Efficient Inference on MI300X: Our Journey at Microsoft, Rajat Monga, Microsoft, CVP AI Frameworks

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 ธ.ค. 2024
  • In this Advancing AI 2024 Luminary Developer Keynote, Rajat Monga, CVP AI Frameworks at Microsoft, discusses efforts in deploying key models on AMD Instinct™ MI300X GPUs. Rajat starts with why they believed it was a good idea to try MI300X; he covers the inside story of what it took to bring up a model on a new machine, to driving performance optimizations that made it competitive against Nvidia H100.
    Gain access to AMD developer tools and resources.
    www.amd.com/en...
    The information contained in this video represents the view of AMD or the third-party presenter as of the date presented. AMD and/or the third-party presenters have no obligation to update any forward-looking content in the above presentations. AMD is not responsible for the content of any third-party presentations and does not necessarily endorse the comments made therein. GD-84.
    © 2024 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, EPYC, ROCm, and AMD Instinct and combinations thereof are trademarks of Advanced Micro Devices, Inc.

ความคิดเห็น •