Maximize GPU utilization with SageMaker HyperPod task governance | AWS OnAir re:Invent 2024
ฝัง
- เผยแพร่เมื่อ 14 ม.ค. 2025
- Teams across organizations are training new models, fine-tuning them with their data, and running inference at scale, all of which requires timely access to accelerated compute resources. Given the overwhelming demand and a finite budget, organizations are unable to allocate accelerated compute resources to each project and task when needed. In this session, learn about how Amazon SageMaker HyperPod’s new governance capability helps dynamically run FM development tasks such as training, fine-tuning, or inference on shared compute resources, ensuring the most important FM development projects get completed on time, while avoiding cost-overruns due to under-utilized compute resources.
Find out all of the details on the web page - awsonair.net/W...
Follow AWS OnAir on
LinkedIn awsonair.net/L...
Twitch awsonair.net/s...
ABOUT AWS
Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers-including the fastest-growing startups, largest enterprises, and leading government agencies-are using AWS to lower costs, become more agile, and innovate faster.
#aws #AWSEvents #generativeai #foundationmodel #sagemaker #awsonair