Scaling RoCE Networks for AI Training | Adi Gangidi

NVIDIA Spectrum-X Network Platform Architecture

Cloud is a Network - How Juniper Enables the Telco Cloud

มวยมันส์สนั่นเมือง 18/06/2024

สาวเกาหลี!! เดินทางเพื่อมากินอาหารอีสาน อย่างกล้า!!

แฟนแนวใด๋ - ยูริ โตเกียวมิวสิค [ SyncVersion ]

Ethernet Fabrics for GenAI workloads

JuniperNetworks

มุมมอง 2 049

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 19 พ.ค. 2024
In this video, Sharada Yeluri, Senior Director of Engineering at Juniper Networks, describes the traffic patterns between the GPUs during LLM and GenAI model training and how to optimize the network topologies for these traffic patterns. She compares the different switch options and the challenges in controlling congestion and improving the performance of training workloads.
Read more about GPU Fabrics for GenAI Workloads:
www.linkedin.com/pulse/gpu-fa...
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 2

@nabromov 29 วันที่ผ่านมา ⁺¹
insightful! thank you
@jasoniannone9675 29 วันที่ผ่านมา ⁺²
Thank you for sharing!
RE: ~15:30 Modular vs. Fixed Switches - Are deep buffers desirable in RDMA? Jitter and latency variation and associated frame reordering over parallel flows seems like a sensitivity. Don't these absurdly integrated systems (GPU to GPU) have their own transmission control mechanisms and realtime knowledge of the state of the flows?
Is the Ethernet network doing ECN or Pause in these deployments?
I think the slide at 21:50 addresses my questions. Sorry. Impatient.

ต่อไป

เล่นอัตโนมัติ

Scaling RoCE Networks for AI Training | Adi Gangidi

Scaling RoCE Networks for AI Training | Adi Gangidi

NVIDIA Spectrum-X Network Platform Architecture

NVIDIA Spectrum-X Network Platform Architecture

Cloud is a Network - How Juniper Enables the Telco Cloud

Cloud is a Network - How Juniper Enables the Telco Cloud

มวยมันส์สนั่นเมือง 18/06/2024

มวยมันส์สนั่นเมือง 18/06/2024

สาวเกาหลี!! เดินทางเพื่อมากินอาหารอีสาน อย่างกล้า!!

สาวเกาหลี!! เดินทางเพื่อมากินอาหารอีสาน อย่างกล้า!!

แฟนแนวใด๋ - ยูริ โตเกียวมิวสิค [ SyncVersion ]

แฟนแนวใด๋ - ยูริ โตเกียวมิวสิค [ SyncVersion ]

World’s Deadliest Obstacle Course!

World’s Deadliest Obstacle Course!

New Juniper Validated Design: Metro EBS

New Juniper Validated Design: Metro EBS

EN141 Webinar: RoCE Introduction

EN141 Webinar: RoCE Introduction

Arista Networking for AI Workloads

Arista Networking for AI Workloads

Congestion Management in the AI Data Center

Congestion Management in the AI Data Center

LLM Control Theory Seminar (April 2024)

LLM Control Theory Seminar (April 2024)

Meta’s Network Journey to Enable AI | Hany Morsy & Susana Contrera

Meta’s Network Journey to Enable AI | Hany Morsy & Susana Contrera

InfiniBand and RoCE: Artificial Intelligence Data Centers | FiberMall

InfiniBand and RoCE: Artificial Intelligence Data Centers | FiberMall

How to Build, Evaluate, and Iterate on LLM Agents

How to Build, Evaluate, and Iterate on LLM Agents

สอนถ่ายคลิปแฟนกับมอไซค์เก๋ๆง่ายๆใครก็ทำได้ด้วยมือถือ🥰 #spk #สอนถ่ายภาพ #สอนถ่ายวิดีโอ

สอนถ่ายคลิปแฟนกับมอไซค์เก๋ๆง่ายๆใครก็ทำได้ด้วยมือถือ🥰 #spk #สอนถ่ายภาพ #สอนถ่ายวิดีโอ

One To Three USB Convert

One To Three USB Convert

เหตุผลที่ผมซื้อ iPad Air 6 รุ่น 13 นิ้ว

เหตุผลที่ผมซื้อ iPad Air 6 รุ่น 13 นิ้ว

รีวิว realme GT 6 เรือธงที่เกิดมาเพื่อคนเล่นเกม (เทียบ S24Ultra และ 15ProMax)

รีวิว realme GT 6 เรือธงที่เกิดมาเพื่อคนเล่นเกม (เทียบ S24Ultra และ 15ProMax)

lol Apple Intelligence is dumb...

lol Apple Intelligence is dumb...

Do not touch my phone 📱- ดูเฉยๆ ห้ามจับ

Do not touch my phone 📱- ดูเฉยๆ ห้ามจับ

Bluetooth Desert Eagle

Bluetooth Desert Eagle

Samsung S24 Ultra professional shooting kit #shorts

Samsung S24 Ultra professional shooting kit #shorts