AMA: 1000's of LPUs, 1 AI Brain - Part II

Chief Technology Evangelist Mark Heaps at Imagine AI Live 2024

LPUs, NVIDIA Competition, Insane Inference Speeds, Going Viral (Interview with Lead Groq Engineers)

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 10 : บอร์นมัธ พบ แมนเชสเตอร์ ซิตี้

PIZZA or CHICKEN // Left or Right Challenge

โอ้เธอช่าง... - บี้เดอะสกา ft. ต๋อง เทวัญ (Prod. by ป๋าเพชร) [Official MV]

AMA: 1000's of LPUs, 1 AI Brain. Scaling with the Fastest AI Inference

Groq

มุมมอง 4 536

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 4 พ.ย. 2024
Learn how the Groq architecture powering its LPU™ Inference Engine is designed to scale from the ground up. This AMA will dive into the scaling capabilities of Groq AI infrastructure across hardware, compiler, and cloud. We'll also discuss the unique Groq approach to overcoming scaling limitations of traditional legacy architectures.

ความคิดเห็น • 12

@dyter07 5 หลายเดือนก่อน ⁺⁵
Groq is amazing, the speed is making me speechless. Is it possible to see some samples with a diffusion model soon?
@merchantsvillage 5 หลายเดือนก่อน
Thank you, great intro to your tech!
@glorified3142 5 หลายเดือนก่อน ⁺²
Multimodal with voice and image, live video/camera capture would surely be a thing to help advance RnD.
@joannot6706 5 หลายเดือนก่อน ⁺³
The future of AI is not just LLM it's multimodal, does the LPU works with any type of data that AI can process? (it's tokenized after all)
Are you going to rename it MPU? Multimodal processing unit?
@MarkHeaps-iu9si 5 หลายเดือนก่อน ⁺³
Interesting suggestion, maybe we'll do that with our V2 silicon.
We're already testing multimodal and we have a well published history of doing inference for many types of data heavy workloads. Look at the work done' with national labs, etc.
@vishwamartur 5 หลายเดือนก่อน
need to invest on it
@lokeshart3340 5 หลายเดือนก่อน ⁺¹
Can u do for image and audio gen or video gen also
@QinghuaLi-wd1tk 5 หลายเดือนก่อน
Groq said they have Lowest TTFT. And it turns out it is 180 ms as shown in their slide. That number really sucks. Even GPU can do it with 100 ms. SambaNova is also doing much better than 180 ms, around 100 ms as well.
@thesimplicitylifestyle 5 หลายเดือนก่อน
Decentralized AGI with virtual substrate independent Machine Learning LLM Nodes working on Multiple servers connected to decentralized search engines being accessed with personal LLM and LAM computers that have WiFi and bluetooth and can learn to operate household appliances and inexpensive interchangeable robot chassis that can be controlled remotely. 😎🤖
@QinghuaLi-wd1tk 5 หลายเดือนก่อน
Groq is a lier that it says its 1250 tokens/s for llama3 8B is 4x higher than other providers. But they obviously know SambaNova can do 1000+ tokens/s as well.
Well, Lier.
@BooleanDisorder 5 หลายเดือนก่อน
An outlier!
@QinghuaLi-wd1tk 5 หลายเดือนก่อน
@@BooleanDisorder oh, right, they are liars

ต่อไป

เล่นอัตโนมัติ

AMA: 1000's of LPUs, 1 AI Brain - Part II

AMA: 1000's of LPUs, 1 AI Brain - Part II

Chief Technology Evangelist Mark Heaps at Imagine AI Live 2024

Chief Technology Evangelist Mark Heaps at Imagine AI Live 2024

LPUs, NVIDIA Competition, Insane Inference Speeds, Going Viral (Interview with Lead Groq Engineers)

LPUs, NVIDIA Competition, Insane Inference Speeds, Going Viral (Interview with Lead Groq Engineers)

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 10 : บอร์นมัธ พบ แมนเชสเตอร์ ซิตี้

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 10 : บอร์นมัธ พบ แมนเชสเตอร์ ซิตี้

PIZZA or CHICKEN // Left or Right Challenge

PIZZA or CHICKEN // Left or Right Challenge

โอ้เธอช่าง... - บี้เดอะสกา ft. ต๋อง เทวัญ (Prod. by ป๋าเพชร) [Official MV]

โอ้เธอช่าง... - บี้เดอะสกา ft. ต๋อง เทวัญ (Prod. by ป๋าเพชร) [Official MV]

Trick-or-Treating in a Rush. Part 2

Trick-or-Treating in a Rush. Part 2

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Agentic AI: Redefining How We Interact with Technology

Agentic AI: Redefining How We Interact with Technology

Tool Calling with Groq & CrewAI

Tool Calling with Groq & CrewAI

Making AI real with the Groq LPU inference engine

Making AI real with the Groq LPU inference engine

Untold story of AI’s fastest chip

Untold story of AI’s fastest chip

AI’s Dirty Little Secret

AI’s Dirty Little Secret

Groq's Tensor Streaming Processor

Groq's Tensor Streaming Processor

OpenAI CEO Sam Altman discusses the future of generative AI

OpenAI CEO Sam Altman discusses the future of generative AI

“We Make Machine Learning Human”: How Groq Is Building A Faster AI Interface

“We Make Machine Learning Human”: How Groq Is Building A Faster AI Interface

🔴Live สด! PUBG GLOBAL SERIES 6 | GROUP STAGE DAY 1

🔴Live สด! PUBG GLOBAL SERIES 6 | GROUP STAGE DAY 1

🔴 รอบชิงชนะเลิศ ราชวินิตบางแก้ว พบ ภัทรบพิตร ฟุตบอลแชมป์กีฬา 7HD แชมเปียน คัพ 2024

🔴 รอบชิงชนะเลิศ ราชวินิตบางแก้ว พบ ภัทรบพิตร ฟุตบอลแชมป์กีฬา 7HD แชมเปียน คัพ 2024

Car Bubble vs Lamborghini

Car Bubble vs Lamborghini

ฉ้อโกงเป็นปกตินิสัย? ปมฉาวทนายตั้ม | MONO เจาะข่าวเด็ด | 30 ต.ค. 67

ฉ้อโกงเป็นปกตินิสัย? ปมฉาวทนายตั้ม | MONO เจาะข่าวเด็ด | 30 ต.ค. 67

PIZZA or CHICKEN // Left or Right Challenge

PIZZA or CHICKEN // Left or Right Challenge

[Part 1] ทารกน้อยถูกทิ้ง👶 หญิงชราเก็บขวดขายเลี้ยงดูเขาจนโต #โลกการละคร

[Part 1] ทารกน้อยถูกทิ้ง👶 หญิงชราเก็บขวดขายเลี้ยงดูเขาจนโต #โลกการละคร

One Bangkok 7 ปีแห่งการรอคอย อาณาจักรใหญ่ของสิริวัฒนภักดี มูลค่า 1.2 แสนล้าน น่าสนใจยังไง ?

One Bangkok 7 ปีแห่งการรอคอย อาณาจักรใหญ่ของสิริวัฒนภักดี มูลค่า 1.2 แสนล้าน น่าสนใจยังไง ?

ลือสนั่น อีก 2 วัน “หมายจับ” มาแน่ เรื่องนี้งานใหญ่ อาจมีใคร “ถอดเสื้อครุย” ? #ถกไม่เถียง

ลือสนั่น อีก 2 วัน “หมายจับ” มาแน่ เรื่องนี้งานใหญ่ อาจมีใคร “ถอดเสื้อครุย” ? #ถกไม่เถียง