Do we really need NPUs now?

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

STOP WASTING YOUR MONEY!!! Same PC... DIFFERENT COST!

This pasta was almost APPROVED @kentycook

คุม ‘สุนทร วิลาวัลย์’ พร้อมพวก 7 คนสอบ ปมยิง ‘สจ.โต้ง’ ดับคาบ้าน คาดขัดแย้งการเมืองท้องถิ่น

How Groq’s LPUs Overtake GPUs For Fastest LLM AI!

ipXchange

มุมมอง 26 725

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ธ.ค. 2024

ความคิดเห็น • 7

@Maisonier 5 หลายเดือนก่อน ⁺³⁷
I'd love to have a small black box at home with several Groq LPUs acting as LLMs for my local network. It would serve a typical family of five, each accessing it from their phones via WiFi while at home working, especially since internet connectivity can be an issue. I wonder if they'll ever sell such a device to the general public instead of just focusing on businesses?
@ipXchange 5 หลายเดือนก่อน ⁺⁵
I couldn't say. They do make racks, but I wonder how many you would need to make something viable at home, and whether they'd let you buy not in bulk. That would be cool though. To be fair, you can use Groq cloud, but I guess you want to own your own infrastructure. Groq has deployed their LPU in super small use cases, so there might be a possibility you could get you hands on some private units...
@MariaAntoniaNeves-fw1vs 2 หลายเดือนก่อน
MariaNevesEstou. AquiAmem
@alertbri 5 หลายเดือนก่อน ⁺²
How does an LPU differ from an ASIC please?
@ipXchange 5 หลายเดือนก่อน ⁺⁴
I suppose it could be considered a type of ASIC as it is a processor designed specifically for large language model processing. The way that an LPU differs from a GPU is that it does not do any parallel processing - it's very good at doing things in sequence.
For applications like LLMs or audio, going forward in time is all that's required because the next word depends on the words that came before it. It's pretty much a 1D problem.
This is in contrast to GPUs because a 2D or 3D picture needs to understand the whole context of a scene, hence why it requires parallel processing of all the pixels in order to understand what's going on.
While parallel processing in GPUs can be used to enable faster LLM AI, at a certain point, the recombination of data slows the whole process down. The LPU, however, is able to just keep chugging along at the same pace because any parallelism is done in separate chips. At a certain number of devices, it seems that this wins out in terms of performance as the GPUs stop providing a net gain for more units added to the system.
This is an oversimplification, but you get the idea. Thank you for the comment and question.
@Davorge 5 หลายเดือนก่อน ⁺¹
@@ipXchange interesting, so why are billionaries dropping hundreds of millions in H100 clusters? wouldnt it be better for them to invest in LPU's moving forward?
@kahvac 4 หลายเดือนก่อน
@@Davorge You have to start somewhere..if you keep waiting for the next best thing you will be left behind.

ต่อไป

เล่นอัตโนมัติ

Do we really need NPUs now?

Do we really need NPUs now?

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

STOP WASTING YOUR MONEY!!! Same PC... DIFFERENT COST!

STOP WASTING YOUR MONEY!!! Same PC... DIFFERENT COST!

This pasta was almost APPROVED @kentycook

This pasta was almost APPROVED @kentycook

คุม ‘สุนทร วิลาวัลย์’ พร้อมพวก 7 คนสอบ ปมยิง ‘สจ.โต้ง’ ดับคาบ้าน คาดขัดแย้งการเมืองท้องถิ่น

คุม ‘สุนทร วิลาวัลย์’ พร้อมพวก 7 คนสอบ ปมยิง ‘สจ.โต้ง’ ดับคาบ้าน คาดขัดแย้งการเมืองท้องถิ่น

Chicken nuggets HACK APPROVED @chefkoudy

Chicken nuggets HACK APPROVED @chefkoudy

Microservices are Technical Debt

Microservices are Technical Debt

The Future Of AI, According To Former Google CEO Eric Schmidt

The Future Of AI, According To Former Google CEO Eric Schmidt

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

When Optimisations Work, But for the Wrong Reasons

When Optimisations Work, But for the Wrong Reasons

How Nvidia Grew From Gaming To A.I. Giant, Now Powering ChatGPT

How Nvidia Grew From Gaming To A.I. Giant, Now Powering ChatGPT

Warren Buffett Leaves The Audience SPEECHLESS | One of the Most Inspiring Speeches Ever

Warren Buffett Leaves The Audience SPEECHLESS | One of the Most Inspiring Speeches Ever

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

Run your own AI (but private)

Run your own AI (but private)

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Homeless Hero Returns Lost Wallet to Its Owner #shorts

Homeless Hero Returns Lost Wallet to Its Owner #shorts

หญิงสาวช่วยชีวิตคนขอทานไว้ เขากลายเป็นมหาเศรษฐีและเริ่มรักเธออย่างบ้าคลั่งและตอบแทนบุญคุณเธอ

หญิงสาวช่วยชีวิตคนขอทานไว้ เขากลายเป็นมหาเศรษฐีและเริ่มรักเธออย่างบ้าคลั่งและตอบแทนบุญคุณเธอ

ด่วน! "สจ.โต้ง" ถูกยิงเสียชีวิต เหตุคนร้ายบุกยิงบ้านอดีต รมช.ศึกษาฯ | 11 ธ.ค. 67 | ไทยรัฐนิวส์โชว์

ด่วน! "สจ.โต้ง" ถูกยิงเสียชีวิต เหตุคนร้ายบุกยิงบ้านอดีต รมช.ศึกษาฯ | 11 ธ.ค. 67 | ไทยรัฐนิวส์โชว์

Scum Rangers LIVE-016 บังเกอร์ร้าง = หนังชีวิต

Scum Rangers LIVE-016 บังเกอร์ร้าง = หนังชีวิต

ลบความเชื่อเรื่องสุขภาพที่คนไทยเข้าใจผิด ของหวาน มัน เค็ม หมอก็กิน! | WOODY FM

ลบความเชื่อเรื่องสุขภาพที่คนไทยเข้าใจผิด ของหวาน มัน เค็ม หมอก็กิน! | WOODY FM

🔴LIVE โหนกระแส "ชาล็อต" เสียท่าถูกมิจจี้ อ้างเป็นตำรวจหลอกโอนเงิน 4 ล้าน

🔴LIVE โหนกระแส "ชาล็อต" เสียท่าถูกมิจจี้ อ้างเป็นตำรวจหลอกโอนเงิน 4 ล้าน

การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 5

การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 5

Đang ngồi chơi bỗng dưng bể cá vỡ kính, may có CCTV chứng minh sự trong sạch cho cô bé

Đang ngồi chơi bỗng dưng bể cá vỡ kính, may có CCTV chứng minh sự trong sạch cho cô bé