LocalAI LLM Testing: Distributed Inference on a network? Llama 3.1 70B on Multi GPUs/Multiple Nodes

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

AI Is Making You An Illiterate Programmer

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

New Colour Match Puzzle Challenge with Cola and McDonald’s Avengers Logo - Incredibox Sprunki

LocalAI LLM Testing: Part 2 Network Distributed Inference Llama 3.1 405B Q2 in the Lab!

RoboTF AI

มุมมอง 2 016

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ก.พ. 2025

ความคิดเห็น •

@tbranch227 5 หลายเดือนก่อน ⁺⁷
Congrats! You unlocked a masssive achievement running this on your own hardware!!! All hail the AI and the kilowatts we feed them
@RoboTFAI 5 หลายเดือนก่อน ⁺²
Skynet is possible in your basement 🦾
@marekkroplewski6760 5 หลายเดือนก่อน ⁺⁴
Dad! Where did my gaming rig go!!! Now listen up there Junior, this is for science. Just don't tell your Mum. And you can have the car keys for Saturday.
@RoboTFAI 5 หลายเดือนก่อน ⁺¹
Better than stealing their GPU's out of their rigs right? 😂
@animationgaming8539 3 หลายเดือนก่อน ⁺¹
I liked every comment on this video!
@RoboTFAI 3 หลายเดือนก่อน
Thanks!
@ckckck12 4 หลายเดือนก่อน ⁺¹
What I see is 1/3 working time caused by using a model built on close to 6 times the data points. That's nice! And... Enabled by distributed mode.
Am I correct? Is there a way that the quant factor affects the computation of those general factors (time/tokens vs size) that would make this more 1:1?
Already it's nice you can get the big models in.
@twinnie38 4 หลายเดือนก่อน ⁺¹
Really impressive, congrats ! Do you know the impact of a limited PCIe bus (1x , 4x GEN3) for those GPU cards ?
@nickmajkic1436 5 หลายเดือนก่อน ⁺²
Would you be able to make a tutorial on getting lovalAI working in kubernetes?
@RoboTFAI 5 หลายเดือนก่อน ⁺¹
Sure, I think that's overdue at this point!
@_zproxy 5 หลายเดือนก่อน ⁺¹
wild. does this work for llava images too?
@unsaturated8482 3 หลายเดือนก่อน
Damn
@mckirkus 5 หลายเดือนก่อน ⁺¹
What's the network bandwidth? I wonder what could be done if you connected to a bunch of buddies with gigabit symmetrical fiber connections.
@RoboTFAI 5 หลายเดือนก่อน ⁺²
As much as you can pump for distributing the model - during inference it's really only about 10-20 MB/s per node
@bechti44 4 หลายเดือนก่อน ⁺¹
around 4 Times faster than cpu only... But around 100x more expensive...
@RoboTFAI 4 หลายเดือนก่อน ⁺¹
It's just money and power! like always.... 😁
@andriidrihulias6197 5 หลายเดือนก่อน ⁺²
First
@RoboTFAI 5 หลายเดือนก่อน ⁺¹
Congrats!
@Anurag_Tulasi 5 หลายเดือนก่อน ⁺¹
It would be more intelligible if your results mention (Higher is better or Lower is better) beside the chart headings.
@RoboTFAI 5 หลายเดือนก่อน ⁺¹
Thanks for the feedback!

ต่อไป

เล่นอัตโนมัติ

LocalAI LLM Testing: Distributed Inference on a network? Llama 3.1 70B on Multi GPUs/Multiple Nodes

LocalAI LLM Testing: Distributed Inference on a network? Llama 3.1 70B on Multi GPUs/Multiple Nodes

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

LocalAI LLM Testing: i9 CPU vs Tesla M40 vs 4060Ti vs A4500

LocalAI LLM Testing: i9 CPU vs Tesla M40 vs 4060Ti vs A4500

Fine-tuning, RAG, Llama, prompt-engineering, LLM-арены | Что происходит в LLM

Fine-tuning, RAG, Llama, prompt-engineering, LLM-арены | Что происходит в LLM

Llama 3.1 405b LOCAL AI Home Server on 7995WX Threadripper and 4090

Llama 3.1 405b LOCAL AI Home Server on 7995WX Threadripper and 4090

The Ultimate Mini Server Rack - Size doesn't matter...

The Ultimate Mini Server Rack - Size doesn't matter...

Building Ubuntu Server for AI and LLMs from scratch Part 2: Nvidia Cuda Drivers, Toolkit, LocalAI!

Building Ubuntu Server for AI and LLMs from scratch Part 2: Nvidia Cuda Drivers, Toolkit, LocalAI!

The most beautiful equation in math, explained visually [Euler’s Formula]

The most beautiful equation in math, explained visually [Euler’s Formula]

Using Clusters to Boost LLMs 🚀

Using Clusters to Boost LLMs 🚀

Cheap mini runs a 70B LLM 🤯

Cheap mini runs a 70B LLM 🤯

LocalAI LLM Tuning: WTH is Flash Attention? What are the effects on memory and performance? Llama3.2

LocalAI LLM Tuning: WTH is Flash Attention? What are the effects on memory and performance? Llama3.2

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

【พากย์ไทย】สาวใช้ในวังจะถูกประหารชีวิต แต่เธอมีฐานะที่ไม่ธรรมดา คือพระราชบุตรีแท้ๆ ของพระราชา!

【พากย์ไทย】สาวใช้ในวังจะถูกประหารชีวิต แต่เธอมีฐานะที่ไม่ธรรมดา คือพระราชบุตรีแท้ๆ ของพระราชา!

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024