Building Ubuntu Server for AI and LLMs from scratch Part 2: Nvidia Cuda Drivers, Toolkit, LocalAI!

[1hr Talk] Intro to Large Language Models

AI Is Making You An Illiterate Programmer

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

PiXXiE - Pick A Card | OFFICIAL M/V

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

Building Ubuntu Server for AI LLMs from scratch Part 3: ComfyUI, Open WebUI, Flowise, n8n, and more!

RoboTF AI

มุมมอง 693

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ก.พ. 2025

ความคิดเห็น • 5

@ShadowsandWhiskey 21 วันที่ผ่านมา
Hey, I love the channel. I started making my own setup a few months ago and came across this when researching docker. New to programing environment and ubuntu. I come from the industrial automation side PLC's and networking. I am going to do a fresh install following your guide on a Node I am setting up for practice.
@RoboTFAI 21 วันที่ผ่านมา
Much appreciated! Hope you find it all a good starting place to get up and going quickly, getting started is always the hardest part.
@ShadowsandWhiskey 21 วันที่ผ่านมา
@RoboTFAI exactly. I got my primary system set up after a month of configuring. I learned a lot about Linux in the process through troubleshooting, and I think 5 install and build from the ground up. This will be perfect for the secondary setup I want to connect to. I will get you a coffee for the time and energy, lol
@satyajitdasgupta3150 วันที่ผ่านมา
Wow! I went through the entire series of three episodes you made. I am not even a coder, I am very new to llms and ai, and I watched through all that you taught here, because I was thinking of buying AMD cards due their usually larger vrams and price. I have to say, your patience and style of guiding is very noob friendly. I really thankyou for these videos.
If you get time, I have few noob questions for you, if you can answer -
- I am still confused how well will 7900xtx perform in terms of token speed in this Ubuntu server and rcom for 7,13 & 32B at Q8 or FP16 respectively, for inference and as well for training. No specific numbers I could find. Any guess?
- How many GPUs do you suggest for training 32b models?
- Which GPUs would you suggest going for if not 7900xtx?
- The 'gpt 4' which you were running during setup, was it a 13b model? I somehow missed that.
- Would you suggest using 13b or lower models for coding and good problem solving purposes for local office team use?
- What's your take on deepseek or qwent for similar use?
Thanks a ton again. God bless.
@RoboTFAI วันที่ผ่านมา
Appreciated! Everyone has to start someplace, and it can be VERY complex if you don't have a background in running/building systems. We have to be able to learn from each other as a community. I don't know it all but happy to share what I do know and my own learning experiences with others.
I don't have a 7900XT in the lab but we do have a 7600XT - you can see how it handled itself in the Mistral Leaderboard series here: th-cam.com/video/uwHiOtvm9go/w-d-xo.html There is also several amazing viewers using the 7900xt that might be able to chime in with more info.
As many as possible - training is a way different beast than just running inference, way different
I mainly suggest Nvidia based cards just due to the software and integrations are well ahead of Intel/AMD - but that gap is closing, and you pay a premium with Nvidia. Checkout the leaderboard on our site for more info on the cards we have tested: robotf.ai/Mistral_7B_Leaderboard and use some of the info so help with your decisions.
The LocalAI AIO images that come with those "GPT-4" etc labeled models are actually like Hermes pro/etc - LocalAI lists the "mappings" here localai.io/basics/container/#all-in-one-images just to get people started.
I prefer to use 32B+ (I run mostly 70B models for coding/etc if not larger when possible) at high quants (Q6-Q8) and large context sizes (64-128k) which great effects your memory requirements.
I have been a huge fan of Deepseek, and Qwen even well before R1 that is getting so much traction these days. Different models are better at different tasks and why we want as many open source models as much as possible! The Deepseek coder models have been a staple in my lab for last few years for sure.

ต่อไป

เล่นอัตโนมัติ

Building Ubuntu Server for AI and LLMs from scratch Part 2: Nvidia Cuda Drivers, Toolkit, LocalAI!

Building Ubuntu Server for AI and LLMs from scratch Part 2: Nvidia Cuda Drivers, Toolkit, LocalAI!

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

PiXXiE - Pick A Card | OFFICIAL M/V

PiXXiE - Pick A Card | OFFICIAL M/V

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

Building Ubuntu Server for AI and LLMs from scratch Part 1: Hardware build, Open Air Case, Multi-GPU

Building Ubuntu Server for AI and LLMs from scratch Part 1: Hardware build, Open Air Case, Multi-GPU

[Webinar] How to Build a Modern Agentic System

[Webinar] How to Build a Modern Agentic System

François Chollet on OpenAI o-models and ARC

François Chollet on OpenAI o-models and ARC

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Cutting up a Dell R720xd to add 2x Nvidia Tesla M40 24GB GPUs - Llama 3.3 70B, QwQ 32B, and more!

Cutting up a Dell R720xd to add 2x Nvidia Tesla M40 24GB GPUs - Llama 3.3 70B, QwQ 32B, and more!

I built JARVIS from Iron Man with AI (NO CODE!)

I built JARVIS from Iron Man with AI (NO CODE!)

host ALL your AI locally

host ALL your AI locally

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

ไก่วิเศษ #การ์ตูน #นิทาน #cartoon

ไก่วิเศษ #การ์ตูน #นิทาน #cartoon

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

เอก - ตาสว่าง - Live Show - The Voice Thailand 2024 - 15 Dec 2024

เอก - ตาสว่าง - Live Show - The Voice Thailand 2024 - 15 Dec 2024

Oren helps Durple escape Pinki in a way you wouldn't expect

Oren helps Durple escape Pinki in a way you wouldn't expect