Coding Llama 3 from scratch in PyTorch - Part 2

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

They succeed because they are united!

ย้อนคำพูด ‘เมียเสี่ยต้น’ อ้างไม่รู้จักมือลอบยิงผัว คนก่อเหตุสารภาพได้ค่าจ้าง 3 แสน

I Built 4 SECRET Rooms In ONE COLOR!

Coding Llama 3 from scratch in PyTorch - Part 1

Prince Canuma

มุมมอง 2 376

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 5 พ.ค. 2024
In this video series, you will learn how to train and fine-tune Llama 3 model from scratch.
The goal is to code LLaMA 3 from scratch in PyTorch to create models with sizes 3B, 6B, 35B and 45BM params. In this first video, you'll learn about upcycling, downcycling and infini-attention.
📚Papers:
- Sparse Upcycling Training Mixture-of-Experts from Dense Checkpoints
: arxiv.org/abs/2212.05055
- Pre-training Small Base LMs with Fewer Tokens: arxiv.org/abs/2404.08634
Leave No Context Behind Efficient Infinite Context Transformers with Infini-attention: arxiv.org/abs/2404.07143
💻 To follow along you can use this colab notebook:
- github.com/Blaizzy/Coding-LLM...
🎥 Coding Llama 2 from scratch video series
Part 1: th-cam.com/users/liveXHmag4damTg
Part 2: th-cam.com/users/liveLSWDpFmbE90
Part 3: • Coding Llama 2 from sc...
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 15

@linz4213 3 วันที่ผ่านมา ⁺¹
Well made Prince! Learned a lot
@fliptip วันที่ผ่านมา
such a high quality content piece
@AC-go1tp 29 วันที่ผ่านมา ⁺³
This is very thoughtful and great initiative! researchers with enough gray matter but limited means can be still in the game . Thank you PC🙏!
@princecanuma 28 วันที่ผ่านมา
Most welcome!
It’s my pleasure:)
I lived through this so others don’t have to.
@ngamcode2485 18 วันที่ผ่านมา
this is very impressive and great content. thank you
@princecanuma 13 วันที่ผ่านมา
You're very welcome!
@kishoretvk 28 วันที่ผ่านมา
Super impressive. Great value
One question
How do I further train the model on my custom content
Instead of LORA ?
Can we further full training it and add new memory
@princecanuma 22 วันที่ผ่านมา
Most welcome!
You can do that, but that can be very expensive.
@maslaxali8826 4 วันที่ผ่านมา
CS programmers are vampires. My eeeeyyyes. great content though
@vivekpadman5248 16 วันที่ผ่านมา
Bro how did you train llama 3 without paper?
@princecanuma 13 วันที่ผ่านมา
Could you elaborate?
@vivekpadman5248 12 วันที่ผ่านมา
@@princecanuma As far as I know there hasn't been an official llama 3 paper released and no data Info as well. But I could be wrong... 😅
@princecanuma 12 วันที่ผ่านมา ⁺¹
@@vivekpadman5248 true, they only released a blog detailing the data, model arch and performance.
Here is how I did it: Llama-3 has the same exact architecture of Llama-2 which we already covered in this channel.
th-cam.com/play/PLDn_JsyofyfQp4td_ub6LfIg5vxyu6YJK.html&si=0Gyt9mdaA-ydiWOA
Finally, if you understand how these models work you don't need the paper, the code implementation is more than enough.
@vivekpadman5248 12 วันที่ผ่านมา ⁺¹
@@princecanuma oh understood, thanks I'll check it out and also your video 💙
@princecanuma 12 วันที่ผ่านมา
Most welcome :)

ต่อไป

เล่นอัตโนมัติ

Coding Llama 3 from scratch in PyTorch - Part 2

Coding Llama 3 from scratch in PyTorch - Part 2

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

They succeed because they are united!

They succeed because they are united!

ย้อนคำพูด ‘เมียเสี่ยต้น’ อ้างไม่รู้จักมือลอบยิงผัว คนก่อเหตุสารภาพได้ค่าจ้าง 3 แสน

ย้อนคำพูด ‘เมียเสี่ยต้น’ อ้างไม่รู้จักมือลอบยิงผัว คนก่อเหตุสารภาพได้ค่าจ้าง 3 แสน

I Built 4 SECRET Rooms In ONE COLOR!

I Built 4 SECRET Rooms In ONE COLOR!

Using Llama Coder As Your AI Assistant

Using Llama Coder As Your AI Assistant

Joscha at Microsoft

Joscha at Microsoft

This Llama 3 is powerful and uncensored, let’s run it

This Llama 3 is powerful and uncensored, let’s run it

How to Improve LLMs with RAG (Overview + Python Code)

How to Improve LLMs with RAG (Overview + Python Code)

How to Use Llama 3 with PandasAI and Ollama Locally

How to Use Llama 3 with PandasAI and Ollama Locally

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Supercharge your Python App with RAG and Ollama in Minutes

Supercharge your Python App with RAG and Ollama in Minutes

Прохождение #Linux-машины INTENTIONS.HTB, сложного уровня | #HackTheBox | КАК ПРОЙТИ #INTENTIONS.HTB

Прохождение #Linux-машины INTENTIONS.HTB, сложного уровня | #HackTheBox | КАК ПРОЙТИ #INTENTIONS.HTB

Use TorchTune to Fine Tune Llama 3 Locally

Use TorchTune to Fine Tune Llama 3 Locally

Do not touch my phone 📱- ดูเฉยๆ ห้ามจับ

Do not touch my phone 📱- ดูเฉยๆ ห้ามจับ

ความจริงของหน้าจอ Loading ที่เล่นกับจิตวิทยา #เรื่องเล่า #สาระ #loading #โปรแกรม #shorts

ความจริงของหน้าจอ Loading ที่เล่นกับจิตวิทยา #เรื่องเล่า #สาระ #loading #โปรแกรม #shorts

15Promaxใหม่แค่ไหน..อะไหล่ก็พร้อม!!😁#iPhone15Promax #เปลี่ยนจอ #เปลี่ยนกล้องหลัง #ซ่อมมือถือ

15Promaxใหม่แค่ไหน..อะไหล่ก็พร้อม!!😁#iPhone15Promax #เปลี่ยนจอ #เปลี่ยนกล้องหลัง #ซ่อมมือถือ

ไม่ดาวน์โหลดไม่ได้แล้ว! 5 แอปสุดเจ๋งที่ต้องมีบน iPhone

ไม่ดาวน์โหลดไม่ได้แล้ว! 5 แอปสุดเจ๋งที่ต้องมีบน iPhone

iPhone 15 vs POCO X6 PRO - FREEFIRE DAMAGE TEST #freefire #pocox6pro #iphone15 #90fps

iPhone 15 vs POCO X6 PRO - FREEFIRE DAMAGE TEST #freefire #pocox6pro #iphone15 #90fps

GeForce RTX 3050 ในปี 2024 แรงขนาดไหน และควรใช้รุ่นไหนดี ? | iHAVECPU

GeForce RTX 3050 ในปี 2024 แรงขนาดไหน และควรใช้รุ่นไหนดี ? | iHAVECPU

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #onepiece

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #onepiece