Distillation of Transformer Models

How to use LLMs for Fact Checking

Fine Tune Flux Diffusion Models with Your Photos

หนังแอ็คชั่นสุดมันส์ - GALLOWWALKERS โคตรคาวบอย พันธุ์อมตะ (เวสลีย์ สไนปส์) หนังใหม่ เต็มเรื่อง HD

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

ไลฟ์สุ่ม iPhone 295 บาท... โกงมั้ย?[ โกงมั้ยครับ ep.97 ] | DOM

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

Trelis Research

มุมมอง 1 409

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 พ.ย. 2024

ความคิดเห็น • 18

@andpoul หลายเดือนก่อน ⁺¹
Very up to date! Includes GaLore, etc.
@TrelisResearch หลายเดือนก่อน
Cheers thanks
@darkmatter9583 หลายเดือนก่อน
Thanks
@imranullah3097 หลายเดือนก่อน
Can you implement a few papers in pytorch like gradtts and more
@TrelisResearch หลายเดือนก่อน
What others?
@padmasrivaddiparthi7287 หลายเดือนก่อน
Hey Trelis! Can you help me setup a **Multi-node, multi-gpu** training infra using RunPod. I figured this out using the community cloud option where I can set a Public IP for my pods and expose the TCP ports with the same internal and external port numbers. However, I'm not able to add a shared disk across my community pods to save checkpoints in case of node failure. I totally failed to set communication between two different pods when I launched them in the secure cloud. But secure cloud allows network volume that can be shared across different pods.
Can you help me set-up infra for multi-node multi-gpu set up in secure cloud. In paperspace this was easy, but I am not able to figure this out using RunPod. Any suggestions are welcome
@TrelisResearch หลายเดือนก่อน ⁺¹
Did you ask runpod support?
Try that and let me know. I’ll see if I can help
@padmasrivaddiparthi7287 หลายเดือนก่อน
@@TrelisResearch Runpod support asked me to select a machine that has more gpus instead of multi-node. But this isn't what I'm planning to achieve. I want to run a bunch of experiments on the multi-node multi-gpu setup.
In runpod, when launching multiple pods, sometimes they are getting allocated on different secure clouds (with different public IPs) and sometimes they are getting on the same public IP. The later case is not a problem because the pods can communicate with each other using their private IPs and exposed TCP ports if they are on the same public IP. However, the former case, I failed to figure out how to establish a communication over TCP ports. I tried port forwarding but I'm getting prompted with some password which I don't have.
Is it just me or is the runpod not configured to allow port forwarding to communicate across different public IPs? Any other ideas to solve this? Paperspace is much better for multi-node multi-gpu setup but it is postpaid and I'm afraid I might run into insane cloud bills. Runpod has the prepaid option which is very safe in my case.
@TrelisResearch หลายเดือนก่อน
@@padmasrivaddiparthi7287 yeah you're right, what you want is not just adding more gpus.
I haven't done any multi node, so I'm unsure.
Perhaps you can look at latitude? they are also pre-paid (although you have to prepay 100 bucks to get started).
Will see if some time I do a vid, but not high priority right now
@padmasrivaddiparthi7287 หลายเดือนก่อน
@@TrelisResearch thanks for the suggestion. I checked Latitude, it is expensive indeed!
@VijayEranti หลายเดือนก่อน
Can we convert full fine tuned model to lora (svd on delta weights)
@TrelisResearch หลายเดือนก่อน ⁺¹
You could try but probably you lose too much quality.
@TrelisResearch หลายเดือนก่อน
Also, you have to re setup the model which is not trivial .
Maybe I’ll try for a video some time
@TrelisResearch หลายเดือนก่อน
Also, you have to re setup the model which is not trivial .
Maybe I’ll try for a video some time
@mdrafatsiddiqui หลายเดือนก่อน
Hi. Will this work for continued pretraining on text books for domain specific adaptive learning? All i see on the internet are LoRA videos. I have seen your video on FFT and thats what i want for my use case.
@TrelisResearch หลายเดือนก่อน ⁺²
Yup this is full fine tuning. Can be for pretraining or continued pretraining
@mdrafatsiddiqui หลายเดือนก่อน ⁺¹
@@TrelisResearch I tried GaLore with Subspace Descent and off the bat, it had better eval/loss than any of the earlier methods. How could this perform better than AdamW?
@TrelisResearch หลายเดือนก่อน ⁺¹
@@mdrafatsiddiqui haha nice. Well doing GaLore prevents overfitting, so that could be the reason

ต่อไป

เล่นอัตโนมัติ

Distillation of Transformer Models

Distillation of Transformer Models

How to use LLMs for Fact Checking

How to use LLMs for Fact Checking

Fine Tune Flux Diffusion Models with Your Photos

Fine Tune Flux Diffusion Models with Your Photos

หนังแอ็คชั่นสุดมันส์ - GALLOWWALKERS โคตรคาวบอย พันธุ์อมตะ (เวสลีย์ สไนปส์) หนังใหม่ เต็มเรื่อง HD

หนังแอ็คชั่นสุดมันส์ - GALLOWWALKERS โคตรคาวบอย พันธุ์อมตะ (เวสลีย์ สไนปส์) หนังใหม่ เต็มเรื่อง HD

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

ไลฟ์สุ่ม iPhone 295 บาท... โกงมั้ย?[ โกงมั้ยครับ ep.97 ] | DOM

ไลฟ์สุ่ม iPhone 295 บาท... โกงมั้ย?[ โกงมั้ยครับ ep.97 ] | DOM

มิจฉาชีพถึงกับช็อก ! ผมไม่ได้ติดตลกครับ เเต่ผมเป็นตำรวจ | อีจัน EJAN

มิจฉาชีพถึงกับช็อก ! ผมไม่ได้ติดตลกครับ เเต่ผมเป็นตำรวจ | อีจัน EJAN

AI can't cross this line and we don't know why.

AI can't cross this line and we don't know why.

AWS CEO - The End Of Programmers Is Near

AWS CEO - The End Of Programmers Is Near

(AI Tinkerers Ottawa) Web scraping for LLM with Firecrawl w/ Eric Ciarla

(AI Tinkerers Ottawa) Web scraping for LLM with Firecrawl w/ Eric Ciarla

Fine tune and Serve Faster Whisper Turbo

Fine tune and Serve Faster Whisper Turbo

Fine tuning LLMs for Memorization

Fine tuning LLMs for Memorization

Fine-tuning LLMs with PEFT and LoRA

Fine-tuning LLMs with PEFT and LoRA

Predicting Events with Large Language Models

Predicting Events with Large Language Models

CONTEXT CACHING for Faster and Cheaper Inference

CONTEXT CACHING for Faster and Cheaper Inference

How to pick LoRA fine-tuning parameters?

How to pick LoRA fine-tuning parameters?

NCT DREAM 엔시티 드림 'When I’m With You' MV

NCT DREAM 엔시티 드림 'When I’m With You' MV

[UNCUT] The Loyal Pin ปิ่นภักดิ์ EP.15 (2/4)

[UNCUT] The Loyal Pin ปิ่นภักดิ์ EP.15 (2/4)

มิจฉาชีพถึงกับช็อก ! ผมไม่ได้ติดตลกครับ เเต่ผมเป็นตำรวจ | อีจัน EJAN

มิจฉาชีพถึงกับช็อก ! ผมไม่ได้ติดตลกครับ เเต่ผมเป็นตำรวจ | อีจัน EJAN

ทำไมแจ็คถึงถูกส่งไปอนาคต #samuraijack #aku #เล่าเรื่อง

ทำไมแจ็คถึงถูกส่งไปอนาคต #samuraijack #aku #เล่าเรื่อง

#นายกฯแพทองธาร เดินทางถึงจีน พร้อมประชุมGMS Summit

#นายกฯแพทองธาร เดินทางถึงจีน พร้อมประชุมGMS Summit

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

ถ่ายทอดสด l ฟุตซอลชิงแชมป์อาเซียน 2024 l รอบชิงชนะเลิศ l เวียดนาม v อินโดนีเซีย

ถ่ายทอดสด l ฟุตซอลชิงแชมป์อาเซียน 2024 l รอบชิงชนะเลิศ l เวียดนาม v อินโดนีเซีย

100 วัน Minecraft โลกสุดสยองของฮีโร่บาย #3 @DrakiKona

100 วัน Minecraft โลกสุดสยองของฮีโร่บาย #3 @DrakiKona