NEW 3.5 SONNET V2 Has a LOGIC BUG: Reasoning ERROR

Stealing LLMs (MIT, Microsoft, Harvard) #ai

TEST TIME Optimized AI REASONING (MIT)

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

หมวกกันน็อค - TaitosmitH |Official MV|

Players vs Trophies 🤯

NEW: LoRA Models override Pre-trained Knowledge (MIT)

Discover AI

มุมมอง 2 830

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 18 ธ.ค. 2024

ความคิดเห็น • 15

@SashaBaych หลายเดือนก่อน ⁺⁷
My favorite data science youtuber these days! Thank you So many channels now are about pure hype of delivering AI news with no substance... But you are an inspiration. Damn I want to read a paper a day at least now!
@code4AI หลายเดือนก่อน
Do it! Smile ....
@vladimirnadvornik8254 5 วันที่ผ่านมา
I just learned that the pissa method from peft probably solves this problem - it initializes the LoRA with singular vectors taken from the model, so it does not have to add new intruder dimensions.
@deter3 หลายเดือนก่อน
how do you measure model's generalization capability , this is a really fuzzy and vogue concepts and we keep using it while do not have clear measurement .
@monologtr_ หลายเดือนก่อน ⁺¹
hows fine tuning vision low mıdel with ocr, vqa custom datasets
@vladimirnadvornik8254 หลายเดือนก่อน
If I understand it correctly, then doing full fine tuning and running SVD on the difference between the finetuned and the original model would create a LoRA that does not suffer from this problem. Is it correct?
@EvanGoodwin-bl7zq หลายเดือนก่อน
Could you train LORA's at different ranks, scaling up and measuring performance? Then when you reach an acceptable level of performance you determine - or improvement falls below a certain level - you stop the process. It might involve some upfront costs, but I assume you would save on inference down the line because the 'acceptable' LORA would be computational more efficient than the full trained model. It would depend on the use case. If you are doing lots of inference, it would definitely payoff down the line. It would be interesting to see the costs of training multiple LORA's in this way vs full training.
@vladimirnadvornik8254 หลายเดือนก่อน ⁺¹
LORA is not more efficient for inference. Either you can merge the LoRA into the model, then it is exactly the same or you can compute the LoRA separately and then it is less efficient.
@EvanGoodwin-bl7zq หลายเดือนก่อน
@@vladimirnadvornik8254 Ok, then perhaps a better approach would be to train a LORA on different model sizes - 1B, 3B, 8B (which are computationally more efficient - and stop when acceptable accuracy is reached or improvement falls below a certain level.
@rikhendrix261 หลายเดือนก่อน
What determines if the task is the same? Is it the instruction prompt? And what defines the size of a task which is correct for LoRA?
@novantha1 หลายเดือนก่อน
Your intuition, basically.
It’s tricky because some tasks will be in distribution, even when dealing with unique data, while some tasks will explicitly not be in distribution. Here’s a couple of things to consider:
For simple math, let’s say addition, subtraction, multiplication and division, do you think that a new equation outside of the example equations is in-distribution or out of distribution?
For logical reasoning problems, do you think that a problem with a similar structure to a problem in the training set is in distribution or out of distribution?
For creative writing, do you think that a model being asked to write stories in the same genres as the training examples is in distribution or out of distribution?
It gets really nuanced, and I think the only way to really understand this is to approach them on a model-by-model and dataset-by-dataset basis.
@jonmichaelgalindo หลายเดือนก่อน
What about undertraining loras on each block and merging as you go? You update all the parameters, and no single lora "overpowers" the original data vectors.
@code4AI หลายเดือนก่อน ⁺¹
??? If you "undertrain" a fine-tuning mechanism, then you have a broken fine-tuned weight tensor structure. Why merge something that is not working into the pre-trained model?
@NLPprompter หลายเดือนก่อน
I'm guessing Lamini AI company doing something like this to achieve what they said better than RAG...

ต่อไป

เล่นอัตโนมัติ

NEW 3.5 SONNET V2 Has a LOGIC BUG: Reasoning ERROR

NEW 3.5 SONNET V2 Has a LOGIC BUG: Reasoning ERROR

Stealing LLMs (MIT, Microsoft, Harvard) #ai

Stealing LLMs (MIT, Microsoft, Harvard) #ai

TEST TIME Optimized AI REASONING (MIT)

TEST TIME Optimized AI REASONING (MIT)

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

หมวกกันน็อค - TaitosmitH |Official MV|

หมวกกันน็อค - TaitosmitH |Official MV|

Players vs Trophies 🤯

Players vs Trophies 🤯

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Diffusion Models | Paper Explanation | Math Explained

Diffusion Models | Paper Explanation | Math Explained

NextGen AI Agents: SAIA

NextGen AI Agents: SAIA

QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)

QLoRA—How to Fine-tune an LLM on a Single GPU (w/ Python Code)

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Small Models, Smarter Learning: ICL

Small Models, Smarter Learning: ICL

Speculations on Test-Time Scaling (o1)

Speculations on Test-Time Scaling (o1)

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544