New Trick for Fine-Tuning LLMs #airesearch

WARNING: Bad News for LLM Fine-Tuning

Benchmarking Hallucination Detection

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Fine-tuning LLMs encourages hallucinations

Vivek Haldar

มุมมอง 410

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 23 ธ.ค. 2024

ความคิดเห็น • 6

@thankqwerty 6 หลายเดือนก่อน
Thanks for sharing the paper.
In my experience with using Llama3-8B, in my benchmark dataset, I noticed that LLM has learned an incorrect fact or in contradiction with my application. I tried to clarify that in the prompt, but noticed the LLM is actually quite stubborn, and lead to quite fragile responses, i.e. the LLM sometimes get it right sometimes get it wrong with minimal changes in the prompt, could be as small as adding spaces.
I wonder if you have come across similar situation or papers that discuss this behavior.
Thanks.
@VivekHaldar 6 หลายเดือนก่อน
Yes that kind of brittleness is a common issue unfortunately.
@gilinachum 7 หลายเดือนก่อน ⁺¹
But why is the paper's fine tuning different than the original pre-training and alignment fine tuning that came before it. All expose the model to a mix of existing and new data...
@VivekHaldar 7 หลายเดือนก่อน
You are correct -- in principle fine-tuning works the same way as pre-training (updating weights), so FT can be thought of as continued PT.
Difference is in data used. One will FT when they have a domain-specific set of data that's very different from the PT data.
@willtipton1698 7 หลายเดือนก่อน
Nice video ty
@hosseinmohammadi4574 7 หลายเดือนก่อน
Interesting! Tnx

ต่อไป

เล่นอัตโนมัติ

New Trick for Fine-Tuning LLMs #airesearch

New Trick for Fine-Tuning LLMs #airesearch

WARNING: Bad News for LLM Fine-Tuning

WARNING: Bad News for LLM Fine-Tuning

Benchmarking Hallucination Detection

Benchmarking Hallucination Detection

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

How I use AI (late 2024)

How I use AI (late 2024)

Are LLMs more creative than humans?

Are LLMs more creative than humans?

Why Large Language Models Hallucinate

Why Large Language Models Hallucinate

Studying GSM8K Leaderboard

Studying GSM8K Leaderboard

Fine-tuning LLMs to be tutors

Fine-tuning LLMs to be tutors

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Mitigating LLM Hallucination Risk Through Research Backed Metrics

Mitigating LLM Hallucination Risk Through Research Backed Metrics

LLM agents can beat human researchers

LLM agents can beat human researchers

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!

ไทยพลิกแซงสิงคโปร์ 2-4! อาเซียนยกเป็นแมตช์สุดมันส์!! เหงียนชมดูไทยเล่นสนุกจริง!

ไทยพลิกแซงสิงคโปร์ 2-4! อาเซียนยกเป็นแมตช์สุดมันส์!! เหงียนชมดูไทยเล่นสนุกจริง!

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

กินขนมมั้ยจ้ะน้อง หนมน้า😝

กินขนมมั้ยจ้ะน้อง หนมน้า😝

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭