ไม่สามารถเล่นวิดีโอนี้

ขออภัยในความไม่สะดวก

MoME Reduces LLM Hallucinations by 10X!

Elvis Saravia

มุมมอง 9 160

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 13 มิ.ย. 2024
More on the announcement: www.lamini.ai/...
Check out our upcoming live trainings to learn more about building with LLMs:
maven.com/dair...
#ai #machinelearning #engineering #coding

ความคิดเห็น • 15

@marinepower 2 หลายเดือนก่อน ⁺³
This is interesting and somewhat aligns with how the brain seems to work. We have general capabilities that we use all the time, but we are also able to retrieve memories even after years of not accessing them. So it implies that we have weights that change, and memories that are more static / MoE-like where we can pull them up at will.
@bluetensee 2 หลายเดือนก่อน ⁺³
good job again. thank you for your insightful expertise! and thanks for not clickbaiting!!! hope you'll get a lot more followers soon. you deserve it!
@elvissaravia 2 หลายเดือนก่อน
I appreciate that!
@valtersilva5386 2 หลายเดือนก่อน
Loved the tone on the content man, you've got a new subscriber! Great job!
@aireddy 2 หลายเดือนก่อน ⁺¹
It is fantastic if it is really reducing 10x hallucinations. Thank you for sharing your thoughts!!
@jeffg4686 2 หลายเดือนก่อน ⁺¹
Nice! That does add a lot more comfort in correct answers.
The "mixture of agents" model architecture is coming in with some good stuff too (not as good as this though - this is big).
We're not far from some really smart agents...
@novantha1 2 หลายเดือนก่อน ⁺⁴
So, I think it's a bit misleading, or perhaps unintuitive, rather, that this technique was labelled "MoE". It's more like S-Lora, where the model actively swaps out relevant LoRAs at inference time. It's not strictly speaking anything "new" as such, but a series of existing techniques tied together into a simple package.
I'm not sure how useful it really is to the broader community, particularly given that it's not open source, and that there are existing techniques, like mechanistic interpretability, that should essentially do something really quite similar at the end of the day, to say nothing of advancements in reinforcement learning which will not eliminate an LLM's ability to lack confidence (raw LLMs actually have a pretty good internal estimate before instruction tuning of how accurate the facts they're saying are, we just destroy it in fine tuning atm, but forcing them to answer confidently).
@yahm0n 2 หลายเดือนก่อน ⁺²
This seems the same as regular mixture of experts.
@mihaitanita 2 หลายเดือนก่อน ⁺²
Hmm. Lots of PR stunts on their blog. So still... skeptical. I really don't get the main trickery, and 200 API calls per month is not enough to get a proper test-through. "Internal memorization. Tuning the weights, not RAG. You can layer them." /via X.
@terionname 2 หลายเดือนก่อน ⁺⁷
not open source =(
@williamzhao3885 2 หลายเดือนก่อน ⁺⁴
I feel like 95% is hard to believe. are they really training 1 million models? I am also not sure how accurate is their routing model
@elvissaravia 2 หลายเดือนก่อน ⁺²
There are a lot of parts to look more closely. I am also wonder how general the approach is to different domains and type of data.
@bbrother92 2 หลายเดือนก่อน
are ML engineer?
@pradeepbansal23 2 หลายเดือนก่อน
But is it right to call this as innovation ? Just training million of experts with task specific facts can't be said to be research ?
@xt-89907 2 หลายเดือนก่อน ⁺¹
It’s special because it swaps in those experts within a larger architecture. Related research on polysemanticity also suggests that sparsity will enhance explainability and steer ability

ต่อไป

เล่นอัตโนมัติ

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

NEW TextGrad by Stanford: Better than DSPy

NEW TextGrad by Stanford: Better than DSPy

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

MATCHDAY LIVE REACTION : BG PATHUM UNITED vs RAYONG FC | THAI LEAGUE 1 2024/25 (MW02)

MATCHDAY LIVE REACTION : BG PATHUM UNITED vs RAYONG FC | THAI LEAGUE 1 2024/25 (MW02)

Harley Quinn's revenge plan！！！#Harley Quinn #joker

Harley Quinn's revenge plan！！！#Harley Quinn #joker

LISA - NEW WOMAN feat. Rosalía (Official Music Video) reaction [NEW LEVEL!]

LISA - NEW WOMAN feat. Rosalía (Official Music Video) reaction [NEW LEVEL!]

ถือของห้ามปล่อย 10 นาที ได้ 500 บาท #aum_ccp #shorts

ถือของห้ามปล่อย 10 นาที ได้ 500 บาท #aum_ccp #shorts

How BYD, Nio And Other Chinese EVs Compare To Tesla

How BYD, Nio And Other Chinese EVs Compare To Tesla

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Tuning Your AI Model to Reduce Hallucinations

Tuning Your AI Model to Reduce Hallucinations

Marker: This Open-Source Tool will make your PDFs LLM Ready

Marker: This Open-Source Tool will make your PDFs LLM Ready

$100b Slaughterbots. Godfather of AI shows how AI will kill us, how to avoid it.

$100b Slaughterbots. Godfather of AI shows how AI will kill us, how to avoid it.

Run your own AI (but private)

Run your own AI (but private)

Low-rank Adaption of Large Language Models: Explaining the Key Concepts Behind LoRA

Low-rank Adaption of Large Language Models: Explaining the Key Concepts Behind LoRA

What is Prompt Tuning?

What is Prompt Tuning?

Graph RAG: Improving RAG with Knowledge Graphs

Graph RAG: Improving RAG with Knowledge Graphs

ทักษะฟุตบอลที่ดีที่สุด 2024/25

ทักษะฟุตบอลที่ดีที่สุด 2024/25

ระทึกศึกเกลี่ยเค้ก ! ห้ามดูแคลน “คนในป่า” เกมชิง “นายกฯ” คนใหม่ ระวังพลิกโผ ! #ถกไม่เถียง

ระทึกศึกเกลี่ยเค้ก ! ห้ามดูแคลน “คนในป่า” เกมชิง “นายกฯ” คนใหม่ ระวังพลิกโผ ! #ถกไม่เถียง

HIGHLIGHTS | Real Madrid 2-0 Atalanta | UEFA Super Cup 2024

HIGHLIGHTS | Real Madrid 2-0 Atalanta | UEFA Super Cup 2024

พ่อขอกินไอติมคำนึงนะ!! #พ่อ #กินไอติม #ไอศครีม #นปโปะหม่ำๆ #shorts

พ่อขอกินไอติมคำนึงนะ!! #พ่อ #กินไอติม #ไอศครีม #นปโปะหม่ำๆ #shorts

นาที 'สรวงศ์ เทียนทอง' สส.เพื่อไทย เสนอชื่อ 'แพทองธาร ชินวัตร' เป็น #นายกรัฐมนตรี ตามมาตรา 159

นาที 'สรวงศ์ เทียนทอง' สส.เพื่อไทย เสนอชื่อ 'แพทองธาร ชินวัตร' เป็น #นายกรัฐมนตรี ตามมาตรา 159

CHOCOLATE LEMON BITES 🍋😋| Home Made Candied Lemon Slices! Food Hack!

CHOCOLATE LEMON BITES 🍋😋| Home Made Candied Lemon Slices! Food Hack!

สาวแต่งงานอย่างรวดเร็วกับครูผู้น่าสงสาร โดยไม่รู้ว่าเขาเป็นสายลับและเป็นพ่อของลูกเธอ

สาวแต่งงานอย่างรวดเร็วกับครูผู้น่าสงสาร โดยไม่รู้ว่าเขาเป็นสายลับและเป็นพ่อของลูกเธอ

มายคราฟ, แต่ คุณ..ควบคุมหัวใจ!

มายคราฟ, แต่ คุณ..ควบคุมหัวใจ!