NVIDIA's Nemotron-4's is totally insane for synthetic data generation

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

[TH] Esports World Cup : Knockout Stage Day 1

Despicable Me Fart Blaster

The Driver EP.245 - เลดี้ปราง

Are Claude 3.5 Sonnet, Llama-3 and Gemini choosing speed over quality?

Chris Hay

มุมมอง 853

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ก.ค. 2024
in this video chris looks at how model providers are trending towards using grouped query attention vs traditional multi-headed attention in transformer models and how this is impacting output in areas such as summarization. in this video chris shows that you get better coherent output from models such as llama-2 or claude 3-opus over new models such as llama-3 or gemini or gemma. in the end, in certain scenarios such as summarization or generative content, gpt-4o still beats sonnet.
repo
github.com/chrishayuk/mha_gqa...
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 14

@makepeace88 8 วันที่ผ่านมา ⁺¹
I just attended detailed anatomy of LLM session.. and it’s just wow! Nobody’s telling these details. Thanks very much Chris ❤
@chrishayuk 8 วันที่ผ่านมา
Glad it was useful, I skipped a lot of details, as I wanted to keep the focus on MHA vs GQA. I will probs do some other videos on some of the other details
@trsd8640 9 วันที่ผ่านมา ⁺¹
Great video! I don’t understand it fully, had to watch it again, but I‘m getting a idea of what is happening! Thank you!
@chrishayuk 9 วันที่ผ่านมา ⁺²
it was quite a tough one to record, as i'm trying to avoid explaining the entire transformers architecture and attention fully (i'll do that in another video), but do enough to just show how this architectural change is affecting models output. it was a weird balance and apologies that i never explained it enough
@danielhenderson7050 9 วันที่ผ่านมา ⁺²
This was very interesting
@chrishayuk 9 วันที่ผ่านมา
Glad you enjoyed, definitely a fun rabbit hole
@everyhandletaken 9 วันที่ผ่านมา ⁺¹
Interesting!
Claude 3.5 Sonnet is definitely great for code, much better than cgpt 4-o & has really helped me solve things that are well beyond my brain capacity in the last few days.
@chrishayuk 9 วันที่ผ่านมา
totally agree, much better for code than gpt-4o
@Leo-ph7ow 10 วันที่ผ่านมา ⁺²
Excelent content! Thanks!
@chrishayuk 10 วันที่ผ่านมา
Glad you liked it!
@seanknowles9985 9 วันที่ผ่านมา
Intel agencies are having their fill first. Its obviously being slowed down so three letter agencies can get ahead of this.
@chrishayuk 9 วันที่ผ่านมา
lol, i'm sure 3 letter agencies are having their say but i suspect it's not on MHA vs GQA but would love to hear that conversation if they were

ต่อไป

เล่นอัตโนมัติ

NVIDIA's Nemotron-4's is totally insane for synthetic data generation

NVIDIA's Nemotron-4's is totally insane for synthetic data generation

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

[TH] Esports World Cup : Knockout Stage Day 1

[TH] Esports World Cup : Knockout Stage Day 1

Despicable Me Fart Blaster

Despicable Me Fart Blaster

The Driver EP.245 - เลดี้ปราง

The Driver EP.245 - เลดี้ปราง

"แท่งไม้" ที่ทำให้กองทัพโรมันเดินทัพได้เร็วที่สุดในโลก #historyworld

"แท่งไม้" ที่ทำให้กองทัพโรมันเดินทัพได้เร็วที่สุดในโลก #historyworld

`const` was a mistake

`const` was a mistake

Building a RAG Pipeline with Anthropic Claude Sonnet 3.5

Building a RAG Pipeline with Anthropic Claude Sonnet 3.5

What are Transformer Models and how do they work?

What are Transformer Models and how do they work?

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

The future of AI agents is WebAssembly (get started now)

The future of AI agents is WebAssembly (get started now)

RouteLLM achieves 90% GPT4o Quality AND 80% CHEAPER

RouteLLM achieves 90% GPT4o Quality AND 80% CHEAPER

i really want to say goodbye to copilot...

i really want to say goodbye to copilot...

รีวิว Samsung Galaxy Z Fold6 : มากกว่าเรือธงพับได้ คือ AI Phone ที่พับได้ !

รีวิว Samsung Galaxy Z Fold6 : มากกว่าเรือธงพับได้ คือ AI Phone ที่พับได้ !

Beautiful 🤩 Me MOBILE L-102 best model keyboard phone 🤙

Beautiful 🤩 Me MOBILE L-102 best model keyboard phone 🤙

เปลี่ยนแบต AirPods หมดปัญหาใช้งานไม่ถึงวัน #houkandbank #reels #shorts #เปลี่ยนแบตairpods

เปลี่ยนแบต AirPods หมดปัญหาใช้งานไม่ถึงวัน #houkandbank #reels #shorts #เปลี่ยนแบตairpods

[spin9] รีวิว Samsung Galaxy Z Fold6 และ Z Flip6 - จอพับ ดีไซน์ใหม่ พร้อม AI แบบจัดเต็ม

[spin9] รีวิว Samsung Galaxy Z Fold6 และ Z Flip6 — จอพับ ดีไซน์ใหม่ พร้อม AI แบบจัดเต็ม

เงิน 1,000 กับ 3,000 จะเลือกอะไร!!!? #macupstudio #apple #ร้านซ่อมไอโฟนขอนแก่น #ซ่อมฝาหลังไอโฟน

เงิน 1,000 กับ 3,000 จะเลือกอะไร!!!? #macupstudio #apple #ร้านซ่อมไอโฟนขอนแก่น #ซ่อมฝาหลังไอโฟน

I tested every new Samsung product!

I tested every new Samsung product!

ศิลาจะเลือก IPad เครื่องไหนดี..!!📱 (มีแจกไอแพดน้าา)

ศิลาจะเลือก IPad เครื่องไหนดี..!!📱 (มีแจกไอแพดน้าา)

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2