Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

How does Gemini compare to GPT-4?

GPT4 vs Gemini Ultra - The “ChatGPT Killer”? (Full review)

World’s Largest Jello Pool

รถติดเม้าท์ EP.3 “หลิงหลิง-จันจิ“ พาดาราช่อง3 เปิดโลก!! ขี่ATV ล่องแก่ง จ.นครนายก (พวกเราแมนมาก!)

"ณัฐชา" โวยถูกห้ามถ่าย สส. ชิม "ปลาหมอคางคำ" หลังเจอรองประธานสภาคนที่ 2 สั่งเบรก | ThairathTV

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

Chris Hay

มุมมอง 1 531

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 24 ก.พ. 2024
in this video, we go under the hood of the gemini and gemma-7b and gemma-2b tokenizer. we look at the large vocabulary and the impact that it has on the size of the model, and how Google has put a focus on people, places, culture, languages and things over efficient vocabulary and frequent sub-words. in this video chris introduced his new tokenizer benchmark test, dataset and tokenizer visualizer tools
github
---------------
github.com/chrishayuk/tokeniz...
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 10

@Aberger789 4 หลายเดือนก่อน ⁺²
Well, it's 2am, and I can't wait to watch your other videos. I am building some RAG implementations with scientific journals from PDF, and feeling like I'm going in circles. Taking a step back and considering the bigger concepts is helping. Great format for learning, I really appreciate your time!
@chrishayuk 4 หลายเดือนก่อน
glad you're enjoying, you might wanna checkout my RAG video, and listen to my stoopid poems
@cybermanaudiobooks3231 5 หลายเดือนก่อน ⁺²
Great video. Companion piece to Andrej Karpathy's most recent. Very insightful. Thanks!
@chrishayuk 5 หลายเดือนก่อน
Thank you, glad it’s useful. This one was a video I’ve been trying to get right for a while
@smithnigelw 5 หลายเดือนก่อน ⁺²
Thanks Chris. Very interesting how they have chosen the vocabulary. For representation of programs in Python, how do they tokenise the white-space? I’m looking forward to the video on embedding.
@chrishayuk 5 หลายเดือนก่อน
it's a similar approach to llama, because not every language seperates using whitespace. i'll maybe cover that in a future video. i will update the programming languages in the dataset, i didn't have time to merge all the other versions back in (where python was covered)
@reza2kn 5 หลายเดือนก่อน ⁺²
This is wonderful! The dataset alone is super useful to have, and the video walk through was really awesome for someone who's just trying to understand what's what here :D Please keep on doing what you're doing! One thing I have been interested in is visualizing the entire vocabulary inside a tokenizer to actually see what's inside, but have it be done in a easy to explore way. tried world clouds and didn't work at all. Do you have any ideas?
I'm also super interested in fine-tuning models to teach them another language and using agents, but not to just look at codes for 30 mins. Specific , real-world use-cases with applied examples. I think TH-cam is really lacking that at the moment.
P.S: Cool glasses :)
@chrishayuk 5 หลายเดือนก่อน ⁺²
thank you, glad it's useful. you might find my next video on embeddings useful for visualization (no spoilers :). As for fine-tuning. I recently downloaded a lot of english-welsh translations, and was planning to do a video on that. i was going to use llama2-7b as i know it doesn't do welsh. i might do it with Gemma but not sure if does Welsh already. Regardless i'll be doing a language fine tune video soon
@garyhamilton2104 5 หลายเดือนก่อน ⁺¹
Commenting cuz I know Chris will give me a heart :)
@chrishayuk 5 หลายเดือนก่อน
because i love you all

ต่อไป

เล่นอัตโนมัติ

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

How does Gemini compare to GPT-4?

How does Gemini compare to GPT-4?

GPT4 vs Gemini Ultra - The “ChatGPT Killer”? (Full review)

GPT4 vs Gemini Ultra - The “ChatGPT Killer”? (Full review)

World’s Largest Jello Pool

World’s Largest Jello Pool

รถติดเม้าท์ EP.3 “หลิงหลิง-จันจิ“ พาดาราช่อง3 เปิดโลก!! ขี่ATV ล่องแก่ง จ.นครนายก (พวกเราแมนมาก!)

รถติดเม้าท์ EP.3 “หลิงหลิง-จันจิ“ พาดาราช่อง3 เปิดโลก!! ขี่ATV ล่องแก่ง จ.นครนายก (พวกเราแมนมาก!)

"ณัฐชา" โวยถูกห้ามถ่าย สส. ชิม "ปลาหมอคางคำ" หลังเจอรองประธานสภาคนที่ 2 สั่งเบรก | ThairathTV

"ณัฐชา" โวยถูกห้ามถ่าย สส. ชิม "ปลาหมอคางคำ" หลังเจอรองประธานสภาคนที่ 2 สั่งเบรก | ThairathTV

why llama-3-8B is 8 billion parameters instead of 7?

why llama-3-8B is 8 billion parameters instead of 7?

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

"AI Will Displace These Jobs In 3 Years!" - Do This To Get Ahead While Others Panic | Emad Mostaque

"AI Will Displace These Jobs In 3 Years!" - Do This To Get Ahead While Others Panic | Emad Mostaque

Understanding STaR and how it powers Claude and Gemini/Gemma 2 (and maybe OpenAI Q* or Strawberry)

Understanding STaR and how it powers Claude and Gemini/Gemma 2 (and maybe OpenAI Q* or Strawberry)

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

Getting Started with ReAct AI agents work using langchain

Getting Started with ReAct AI agents work using langchain

Can Google's Gemini Advanced Beat GPT-4? Or Is ChatGPT Still King?

Can Google's Gemini Advanced Beat GPT-4? Or Is ChatGPT Still King?

Fine-Tune Llama3 using Synthetic Data

Fine-Tune Llama3 using Synthetic Data

Let's build the GPT Tokenizer

Let's build the GPT Tokenizer

วิธีเก็บข้อมูลให้ได้เยอะๆ (Storage)

วิธีเก็บข้อมูลให้ได้เยอะๆ (Storage)

EP.143 เกิดความบรรลัย อุบัติเหตุกับ Kiha183 อินเตอร์เน็ตจะไม่ได้เล่นแล้วเขมร ประมาทหรือคิดไม่ได้

EP.143 เกิดความบรรลัย อุบัติเหตุกับ Kiha183 อินเตอร์เน็ตจะไม่ได้เล่นแล้วเขมร ประมาทหรือคิดไม่ได้

Key Windows หลักร้อย หลักพัน ต่างกันมั้ย จะเช็คได้ยังไงว่าแท้ไม่แท้

Key Windows หลักร้อย หลักพัน ต่างกันมั้ย จะเช็คได้ยังไงว่าแท้ไม่แท้

รีวิวหลังใช้ iPhone 15 Pro Max ครบ 10 เดือน - ซื้อตอนนี้ยังคุ้มไหม หรือควรรอ 16 Pro Max ดีกว่า ??

รีวิวหลังใช้ iPhone 15 Pro Max ครบ 10 เดือน - ซื้อตอนนี้ยังคุ้มไหม หรือควรรอ 16 Pro Max ดีกว่า ??

Battery low 🔋 🪫

Battery low 🔋 🪫

เส้นทางลูกผู้ชายของ Sony กับการสร้าง PlayStation | THE ENERGY

เส้นทางลูกผู้ชายของ Sony กับการสร้าง PlayStation | THE ENERGY

iPhone 16 ถูกสั่งผลิตเยอะ / iPhone 17 Pro กล้อง 48MP ทุกตัว? / MacBook ชิป M5 เลนส์กล้องใหม่ #iMoD

iPhone 16 ถูกสั่งผลิตเยอะ / iPhone 17 Pro กล้อง 48MP ทุกตัว? / MacBook ชิป M5 เลนส์กล้องใหม่ #iMoD

Privacy on iPhone | Flock | Apple

Privacy on iPhone | Flock | Apple