LCM: The Ultimate Evolution of AI? Large Concept Models

ICL and TTT: Adaptive Intelligence for Small LM

NEW Transformer for RAG: ModernBERT

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

Byte Latent Transformer - BLT explained (Entropy of Next Byte, META)

Discover AI

มุมมอง 3 804

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 23 ธ.ค. 2024

ความคิดเห็น • 11

@code4AI 4 วันที่ผ่านมา ⁺⁴
Please note, with the automatic dubbing from TH-cam /Google you hear a synthetic voice in your regional language. To hear my original voice in English, switch to "Default" or "English" in the settings. Thank you.
@mrpocock 4 วันที่ผ่านมา ⁺¹²
Byte-level LLMs are obviously the way forward for that first round of training where you're predicting 1..n tokens given the prefix, particularly for multi-language models. Tokenization is clearly a hack, like in the dark ages of image neural networks, where we would hand-craft feature detection kernels.
@wwkk4964 4 วันที่ผ่านมา ⁺¹
Thank you so much for covering this paper! I had been thinking about this specific implementation for a year and i believe its a significant step towards having truly general learning architecture that is minimizing hand crafted human priors.
@ProgrammingWIthRiley 3 วันที่ผ่านมา
Brother, you are amazing.
Thank you for doing this.
@davidwynter6856 4 วันที่ผ่านมา ⁺¹
Can you clarify that the pre training will have to use the BLT embeddings. I.e. unless models pre trained using BLT start appearing on huggingface or elsewhere we mere mortals will not be able to take advantage of this new method?
@TalsBadKidney 4 วันที่ผ่านมา ⁺¹
very very cool
@JeomonGeorge 4 วันที่ผ่านมา
Does the small transformer have bpe then in the H(xi) is it finding the cross entropy. 26:13
@themax2go 4 วันที่ผ่านมา ⁺¹
i'm having a plantbased BLT right now
@King_Deundel 3 วันที่ผ่านมา
BLT seems the way to go in an ideal world, but there are definetly problems with it, I think tokenizers have accomplished tremendous work and we are on this state thanks to improving the vocab size and the tokenizations mechanisms, but from this point we may have the technology and resources to try to perform BLT on a model ( I still don't think it would work that much better)
@augmentos วันที่ผ่านมา
Can you expand on ‘definitely problems’ with it
@ivangoncharuk607 4 วันที่ผ่านมา ⁺¹
Bacon Lettuce Tomato

ต่อไป

เล่นอัตโนมัติ

LCM: The Ultimate Evolution of AI? Large Concept Models

LCM: The Ultimate Evolution of AI? Large Concept Models

ICL and TTT: Adaptive Intelligence for Small LM

ICL and TTT: Adaptive Intelligence for Small LM

NEW Transformer for RAG: ModernBERT

NEW Transformer for RAG: ModernBERT

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

o3 Inference Time CoT Reasoning: How relevant is SFT and RL?

o3 Inference Time CoT Reasoning: How relevant is SFT and RL?

Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM

Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

2024's Biggest Breakthroughs in Math

2024's Biggest Breakthroughs in Math

Moore's Law is Dead - Welcome to Light Speed Computers

Moore's Law is Dead — Welcome to Light Speed Computers

2024's Biggest Breakthroughs in Computer Science

2024's Biggest Breakthroughs in Computer Science

The Simple Math Problem That Revolutionized Physics

The Simple Math Problem That Revolutionized Physics

Open Source "Thinking" Models Are Catching Up To OpenAI o1 Already...

Open Source "Thinking" Models Are Catching Up To OpenAI o1 Already...

2024's Biggest Breakthroughs in Biology and Neuroscience

2024's Biggest Breakthroughs in Biology and Neuroscience

คอมเมนต์แฟนเวียดนามสุดทึ่ง หลังไทยเกือบหลับแต่กลับมาได้ พลิกนรกคว้าชัยเหนือสิงคโปร์ 4-2 แบบสุดมันส์

คอมเมนต์แฟนเวียดนามสุดทึ่ง หลังไทยเกือบหลับแต่กลับมาได้ พลิกนรกคว้าชัยเหนือสิงคโปร์ 4-2 แบบสุดมันส์

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

Oren helps Durple escape Pinki in a way you wouldn't expect

Oren helps Durple escape Pinki in a way you wouldn't expect

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

Cat mode activated 🤣

Cat mode activated 🤣

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น