Has Generative AI Already Peaked? - Computerphile

Generative AI in a Nutshell - how to survive and thrive in the age of AI

How AI Discovered a Faster Matrix Multiplication Algorithm

ฟังสดเดอะโกสเรดิโอ 8/9/2567 เรื่องเล่าผีเดอะโกส

เลียม แฮร์ริสัน vs เสกสรร อ.ขวัญเมือง ONE 168 | 7 ก.ย.67

ตามล่าหา โค้กโอรีโอ้ VS โอรีโอ้โค้ก #tiktokพากิน #เช้งกับร็อค #chengandrock #luckytree

Mistral 7B: Smarter Than ChatGPT & Meta AI - AI Paper Explained

Harry Mapodile

มุมมอง 7 916

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 8 ก.ย. 2024
An Actually Big Week in AI, Mistral 7B’s performance demonstrates what small models can do with enough conviction. The New 7B LLaMA Killer? Tracking the smallest models performing above 60% on MMLU is quite instructive: in two years, it went from Gopher (280B, DeepMind. 2021), to Chinchilla (70B, DeepMind, 2022), to Llama 2 (34B, Meta, July 2023), and to Mistral 7B. Mistral 7B is released in Apache 2.0. This AI Startup actually "Open"s AI, and is more efficient than ChatGPT.
#openai #chatgpt #llama2
/ harrymapodile
Mistral 7B -
Report: mistral.ai/new...
Blog: mistral.ai/new...
Github: github.com/mis...
Magnet: x.com/MistralA...
Longformer: The Long-Document Transformer -
Paper: arxiv.org/pdf/...
Mistral AI blows in with a $113M seed -
TechCrunch: tcrn.ch/3CrOhTC
Tweets -
/ 1707430998600831424

ความคิดเห็น • 8

@coalhater392 11 หลายเดือนก่อน ⁺¹⁵
Don't really have anything to add so I'm just commenting for the algorithm hope it helps.
@harrymapodile 11 หลายเดือนก่อน ⁺³
Thanks a ton!
@jaysonp9426 10 หลายเดือนก่อน
This is excellent. Please keep making vidoes. This is extremely high quality. New subscriber 👍
@harrymapodile 9 หลายเดือนก่อน
thanks for the support
@talroitberg5913 11 หลายเดือนก่อน
I have a general question on LLM attention. So, if I understand correctly, when an LLM generates a token, it has layers that attend to other words in the context window, and add information about relevant words into the latest token. Say the context window is n tokens, then token n has a lot of info about the rest of the context window, and token n+1 is the output for this iteration.
To generate token n+2, you give the LLM a new prompt consisting of tokens 2 to n+1 (i.e. ending with what the LLM just generated in the previous step).
My question is -- does this new prompt keep all the self-attention information from the previous cycle, or do they discard it and start from scratch? Intuitively it seems like you could save a lot of time by caching the results of the last attention layer (or wherever you find the end result of all the work the hidden layers did last time around). Do they do that? If not, do you know why not? Or do some LLMs do it, and others don't?
@mohl-bodell2948 10 หลายเดือนก่อน ⁺¹
Yes, during inference, caching is common
@jamesoukassou7544 9 หลายเดือนก่อน
lol , it doesnt even compare. Sorry but mistral doesnt even respond to the simplest question. I asked it to parse some informations from a given text that was way way simpler than what i actually gave to chatgpt and it fails hard.
@harrymapodile 9 หลายเดือนก่อน ⁺¹
Haha yeah, ChatGPT is always updating. GPT-4 turbo just went live in ChatGPT four days ago, the thing that makes mistral impressive is the density and how quantized it is. And the Apache license which makes it friendlier than llama 2, if you’re just consuming ChatGPT is perfect. But as a platform for building and SFT, mistral is king right now.

ต่อไป

เล่นอัตโนมัติ

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

How AI Discovered a Faster Matrix Multiplication Algorithm

How AI Discovered a Faster Matrix Multiplication Algorithm

ฟังสดเดอะโกสเรดิโอ 8/9/2567 เรื่องเล่าผีเดอะโกส

ฟังสดเดอะโกสเรดิโอ 8/9/2567 เรื่องเล่าผีเดอะโกส

เลียม แฮร์ริสัน vs เสกสรร อ.ขวัญเมือง ONE 168 | 7 ก.ย.67

เลียม แฮร์ริสัน vs เสกสรร อ.ขวัญเมือง ONE 168 | 7 ก.ย.67

ตามล่าหา โค้กโอรีโอ้ VS โอรีโอ้โค้ก #tiktokพากิน #เช้งกับร็อค #chengandrock #luckytree

ตามล่าหา โค้กโอรีโอ้ VS โอรีโอ้โค้ก #tiktokพากิน #เช้งกับร็อค #chengandrock #luckytree

[Full Episode] MasterChef Junior Thailand มาสเตอร์เชฟ จูเนียร์ ประเทศไทย Season 3 Episode 13

[Full Episode] MasterChef Junior Thailand มาสเตอร์เชฟ จูเนียร์ ประเทศไทย Season 3 Episode 13

Mistral AI's Open Source Initiative | Arthur Mensch, Mistral AI | #aiPULSE 2023

Mistral AI's Open Source Initiative | Arthur Mensch, Mistral AI | #aiPULSE 2023

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

ChatGPT: 30 Year History | How AI Learned to Talk

ChatGPT: 30 Year History | How AI Learned to Talk

Rising Titan - Mistral's Roadmap in Generative AI | Slush 2023

Rising Titan – Mistral's Roadmap in Generative AI | Slush 2023

Mistral 7B’s secrets: a talk by Guillaume Lample

Mistral 7B’s secrets: a talk by Guillaume Lample

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

Sliding Window Attention (Longformer) Explained

Sliding Window Attention (Longformer) Explained

ที่แท้ตำรวจ! รู้ตัวทีมฉุดสาวขี่ จยย.หนี | 9 ก.ย. 67 | ข่าวใหญ่ช่อง8

ที่แท้ตำรวจ! รู้ตัวทีมฉุดสาวขี่ จยย.หนี | 9 ก.ย. 67 | ข่าวใหญ่ช่อง8

MEOVV - ‘MEOW’ M/V

MEOVV - ‘MEOW’ M/V

📱🪢 Mom's Wild Lesson: Phone Tied to Thread! See What Happens Next! 😱 #reaction #cats #funny #prank

📱🪢 Mom's Wild Lesson: Phone Tied to Thread! See What Happens Next! 😱 #reaction #cats #funny #prank

บวงสรวง #เกมรักปาฏิหาริย์ วันนี้ ตื่นเต้นรอชมพระนาง #ฟิล์มธนภัทร และ #มินพีชญา ที่มาพบกันครั้งแรก

บวงสรวง #เกมรักปาฏิหาริย์ วันนี้ ตื่นเต้นรอชมพระนาง #ฟิล์มธนภัทร และ #มินพีชญา ที่มาพบกันครั้งแรก

24 ชั่วโมง เอาชีวิตรอดในหมู่บ้านร้าง !!

24 ชั่วโมง เอาชีวิตรอดในหมู่บ้านร้าง !!

[UNCUT] The Loyal Pin ปิ่นภักดิ์ EP.6 (1/4)

[UNCUT] The Loyal Pin ปิ่นภักดิ์ EP.6 (1/4)

ร้องเพลงสั่งข้าว Ver.ว่าว มินตรา อินทิรา | feat ท่านเปา #ว่าว #ร้องเพลงสั่งข้าว #บี้เดอะสกา #ท่านเปา

ร้องเพลงสั่งข้าว Ver.ว่าว มินตรา อินทิรา | feat ท่านเปา #ว่าว #ร้องเพลงสั่งข้าว #บี้เดอะสกา #ท่านเปา