Wall Street's new LLM beats GPT-4

Large Language Models (LLMs) - Everything You NEED To Know

LLM Ecosystem explained: Your ultimate Guide to AI

🔴LIVE เชียร์สด : แมนเชสเตอร์ ซิตี้ พบ ฟูแล่ม | เรือใบสีฟ้าดวลเจ้าสัวน้อย MW7

เพื่อแก้แค้นสามีชั่วร้าย สาวแต่งงานกับพนักงานเสิร์ฟชั่วคราว แต่ไม่คาดคิดเขาเป็น CEO ไม่เปิดเผยตัวตน

ฟังสดเดอะโกสเรดิโอ 6/10/2567 เรื่องเล่าผีเดอะโกส

The inner workings of LLMs explained - VISUALIZE the self-attention mechanism

Discover AI

มุมมอง 13 376

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 6 ต.ค. 2024
HOW do LLMs (Large Language Models) work, and WHY do they work?
Models like ChatGPT or GPT-4. Can we understand them?
Easy introduction to:
1. How does the self-attention mechanism work inside of LLMs?
2. What makes all those LLM different, their weights, their pre-trained datasets or their architectural design structure?
3. What makes LLM perform better (hardware /software), and how to tune for optimal layers and attention heads in the LLM architecture?
Simple explanations on how Large Language Models (LLM) or Decoder-based Transformers in general work. Plus LangChain and Vector stores, with their corresponding vector embeddings, explained. Also for beginners to AI.
We only focus on the decoder stack of the transformer for LLMs and ignore for the moment the RLHF (human feedback forms).
Introducing Claude
www.anthropic....
Great new pre-print (all rights with authors):
"AttentionViz: A Global View of Transformer Attention"
by Catherine Yeh, Yida Chen, Aoyu Wu, Cynthia Chen, Fernanda Viégas, Martin Wattenberg
arxiv.org/abs/...
Read the documentation:
catherinesyeh....
interactive Demo:
attentionviz.com/
#ai
#languagemodel
#datascience
#naturallanguageprocessing
#gpt4
#chatgpt
#bard
#vectors
#vectorspaces

ความคิดเห็น • 16

@derejew109 ปีที่แล้ว ⁺⁶
I am already addicted to your Intro "hello community" brillant and authentic! Thank you sir.
@s11-informationatyourservi44 ปีที่แล้ว
+1
@MadhavanSureshRobos ปีที่แล้ว ⁺¹³
Wow! I can't believe this quality information! Best educative channel
@mehmetcandemir5035 7 หลายเดือนก่อน
This explaination is something else, felt like a teacher right in front of me. Thank you for this incredible work!
@bahramboutorabi5971 ปีที่แล้ว ⁺¹
Great work. You made a complex concept easy to visualise and understand.
@JonathanYankovich ปีที่แล้ว ⁺¹
Fantastic. Man, i get so much out of these videos. And the delivery is great.
@MadhavanSureshRobos ปีที่แล้ว ⁺¹
Cant wait for the data lakes video. I tried understanding the concept but I wasn't sure and also I wasnt able to run their code either
@vitaliiivanov9514 ปีที่แล้ว
That's great! Didn't know there is such a tool available
@norlesh ปีที่แล้ว
Would really like to see how much LLama2-7B could be reduced by optimizing every head of every layer using AttentionViz and transfer training the standard model weights to the reconfigured layer weights.
@jayhu6075 ปีที่แล้ว
What a understandable explanation how self -attention work in this for my as a beginner difficult topic, in this cases you decide to choose ADA, what is the reason behind this?
Is it possible to make a tutorial with examples as counselor or lawyers office with there own AI system, what you all say optimize for his task? Many thanks.
@code4AI ปีที่แล้ว
OpenAI only sells you token embeddings from a single, second generation AI model: their ada-002. They specify, that also Notion works with this model .
See also:
openai.com/blog/new-and-improved-embedding-model
@learnvik 11 หลายเดือนก่อน
Thanks, but I still didn't understand how llm creates a response from a prompt. I understood how it picks the words based on attention weightage but still no clue how the whole sentence is getting generated. I think I am not capable of understanding it. Will try some other videos.
@henkhbit5748 ปีที่แล้ว
Thanks, very interesting topic. The code for the interactive visualization is not on github?
@gileneusz ปีที่แล้ว
3:18 this is a nice youtube inception 😆
@chivesltd ปีที่แล้ว
any tutorial on how to code our own llm and finetune to specialized task?
@code4AI ปีที่แล้ว ⁺¹
yes, more than 30 videos on this channel.

ต่อไป

เล่นอัตโนมัติ

Wall Street's new LLM beats GPT-4

Wall Street's new LLM beats GPT-4

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

LLM Ecosystem explained: Your ultimate Guide to AI

LLM Ecosystem explained: Your ultimate Guide to AI

🔴LIVE เชียร์สด : แมนเชสเตอร์ ซิตี้ พบ ฟูแล่ม | เรือใบสีฟ้าดวลเจ้าสัวน้อย MW7

🔴LIVE เชียร์สด : แมนเชสเตอร์ ซิตี้ พบ ฟูแล่ม | เรือใบสีฟ้าดวลเจ้าสัวน้อย MW7

เพื่อแก้แค้นสามีชั่วร้าย สาวแต่งงานกับพนักงานเสิร์ฟชั่วคราว แต่ไม่คาดคิดเขาเป็น CEO ไม่เปิดเผยตัวตน

เพื่อแก้แค้นสามีชั่วร้าย สาวแต่งงานกับพนักงานเสิร์ฟชั่วคราว แต่ไม่คาดคิดเขาเป็น CEO ไม่เปิดเผยตัวตน

ฟังสดเดอะโกสเรดิโอ 6/10/2567 เรื่องเล่าผีเดอะโกส

ฟังสดเดอะโกสเรดิโอ 6/10/2567 เรื่องเล่าผีเดอะโกส

24 ชั่วโมง แคมป์สวนสนุกกลางห้างยามดึก!! (ฟินมาก ฮาๆ!!)

24 ชั่วโมง แคมป์สวนสนุกกลางห้างยามดึก!! (ฟินมาก ฮาๆ!!)

Overparametrized LLM: COMPLEX Reasoning (Yale Univ)

Overparametrized LLM: COMPLEX Reasoning (Yale Univ)

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Intuition Behind Self-Attention Mechanism in Transformer Networks

Intuition Behind Self-Attention Mechanism in Transformer Networks

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

PEFT LoRA Explained in Detail - Fine-Tune your LLM on your local GPU

PEFT LoRA Explained in Detail - Fine-Tune your LLM on your local GPU

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Attention in transformers, visually explained | Chapter 6, Deep Learning

Attention in transformers, visually explained | Chapter 6, Deep Learning

ไฮไลท์ฟุตบอล #บุนเดสลีกา | แฟร้งค์เฟิร์ต 3-3 บาเยิร์น มิวนิค | 6 ต.ค. 67

ไฮไลท์ฟุตบอล #บุนเดสลีกา | แฟร้งค์เฟิร์ต 3-3 บาเยิร์น มิวนิค | 6 ต.ค. 67

What are they drawing?#devil #lilith #funny #shorts

What are they drawing?#devil #lilith #funny #shorts

10% vs 100% #beatbox #tiktok

10% vs 100% #beatbox #tiktok

LIVE⚽หลังเกม วิลล่า vs แมนฯ ยูไนเต็ด l ซอคเกอร์ ปาร์ตี้ ขยี้บอลสด l 2024/25 EP7 l SIAMSPORT

LIVE⚽หลังเกม วิลล่า vs แมนฯ ยูไนเต็ด l ซอคเกอร์ ปาร์ตี้ ขยี้บอลสด l 2024/25 EP7 l SIAMSPORT

เงาะป่า - วงL.กฮ. | Official Music Video

เงาะป่า - วงL.กฮ. | Official Music Video

โรงเรียนตอนตี 2 โคตรหลอน..!! [โรงเรียนไดเม่]

โรงเรียนตอนตี 2 โคตรหลอน..!! [โรงเรียนไดเม่]

Epic Reflex Game vs MrBeast Crew 🙈😱

Epic Reflex Game vs MrBeast Crew 🙈😱

ฟังสดเดอะโกสเรดิโอ 6/10/2567 เรื่องเล่าผีเดอะโกส

ฟังสดเดอะโกสเรดิโอ 6/10/2567 เรื่องเล่าผีเดอะโกส