Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

Why Does Diffusion Work Better than Auto-Regression?

The moment we stopped understanding AI [AlexNet]

ผลสอบออกแล้ว "ครูเบญ" สอบไม่ผ่านครูผู้สอน | ห้องข่าวภาคเที่ยง

ไม่ต้องล้างกันละ💦😆🤣

สาวสอบติดพนักงานราชการครู อันดับ1 สามวันชื่อหาย แต่ดันมีชื่อคนอื่นโผลแทน l EP.1759 l 16 ก.ย.67

ML Interpretability: feature visualization, adversarial example, interp. for language models

Umar Jamil

มุมมอง 7 053

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 20 ก.ย. 2024
In this video, I will be introducing Machine Learning Interpretability, a vast topic that aims at understanding the inner mechanisms of how machine learning models make their predictions, with the aim of debugging them, making them more transparent and trustworthy.
I will start by reviewing deep learning and the back-propagation algorithm, which are necessary for understanding adversarial example generation and feature visualization for computer vision classification models. In the second part, I will show how we can leverage the knowledge built in the first part of the video and apply it to language models. In particular, we will see how we can get insights on the bias of a language model by generating a prompt that maximizes the likelihood of the next token being a certain concept of our choice. This allows us to answer questions like:
"What does my language model think of women?"
"What does my language model think of minorities?"
This video has been built in collaboration with Leap Labs - an AI research lab that deals with machine learning interpretability and built the Leap Labs Interpretability Engine, which allows to get insights on how computer vision models work and how to improve them by generating prototypes, isolating features and understanding entanglement between classes.
Leap Labs: www.leap-labs....
Leap Labs Tutorials: docs.leap-labs...
As usual, the code and PDF slides are available at the following links:
- PDF slides: github.com/hkp...
- Adversarial Example Generation (tricking a classifier): github.com/hkp...
- Generate inputs for language models: github.com/jes...

ความคิดเห็น • 43

@sauravrao234 4 หลายเดือนก่อน ⁺²⁵
You are one incredibly underrated youtuber
@denishclarke4470 4 หลายเดือนก่อน ⁺¹
Agreed
@PongsiriHuang 2 หลายเดือนก่อน ⁺¹
yupppp, just found him few days ago. definitel underated.
@arpitanand4693 หลายเดือนก่อน ⁺¹
And one hell of a teacher
@JatinKashyap-Innovision 4 หลายเดือนก่อน ⁺⁸
Can't understand why this channel is free! Thanks a lot for all the content, keep it flowing.
@alivecoding4995 2 วันที่ผ่านมา ⁺¹
I am very thankful for your qualitative content! 😊
@mosca204 8 วันที่ผ่านมา ⁺¹
One of the best videos on youtube. Please do IJEPA next. And keep on publishing videos and code.
@nmxnunezz8214 2 หลายเดือนก่อน ⁺³
andrej karpathy liked a tweet were some dude said your video on difussion models was incredibly underated, you are going to make it far!
@4thlord51 4 หลายเดือนก่อน ⁺⁵
You are a great teacher.
@Trending-lc6kc 4 หลายเดือนก่อน ⁺¹
Bruh, I was just looking for this topic & got the notification of this video. Thanks dude
@Wing-sv6ps หลายเดือนก่อน ⁺¹
Keep up the amazing work!
@jueying1443 3 หลายเดือนก่อน ⁺¹
Thanks, could you talk about flash attention?
@hajaani6417 4 หลายเดือนก่อน ⁺¹
As always, I salute you for this awesome video, keep up the good work 👍
@DoppiaDx 4 หลายเดือนก่อน ⁺¹
Always interesting topics. Thank you so much
@MENGRUWANG-qk1ip 2 หลายเดือนก่อน
Hello! I was wondering if the blogger might be interested in Microsoft's recently released Graph RAG algorithm. I'm hoping you could do a video explaining it; your explanations are always so excellent!
@umarjamilai 2 หลายเดือนก่อน
博主翻译错了😁我会考虑的
@RadRebel4 4 หลายเดือนก่อน ⁺¹
Amazing Video! Could you also include a traning script for the Video you made about the transformer model for general LLM task. As the earlier one was about translation only.
@oiooio7879 3 หลายเดือนก่อน ⁺¹
Thank you for this video!
@abhijitrai1349 2 หลายเดือนก่อน
How do you stay up to date on Data science research papers?
@ahmedmohamedabdelhameedabd295 4 หลายเดือนก่อน ⁺¹
Amazing , interesting topic.
@agiagiagitk 4 หลายเดือนก่อน
any plan for 'Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models' video? It'd be awesome with your explanation on the math ~.~
@mahyarkarami6572 4 หลายเดือนก่อน ⁺¹
I thought for a moment that Satya is coming for explain ;)
@umarjamilai 4 หลายเดือนก่อน ⁺¹
🤓🤓🤓
@daleanfer7449 4 หลายเดือนก่อน ⁺¹
great videos!
@Research_work02 4 หลายเดือนก่อน
Thanks a lot for this! A request to add the training script to the Stable Diffusion repository, it would be of great help!! Thank you!
@AhmedMohamed-nh2hs 3 หลายเดือนก่อน ⁺¹
thank you for this!
@sonned9843 2 หลายเดือนก่อน ⁺¹
God bless you, you are amazing
@johanvandermerwe7687 3 หลายเดือนก่อน
I tuoi video sono fantastici, benedizioni dal Sud Africa 🙌
@umarjamilai 3 หลายเดือนก่อน
Thank you very much for your support!
@Simplifieddeeplearning 3 หลายเดือนก่อน
can you make tutorial video on model like Perplexity that use website live search
@justrax8466 4 หลายเดือนก่อน
Please sir make a complete course for LLM engineering 😊
@Wenming. 3 หลายเดือนก่อน ⁺¹
cool tutorial❤
@Simplifieddeeplearning 3 หลายเดือนก่อน
hello sir can you please make a tutorial on pytorch to fellow along with your pytorch projects. Thank you in advance
@elieelezra2734 3 หลายเดือนก่อน ⁺¹
Good vid boss
@usr-34-gambaman 2 หลายเดือนก่อน
Does leap labs provide open-source libraries?
@umarjamilai 2 หลายเดือนก่อน
You can play with the LLM interpretability notebook, which is open source. Link in the description
@olympus8903 4 หลายเดือนก่อน
It's Kind of architecture similar to Stable diffusion. Stable diffusion Generate the image from text. I am not saying exactly same, But kind of similar. Both generate image or features from noise.
@johanvandermerwe7687 3 หลายเดือนก่อน
Thanks
@bibhutibaibhavbora8770 3 หลายเดือนก่อน
When the new video is coming?
@umarjamilai 3 หลายเดือนก่อน ⁺¹
Working on it ;)
@bibhutibaibhavbora8770 3 หลายเดือนก่อน
@@umarjamilai Very much excited❤️
@Wenming. 3 หลายเดือนก่อน
谢谢！

ต่อไป

เล่นอัตโนมัติ

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

ผลสอบออกแล้ว "ครูเบญ" สอบไม่ผ่านครูผู้สอน | ห้องข่าวภาคเที่ยง

ผลสอบออกแล้ว "ครูเบญ" สอบไม่ผ่านครูผู้สอน | ห้องข่าวภาคเที่ยง

ไม่ต้องล้างกันละ💦😆🤣

ไม่ต้องล้างกันละ💦😆🤣

สาวสอบติดพนักงานราชการครู อันดับ1 สามวันชื่อหาย แต่ดันมีชื่อคนอื่นโผลแทน l EP.1759 l 16 ก.ย.67

สาวสอบติดพนักงานราชการครู อันดับ1 สามวันชื่อหาย แต่ดันมีชื่อคนอื่นโผลแทน l EP.1759 l 16 ก.ย.67

แล้วแต่ดวง | สุ่มเที่ยว 77 จังหวัด

แล้วแต่ดวง | สุ่มเที่ยว 77 จังหวัด

Mamba and S4 Explained: Architecture, Parallel Scan, Kernel Fusion, Recurrent, Convolution, Math

Mamba and S4 Explained: Architecture, Parallel Scan, Kernel Fusion, Recurrent, Convolution, Math

Simple Artificial Neural Network entirely in assembly language

Simple Artificial Neural Network entirely in assembly language

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Ilya Sutskever (OpenAI Chief Scientist) - Building AGI, Alignment, Spies, Microsoft, & Enlightenment

Ilya Sutskever (OpenAI Chief Scientist) - Building AGI, Alignment, Spies, Microsoft, & Enlightenment

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

VLOGWEEK #20 สิงหาทำไมมันคมจังว่ะ ! แปปๆจะสิ้นปีอีกแล้ว…. สุดท้ายทุกคนก็มีชีวิตเป็นของตัวเอง

VLOGWEEK #20 สิงหาทำไมมันคมจังว่ะ ! แปปๆจะสิ้นปีอีกแล้ว…. สุดท้ายทุกคนก็มีชีวิตเป็นของตัวเอง

แก้ว 2 แสนใบที่ผลิตออกมาผิดจะเอาไปทำไรดีนะ ? 👍🏻✨🚮 #BEARHOUSE #แบร์เฮาส์

แก้ว 2 แสนใบที่ผลิตออกมาผิดจะเอาไปทำไรดีนะ ? 👍🏻✨🚮 #BEARHOUSE #แบร์เฮาส์

Real Madrid 3-1 VfB Stuttgart | Champions League 24/25 Match Highlights

Real Madrid 3-1 VfB Stuttgart | Champions League 24/25 Match Highlights

Finger and Needle Close Up

Finger and Needle Close Up

แล้วแต่ดวง | สุ่มเที่ยว 77 จังหวัด

แล้วแต่ดวง | สุ่มเที่ยว 77 จังหวัด

บุฟเฟต์ปิ้งย่างเกาหลีเปิดใหม่ราคา 259 บาทเอง เค้าเอากำไรจากไหสครับเนี่ย 🤣

บุฟเฟต์ปิ้งย่างเกาหลีเปิดใหม่ราคา 259 บาทเอง เค้าเอากำไรจากไหสครับเนี่ย 🤣

การแข่งขัน RoV Pro League 2024 Winter | รอบเก็บคะแนน Week 6 Day 1

การแข่งขัน RoV Pro League 2024 Winter | รอบเก็บคะแนน Week 6 Day 1

[TH] RNT vs SPG - VCT Ascension Pacific - Day 1

[TH] RNT vs SPG - VCT Ascension Pacific - Day 1