Group Normalization (Paper Explained)

Batch Normalization - EXPLAINED!

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

The Driver EP.260 - ยิปซี เนะ ปาย @ANOandFriends

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Yannic Kilcher

มุมมอง 25 823

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 7 พ.ย. 2024

ความคิดเห็น • 37

@kyuhyoungchoi 5 ปีที่แล้ว ⁺⁵
I really love the way you explain. You are using very standard language which the old pre-deeplearning guys are familiar with.
@Vroomerify 2 ปีที่แล้ว
This was great! Your explanation of batch normalization is by far the most intuitive one I've found.
@lakerstv3021 5 ปีที่แล้ว ⁺⁸
Love your content! Some of the best explanations on the internet. Would be amazing if you could go through the Neural ODE or Taskonomy paper next.
@madhavimehta6010 2 ปีที่แล้ว
thanks for putting effort in explaining in simpler manner
@deepakarora987 5 ปีที่แล้ว ⁺⁶
This is really cool explanation , would love to hear more from you.
@dvirginz4001 5 ปีที่แล้ว ⁺¹
That's the best video on youtube about batchnorm, thanks for going over the paper.
@KulkarniPrashant 4 หลายเดือนก่อน
Absolutely amazing explanation!
@黃一-h6b 4 ปีที่แล้ว ⁺¹
Thank you so much for your intro.
I had a hard time grasping the concepts, this helped a lot. Thank you :)
@ross825 5 ปีที่แล้ว ⁺³
Why don’t you have more subscribers, so helpful!!!
@payalbhatia5244 4 ปีที่แล้ว ⁺¹
@
Yannic Kilcher. Thanks , it has been best explaination. You simplified maths as well. Would request you to explain all recent papers in same way. Thansk
@Konstantin-qk6hv 2 ปีที่แล้ว
Thank you for the review. I like to watch your videos instead of reading paper
@manasagarwal9485 4 ปีที่แล้ว ⁺⁴
Hi, thanks a lot for these amazing paper reviews. Can you make a video about layer normalization as well, and why is it more suited for recurrent nets than batch normalization?
@tudor6210 4 ปีที่แล้ว
Thanks for going over the paper!
@MLDawn 2 ปีที่แล้ว
Absolutely beautiful man. I love how you mentioned the model.train and mdoel.eval coding implication as well. OPut of curiousity: 1) What software are you using to show the paper (not adobe right?) 2) What kind of drawing pad are you using? I have a Wacom but since I cannot see what I'm doing on it, it is annoying to teach using it really.
@Engidea 3 ปีที่แล้ว ⁺¹
What is the app are you using to edit and write on the the paper.
@Laszer271 4 ปีที่แล้ว ⁺¹
Batch Normalization doesn't reduce internal covariate shift, see: How Does Batch Normalization Help Optimization? arXiv:1805.11604
@BlakeEdwards333 5 ปีที่แล้ว ⁺¹
Awesome thanks!
@shinbi880928 5 ปีที่แล้ว
I really like it, thank you! :)
@wolftribe66 5 ปีที่แล้ว
Could you make a video about group normalization from FAIR?
@matthewtang1489 4 ปีที่แล้ว ⁺³
Wouldn't it be cool for some professors to make the students derive the derivatives in the test =).
@hoangnhatpham8076 3 ปีที่แล้ว
I had to do that my DL exams. Just feedforward though, nothing this involved =)
@abdulhkaimal0352 3 ปีที่แล้ว
why we normalize the data and then we multiply by gamma and add it with beta
i understand it produce the best distribution of data but rather cant we just multiply by gamma and add the beta without even do the normalization part ?
@AvielLivay 4 ปีที่แล้ว
But why do you need the lambda/beta? what's wrong with just shifting to average 0, variance 1? And also - how do you train them, you mean they are part of the network and so they are trained but I thought we want things not to be shaky but you are actually adding these that are adding to the 'shakiness'... what's the point?
@rbhambriiit ปีที่แล้ว
Idea is to learn the better repression. Identity or normalised or something in between. Think of it as data preprocessing
@rbhambriiit ปีที่แล้ว
Agreed it does question the original hypothesis/definition of normalisation at input layer as well
@rbhambriiit ปีที่แล้ว
It's not that shaky. It's another layer trying to learn the better data dimensions. With images identity layers work well. So the batchnorm learning should effectively reverse the mean/variance shift.
@adhiyamaanpon4168 4 ปีที่แล้ว
someone clear the following doubt:
does gamma and beta value will be different for each input feature in a particular layer?
@YannicKilcher 4 ปีที่แล้ว ⁺¹
yes
@adhiyamaanpon4168 4 ปีที่แล้ว
@@YannicKilcher Thanks a lot!!
@yamiyagami7141 5 ปีที่แล้ว ⁺¹
Nice video! You might also want to check out "How Does Batch Normalization Help Optimization?" (arxiv.org/abs/1805.11604), presented at NeurIPS18, which casts doubt on the idea that batchnorm improves performance through reduction in internal covariate shift.
@yasseraziz1287 4 ปีที่แล้ว ⁺¹
YOU DA MAN
LONG LIVE YANNIC KILCHER
@dlisetteb 3 ปีที่แล้ว
i really cant understand it
@michaelcarlon1831 5 ปีที่แล้ว ⁺¹
All of the cool kids use SELU
@garrettosborne4364 3 ปีที่แล้ว
You got lost in the weeds on this one.
@rbhambriiit ปีที่แล้ว
The backprop was bit unclear n perhaps the hardest bit
@ssshukla26 4 ปีที่แล้ว ⁺²
If not more then almost as complicated explanation as in the paper.
@stefanogrillo6040 9 หลายเดือนก่อน
lol

ต่อไป

เล่นอัตโนมัติ

Group Normalization (Paper Explained)

Group Normalization (Paper Explained)

Batch Normalization - EXPLAINED!

Batch Normalization - EXPLAINED!

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

The Driver EP.260 - ยิปซี เนะ ปาย @ANOandFriends

The Driver EP.260 - ยิปซี เนะ ปาย @ANOandFriends

斗羅家族的大波浪大挑戰！ #斗羅大陸 #小舞 #漫威 #超人 #小丑 #小丑女

斗羅家族的大波浪大挑戰！ #斗羅大陸 #小舞 #漫威 #超人 #小丑 #小丑女

Batch normalization | What it is and how to implement it

Batch normalization | What it is and how to implement it

Why Does Batch Norm Work? (C2W3L06)

Why Does Batch Norm Work? (C2W3L06)

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

Batch Normalization in neural networks - EXPLAINED!

Batch Normalization in neural networks - EXPLAINED!

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

All About Normalizations! - Batch, Layer, Instance and Group Norm

All About Normalizations! - Batch, Layer, Instance and Group Norm

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Layer Normalization - EXPLAINED (in Transformer Neural Networks)

Layer Normalization - EXPLAINED (in Transformer Neural Networks)

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

🔴Live สด! PUBG GLOBAL SERIES 6 | GROUP STAGE DAY 2

🔴Live สด! PUBG GLOBAL SERIES 6 | GROUP STAGE DAY 2

ฮาโลวีนมรณะ | Who Are You EP.6

ฮาโลวีนมรณะ | Who Are You EP.6

NERF HEAVY WEAPON GUYS: Drone Battle 6

NERF HEAVY WEAPON GUYS: Drone Battle 6

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

🔴Live โหนกระแส หมอดูฮวงจุ้ย มิจซี่ อัดค่าพิธี ซื้อเทพ ซื้อที่ ซื้อหิน มากกว่า 80 ล้านบาท #3

🔴Live โหนกระแส หมอดูฮวงจุ้ย มิจซี่ อัดค่าพิธี ซื้อเทพ ซื้อที่ ซื้อหิน มากกว่า 80 ล้านบาท #3

การ์นาโช่ เล่นเหมือน โรนัลโด้สมัยก่อน #แมนยู #การ์นาโช่

การ์นาโช่ เล่นเหมือน โรนัลโด้สมัยก่อน #แมนยู #การ์นาโช่

CORSAIR K70 PRO TKL คีย์บอร์ดโคตรโกง สำหรับสายเกมมิ่ง !!

CORSAIR K70 PRO TKL คีย์บอร์ดโคตรโกง สำหรับสายเกมมิ่ง !!

เตียงมรณะ!!! ติดกระบอกลม ห้ามนอนโดยไม่ได้รับอนุญาต

เตียงมรณะ!!! ติดกระบอกลม ห้ามนอนโดยไม่ได้รับอนุญาต