Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Aditya Varre - On the spectral bias of two-layer linear networks

MIT 6.S191 (2022): Convolutional Neural Networks

ศึกมวยไทยพันธมิตร 16/12/2024

人是不能做到吗？#火影忍者 #家人 #佐助

🎄✨ Puff is saving Christmas again with his incredible baking skills! #PuffTheBaker #thatlittlepuff

Yuan Cao - Understanding Deep Learning Through Phenomena Discovery and Explanation

One world theoretical machine learning

มุมมอง 914

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 6 ก.พ. 2025
Abstract: Deep learning has achieved great success in many applications. However, the success of deep learning has not been well understood in theory. In this talk, I will discuss some recent efforts to bridge the gap between theory and practice through phenomenon discovery and explanation. In the first part of this talk, I will discuss the phenomenon of “benign overfitting” in deep learning, and present our recent results characterizing benign and harmful overfitting in training convolutional neural networks (CNNs). In the second part of the talk, I will discuss the recently discovered phenomenon on the generalization gap between Adam and stochastic gradient descent in image classification tasks. I will present an intuitive explanation for this generalization gap and provide a rigorous theoretical guarantee to support the explanation. Overall, this talk will provide insights into the “feature learning” procedure of neural networks, and how it is related to various interesting phenomena in deep learning.

ความคิดเห็น •

ต่อไป

เล่นอัตโนมัติ

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Aditya Varre - On the spectral bias of two-layer linear networks

Aditya Varre - On the spectral bias of two-layer linear networks

MIT 6.S191 (2022): Convolutional Neural Networks

MIT 6.S191 (2022): Convolutional Neural Networks

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

🎄✨ Puff is saving Christmas again with his incredible baking skills! #PuffTheBaker #thatlittlepuff

🎄✨ Puff is saving Christmas again with his incredible baking skills! #PuffTheBaker #thatlittlepuff

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

How language model post-training is done today

How language model post-training is done today

MIT Introduction to Deep Learning | 6.S191

MIT Introduction to Deep Learning | 6.S191

Attention in transformers, step-by-step | DL6

Attention in transformers, step-by-step | DL6

MIT 6.S191 (2023): Convolutional Neural Networks

MIT 6.S191 (2023): Convolutional Neural Networks

Lei Wu - Understanding the implicit bias of SGD: A dynamical stability perspective

Lei Wu - Understanding the implicit bias of SGD: A dynamical stability perspective

Micah Goldblum - Bridging the Gap between Deep Learning Theory and Practice

Micah Goldblum - Bridging the Gap between Deep Learning Theory and Practice

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

คุณอยากเรียนเวลาไหนทุกวันไปตลอดชีวิต? เลือกเลย!

คุณอยากเรียนเวลาไหนทุกวันไปตลอดชีวิต? เลือกเลย!

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

พ้นเส้นตาย "ทหารไทย" 18 ธ.ค.หมดเวลา "ว้าแดง" | DAILYNEWSTODAY 18/12/67

พ้นเส้นตาย "ทหารไทย" 18 ธ.ค.หมดเวลา "ว้าแดง" | DAILYNEWSTODAY 18/12/67