Overview of Deep Reinforcement Learning Methods

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)

REAL MADRID 4 - 0 CA OSASUNA I RESUMEN LALIGA EA SPORTS

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

ONE 169 Full Fight | 9 พ.ย. 2567 | Ch7HD

Deep Q-Network & Dueling network architectures for deep reinforcement learning

Andrew Melnik

มุมมอง 22 941

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 9 พ.ย. 2024

ความคิดเห็น • 24

@hangchen 5 ปีที่แล้ว ⁺²¹
This is the best explanation video I can find for Dueling DQN!
@IgorAherne 5 ปีที่แล้ว ⁺²
Agreed
@SandwichMitGurke 5 ปีที่แล้ว ⁺¹
this is exactly what i wanted to write..
@kansasllama 4 ปีที่แล้ว ⁺¹
same!! fantastic video!
@julian540 4 ปีที่แล้ว ⁺³
This is a clear, careful, and organized presentation of dueling DQNs that you need to see if you're new to these networks!
Thank you so much, Andrew for your video.
@sharp7j 2 ปีที่แล้ว ⁺¹
I read the paper and was so confused about the intuitive behind their loss function that subtracted the mean of the action rewards. Now I get in just the first 4 min of this video. Amazing explanation, clearly illustrates the intuition thank you so much!
@cmunozcortes 3 ปีที่แล้ว ⁺²
Phenomenal explanation. Very illustrative and concise. Thank you!
@bagumamartin 5 หลายเดือนก่อน
Best explanation
@aashishadhikari8144 3 ปีที่แล้ว
This video did not exactly answer the question that I was here for but it gave me several new ideas that are helpful to understand Dueling DQN better. Good work mate.
@coroamalarisa5188 3 ปีที่แล้ว ⁺¹
Amazing video, thank you very much for the explanation!
@muhammadusama6040 6 ปีที่แล้ว ⁺⁴
Excellent explanation man. Thanks a lot for your effort.
@SandwichMitGurke 5 ปีที่แล้ว ⁺¹
very very good explanation. Now i'm hyped on implementing it :)
@niektuytel9519 3 ปีที่แล้ว ⁺¹
good video underrated publicity man :)
@Nissearne12 ปีที่แล้ว
❤so good 👍 Tutorial
@thomasdelteil1158 6 ปีที่แล้ว
great video, very clear explanation of the advantage of dueling networks! Thanks
@ЭдуардПольников 2 ปีที่แล้ว
Thanks
@Hideonbush-fm2td 2 ปีที่แล้ว
Thanks a lot
@Eijgey 2 ปีที่แล้ว
This is excelent
@andychoi3589 4 ปีที่แล้ว
Thanks for the awesome video! One question though.. I didn't quite understand why the mean term acts as a regularizer at 6:55. I understand that A - A.mean() would have value around 0, but I don't see why A.mean() itself enables the layer output A to be centered around 0. Could you briefly explain it if possible? Thank you :)
@westoncook981 3 ปีที่แล้ว
I was confused about this too but if you take the gradient of any action's q value w.r.t. the advantage function, it turns out to sum to zero so the mean will not be changed by SGD updates.
@IgorAherne 6 ปีที่แล้ว ⁺¹
Andrew, can you please explain the *Implicit* Quantile-Regression Network? arxiv.org/abs/1806.06923
There is literally no user-friendly explanation on the web as of November 2018, and I can't understand it (although I understood C51 and Quantile-regression DQN)
It would be a great contribution & help!
@ahmettavli4205 5 ปีที่แล้ว ⁺³
Do you have the code for the visual example?
@revimfadli4666 11 หลายเดือนก่อน
Can this be applied to PPO and other on-policy advantage-based methods by simply flipping the equation (instead of q=a+v, you use a = q-v)?
@revimfadli4666 ปีที่แล้ว
I find it unfortunate that it's called "dueling" because it's more cooperative like pilot(advantage) and navigator(value)

ต่อไป

เล่นอัตโนมัติ

Overview of Deep Reinforcement Learning Methods

Overview of Deep Reinforcement Learning Methods

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)

[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)

REAL MADRID 4 - 0 CA OSASUNA I RESUMEN LALIGA EA SPORTS

REAL MADRID 4 - 0 CA OSASUNA I RESUMEN LALIGA EA SPORTS

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

ONE 169 Full Fight | 9 พ.ย. 2567 | Ch7HD

ONE 169 Full Fight | 9 พ.ย. 2567 | Ch7HD

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

DQN (Deep Q-Network) theory and implementation. Using DQN with ROS and Gazebo.

DQN (Deep Q-Network) theory and implementation. Using DQN with ROS and Gazebo.

Increasing Training Stability with Double DQNs

Increasing Training Stability with Double DQNs

Deep Q-Networks Explained!

Deep Q-Networks Explained!

A. I. Learns to Play Starcraft 2 (Reinforcement Learning)

A. I. Learns to Play Starcraft 2 (Reinforcement Learning)

DQN explained line-by-line.

DQN explained line-by-line.

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Actor Critic Algorithms

Actor Critic Algorithms

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

เมื่อ SPD เซอร์ไพรส์ ห้องสตรีมเกมให้แฟน!!

เมื่อ SPD เซอร์ไพรส์ ห้องสตรีมเกมให้แฟน!!

No more ice cream mess 😫 #parenting #cleaning #diy #hacks #useful #parentingtips

No more ice cream mess 😫 #parenting #cleaning #diy #hacks #useful #parentingtips

How Much Tape To Stop A Lamborghini?

How Much Tape To Stop A Lamborghini?

🔴LIVE เชียร์สด : ลิเวอร์พูล พบ แอสตัน วิลล่า | หงส์แดงดวลสิงห์ผงาด MW11

🔴LIVE เชียร์สด : ลิเวอร์พูล พบ แอสตัน วิลล่า | หงส์แดงดวลสิงห์ผงาด MW11

พลิกดาวสู่ดิน ! จาก “ทนายดัง” สู่ “ผู้ต้องขัง” อึ้ง เจอแฉ ล้วงความลับ พลิกคดีเหยื่อ #ถกไม่เถียง

พลิกดาวสู่ดิน ! จาก “ทนายดัง” สู่ “ผู้ต้องขัง” อึ้ง เจอแฉ ล้วงความลับ พลิกคดีเหยื่อ #ถกไม่เถียง

“อ.เบียร์ คนตื่นธรรม” มาตามคำเรียกร้อง แหกศาสตร์ฮวงจุ้ย เตือนสติอย่างมงาย | แฉ 7 พ.ย. 67 [1/3]

“อ.เบียร์ คนตื่นธรรม” มาตามคำเรียกร้อง แหกศาสตร์ฮวงจุ้ย เตือนสติอย่างมงาย | แฉ 7 พ.ย. 67 [1/3]

A cake made of coal briquettes?|Chinese Mountain Forest Life and Food #Moo TikTok#FYP

A cake made of coal briquettes?|Chinese Mountain Forest Life and Food #Moo TikTok#FYP

น้องๆ พี่เจอผีหมูเด้ง!!

น้องๆ พี่เจอผีหมูเด้ง!!