Coding Deep Q-Learning in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.1

Deep Q-Learning - Atari Breakout

Train Deep Q-Learning on Atari in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.2

น้องๆ พี่เจอผีหมูเด้ง!!

NCT DREAM 엔시티 드림 'Flying Kiss' MV

จารย์❌ จาน✅ #ตลก #บ้านกูเอง

Deep Q Network Learns Game Breaking Atari Strategy - DQN Reinforcement Learning Code Tutorial p.3

brthor

มุมมอง 2 427

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 10 พ.ย. 2024

ความคิดเห็น • 10

@hikarulab 2 ปีที่แล้ว
I'm a beginner. It was your videos that started my journey. Thank you so much!
@brthor1117 2 ปีที่แล้ว
Nice! Good Luck!
@pa-su6901 3 ปีที่แล้ว
Is there a way to quickly see what Machine Finished has learned on the game screen?
@brthor1117 3 ปีที่แล้ว
I'm not too clear what you are asking here. The agent's final score will usually reflect how much it has learned.
@petarstika7224 2 ปีที่แล้ว
Hello brthor, what version are of Python did you use for this project? I used 3.9, but I had problems with ROMs. I read somewhere that latest support for atari_py is versions 3.7 and lower. Thanks in advance.
@brthor1117 2 ปีที่แล้ว
I believe used python 3.6 for this one.
@pa-su6901 3 ปีที่แล้ว
i have a question. If you run the model from the "DQN" file, can't you run it using the "DQN" file from the "observe" file as the result of rotation?
@brthor1117 3 ปีที่แล้ว ⁺¹
I'm not sure what you mean by rotation here, but the Network model in dqn.py and observe.py is identical and you could simply import the network into observe.py from dqn.py as long as you protect the training code from executing during import by checking if it's the main script before executing training.
@mateuszbielesz711 2 ปีที่แล้ว
Hello brthor. High LR is faster at the begining but has a limit at the late phases of training, low LR are slower, but limit is much higher. Solution: Set High LR and lower it during training f.e LR = n/attempt_no. Did you try that?
After all great job, thanks!
@brthor1117 2 ปีที่แล้ว
This is a common approach for longer training runs, but in this case I was replicating the methods of the paper.

ต่อไป

เล่นอัตโนมัติ

Coding Deep Q-Learning in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.1

Coding Deep Q-Learning in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.1

Deep Q-Learning - Atari Breakout

Deep Q-Learning - Atari Breakout

Train Deep Q-Learning on Atari in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.2

Train Deep Q-Learning on Atari in PyTorch - Reinforcement Learning DQN Code Tutorial Series p.2

น้องๆ พี่เจอผีหมูเด้ง!!

น้องๆ พี่เจอผีหมูเด้ง!!

NCT DREAM 엔시티 드림 'Flying Kiss' MV

NCT DREAM 엔시티 드림 'Flying Kiss' MV

จารย์❌ จาน✅ #ตลก #บ้านกูเอง

จารย์❌ จาน✅ #ตลก #บ้านกูเอง

The Driver EP.260 - ยิปซี เนะ ปาย @ANOandFriends

The Driver EP.260 - ยิปซี เนะ ปาย @ANOandFriends

Google DeepMind's Deep Q-learning playing Atari Breakout!

Google DeepMind's Deep Q-learning playing Atari Breakout!

Reinforcement Learning For Classification?

Reinforcement Learning For Classification?

Recurrent Neural Network: Built from Scratch in C++!

Recurrent Neural Network: Built from Scratch in C++!

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How might LLMs store facts | Chapter 7, Deep Learning

How might LLMs store facts | Chapter 7, Deep Learning

Reinforcement Learning Deep Q-Learning PyTorch Tutorial [Part 4]: Double Q-Learning (DDQN)

Reinforcement Learning Deep Q-Learning PyTorch Tutorial [Part 4]: Double Q-Learning (DDQN)

When Optimisations Work, But for the Wrong Reasons

When Optimisations Work, But for the Wrong Reasons

อย่าแปลงร่างกลางตลาด #ตลก #ben10 #ละครสั้น #บ้านกูเอง

อย่าแปลงร่างกลางตลาด #ตลก #ben10 #ละครสั้น #บ้านกูเอง

Why do you feel something's wrong? Who secretly kissed me # shorts # Couple funny

Why do you feel something's wrong? Who secretly kissed me # shorts # Couple funny

ไลฟ์สุ่ม iPhone 295 บาท... โกงมั้ย?[ โกงมั้ยครับ ep.97 ] | DOM

ไลฟ์สุ่ม iPhone 295 บาท... โกงมั้ย?[ โกงมั้ยครับ ep.97 ] | DOM

มายคราฟสุ่มเอาชีวิตรอด "ใช่หรือไม่" นะหรือไม่ใช่!?

มายคราฟสุ่มเอาชีวิตรอด "ใช่หรือไม่" นะหรือไม่ใช่!?

Hoodie gets wicked makeover! 😲

Hoodie gets wicked makeover! 😲

พลิกดาวสู่ดิน ! จาก “ทนายดัง” สู่ “ผู้ต้องขัง” อึ้ง เจอแฉ ล้วงความลับ พลิกคดีเหยื่อ #ถกไม่เถียง

พลิกดาวสู่ดิน ! จาก “ทนายดัง” สู่ “ผู้ต้องขัง” อึ้ง เจอแฉ ล้วงความลับ พลิกคดีเหยื่อ #ถกไม่เถียง

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

ทำไม RWB คันนึงถึงมีมูลค่า มากกว่า 10 ล้าน !!

[LIVE] : ONE ลุมพินี 86 | คู่เอก "คมเพชร vs ชาติพยัคฆ์"

[LIVE] : ONE ลุมพินี 86 | คู่เอก "คมเพชร vs ชาติพยัคฆ์"