Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

“typing” is getting deprecated in Python

Why is everyone LYING?

ว้าแดงทางใต้เรียกระดมพลครั้งใหม่ เอาจำนวนมากว่าเดิม จีนกดดันว้าอยู่ในอุ้งมือ

Virtual VR: 🥽 I miss You

😂I Was Almost Scared! He Was Performing? #funny #trickingcombo

Build a Custom Gymnasium Reinforcement Learning Environment & Train w Q-Learning & Stable Baselines3

Johnny Code

มุมมอง 7 421

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ธ.ค. 2024

ความคิดเห็น • 23

@johnnycode 15 วันที่ผ่านมา
Ready to get started with Stable Baselines3? th-cam.com/video/OqvXHi_QtT0/w-d-xo.html
@graycomet 16 วันที่ผ่านมา
Thanks for the great video. It makes me easier to build my own training environments.
@a_samad 7 หลายเดือนก่อน ⁺³
Thank You!🎉
We need more videos (course) like that: create custom env using open ai gym.
@Ayhamtechtips 8 หลายเดือนก่อน ⁺²
Very helpful and nice explanation.
Thanks!
@hrsharma02 7 หลายเดือนก่อน ⁺²
Thanks, dear 🎉, Please upload more videos on multi-agent RL for robotics and path planning for multi-robot with custom environment in Gymnasium.
@navaneethbuilds 4 หลายเดือนก่อน ⁺¹
thanks for the amazing video
@ashutoshmishra5901 7 หลายเดือนก่อน ⁺⁴
Your Explanation is too good. I have a humble request. Can you make RL training using MuJoCo Ant but registering it as custom environment. The GAIT parameter generation is quite treaky. If possible please make a tutorial on it.
@towerboi-zg3it 7 หลายเดือนก่อน
I also want this tutorial, so please Johnny
@buzzbuzz1691 8 หลายเดือนก่อน ⁺¹
Thank you
@rickyolal 5 หลายเดือนก่อน ⁺¹
Hey Johnny, I was wondering if you knew how to make the algorithm learn some already known states? I have a challenge related to make a DQN learn and start with already known states stored in a csv file, and I am struggling because I have no idea how to do that. Is it possible?
@johnnycode 5 หลายเดือนก่อน
I'm guessing if you know those states, then you would know what action to take or not take in relation to those states. For example, a pawn on a chess board can't go backwards, since you know that state is impossible. If my interpretation of your question is correct, then you might want to look into "action masking", which prevents the agent from taking illegal actions. You can start with this SB3 reference, but the concept is not limited to PPO: sb3-contrib.readthedocs.io/en/master/modules/ppo_mask.html
@BoxingBytes-y9i 6 หลายเดือนก่อน
Thanks for your video. Which ressources would you adivse to learn practical applications of reinforcement learning? I've been trying to implement a bot for a specific game and have to create my own environment and DQN. I'm familiar with neural nets, but all the rest is so hard to find good information on
@johnnycode 6 หลายเดือนก่อน
Sorry, I'm not an expert. I suggest inquiring at the r/reinforcementlearning subreddit. There are some very knowledgeable people there.
@BoxingBytes-y9i 6 หลายเดือนก่อน
@@johnnycode Thank you for the answer, will do!
@tieqi5623 6 หลายเดือนก่อน
Thanks, good video. Dose Gymnasium can support NeoGeo (SNK) roms? How to make it to support?
@johnnycode 6 หลายเดือนก่อน
It doesn’t support Neo Geo roms. I think it would be extremely hard to bridge that support.
@arnavmodanwal6295 5 หลายเดือนก่อน
Hi, your videos are great and helped me a lot since you were using the latest version of stable baseline3...But I am facing an issue that the verbose values are not getting printed in output I have put verbose = 1 and even tried to use verbose = 2 but not getting the desired outputs (like rewards, loss, iterations, ep_len_mean etc.) as it was getting printed in your videos. Can you please help me? Is this due to the custom environment I am using or something else?
Also, tensorboard logs are also not working...
@johnnycode 5 หลายเดือนก่อน
You should try creating a new conda environment and then install SB3 again. In my SB3 introduction video, I just ran pip install stable-baselines3[extra] and didn't do anything else special: th-cam.com/video/OqvXHi_QtT0/w-d-xo.html
@arnavmodanwal6295 5 หลายเดือนก่อน
@@johnnycode Hi, I will try this one again...Thanks a lot for the reply and your time! Might need your help again...
@arnavmodanwal6295 5 หลายเดือนก่อน
Hi, @johnnycode, I tried reinstalling the stable-baselines3[extras] but I am not getting the monitor data also the tensorboard logs are also not getting displayed...Is there some issue with the new version of stable-baselines3[extra] can you please give me the version you installed when making the video?
@johnnycode 5 หลายเดือนก่อน
stable-baselines3 2.0.0
tensorboard 2.13.0
@sergiogirona2988 7 หลายเดือนก่อน
Could i use it for own game made it with Godot Engine?? Thanks!!
@johnnycode 7 หลายเดือนก่อน
Yes, of course!

ต่อไป

เล่นอัตโนมัติ

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

Deep Q-Learning/Deep Q-Network (DQN) Explained | Python Pytorch Deep Reinforcement Learning

“typing” is getting deprecated in Python

“typing” is getting deprecated in Python

Why is everyone LYING?

Why is everyone LYING?

ว้าแดงทางใต้เรียกระดมพลครั้งใหม่ เอาจำนวนมากว่าเดิม จีนกดดันว้าอยู่ในอุ้งมือ

ว้าแดงทางใต้เรียกระดมพลครั้งใหม่ เอาจำนวนมากว่าเดิม จีนกดดันว้าอยู่ในอุ้งมือ

Virtual VR: 🥽 I miss You

Virtual VR: 🥽 I miss You

😂I Was Almost Scared! He Was Performing? #funny #trickingcombo

😂I Was Almost Scared! He Was Performing? #funny #trickingcombo

My lovely daughter arranged for me, a security guard, to marry a female CEO.

My lovely daughter arranged for me, a security guard, to marry a female CEO.

How to Build Samantha with OpenAI’s Realtime API for FREE! | Speech-to-speech AI Agent Explanation

How to Build Samantha with OpenAI’s Realtime API for FREE! | Speech-to-speech AI Agent Explanation

AI/ML/DL GPU Buying Guide 2024: Get the Most AI Power for Your Budget

AI/ML/DL GPU Buying Guide 2024: Get the Most AI Power for Your Budget

Building a Custom Environment for Deep Reinforcement Learning with OpenAI Gym and Python

Building a Custom Environment for Deep Reinforcement Learning with OpenAI Gym and Python

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

day 01 - advent of code 2024

day 01 - advent of code 2024

Q-Learning Tutorial in Python - Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Getting Started with Stable Baselines3 Python Reinforcement Learning Library | MuJoCo Humanoid-v4

Getting Started with Stable Baselines3 Python Reinforcement Learning Library | MuJoCo Humanoid-v4

Easiest Way to Train AI to Play Atari Games with Reinforcement Learning | RL Baselines3 Zoo Tutorial

Easiest Way to Train AI to Play Atari Games with Reinforcement Learning | RL Baselines3 Zoo Tutorial

ญี่ปุ่นเจอ"ซุปเปอร์เล็ก"ครั้งแรกเลยประมาท #โค้ชเชร์พากย์มวย #reaction #react #มวยไทย #muaythai

ญี่ปุ่นเจอ"ซุปเปอร์เล็ก"ครั้งแรกเลยประมาท #โค้ชเชร์พากย์มวย #reaction #react #มวยไทย #muaythai

🔴 หลังเกม: หงส์คว่ำเรือหมดจด 11แต้มช่างห่างไกล

🔴 หลังเกม: หงส์คว่ำเรือหมดจด 11แต้มช่างห่างไกล

Part2 🍖หญิงสาวแรงเกินไปจนขว้างรองเท้าลงไปในหม้อไฟของหัวหน้าแก๊ง #shorts #Chinesedrama #drama #fyp

Part2 🍖หญิงสาวแรงเกินไปจนขว้างรองเท้าลงไปในหม้อไฟของหัวหน้าแก๊ง #shorts #Chinesedrama #drama #fyp

“แอฟ-นนกุล” รักหวานหนักมาก เหมือนเป็นที่ชาร์จแบตให้กัน | แฉฮอต 2024

“แอฟ-นนกุล” รักหวานหนักมาก เหมือนเป็นที่ชาร์จแบตให้กัน | แฉฮอต 2024

Confronting Ronaldo

Confronting Ronaldo

Don’t Choose The Wrong Box 😱

Don’t Choose The Wrong Box 😱

VLOG #265 ภูเก็ตที่ไม่มีเธอ !! โสดก็ตอแหลสิคะ กลับมารอบนี้แตกๆ 3วันกับพี่กะทิ บอกเลยพังเละเทะ …….

VLOG #265 ภูเก็ตที่ไม่มีเธอ !! โสดก็ตอแหลสิคะ กลับมารอบนี้แตกๆ 3วันกับพี่กะทิ บอกเลยพังเละเทะ …….

🔴 LIVE ศึกมวยไทยพันธมิตร I 2 ธ.ค. 67

🔴 LIVE ศึกมวยไทยพันธมิตร I 2 ธ.ค. 67