Faster QLearning with Experience Replay

Neural Network Learns to Balance a CartPole (Deep Q Networks)

Deep Q-Learning - Combining Neural Networks and Reinforcement Learning

🔴Live โหนกระแส อ.ปานเทพมาแล้ว เชื่อทนายปาเกียวกำลังพลิกคดี มั่นใจเมียตั้มมีรู้เห็นทั้งหมด

Why do you feel something's wrong? Who secretly kissed me # shorts # Couple funny

มวยมันส์สนั่นเมือง 12/11/2024

Q-Learning with a Neural Network in Tensorflow

TheComputerScientist

มุมมอง 6 549

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 12 พ.ย. 2024

ความคิดเห็น • 13

@rahulsutradhar5759 2 ปีที่แล้ว
Man, I love your tutorials so far! You are a life-saver. Though I had to pause the video every 30 seconds to makeout what's happening (as the explanation is very fast for me); this is the only video that I have actually learned about Q Learning the most.
I thank you from the bottom of my heart.
@MultsElMesco 3 ปีที่แล้ว
I'm just starting with machine learning and this channel has helped me a lot with understanding the basics. Thanks a lot!
@yashmandilwar8904 5 ปีที่แล้ว ⁺¹
Hi Shawn. Thanks for this tutorial. I have a question on line 30 and 38 after you've coded the DQN . Shouldn't you be calling self.q_action instead of self.q_state and then take max of the values?
Thanks!
@gregchance5090 4 ปีที่แล้ว ⁺¹
Hi, great tutorial!
Cannot feed value of shape () for Tensor 'Placeholder_2:0', which has shape '(1,)'
From self.sess.run(self.optimizer, feed_dict=feed)
So tf.reduce_sum(tf.square(agent.target_in - agent.q_action)) returns a tensor with shape=() is this the issue?
but if i change self.target_in = tf.compat.v1.placeholder(tf.float32, shape=[1]) to shape=[] it runs?
However, rewards over 200 episodes never go over 1 or 2?
Thanks!
@pranjalthakur8115 5 ปีที่แล้ว ⁺⁶
Good Tutorial but the pace is on higher side..
@hitinjami1143 4 ปีที่แล้ว ⁺¹
great video bro.
just want to know how to work with an image as my observation space.
@nubscripters3756 5 ปีที่แล้ว ⁺⁴
hey why does your ai always train alot better than mine? on first 100 episodes we get similar results but on the next 100 i either got 3/4 (or 0) and you get like 50!
@sergeypigida4834 4 ปีที่แล้ว ⁺¹
Hi Shawn, thank you for so detailed tutorial. Could you please give some tips on how to rewrite your code for TensorFlow 2.0?
@alexsmith3974 5 ปีที่แล้ว
What version of tensorflow are you using here/how would you implement this in 2.0? I'm using 2.0, and when I copy and paste the code you provide (which should work), I get an error: "module 'tensorflow' has no attribute 'placeholder'". any ideas?
@azizrais1526 4 ปีที่แล้ว ⁺²
import tensorflow.compat.v1 as tf
tf.disable_v2_behavior()
@AmitYadav-zk8zm 3 ปีที่แล้ว
Great video. However I don't think your statement about NN updating multiple weights per iteration is true. For each iteration only a single state and action is active hence the gradients of only the weights attached to them would be non-zero and hence only they would be updated - that would be only one weight.
@ryanbeasley1079 4 ปีที่แล้ว
Anyone else having trouble with the reward never going over 1?
@nothingtodo3097 3 ปีที่แล้ว
yep, mine is not updating randomly anymore

ต่อไป

เล่นอัตโนมัติ

Faster QLearning with Experience Replay

Faster QLearning with Experience Replay

Neural Network Learns to Balance a CartPole (Deep Q Networks)

Neural Network Learns to Balance a CartPole (Deep Q Networks)

Deep Q-Learning - Combining Neural Networks and Reinforcement Learning

Deep Q-Learning - Combining Neural Networks and Reinforcement Learning

🔴Live โหนกระแส อ.ปานเทพมาแล้ว เชื่อทนายปาเกียวกำลังพลิกคดี มั่นใจเมียตั้มมีรู้เห็นทั้งหมด

🔴Live โหนกระแส อ.ปานเทพมาแล้ว เชื่อทนายปาเกียวกำลังพลิกคดี มั่นใจเมียตั้มมีรู้เห็นทั้งหมด

Why do you feel something's wrong? Who secretly kissed me # shorts # Couple funny

Why do you feel something's wrong? Who secretly kissed me # shorts # Couple funny

มวยมันส์สนั่นเมือง 12/11/2024

มวยมันส์สนั่นเมือง 12/11/2024

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel

Increasing Training Stability with Double DQNs

Increasing Training Stability with Double DQNs

Q Learning Explained (tutorial)

Q Learning Explained (tutorial)

Deep Reinforcement Learning Tutorial for Python in 20 Minutes

Deep Reinforcement Learning Tutorial for Python in 20 Minutes

But what is a neural network? | Chapter 1, Deep learning

But what is a neural network? | Chapter 1, Deep learning

An Introduction to Q-Learning

An Introduction to Q-Learning

Foundations of Q-Learning

Foundations of Q-Learning

Play Any OpenAI Gym Environment with a Single Agent

Play Any OpenAI Gym Environment with a Single Agent

Getting Started With OpenAI Gym

Getting Started With OpenAI Gym

How To Speed Up Training With Prioritized Experience Replay

How To Speed Up Training With Prioritized Experience Replay

[LIVE] BABYMONSTER(베이비몬스터) - DRIP | 두시탈출 컬투쇼

[LIVE] BABYMONSTER(베이비몬스터) - DRIP | 두시탈출 컬투쇼

How To Choose Mac N Cheese Date Night.. 🧀

How To Choose Mac N Cheese Date Night.. 🧀

Smart Parenting Gadget for a Mess-Free Mealtime 🍽️👍 #parenting #gadgets #asmr

Smart Parenting Gadget for a Mess-Free Mealtime 🍽️👍 #parenting #gadgets #asmr

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

ย่าน - ปรีชา ปัดภัย : เซิ้ง|Music 【Official MV】

奶奶的衣柜坏了，给她翻新了一下。My grandma’s wardrobe was broken, so I gave it a makeover.丨Liziqi Channel

奶奶的衣柜坏了，给她翻新了一下。My grandma’s wardrobe was broken, so I gave it a makeover.丨Liziqi Channel

OHANA บ้าพลัง EP.126 : เกมการ์ดโอฮาน่า x นินิว โย ฝน

OHANA บ้าพลัง EP.126 : เกมการ์ดโอฮาน่า x นินิว โย ฝน

ทำไมผู้หญิงในอเมริกายังโหวตให้ทรัมป์?🇺🇸

ทำไมผู้หญิงในอเมริกายังโหวตให้ทรัมป์?🇺🇸

BABYMONSTER (베이비몬스터) - CLIK CLAK @인기가요 inkigayo 20241110

BABYMONSTER (베이비몬스터) - CLIK CLAK @인기가요 inkigayo 20241110