Q-learning - Explained!

Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

คอมเมนต์แฟนเวียดนามสุดทึ่ง หลังไทยเกือบหลับแต่กลับมาได้ พลิกนรกคว้าชัยเหนือสิงคโปร์ 4-2 แบบสุดมันส์

How Strong Is Tape?

Foundation of Q-learning | Temporal Difference Learning explained!

CodeEmporium

มุมมอง 22 001

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 8 ก.พ. 2025
Let's talk about the foundation concept of Q-learning, SARSA called Temporal Difference Learning.
ABOUT ME
⭕ Subscribe: www.youtube.co...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajh...
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas...
[2] Paradigms of ML: idapgroup.com/...
[3] Model Free vs Model Based RL: spinningup.ope...
[4] Bellman Equation video: • Bellman Equation - Ex...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.ne...
📕 Calculus: imp.i384100.ne...
📕 Statistics for Data Science: imp.i384100.ne...
📕 Bayesian Statistics: imp.i384100.ne...
📕 Linear Algebra: imp.i384100.ne...
📕 Probability: imp.i384100.ne...
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.ne...
📕 Python for Everybody: imp.i384100.ne...
📕 MLOps Course: imp.i384100.ne...
📕 Natural Language Processing (NLP): imp.i384100.ne...
📕 Machine Learning in Production: imp.i384100.ne...
📕 Data Science Specialization: imp.i384100.ne...
📕 Tensorflow: imp.i384100.ne...

ความคิดเห็น • 28

@PrymeOrigin ปีที่แล้ว ⁺²³
You have a gift to teach and I'm very thankful to find someone who breaks down concepts so simply and easy
to digest
@CodeEmporium ปีที่แล้ว ⁺²
Thanks so much for the kind words. I really appreciate this
@magroubezpieczeniasp.zo.o.2137 ปีที่แล้ว ⁺¹
Totally agree!
@noahgsolomon 9 หลายเดือนก่อน ⁺¹⁰
The breakdown of the 1 sentence explanation is so useful
@LuthandoMaqondo ปีที่แล้ว ⁺⁹
Nice, quick and straight to the point.
@al_parlam ปีที่แล้ว ⁺³
man, your explanation is gorgeous ! you are remarkable in explaining complex things. Keep doing what you are doing :) I wish you much luck with your channel
@syedmaazbinshameem1884 22 วันที่ผ่านมา
You are a legend dude. Was stuck in an assignment and this video helped me!
@LaveshNK 11 หลายเดือนก่อน
Fantastic video...I have a RL assignment due and I had no idea wht TD error even meant. You are great at explaining
@benjaminimsi9558 6 หลายเดือนก่อน ⁺³
i wasnt expecting such a good explanation.
@pareak 3 หลายเดือนก่อน
I'll need to check out more of your videos... That is so well explained!!
@DevanshSagar-cy8kp 7 หลายเดือนก่อน ⁺¹
Great work ❤
@gregkondas6457 5 หลายเดือนก่อน
thank you so much! this is an awesome resource!
@manojkumar-pp4ky 6 หลายเดือนก่อน ⁺¹
Excellent
@slitihela1860 11 หลายเดือนก่อน ⁺¹
can you prepare a video for Double Q-Learning Network
and Dueling Double Q-Learning Network
please
@yep3659 11 หลายเดือนก่อน ⁺¹
I'm craving for some Tempuras now
@krishnavinukonda1882 10 หลายเดือนก่อน
This is best . Thanks!
@li-pingho1441 ปีที่แล้ว
awesome explanation!
@akshaypansari111111 ปีที่แล้ว
Thanks a lot. This is real helpful. I will check out the bellman equation video as well
@minapagliaro7607 10 หลายเดือนก่อน
Great video !!!!
@krzysztofjarek6476 ปีที่แล้ว
Great lecture 😉
@बिहारीभायजी 6 หลายเดือนก่อน ⁺¹
this video not ust explain q value, but also value function, action value function, episode, etc
@davidlieber3494 ปีที่แล้ว
great video, thanks!
@CodeEmporium ปีที่แล้ว
You are very welcome. Thanks for commenting
@Trubripes 5 หลายเดือนก่อน
why use an episodic problem as an example for 1-step TD ? the advantage of TD is for non-episodic problems.
TD uses previous value to bootstrap the current estimate, in this case shouldn't the table be initialized to R for each S,
instead of zeroes ?
@redrose5406 ปีที่แล้ว
Post more about GANs
@agnelomascarenhas8990 หลายเดือนก่อน
While watching the video, it occurred to me how do ants find paths and remember the path, change it when disturbed.
@satyamdubey4110 11 หลายเดือนก่อน
💖💖
@razainul 4 หลายเดือนก่อน
why haven't you given true credits to the original video creator. The voice is not yours we know it. You are simply lip-syncing the audio. You could've used the video and used your own true voice! I feel this is not your voice! 99.99% !

ต่อไป

เล่นอัตโนมัติ

Q-learning - Explained!

Q-learning - Explained!

Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4

Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

คอมเมนต์แฟนเวียดนามสุดทึ่ง หลังไทยเกือบหลับแต่กลับมาได้ พลิกนรกคว้าชัยเหนือสิงคโปร์ 4-2 แบบสุดมันส์

คอมเมนต์แฟนเวียดนามสุดทึ่ง หลังไทยเกือบหลับแต่กลับมาได้ พลิกนรกคว้าชัยเหนือสิงคโปร์ 4-2 แบบสุดมันส์

How Strong Is Tape?

How Strong Is Tape?

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Elon Musk’s DOGE Team: 19-Year-Olds Running US government? | Vantage with Palki Sharma | N18G

Elon Musk’s DOGE Team: 19-Year-Olds Running US government? | Vantage with Palki Sharma | N18G

Q Learning simply explained | SARSA and Q-Learning Explanation

Q Learning simply explained | SARSA and Q-Learning Explanation

Overview of Deep Reinforcement Learning Methods

Overview of Deep Reinforcement Learning Methods

Monte Carlo And Off-Policy Methods | Reinforcement Learning Part 3

Monte Carlo And Off-Policy Methods | Reinforcement Learning Part 3

EU leaders brace for Trump tariffs: How will they respond? | DW News

EU leaders brace for Trump tariffs: How will they respond? | DW News

LoRA - Explained!

LoRA - Explained!

What is Q-Learning (back to basics)

What is Q-Learning (back to basics)

#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

หมวกกันน็อค - TaitosmitH |Official MV|

หมวกกันน็อค - TaitosmitH |Official MV|

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas