Foundation of Q-learning | Temporal Difference Learning explained!

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 ก.พ. 2025
  • Let's talk about the foundation concept of Q-learning, SARSA called Temporal Difference Learning.
    ABOUT ME
    ⭕ Subscribe: www.youtube.co...
    📚 Medium Blog: / dataemporium
    💻 Github: github.com/ajh...
    👔 LinkedIn: / ajay-halthor-477974bb
    RESOURCES
    [1] Reinforcement Learning book: incompleteideas...
    [2] Paradigms of ML: idapgroup.com/...
    [3] Model Free vs Model Based RL: spinningup.ope...
    [4] Bellman Equation video: • Bellman Equation - Ex...
    PLAYLISTS FROM MY CHANNEL
    ⭕ Reinforcement Learning: • Reinforcement Learning...
    Natural Language Processing: • Natural Language Proce...
    ⭕ Transformers from Scratch: • Natural Language Proce...
    ⭕ ChatGPT Playlist: • ChatGPT
    ⭕ Convolutional Neural Networks: • Convolution Neural Net...
    ⭕ The Math You Should Know : • The Math You Should Know
    ⭕ Probability Theory for Machine Learning: • Probability Theory for...
    ⭕ Coding Machine Learning: • Code Machine Learning
    MATH COURSES (7 day free trial)
    📕 Mathematics for Machine Learning: imp.i384100.ne...
    📕 Calculus: imp.i384100.ne...
    📕 Statistics for Data Science: imp.i384100.ne...
    📕 Bayesian Statistics: imp.i384100.ne...
    📕 Linear Algebra: imp.i384100.ne...
    📕 Probability: imp.i384100.ne...
    OTHER RELATED COURSES (7 day free trial)
    📕 ⭐ Deep Learning Specialization: imp.i384100.ne...
    📕 Python for Everybody: imp.i384100.ne...
    📕 MLOps Course: imp.i384100.ne...
    📕 Natural Language Processing (NLP): imp.i384100.ne...
    📕 Machine Learning in Production: imp.i384100.ne...
    📕 Data Science Specialization: imp.i384100.ne...
    📕 Tensorflow: imp.i384100.ne...

ความคิดเห็น • 28

  • @PrymeOrigin
    @PrymeOrigin ปีที่แล้ว +23

    You have a gift to teach and I'm very thankful to find someone who breaks down concepts so simply and easy
    to digest

  • @noahgsolomon
    @noahgsolomon 9 หลายเดือนก่อน +10

    The breakdown of the 1 sentence explanation is so useful

  • @LuthandoMaqondo
    @LuthandoMaqondo ปีที่แล้ว +9

    Nice, quick and straight to the point.

  • @al_parlam
    @al_parlam ปีที่แล้ว +3

    man, your explanation is gorgeous ! you are remarkable in explaining complex things. Keep doing what you are doing :) I wish you much luck with your channel

  • @syedmaazbinshameem1884
    @syedmaazbinshameem1884 22 วันที่ผ่านมา

    You are a legend dude. Was stuck in an assignment and this video helped me!

  • @LaveshNK
    @LaveshNK 11 หลายเดือนก่อน

    Fantastic video...I have a RL assignment due and I had no idea wht TD error even meant. You are great at explaining

  • @benjaminimsi9558
    @benjaminimsi9558 6 หลายเดือนก่อน +3

    i wasnt expecting such a good explanation.

  • @pareak
    @pareak 3 หลายเดือนก่อน

    I'll need to check out more of your videos... That is so well explained!!

  • @DevanshSagar-cy8kp
    @DevanshSagar-cy8kp 7 หลายเดือนก่อน +1

    Great work ❤

  • @gregkondas6457
    @gregkondas6457 5 หลายเดือนก่อน

    thank you so much! this is an awesome resource!

  • @manojkumar-pp4ky
    @manojkumar-pp4ky 6 หลายเดือนก่อน +1

    Excellent

  • @slitihela1860
    @slitihela1860 11 หลายเดือนก่อน +1

    can you prepare a video for Double Q-Learning Network
    and Dueling Double Q-Learning Network
    please

  • @yep3659
    @yep3659 11 หลายเดือนก่อน +1

    I'm craving for some Tempuras now

  • @krishnavinukonda1882
    @krishnavinukonda1882 10 หลายเดือนก่อน

    This is best . Thanks!

  • @li-pingho1441
    @li-pingho1441 ปีที่แล้ว

    awesome explanation!

  • @akshaypansari111111
    @akshaypansari111111 ปีที่แล้ว

    Thanks a lot. This is real helpful. I will check out the bellman equation video as well

  • @minapagliaro7607
    @minapagliaro7607 10 หลายเดือนก่อน

    Great video !!!!

  • @krzysztofjarek6476
    @krzysztofjarek6476 ปีที่แล้ว

    Great lecture 😉

  • @बिहारीभायजी
    @बिहारीभायजी 6 หลายเดือนก่อน +1

    this video not ust explain q value, but also value function, action value function, episode, etc

  • @davidlieber3494
    @davidlieber3494 ปีที่แล้ว

    great video, thanks!

    • @CodeEmporium
      @CodeEmporium  ปีที่แล้ว

      You are very welcome. Thanks for commenting

  • @Trubripes
    @Trubripes 5 หลายเดือนก่อน

    why use an episodic problem as an example for 1-step TD ? the advantage of TD is for non-episodic problems.
    TD uses previous value to bootstrap the current estimate, in this case shouldn't the table be initialized to R for each S,
    instead of zeroes ?

  • @redrose5406
    @redrose5406 ปีที่แล้ว

    Post more about GANs

  • @agnelomascarenhas8990
    @agnelomascarenhas8990 หลายเดือนก่อน

    While watching the video, it occurred to me how do ants find paths and remember the path, change it when disturbed.

  • @satyamdubey4110
    @satyamdubey4110 11 หลายเดือนก่อน

    💖💖

  • @razainul
    @razainul 4 หลายเดือนก่อน

    why haven't you given true credits to the original video creator. The voice is not yours we know it. You are simply lip-syncing the audio. You could've used the video and used your own true voice! I feel this is not your voice! 99.99% !