Q-learning - Explained!

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 พ.ค. 2024
  • Let's talk about one of the more important concepts in reinforcement learning: q-learning
    ABOUT ME
    ⭕ Subscribe: th-cam.com/users/CodeEmporiu...
    📚 Medium Blog: / dataemporium
    💻 Github: github.com/ajhalthor
    👔 LinkedIn: / ajay-halthor-477974bb
    RESOURCES
    [1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
    [2] Paradigms of ML: idapgroup.com/blog/types-of-m...
    [3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
    [4] Bellman Equation video: • Bellman Equation - Ex...
    [5] Temporal Difference Learning video: • Foundation of Q-learni...
    PLAYLISTS FROM MY CHANNEL
    ⭕ Reinforcement Learning: • Reinforcement Learning...
    Natural Language Processing: • Natural Language Proce...
    ⭕ Transformers from Scratch: • Natural Language Proce...
    ⭕ ChatGPT Playlist: • ChatGPT
    ⭕ Convolutional Neural Networks: • Convolution Neural Net...
    ⭕ The Math You Should Know : • The Math You Should Know
    ⭕ Probability Theory for Machine Learning: • Probability Theory for...
    ⭕ Coding Machine Learning: • Code Machine Learning
    MATH COURSES (7 day free trial)
    📕 Mathematics for Machine Learning: imp.i384100.net/MathML
    📕 Calculus: imp.i384100.net/Calculus
    📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
    📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
    📕 Linear Algebra: imp.i384100.net/LinearAlgebra
    📕 Probability: imp.i384100.net/Probability
    OTHER RELATED COURSES (7 day free trial)
    📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
    📕 Python for Everybody: imp.i384100.net/python
    📕 MLOps Course: imp.i384100.net/MLOps
    📕 Natural Language Processing (NLP): imp.i384100.net/NLP
    📕 Machine Learning in Production: imp.i384100.net/MLProduction
    📕 Data Science Specialization: imp.i384100.net/DataScience
    📕 Tensorflow: imp.i384100.net/Tensorflow

ความคิดเห็น • 15

  • @henoknigatu7121
    @henoknigatu7121 หลายเดือนก่อน +3

    Your 12 min video worth than all the playlist about q-learning on youtube👏

  • @anya_forgerrr
    @anya_forgerrr 3 หลายเดือนก่อน +2

    i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc

  • @tonihullzer1611
    @tonihullzer1611 หลายเดือนก่อน

    very good explained, thanks a lot!

  • @akshaypansari111111
    @akshaypansari111111 6 หลายเดือนก่อน +3

    Really enjoying the series. Keep it up

    • @CodeEmporium
      @CodeEmporium  6 หลายเดือนก่อน +1

      Thanks so much! Super glad you are enjoying this

  • @arandomwho
    @arandomwho 2 หลายเดือนก่อน

    Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍

  • @user-qu4is5uk3p
    @user-qu4is5uk3p 17 วันที่ผ่านมา

    thank you so much that was so helpful

  • @user-pb6yt8qh3w
    @user-pb6yt8qh3w 12 วันที่ผ่านมา

    Thank you so much!!!!!!!!!!!!

  • @sameertupe6094
    @sameertupe6094 หลายเดือนก่อน

    Very Well explained by you sir,It helped alot

  • @alexanderlevakin9001
    @alexanderlevakin9001 6 หลายเดือนก่อน +1

    What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?

  • @khabibownsmysoul7836
    @khabibownsmysoul7836 21 วันที่ผ่านมา

    May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?

  • @justsomegirlwithoutamustac5837
    @justsomegirlwithoutamustac5837 2 หลายเดือนก่อน

    This is so underrated

  • @djsocialanxiety1664
    @djsocialanxiety1664 2 หลายเดือนก่อน

    thanks man

  • @friedrichwilhelmhufnagel3577
    @friedrichwilhelmhufnagel3577 6 หลายเดือนก่อน

    Instead of saying grid you could say almost say DFA

  • @MrHorse16
    @MrHorse16 6 หลายเดือนก่อน

    Q*