Q-learning - Explained!
ฝัง
- เผยแพร่เมื่อ 28 พ.ค. 2024
- Let's talk about one of the more important concepts in reinforcement learning: q-learning
ABOUT ME
⭕ Subscribe: th-cam.com/users/CodeEmporiu...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajhalthor
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
[2] Paradigms of ML: idapgroup.com/blog/types-of-m...
[3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
[4] Bellman Equation video: • Bellman Equation - Ex...
[5] Temporal Difference Learning video: • Foundation of Q-learni...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow
Your 12 min video worth than all the playlist about q-learning on youtube👏
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
very good explained, thanks a lot!
Really enjoying the series. Keep it up
Thanks so much! Super glad you are enjoying this
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
thank you so much that was so helpful
Thank you so much!!!!!!!!!!!!
Very Well explained by you sir,It helped alot
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
This is so underrated
thanks man
Instead of saying grid you could say almost say DFA
Q*