An introduction to Reinforcement Learning

แชร์
ฝัง
  • เผยแพร่เมื่อ 30 พ.ค. 2024
  • This episode gives a general introduction into the field of Reinforcement Learning:
    - High level description of the field
    - Policy gradients
    - Biggest challenges (sparse rewards, reward shaping, ...)
    This video forms the basis for a series on RL where I will dive much deeper into technical details of state-of-the-art methods for RL.
    Links:
    - "Pong from Pixels - Karpathy": karpathy.github.io/2016/05/31/rl/
    - Concept networks for grasp & stack (Paper with heavy reward shaping): arxiv.org/abs/1709.06977
    If you enjoy my videos, all support is super welcome!
    / arxivinsights
    If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: pensight.com/x/xander-steenbr...
    ::Chapters::
    00:00 Intro
    01:03 So what is Reinforcement Learning?
    03:39 Learning without explicit examples
    07:25 Main challenges when doing RL
    15:04 Are the robots taking over now?
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 404

  • @rednassie1101
    @rednassie1101 4 ปีที่แล้ว +208

    People: ANN ARE TAKING OVER THE WORLD AND STUFF WILL NEVER BE THE SAME
    my horribly trained network on a cat: "dog"

    • @I_Lemaire
      @I_Lemaire 4 ปีที่แล้ว +1

      Could they help with the necessary government takeovers associated with COVID-19? Temporary command economies could be more efficient.

    • @revimfadli4666
      @revimfadli4666 4 ปีที่แล้ว +4

      TH-cam's bots: "Robot fighting is animal cruelty"

  • @SukhwinderSingh-fb9qw
    @SukhwinderSingh-fb9qw 5 ปีที่แล้ว +64

    This was one of the best videos on RL that I have seen. Extremely informative. The way you explain things is awesome. Keep up the great work! Cheers man!

  • @yuanyuansun3521
    @yuanyuansun3521 3 ปีที่แล้ว +24

    “If u only give it a positive reward when it successfully stacked a block, it’ll never get to see any of those reward” Only if my tutors realise this.

  • @denebvegaaltair1146
    @denebvegaaltair1146 2 ปีที่แล้ว +14

    Your videos have just the right amount of technical terms such that student engineers can learn something, and also the right amount of summary and rewording such that beginners can get a vague idea of concepts. Thank you so much

  • @DanielHernandez-rn6rp
    @DanielHernandez-rn6rp 6 ปีที่แล้ว +291

    Love this guy. As an RL PhD student, your videos are golden.

    • @nikhillondhe5815
      @nikhillondhe5815 5 ปีที่แล้ว +13

      RL PhD sounds so interesting!

    • @andres18m
      @andres18m 5 ปีที่แล้ว +2

      Institute name?

    • @Ayanwesha
      @Ayanwesha 5 ปีที่แล้ว +1

      hello..sir
      i am a grad stud
      can anyone tell me plzz if back propagation is necessary in supervised and unsupervised learning?or it is only used in reinforcement learning
      thanks

    • @hcgaron
      @hcgaron 5 ปีที่แล้ว

      Ayanwesha 12345 yes, back propagation is used as a basis for gradient based methods of optimization

    • @ernie2111
      @ernie2111 5 ปีที่แล้ว +3

      "RL PhD" didn't know such things exist lol

  • @floriandebrauwer9140
    @floriandebrauwer9140 4 ปีที่แล้ว +2

    Thanks for your work ! I like the way you present such a complex field in a clear manner for poeple without any background. Thanks to you I know where to start in my learning journey !

  • @gusbakker
    @gusbakker 5 ปีที่แล้ว +2

    Great balance between a very well explained content and the interesting facts about current progress in AI at the end. Good work

  • @cemgocer8185
    @cemgocer8185 3 ปีที่แล้ว +3

    Quality of the video is off the charts. Topics u have chosen to explain the field, the way u explain them and especially pointing the common misconceptions that make it harder for us to get into what AI really is... I'm sad that there is no superlike button. Rare to see videos of this quality and honesty

  • @snippletrap
    @snippletrap 5 ปีที่แล้ว +16

    The perils of reward shaping are well understood in a public policy context, where incentives can lead to "unintended consequences".

  • @MuditBachhawatIn
    @MuditBachhawatIn 4 ปีที่แล้ว

    I have been meaning to read about RL for a long time. This video couldn't be more simple and clear introduction to it. Thanks man!

  • @orfeasliossatos
    @orfeasliossatos 5 ปีที่แล้ว +2

    I've been literally looking all over for a video like this, thank you so much

  • @shashankshivakumar4732
    @shashankshivakumar4732 5 ปีที่แล้ว +4

    I love this video. I love his criticial and grounded thinking. Great work !

  • @TY-un4no
    @TY-un4no 3 ปีที่แล้ว +1

    Complex stuff made simple and easy, this is a very good intro video to RL. Starting to learn RL for work and your video gave me a great starting point, thank you!

  • @josefpolasek6666
    @josefpolasek6666 4 ปีที่แล้ว +1

    Your videos are absolutely amazing! Thank you very much for explaining concept of RL in 16 minutes.

  • @angelakong653
    @angelakong653 4 ปีที่แล้ว +1

    This was really helpful. Thank you to people like you for creating this content. Appreciate you, Xander!

  • @nemx4u
    @nemx4u 6 ปีที่แล้ว +2

    You explain hard topics beautifully! great job. Would love to see more RL videos!

  • @davidfield5295
    @davidfield5295 5 ปีที่แล้ว

    The misuse of 'literally' notwithstanding, this was an excellent video. Very clear and concise explanation.

  • @Hyuts
    @Hyuts 4 ปีที่แล้ว +16

    Explains in an elegant manner more than I have learned in half a semester of my AI college course.

  • @PriyanshuGupta-hf2hm
    @PriyanshuGupta-hf2hm 3 ปีที่แล้ว

    You explained so well that I understood each and everything in your video. I am overjoyed!

  • @Lilowillow42
    @Lilowillow42 2 ปีที่แล้ว +1

    Just wanted you to know that in my university course for introduction to AI our professor recommended your videos for machine learning. Your explanation is highly enjoyable and informative. Thank you!

  • @7810
    @7810 6 ปีที่แล้ว

    Good stuff to learn the RL in terms of basic knowledge as well as the challenge it will face. Thanks for your time and sharing!

  • @Alex-gc2vo
    @Alex-gc2vo 5 ปีที่แล้ว

    your videos are some of the best explanations I've found for a lot of these very advanced subjects. I suspect your viewer count is going to jump very quickly. keep it up.

  • @jackwhite9332
    @jackwhite9332 6 ปีที่แล้ว +7

    Impressive explanation, found this very useful. Thank you!

  • @nateshrager512
    @nateshrager512 6 ปีที่แล้ว

    Great job introducing the topic. Very nice job dispelling misconceptions surrounding the topic as well. I put on that notification for your next videos, looking forward to em : )

  • @Krimson5pride
    @Krimson5pride 4 ปีที่แล้ว

    It was both professional and entertaining at the same time. Great and precise explanation.

  • @HARtalks
    @HARtalks 3 ปีที่แล้ว

    It was really interesting and helped me to get a clear picture of what reinforcement learning is... Thank you!!

  • @atcer51
    @atcer51 7 หลายเดือนก่อน +1

    fiiiinnnaaaallly after tons of googling, I finally fund a USEFUL video that accually EXPLAINS how to reward the agent, and not just saying:
    'oh u just reward it'

  • @majeedhussain3276
    @majeedhussain3276 6 ปีที่แล้ว

    You deserve million subscribers hopefully one day you will. So much clarity in every video. Keep going...

  • @ArnauViaMartinezSeara
    @ArnauViaMartinezSeara 6 ปีที่แล้ว

    Really useful. I am preparing a Reinforcement Learning class aplied to finance and it is really helpful. Can't wait to see next episode. Thanks

  • @dean8147
    @dean8147 2 ปีที่แล้ว

    You’re a legend mate. Honestly, thanks for all of your hard work

  • @codyheiner3636
    @codyheiner3636 5 ปีที่แล้ว +1

    Love the philosophical discussion at the end!

  • @lincolnaisagbonhi8953
    @lincolnaisagbonhi8953 4 ปีที่แล้ว

    This is a great presentation on RL, short and clear content.

  • @aanex2005
    @aanex2005 4 ปีที่แล้ว

    I have no idea about RL but your video has given me a good jump start. Thanks man

  • @amitredkar140
    @amitredkar140 5 ปีที่แล้ว +1

    Great video!!!! Explained exceptionally, liked other videos as well from your channel. Would love to see more stuff related to AI/DL or RL. Thanks in advance. Keep up the good work....

  • @allamasadi7970
    @allamasadi7970 6 ปีที่แล้ว +151

    Your channel deserves more views 👍

    • @akramsystems
      @akramsystems 5 ปีที่แล้ว +1

      agree %100

    • @lohithArcot
      @lohithArcot 3 ปีที่แล้ว

      Not many reach these topics.

  • @matfuckk4736
    @matfuckk4736 5 ปีที่แล้ว

    Great quality and well-appreciated content. Please, continue, became your patron.

  • @ingeniouswild
    @ingeniouswild 5 ปีที่แล้ว +1

    Very nice episode! One thing that struck me about your suggestion that without Reward Shaping, the auto-learning of the 2600 games would be intractable: even for a human, this would be extremely difficult - we succeed with new, undocumented games because they often have similar sub-components and sub-goals that we already know from other games (or life). But I'm sure you could easily construct a game which would be impossible for a human to learn without any hints, while still having the same overall complexity.

  • @saaniausaf9621
    @saaniausaf9621 5 ปีที่แล้ว

    I loved the way you explained everything. Thanks!

  • @TheBeansChopper
    @TheBeansChopper 3 ปีที่แล้ว

    I think the comment section speaks for itself. This is a fantastic grasp of the basic concepts and issues with this technologies in such short time, without diving unnecessarily into formalism. Thanks :)

  • @mohammadhatoum
    @mohammadhatoum 5 ปีที่แล้ว

    Great job.. Explained the subject in a simple way. Keep it up and looking forward for new videos

  • @mehdisauvage1234
    @mehdisauvage1234 6 ปีที่แล้ว

    Your videos are so useful and interesting ! This is pure gold to me :)

  • @OliverZeigermann
    @OliverZeigermann 5 ปีที่แล้ว

    Very lively and understandable. Great work!

  • @sharadrawatindia
    @sharadrawatindia 6 ปีที่แล้ว

    Hey Xander! Great videos. Looking forwards for your next video.

  • @soumyakantadash5986
    @soumyakantadash5986 4 ปีที่แล้ว

    These videos are gem!!!..... incredible, precise and knowledgeable!!!!

  • @josephedappully1482
    @josephedappully1482 6 ปีที่แล้ว

    This is a great video; thanks for making it! Looking forward to your next one.

  • @Z4NT0
    @Z4NT0 3 ปีที่แล้ว

    I learned so much in just 16 minutes. Awesome Video!

  • @alirezaparsay8518
    @alirezaparsay8518 11 หลายเดือนก่อน

    The explanation was so clear. Thank you.

  • @jonathaskerber5472
    @jonathaskerber5472 5 ปีที่แล้ว

    Such a great introduction. Keep up the good work!

  • @tnmygrwl
    @tnmygrwl 6 ปีที่แล้ว

    You do an awesome of structuring the content. Loved the video.

  • @rishidixit7939
    @rishidixit7939 8 หลายเดือนก่อน +2

    The sudden surprise of hearing Bruno Mars makes you pause video for other open tabs

  • @mantische
    @mantische 4 ปีที่แล้ว

    One of the best explanations I've seen

  • @RoxanaNoe
    @RoxanaNoe 5 ปีที่แล้ว

    Your channel is a great resource for getting into Deep Learning and AI.

  • @poojanpatel2437
    @poojanpatel2437 6 ปีที่แล้ว +4

    Best Channel on yt for ml/dl/rl/ai... Keep up the good work... Would love to see your new video weekly...

    • @ArxivInsights
      @ArxivInsights  6 ปีที่แล้ว +3

      I'd love to make more videos too! But since I'm currently doing this 100% in my spare time and 1 vid takes about 30hrs of work, there's really no way I can do one per week for now :(

    • @poojanpatel2437
      @poojanpatel2437 6 ปีที่แล้ว

      Arxiv Insights Still amazing work till now... Love to see your more videos in future.. ❤

  • @bjbodner3097
    @bjbodner3097 6 ปีที่แล้ว

    Great video, great channel!
    Thanks so much for making this!
    Can't wait to watch more:)

  • @thanasispappas62
    @thanasispappas62 10 หลายเดือนก่อน

    By far the best video of RL ive ever seen.

  • @gudusangtani
    @gudusangtani 4 ปีที่แล้ว

    So well explained ....I also liked the comments on Boston robotics considering the hype and buzz about AI and ML.. You are doing a very good job !

  • @ArturoMoraSoto
    @ArturoMoraSoto 3 ปีที่แล้ว

    Nice explanation, thanks for taking the time to create this great video.

  • @jorgegarcia-torresfdez2471
    @jorgegarcia-torresfdez2471 6 ปีที่แล้ว

    You did again a really nice work ! Congratulations :D

  • @elvispiss
    @elvispiss 2 ปีที่แล้ว +1

    Even after doing my second course of RL, this video is still so informative in its simplicity. Great videos

  • @empiricistsacademy7181
    @empiricistsacademy7181 6 ปีที่แล้ว

    Thanks youuu for this video. Looking forward to your future videos!

  • @tonakkie635
    @tonakkie635 5 ปีที่แล้ว +1

    Great overview, well explained👍.Thanks

  • @sidharthaparhi7930
    @sidharthaparhi7930 5 ปีที่แล้ว

    Also your intro is very high quality, like an intro to a good TV show

  • @biiigates7381
    @biiigates7381 4 ปีที่แล้ว +1

    I've been learning AI for almost a year now and on all the channels I've spent with this is the best one. Very underrated! (btw its the first time i discovered this channel and I instantly subscribed)

    • @mundeepcool
      @mundeepcool 4 ปีที่แล้ว

      Same here, loved this video and I instantly subscribed... and also oh yeah yeah

  • @khajasaen
    @khajasaen 6 ปีที่แล้ว

    Best channel in the crowd ... keep it up Xander

  • @luiseduardocorralesmendoza9396
    @luiseduardocorralesmendoza9396 4 ปีที่แล้ว

    Great examples and great explanation, thank you i was struggling with this topic

  • @alanator25
    @alanator25 ปีที่แล้ว

    Thank you! This was a great introduction!

  • @thaermashkoor6225
    @thaermashkoor6225 2 ปีที่แล้ว

    Thanks for this clear introduction.

  • @senri-
    @senri- 6 ปีที่แล้ว

    Cant wait for the next videos keep up the great work!

  • @laeeqahmed1980
    @laeeqahmed1980 5 ปีที่แล้ว +1

    Great talk. Humans are not good at multiple sound recognition and you added music to your video.

  • @adammenges6300
    @adammenges6300 5 ปีที่แล้ว

    your videos are so good, keep up the great work 💪🏻

  • @digvijaybhandari9747
    @digvijaybhandari9747 ปีที่แล้ว

    Really enjoyed the content here!

  • @ms_1918
    @ms_1918 4 ปีที่แล้ว

    well came here for a 1 min intro to reinforcement learning for first class of course,
    stopped after 16 minutes what a superb experience.

  • @maisamwasti
    @maisamwasti 6 ปีที่แล้ว

    Your videos = super informative! Thanks a lot for the good work

  • @mujahid1324
    @mujahid1324 3 ปีที่แล้ว

    I would say "Wow'. You nailed it in10 mnts what's "reinforcement learning" is. Please keep sending more and more Ai . keep it up, Xander :)

  • @alenasazanova8331
    @alenasazanova8331 4 ปีที่แล้ว

    That's very interesting and understantable video. Thank you very much!

  • @qandos-nour
    @qandos-nour ปีที่แล้ว

    Great and clear explanation

  • @andreasnatsis3027
    @andreasnatsis3027 5 ปีที่แล้ว

    Amazing video. Keep up the good work and soon your channel will explode!

  • @doctorartin
    @doctorartin 4 ปีที่แล้ว

    Doing part of my PhD on potantial AI-strategies fordecision-making in healthcare, and this was very useful, thank you.

    • @varshinis6930
      @varshinis6930 3 ปีที่แล้ว

      Which university??

    • @doctorartin
      @doctorartin 3 ปีที่แล้ว

      @@varshinis6930 Lund University

  • @williamkyburz
    @williamkyburz 5 ปีที่แล้ว +1

    Xander, extremely well done, lucid and cogent. You should be teaching at M.I.T. or Universiteit Gent). The ability to teach complex subjects in an intuitive and simple way is a gift. Wish you the best in everything. Peace

    • @ArxivInsights
      @ArxivInsights  5 ปีที่แล้ว +1

      Thanks William! I am actually doing my PhD in Gent at the moment :)

  • @mgilson
    @mgilson 6 ปีที่แล้ว

    I can't wait for your next video !! 😍😍😍

  • @DavidSaintloth
    @DavidSaintloth 6 ปีที่แล้ว

    Reinforcement learning is along the path to the complex multidimensional salience models that will drive dynamic cognition.
    "Reward shaping" I assert in the salience theory of dynamic cognition that I proposed publicly in 2013 is performed by a combination of autonomic and emotional signal modifications to experience. The key is to tie the reward to the experience and then use that to vary the prediction...this way you don't reward shape as a separate process ...reward shaping is actually performed BY comparison.
    For those interested in the salience theory of dynamic cognition and consciousness a collection of the articles I've written are available at this public Facebook note:
    facebook.com/notes/david-saintloth/discovering-the-dynamic-cognition-cycle/10152513149708057

  • @bsudharsh
    @bsudharsh 5 ปีที่แล้ว

    succinct; its a brilliant rendition on reinforcement learning

  • @skviknesh
    @skviknesh 5 ปีที่แล้ว

    Awesome!!!!!! Bro!!! Great explanation! !!!! Keep continuing!!!

  • @payam-bagheri
    @payam-bagheri 8 หลายเดือนก่อน

    Brilliant video!

  • @karFLY1
    @karFLY1 6 ปีที่แล้ว +1

    Great as usual. Thank you :)

  • @SageElliott
    @SageElliott 5 ปีที่แล้ว

    What a fantastic channel!! 😊

  • @ipuhbamrash6708
    @ipuhbamrash6708 4 ปีที่แล้ว

    Fabulous!! No other word for you!!

  • @azmathmoosa4324
    @azmathmoosa4324 6 ปีที่แล้ว

    I like how u don't hype up anything. Great mate! I subscribe!

  • @gorillapimpin2978
    @gorillapimpin2978 5 ปีที่แล้ว

    my new favorite channel

  • @rajendrarao3057
    @rajendrarao3057 5 ปีที่แล้ว

    awesome video sir. please keep up the good work in this field....

  • @Vladeeer
    @Vladeeer 6 ปีที่แล้ว

    Awesome video, keep up the good work!

  • @011azr
    @011azr 6 ปีที่แล้ว

    Your explanations are great, thanks :)

  • @lamborghinicentenario2497
    @lamborghinicentenario2497 2 หลายเดือนก่อน

    12:28 what did you use to connect the machine learning to a 3d model?

  • @christopherwolff8443
    @christopherwolff8443 6 ปีที่แล้ว

    These videos are great. Keep it up!

  • @sridhasridharan3600
    @sridhasridharan3600 3 ปีที่แล้ว

    Great Videos! I am recommending these to my students.

  • @stefano3808
    @stefano3808 3 ปีที่แล้ว

    really high quality videos, thanks for that

  • @wzyjoseph7317
    @wzyjoseph7317 2 ปีที่แล้ว

    Very clear explaination! Thanks for the work!!!!XD

  • @jordia.2970
    @jordia.2970 3 ปีที่แล้ว

    Great work man

  • @shahulhameed-xc1to
    @shahulhameed-xc1to 4 ปีที่แล้ว

    Great learning experience. Thank you

  • @shirishbajpai9486
    @shirishbajpai9486 9 หลายเดือนก่อน

    watched in 2023 after all the LLMs stuff going on... still such relevant and pure gold!