Deep Q Learning Pong Tutorial

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ธ.ค. 2024

ความคิดเห็น • 8

  • @mikepetropavlovskiy8667
    @mikepetropavlovskiy8667 4 ปีที่แล้ว +5

    Jeez, these are the most comprehensive tutorials that I've found so far. Thank you soooo much. Everything that is needed is here: overall overview, necessary mathematic, code.

  • @digitaldrive1192
    @digitaldrive1192 3 ปีที่แล้ว

    By far the best explanation of this material. You have a knack for explaining this in the simplest terms.

  • @anilkurkcu3389
    @anilkurkcu3389 6 ปีที่แล้ว +5

    Hey Colin, when specifying a = max_a(Q(s,a)), I think that the 'argmax' operator would be more suitable, since 'max' is returning a value rather than an action, while 'argmax' returns the action itself that maximizes the term inside the parenthesis.

  • @howardnolan3198
    @howardnolan3198 10 หลายเดือนก่อน

    Thank you for making this, very helpful.

  • @IntoTheSkyy
    @IntoTheSkyy 5 ปีที่แล้ว +3

    So I think we can all agree that the only good question to be asked is where you got that pic of the cute game playing robot

  • @gideonprior4842
    @gideonprior4842 5 ปีที่แล้ว +1

    Shouldnt the subscript in step 6 be s' ?

  • @karthik-ex4dm
    @karthik-ex4dm 6 ปีที่แล้ว +2

    Can we make it interactive with more diagrams ?

  • @billykotsos4642
    @billykotsos4642 4 ปีที่แล้ว +2

    This is a nice copy paste from Hands-on Deep Reinforcement Learning.