Markov Decision Processes

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ธ.ค. 2024

ความคิดเห็น • 8

  • @nantunest
    @nantunest 2 ปีที่แล้ว +1

    Excellent class, very well explained! Please, make a playlist for the reinforcement learning subject. That would be great!

    • @OlivierSigaud
      @OlivierSigaud  2 ปีที่แล้ว

      Well, the lesson you found is the second in my tabular reinforcement learning playlist...
      So I'm quite sure that with a short search, your wish will be fulfilled. :)

  • @hongkyulee9724
    @hongkyulee9724 2 ปีที่แล้ว

    Your explain and slides are very intuitive for me :D Really thank you for the nice video. Hope your happiness :D

    • @OlivierSigaud
      @OlivierSigaud  2 ปีที่แล้ว +1

      Thanks for your kind message

  • @saradehghani1153
    @saradehghani1153 4 ปีที่แล้ว +1

    thanks for your video...you explain clearly..
    how can we obtain a transition function to decide which direction we go to?

    • @OlivierSigaud
      @OlivierSigaud  4 ปีที่แล้ว

      The transition function is one of the components of the definition of an MDP. So either you have to give it when defining the MDP, or the agent has to learn it by collecting statistics while it is moving in the environment.

  • @martinspage
    @martinspage 5 ปีที่แล้ว +1

    merci, c'est tres bien fait