Passive Reinforcement Learning

Lecture 8: Markov Decision Processes (MDPs)

Math Olympiad Contest Problems for Elementary and Middle Schools # 140

ตรงที่เดิม - ฝน พรสุดา [ต้นฉบับ :เบส ขวางหวัน]

ถ้าโดดลงไปแล้วได้ 10,000 บาทคุณกล้าไหม #shorts @hilldilly92

เข้าใจเหตุผลของคนบอกเลิก แต่ทำไมต้องเป็นคนที่โดนทิ้ง

Markov Decision Processes

Bert Huang

มุมมอง 73 824

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 15 พ.ค. 2024
Virginia Tech CS5804
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 41

@hosamfikry2924 5 ปีที่แล้ว ⁺¹⁸
That is the best video I watched so far to understand this topic
@hobby_coding 3 ปีที่แล้ว ⁺³
very good lecture maybe the best introduction to this topic i've ever seen on youtube
@syedrumman3920 2 ปีที่แล้ว ⁺²
This is such a clear explanation!! Ty for this!! I wish I had taken your class while I was in VT!
@Pexers. 3 ปีที่แล้ว ⁺³
Thank you, I spent hours in this algorithm, finally understood it !
@consolesblow 5 ปีที่แล้ว ⁺¹
Thanks a lot! I found this very helpful.
@quantlfc ปีที่แล้ว
Absolutely amazing lecture!!!
@coeusmaze9413 4 ปีที่แล้ว ⁺²
The video provides intuitive but deep understanding in MDP
@jff711 2 ปีที่แล้ว ⁺²
Thank you very much, very well explained.
@JustinMasayda ปีที่แล้ว
This was fantastic, thank you!
@jub8891 10 หลายเดือนก่อน
thank you so much, you explain the subject very well and have helped me to understand..
@ismailasmcalskan2552 4 ปีที่แล้ว
Really good video about this topic. Thank you
@Ahmed.r.a หลายเดือนก่อน
thank you for this brilliant explanation. I wished there was a Question with solution to practice on.
@linfrancis5204 5 ปีที่แล้ว ⁺¹
Great video. Thank you. Could you please make a similar video while we consider a two-dimensional Markov chain with more states?
@sander1426-2 4 ปีที่แล้ว
Thanks for the explanation!
@srujayop 2 ปีที่แล้ว ⁺¹
Is the reward R(s) actually R(s')?
And should that also be multiplied with the transition probability?
max(over a) sum P(s', r|,s, a) [r + gamma*V(s')]
? I am trying to relate the equation presented in the video to standard notation 4 par notation.
@richardm5916 4 ปีที่แล้ว
Realy great explaintion on Machine learning
@behmandtirgar 4 ปีที่แล้ว
I have a question at time 8:30
: if we take an action to go to the left, why Pr(c | b, left) isn't 0.00? (we go to another side)
@ryanflynn386 5 ปีที่แล้ว ⁺³
This is a great explanation video, thanks so much. Your voice is easy to listen to too haha.
@berty38 5 ปีที่แล้ว ⁺¹⁴
Ryan Flynn Thanks! I’m glad it’s helpful. My smooth voice is a huge disadvantage when I teach morning classes and my students all fall asleep.
@tarik8622 3 ปีที่แล้ว
Very interesting topic. And i think that you will make a fortune if you use your voice in publicity field. Best regards.
@xruan6582 4 ปีที่แล้ว ⁺¹
can anyone explain (32:00) the switch between two modes (i.e. represented by green and red arrow). To me the green one seems like deterministic rule, the red one seems like stochastic rule. Can they exist simultaneously?
@seanxu6741 ปีที่แล้ว
Fantastic video! Thanks a lot!
@JebbigerJohn 8 หลายเดือนก่อน
This is so good!!!
@rezadarooei248 4 ปีที่แล้ว
Thanks for your nice tutorial is it possible upload the slides?
@peterkimemiah9669 2 ปีที่แล้ว
Very good easy to understand.
@joshuasegal4161 5 ปีที่แล้ว
What software are you using to make this?? It looks like you have like an infinite page which gives a really clean look
@berty38 5 ปีที่แล้ว ⁺³
Nothing too fancy. This was done with Apple Keynote, and I'm faking that scrolling effect with "Magic Move" animations. I'm always looking for better tools to build useful visuals for lectures.
@treegnome2371 3 ปีที่แล้ว
at 17:35, why isn't it gamma = (0,1), instead of (0,1]...if gamma = 1, the influence of the actions farther down the road stays the same as all other actions, rather than shrinking the influence...right?
@jaideep_yes 4 ปีที่แล้ว
Thank you.
@sanskarshrivastava5193 3 ปีที่แล้ว
Best video for MDP on youtube
@EdupugantiAadityaaeb 8 หลายเดือนก่อน
What is the name of textbook
@zenchiassassin283 3 ปีที่แล้ว
What textbook ? thank you very much
@_brenda4975 3 ปีที่แล้ว
much better than my lecturer
@y-3084 4 ปีที่แล้ว
excellent
@Throwingness 2 ปีที่แล้ว ⁺¹
Around 34:00 when there are equations on the screen you should have had a pointer or something to point at what you are talking about. It's not clear.
@dminn 4 ปีที่แล้ว
God bless
@suvinaybothra8988 4 ปีที่แล้ว
honesty
@ahmet9446 5 ปีที่แล้ว
The best I find is [4, 1]. I couldn't achieve [4.2, 1.2]. Does anyone achieve [4.2, 1.2]?
@linfrancis5204 5 ปีที่แล้ว
YES, I GOT IT
@abdullahmoiz8151 4 ปีที่แล้ว
33:27
@izazkhan1640 5 ปีที่แล้ว
jhk

ต่อไป

เล่นอัตโนมัติ

Passive Reinforcement Learning

Passive Reinforcement Learning

Lecture 8: Markov Decision Processes (MDPs)

Lecture 8: Markov Decision Processes (MDPs)

Math Olympiad Contest Problems for Elementary and Middle Schools # 140

Math Olympiad Contest Problems for Elementary and Middle Schools # 140

ตรงที่เดิม - ฝน พรสุดา [ต้นฉบับ :เบส ขวางหวัน]

ตรงที่เดิม - ฝน พรสุดา [ต้นฉบับ :เบส ขวางหวัน]

ถ้าโดดลงไปแล้วได้ 10,000 บาทคุณกล้าไหม #shorts @hilldilly92

ถ้าโดดลงไปแล้วได้ 10,000 บาทคุณกล้าไหม #shorts @hilldilly92

เข้าใจเหตุผลของคนบอกเลิก แต่ทำไมต้องเป็นคนที่โดนทิ้ง

เข้าใจเหตุผลของคนบอกเลิก แต่ทำไมต้องเป็นคนที่โดนทิ้ง

ใครจะอยู่ใครจะไป! "ชาวพุทธ" ท้าชนลัทธิ "เชื่อมจิต" แจ้งความกองปราบเอาผิด l EP.1668 l 13 พ.ค.67

ใครจะอยู่ใครจะไป! "ชาวพุทธ" ท้าชนลัทธิ "เชื่อมจิต" แจ้งความกองปราบเอาผิด l EP.1668 l 13 พ.ค.67

Markov Decision Processes Continued

Markov Decision Processes Continued

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Markov Chains Clearly Explained! Part - 1

Markov Chains Clearly Explained! Part - 1

Markov Decision Processes - Computerphile

Markov Decision Processes - Computerphile

Markov Decision Processes 1 - Value Iteration | Stanford CS221: AI (Autumn 2019)

Markov Decision Processes 1 - Value Iteration | Stanford CS221: AI (Autumn 2019)

Policy and Value Iteration

Policy and Value Iteration

Bayesian Networks

Bayesian Networks

Introduction To Causal Inference And Directed Acyclic Graphs

Introduction To Causal Inference And Directed Acyclic Graphs

Hidden Markov Models

Hidden Markov Models

สืบข้อมูลอะไรได้บ้าง จากรูปแค่นี้

สืบข้อมูลอะไรได้บ้าง จากรูปแค่นี้

Xperia 1 VI & Xperia 10 VI | Product Announcement May 2024

Xperia 1 VI & Xperia 10 VI | Product Announcement May 2024

M4 iPad Pro Impressions: Well This is Awkward

M4 iPad Pro Impressions: Well This is Awkward

อัพเดทราคา iPhone มือสอง ถูกและดีเหมือนเดิม #houkandbank #ไอโฟนมือสอง #shorts #reels

อัพเดทราคา iPhone มือสอง ถูกและดีเหมือนเดิม #houkandbank #ไอโฟนมือสอง #shorts #reels

How to one photo Crop in Defrint Color Bolding And Defrint Change Background🖥️

How to one photo Crop in Defrint Color Bolding And Defrint Change Background🖥️

iOS 17.5 มาแล้ว! ภาพพื้นหลัง Pride ใหม่, โหมดซ่อมใน Find My และอื่น ๆ ชมสรุปที่นี่ #iMoD

iOS 17.5 มาแล้ว! ภาพพื้นหลัง Pride ใหม่, โหมดซ่อมใน Find My และอื่น ๆ ชมสรุปที่นี่ #iMoD

Why spend $10.000 on a flashlight when these are $200🗿

Why spend $10.000 on a flashlight when these are $200🗿

ลองสั่ง SSD 2TB แค่ 500 กว่าบาท ใช้จริงเป็นยังไง ?

ลองสั่ง SSD 2TB แค่ 500 กว่าบาท ใช้จริงเป็นยังไง ?