Lecture 18 - Continous State MDP & Model Simulation | Stanford CS229: Machine Learning (Autumn 2018)

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Andrew Ng: Opportunities in AI - 2023

ว้าแดง แถลงด่วน! ปมข่าวตึงเครียดและจะรบกับไทยไม่จริง ขณะ ทภ.3 สยบก่อนหน้า ยันสัมพันธ์ชายแดนปกติ

น่ากลัวสุดๆว้าแดงโชว์อาวุธหนักพร้อมรบไม่กลัวใครทั้งนั้น

This Is The Worlds Stretchiest Cheese!

Lecture 17 - MDPs & Value/Policy Iteration | Stanford CS229: Machine Learning Andrew Ng (Autumn2018)

Stanford Online

มุมมอง 88 237

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 9 ธ.ค. 2024

ความคิดเห็น • 16

@myao8930 2 ปีที่แล้ว ⁺¹⁷
This is the best instruction among all videos on reinforcement learning. Thank you!
@supersnowva6717 9 หลายเดือนก่อน ⁺³
Such a great lecture on RL! Super clear on these algorithms, thanks so much Profession Ng!
@ali57555 ปีที่แล้ว ⁺²
Thank you very much for explaining this in such simple terms! Been looking for some time for something good to understand MDPs
@KipIngram 2 ปีที่แล้ว ⁺²
1:13:19 - Probably the right model here is the one we used to spread across the planet. Most folks are trying to get by best they can, and are likely to pursue "exploitative" strategies - what they know will bring them what they want. But sometimes we explicitly launch exploration missions, and the "success criterion" of such a mission is very different from that of a "profit oriented initiative." The "payoff" for an exploration mission is *knowledge*. I think keeping the two things cleanly separate is probably the way to go.
@genotabby 8 หลายเดือนก่อน
48:42 this should be for stochastic methods right? If it is deterministic then the value policy V(S) should be calculated based on the 100% chance of the direction in the optimal policy. For stochastic it would be, in this case, 0.8 in the direction of the optimal policy, 0.1 chance for left side of the optimal policy, 0.1 chance for the right side. Since the left side is already at the border, it would return back to it's original state hence 0.1*0.71
@henkjekel4081 2 ปีที่แล้ว ⁺¹
Thank you andrew, u the best
@Ayanshandseals 2 ปีที่แล้ว ⁺²
indeed (1-epsilon) Greedy is the correct term and should have been used!
@gokdeniztingur7515 9 หลายเดือนก่อน
great video man!
@KipIngram 2 ปีที่แล้ว ⁺²
1:12:00 - I feel exactly the same mixed feelings that Dr. Ng seems to feel here. On the one hand, this technology is amazing, and there are so many wonderful things we can do with it, such as helping people get better medical care more quickly, and so on. These things could save lives. But there are also so many nasty things we can do with them; this general category of stuff is part of how we're.. sterilizing the world, so to speak. Removing the "humanity" from things and making our culture colder, more clinical, and less empathic and compassionate. I honestly don't know how to walk that tightrope - in cases like this "if we don't do it, someone else will." I suppose the best we can do is just try every day to keep some sort of "human-ness" in our endeavors. Some of us will do a pretty good job of that - some of us won't. 😞
@griffinbholt 2 ปีที่แล้ว ⁺²
Is there one student in the class with just a crazy deep voice? Or are they masking students' voices?
@griffinbholt 2 ปีที่แล้ว ⁺¹
Nvm. I can confirm they are masking students' voices. One time they accidentally masked Dr. Ng's voice.
@PhucHoang-ng4vh ปีที่แล้ว
@@griffinbholt u can see in another video, they would blurred it whenever a student appeared on screen
@McAwesomeReaper ปีที่แล้ว ⁺¹
The real genius on display here is the guy who invented the silicon based ink of the markers.
@FelixtheSame 8 หลายเดือนก่อน
deep.
@andrewpan1700 ปีที่แล้ว ⁺⁴
robot at 1:08:10 looking a little stiff
@SuzanneWolfe-zc9bt 3 หลายเดือนก่อน
Thompson Margaret Rodriguez Susan Allen Nancy

ต่อไป

เล่นอัตโนมัติ

Lecture 18 - Continous State MDP & Model Simulation | Stanford CS229: Machine Learning (Autumn 2018)

Lecture 18 - Continous State MDP & Model Simulation | Stanford CS229: Machine Learning (Autumn 2018)

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Andrew Ng: Opportunities in AI - 2023

Andrew Ng: Opportunities in AI - 2023

ว้าแดง แถลงด่วน! ปมข่าวตึงเครียดและจะรบกับไทยไม่จริง ขณะ ทภ.3 สยบก่อนหน้า ยันสัมพันธ์ชายแดนปกติ

ว้าแดง แถลงด่วน! ปมข่าวตึงเครียดและจะรบกับไทยไม่จริง ขณะ ทภ.3 สยบก่อนหน้า ยันสัมพันธ์ชายแดนปกติ

น่ากลัวสุดๆว้าแดงโชว์อาวุธหนักพร้อมรบไม่กลัวใครทั้งนั้น

น่ากลัวสุดๆว้าแดงโชว์อาวุธหนักพร้อมรบไม่กลัวใครทั้งนั้น

This Is The Worlds Stretchiest Cheese!

This Is The Worlds Stretchiest Cheese!

🔴LIVE ลาว vs เวียดนาม | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม B

🔴LIVE ลาว vs เวียดนาม | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม B

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Lecture 5 - GDA & Naive Bayes | Stanford CS229: Machine Learning Andrew Ng (Autumn 2018)

Lecture 5 - GDA & Naive Bayes | Stanford CS229: Machine Learning Andrew Ng (Autumn 2018)

Policy and Value Iteration

Policy and Value Iteration

Lecture 6 - Support Vector Machines | Stanford CS229: Machine Learning Andrew Ng (Autumn 2018)

Lecture 6 - Support Vector Machines | Stanford CS229: Machine Learning Andrew Ng (Autumn 2018)

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

Markov Decision Processes - Computerphile

Markov Decision Processes - Computerphile

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Think Faster, Talk Smarter with Matt Abrahams

Think Faster, Talk Smarter with Matt Abrahams

Lecture 10 - Decision Trees and Ensemble Methods | Stanford CS229: Machine Learning (Autumn 2018)

Lecture 10 - Decision Trees and Ensemble Methods | Stanford CS229: Machine Learning (Autumn 2018)

ด่วนทหารไทยขับรถถังซุ่มโผล่ชายแดนแล้วพร้อมยิง

ด่วนทหารไทยขับรถถังซุ่มโผล่ชายแดนแล้วพร้อมยิง

Store Owner's Social Experiment Unveils True Characters #shorts

Store Owner's Social Experiment Unveils True Characters #shorts

🔴LIVE กัมพูชา vs มาเลเซีย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

🔴LIVE กัมพูชา vs มาเลเซีย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

[Live] : ONE FIGHT NIGHT 26 วันนี้!! “คริสเตียน vs อาลิเบก”

[Live] : ONE FIGHT NIGHT 26 วันนี้!! “คริสเตียน vs อาลิเบก”

IGITT! HAT ER GERADE EINE SCHWAMM GEGESSEN?! ICH BIN RAUS! 😹🧽

IGITT! HAT ER GERADE EINE SCHWAMM GEGESSEN?! ICH BIN RAUS! 😹🧽

เส้นทางคนที่โดนไล่ | Path of Exile 2 วันที่ 1

เส้นทางคนที่โดนไล่ | Path of Exile 2 วันที่ 1

กล้องจับภาพ สัตว์ประหลาดใต้น้ำ!

กล้องจับภาพ สัตว์ประหลาดใต้น้ำ!