CS885 Lecture 9: Model-based RL

CS885 Lecture 10: Bayesian RL

Bandit Algorithms - 1

เค้นสอบ "สุนทร-พวก " พร้อม แจ้ง 2 ข้อหาหนัก | ข่าวเที่ยงช่องวัน | สำนักข่าววันนิวส์

ลบความเชื่อเรื่องสุขภาพที่คนไทยเข้าใจผิด ของหวาน มัน เค็ม หมอก็กิน! | WOODY FM

ห้ามพูดคำบนหัว ep9 #คำต้องห้าม

CS885 Lecture 8b: Bayesian and Contextual Bandits

Pascal Poupart

มุมมอง 13 640

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ธ.ค. 2024

ความคิดเห็น • 10

@shivangitomar5557 ปีที่แล้ว ⁺²
Beautiful! Thank you so much!!
@zhouyun 5 ปีที่แล้ว ⁺⁴
the math is very clearly explained, really good one.
@pemfiri 4 ปีที่แล้ว ⁺⁴
the contextual bandit is a simple concept, but i get confused with the mathematical abstraction and subscripts etc,, at points the indexes gets tangled up and are inconsistent, in both Bernoulli and linear models you could just use 1 toy example like the coin example to completely illustrate how the algorithm works end to end.
@yakocal ปีที่แล้ว
Thank you so much, explained very well
@whyareugae5528 2 ปีที่แล้ว
Thank you professor great lecture
@statisticaltheoryandanalys8270 4 ปีที่แล้ว
great professor
@giorgosnikola6990 2 ปีที่แล้ว
I think there is an error in the posterior predictive distribution in 59:52. First, I suppose that is the posterior predictive of the mean reward instead of the reward because an additional σ^2 is missing from the Covariance of the posterior predictive. But my main concern is on the mean of the posterior predictive distribution. I think it should be x * μ instead of σ^2 * x * μ. Any insights?
@shairuno 4 ปีที่แล้ว
he is really good.
@yanxu4968 3 ปีที่แล้ว
how to do contextual exploration in case of neural network approximation?

ต่อไป

เล่นอัตโนมัติ

CS885 Lecture 9: Model-based RL

CS885 Lecture 9: Model-based RL

CS885 Lecture 10: Bayesian RL

CS885 Lecture 10: Bayesian RL

Bandit Algorithms - 1

Bandit Algorithms - 1

เค้นสอบ "สุนทร-พวก " พร้อม แจ้ง 2 ข้อหาหนัก | ข่าวเที่ยงช่องวัน | สำนักข่าววันนิวส์

เค้นสอบ "สุนทร-พวก " พร้อม แจ้ง 2 ข้อหาหนัก | ข่าวเที่ยงช่องวัน | สำนักข่าววันนิวส์

ลบความเชื่อเรื่องสุขภาพที่คนไทยเข้าใจผิด ของหวาน มัน เค็ม หมอก็กิน! | WOODY FM

ลบความเชื่อเรื่องสุขภาพที่คนไทยเข้าใจผิด ของหวาน มัน เค็ม หมอก็กิน! | WOODY FM

ห้ามพูดคำบนหัว ep9 #คำต้องห้าม

ห้ามพูดคำบนหัว ep9 #คำต้องห้าม

FIN | คุณแค่คนเห็นแก่ตัว ไม่แปลกใจที่ทุกคนทิ้งคุณไป | หวานรักต้องห้าม EP.20 | 3Plus

FIN | คุณแค่คนเห็นแก่ตัว ไม่แปลกใจที่ทุกคนทิ้งคุณไป | หวานรักต้องห้าม EP.20 | 3Plus

Thompson sampling, one armed bandits, and the Beta distribution

Thompson sampling, one armed bandits, and the Beta distribution

Multi-Armed Bandits: A Cartoon Introduction - DCBA #1

Multi-Armed Bandits: A Cartoon Introduction - DCBA #1

1. Introduction to 'The Society of Mind'

1. Introduction to 'The Society of Mind'

Amazon AI Conclave 2019 - Contextual Bandits for Efficient A/B Testing

Amazon AI Conclave 2019 - Contextual Bandits for Efficient A/B Testing

A Portal Special Presentation- Geometric Unity: A First Look

A Portal Special Presentation- Geometric Unity: A First Look

Contextual Bandits

Contextual Bandits

The Contextual Bandits Problem: A New, Fast, and Simple Algorithm

The Contextual Bandits Problem: A New, Fast, and Simple Algorithm

Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits

Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits

CS885 Lecture 4b: Deep Q-Networks

CS885 Lecture 4b: Deep Q-Networks

เปิดคำทำนาย นอสตราดามุส - บาบา วานก้า ปี 2025 คนทั่วโลกต้องเจออะไรบ้าง? | แฉ 11 ธ.ค. 67 [2/3] |GMM25

เปิดคำทำนาย นอสตราดามุส - บาบา วานก้า ปี 2025 คนทั่วโลกต้องเจออะไรบ้าง? | แฉ 11 ธ.ค. 67 [2/3] |GMM25

I Bought Everything In A Grocery Store!

I Bought Everything In A Grocery Store!

เปิดคลิปเสียง 'สจ.โต้ง' คุย 'นาย ส.' ปมขัดแย้งการเมืองท้องถิ่น ก่อนถูกยิงดับคาบ้าน 'สุนทร'

เปิดคลิปเสียง 'สจ.โต้ง' คุย 'นาย ส.' ปมขัดแย้งการเมืองท้องถิ่น ก่อนถูกยิงดับคาบ้าน 'สุนทร'

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 1 Final

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 1 Final

ถ้าทุกคนกด LIKE ผมจะวาร์ปได้ทุกที่ 👽 #freefire #mungzer

ถ้าทุกคนกด LIKE ผมจะวาร์ปได้ทุกที่ 👽 #freefire #mungzer

ทหารไทยไม่ธรรมดา!18ธ.ค.จบปัญหา"ว้าแดง" | HOTSHOT เดลินิวส์ 08/12/67

ทหารไทยไม่ธรรมดา!18ธ.ค.จบปัญหา"ว้าแดง" | HOTSHOT เดลินิวส์ 08/12/67

“มดดำ” โทรหา “อั้ม อธิชาติ”สดๆ เคลียร์ปมมือที่ 3 อั้ม-นัท ปิดฉากชีวิตคู่ | 10 ธ.ค. 2567 | ข่าวใส่ไข่

“มดดำ” โทรหา “อั้ม อธิชาติ”สดๆ เคลียร์ปมมือที่ 3 อั้ม-นัท ปิดฉากชีวิตคู่ | 10 ธ.ค. 2567 | ข่าวใส่ไข่

Support each other🤝

Support each other🤝