Deep Q Learning Pong Tutorial

Policy Gradient Methods Tutorial

Q Learning Tutorial for Ride Sharing (Open AI Taxi)

7 ตำรวจโหด โคตรงามไส้ จับผิดตัวกระทืบสาหัส l EP.1818 l 6 ธ.ค.67 l#โหนกระแส

เล่นเกมรถถัง แพ้โดน! "รถถัง จิตรเมืองนนท์" เตะ

นี่คือสงครามอวกาศ #ตลก #เพื่อน #ละครสั้น #starwars

Augmented Random Search Tutorial - How to Train Robots to Walk!

Skowster the Geek

มุมมอง 5 985

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 8 ธ.ค. 2024

ความคิดเห็น • 15

@craigowsen4501 4 ปีที่แล้ว ⁺¹
Very clear explanation! The one thing that was a little confusing was why you used self.hp.noise to throttle the deltas in the policy evaluate function but not in the update function.
@tibor2077 6 ปีที่แล้ว ⁺²
Great video, explains everything clearly , step by step. Thanks again :)
@joeyng9754 6 ปีที่แล้ว ⁺¹
Thank you brother , I'm a huge fun, I have finally SUBSCRIBED , about time .
@TheMyrkiriad 4 ปีที่แล้ว ⁺¹
Normalization do not constrain inputs betwen 0 and 1 but rather -1 and 1. Correct ?
@anilkurkcu3389 6 ปีที่แล้ว
Could you share the link for the multiprocessing version?
@janwarchocki6154 6 ปีที่แล้ว
Can i use it to different things than robots, for example in some games in which i have reward system ?
@hazemahmed8333 4 ปีที่แล้ว
I am really glad i found your channel !! I can't thank you enough, really appreciate your amazing effort
@damonholden8572 3 ปีที่แล้ว
Sorry to be offtopic but does any of you know a tool to get back into an instagram account..?
I was dumb forgot the password. I would appreciate any tips you can give me!
@bentonhugo2541 3 ปีที่แล้ว
@Damon Holden instablaster :)
@damonholden8572 3 ปีที่แล้ว
@Benton Hugo I really appreciate your reply. I got to the site on google and I'm trying it out now.
Seems to take a while so I will get back to you later with my results.
@damonholden8572 3 ปีที่แล้ว
@Benton Hugo It did the trick and I finally got access to my account again. I am so happy!
Thanks so much, you saved my account !
@bentonhugo2541 3 ปีที่แล้ว
@Damon Holden Happy to help :)
@Chillos100 4 ปีที่แล้ว
I'm loving these series, thanks a lot!! I'm also trying to re-code this to gain better understanding, however I'm getting this error:
Traceback (most recent call last):
File "aug_rand_search.py", line 180, in
trainer.train()
File "aug_rand_search.py", line 156, in train
self.policy.update(rollouts, sigma_rewards)
File "aug_rand_search.py", line 84, in update
for r_pos, r_neg, delta in rollouts:
ValueError: not enough values to unpack (expected 3, got 2)
What am I doing wrong? Any help is much appreciated.. thnx
@Chillos100 4 ปีที่แล้ว
It solved and working! thnx for the upload
@AB-gd8hn 5 ปีที่แล้ว
Does OpenAI gym work on Linux only? Any equivalent in Windows?

ต่อไป

เล่นอัตโนมัติ

Deep Q Learning Pong Tutorial

Deep Q Learning Pong Tutorial

Policy Gradient Methods Tutorial

Policy Gradient Methods Tutorial

Q Learning Tutorial for Ride Sharing (Open AI Taxi)

Q Learning Tutorial for Ride Sharing (Open AI Taxi)

7 ตำรวจโหด โคตรงามไส้ จับผิดตัวกระทืบสาหัส l EP.1818 l 6 ธ.ค.67 l#โหนกระแส

7 ตำรวจโหด โคตรงามไส้ จับผิดตัวกระทืบสาหัส l EP.1818 l 6 ธ.ค.67 l#โหนกระแส

เล่นเกมรถถัง แพ้โดน! "รถถัง จิตรเมืองนนท์" เตะ

เล่นเกมรถถัง แพ้โดน! "รถถัง จิตรเมืองนนท์" เตะ

นี่คือสงครามอวกาศ #ตลก #เพื่อน #ละครสั้น #starwars

นี่คือสงครามอวกาศ #ตลก #เพื่อน #ละครสั้น #starwars

没想到出去了一会儿老公就干出这种事！幸好还有这个可以陪着我！#funny#萌娃#夫妻#剧情

没想到出去了一会儿老公就干出这种事！幸好还有这个可以陪着我！#funny#萌娃#夫妻#剧情

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

The Player Type Alignment Tesseract

The Player Type Alignment Tesseract

Proximal Policy Optimization (PPO) Tutorial - Master Roboschool!!!

Proximal Policy Optimization (PPO) Tutorial - Master Roboschool!!!

Dear Game Developers, Stop Messing This Up!

Dear Game Developers, Stop Messing This Up!

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

Monte Carlo Reinforcement Learning Tutorial

Monte Carlo Reinforcement Learning Tutorial

Actor Critic (A3C) Tutorial

Actor Critic (A3C) Tutorial

I never understood why you can't go faster than light - until now!

I never understood why you can't go faster than light - until now!

Coding Adventure: Boids

Coding Adventure: Boids

เอาชีวิตรอด 24 ชั่วโมง กับครอบครัว!! บนเรือ DIY HOMEMADE เคลื่อนที่!!

เอาชีวิตรอด 24 ชั่วโมง กับครอบครัว!! บนเรือ DIY HOMEMADE เคลื่อนที่!!

공중에서 다리찢기⁉️😱 Split Balance Challenge

공중에서 다리찢기⁉️😱 Split Balance Challenge

ออร่า (AURA) - JUEPAK feat. SARAN | [OFFICIAL MV]

ออร่า (AURA) - JUEPAK feat. SARAN | [OFFICIAL MV]

ซวยแล้วไล่ออก! 7ตำรวจตื้บชาวบ้าน | HOTSHOT เดลินิวส์ 07/12/67

ซวยแล้วไล่ออก! 7ตำรวจตื้บชาวบ้าน | HOTSHOT เดลินิวส์ 07/12/67

ห้ามพูดคำบนหัว ep9 #คำต้องห้าม

ห้ามพูดคำบนหัว ep9 #คำต้องห้าม

ไฮไลท์ฟุตบอล บุนเดสลีกา | บาเยิร์น มิวนิค 4-2 ไฮเดนไฮม์ | 7 ธ.ค. 67

ไฮไลท์ฟุตบอล บุนเดสลีกา | บาเยิร์น มิวนิค 4-2 ไฮเดนไฮม์ | 7 ธ.ค. 67

ถ้าคุณหาห้องลับเจอ รับไปเลย $1,000,000

ถ้าคุณหาห้องลับเจอ รับไปเลย $1,000,000