Augmented Random Search Tutorial - How to Train Robots to Walk!

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ธ.ค. 2024

ความคิดเห็น • 15

  • @craigowsen4501
    @craigowsen4501 4 ปีที่แล้ว +1

    Very clear explanation! The one thing that was a little confusing was why you used self.hp.noise to throttle the deltas in the policy evaluate function but not in the update function.

  • @tibor2077
    @tibor2077 6 ปีที่แล้ว +2

    Great video, explains everything clearly , step by step. Thanks again :)

  • @joeyng9754
    @joeyng9754 6 ปีที่แล้ว +1

    Thank you brother , I'm a huge fun, I have finally SUBSCRIBED , about time .

  • @hazemahmed8333
    @hazemahmed8333 4 ปีที่แล้ว

    I am really glad i found your channel !! I can't thank you enough, really appreciate your amazing effort

    • @damonholden8572
      @damonholden8572 3 ปีที่แล้ว

      Sorry to be offtopic but does any of you know a tool to get back into an instagram account..?
      I was dumb forgot the password. I would appreciate any tips you can give me!

    • @bentonhugo2541
      @bentonhugo2541 3 ปีที่แล้ว

      @Damon Holden instablaster :)

    • @damonholden8572
      @damonholden8572 3 ปีที่แล้ว

      @Benton Hugo I really appreciate your reply. I got to the site on google and I'm trying it out now.
      Seems to take a while so I will get back to you later with my results.

    • @damonholden8572
      @damonholden8572 3 ปีที่แล้ว

      @Benton Hugo It did the trick and I finally got access to my account again. I am so happy!
      Thanks so much, you saved my account !

    • @bentonhugo2541
      @bentonhugo2541 3 ปีที่แล้ว

      @Damon Holden Happy to help :)

  • @TheMyrkiriad
    @TheMyrkiriad 4 ปีที่แล้ว +1

    Normalization do not constrain inputs betwen 0 and 1 but rather -1 and 1. Correct ?

  • @janwarchocki6154
    @janwarchocki6154 6 ปีที่แล้ว

    Can i use it to different things than robots, for example in some games in which i have reward system ?

  • @anilkurkcu3389
    @anilkurkcu3389 6 ปีที่แล้ว

    Could you share the link for the multiprocessing version?

  • @Chillos100
    @Chillos100 4 ปีที่แล้ว

    I'm loving these series, thanks a lot!! I'm also trying to re-code this to gain better understanding, however I'm getting this error:
    Traceback (most recent call last):
    File "aug_rand_search.py", line 180, in
    trainer.train()
    File "aug_rand_search.py", line 156, in train
    self.policy.update(rollouts, sigma_rewards)
    File "aug_rand_search.py", line 84, in update
    for r_pos, r_neg, delta in rollouts:
    ValueError: not enough values to unpack (expected 3, got 2)
    What am I doing wrong? Any help is much appreciated.. thnx

    • @Chillos100
      @Chillos100 4 ปีที่แล้ว

      It solved and working! thnx for the upload

  • @AB-gd8hn
    @AB-gd8hn 5 ปีที่แล้ว

    Does OpenAI gym work on Linux only? Any equivalent in Windows?