Advanced Skills through Multiple Adversarial Motion Priors in Reinforcement Learning

Robotic Systems Lab: Legged Robotics at ETH Zürich

มุมมอง 69 869

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 มี.ค. 2022
Nvidia presented parts of this work at GTC 2022, revealing our humanoid-quadruped transformer!
Title: Advanced Skills through Multiple Adversarial Motion Priors in Reinforcement Learning
Authors: Eric Vollenweider, Marko Bjelonic, Victor Klemm, Nikita Rudin, Joonho Lee and Marco Hutter
Paper submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems in Kyoto.
Preprint: arxiv.org/abs/2203.14912
Abstract: In recent years, reinforcement learning (RL) has shown outstanding performance for locomotion control of highly articulated robotic systems. Such approaches typically involve tedious reward function tuning to achieve the desired motion style. Imitation learning approaches such as adversarial motion priors aim to reduce this problem by encouraging a pre-defined motion style. In this work, we present an approach to augment the concept of adversarial motion prior-based RL to allow for multiple, discretely switchable styles. We show that multiple styles and skills can be learned simultaneously without notable performance differences, even in combination with motion data-free skills. Our approach is validated in several real-world experiments with a wheeled-legged quadruped robot showing skills learned from existing RL controllers and trajectory optimization, such as ducking and walking, and novel skills such as switching between a quadrupedal and humanoid configuration. For the latter skill, the robot is required to stand up, navigate on two wheels, and sit down. Instead of tuning the sit-down motion, we verify that a reverse playback of the stand-up movement helps the robot discover feasible sit-down behaviors and avoids tedious reward function tuning.
Note: The following parts of the video are sped up:
- Door opening and closing when the robot stands inside the elevator between 00:20 and 00:21 (+200%)
- In-between the standing up and sitting down sequence between 00:49 and 01:10 (+200% only the navigation on two legs)
- Reaction with the crowd between 1:41 and 1:50 (+150%)
- Last drone footage after 2:01 (+200%)
Learn more at www.swiss-mile.com/
Disclaimer: Robot from ANYbotics; customized by ETH Zürich; strictly for research purposes.
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 102

@DiegoAlanTorres96 2 ปีที่แล้ว ⁺²⁴
It's amazing when it briefly goes one-legged at 1:31
@leggedrobotics 2 ปีที่แล้ว ⁺²
Good catch :)
@ProjectPhysX 2 ปีที่แล้ว ⁺¹⁶
Technically, this robot is doing a wheelie!
@leggedrobotics 2 ปีที่แล้ว ⁺³
True 🤔
@leejunja 2 ปีที่แล้ว ⁺³
Bro are you genius
@TheSentients 2 ปีที่แล้ว
@@leejunja 🤣
@junghwanro4829 2 ปีที่แล้ว ⁺⁶
Wow. She is amazing!!!
@leggedrobotics 2 ปีที่แล้ว ⁺²
She? He? It? They? Pronounce are hard to define for robots. :)
@leejunja 2 ปีที่แล้ว ⁺¹⁰
Wow
@leggedrobotics 2 ปีที่แล้ว ⁺²
😋
@neithanm 2 ปีที่แล้ว ⁺¹⁴
This is amazing. I follow all things robots and you guys are on a different level...
@leggedrobotics 2 ปีที่แล้ว ⁺⁶
What an honor! :)
@omotoye 2 ปีที่แล้ว ⁺¹²
This is Awesome!
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Thank you :)
@supafred16 2 ปีที่แล้ว ⁺⁷
That's so cool!
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Thank you :)
@jerushalaim 4 หลายเดือนก่อน ⁺¹
I loved the learning sessions. You were great with translating to the general viewer pretty tough topics. And I have a crush on your robot.
@leggedrobotics 4 หลายเดือนก่อน ⁺¹
Thank you! 😃
@learnrobot32 2 ปีที่แล้ว ⁺²
how wonderful!
@username-up1ph 2 ปีที่แล้ว ⁺³
Nice I saw this from NVIDIA (from their presentation this year).
@leggedrobotics 2 ปีที่แล้ว ⁺¹
The robot was featured couple of times there :)
@pisoiorfan 2 ปีที่แล้ว ⁺⁴
If you put the front wheels at the "elbow" instead of "wrist" level then the wrists are freed to mount useful grippers.
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Good idea!
@computerjantje 2 ปีที่แล้ว ⁺³
I LOVE this machine.
@leggedrobotics 2 ปีที่แล้ว
Thank you
@TikiShootah 2 ปีที่แล้ว ⁺¹⁰
Man that looks amazing! Macro size and slap a car chassis on this thing. You revolutionize the auto industry 😎👍
@leggedrobotics 2 ปีที่แล้ว ⁺⁶
Working on that!
@huytranvan2754 ปีที่แล้ว
Wow, this is a really cool robot. Love to see more video about it. Thanks RSL!
@leggedrobotics ปีที่แล้ว
More to come!
@416dl 2 ปีที่แล้ว ⁺⁷
when do the wheels become ducted fans so it can fly?
@leggedrobotics 2 ปีที่แล้ว ⁺⁶
Should be the next goal! The all in one robot
@xiaojake479 ปีที่แล้ว ⁺²
I was amazed that the robot could push the elevator button to take the elevator at 0:14 !!! Is this an intelligent autonomous action or is it caused by human control?
@GameReality 2 ปีที่แล้ว ⁺⁷
All you have to do now is to transform the wheels to a expandable/ collapsible fan wheels. 🙂
@leggedrobotics 2 ปีที่แล้ว ⁺²
Great idea 💡
@GameReality 2 ปีที่แล้ว ⁺¹
@@leggedrobotics It is Yes 🙂👍 If you need any help then let me know. 🖖
@nathandavis1093 2 ปีที่แล้ว
Yeah, like a boat or a jet-powered bicycle.
@GameReality 2 ปีที่แล้ว
@@nathandavis1093 How do you mean?
@user-jr8xc3pj7w 2 ปีที่แล้ว ⁺⁶
It is so amazing to see anymal standing up! According to your answer few days ago, is it possible to send torque or current to the wheel directly? Maybe it will present better dynamic characteristics and help anymal balance itself in a more robust way . Have you tried it before?
@leggedrobotics 2 ปีที่แล้ว
We are controlling torque through velocity commands. Why more robust? The robot never falls, even when going down the stairs.
@zhaoharper3113 2 ปีที่แล้ว
@@leggedrobotics Velocity commands to PD controllers even for the two-wheel balancing task? Or there exists some type of feedforward torque?
@someguydino6770 3 หลายเดือนก่อน
FANTASTIC!
@leggedrobotics 2 หลายเดือนก่อน
Many thanks!
@yixiaozheng2890 2 ปีที่แล้ว ⁺⁵
cool！！！
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Thanks!
@stain5570 2 ปีที่แล้ว ⁺⁵
This looks so good I love it.💕 If you attach a robotic arm on its back, the Applications will most definitely increase, but I think it might affect the dynamics of the robot. But if you can manage it out somehow this will be great 👍
@leggedrobotics 2 ปีที่แล้ว ⁺⁷
Why add another arm if you can transform two legs into arms? ;)
@computerjantje 2 ปีที่แล้ว
@@leggedrobotics Because the wheels are limiting what the arms can do. Are you going to tranform the wheels to fingers when it is standing on its other 2 wheels?
@anishkommireddy4429 3 หลายเดือนก่อน
Is the robot able to climb all kinds of stairs? I saw you include an example of it climbing a small flight of stairs in quadrupedal mode. Or is it not going to be used in that kind of application?
@andymitchell2146 2 ปีที่แล้ว ⁺⁵
Very impressive. Is this all done based on neural networks without any sort of PID control? Or are the transitions handled through the learned behaviour and balance still handled by PID?
@leggedrobotics 2 ปีที่แล้ว ⁺⁷
All behaviors shown in this video are generated through a neural networks that send position and velocity targets to each motor.
@eelcohoogendoorn8044 2 ปีที่แล้ว ⁺¹
@@leggedrobotics Those position and velocity targets are probably handled by PID control, no? So technically, no; but yeah any high level control relating to the bulk motion such as balance is handled by the neural network.
@andymitchell2146 2 ปีที่แล้ว ⁺¹
@@eelcohoogendoorn8044 That was my thought, a hybrid system driven by the neural network and maintained by PID. Wondering if there's also inverse kinematics going on to move to target positions or not. Exciting stuff if this could all be handled by a virtually trained NN rather than going through fiddly PID tuning.
@sacredthegame 2 ปีที่แล้ว
@@andymitchell2146 The velocity/position targets for the motors are tracked by a high-frequency (400Hz) PID controller. The actual movement (8 times lower frequency) is a NN policy. In this work there is no inverse kinematics going on. Some parts (the style extractions for AMP motions did have forward kinematics though, since the discriminator observations contained the end-effector position in the body frame) The policy directly predicts joint angles/velocities.
You can find a link to the paper preprint in the description.
@execratus ปีที่แล้ว
What are the acquisition requirements for these?
@ravindr22 2 ปีที่แล้ว ⁺⁵
I really love your work in devloping learning-based controllers for legged robots. Moreover, i also wanted to know whether there any opportunities to work at your Lab. I am currently a final year undegrad. I aim to a do a masters degree in electrical engineering (specialized in robotics and control) and head for PhD position at a reputed lab like yours. Please shed some advice of how I could achieve my goals.
@leggedrobotics 2 ปีที่แล้ว ⁺⁴
Check out the website of our lab for more information about opportunities and read our papers to see what kind of skill set you need.
@ravindr22 2 ปีที่แล้ว ⁺¹
@@leggedrobotics Thank you means a lot. I hope to gain expertise and research experience in this field and get a Ph.D. position at your lab. :)
@nathandavis1093 2 ปีที่แล้ว
Me too
@likeamanpossesed ปีที่แล้ว ⁺¹
This tech is terrifying
@marklevasseur1373 2 ปีที่แล้ว ⁺²
Mad cool
@leggedrobotics 2 ปีที่แล้ว
Thank you :)
@stardude2006 7 หลายเดือนก่อน ⁺¹
Looks like the Wheelers from the Land of Oz
@dhbaek6943 2 ปีที่แล้ว ⁺³
Looks like amazing work! Can you share your paper to Arxiv??
@leggedrobotics 2 ปีที่แล้ว ⁺³
The paper will be soon online.
@allyourcode 6 หลายเดือนก่อน
This is amazing. My favorite part was when it took the elevator. Is this going to make it out of the lab? When? If it does, I guess it will first be used for things like industrial plant inspection, like Spot. Y'all need to make dance videos like Boston Dynamics XD
@leggedrobotics 6 หลายเดือนก่อน
We will work on it :)
@simplerobotics 2 ปีที่แล้ว ⁺¹
I propose to implement a wheel lock to overcome obstacles on foot.
@RBLevin 6 หลายเดือนก่อน
You should add a set of arms and hands that can replace to wheels when it is upright.
@leggedrobotics 6 หลายเดือนก่อน
Good idea 👍
@JorgeGamaliel 2 ปีที่แล้ว ⁺⁶
Like like like
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Like!
@jabrwok 2 ปีที่แล้ว ⁺²
GoBots live!
@leggedrobotics 2 ปีที่แล้ว
Finally!
@Solarstormflare ปีที่แล้ว ⁺¹
im confused but pretty robot goes brrrrr
@leggedrobotics ปีที่แล้ว
Why confused?
@jrohit1110 2 ปีที่แล้ว ⁺²
42 mins?? what hardware are you running it on?
@victorklemm 2 ปีที่แล้ว ⁺¹
We used a NVIDIA GeForce RTX 3080 Ti for training
@leggedrobotics 2 ปีที่แล้ว
And Nvidia Isaac gym for simulation:)
@jrohit1110 2 ปีที่แล้ว
@@leggedrobotics Nice! What are you running the network on in real time?
@Tabby-The-Cats 6 หลายเดือนก่อน
Is this the next Mars Rover? given his agility and his abilities, he should be able to explore the red planet...
@leggedrobotics 6 หลายเดือนก่อน
That should be our vision :)
@KeitelDOG 6 หลายเดือนก่อน
The scary part of robots learning, is that one can learn for days and all the others can get that knowlegde instantly from copy paste like in Matrix 😮.
@leggedrobotics 6 หลายเดือนก่อน
Exactly!
@MrFennmeista 2 ปีที่แล้ว ⁺²
Me trying to figure out if it's an Autobot or a Decepticon 🤔
@leggedrobotics 2 ปีที่แล้ว ⁺¹
Autobot!
@rylaczero3740 2 ปีที่แล้ว ⁺²
Next up: Transformers.
@leggedrobotics 2 ปีที่แล้ว
The first of its kind !
@nikonissinen6772 ปีที่แล้ว
github plz
@BarnCamonthanakun 2 ปีที่แล้ว ⁺¹
This is no doubt amazing & impressive. But I'm concerned about hygiene when I see the robot standing up and using its wheels to push the elevator buttons.
Not saying that elevator buttons are clean as of today, its comparable with someone using their shoes to push a button.
@execratus ปีที่แล้ว
If that's your take away, I pity the drugs that keep your mental illnesses from allowing you to destroy yourself.
@GameReality 2 ปีที่แล้ว ⁺²
Make your wheels fold / unfold automatic by splitting up its legs and arms in to wheel spokes
in a way like this th-cam.com/video/mt2FpYsQ4IE/w-d-xo.html and this th-cam.com/video/f86N8Xp7DHI/w-d-xo.html
@leggedrobotics 2 ปีที่แล้ว ⁺¹
And then?
@GameReality 2 ปีที่แล้ว
@@leggedrobotics
Then it will look like a humanoid robot standing up and walking.
Then it unfold wheels spokes on legs and drive away when in need of higher speed.
In hard terrain it just walk on all four legs.
It unfold all wheels spokes and go down on all four
when it need highest speed and stability .
Legs and arms unfold like this when it split up its arms and legs in wheel spokes , th-cam.com/video/f86N8Xp7DHI/w-d-xo.html
@HOWYOUDOIN884 6 หลายเดือนก่อน
This is very cool but it freaks me out
@leggedrobotics 6 หลายเดือนก่อน
Do not worry :)
@osmir6679 2 ปีที่แล้ว ⁺¹
mannnnnnn
@tokyowarfare6729 2 ปีที่แล้ว ⁺²
Only competitor team can downvote this vid :D
@leggedrobotics 2 ปีที่แล้ว
😎
@chriss822 2 ปีที่แล้ว ⁺²
Da kann man bestimmt ein cal .50 Maschinengewehr drauf montieren.
@leggedrobotics 2 ปีที่แล้ว ⁺⁶
Bitte nicht!

ต่อไป

เล่นอัตโนมัติ

NVIDIA’s New AI Trained For 10 Years! But How? 🤺