How to train simple AIs to balance a double pendulum

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 พ.ย. 2024

ความคิดเห็น • 747

  • @HolySerega
    @HolySerega 5 หลายเดือนก่อน +1690

    Imagine training for 46 years and the god says, "naaaah, you wiggle too much"

    • @maxim_ml
      @maxim_ml 5 หลายเดือนก่อน +29

      i know RIGHT

    • @Nzargnalphabet
      @Nzargnalphabet 5 หลายเดือนก่อน +57

      I feel like the evolution of quick oscillations was frankly quite cool and made sense with such a chaotic system

    • @AA-cg1wm
      @AA-cg1wm 5 หลายเดือนก่อน +34

      "but, but i did the task sir!"
      *nope echoes in distance*

    • @mr_b_hhc
      @mr_b_hhc 5 หลายเดือนก่อน

      I would argue with him that in such a case "well, you made me so are at fault". Unless of course I had be given free will?

    • @TomasV-s9f
      @TomasV-s9f 5 หลายเดือนก่อน

      😂😂😂😂😂😂

  • @msx80
    @msx80 5 หลายเดือนก่อน +821

    The idea of starting with lower gravity and higher friction is simply genius

    • @research417
      @research417 5 หลายเดือนก่อน +69

      Yeah he immediately solved a problem that even that professional research paper struggled with, and it intuitively makes perfect sense how the learning can transfer over from the simpler problem to the more complex problem. Goes to show that sometimes all you need is to just reframe the problem.

    • @CliffHanger-fg6uy
      @CliffHanger-fg6uy 5 หลายเดือนก่อน

      @@research417If you google it, you’ll find that it was a project report from an undergraduate machine learning course…
      It’s still a cool approach, though.

    • @reyariass
      @reyariass 5 หลายเดือนก่อน +23

      For real, when I saw the research paper I thought Pezzz was going to show something else because the “professionals” said it was not possible. It’s amazing how it just took a small change to make it work. Great work pezzz

    • @oko3717
      @oko3717 4 หลายเดือนก่อน +1

      ingenious

    • @joaosousapinto3614
      @joaosousapinto3614 4 หลายเดือนก่อน +9

      It’s common in the field of reinforcement learning. It’s called curriculum learning.

  • @marklondon9004
    @marklondon9004 5 หลายเดือนก่อน +1568

    I went from "Only a fool thinks he can balance a double pendulum" to "Praise the lord, he walked on water!"

    • @extremechimpout
      @extremechimpout 5 หลายเดือนก่อน +21

      irl it's impossible so you were right

    • @MrHardzio4Fun
      @MrHardzio4Fun 5 หลายเดือนก่อน +100

      ​​​​@@extremechimpoutSo your claim is that all YT videos with robots doing that are fake? Even with triple ones.
      In real life it does not wobble as much, everything has friction. It's actually easier irl.

    • @extremechimpout
      @extremechimpout 5 หลายเดือนก่อน +2

      @@MrHardzio4Fun I looked there are no such videos

    • @mapron1
      @mapron1 5 หลายเดือนก่อน +35

      @@extremechimpout I looked there are many of those video, even quadruple.

    • @MrHardzio4Fun
      @MrHardzio4Fun 5 หลายเดือนก่อน +30

      ​@@extremechimpoutLook harder.

  • @Geosquare8128
    @Geosquare8128 5 หลายเดือนก่อน +595

    the agent UI/visualization, the training graph hyper parameter view, the movitated loss function explanations.. all so well done wow

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +41

      Thank you!

    • @gus2603
      @gus2603 5 หลายเดือนก่อน +7

      ​@@PezzzasWorkcan you make a video doing those?

    • @yalnisinfo
      @yalnisinfo 5 หลายเดือนก่อน

      @@gus2603absolutely support this 🎉, i want to avoid looking at numbers as much as possible.

    • @soumyodiptanath2917
      @soumyodiptanath2917 5 หลายเดือนก่อน

      Is it made using Python?

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +13

      Everything is made using C++

  • @CriticalMonkey623
    @CriticalMonkey623 5 หลายเดือนก่อน +1105

    My toxic trait is believing that I would be extremely good at balancing a double pendulum with zero practice.

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +188

      That would be very impressive :D

    • @CriticalMonkey623
      @CriticalMonkey623 5 หลายเดือนก่อน +82

      @@PezzzasWork Your videos are always so clean and well put together. I've finished the video now and the final solution you came up with is great. The idea to incrementally ramp up the difficulty was a stroke of genius that felt so obvious after you said it. Great work showing that paper what's possible!

    • @smileyp4535
      @smileyp4535 5 หลายเดือนก่อน +7

      @@CriticalMonkey623 yeah maybe if you could somehow do that irl you'd actually be able to do a double pendulum balance

    • @anonanon6596
      @anonanon6596 5 หลายเดือนก่อน +9

      If single pendulum is like balancing a broomstick on your finger, then double is like balancing a broom stick on top of another broomstick, on top of your finger. Suddenly it does not seem so easy.

    • @adora_was_taken
      @adora_was_taken 5 หลายเดือนก่อน +1

      @@anonanon6596 nah i could do it

  • @jonathanhelgesen8800
    @jonathanhelgesen8800 5 หลายเดือนก่อน +264

    The idea of gradually increasing gravity and reducing friction is just genius! Would be interesting to see if this method also would work for a triple pendulum 🤔

    • @andrewferguson6901
      @andrewferguson6901 5 หลายเดือนก่อน +19

      boyo do i have some news for you
      "World's first video of 56 transition controls for a triple inverted pendulum : 3-body problem"

    • @ThePyrosirys
      @ThePyrosirys 5 หลายเดือนก่อน +13

      @@andrewferguson6901 That video is not using an AI controller, it's a controller calculated from methods found in control theory.

    • @Pockeywn
      @Pockeywn 5 หลายเดือนก่อน

      @@andrewferguson6901lmao i found the video and apparently i already started it at some point

    • @Gwarks337
      @Gwarks337 5 หลายเดือนก่อน

      Seems like engineers doing that for years th-cam.com/video/cyN-CRNrb3E/w-d-xo.html this one some years

    • @Conceptofcandid4043
      @Conceptofcandid4043 3 หลายเดือนก่อน +1

      Nah lets skip straight to 50 pivot points

  • @lake5044
    @lake5044 5 หลายเดือนก่อน +77

    I don't think you know this but in the real world, if you attach a double pendulum to a vibration source with the right frequency, like a motor oscillating back and forth, the double pendulum balances without any adaptive balancing mechanism. (That's why I think the high jitter solutions of the AI are actually valid in this scenario.)

    • @kuhljager2429
      @kuhljager2429 3 หลายเดือนก่อน +5

      Interesting. The things you learn

  • @o-..-
    @o-..- 5 หลายเดือนก่อน +202

    I didnt even think that it would be possible, wow, also nice editing and everything is just put in a way that makes the video really enjoyable 🎉

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +12

      Thank you very much!

    • @mage3690
      @mage3690 5 หลายเดือนก่อน

      All AI are essentially just function emulators.

    • @CliffHanger-fg6uy
      @CliffHanger-fg6uy 5 หลายเดือนก่อน

      @@PezzzasWorkDefinitely a cool video.
      If you end up doing a follow-up, here are a couple of things I would like to see (as a control theorist):
      - test with random disturbances that also take place during the swing-up control. You could also think of this as adding a little noise to the inputs of the NN. I think you would have to include this in the training as the swing-up didn’t look like it would tolerate any sort of disturbance.
      - It would be nice if you gave the audience the ratio between maximum acceleration and gravity used.
      - I definitely agree that it makes more sense to use acceleration as the control input.

    • @CliffHanger-fg6uy
      @CliffHanger-fg6uy 5 หลายเดือนก่อน

      @@PezzzasWorkalso, I think that “article” you mention on NN-based control of a double pendulum is just a project report from an undergraduate ML course.
      Finally, if you’re unable to make your NN solution robust to disturbances or noise during the swing-up, you could use the ML approach for the “feed-forward” control, basically as a planned trajectory and add conventional control theory on top to stabilize the trajectory w.r.t. noise/disturbances.

  • @banaankiamanesh
    @banaankiamanesh 2 หลายเดือนก่อน +6

    I'm a control engineer myself and I absolutely enjoyed your way of solving the problem. Well done man :)

  • @amzogm8749
    @amzogm8749 5 หลายเดือนก่อน +129

    The next step is teaching an AI to balance a million pendulums.
    Keep up the great work, it's both educational and entertaining!

    • @saferugdev8975
      @saferugdev8975 5 หลายเดือนก่อน +16

      we just need a million dyson spheres so the model training doesnt take 100 years

    • @wanfuse
      @wanfuse 5 หลายเดือนก่อน

      nah dont need a million just the magic number 7

    • @strecher777
      @strecher777 5 หลายเดือนก่อน +20

      Do you mean balancing a rope? 😂

    • @wanfuse
      @wanfuse 5 หลายเดือนก่อน

      @@strecher777 no but I will share it when the time is best!

    • @caballeronocturno124
      @caballeronocturno124 5 หลายเดือนก่อน +1

      We need to balance the n-pendulum

  • @xwxwvyz1
    @xwxwvyz1 5 หลายเดือนก่อน +247

    it was both surprising and entertaining to see the AI doing the blender move whenever the pendulum drops like a kid throwing tantrums

    • @daniel.watching
      @daniel.watching 5 หลายเดือนก่อน +6

      It's less "doing a move" and more potential energy being converted into kinetic. If the pendulum falls from its highest position it will be moving fast without a correction.
      Since it is more stable lower down, it will tend to spend more time rapidly spinning below the axis that slowly spinning above it, unless intervened.
      And finally, the AI, like any control system has a maximum frequency/speed it can manage, above which it breaks down and essentially become random input.
      So while the AI might be able to take the slow pendulum and balance it up high, if it then falls, it might not have the skill needed to recover. So every failure mode leads to a rapidly spinning "blender".

    • @xwxwvyz1
      @xwxwvyz1 5 หลายเดือนก่อน

      @@daniel.watching i know how the doube pendulum works and i also know the situation is either what you said or the AI gaining reward through some loophole in the reward system since its technically over the parameters once every loop (thought this because blenders became less of a problem with each added condition)
      But i just wanted to make a visual comparison between malfunctions of AI and toddler brains where they start giving seemingly nonsensical inputs to solve their impossible problems (which is your statement anyways)

    • @daniel.watching
      @daniel.watching 5 หลายเดือนก่อน +2

      @@xwxwvyz1 Yeah sorry I didn't mean to imply you didn't understand it. I think I was just looking for an excuse to explain why the blender thing happens.

  • @makebreakrepeat
    @makebreakrepeat 5 หลายเดือนก่อน +46

    Your videos are always a delightful blend of programming, puzzles, science and philosophy. Thank you for making YT a better place

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +7

      Thank you very much!

  • @EchoPrograms
    @EchoPrograms 5 หลายเดือนก่อน +4

    Just finished my first implementation of a standard nn with back propagation. I'm 16, so it took a bit, but I'm proud of it. Right now i have it fitting a given function, but it is pretty versatile.

  • @isaacbutler4262
    @isaacbutler4262 5 หลายเดือนก่อน +79

    Very nice!, I really enjoyed the first video, it helped me create my own NN. Keep up the good work

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +7

      Thank you!

  • @gm4984
    @gm4984 5 หลายเดือนก่อน +27

    Love this approach of setting up an easier environment, for the ai to learn from and slowly increasing the difficulty. I would also love to see how you are going to explore this aspect in the future :D

  • @richardcoppin5332
    @richardcoppin5332 5 หลายเดือนก่อน +32

    👌🏼 Magnificent.
    It was really great idea to use air friction as the simplifying factor.
    I world love to see the cost of work in the evaluation function.

  • @vibaj16
    @vibaj16 5 หลายเดือนก่อน +7

    It's amazing how you go above and beyond. After that first working solution, I would've been really proud of myself and be done with it, but you just make it better and better.

  • @kevintrigg3707
    @kevintrigg3707 5 หลายเดือนก่อน +13

    the progressive difficulty was an amazing training tool! very well put together video

  • @kellymoses8566
    @kellymoses8566 5 หลายเดือนก่อน +5

    starting easy and gradually increasing the difficulty is a very clever idea.

  • @ghomeyshi7
    @ghomeyshi7 5 หลายเดือนก่อน +7

    give this man a Nobel prize! respect!

  • @alxklgn364
    @alxklgn364 5 หลายเดือนก่อน +10

    The editing and the feedback UI are brilliant. Cheers, Pez.

  • @lyuboslavilov
    @lyuboslavilov 5 หลายเดือนก่อน +2

    Starting from easy conditions and gradually making them complex is simply a stroke of genius! I bet this will be a thing in the near future. You should write a paper

  • @naijii
    @naijii 5 หลายเดือนก่อน +3

    The UI is incredible and everything is very well presented!

  • @eelcohoogendoorn8044
    @eelcohoogendoorn8044 5 หลายเดือนก่อน +6

    Really nice! I liked the velocity-commanded version better than the acceleration commanded one. In many practical cases that is also found to matter; 'in theory' equivalent neural networks should exist but controlling acceleration does bias towards smooth accelerations indeed. In practice a motor controller often has an internal PID control loop; so commanding a velocity (and having that quickly realized, up to physical constraints on acceleration; something like min(max_motor_accel, velocity_error/dt)) isnt necessarily an unnatural choice. It still allows for spiky torques to be learned where they are required in a natural manner.

  • @janmuntsiglesias577
    @janmuntsiglesias577 5 หลายเดือนก่อน +43

    keep up making this content!

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +17

      I will try :)

  • @triplezgames3882
    @triplezgames3882 5 หลายเดือนก่อน +9

    It's so interesting how friction and low gravity helped the AI learn. It's like humans practicing something complex in simpler variant, because trying to do something like balancing a double pendulum would completely overwhelm us too having no idea about how you would even start counteracting the imbalances

    • @sebastiangudino9377
      @sebastiangudino9377 3 หลายเดือนก่อน +2

      Yup! AI shows us mathematically something very true about learning. You should always try hard problems (An always positive score function leads to no improvement and over-fitting), but if the problem is too hard, you won't learn anything (An always off score function will also lead to now improvements)
      This is the principle behind human learning techniques like language immersion. Once you understand about 50% of a language the best way to learn is to just jump in, that's the point where your score function leads to the fastest learning

    • @triplezgames3882
      @triplezgames3882 3 หลายเดือนก่อน

      @@sebastiangudino9377 Damn, really well said.

  • @TeamDman
    @TeamDman 5 หลายเดือนก่อน +3

    I'm in love with the interface you designed!

  • @8ightfold
    @8ightfold 3 หลายเดือนก่อน +1

    My dad did a single pendulum in the 90s with fuzzy logic... it's cool to see how far things have come

  • @davidamatolondriz8512
    @davidamatolondriz8512 4 หลายเดือนก่อน +3

    Congrats on the video man! Literally the best 20min in TH-cam since many time ago!

  • @AA-cg1wm
    @AA-cg1wm 5 หลายเดือนก่อน +1

    I'm currently making my own neural network and your videos really helped me understand the concepts and mechanisms of this project

  • @leggyjorington3960
    @leggyjorington3960 5 หลายเดือนก่อน +2

    The UI and graphs for this video look amazing! Keep up the good work!

  • @AE_Sub
    @AE_Sub 5 หลายเดือนก่อน +3

    This is the kind of top tier content youtube was made for!

  • @mzg147
    @mzg147 5 หลายเดือนก่อน +2

    love the visuals, the clean colored borders... perfect!

  • @lanha3385
    @lanha3385 5 หลายเดือนก่อน +1

    You made it absolutely wonderous and intrigued interest to the notion of chaos. Well done

  • @SiddhantSharma181
    @SiddhantSharma181 4 หลายเดือนก่อน

    I like your idea of starting with a simpler problem and gradually increasing the difficulty

  • @raghavrokr
    @raghavrokr 2 หลายเดือนก่อน

    It is absolutely genius to play with the physics to find a solution. Also, amazing visualizations! Such a treat to watch this video.

  • @99totof99
    @99totof99 5 หลายเดือนก่อน +1

    Wonderful, you have mastered the art of getting what you want from a neural network!

  • @martinfisker7438
    @martinfisker7438 5 หลายเดือนก่อน +3

    That pendulum rainbow animation is straight art

  • @M_1024
    @M_1024 5 หลายเดือนก่อน +1

    Increasing the difficulty is a cery good approach, if I ever make a neural network I will probably try it.

  • @phoenixxofficial
    @phoenixxofficial 5 หลายเดือนก่อน +1

    Your work never ceases to amaze me

  • @saurabhtalele
    @saurabhtalele 5 หลายเดือนก่อน +1

    If someone could build a physical system using this, it would be magic

  • @ivocanevo
    @ivocanevo 5 หลายเดือนก่อน +1

    Great video. I love watching the refinements. Human and AI learning together.

  • @liamseanalbarel-hepburn6436
    @liamseanalbarel-hepburn6436 5 หลายเดือนก่อน

    I haven't worked with neural networks before, but you definitely inspired me to start looking into it and giving it a shot. Well done with your solution, and good work for not giving up!

  • @Paplu-i5t
    @Paplu-i5t 4 หลายเดือนก่อน

    Genius. A perfect use case to illustrate what is the special ability of a neural network.

  • @stoobidthing
    @stoobidthing 5 หลายเดือนก่อน +2

    Amazing video as always, I love the visuals on this one! Greetings from Paraguay 🇵🇾

  • @rewolfer
    @rewolfer 5 หลายเดือนก่อน

    The idea to increase difficulty was so good. Loved watching it go through periods of great success and fast difficulty increasing.

  • @kaylor87
    @kaylor87 4 หลายเดือนก่อน

    I've been obsessed with double pendulums for a long time now, I absolutely loved this video. I honestly started the video thinking it would be impossible to balance.

  • @XiaoMingXing
    @XiaoMingXing 2 หลายเดือนก่อน +1

    Anyone else feel like the little animation at 3:15 would make an incredible screensaver?

  • @brambleumink6928
    @brambleumink6928 4 หลายเดือนก่อน

    This was a really awesome video, I enjoyed watching it very much. Thank you for creating this, the visuals were really nice to watch and the explanations were easy to follow. The idea of starting with low gravity and high friction was amazing, great job!!!!

  • @ivovelo
    @ivovelo 2 หลายเดือนก่อน

    This was fantastic, thank you!
    One more thing that would be amazing to teach your network: to bring a double-pendulum from any state into a balanced state. I'd love to see that.

  • @furbyfubar
    @furbyfubar 4 หลายเดือนก่อน +1

    I feel like you now should write the scientific paper on this. Going from "This problem is yet to be solved" to "I solved it with about 2 hours of training for my model" is huge progress!

  • @K0rck
    @K0rck 5 หลายเดือนก่อน

    Amazing work. It was _really_ engenius the way you break up the problem for the evolution algorithm! And the interface you put together to demonstrate the work just adds the cherry on top of it. Keep up the great work, I'm looking forward for the next one!

  • @wanfuse
    @wanfuse 5 หลายเดือนก่อน +1

    fantastic! this has so many uses! hope you publish code soon! I have a few ideas what to use it for! I worked for a brief time on this exact problem!

  • @DjDoGGoD
    @DjDoGGoD 5 หลายเดือนก่อน

    Really amazing implementation! The first moment I saw the jerk movements I immediately blamed floats. I was very satisfied when you confirmed it, and decided to switch to doubles, as I was screaming at my monitor. :D

  • @purrzival
    @purrzival 5 หลายเดือนก่อน +2

    This video is so beautiful, great work!

  • @raiden631
    @raiden631 3 หลายเดือนก่อน

    Real nice video! Loved it :)
    Love the aesthetics for explaining. This makes it very easy to follow, with nice colors and distinct UI elements. Keep it up !

  • @mathewgriffiths1870
    @mathewgriffiths1870 5 หลายเดือนก่อน +1

    This is incredible, such amazing work

  • @amzogm8749
    @amzogm8749 5 หลายเดือนก่อน

    This video is simply fascinating! Seeing how an AI can learn to balance a double pendulum is truly impressive. The explanations are clear, and the visuals are very well done, making the subject both understandable and captivating. I especially enjoyed the demonstration with different weights, and it made me want to try creating my own AI to tackle this challenge. Keep producing such inspiring and educational content, it's a real pleasure to watch!

  • @Pockeywn
    @Pockeywn 5 หลายเดือนก่อน

    its interesting that in the final result you can so clearly see oscillation at two different frequencies in the output velocity thats cool

  • @renakunisaki
    @renakunisaki 3 หลายเดือนก่อน

    The last one didn't even seem like an exploit. It wasn't smooth or perfectly straight, but it kept the pendulum upright enough by compensating for its motion.
    Also you made a groovy screen saver!

  • @sanderbos4243
    @sanderbos4243 5 หลายเดือนก่อน

    The exploit solution at the end is amazing

  • @chrizzzly_hh
    @chrizzzly_hh 5 หลายเดือนก่อน

    Really great video and visualisation. Please add some tests next for pendulum setups that are already in motion to see if it can also stabilize these and not only from resting position. This will be fun!

  • @sbstndbs
    @sbstndbs 5 หลายเดือนก่อน

    That's very great ! Even if the numerical scheme can hide some dissipation, it is the same for a near-perfect physical system.

  • @BCannTV
    @BCannTV 5 หลายเดือนก่อน +2

    Buddy graduated with a phd at 17. Subbed earned.

  • @srijanraghavula
    @srijanraghavula 4 หลายเดือนก่อน

    Loved the video, the visualizations, the breakdown and the thought of using not python for everything is whats impressive, immediate sub, hope ill learn how to do this stuff

  • @olliecook1982
    @olliecook1982 5 หลายเดือนก่อน

    Awesome video! Really shows the complexity in tasks like this, and brings into light the massive challenge of doing this in real life with double and tripple pendulums. I wonder how they managed to get such good control!
    I also find it very interesting how it hits a wall every once in a while than makes 1 small change and just doubles it performance. Thats really interesting!

  • @Alnakera
    @Alnakera 4 หลายเดือนก่อน +1

    We need a 10 hour version of the double pendulum trajectory.

  • @pcgamerforthewin
    @pcgamerforthewin 5 หลายเดือนก่อน +2

    Beautifully mesmerizing! Thank you for this amazing piece of art. Every aspect of this video is so well done. Kept me engrossed until the very end.
    It was very satisfying to see how you solved each problem with seemingly simple solutions. The progressive training scratched the itch I've always had every time I'd see a machine learning project.
    The UI was also fantastic, amazing work!
    One question if you don't mind. What was the song at 3:51? Thank you!

    • @PezzzasWork
      @PezzzasWork  5 หลายเดือนก่อน +1

      Thank you very much for your support!
      The music used is Empyrean by Dear Gravity :)

    • @pcgamerforthewin
      @pcgamerforthewin 5 หลายเดือนก่อน

      ​@@PezzzasWork Thank you!

  • @NaviaryMusic
    @NaviaryMusic 5 หลายเดือนก่อน

    To give it a greater intuition for how to recover the pendulums from any chaotic state, you could start each simulation with some applied random motion, instead of hanging dead center.

  • @VivienLEGER
    @VivienLEGER 5 หลายเดือนก่อน

    as AI most generaly does not impress me, your work and dedication does. also i love the attention to details like graphics, that s truly amazing!

  • @BendoubaAbdessalem
    @BendoubaAbdessalem 5 หลายเดือนก่อน +2

    you should've add to the score function how are the three points are aligned to avoid exploids such ase the last solution you showed us, also i think that we would like to see the result of the training method of controling the acceleration using the last score function when you was still giving the ai control over speed of the cart not the acceleration, and in the end it was a good video, and it was pretty informative and entertaining in the same time!

  • @bmdsch1320
    @bmdsch1320 5 หลายเดือนก่อน

    This is amazing! I have been waiting for this second part and its finally here! Its amazing that you can produce a video with such a good quality and informative in such short time!

  • @ChristofFritz
    @ChristofFritz 5 หลายเดือนก่อน

    Ok, that's hella impressive. I fondly remember visiting "Hannover Messe" with my parents. A convention with a lot of technical stuff. There was a booth where they had a pendulum with a flywheel on the end that automatically uprighted itself and balanced after that. It's stuff like that and what you do here that inspires young people to make stuff like that. Thank you! (And now build a machine IRL that replicates the double pendulum simulation and is controlled by the neural net you trained in the simulation :D)

  • @Cenodis
    @Cenodis หลายเดือนก่อน +1

    Regarding the "none of the solutions can recover after collapsing". I imagine this could be because the AI never encountered such a scenario during training. Now I have never trained my own AI but I am reminded of a video about the training of a Trackmania AI where something similar happened.
    The problem was that the AI got really good at solving the one specific portion of track it was trained on but failed to generalize for the entire track. The solution in that case was to always spawn the AI on a random portion of track. That way any specific solution would fail on the next run and be eliminated which helps avoid overfitting.
    I think the same might happen here where the AI essentially only ever encounters two states: Pendulum hanging straight down with no velocity and pendulum near equilibrium. Here too it might be possible to "force" the AI to generalize by setting the initial angles and velocities to random values. That way the AI has to learn to recover from any possible state instead of just optimizing for the "straight down" case that can't really happen naturally.

  • @poketopa1234
    @poketopa1234 4 หลายเดือนก่อน

    Seriously such a great video. I love the curriculum learning approach!

  • @BORCHLEO
    @BORCHLEO 5 หลายเดือนก่อน

    this is such an amazing visualization and intersection between so many beautiful forms of mathematics. thank you for this @Pezzza's Work

  • @SuperElephant
    @SuperElephant 5 หลายเดือนก่อน +1

    Amazing content would be an extreme understatement.

  • @evandude5000
    @evandude5000 3 หลายเดือนก่อน +1

    It would be really interesting to see how the algorithm solves the problem if its acceleration was severely limited, so that it had to reach equilibrium without small adjustments. I believe that it would find a solution to swing the double pendulum up straight into equilibrium, but I'm not sure.

  • @aw_dev
    @aw_dev 5 หลายเดือนก่อน

    Watched the first video, was amazing. Watching this. Hopefully it's even better

  • @shadowcraftersr
    @shadowcraftersr 5 หลายเดือนก่อน

    Awesome! I've been looking forward to this video since I saw the first one. Did not disappoint

  • @TheMiczu
    @TheMiczu 5 หลายเดือนก่อน

    Amazing video, gives great insight how to make AI tackle seemingly impossible task.

  • @geromiuiboxz765
    @geromiuiboxz765 5 หลายเดือนก่อน

    🇨🇱
    Well, everybody already praised your genius.
    It is so satisfying to encounter people so smart as you, who sort of neutralize the not so smart, like me 😀.
    Not fully understand the process, it is double satisfying, to watch you slowly approaching a solution to a seemingly impossible problem.
    Congratulations and THANKS you ‼️
    Saludos de k🇨🇱

  • @andrewferguson6901
    @andrewferguson6901 5 หลายเดือนก่อน +1

    I find it interesting that the overlay of hundreds of sets of pendulums feels like some kind of self intersecting 3(+?) dimensional surface. It's interesting that the brain tries to make sense of this highly chaotic pattern by perceiving a higher visual-spatial dimension

  • @cedric1731
    @cedric1731 5 หลายเดือนก่อน +1

    I want your pendulum visualization as a screensaver 😂

  • @kuhljager2429
    @kuhljager2429 3 หลายเดือนก่อน

    This reminded me of backing up a double trailer, except you dont get to pause and think about how to move next

  • @MarkHivosk
    @MarkHivosk 3 หลายเดือนก่อน

    Amazing you were able to figure it out! I think there is even another solution but it seems a lot less likely to stumble on

  • @Radu
    @Radu 5 หลายเดือนก่อน

    Amazing work :-) I will send this to my students in a couple of years when I'll teach intelligent automation.

  • @axeldaval3410
    @axeldaval3410 5 หลายเดือนก่อน +1

    mad respect, whole scientist team can get your interview to update the study 😂

  • @adem0s687
    @adem0s687 5 หลายเดือนก่อน

    I just LOVE your interface

  • @Soulergonote
    @Soulergonote 5 หลายเดือนก่อน

    Honestly I thought it would be impossible, you're a real beast !

  • @fwenny
    @fwenny 5 หลายเดือนก่อน +1

    loving the visuals, great video!!

  • @Maxime2bleau
    @Maxime2bleau 5 หลายเดือนก่อน +1

    I love the interface, great video!!

  • @maxim_ml
    @maxim_ml 5 หลายเดือนก่อน

    Oooh, you're real good
    I would've thought it was good enough as soon as it was able to balance at all

  • @Alayric
    @Alayric 5 หลายเดือนก่อน +2

    Love your ideas and visuals! 🥰

  • @bdcoller
    @bdcoller 4 หลายเดือนก่อน

    Technically fascinating, and aesthetically beautiful. Thanks.

    • @PezzzasWork
      @PezzzasWork  4 หลายเดือนก่อน

      Thank you!

  • @Lampe2020
    @Lampe2020 5 หลายเดือนก่อน +3

    The AI basically first learnt to balance a single pendulum in this case and then learnt to make the inputs into it through a second pendulum. At least its response to disturbances looks like that's the way the AI "thinks".

  • @iainmackenzieUK
    @iainmackenzieUK 4 หลายเดือนก่อน

    I laughed at the end when you said "rudimentary" algorithm :)
    Anyhow - Loved this journey with you. Really fascinating. Thanks for sharing (Seems like you have over-come Loki...)

  • @nyhrocx
    @nyhrocx 5 หลายเดือนก่อน

    Man I love this content, I am young and want to know more about this, thank you!

  • @brandoncanfield1725
    @brandoncanfield1725 5 หลายเดือนก่อน +1

    Beautiful visualizations!

  • @NaviaryMusic
    @NaviaryMusic 5 หลายเดือนก่อน

    Or perhaps, the next step would be to balance the double pendulum in 3D space? Each joint can rotate freely in any direction. That, or balancing a triple pendulum in 2D, I couldn't miss!! Anyway this was extremely enjoyable and a wonderful achievement! My favorite video of yours!