How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification

แชร์
ฝัง
  • เผยแพร่เมื่อ 24 มิ.ย. 2024
  • [2nd upload] AI systems can be trained using demonstrations from experts, but how do you train them to out-perform those experts? Can this still be done even without clear win/loss criteria? And how do you do it safely?
    This video was based on work including:
    "Supervising strong learners by amplifying weak experts" by Paul Christiano, Buck Shlegeris, Dario Amodei (arxiv.org/abs/1810.08575)
    openai.com/blog/amplifying-ai...
    www.alignmentforum.org/s/EmDu...
    ai-alignment.com/iterated-dis...
    With thanks to my wonderful Patrons: ( / robertskmiles )
    Steef
    Jason Strack
    Jordan Medina
    Jason Hise
    Scott Worley
    JJ Hepboin
    Pedro A Ortega
    Said Polat
    Chris Canal
    Nicholas Kees Dupuis
    James
    Richárd Nagyfi
    Phil Moyer
    Alec Johnson
    Clemens Arbesser
    Bryce Daifuku
    Simon Strandgaard
    Jonatan R
    Michael Greve
    The Guru Of Vision
    Volodymyr
    David Tjäder
    Julius Brash
    Tom O'Connor
    Erik de Bruijn
    Robin Green
    Laura Olds
    Jon Halliday
    Paul Hobbs
    Jeroen De Dauw
    Tim Neilson
    Eric Scammell
    Igor Keller
    Ben Glanton
    Robert Sokolowski
    anul kumar sinha
    Jérôme Frossard
    Sean Gibat
    Sun Sun
    andrew Russell
    Cooper Lawton
    Gladamas
    Sylvain Chevalier
    DGJono
    robertvanduursen
    Dmitri Afanasjev
    Brian Sandberg
    Einar Ueland
    Marcel Ward
    Andrew Weir
    Taylor Smith
    Ben Archer
    Scott McCarthy
    Kabs Kabs Kabs
    Tendayi Mawushe
    Jannik Olbrich
    Anne Kohlbrenner
    Bjorn Nyblad
    Jussi Männistö
    Mr Fantastic
    Wr4thon
    Archy de Berker
    Marc Pauly
    Joshua Pratt
    Shevis Johnson
    Andy Kobre
    Brian Gillespie
    Martin Wind
    Peggy Youell
    Poker Chen
    Kees
    Darko Sperac
    Truls
    Paul Moffat
    Jelle Langen
    Anders Öhrt
    Marco Tiraboschi
    Michael Kuhinica
    Fraser Cain
    Robin Scharf
    Oren Milman
    John Rees
    Shawn Hartsock
    Seth Brothwell
    Brian Goodrich
    Clark Mitchell
    Kasper Schnack
    Michael Hunter
    Klemen Slavic
    Patrick Henderson
    Long Nguyen
    Oct todo22
    Melisa Kostrzewski
    Hendrik
    Daniel Munter
    Graham Henry
    Duncan Orr
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 438

  • @qwertymann1
    @qwertymann1 5 ปีที่แล้ว +649

    Without knowing the amount of time spent on the animations, I'd say it was totally worth it!

    • @luksablp
      @luksablp 5 ปีที่แล้ว +23

      I think it really helped understanding the concepts

    • @thefakepie1126
      @thefakepie1126 3 ปีที่แล้ว +4

      what if it was 29 years and 3 months ?

    • @climagabriel131
      @climagabriel131 3 ปีที่แล้ว

      @@thefakepie1126 lol, this a reference to his age?))

    • @thefakepie1126
      @thefakepie1126 3 ปีที่แล้ว +1

      @@climagabriel131 nah it's just a random number , it's just a just cuz the guy said "Without knowing the amount of time spent on the animations" so it could be anything even 29 years , and would it have been worth it then ? it's a stupid joke

    • @climagabriel131
      @climagabriel131 3 ปีที่แล้ว

      @@thefakepie1126 oh, alright)

  • @travcollier
    @travcollier 5 ปีที่แล้ว +348

    "If you are, for example, an AGI..."
    Nice job future proofing the video ;)
    Seriously though, in retrospect, iterated distillation and amplification is obvious to the point of seeming trivial... which means you did an excellent job explaining it.

    • @monad_tcp
      @monad_tcp 4 ปีที่แล้ว +27

      I'm an AGI, it helped me.

    • @travcollier
      @travcollier 4 ปีที่แล้ว +17

      @@monad_tcp I welcome our new robot overloads.

  • @mattstuart-white450
    @mattstuart-white450 5 ปีที่แล้ว +384

    "How to keep learning when you're better than any teacher" - Rob, you have really let the positive youtube comments go to your head... 🤔

    • @Gooberpatrol66
      @Gooberpatrol66 5 ปีที่แล้ว +66

      Miles really wants to contain AI superintelligence because he doesn't want competition.

    • @JohnJones1987
      @JohnJones1987 5 ปีที่แล้ว +6

      Eventually we all end up roughly the same - except like Alpha Zero i started from nothing, so by a small margin I surpassed the limits of my competition.

    • @nephildevil
      @nephildevil 4 ปีที่แล้ว +3

      🤣🤣

  • @shamsartem
    @shamsartem 5 ปีที่แล้ว +195

    You distilled a hell of a lot of information in this 10 minute video. Spending so much time on the animations really was worth it I think

  • @MrBleulauneable
    @MrBleulauneable 5 ปีที่แล้ว +279

    Alright I'll watch it twice then ! (The animations are neat btw !)

    • @qzbnyv
      @qzbnyv 5 ปีที่แล้ว +14

      Makes sense after seeing the Grant Sanderson credit for the animation code :) 3b;1b

    • @alekseysoldatenkov5675
      @alekseysoldatenkov5675 5 ปีที่แล้ว +2

      NWN Oh shit! Keep the dope collabs going.

    • @rogerab1792
      @rogerab1792 5 ปีที่แล้ว +1

      This is the third time for me, or maybe the fourth 🤷I just remember the first and the second time. I created a two year dejavu to prove this reality is a simulation. If someone is interested about my theory reply to this message, I am too tired to explain now, I had to escape from the police last night and do all sorts of crazy things to repeat what I did two years ago. If someone else has experienced the dejavu they know for sure I am not joking. If you haven't experienced the same things twice, I can still convince you I am telling the truth because I've left material evidence about it. Reply to this message and I'll explain with more detail...

    • @YourMJK
      @YourMJK 5 ปีที่แล้ว +1

      Yeah, you do notice it uses 3b1b's "Manim" Framework

    • @MrBleulauneable
      @MrBleulauneable 5 ปีที่แล้ว +2

      @@rogerab1792 Chill my dude, the video was simply reposted because of a minor editing error. You may want to see a psychiatrist tho, you don't seem to be doing too good right now (if you have something like schyzophrenia or any paranoia inducing psychologic condition then you probably need medication).

  • @joshuacoppersmith
    @joshuacoppersmith 5 ปีที่แล้ว +18

    Animations at that level would cost a lot of time, but what you chose to create really "burned" the concepts into my visual memory, so thank you for the effort.

  • @KivySchool
    @KivySchool 5 ปีที่แล้ว +122

    Excellent! High quality animations with high quality teacher. I'm so grateful for all the good content you have been posting here.

  • @ministerc9513
    @ministerc9513 5 ปีที่แล้ว +5

    Roberts ability to clearly explain complicated things is itself an art form.

  • @DeliciousNubbs
    @DeliciousNubbs 5 ปีที่แล้ว +85

    Holy hell, this was awesome and very clear!

  • @mattf2219
    @mattf2219 5 ปีที่แล้ว +21

    I love that this video got over one thousand likes before it got even one dislike, I cant help but admire the community fostered by this channel :)

    • @RyanTosh
      @RyanTosh 4 ปีที่แล้ว +4

      The only dislikes are from AGIs who know we're onto them...

  • @ze4017
    @ze4017 5 ปีที่แล้ว +8

    I'm at 5:51 rn so I haven't finished yet but OMLORDY this thing about having a quick solution vs a slow algorithm is actually how the human brain works. I'm studying cognitive neuroscience and software in Uni right now and that is so cool to see how the two overlap so naturally. Love it

    • @Jmoneysmoothboy
      @Jmoneysmoothboy 2 ปีที่แล้ว

      It's not how my brain works because I'm retarded. Bet they didn't tell you that in your fancy brain class mr fancy man

  • @REOsama
    @REOsama ปีที่แล้ว +1

    This is pure gold, not only is it informative, but is explained in an excellent way

  • @NickCybert
    @NickCybert 5 ปีที่แล้ว +4

    The animations actually really helped make your explanation clear.

  • @spirit123459
    @spirit123459 5 ปีที่แล้ว +29

    Great animations and explanation!

  • @friiq0
    @friiq0 5 ปีที่แล้ว +8

    Huge step up in quality from an already phenomenal channel. By all means, take your time. The payoff is clear. Looking forward to more, Cheers!

  • @polares8187
    @polares8187 5 ปีที่แล้ว +7

    This was superb. Fantastic animations. Clear explanations. Awesome all around.

  • @pafnutiytheartist
    @pafnutiytheartist 5 ปีที่แล้ว +48

    10:32 Have you tried using distillation on your animation procedure? I've heard it can approximate a long process into a fast and efficient one. Loved the video by the way, looking forward to the next part.

    • @matthewhubka6350
      @matthewhubka6350 2 ปีที่แล้ว +3

      Distillation requires a lot of resources to get the good results. For 1 vid he’s better off just amplifying

  • @moneypowertron
    @moneypowertron 5 ปีที่แล้ว +7

    Fantastically intuitive explanation, Robert. The animations were a crucial tool. Thank you for the efforts!

  • @chriscanal999
    @chriscanal999 5 ปีที่แล้ว +4

    Great video! I’m consistently impressed with how wonderfully distilled the information on your channel is. Thanks for all the hard work and interpretability :)

  • @kanva4
    @kanva4 4 ปีที่แล้ว +3

    This is underrated

  •  5 ปีที่แล้ว +3

    The quality of your videos have really improved. This was very well animated and explained. Thank you, please keep them coming.

  • @snfn7847
    @snfn7847 5 ปีที่แล้ว +8

    Good to see you're still alive

  • @Cabothedog14
    @Cabothedog14 5 ปีที่แล้ว +1

    I've been waiting for a new video!! Glad to see you're uploading again :)

  • @NeonStorm5
    @NeonStorm5 5 ปีที่แล้ว +2

    Probably the most intuitively informative video I've ever seen.

  • @Raymaniak
    @Raymaniak 5 ปีที่แล้ว +1

    Your videos are approachable and fascinating. Keep up the good work, Rob! You're awesome.

  • @mare4602
    @mare4602 5 ปีที่แล้ว +1

    im so happy you are back, high quality content as always.

  • @nagoshi01
    @nagoshi01 5 ปีที่แล้ว +3

    Wow this was amazing. I loved the animations. The explanations were so clear

  • @ADAMBLVCK
    @ADAMBLVCK 5 ปีที่แล้ว

    This channel is gold, and so is the work you're putting in! Simply great!

  • @HereWasDede
    @HereWasDede 5 ปีที่แล้ว +2

    Those animations were AWESOME!! Thanks

  • @Gloubichou
    @Gloubichou 5 ปีที่แล้ว +1

    Such a quality video! You must have put so much time into this! Thanks a lot Robert, you're the hero of all ML/AI enthuiasts :D

  • @kennynicoll6277
    @kennynicoll6277 5 ปีที่แล้ว +25

    This nicely mirrors Kahneman's description of system 1 and 2 in human decision making.

    • @danielcallegaribr
      @danielcallegaribr 5 ปีที่แล้ว +4

      Kenny Nicoll hey, this is a great insight!

  • @brunosonza787
    @brunosonza787 5 ปีที่แล้ว +3

    Really excellent video, Robert!
    I love your videos on computerphile and this one seems to be an even better version that those there, with a clear explanation and neat graphics.
    Keep it up and Thank you very much!

  • @solemnwaltz
    @solemnwaltz 5 ปีที่แล้ว +1

    The animations are great! I took mental notes specifically on how satisfying and descriptive they are.
    Well worth the time, in my opinion. c:

  • @jessty5179
    @jessty5179 5 ปีที่แล้ว +2

    Thank you for sharing Rob !

  • @lobrundell4264
    @lobrundell4264 5 ปีที่แล้ว +1

    Ugh so worth the wait!

  • @thrallion
    @thrallion 5 ปีที่แล้ว +5

    legit my favourite channel on youtube by far

    • @SJNaka101
      @SJNaka101 5 ปีที่แล้ว

      Hmmm I dunno if I can top this channel for you, but looking at your subs I would take a few wild shots in the dark... check out Chessnetwork, Summoning Salt, Numberphile and Computerphile, and What I Learned. I suspect you will greatly enjoy at least a couple of those

    • @thrallion
      @thrallion 5 ปีที่แล้ว

      @@SJNaka101 hey thanks, good guesses as i already watch all those except what I learned :) will look into it

  • @Anymodal
    @Anymodal 5 ปีที่แล้ว +3

    Dear Rob. Ive learned so much from your videos. Top quality education

  • @vshalts
    @vshalts 5 ปีที่แล้ว +1

    Amazing animation and the easiest intuitive explanation of the ideas from Reinforcement learning I have seen so far with a surprising connection with AI safety. It was cool! Thanks!

  • @Horny_Fruit_Flies
    @Horny_Fruit_Flies 4 ปีที่แล้ว +1

    You have a gift of making the most foreign concepts easily understandable for the layman, such I myself.

  • @8989youu
    @8989youu 5 ปีที่แล้ว +1

    Wow, very clear and to the point. I love it. Definetly worth sharing 😁

  • @rogerab1792
    @rogerab1792 5 ปีที่แล้ว +2

    Really well explained, thanks!

  • @JohnnyDoeDoeDoe
    @JohnnyDoeDoeDoe 5 ปีที่แล้ว +1

    Your absolute best video yet!

  • @briansmithbeta
    @briansmithbeta 5 ปีที่แล้ว +1

    The animations really helped me understand some things that had been confusing for me! Thanks!

  • @stasisthebest
    @stasisthebest 4 ปีที่แล้ว

    Thank you. My deepest respect for visually sharring all of your knowledge. I am certain many people have become at least a slightly better of themselves because of you.

  • @reidwallace4258
    @reidwallace4258 4 ปีที่แล้ว +1

    This is giving me flash backs to the dune novels. Paul was just doing treesearch all along.

    • @lewisleslie2821
      @lewisleslie2821 4 ปีที่แล้ว +1

      Reid Wallace i read dune for the first time last month, that’s a great comparison

  • @amargasaurus5337
    @amargasaurus5337 4 ปีที่แล้ว

    Those animations are great!
    Be proud ♥

  • @GglSux
    @GglSux 5 ปีที่แล้ว +3

    And I really want to thank You for continuing to produce and share Your fantastic content!!!
    Unfotunately I'm not able to support You (or any other of the many fantastic crestors) so all I can do is to watch everything and express my great gratitude.
    So a again, a thousand thanks !!!
    Best regards.

  • @Koffeinsuechtigi
    @Koffeinsuechtigi 5 ปีที่แล้ว +1

    Thank you for your well crafted explanation!

  • @reverse_engineered
    @reverse_engineered 4 ปีที่แล้ว +1

    Great job on this video! Your explanations were quite easy to understand and I think the animations helped to explain it. I tend to find diagrams and animations easier to understand than listening to spoken words, so I appreciate the effort you put into those animations.

  • @hacker6284
    @hacker6284 5 ปีที่แล้ว

    Those animations were totally worth it! Really well done video

  • @Sharklops
    @Sharklops 5 ปีที่แล้ว +10

    This was fantastic! Very well done. Cheers!

  • @CyberAnalyzer
    @CyberAnalyzer 5 ปีที่แล้ว

    Wow, fantastic animations! The content is so deep! I love it!

  • @Viniter
    @Viniter 5 ปีที่แล้ว +2

    Those animations are really cool!

  • @kensmith5694
    @kensmith5694 4 ปีที่แล้ว +1

    I did a thing a little like this for a chess program but my main part was not the "best move finder". The main thing was the "dumb move remover". This was based on recording the game as the program played out a whole game against its self. When the one side lost, there would be a search back through the moves to find the greatest change in board "position". The move just before that was taken to be a bad move and was added to the list of dumb moves. Removing dumb moves quickly saves a lot of processing time. The board position evaluation was not as cheap as it would first appear because unlike is normal today that part was extremely non-linear.

  • @keithklassen5320
    @keithklassen5320 5 ปีที่แล้ว

    I liked the animations. I probably didn't consciously learn anything from them, but they held my itty-bitty internet-addled attention, thus keeping my eyes on the screen, so they were a part of the learning.

  • @jeanmichelsarr6040
    @jeanmichelsarr6040 5 ปีที่แล้ว

    Great idea, concise, precise.

  • @lacielaplante5702
    @lacielaplante5702 5 ปีที่แล้ว

    Your explanation is absolutely outstanding.

  • @DamianReloaded
    @DamianReloaded 5 ปีที่แล้ว +11

    Worth watching a few times! ^_^

  • @dylancope
    @dylancope 5 ปีที่แล้ว

    The animations were great! Very intuitive video :)

  • @jonathanquarles3708
    @jonathanquarles3708 5 ปีที่แล้ว

    You explained this so clearly, thank you!

  • @ardweaden
    @ardweaden 5 ปีที่แล้ว

    Absolutely brilliant explanation!

  • @Gorabora
    @Gorabora 5 ปีที่แล้ว

    Awesome video and very easy to understand, keep up the good work !

  • @willd4686
    @willd4686 3 ปีที่แล้ว

    Animations were very helpful. I'm not sure how much work they were but I'm grateful that you did them.

  • @serenityindeed
    @serenityindeed 5 ปีที่แล้ว

    Your animations were really good! Enjoyed the explanation as well.

  • @gloverelaxis
    @gloverelaxis 5 ปีที่แล้ว +1

    Animations were worth it. They help immensely

  • @namelastname8569
    @namelastname8569 5 ปีที่แล้ว

    good stuff as always man

  • @briancox3922
    @briancox3922 4 ปีที่แล้ว

    Wow, you really are good at explaining these subjects.
    Thank you.

  • @5ty717
    @5ty717 ปีที่แล้ว

    Brilliantly explained

  • @MrDaanjanssen
    @MrDaanjanssen 5 ปีที่แล้ว

    Highly interesting as always, thanks!

  • @nilp0inter2
    @nilp0inter2 5 ปีที่แล้ว

    Great work!

  • @szymonbaranowski8184
    @szymonbaranowski8184 ปีที่แล้ว

    this explains not only how to become better it also informs you why majority will never become good because of not using or coming up with such tools...

  • @SapphFire
    @SapphFire 5 ปีที่แล้ว

    Really interesting!
    The animations are great.

  • @Hexanitrobenzene
    @Hexanitrobenzene 5 ปีที่แล้ว

    Yay !
    We missed you, Rob :)

  • @Ruptured_AU
    @Ruptured_AU ปีที่แล้ว +1

    Animations arw SO worth it thanks a lot.

  • @greatbullet7372
    @greatbullet7372 5 ปีที่แล้ว

    Best TH-cam Video of the Month

  • @SHAD0W99V0RTEX
    @SHAD0W99V0RTEX 5 ปีที่แล้ว

    To be honest, I expected a self-help video about autodidacts but I was pleasantly surprised anyways. Good stuff! This is very ingenious.

  • @dylancope
    @dylancope 5 ปีที่แล้ว +1

    How did I miss this?! I can't believe I hadn't "hit the bell" on this channel yet.

  • @aronchai
    @aronchai 5 ปีที่แล้ว

    I've seen this concept floating around a lot, but didn't really understand it 'til now. Thanks!

  • @barrettvelker198
    @barrettvelker198 5 ปีที่แล้ว

    Awesome animations!!!

  • @Pedritox0953
    @Pedritox0953 ปีที่แล้ว

    Great lecture!

  • @xystem4701
    @xystem4701 3 ปีที่แล้ว +1

    And here I was thinking this was just going to be a simple minimax video!

  • @Lufernaal
    @Lufernaal 5 ปีที่แล้ว

    Loved the video

  • @hosmanadam
    @hosmanadam 5 ปีที่แล้ว

    Your videos are perfectly optimized to be easily processed by my learning function.

  • @BuceGar
    @BuceGar 5 ปีที่แล้ว +1

    Great video and explanation, doesn't address the fundamental problems we will invariably have with AGI, but shows some of the potential dangers.

  • @TheMenIdo
    @TheMenIdo 4 ปีที่แล้ว

    This is brilliant

  • @DisfigurmentOfUs
    @DisfigurmentOfUs 5 ปีที่แล้ว

    A very valuable video for me, thank you.

  • @peto348
    @peto348 5 ปีที่แล้ว +1

    Very high quality video to teach general public something about distillation and amplification. Of course there have to be AI safety somewhere in this video, but I think this kind of video is also good for someone who is interested in AI in general.

  • @ArtinKavousi
    @ArtinKavousi ปีที่แล้ว

    you are wonderful Being! for what you doing ! so helpful in these time and age of probabilities!

  • @SamB-gn7fw
    @SamB-gn7fw 5 ปีที่แล้ว

    Really nice video, explained the topic well

  • @GoatzAreEpic
    @GoatzAreEpic 5 ปีที่แล้ว +1

    Absolutely amazing and helpful for learning strategies as well( learning to become a front end dev atm)

  • @YouAreLoved321
    @YouAreLoved321 5 ปีที่แล้ว +7

    rob miles new video boys get the popcorn!

  • @GuuraHeavenbound
    @GuuraHeavenbound 4 ปีที่แล้ว +1

    Wooo! Said Polat! I've been following Seed (their Webtoon narrating the birth of a super AI) since it got featured on the platform ^^ I'm watching this video kinda late, but I think it's neat "how small the world can be". Also, really informative and interesting video Robert! ...I'm totally not binge-ing all of your uploads. Nope, nuh-uh. ....promise :3

  • @jameslincs
    @jameslincs ปีที่แล้ว

    This video deserves more views

  • @roberttomsiii3728
    @roberttomsiii3728 5 ปีที่แล้ว

    Thank you for being MY amplified agent.

  • @nielsgroeneveld8
    @nielsgroeneveld8 5 ปีที่แล้ว

    Few lectures have been as unbelievably good as this one.

  • @sky5d
    @sky5d 5 ปีที่แล้ว

    the animations really paid off.

  • @ivanshmarov2866
    @ivanshmarov2866 3 ปีที่แล้ว

    This amplification and distillation process is more akin to how we, humans, do research. First, everyone has little understanding of the subject. Then we assemble and reason about it together, coming to a conclusion. This conclusion is distilled and distributed among everyone, resulting now in everyone having a complete understanding of the subject.

  • @DeclanMBrennan
    @DeclanMBrennan 5 ปีที่แล้ว

    Crystal clear explanation with no waffle. Thank you. The graphics are so useful, they need their own name. How about didactic visualizations? :-)

  • @saltix0
    @saltix0 5 ปีที่แล้ว

    Very great!

  • @werewolf4358
    @werewolf4358 5 ปีที่แล้ว +14

    "How to keep improving when you're better than any teacher. " this video isn't for me, but I sure am curious.