AI Learns to Escape (deep reinforcement learning)

แชร์
ฝัง
  • เผยแพร่เมื่อ 18 ธ.ค. 2022
  • AI Teaches Itself How to Escape!
    In this video an AI Warehouse agent named Albert learns how to escape 7 rooms I've designed. The AI was trained using Deep Reinforcement Learning, a method of Machine Learning which involves rewarding the agent for doing something correctly, and punishing it for doing anything incorrectly. Albert's actions are controlled by a Neural Network that's updated after each attempt in order to try to give Albert more rewards and less punishments over time.
    Everything in this video (except for the music) was created entirely by myself using Unity. Check the pinned comment for more information on how the AI was trained!
    Current Subscribers: 57,579
  • บันเทิง

ความคิดเห็น • 3.7K

  • @aiwarehouse
    @aiwarehouse  ปีที่แล้ว +6146

    More information about how Albert was trained:
    Time it took to train:
    Room 1: 0h 12m 35s
    Room 2: 0h 15m 32s
    Room 3: 0h 57m 10s
    Room 4: 1h 06m 40s
    Room 5: 1h 47m 14s
    Room 6: 5h 56m 35s (I overtrained it a bit to get more consistent results)
    Total Training Time: 10 hours, 15 minutes, 46 seconds (plus 3 weeks of trial and error for the last room)
    This is a very long comment going over more of the details of how Albert works and issues he currently has. I've tried to make it as easy as possible to understand, but some parts are complicated. Either way, after the last video there were many people wanting more information, so here you go!:D
    If you're interested in training your own AI like Albert but don't know how, there's now a really easy way to do it! Luda, an AI lab, recently built a web app that allows you to create and train your own AI using deep reinforcement learning (just like Albert) completely for free in your browser! You build your own character (called a Mel) with lego-like building blocks then watch it train in real-time on their website in just a few minutes (really). It's an awesome project, and just like my videos, makes deep reinforcement learning so much more accessible, which is why I love it so much. This section of the comment is sponsored by Luda, but these words are entirely my own, it's an amazing project that I would have been obsessed with had they released it before I built Albert. I've genuinely been looking for a sandbox/game exactly like this since I was a kid. They're still early, but they're giving my audience first access to their closed, pre-alpha build. Make sure you check out their site and create an AI agent for yourself!:D prealpha.mels.ai
    Now, back to Albert:
    NOTE:
    You only see one Albert in the video, but there are actually around 50-100 copies of Albert and the room he’s in training simultaneously behind the camera to significantly speed up the training time (and the time it takes to go through all the footage to edit the video).
    THE BASICS:
    Albert was trained using reinforcement learning, meaning he was rewarded for doing things correctly and punished for doing them incorrectly (the reward is just increasing Alberts score, and the punishment is decreasing it). After Albert finishes each attempt, the actions he took are analyzed and the weights in the neural network (Albert's brain) are adjusted using PPO (proximal policy optimization) to try to prioritize the actions that lead to a positive outcome, and to try to avoid the actions that lead to a negative outcome, using the sum of the rewards and punishments as the evaluation of the outcome. Albert starts off making essentially random decisions until he accidentally hits the pressure plate in the first room and is rewarded, then, as mentioned above, the weights in his neural network brain are adjusted in order to try to replicate that reward. Or, Albert does something to be punished (like hitting the obstacle), in which case the weights in his neural network are adjusted to try to avoid that. This process continues and eventually results in him being able to walk towards the pressure plates and out of the room (into an invisible pressure plate behind the door), where he sees the second room and continues his learning in the new environment. I had to use the invisible pressure plate at the front of the next room to get him to go through the door because once he makes it though once, he’s in the next room permanently, so he can’t learn to go through the door himself.
    REWARD FUNCTION:
    The actual rewards and punishments were given as follows: falling off the platform (-0.5), hitting an obstacle or wall (-0.1), hitting the ground (-0.05) to try to minimize excessive jumping, hitting a pressure plate (+1, +0.9, +0.8, depending on how quickly it’s hit), escaping the room (+1).
    ALBERT’S BRAIN:
    Alberts brain is a neural network with a total of 5 layers (one input layer, 3 hidden layers and one output layer). The network is a Multi-Layer Perceptron (MLP) with 897 nodes, 510 input nodes, 128x3 hidden nodes and 3 output nodes. When his brain changed in room 6, the 510 input nodes became 1230, making the total number of nodes 1617.
    In the first brain, there were a total of 510 inputs, all from raycasts. There were 3 raycasts looking down, 7 at eye level and 7 above Albert’s head, all with an FOV of 70 to try to mimic our own eyesight. Each of these 17 raycasts could detect the 4 types of objects (ground, wall, obstacle, pressure plate) and the distance to the object it hits. This leads to a total of 6 observations per raycast (one observation is whether or not an object is there, one for the distance to the object and one for each of the 4 detectable tags). The raycast observations are then stacked 5 times, allowing Albert to remember the previous 5 observations. Each observation is observed and acted on every 10 academy steps, there are 50 academy steps per second, meaning Albert has a short term memory of exactly 1s. The 10 academy step delay in actions also makes Albert’s movement look a lot smoother than most AI you’ll find on TH-cam, this makes for a slightly less accurate AI, but a much better viewing experience. There were 17 raycasts each responsible for 6 inputs, and they’re each stacked 5 times, so there are 17*6*5=510 total observations in the input layer in the first brain.
    In the last room Albert needed more observations to be able to complete it, so I increased the number of raycasts. He now has 17 raycasts at eye level, 17 above his head, and 7 looking down, changing the number of observations to 41*6*5=1230. These new observations would have allowed him to beat the previous rooms more accurately, but the need for them only arose in the last room.
    I’ve given Albert 3 hidden layers each with 128 nodes because the last video had 2 hidden layers and I figured the moving obstacle adds another layer of complexity that the neural network should account for. 128 nodes per hidden layer was chosen fairly arbitrarily, 128 is just the default number of nodes per hidden layer in ML-Agents.
    There are a total of 3 outputs, one to determine Albert’s forward/backwards movement (go forward, backward or do nothing), one to determine his right/left turning (turn left, right or don’t turn) and one to determine if he jumps or not. Having 3 outputs allows him to perform all 3 actions at the same time.
    ISSUES:
    There are a few issues with Albert’s brain for this task, for starters, I greatly underestimated how many observations are needed to accurately avoid these moving obstacles. This was fixed in the last room by more than doubling the number of raycasts, but I could have improved the AI even more by also giving him his coordinates in the room, so he can more accurately understand which positions in the room are dangerous and get out of them quickly.
    If you're still reading this, you're probably really smart and want to learn more about Albert, so make sure to join my discord server I just made where we can talk more about the details of Albert's AI! discord.gg/jM2WkNuBnG :)
    There also was an issue with the vertical spinner in Room 5, there were too few raycasts for Albert to consistently see that spinner (sometimes the raycasts see to the left and right of the spinner, but not the spinner itself, resulting in Albert being blind to it. This caused Albert to try to jump through the spinner regardless of its position (unless the vertical spinner was directly in front of him), so he couldn’t do it very consistently. This is no longer an issue after Albert’s brain upgrade in the last room, but it was during the training of Room 5.
    Overfitting was also an issue with this training. Overfitting generally isn’t a big concern with reinforcement learning tasks because the training data is exactly the same as the testing data so it’s guaranteed to overfit to some degree, but the issue arises when Albert overfits too much to one room, beats it, then starts the next room. Albert most noticeably overfit to Room 4, making it take a while for him to figure things out in Room 5. This can mostly be fixed by randomizing the locations of the pressure plates, platforms and obstacles in the copies behind the camera, but that would require a lot of strategic limits on the randomization of the positions to make sure it’s always possible, which isn’t ideal. I think a better way to address overfitting for this task, and one that I started implementing in Room 5, is to make very slight and random movements to everything in the copies of the rooms, as well as making copies that don’t have obstacles, and copies that are mirrored. This should get the best of both worlds, where Albert is less likely to overfit, without the need for strategic changes to the rooms; the changes can be made automatically.
    We're looking to hire people to help make these videos! If you're a talented Unity game developer you can apply for a full time position here forms.gle/ko54z1LQmZNUT9Vp8 And if you're a talented AI developer (ML-Agents), you can apply here! forms.gle/Uou1Vwb5Q9VccaAY7 We're looking for full time employees, but part time works too, what we're really looking for are skilled and passionate people, so feel free to apply if you're interested! :D

    • @PembuatKomentarHandal
      @PembuatKomentarHandal ปีที่แล้ว +170

      Its Cool That Ai Is Getting Smarter

    • @realfranklinbadge
      @realfranklinbadge ปีที่แล้ว +72

      Noted

    • @ethanwilde4716
      @ethanwilde4716 ปีที่แล้ว +135

      I still think there should be a test of him learning to not hit a button or else the timer will restart and not the test. Or something to that degree. Like he learns buttons are good but not every button should be pressed

    • @skeleton819
      @skeleton819 ปีที่แล้ว +14

      i dont understand how having copies of him behind the camera helps with video editing, you dont even see him

    • @lamelime1
      @lamelime1 ปีที่แล้ว +17

      @@stonksinwestycje5004 excuse me what

  • @gorillanoodles
    @gorillanoodles ปีที่แล้ว +5480

    The fact that he just gives up occasionally and just 360’s off the map is such a mood

  • @davemustang8173
    @davemustang8173 ปีที่แล้ว +10165

    I'll never stop loving Albert's sick 360 spins when he nails a jump

    • @SoraQuill
      @SoraQuill ปีที่แล้ว +625

      Little dude’s got a lotta personality for something with literally no personality~ god i love him~

    • @RubyPiec
      @RubyPiec ปีที่แล้ว +60

      @@SoraQuill same

    • @kip9793
      @kip9793 ปีที่แล้ว +261

      *360s into the void*

    • @mpkki2499
      @mpkki2499 ปีที่แล้ว +238

      It looks cool, but I also like to think that this way AI scouts area around him without losing any time as Albert can only see forward like humans do but not all the ways at once.

    • @maxbranvall4048
      @maxbranvall4048 ปีที่แล้ว +144

      @@mpkki2499 In Albert’s first video I believe AI Warehouse mentioned that’s exactly why Albert does these 360s, to scope out the area.

  • @Sakkeru96
    @Sakkeru96 11 หลายเดือนก่อน +603

    I like it when Albert occasionally decides to pirouette gracefully into the abyss

    • @melonized_3304
      @melonized_3304 9 หลายเดือนก่อน +10

      Got me laughing

    • @Eyeecuu
      @Eyeecuu 2 หลายเดือนก่อน +2

      sweet release of the abyss

    • @abelhinha_gamer
      @abelhinha_gamer 6 วันที่ผ่านมา +1

      SAME

  • @heyyou9472
    @heyyou9472 ปีที่แล้ว +387

    While very adorable, his jumps and 360 rotations also serve a very clever purpose. Since he can't really see high objects or see around him, when he jumps and does a 360, he memorizes his environment and works from there. Sometimes Albert just.... jumps off or does weird actions, which I don't really get but hey, he's a deep reenforcement learning AI, he's in his own world.

    • @dan_schnider
      @dan_schnider ปีที่แล้ว +17

      Albert doesn’t do it because of that. You can tell that he can’t rennet his surroundings he only remembers how he did int he previous rounds. He does that because the ai is coded to always move(it seems unless he gets sideways or upside down) he wouldn’t really be able to learn that either was though.

    • @tomdebom1346
      @tomdebom1346 11 หลายเดือนก่อน +38

      @@dan_schnider Creator did say that he has some short term memory, but it doesnt last very long

    • @funguy3259
      @funguy3259 7 หลายเดือนก่อน +8

      The concept of object permanence or whatever its called leaves his body after like 3 seconds lol

    • @ishas4421
      @ishas4421 6 หลายเดือนก่อน +4

      He just jumps off bc he is aware that the he couldn’t do it in the time allotted I’ve noticed when it’s a crap run he’ll just jump when there’s no way he could come back from it

    • @CardGamesRule.
      @CardGamesRule. วันที่ผ่านมา

      @@ishas4421the reason is he probably gets punished the more he gets hit by the spinners so hed rather just off himself then get hurt again since I don’t think he gets punished from falling

  • @retr0robbin
    @retr0robbin ปีที่แล้ว +3907

    I’d love to see an attempt counter in the room so we can see how long it took for each highlighted milestone to take place

    • @stegpeng
      @stegpeng ปีที่แล้ว +130

      also maybe a stopwatch to see how long it takes 🤔

    • @depressedpikachu1535
      @depressedpikachu1535 ปีที่แล้ว +22

      Big brain thinking

    • @robertrse
      @robertrse ปีที่แล้ว +7

      I agree

    • @lego_61
      @lego_61 ปีที่แล้ว +4

      yes

    • @abdulazizsukhrobjonov7669
      @abdulazizsukhrobjonov7669 ปีที่แล้ว +12

      Opened comments to suggest same ideas)
      It’d be good to see attempt counter+time for each room separately and total counter+time for all rooms

  • @user-mm5xz8ib8c
    @user-mm5xz8ib8c ปีที่แล้ว +4283

    The moments when Albert "ragequits" and jumps off the cliff after failing are the funniest thing ever.

  • @mothramaster1837
    @mothramaster1837 ปีที่แล้ว +953

    I think a big issue Albert had regarding his AI is that he has zero object permanence. The moment something leaves his cone of vision he doesn't seem to acknowledge it's existence anymore, and thus he ends up getting blindsided by spinners at times or fails to locate the door when he clearly saw it.

    • @yyhhttcccyyhhttccc6694
      @yyhhttcccyyhhttccc6694 ปีที่แล้ว +11

      his issue is only 2 videos

    • @jilljohn2638
      @jilljohn2638 ปีที่แล้ว +1

      ​@@yyhhttcccyyhhttccc6694 what

    • @yyhhttcccyyhhttccc6694
      @yyhhttcccyyhhttccc6694 ปีที่แล้ว +67

      @@jilljohn2638 he has only 2 videos and it makes me mad

    • @yangliu6901
      @yangliu6901 ปีที่แล้ว +47

      @@yyhhttcccyyhhttccc6694 hey there’s a new video

    • @user-kd1ho9bu6g
      @user-kd1ho9bu6g ปีที่แล้ว +1

      @@yyhhttcccyyhhttccc6694 agreed!!!!

  • @Pvkasz
    @Pvkasz 9 หลายเดือนก่อน +121

    I love how it looks like it has a "I wasnt planning on getting this far" moment everytime it gets over an obstacle and has to tackle a new part

  • @CavemanNo.12
    @CavemanNo.12 ปีที่แล้ว +377

    7:40 bro Albert's front flip was sick. My man's a gymnast

  • @CrippIusDungledeen
    @CrippIusDungledeen ปีที่แล้ว +466

    Albert quickly learned that spin-jump technique, showing just how smart AI's really are. He got a quick scope of the area.

    • @davidthecommenter
      @davidthecommenter ปีที่แล้ว +9

      i never realized that the spinjump is actually practical for information, i thought that they saw based on the video's perspective until i realized their sight is _actually from their eyes_

  • @Ocro555
    @Ocro555 2 หลายเดือนก่อน +5

    The last scene suddenly gave a sort of a horror movie vibe where the main character comes to realise that his entire life was dictated and he never was able to achieve freedom, with his sole purpose being an entertainment tool imprisoned forever...

  • @kelast203
    @kelast203 ปีที่แล้ว +55

    I very much love how he learned to spin during jumps so that he could survey the area.

  • @LHS_Shadow
    @LHS_Shadow ปีที่แล้ว +1424

    I would love to see Albert try the whole level over to see how much he really learned rather than memorized

    • @TheSinzy
      @TheSinzy ปีที่แล้ว +139

      He is neither learning nor memorizing anything. He just clicks random buttons and checks if that makes him progress further to the way out.

    • @nocanseegreen3845
      @nocanseegreen3845 ปีที่แล้ว +82

      Or to have him start in randomized positions in the set starting area so he can't follow the same path so strictly

    • @nocanseegreen3845
      @nocanseegreen3845 ปีที่แล้ว +154

      @@TheSinzy if it was not learning it would not get any better at escaping. It is changing the weights of some things in the neural net so its not like its an unchanging randomization machine like you say

    • @g10xz._
      @g10xz._ ปีที่แล้ว +39

      @@TheSinzy if he didn’t learn anything or memorize it then he wouldn’t have made it past the first room and yes AI can both learn and memorize things

    • @alexmag342
      @alexmag342 ปีที่แล้ว +2

      @@g10xz._ they can't, no such thing as "AI" eitheir

  • @heduck
    @heduck ปีที่แล้ว +671

    the ending of albert tipping over at 14:02 on sync with the music is so perfect i love it

    • @DarkKnight-te6wq
      @DarkKnight-te6wq ปีที่แล้ว +2

      No way...I didn't notice that

    • @hello-hb1ll
      @hello-hb1ll ปีที่แล้ว +3

      The wall and ground crashes were also synced lol

  • @daedalus332
    @daedalus332 ปีที่แล้ว +29

    I love that even after he has to learn everything again of getting his brain upgraded, he still learns to do all the 360’s and everything

  • @Rocky4719
    @Rocky4719 ปีที่แล้ว +2195

    I don’t know what gets me more: the fact that Albert will randomly ragequit and jump off the map, or the little celebratory acrobatics he’ll sometimes do when he gets a part right 😂

    • @Nikitashow12355
      @Nikitashow12355 ปีที่แล้ว +24

      Мне кажется он понимает что уже слил этап и сам сливается

    • @Im-not-a-moron
      @Im-not-a-moron ปีที่แล้ว +14

      @@Nikitashow12355 do you like vodka?

    • @Im-not-a-moron
      @Im-not-a-moron ปีที่แล้ว +8

      @@Nikitashow12355 ты любишь водку?

    • @Nikitashow12355
      @Nikitashow12355 ปีที่แล้ว +26

      @@Im-not-a-moron bruh, i like coca cola because still can't drink alcohol. specifically tried some and I dont liked it, but not vodka

    • @jasonriddell
      @jasonriddell ปีที่แล้ว +19

      my guess is the VERY SHORT vision buffer and he "forgets" the cliff is even there

  • @user-wy3id7op5t
    @user-wy3id7op5t ปีที่แล้ว +18

    Watching this is almost like teaching a class full of kids, it takes so much patience until they finally understand in the end.

  • @technofreak39
    @technofreak39 ปีที่แล้ว +22

    The love to the details is sooooo good. The coloring of different things makes this so relatable. Whenever there is "Albert" in orange, I see a half meanless half dumb face which doesnt understand the problem. Green is when something is good and Red pure frustration. And the music stopping (followed by red text).. Love it! :D

  • @EllieCollie
    @EllieCollie ปีที่แล้ว +1399

    “Ai will take over the world!”
    AI : “I have pressed the button to open the door, I will go back in the room I came from!”

    • @natix1_
      @natix1_ ปีที่แล้ว +147

      AI: *Doing sick frontflips and 360s*

    • @elcazador9900
      @elcazador9900 ปีที่แล้ว +54

      AI: time to celebrate with some sick flips

    • @chinabluewho
      @chinabluewho ปีที่แล้ว

      The problem with a higher intelligence is that it will fool us into a false sense of safety, like when you were two and your parents would fool you about everything. if GAI comes into being we are literally doomed, there will be no tomorrows for humanity.

    • @amogus3285
      @amogus3285 ปีที่แล้ว +30

      AI: 3. 2. 1...Deploying neurotoxin!

    • @Matthigast
      @Matthigast ปีที่แล้ว +50

      AI: "I have pressed the button to open the door, I will now jump off this cliff"

  • @snouwballproductions3232
    @snouwballproductions3232 ปีที่แล้ว +290

    It's amazing how easy it is to get emotionally attached to one bouncy orange cube

  • @isaiahrosner3780
    @isaiahrosner3780 ปีที่แล้ว +27

    This is tremendous content. I can tell you spent tons of time on the video editing, not to mention the whole process. I like the added explanations and timer on this video.
    If I could add one thing to the next one, I’d like an attempt counter for each room.

  • @TheGreatMagispeller
    @TheGreatMagispeller ปีที่แล้ว +21

    11:44
    The escape is a lie

  • @Nightshade-Aurora
    @Nightshade-Aurora ปีที่แล้ว +748

    Room 6 having "The cake is a lie" written on the wall followed by Albert being unable to truly escape was great

  • @Quirions
    @Quirions ปีที่แล้ว +1068

    Will he ever have a friend triangle or sphere to escape?
    Would honestly be cool to have teamwork based rooms

    • @dwsel
      @dwsel ปีที่แล้ว +99

      A companion sphere? 🤔

    • @aeterborne
      @aeterborne ปีที่แล้ว +85

      a friend he has to sacrifice to reach the goal

    • @Quirions
      @Quirions ปีที่แล้ว +63

      @@aeterborne or just the idea of pressing multiple buttons simultaneously to progress or needing to jump on one another even tho the last is probably a little bit too hardcore to do

    • @athsmooth2171
      @athsmooth2171 ปีที่แล้ว +5

      @@Quirions or stacking on top of each other

    • @memey6978
      @memey6978 ปีที่แล้ว +19

      Now this is starting to look like Portal all over again

  • @JezzyCrazyTV
    @JezzyCrazyTV ปีที่แล้ว +17

    4:34 when the cool kid Breaks His legs:
    Albert: Look this 720° jump. *Falls*

    • @Sandwhxih
      @Sandwhxih 7 หลายเดือนก่อน

      Albert got combo'd by the spinners

  • @jademonass2954
    @jademonass2954 ปีที่แล้ว +14

    7:35 thats amazing

  • @optimisticori
    @optimisticori ปีที่แล้ว +422

    i think it was really interesting watching Al’s decision to do a 360 every time he jumped. he probably did that so he could better see where everything is. i noticed him do one 360 in place, and then he beelined for a pressure plate in a new room.

    • @harmonicpsyche8313
      @harmonicpsyche8313 ปีที่แล้ว +7

      Makes sense to gather as much information as possible. Probably analogous to turning your head to look around at everything when you enter a room.

    • @olivefontaine2562
      @olivefontaine2562 ปีที่แล้ว +11

      It’s also optimal because spinning horizontally keeps him more stable vertically thanks to the physics. In real life if you throw a cup or something upwards it wobbles and falls pretty randomly. But if you spin it and throw it up it will stay straight thanks to its angular momentum.

    • @DeezNuts-ej6sr
      @DeezNuts-ej6sr ปีที่แล้ว

      Kinda like sniper bots in tf2

    • @omegalul9629
      @omegalul9629 ปีที่แล้ว +7

      @@olivefontaine2562 such physics were very likely not implemented here.

  • @dynhoyw
    @dynhoyw ปีที่แล้ว +1098

    i like to theorize that whoever wrote "the cake is a lie" in room 6 is actually other versions of albert. basically, this albert is not the first AI, but only one of the many AIs the AI Warehouse has created and trained. they'd then be used for very dark, secret, malicious and malevolent intents. i also like to imagine that AI Warehouse is also an AI with its own will and consciousness that has somehow founded this facility to create and train AIs at a large scale

    • @bsm377
      @bsm377 ปีที่แล้ว +141

      I'm GLaD that I am not the only one person to notice this reference.

    • @davidthecommenter
      @davidthecommenter ปีที่แล้ว +73

      @Boathook Animations GLaDOS. it's all a portal reference

    • @lemonsinternet139
      @lemonsinternet139 ปีที่แล้ว +13

      @@davidthecommenterYES WE NEED MORE PORTAL

    • @xfanoentres7344
      @xfanoentres7344 ปีที่แล้ว +7

      When i saw this vid i understood what Glados was doing, completely understood this reference lol

    • @chuperzz2866
      @chuperzz2866 ปีที่แล้ว +12

      Albert lore

  • @Victor_Manuel.--117
    @Victor_Manuel.--117 4 หลายเดือนก่อน +1

    How incredible, I didn't expect Albert to do a front flip, I was very surprised, good video c:

  • @enolp
    @enolp ปีที่แล้ว +32

    I’m at a point halfway between laughing and crying because I see myself so much in this little ai cube dude, as if all my attempts at learning how to live life as a human are being monitored and judged by some alternate higher version of myself

    • @chaoticgood1977
      @chaoticgood1977 ปีที่แล้ว +4

      We're getting out of the psycho house with this one🗣🗣🗣

  • @fuckoff9137
    @fuckoff9137 ปีที่แล้ว +15

    8:35 - "Albert, you suck" *Albert ragequits*

  • @ZeNyfh
    @ZeNyfh ปีที่แล้ว +513

    gotta love albert bouncing around mindlessly at the beginning

    • @ethanwilde4716
      @ethanwilde4716 ปีที่แล้ว +21

      Watching him mess up is humorous dispite him not having any emotion or reaction to messing up

    • @RLSova
      @RLSova ปีที่แล้ว +5

      It's so cute

    • @JackieTheYeen
      @JackieTheYeen ปีที่แล้ว +4

      Just vibing

    • @melonized_3304
      @melonized_3304 9 หลายเดือนก่อน

      And room 6, when he was brainwashed.

  • @duckyboy8169
    @duckyboy8169 ปีที่แล้ว +279

    My favorite part is 7:20 in room 3, where he lands perfectly, but he can't figure out how to get through the door, so he just falls over from the spinner

    • @TotallyTaRz
      @TotallyTaRz ปีที่แล้ว +20

      Nah he just didn’t wanna go through, he even shook his head at the door

    • @ErraticPulse
      @ErraticPulse ปีที่แล้ว +6

      Classic indecision moment

    • @The_Magical_Cat__
      @The_Magical_Cat__ ปีที่แล้ว +9

      He knows if he goes in he has to do another puzzle

    • @ITS_SAMUEL101
      @ITS_SAMUEL101 ปีที่แล้ว +2

      *room 4

  • @meatlemonade3338
    @meatlemonade3338 ปีที่แล้ว +8

    Albert has orange tomcat energy. what a neat little guy. can't wait to see him graduate med school or whatever cubes aspire to

  • @JustLetMeUseNPC
    @JustLetMeUseNPC 11 หลายเดือนก่อน +4

    If this was turned into a game, with the premise that you're an advanced AI learning to do things with an AI narrator that either communicates with text popups or using some TTS (or maybe both), I would instantly buy that game the moment I could.

    • @catface5144
      @catface5144 7 หลายเดือนก่อน

      Basically The Stanley Parable xD

  • @mothgyaru3158
    @mothgyaru3158 ปีที่แล้ว +204

    Dude, this is insane! Blows me away, I don’t know how you even BEGIN to make Albert. I always cheer whenever he completes something LOL

    • @IloveHildasfeet
      @IloveHildasfeet ปีที่แล้ว +1

      Sonic

    • @Logic-ys7rw
      @Logic-ys7rw ปีที่แล้ว +1

      @@IloveHildasfeet Triplets born, the throne awaits

  • @EveCitrus
    @EveCitrus ปีที่แล้ว +832

    It's like 4 o clock on the morning but to me there's something absolutely beautiful about this all. Like we as humans just have such an innate need to give everything a personality. All the comments cheering Albert on, calling the spin he does to check his surroundings "celebratory acrobatics", the commentary throughout the video, even down to the physics and the way Albert moves gives him this loveable aloofness to him, making you cheer him on more as he tries and fails and does his sick tricks. And christ, he has googly eyes. It's just. Such a wonderful thing how much personality we give an AI cube.

    • @waluigihentailover6926
      @waluigihentailover6926 ปีที่แล้ว +12

      Well said!

    • @dorol6375
      @dorol6375 ปีที่แล้ว +2

      Did not expect to see you here

    • @RonnieMcNutt_Mindblowing
      @RonnieMcNutt_Mindblowing ปีที่แล้ว +1

      Nut

    • @funkyfranx
      @funkyfranx 9 หลายเดือนก่อน +2

      Yes it’s weird how we give an unsentient AI so much of our emotions, yet we treat cows, pigs, chickens and other animals like inanimate objects.

    • @inkognito105
      @inkognito105 7 หลายเดือนก่อน +1

      Ааа

  • @adamion1993
    @adamion1993 ปีที่แล้ว

    I LOVE your videos, the extra effort is really showing on all fields with this one! Thank you!

  • @JezzyCrazyTV
    @JezzyCrazyTV ปีที่แล้ว +16

    0:51 Albert: did you Just... PRAISE ME? IM SO HAPPY I COULD JUMP OF YAY YAY YAY WAIT AHHHH

  • @JohnSmith-gj6md
    @JohnSmith-gj6md ปีที่แล้ว +571

    Do you think the reason he's constantly jumping and doing 360s is so he can see the room? That way he's able to survey more of the area around him and figure out where the pressure plates are etc.

    • @morrowmorrow4811
      @morrowmorrow4811 ปีที่แล้ว +86

      this is precisely it.

    • @BrandonWillWin
      @BrandonWillWin ปีที่แล้ว +92

      Has to with maintaining stability as well. Helps him remain upright, sort of like a frisbee, so that he can minimize the chance of tumbling around when he lands

    • @blakksheep736
      @blakksheep736 ปีที่แล้ว +41

      @@BrandonWillWin I doubt this physics sim bothered to account for gyroscopic stability.

    • @1nfinite464
      @1nfinite464 ปีที่แล้ว +6

      STOP RUINING THE JOKE NOOO
      YOU MONSTER

    • @thegameman9046
      @thegameman9046 ปีที่แล้ว +12

      Bro does a 360 no scope

  • @renskedunnewold1995
    @renskedunnewold1995 ปีที่แล้ว +89

    I love that he still does his little jump spins after hitting a pressure plate, it's his little celebration

  • @katiekawaii
    @katiekawaii ปีที่แล้ว +1

    Your editing and commentary really make these compelling. I watched every second and it was great. Sub'd for sure.

  • @HappyMatt12345
    @HappyMatt12345 ปีที่แล้ว +2

    Machine learning is something I'm really interested in learning about tbh. Also this was quite fun to watch! I also noticed the portal reference in one of the levels which is cool!

  • @paperpauperplayer
    @paperpauperplayer ปีที่แล้ว +150

    I would love to see after he completes all the chambers, if he's able to do all of them back to back without a mistake. As in, when he completes the last chamber, you save file his memory, reset the coding to where any room he's in, if he fails, he starts back in the first room. Doesn't matter if he's in room 2 or room 6. I want to see if he is able to retain the patterns and methods to get through all the rooms in one run!

    • @JanMaynz
      @JanMaynz ปีที่แล้ว +11

      That would be AWESOME

    • @salttolerant2690
      @salttolerant2690 ปีที่แล้ว +1

      I originally thought that was gonna happen. I would also like to see that

    • @dogsushienjoyer
      @dogsushienjoyer ปีที่แล้ว +1

      Yes please I thought that on the first video already, would be exciting

    • @DABiDo.O
      @DABiDo.O ปีที่แล้ว +4

      This would essentially mean the obstacle is one big room rather than seven, so given enough chances he should be able to get it.

  • @DeathByKittenz
    @DeathByKittenz ปีที่แล้ว +479

    I love "learning AI" content on TH-cam so much, but there is such a lack of little snippets of editing that give it character. It makes content like this so perfect and less for "educational purposes only", which are so cold occasionally. Please, keep making more because I know there are many more people like myself out there that crave this stuff. Amazing work and I can't wait to see more! 😁

    • @DeathByKittenz
      @DeathByKittenz ปีที่แล้ว +3

      @@kyro7482 Definitely will, thanks!

  • @JezzyCrazyTV
    @JezzyCrazyTV ปีที่แล้ว +10

    3:08 Albert: SPINNER YOU PUSH ME? ILL 360° OVER ALL SPINNERS

  • @JessicaLovesFoxes
    @JessicaLovesFoxes ปีที่แล้ว +1

    This was so much fun to watch! I love his little happy dances as well as his frustrated jumps. 😂❤

  • @meatlover33
    @meatlover33 ปีที่แล้ว +142

    There was something about Albert emotionlessly casting himself into the void that I found very inspiring

  • @dylanparrish-subda7141
    @dylanparrish-subda7141 ปีที่แล้ว +314

    This is like watching my mom learn to play video games when I was a teenager, but nobody's yelling at me when I laugh 🤣

  • @AnnetteBlack
    @AnnetteBlack ปีที่แล้ว +2

    Man, that's so sick. I laughed all the time. No, I really worried about Albert but I was waiting for that Portal's easter egg all the time. Your sense of humor is great and all skills that allow you to make such a masterpiece. Thank you very much and we'll be waiting for Albert's next journey. Wish you both a good luck!

  • @icecremmester
    @icecremmester ปีที่แล้ว

    The extra details, like the blinking, better gates, and timer really added something to this!

  • @ywatcher2106
    @ywatcher2106 ปีที่แล้ว +188

    Yes! more Albert! love these videos. so interesting from a computer perspective but also just as much from the "Albert is adorable and I love watching him try and eventually succeed on these challenges with the text commentating" the *sigh* at the door was perfect. everything was perfect :)

    • @obhwg
      @obhwg ปีที่แล้ว +2

      I think it could be more perfect if Albert got himself some kind of object permanence? Kind of unusual to see him spin around on each jump. Is it because the AI has figured out this is a good way to see things? Or is that itself proof it has memory?
      EDIT: Got time to (skim) read the pinned comment. I think I'm pretty wrong now, but I'm still curious about why he spins so much. Hope I didn't miss that explanation.

    • @adoplayzz9725
      @adoplayzz9725 ปีที่แล้ว

      @@obhwg I think he spins Like that so he has a straight line to the next part

  • @elemental_ofmusic390
    @elemental_ofmusic390 ปีที่แล้ว +267

    I love how at certain points, like 3:59, Albert just decides to do a sick 360 and hit the button in style. He may be a learning AI, but he still has style!

    • @neofalz7643
      @neofalz7643 ปีที่แล้ว +16

      Professionals have standards

    • @lukabrasi001
      @lukabrasi001 ปีที่แล้ว +6

      he's actually scouting the area when doing those so there's a reason he does them

  • @victorportable3892
    @victorportable3892 10 หลายเดือนก่อน +2

    Those 360 jumps are the best way for him to see his whole environment. Since he doesn't seem to have a memory about the objects in his world, that's the best way for him to stay focused.

  • @ZippyCoons
    @ZippyCoons 3 วันที่ผ่านมา

    I personally am inspired by Albert's long journey. I was attracted to your most recent post where Albert fearlessly faces off against a secondary AI, and was inspired to watch more! Though there are not many videos you've posted, I like to see Albert coming up with new and interesting ways to complete different tasks! (Also, i saw 'the cake is a lie' and it was so hard to not laugh XD so in short, you're awesome, keep on going!

  • @greenbean102
    @greenbean102 ปีที่แล้ว +169

    I love it when Albert just does a front flip like everything’s fine 7:39

    • @Felipe_9999
      @Felipe_9999 ปีที่แล้ว +1

      Albert's AI really decided that he wanted to show off for the lolz (and i would like those sick flips to get him a reward)

  • @hattyhat261
    @hattyhat261 ปีที่แล้ว +471

    What really makes me impressed is that Albert is kind of like us, as he makes mistakes and doesn’t understand anything at first, but eventually learns what to do.

    • @_isabxlla
      @_isabxlla ปีที่แล้ว +29

      that's the definition of learning 🤦‍♂️

    • @Helipelicoptro6939
      @Helipelicoptro6939 ปีที่แล้ว +23

      Yeh but we would just give up eventually

    • @hattyhat261
      @hattyhat261 ปีที่แล้ว +11

      @@Helipelicoptro6939 yeah Albert is a legend he never gives up bro got the best mindset (definitely not because he is an AI)

    • @xenai.
      @xenai. ปีที่แล้ว +2

      ​@@hattyhat261 6:10

  • @Kazomix._.
    @Kazomix._. ปีที่แล้ว +1

    i love how you add some text like you're talking to the ai, it makes these videos way more enjoyable to watch

  • @JezzyCrazyTV
    @JezzyCrazyTV ปีที่แล้ว +14

    7:04 Go in DOOR
    Albert: OK
    Spinner: YEET
    Albert: Stop
    Spinner: YEEEEEET
    Albert: AHHH

    • @laineylain
      @laineylain 11 หลายเดือนก่อน +2

      bro decided to make 6 timed comments

    • @treebacon
      @treebacon 7 หลายเดือนก่อน +2

      ​@@laineylainnot to be that person but erm ACKSHUALLY 7 ☝️☝️

  • @dappah302
    @dappah302 ปีที่แล้ว +151

    Your videos have been extremely well made so far! Can't wait to see what awaits Albert in the future!

    • @aiwarehouse
      @aiwarehouse  ปีที่แล้ว +33

      Thank you so much!

    • @dappah302
      @dappah302 ปีที่แล้ว +9

      @@aiwarehouseOf course! :D

  • @guy_roh
    @guy_roh ปีที่แล้ว +70

    Albert caving in due to pressure at 3:47

    • @LowBudgetBoi
      @LowBudgetBoi 3 วันที่ผ่านมา

      "I don't wanna do this no more :("

  • @aichi337
    @aichi337 ปีที่แล้ว

    Very interesting to watch an Ai learn! And good music. Would love to see more of those videos

  • @MistikTavuk
    @MistikTavuk ปีที่แล้ว +13

    11:11 the cake is a lie :)

  • @anmolagrawal5358
    @anmolagrawal5358 ปีที่แล้ว +19

    12:17 "The cake is a lie"
    I see, you're a man of culture as well

  • @HalfDecentDucc
    @HalfDecentDucc ปีที่แล้ว +58

    6:16 Albert got so annoyed from the locked door and wall spinner, he killed himself.

  • @lunakepio5387
    @lunakepio5387 ปีที่แล้ว +4

    What's interesting is, through the eyes, the thought you gave him, the name, the way he looks conscious it's really weird for my brain to process Albert, but you gotta love those spinning jump he doesn't only nail an obstacle but does it with style

  • @user-ly1qu3nd6h
    @user-ly1qu3nd6h ปีที่แล้ว +1

    I love your project that related to reinforcement learning. This is very interesting and Albert is becoming creature that considered error processing and reward prediction like us !!!! I would like to implement and simulate such this work in my study. Thank you for inspiring me :) V

  • @ethanwilde4716
    @ethanwilde4716 ปีที่แล้ว +593

    Question: does Albert remember these tests for every new experiment run? Or does he get a memory wipe and has to start at square 1?

    • @aiwarehouse
      @aiwarehouse  ปีที่แล้ว +752

      He keeps the same brain throughout the video (except for Room 6), but between videos his brain is wiped clean. It can sometimes look like he has a new brain when he enters new rooms, that happens when he overfits to the previous room too much, that makes him perform very poorly with anything even slightly different from the room he just left. A series that would be really interesting though is keeping the same brain and just constantly training on top of it, I might do that in the future:)

    • @silverstar1726
      @silverstar1726 ปีที่แล้ว +176

      @@aiwarehouse you should try out that series! I wonder how smart he would get if his memory wasn’t wiped.

    • @brackencloud
      @brackencloud ปีที่แล้ว +43

      i think it would also be cool to do runs through the whole system, so he has to train for everything. though im sure that would increase the simulation time exponentially

    • @StateCorp.
      @StateCorp. ปีที่แล้ว +16

      @@silverstar1726 Heh, imagine Albert becoming sentient and start to develop a hate towards its own creator...

    • @korettopun1203
      @korettopun1203 ปีที่แล้ว +5

      @@aiwarehouse please do it! Would love to see the same brain develops

  • @TheCakeIsNotaVlog
    @TheCakeIsNotaVlog ปีที่แล้ว +108

    Seeing him jumping on the spot and spinning in the air was really quite fascinating. For all his dumbassery, he got the hang of scoping out his surroundings real quick

  • @DJG_Studios
    @DJG_Studios ปีที่แล้ว

    This one’s my favorite in the trilogy so far, excited to see where this series goes

  • @flamingdog9207
    @flamingdog9207 ปีที่แล้ว

    I love the Portal influence with the style, plus it's just interesting to watch Albert try and navigate these

  • @bacondude420
    @bacondude420 ปีที่แล้ว +43

    10:58 the cake is a lie

  • @Kalaggu2020
    @Kalaggu2020 ปีที่แล้ว +105

    Dude, these series are so fun to watch. There's nothing like this on yt, sure there's the rare chance that you find 1 or 2 vids but this is magic, gold, true comedy and an actual series, not just 1 vid. Keep it up!

    • @aiwarehouse
      @aiwarehouse  ปีที่แล้ว +13

      Thank you so much!:D

  • @gec101
    @gec101 7 หลายเดือนก่อน +4

    12:21 the cake was always a lie, Albert.

  • @tannerh7774
    @tannerh7774 ปีที่แล้ว +3

    Hey, this is a cool series, excited to see more. I was wondering though, would Albert be able to generalize better if he had to start from level 1 each time he beat a new level?

  • @helizteil2625
    @helizteil2625 ปีที่แล้ว +44

    This honestly reminds me of a player who has absolutely zero idea what is happening and is just trying to learn by failing.
    Like, ramp up the difficulty by 10 and I can see myself making these same mistakes. It's honestly quite impressive how far we've come.

  • @giddygoon73
    @giddygoon73 ปีที่แล้ว +92

    I love how this channel has so many subscribers but just two videos and a short. They totally deserve it.

    • @nardalis4832
      @nardalis4832 ปีที่แล้ว

      We be waiting on the good stuff... patiently xD

    • @PieletPi
      @PieletPi ปีที่แล้ว

      I can't tell if youre sarcastic there with them deserving it.

    • @RiingMsn
      @RiingMsn ปีที่แล้ว

      Didnt they have more before

    • @giddygoon73
      @giddygoon73 ปีที่แล้ว +1

      @@PieletPi I'm not, I was gonna point it out but I just decided not to.

    • @adil0028
      @adil0028 ปีที่แล้ว

      Plot twist, they're all bots

  • @JezzyCrazyTV
    @JezzyCrazyTV ปีที่แล้ว +13

    5:11 hitting this is difficult
    Albert: whats difficult?

  • @Demavuratheannoying
    @Demavuratheannoying ปีที่แล้ว +1

    The one thing I would want in a simulation like this is a reward for the ai for being experimentative, that would encourage finding faster ways to do levels..

  • @insertedfailed3586
    @insertedfailed3586 ปีที่แล้ว +56

    Interesting way of teaching AI to more people!

  • @murphthasmurf5923
    @murphthasmurf5923 ปีที่แล้ว +87

    I wish we could see a post-learning run after he finishes the gauntlet to see if he actually retains the knowledge and can do it in a reasonable amount of time

  • @MyNameIsRati
    @MyNameIsRati 23 ชั่วโมงที่ผ่านมา

    It's like going through rooms blindfolded and hearing sounds when you succeed or lose.

  • @gorgsbagofchips3511
    @gorgsbagofchips3511 ปีที่แล้ว

    pls upload more this is so interesting and fun to watch

  • @DamonTorro
    @DamonTorro ปีที่แล้ว +45

    There's a sense of dread and depression whenever Albert just...jumped off...into the abyss.

  • @Faber779
    @Faber779 ปีที่แล้ว +29

    7:25 top 10 saddest anime deaths

  • @LuCarlio
    @LuCarlio ปีที่แล้ว

    These videos are so cool, i love this. 🤗

  • @saddesklunch2544
    @saddesklunch2544 ปีที่แล้ว +2

    I love that Albert does a little pirouette every time he jumps, it’s so adorable 🥰

  • @darkacadpresenceinblood
    @darkacadpresenceinblood ปีที่แล้ว +136

    it's so interesting how we all get emotionally attached to an ai cube if it has eyes, a name and gets treated as a living thing... we're all here calling him cute (and i mean he is, look at the lil celebration bounces when he gets something!)

  • @TheCoriKat
    @TheCoriKat ปีที่แล้ว +44

    8:00 it feels as if its alive and is looking at him and jumping in excitement for finishing the course!

  • @MoonGlow22
    @MoonGlow22 2 วันที่ผ่านมา +1

    7:40 He should be awarded for his backflips, like 10 times of pushing a plate

  • @fuzzyotterpaws4395
    @fuzzyotterpaws4395 หลายเดือนก่อน +1

    Albert randomly jumping off to end it faster when you know you failed is a thing I do in video games too!😂

  • @TristonNightshade
    @TristonNightshade ปีที่แล้ว +80

    7:50 a prime example of ai. He waited for a better opportunity to push forward

    • @Artameful
      @Artameful ปีที่แล้ว

      He couldn't see yet.

    • @OmniSync
      @OmniSync 8 หลายเดือนก่อน

      ​@@Artamefulhe can see

  • @Pixelcraftian
    @Pixelcraftian ปีที่แล้ว +47

    I wonder what would happen if you put all of the training from the other videos and applied it to something new, would it overcome it faster or slower?
    Awesome video!!

    • @kakyoindonut3213
      @kakyoindonut3213 ปีที่แล้ว +3

      slower

    • @Jeb4100
      @Jeb4100 ปีที่แล้ว

      hey

    • @Gardengap
      @Gardengap ปีที่แล้ว +1

      You again?

    • @itzdubstep1265
      @itzdubstep1265 ปีที่แล้ว

      I see your comments all around TH-cam on just about every video I watch.
      How do you do good sir?

    • @Gardengap
      @Gardengap ปีที่แล้ว

      @@itzdubstep1265 exactly

  • @TheAdvertisement
    @TheAdvertisement ปีที่แล้ว +8

    The text "speaking" to Albert, especially when it turns red, has this vibe of this dystopian testing facility pretending everything is safe.
    "The cake is a lie" on the wall definitely helps cement that-
    13:40 WAIT I PREDICTED IT

  • @Wallemations
    @Wallemations ปีที่แล้ว +41

    It'd be cool to see an extra screen under the timer that tells you what iteration is currently being shown! I'd love to see how many attempts he takes to complete a room.

  • @wheezeardjack
    @wheezeardjack ปีที่แล้ว +28

    I love him. I need more of him- please. I didn’t know watching AI learn would be so funny (especially with your comments and notes) and entertaining.

  • @mx.fuzzypants1911
    @mx.fuzzypants1911 ปีที่แล้ว

    Nice Portal reference with “The Cake is a Lie”! This was very entertaining. Albert is fun.

  • @NaiveCat45446zeroSEVEN
    @NaiveCat45446zeroSEVEN 8 หลายเดือนก่อน +7

    6:12 relatable

  • @d-boy
    @d-boy ปีที่แล้ว +11

    Goddamn man this is your second vid and you're popping off so hard already! And no doubt you deserve it. your videos are both informative and hilarious!Really happy for you man keep it up!!