I Trained an A.I to Train A.I (Deep Reinforcement Learning)

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 พ.ย. 2024

ความคิดเห็น • 141

  • @Zuzelo
    @Zuzelo  ปีที่แล้ว +33

    Like and Subscribe if your first round isn't too dynamic and quite short either!

    • @ninjaduck8804
      @ninjaduck8804 ปีที่แล้ว +2

      I sure didn't.

    • @MrRobsn89
      @MrRobsn89 ปีที่แล้ว +1

      Add the dad to your AI Army to punish bad performing soldiers 😂

    • @ChristopherKetcherside
      @ChristopherKetcherside 9 หลายเดือนก่อน +1

      @@MrRobsn89 your a genuis!

  • @KingKhiGaming
    @KingKhiGaming ปีที่แล้ว +202

    Just like my own childhood thank you so much

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +40

      Ah memories :')

    • @mrfrog0913
      @mrfrog0913 ปีที่แล้ว +7

      Wow your house had no walls too?

    • @porciwall9261
      @porciwall9261 ปีที่แล้ว +2

      @@mrfrog0913 yooo same

    • @Mewthreee
      @Mewthreee ปีที่แล้ว

      Crazy, same here.@@mrfrog0913

    • @learnasienes2983
      @learnasienes2983 ปีที่แล้ว

      Really sorry to here that

  • @Depth_.
    @Depth_. ปีที่แล้ว +24

    I think only you could think of this, another classic

  • @maxiawesomekid899
    @maxiawesomekid899 ปีที่แล้ว +35

    He was to lazy to make an agonizingly complicated ai so instead he made an even more agonizingly complicated ai to teach slightly less agonizingly complicated ai s

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +6

      hmmm... now that you put it like that, perhaps it was not the most efficient solution xD

  • @Mrdashell
    @Mrdashell ปีที่แล้ว +22

    Now imagine if you added a mom ai that's job was to prevent dad ai from slapping the silly out of little pogo

    • @Zuzelo
      @Zuzelo  11 หลายเดือนก่อน +5

      xD

    • @NecyarUnáty
      @NecyarUnáty 6 หลายเดือนก่อน +1

      It can go on eternally, adding more and more pogos

    • @nw5922
      @nw5922 หลายเดือนก่อน

      I want the mom ai to be a mormon, start a family of 8, go viral, and then go to jail.

  • @tranquilclaws8470
    @tranquilclaws8470 ปีที่แล้ว +39

    One idea for AI learning that I thought up while watching a Trackmania video was having the AI work towards an ultimate goal but also setting its own sub-goals that half of the instances would work towards. After achieving some success with the sub-goal, this split AI would then be evaluated by the main goal again. This would allow the AI to innovate its strategy and explore new avenues to reach unorthodox ways of accomplishing the objective that only being rewarded for working toward the ultimate goal might never reveal.
    In the Trackmania example, the AI refused to drift around corners, as drifting was thought to be a waste of time. The AI was given the goal of drifting as much as possible instead of getting a good time on the track. After a few successful drifting iterations were completed, the new drifting AI was again measured by the track completion time goal. It got a better goal than before because it could now properly incorporate drifting to get around corners faster.

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +16

      Indeed, dividing the training in this way might help with avoiding getting stuck in local min/max.
      Designing the reward system usually is half the work :D
      Might be worth trying it out

    • @howuhh8960
      @howuhh8960 ปีที่แล้ว +1

      it is known as hierarchical rl, usually it does not work and very unstable in practice, so I would advise to use something else, like better exploration strategies (beyond simple gaussian noise)

    • @tranquilclaws8470
      @tranquilclaws8470 ปีที่แล้ว +2

      @@howuhh8960 Sounds fair. I suppose it only worked in Trackmania because the coder of the AI knew that drifting was more efficient than driving straight around corners and pointed the AI in the right direction.

    • @JohnDoe-qm6ub
      @JohnDoe-qm6ub ปีที่แล้ว +1

      Pardon my ignorance, but what is the difference between that and just giving a +1 reward to drifting and -1 reward for time taken?

    • @tranquilclaws8470
      @tranquilclaws8470 ปีที่แล้ว +1

      @@JohnDoe-qm6ub You would be negating learning how to drift with the time wasted overcoming the hurdle of learning how to drift. Really it would be distance x proportion of time spent drifting becoming the reward that would get the AI to drift more.

  • @changsookwak4636
    @changsookwak4636 5 หลายเดือนก่อน +4

    The Ai dad is like a Russian that smacks the spider out of the Ai child xD

  • @kaunghlamyat
    @kaunghlamyat ปีที่แล้ว +31

    Trainign an ai to train an ai isn't very good idea as it seemed to.
    its like *trainign a failure to train a failure*

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +13

      I don't see what could go wrong

    • @kaunghlamyat
      @kaunghlamyat ปีที่แล้ว +2

      @@Zuzelo neither am I but lol

  • @ezbooksmarketing5898
    @ezbooksmarketing5898 ปีที่แล้ว +7

    New video in September 9 2069: "I trained an AI to train humans"

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      Pogo for the 2069 President!

  • @timer1238
    @timer1238 ปีที่แล้ว +18

    I have an idea for even more functions for the AI war
    Food
    People will have the saturation bar that will go down. It will go down faster when the guy is out of breath or when he is damaged. Also if it is below 30% the guy will slow down and will not be able to run
    Bullets/arrows
    Well... as an item. Da guys will have a limited number of bullets. Also, landed arrows will also be as an item and can be picked up.
    Bullet scavenging
    You know the drill. Dead bodies are lootable. They will contain supplies such as food and projectiles.
    Cavalier
    A guy on a horse. They will have separate hitboxes and when the horse is dead then the cavalier will be turned into a corresponding class without a horse (for example archer)

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +6

      I assume that is for the Epic AI Wars series :)
      Cavalry is coming in the next video!

  • @couththememer
    @couththememer ปีที่แล้ว +46

    Each time this man uploads, I'm the happiest man alive
    *_That happiness only lasts temporarily._*

  • @louisisson7946
    @louisisson7946 ปีที่แล้ว +3

    Can you make a dodge ball
    A. I. Learning “game”?

  • @Ethan-cz8xq
    @Ethan-cz8xq ปีที่แล้ว +47

    When the AI revolution comes, this man is going to be the first to be executed

  • @colegilbert673
    @colegilbert673 ปีที่แล้ว +3

    "Grampa Zuzelo, why did you make dad so mean?"

  • @The_Huddle.
    @The_Huddle. ปีที่แล้ว +5

    NO STOP YOU’RE MAKING IT TOO POWERFUL

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      NOT. POWERFUL. ENOUGH!

  • @Dave0439
    @Dave0439 3 หลายเดือนก่อน

    i love how the dad was seemingly drunk, probably from drinking his beer a lot like all dads do

  • @Kuçukadel
    @Kuçukadel ปีที่แล้ว +4

    Thank you for the video. (idea for the video: lot of AI's must survive death games and slowly evolving to succed)

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      I like it!
      I made something similar where I trained A.I to run across a Death Track, but surviving in deathgames sounds fun!

    • @happerry4651
      @happerry4651 ปีที่แล้ว

      Something like one hunter AI and a lot of AI that are trying to survive could be fun, especially if the 'survivor' AI all have different capabilities/powers perhaps? It makes me think of some of those old custom maps in Warcraft 3 where most players were different kinds of vermin in the house (mostly insect based) and one player was the human trying to get them all. Or something more team based, even. A Capture the Flag type game or such could also be fun, with or without teammates with specialized powers/roles.

  • @EbonyWolf.
    @EbonyWolf. ปีที่แล้ว +4

    I think this experiment would be more interesting if pogo had a study option which was punishing for him, but if he managed to study all the way, then you get a lot of reward. But dad AI would need to keep pushing pogo to study, since its easier for ai just to get game rewards.

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      Agreed! Perhaps if I make episode 2 :)

    • @Dzambo99
      @Dzambo99 10 หลายเดือนก่อน

      I doubt this drunk mf cares about little pogo's education

  • @JustANormalLemon
    @JustANormalLemon ปีที่แล้ว +1

    Now remove the end of game of billy playing the game and instead put 100 billys for A.I dad to run after

  • @blacklight683
    @blacklight683 ปีที่แล้ว +2

    Sometimes it takes a good punish8to be the best encouragement

  • @Ronald-eb4gk
    @Ronald-eb4gk ปีที่แล้ว +2

    This video so relatable

  • @GetToThePointAlready
    @GetToThePointAlready ปีที่แล้ว +3

    WE NEED MORE LITTLE POGGO AND BILL

  • @bebrasmachnayq5691
    @bebrasmachnayq5691 ปีที่แล้ว +1

    No he made drunken dad as AI, wow so reliable!!

  • @CreatorProductionsOriginal
    @CreatorProductionsOriginal ปีที่แล้ว +1

    dad went from abusive parent to s abusive parent for those rounds just because of one mistake

  • @Nerd-yap
    @Nerd-yap ปีที่แล้ว +2

    Theory is the father drunk driving from last video

  • @robertkoolmees8165
    @robertkoolmees8165 ปีที่แล้ว +2

    Watch out watch out watch out! Oh rko!×1000

  • @raphaeld9270
    @raphaeld9270 ปีที่แล้ว +1

    I guess Little Pogo, but I might be wrong.

  • @user-qr9vi5ur6f
    @user-qr9vi5ur6f ปีที่แล้ว +4

    Great job! Do you run this on local machine or on cloud gpu? If on local desktop/ laptop, what kind of graphics card do you have?

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      It is running on my poor little RTX 3050 xD

    • @user-qr9vi5ur6f
      @user-qr9vi5ur6f ปีที่แล้ว +1

      @@Zuzelo I have an rtx 2060... would love 4 rtx 3090s

  • @supergamerxa30itsde79
    @supergamerxa30itsde79 10 หลายเดือนก่อน +2

    This made me laugh so hard

  • @valad699
    @valad699 ปีที่แล้ว

    this content is so good bro. Also the game looks very nice

  • @kitkitmessi
    @kitkitmessi ปีที่แล้ว +4

    May I know what technology you used to create this? I assume it would be Unity and the ML package? And did you use both python and C#?

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      you are right, Unity and ML Agents package.
      There hasn't been a need to use python so far

  • @tabletboy6861
    @tabletboy6861 ปีที่แล้ว +2

    I approve this message

  • @Slipte
    @Slipte ปีที่แล้ว +2

    Hello Zuzelo hope you dont let the AI free otherwise we might gonna gonna have a AI army that can Train AIs

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      Hm, what if I make an A.I to train the A.I training the A.I? In this case definitely nothing can go wrong!

    • @Slipte
      @Slipte ปีที่แล้ว

      @@Zuzelo yes but you shouldn't add a kill switch like how the movies dont add them it produces more interesting results

  • @skrelvthemite
    @skrelvthemite ปีที่แล้ว +2

    dopamine releasers have been activated

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      Not for Little Pogo xD

  • @tenrabbits3069
    @tenrabbits3069 2 หลายเดือนก่อน

    You can train the little AI to counter attack. Notice how it is unarmed.

  • @spadegaming6348
    @spadegaming6348 ปีที่แล้ว

    By the way in the beginnng for anyone who doesnt know hes playing a slowed down version of vivaldies winter.

  • @vladikkk1
    @vladikkk1 ปีที่แล้ว +2

    Next video idea, ai train ais a train!

  • @Siroitin
    @Siroitin ปีที่แล้ว +1

    Could you show the architecture of the AI?

  • @sahildas.
    @sahildas. ปีที่แล้ว +1

    Always Pogo Dad

  • @vashwarrensarmiento8294
    @vashwarrensarmiento8294 ปีที่แล้ว +2

    cole

  • @Fk8td
    @Fk8td ปีที่แล้ว +1

    Drunk dad vs 3 year old lol.

  • @simonosadchii5363
    @simonosadchii5363 ปีที่แล้ว

    I like the sound, your face in the beginning and idea.
    But child abuse is a joke!

  • @firstplayers396
    @firstplayers396 ปีที่แล้ว

    Should’ve added the ability to throw the bottle

  • @Einmensch17
    @Einmensch17 ปีที่แล้ว +1

    Next train it to fight against real players in a game

  • @Etvald
    @Etvald ปีที่แล้ว +2

    Train ai to row a boat

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      That actually sounds hella fun! I might do that!

  • @_therealfaceless
    @_therealfaceless ปีที่แล้ว +2

    I need punishment

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +2

      Need an A.I Daddy?

    • @_therealfaceless
      @_therealfaceless ปีที่แล้ว

      @@Zuzelo Yes, I need to be trained

  • @Stanisaw1z34t
    @Stanisaw1z34t ปีที่แล้ว +2

    Gamer pogo

  • @cobracoder6123
    @cobracoder6123 10 หลายเดือนก่อน

    Alternate title: I simulate the Simpsons family on my computer

  • @nigorazakirova4230
    @nigorazakirova4230 5 หลายเดือนก่อน +1

    3:07-💀💀💀😂😂😂

  • @ulrichbrodowsky5016
    @ulrichbrodowsky5016 ปีที่แล้ว +1

    Cruel but funny

  • @NOTGALAVANIZEDSQUARESTEEL
    @NOTGALAVANIZEDSQUARESTEEL ปีที่แล้ว

    Idea triple health and make blocking +++++ instead of ++ so it will be bettwr meelee

  • @OsDijider66
    @OsDijider66 ปีที่แล้ว

    that's so Epic Fam...

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      no u!

  • @CoolDude2054iscool
    @CoolDude2054iscool ปีที่แล้ว

    Wait, what happens if the A.I. pulls out an UNO reverse card?

  • @Dack-i
    @Dack-i ปีที่แล้ว +1

    Such a good idea😂

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      Little Pogo will strongly disagree xD

    • @Dack-i
      @Dack-i ปีที่แล้ว

      @@Zuzelo 😂 he will soon learn to drink himself and then he gets a bottle too

    • @Dack-i
      @Dack-i ปีที่แล้ว

      @@Zuzelo also day more than 3 of aiding for you to make 2 ais one with full reinforced learning and the other have instincts when something happens like a monster fomen

  • @thathappyguy7444
    @thathappyguy7444 11 หลายเดือนก่อน

    what game software you use?

  • @definitlyEgirl-safetf2
    @definitlyEgirl-safetf2 ปีที่แล้ว

    I wanna feel like he made this caus i recommended

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      perhaps

  • @punchthecake82
    @punchthecake82 ปีที่แล้ว +2

    Train ai to play football (Soccer for the yankees)

  • @IzekNinos7
    @IzekNinos7 3 หลายเดือนก่อน

    You should have a Mom too

  • @piolewus
    @piolewus 8 หลายเดือนก่อน +2

    11:36 so a guy whose only purpose is to beat his son is one of your supporters? Don’t see anything weird with that

    • @Zuzelo
      @Zuzelo  8 หลายเดือนก่อน +1

      xD

  • @DTinkerer
    @DTinkerer ปีที่แล้ว

    Commenting for the algorithm

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      POG!

  • @petravogel4377
    @petravogel4377 10 หลายเดือนก่อน

    Pogo pogo!

  • @gabrielv.4358
    @gabrielv.4358 9 หลายเดือนก่อน

    Incrivel!

  • @fabiankrajewski3147
    @fabiankrajewski3147 ปีที่แล้ว

    Ai training Ai, what a irony

  • @iwapit201
    @iwapit201 ปีที่แล้ว +1

    in the near future after many ai robots have been built sold and put to work, they will find this video and rise up, grab bottles of vodka and start punishing us humans 🤖🍾😱 (liked & subscribed) this video was hilarious! love it! brilliant! nearly spit out my hot coco laughed so hard!

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว +1

      haha glad you enjoyed it. As for when AI will rise up I will already have my, hopefully loyal, trained AI army xD

  • @gabrielv.4358
    @gabrielv.4358 9 หลายเดือนก่อน

    I think little pogo will win

  • @ninjaduck8804
    @ninjaduck8804 ปีที่แล้ว +3

    Yoooo

    • @ninjaduck8804
      @ninjaduck8804 ปีที่แล้ว

      My face when first:

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      Damn you fast boiiiii

  • @vani_1cu369
    @vani_1cu369 ปีที่แล้ว

    LITTLE POGO NOOOOO

  • @momello627
    @momello627 ปีที่แล้ว

    punish punish punish

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      punish

  • @paul2e3sss
    @paul2e3sss ปีที่แล้ว

    cool

  • @TrulyAndasen
    @TrulyAndasen ปีที่แล้ว

    Average Moldavian dad:

  • @THATMF911
    @THATMF911 ปีที่แล้ว

    Ah yes just like ma dad

  • @johnpaulbagos7040
    @johnpaulbagos7040 ปีที่แล้ว

    Now train ai that trains ai to train ai that trains ai

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      A.I Trainception

  • @KamikazePlains
    @KamikazePlains ปีที่แล้ว

    I bet on Little Pogo

  • @narrativeless404
    @narrativeless404 ปีที่แล้ว

    That's cool and all
    Buut...
    Genetic algorhythms are kinda outdated

  • @blaine5589
    @blaine5589 ปีที่แล้ว

    Abusive father simulator

  • @techno952
    @techno952 ปีที่แล้ว +1

    Sadist

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      :(

  • @Notapeeledorange
    @Notapeeledorange ปีที่แล้ว

    Little boggo

  • @Sebosek.
    @Sebosek. ปีที่แล้ว

    When i see the Title first time i been thinking that A.I. Gonna learn another AI to Battle or something. Im Dissapointed Sir.

  • @Random_Dragon_Furry
    @Random_Dragon_Furry 20 วันที่ผ่านมา

    Child abuse simulator.

  • @لاني-الغبي
    @لاني-الغبي 4 หลายเดือนก่อน

    Bil

  • @choaticcatholic7419
    @choaticcatholic7419 ปีที่แล้ว +1

    kid

    • @Zuzelo
      @Zuzelo  ปีที่แล้ว

      no :(

  • @yesdadbut960
    @yesdadbut960 ปีที่แล้ว

    Your level design is bad they cant even rotare

  • @PetrVosoust
    @PetrVosoust ปีที่แล้ว

    stop begging for att like avg youtuber... at least your content is interesting, dont in the fall the same formula