AI Learns to DESTROY old CPUs | Mario Kart Wii

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 พ.ค. 2024
  • #ai #mariokart #reinforcementlearning
    AI uses a variant of the Reinforcement Learning algorithm Rainbow DQN to learn how to play SNES Ghost Valley 2 against Mario Kart Wii's own CPUs
    0:00 Intro
    2:46 Training
    4:54 A bit better
    7:58 Final AI
    If you are a C++ developer please help on this GitHub! Would allow me to get AI to play many other cool games.
    github.com/Felk/dolphin/issue...
    Also thank you to my editor for editing this video!
    @benji.botterill
    www.benji-bott.com
    Music:
    1) 8-bit Sheriff - Wave Saver
    2) A bit of power - Soara
    3) Press X Twice - Lexica
    4) Through the prism - Ava low
    5) Backseat rider - Lupus Nocte
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 1.6K

  • @memcgee7832
    @memcgee7832 8 หลายเดือนก่อน +7539

    *turns on items*

    • @aitango
      @aitango  8 หลายเดือนก่อน +1829

      Perhaps in my next video your wish will be granted ;)

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +110

      @@aitango if you are ready, so am I

    • @mkgeostar
      @mkgeostar 8 หลายเดือนก่อน +87

      ​@@aitangomake it learn mushroom peaks

    • @kaishidow
      @kaishidow 8 หลายเดือนก่อน +24

      ​@@aitangoYou know the rules, and so we are

    • @manoahmelching1415
      @manoahmelching1415 8 หลายเดือนก่อน +25

      @@kaishidowNice try but those aren’t the lyrics

  • @aname4390
    @aname4390 8 หลายเดือนก่อน +2873

    Not having offroad on this track really seems to make it easier for the AI. cool stuff!

    • @aitango
      @aitango  8 หลายเดือนก่อน +581

      Yeah true, it helps the AI as if it does something wrong (like hit a wall or fall off), the feedback is pretty instant, making it clear what the AI needs to avoid!

    • @Mmmeeee4
      @Mmmeeee4 8 หลายเดือนก่อน +50

      @@aitango have you though about how it might deal with items and if it could learn how to use them as well?

    • @CaptainBuggyTheClown
      @CaptainBuggyTheClown 8 หลายเดือนก่อน +11

      @@aitango Can you not program it to detect the dif maps roads and whats off road?

    • @stormycd392
      @stormycd392 8 หลายเดือนก่อน +25

      @@CaptainBuggyTheClown
      ​​⁠
      He *could* give the AI a preconceived idea of what the tracks are is, but that defeats the purpose of having the AI Learn Mario Kart.
      Not just how to play Mario Kart, but learning how to recognize what bottomless pits are, optimize driving, and hell, even learn what driving *is* based on the 2 goals given to it.
      He’s not teaching the AI what Mario Kart is, the AI is learning by itself to win by wanting to maintain speed, and Strive for first place.
      If I understood him right, the AI might not even know what a race is, as much as it’s just trying to keep 2 numbers up very high, and is determining what actions will allow it to do that with the best possible advantage.

    • @dougmasters4561
      @dougmasters4561 8 หลายเดือนก่อน

      What is a reward so far as its concerned?

  • @omegahaxors3306
    @omegahaxors3306 8 หลายเดือนก่อน +1587

    I love how aroused the reward function gets right before they cross the finish line on their final lap.

    • @aitango
      @aitango  8 หลายเดือนก่อน +597

      I gave it a big reward for finishing, so it really makes the reward spike which I guess for an AI counts as arousal haha

    • @LilacMonarch
      @LilacMonarch 8 หลายเดือนก่อน +460

      how do i delete someone else's comment

    • @emanwonkon2205
      @emanwonkon2205 8 หลายเดือนก่อน +144

      @@LilacMonarch says the (admittedly adorable) furry

    • @OtisGaming54
      @OtisGaming54 8 หลายเดือนก่อน +80

      ​@@LilacMonarchsays the (creepy) furry

    • @offbranddorito9668
      @offbranddorito9668 8 หลายเดือนก่อน +132

      @@LilacMonarchsays the (neutral as the furry is a stranger) furry

  • @IrishAnonymous01
    @IrishAnonymous01 8 หลายเดือนก่อน +733

    I appreciate how you do these on actual software and not just code your own software version of the game to simplify the task.

    • @GunnerSiIva
      @GunnerSiIva 8 หลายเดือนก่อน +47

      Coding again Mario Kart Wii wouldn’t be easier

    • @IrishAnonymous01
      @IrishAnonymous01 8 หลายเดือนก่อน +185

      @@GunnerSiIva yes but a lot of these "AI learns game" channels code a simplified version of the game that they can integrate their AI with easier

    • @manelneedsaname1773
      @manelneedsaname1773 8 หลายเดือนก่อน +31

      Truuue, lowkey annoying when they do that

    • @Granolora
      @Granolora 8 หลายเดือนก่อน +15

      ​​@@manelneedsaname1773yeah, cause its not really beating mariokart if its not even the real mariokart, is it?

    • @theraytech54
      @theraytech54 8 หลายเดือนก่อน +33

      @@IrishAnonymous01 That's because you need a programming interface to interact with the game. In some cases, coding the game from scratch is just way easier !

  • @DrizzyJeremiah
    @DrizzyJeremiah 8 หลายเดือนก่อน +761

    it's so crazy to see the AI even go for the lowtrick during the shortcut after 48-80 hours of training lol

    • @aitango
      @aitango  8 หลายเดือนก่อน +197

      In most of the Mario Kart stuff I've done with AI, it always seemed to do low tricks surprisingly well which always baffled me, as I have no idea why I learned that so fast

    • @DrizzyJeremiah
      @DrizzyJeremiah 8 หลายเดือนก่อน +94

      @@aitango Yeah, it does hop a lot at the start so maybe it's just something it does on accident then realises it can get the most amount of points there by lowtricking. It's so cool how your reward system makes it unintentionally learn it though lol

    • @BeMaeDoCrimes
      @BeMaeDoCrimes 8 หลายเดือนก่อน +19

      With the right rewards it might figure out the instant finish glitch.

    • @the_platapus3522
      @the_platapus3522 8 หลายเดือนก่อน +1

      The fact it pulled the shortcut lff at 12 hours of training is still very impressive

    • @kultw1837
      @kultw1837 8 หลายเดือนก่อน

      Whats so crazy about it?

  • @Plazix999
    @Plazix999 8 หลายเดือนก่อน +9

    Ok, but right as you change the map the ai starts screwing up again. People need to start training ai’s to actually use the positioning of the walls, jumps, etc. to make them actually play instead of just following a list of instructions that was mutated until perfection.

  • @dielachmowe8613
    @dielachmowe8613 8 หลายเดือนก่อน +269

    I think it'd be pretty interesting to see it compete against the staff ghost of that track once it learns to use items such as mushrooms.

    • @juanca2825
      @juanca2825 หลายเดือนก่อน +4

      That would be so hard for it to learn. Using a shroom would give the AI huge reward (Thats a lot of speed), so it would most likely just use the 3 shrooms one after the other every time.

  • @EVPointMaster
    @EVPointMaster 8 หลายเดือนก่อน +833

    Have an AI and a human learn a game at the same time.
    Would be very interesting to see how the learning curves differ.

    • @bobkreme2175
      @bobkreme2175 8 หลายเดือนก่อน +211

      Chess...
      Average human after 80 hours - "wait, en passant is actually a rule?"
      Engine after 80 hours -
      *able to beat the best human in the world*

    • @greattitan371
      @greattitan371 8 หลายเดือนก่อน +64

      ​@@bobkreme2175"Send me your god, I must consume their ELO"

    • @BrawlLegendLink
      @BrawlLegendLink 8 หลายเดือนก่อน +82

      Of course it heavily depends on the task and the model architecture, but humans will generally learn MUCH faster. The AI however has the much higher ceiling.
      For real-time usecases that is. If the AI can play millions of games in the same time a human plays one it's obviously different

    • @MikeyCyan
      @MikeyCyan 8 หลายเดือนก่อน +27

      ​@@bobkreme2175yea 80 hours is such a large sample size for the AI to body us.
      I have I feeling we win first hour or two, then the AI starts beating us with the tricks we showed it

    • @lajont
      @lajont 8 หลายเดือนก่อน +17

      It wouldn't be a fair comparison at all. If we are talking about a regular game (let's say mario cart), the human player (even if this person has never seen mario cart before) would start with prior knowledge, since the game is designed to be easily understood by humans. The AI on the other hand would not understand anything, and play like a human when the screen is disconnected, but maybe quicker on changes in state. If a human player played as if this was a full time job for 2 weeks (80 hrs), there would be an improvement, but the progress would probably look fairly linear (or would maybe flatten a bit if completely new to the game), while the AI would have a much larger rise in the first few hours, which would flatten quite much after just a few hours. The change in the AIs curve would most likely make the human curve look constant in comparison.
      If we would do this in a more fair manner, the "game" should have a few buttons that could be pressed, and randomized pixel graphics that change depending on the inputs and timings, with a scoring system that gives points based on some determined criteria, which in the start is seemingly random to both the player and the AI. In this case the AI would win consistently over the player almost certainly all the time with a scoring gap that would increase with every single run.

  • @MoreInsane96
    @MoreInsane96 8 หลายเดือนก่อน +89

    Imagine leaving it on for thousands of hours and learning how to do the Ultra Shortcut

    • @myithspa25
      @myithspa25 8 หลายเดือนก่อน

      The what

    • @opo33333
      @opo33333 8 หลายเดือนก่อน +11

      That's sadly improbable fue to how the rewards are set up

    • @BrawlLegendLink
      @BrawlLegendLink 8 หลายเดือนก่อน +24

      With this manually set up reward system, it would be impossible because it wouldn't even be optimal for the AI. It technically isn't learning how to beat the track in the fastest way possible, it is learning how to maximize the handcrafted reward function.

    • @jeremycase3047
      @jeremycase3047 หลายเดือนก่อน

      @@myithspa25 look up history of unltrashortcuts by Summoning Salt

  • @legendlambohcr2785
    @legendlambohcr2785 8 หลายเดือนก่อน +196

    love how the reward bars perfectly sync with the music for a second at 6:34-6:36

    • @aitango
      @aitango  8 หลายเดือนก่อน +33

      I didn't even notice that haha

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +21

      As the editor I can tell you this is a beautiful coincidence

    • @skyscrapersx5877
      @skyscrapersx5877 หลายเดือนก่อน

      @@aitango do you watch the videos though or when they are done they are done

  • @uppeltje1532
    @uppeltje1532 8 หลายเดือนก่อน +245

    It's fascinating to see how much AI has improved as a whole, not only in the game itself but in general. 5 years ago we wouldn't have been able to do this much with AI already. Also you're criminally underrated, keep it up!

    • @aitango
      @aitango  8 หลายเดือนก่อน +30

      Yeah its pretty amazing, it feels like new AI research is coming out constantly and it just keeps getting better and better so fast. Thank you so much, I really appreciate the kind words!

    • @an2thea514
      @an2thea514 8 หลายเดือนก่อน +17

      Depends. This is hardly different from MarI/O, an AI that happened years ago.

    • @frl_panda
      @frl_panda 8 หลายเดือนก่อน +2

      Simply: a electric powered programmable calculator, calculating how to get the most reward numbers by trying every possible input. It's not really much more than a algorythm that saves it's inputs and at what time it has to do said inputs to get the highest reward number possible.

    • @mediocrestu8238
      @mediocrestu8238 8 หลายเดือนก่อน +16

      nah this stuff was definitely possible when I took an AI class over 10 years ago

    • @Essence1123
      @Essence1123 8 หลายเดือนก่อน +10

      This is just factually wrong. Here's a MUCH more advanced mario kart AI by Sethbling (actually learning the *game* not just a track) from 4 years ago. th-cam.com/video/Tnu4O_xEmVk/w-d-xo.html and 3 years before that he did the same thing but with Super Mario World. This stuff (and much more advanced) has been available for over a decade easily.

  • @dbeast03
    @dbeast03 8 หลายเดือนก่อน +55

    For a bit of reference, sub 1 minute on this track is a decent time for a human in time trials, and a rather good time without powerups in vs mode, so a mid 1:06 is intriguing for sure, definitely showing some potential room for improvement but also demonstrating some clear progress to go with the gaps in it's methods.

    • @aitango
      @aitango  8 หลายเดือนก่อน +12

      The second to last clip was slightly faster, managing to get 1:05. There's definitely still room for improvement though, mainly missing the odd mini-turbo and missing the boost panel at the end

  • @holepino
    @holepino 8 หลายเดือนก่อน +168

    I’m really curious how the AI was able to take input from the game. Was it using computer vision to actually process the whole screen in real time or was it integrated with the game engine in some way that let it get position data? I’m especially curious how it dealt with knowing where the other CPUs were and responding to bumps from them

    • @natew4724
      @natew4724 8 หลายเดือนก่อน +2

      +1

    • @MarshMello00
      @MarshMello00 8 หลายเดือนก่อน +38

      When looking at 0:40 , I assume they've used the package dolphin-memory-engine and has access to read the memory.

    • @NFITC1
      @NFITC1 8 หลายเดือนก่อน +29

      It would have to be using some kind of memory hooks. Whole screen processing is very slow and probably couldn't handle processing a race in real-time. It also wouldn't know what speed it was going just by screen alone.

    • @edsheeransdad9985
      @edsheeransdad9985 8 หลายเดือนก่อน +22

      I'm curious too. It'd be very underwhelming if the AI is essentially blind and doing this by trial and error. It seems like sometimes it just runs into a wall.

    • @Toby-Wan-Kenobi
      @Toby-Wan-Kenobi 8 หลายเดือนก่อน

      +1

  • @foosamooncreations5497
    @foosamooncreations5497 8 หลายเดือนก่อน +55

    No way this video has so little views! You put so much effort into this, keep up the good work, definitely subbing.

    • @aitango
      @aitango  8 หลายเดือนก่อน +5

      Thank you so much, always great to hear! Will look to keep making more content

    • @That_Sonic_fan
      @That_Sonic_fan 8 หลายเดือนก่อน

      Me too man

    • @That_Sonic_fan
      @That_Sonic_fan 8 หลายเดือนก่อน +1

      I’ll check the other videos

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks, will really help me out!

  • @alexandrialee1705
    @alexandrialee1705 8 หลายเดือนก่อน +139

    i'm very happy this popped up in my recommendations; i'm on the spectrum and i've had pretty consistent mario kart hyperfixation periods since about 2017, and recently I've been very interested in the process of machine learning, so this video was basically a perfect match of two of my special interests. target audience reached! :)

    • @aitango
      @aitango  8 หลายเดือนก่อน +25

      I'm really glad you liked the video! I remember years ago something similar happened to me! Back around 2019 I was really into StarCraft 2, so when Google released an AI to play it just as I was getting into AI it absolutely blew my mind!

  • @harrylane4
    @harrylane4 8 หลายเดือนก่อน +9

    I like how the reward skyrockets at the end of lap 3, as if it’s getting excited

    • @aitango
      @aitango  8 หลายเดือนก่อน +4

      Yeah that final lap finish is like a drug to the AI haha

  • @luiscossio296
    @luiscossio296 8 หลายเดือนก่อน +16

    Having the AI receive greater rewards based on its speed seems to work great. It sounds like common sense, but every other attempts I've seen at training racing AI just sets checkpoints and rewards for crossing them, no variable to increase the reward if the time between checkpoints is lower or the speed higher.

  • @sam_bamalam
    @sam_bamalam 8 หลายเดือนก่อน +77

    I’m really curious about two things. 1) How we're rewards determined? You showed us how you received rewards as a human, but I'm left curious about how rewards are calculated. 2) What difficulty are the CPUs set to? And how did that affect it?
    Really neat to see the results, but so many more questions have arrived!

    • @antenant9294
      @antenant9294 8 หลายเดือนก่อน +5

      I was wondering exactly the same.
      It feels as though the AI is just reverse-engineering the reward algorithm, where the algorithm needs to be known in advance in order to determine "successful driving".
      I think I've missed something in the explanation here.

    • @AidenJukes-oh9lm
      @AidenJukes-oh9lm 8 หลายเดือนก่อน

      I kind of just want to know what difficulty they were set to

    • @Guaycuru
      @Guaycuru 8 หลายเดือนก่อน +1

      I wanted to know that too, how are rewards calculated?

  • @NekoNinja13
    @NekoNinja13 8 หลายเดือนก่อน +2

    good job Ai-chan. im proud of you for improving so much ❤

  • @aspertomrine
    @aspertomrine 8 หลายเดือนก่อน +7

    This is probably one of the more original ideas I've seen about MKWii, I have not seen any other video like this but this is great!

  • @timberlin4075
    @timberlin4075 8 หลายเดือนก่อน +4

    pretty cool the ai learned to drive like someone really trying to avoid wheelie bumps

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      There's a good chance that was on purpose, as getting wheelie bumped causes a massive drop in speed and therefore reward. I actually trained other mario kart AIs which avoided wheelieing for a really long time because of this

  • @batlin
    @batlin 8 หลายเดือนก่อน +16

    Would love to see more details about how intermediate rewards were determined and how game state was encoded, like was it seeing a downscale greyscale screen grab, or peeking at position and map data in emulator memory?

  • @mattlan37x21
    @mattlan37x21 8 หลายเดือนก่อน

    Really good video, excited to see where this channel is going

  • @sitbar1
    @sitbar1 8 หลายเดือนก่อน +18

    This is some real quality Mario Kart Wii content! Glad this popped up in my recommendations, looking Forward for part 2 :D

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Really glad you like it! I'll make sure to make the next part as good as possible!

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +1

      Next video should be out on Friday 1st September

  • @blahblahblahblahblahblahblahbl
    @blahblahblahblahblahblahblahbl 8 หลายเดือนก่อน +3

    Wtf how does this have only 1.3k views… this is sick. Really cool video.

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Really glad you like it! I'm hoping it gets lots more haha

    • @blahblahblahblahblahblahblahbl
      @blahblahblahblahblahblahblahbl 8 หลายเดือนก่อน

      It seems like you are on the way there already! 😄 @@aitango

    • @blahblahblahblahblahblahblahbl
      @blahblahblahblahblahblahblahbl 8 หลายเดือนก่อน

      Congrats on 100k!!

    • @barneybetsington7501
      @barneybetsington7501 8 หลายเดือนก่อน +1

      I agree, it is sick. Utterly disgusting. Videos like this should be outlawed!

  • @hugom2317
    @hugom2317 8 หลายเดือนก่อน +26

    Hey, great video as always ! That's be cool to see you race the AI, maybe someday !

    • @aitango
      @aitango  8 หลายเดือนก่อน +6

      Thanks! I'll have to give it a try at some point

  • @theawesome8155
    @theawesome8155 8 หลายเดือนก่อน +1

    You have no idea how long I’ve been waiting for something like this to happen!!

  • @PinkyNardo
    @PinkyNardo 8 หลายเดือนก่อน +8

    The fact that self learning AI has become an innovation for success, a lot of gaming companies are using it to provide a much more Dynamic and Random experiences for gamers. Can't wait to see how it evolves. This video is a great way to show, Practice makes perfect. A real life skill that must be taken into account! ❤

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      I've always thought that would be such a cool idea, infact I would love to do this for job if it becomes popular! Really glad you liked it!

    • @PinkyNardo
      @PinkyNardo 8 หลายเดือนก่อน

      @@aitango Honestly most artificial intelligence in games use random number generation but never actually learn themselves they just use outcomes defined by numbers not outcomes defined by what it actually knows and what it doesn't. I feel like learning itself is a much better way of it to become overly powerful over current methods used to day with artificial intelligence in video games. it's a hard prospect to grasp but if you ever decide to reach out for some company or get a job for this specifically, please let us know about it, As I would definitely like to follow you on that journey!

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Will do! I think it could lead to so much more interesting gameplay so would love to be apart of it

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +1

      Well said

    • @theraytech54
      @theraytech54 8 หลายเดือนก่อน

      @@aitango Some companies are already working on that like Sony AI, Unreal Engine Learning Agents... But please keep in mind we don't necessarily want our butts kicked as players 😂 Sometimes, predictible AIs are funnier to play against !

  • @danythomas550
    @danythomas550 8 หลายเดือนก่อน +22

    Imagine if you could replace the original CPUs with this better AI what hard challenges would come out

    • @aitango
      @aitango  8 หลายเดือนก่อน +8

      Yeah I would love to see, I could imagine Mario Kart TH-camrs doing vs AI challenges and stuff like that

    • @alpha4935
      @alpha4935 8 หลายเดือนก่อน +2

      ​@@aitangoYou might want to reach out to the MKW modding community to see if its feasible. That idea is overflowing with potential.

    • @catsup1308
      @catsup1308 8 หลายเดือนก่อน +1

      You can already race ghosts of other fast players through CTGPR

  • @louie3975
    @louie3975 8 หลายเดือนก่อน +6

    Great video as usual, like i have mentioned in a community post making the AI learn the track moonview highway would be a great video imo

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks! Yeah Moonview would definitely be a great track to try as the AI would have to try so hard to learn to avoid cars haha

    • @redrangers12330
      @redrangers12330 8 หลายเดือนก่อน +1

      Having it use an outside drift kart on moonview highway would be sick

  • @salih2d486
    @salih2d486 8 หลายเดือนก่อน +2

    I love the subtitles helps watching the video with out audio

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      I'm glad you find them helpful!

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน

      Glad they are helpful!

  • @nickdoesnotlikepie
    @nickdoesnotlikepie 8 หลายเดือนก่อน +2

    Would love to see a tutorial series on how you could get something like this running on your own PC. It would be so interesting to try this with all sorts of different variables. Great content man, very interesting!

  • @ModBros8434
    @ModBros8434 8 หลายเดือนก่อน +22

    Cool video. It's nice to see how sophisticated these AIs are getting, especially during the livestream you did. I think it would be interesting to train an AI with other players online with Wiimmfi. Might have to check if that's allowed first, though

    • @aitango
      @aitango  8 หลายเดือนก่อน +11

      Thanks! Glad you saw the livestream, I wasn't sure how it would go! Yeah I'm not sure, I think the AI would definitely get banned for cheating since it would probably set off any bot detection that exists. Would be really cool to try though, would love to get in touch with some of the Wiimmfi Devs and try and make it happen though

    • @Leseratte
      @Leseratte 8 หลายเดือนก่อน

      @@aitango Hi AI Tango, Wiimmfi dev here. TH-cam seems to remove my comments for some reason (spam filter) - feel free to contact me through the info on my youtube profile.

    • @ewoly5
      @ewoly5 8 หลายเดือนก่อน +8

      @@aitango you can set up a private room and livestream the bot racing to encourage people to join to try and race the ai

  • @fastestghost
    @fastestghost 8 หลายเดือนก่อน +6

    Fascinating video man, I wonder how far this can go if someone put lots of time into perfecting this AI. Nice work.

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thanks a ton!

  • @BoazVr
    @BoazVr 8 หลายเดือนก่อน +1

    thanks so much, the music, the clips, the talking, so good, this absolutly made my day better

    • @aitango
      @aitango  8 หลายเดือนก่อน

      So glad you enjoyed it, great to hear such kind comments!

  • @cascadegaming1608
    @cascadegaming1608 8 หลายเดือนก่อน

    This is amazing content! I will be sticking around for more like this.

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Glad you enjoy it, and good to hear you’re looking forward to more!

  • @rasmuspedersen4891
    @rasmuspedersen4891 8 หลายเดือนก่อน +3

    now we just need it to train on multiple tracks!
    on a side note, is it possible to increase the playback speed of the game in dolphin to speed up the training?

    • @aitango
      @aitango  8 หลายเดือนก่อน +2

      That's definitely coming at some point in the future! I'll have to get some suggestions for tracks I should do! It is possible, and while training this AI the game was actually sped up! The AI also plays multiple games at once (4 games in parallel) to further increase the speed!

    • @rasmuspedersen4891
      @rasmuspedersen4891 8 หลายเดือนก่อน

      @@aitango nice!
      I'd suggest doing Mario raceway (N64), Mario circuit (DD), Peach gardens (DS) and Luigi circuit (Wii) since they're kinda visually similar

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Yeah those are some good ideas, you would hope the AI should be able to use its learnt knowledge on all of those tracks since they're similar. I think Peach Gardens could be interesting though, at the start the AI might end up going around the round about at the start haha

  • @FranXiT
    @FranXiT 8 หลายเดือนก่อน +5

    Keep it up, love these vids

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks, will do! Really glad you're enjoying it!

  • @nirraku
    @nirraku 8 หลายเดือนก่อน

    This is so cool! Great video

  • @pepdog1
    @pepdog1 8 หลายเดือนก่อน

    I know very little about Mario Kart strats but this was a chill and fun video to watch
    also kudos for putting music credits in the description, now I've added a few to my spotify lmao

  • @TheCarpenterUnion
    @TheCarpenterUnion 8 หลายเดือนก่อน +3

    Honestly the most rudimentary AI training video I've seen in 3 years

  • @dinglemcdangledongle
    @dinglemcdangledongle 8 หลายเดือนก่อน +3

    I'd say keeping it on so long was worth it, it looked really good at the end

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Yeah I was really happy with how it turned out, I might consider running others for a while longer!

  • @julienarmand8065
    @julienarmand8065 7 หลายเดือนก่อน +2

    Amazing video! I'm curious to see what happens if we modify the reward function as performance improves to assist the AI in learning new elements or overcoming plateaus, such as taking the yellow boost at the end. This could involve manually adjusting the reward function or using some form of meta-learning.

  • @bluewuppo
    @bluewuppo 8 หลายเดือนก่อน +1

    incredible video. cannot wait to see more from this channel ❤

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thank you so much!! Will try my best to keep new videos coming!

  • @harubutjojo
    @harubutjojo 8 หลายเดือนก่อน +6

    I want to see how an AI would cope on a track with more randomness such as koopa cape or toad's factory. I would also like to see how would fare with items on.

  • @picklejack1221
    @picklejack1221 8 หลายเดือนก่อน +5

    You should see how long it would take for the AI to beat a world record in mario kart!

  • @pinkthun2944
    @pinkthun2944 8 หลายเดือนก่อน +2

    This was a good ass video man! Keep doing what you do and success will come your way 🙏

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน

      We are gonna try and post a video every 2 weeks on Fridays, thank you for the support. Takes a while to train the AI, create voice overs and edit. We really appreciate the love. 🙏

  • @stardust-kyun
    @stardust-kyun 8 หลายเดือนก่อน +11

    Absolutely incredible! Question: does this AI only work for Ghost Valley 2? It seems like with its learning process, it would brute force other tracks rather than making decisions based on the track itself. It would be interesting to see an AI that can learn new tracks quickly!

    • @aitango
      @aitango  8 หลายเดือนก่อน +21

      Thanks! So this AI was only trained for Ghost Valley 2, so would likely struggle if I put it down on another track. If however it was trained on many different tracks at once, the AI would likely start to understand how tracks really work, rather than a specific track, so could probably try tracks its never seen before! This is definitely something I'm looking to do in the future!

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +5

      @@aitango not sure how hard it would be to do, but love this idea and we should make it happen.

    • @LankyMF
      @LankyMF 8 หลายเดือนก่อน +2

      ​@@aitangoOooh, you could have one version of this AI (Player 1) learn every track in sequence, and each race put it up against a fresh AI in the Player 2 slot to see how much P1's previous training helps or hinders learning new tracks.

  • @darthfastball1150
    @darthfastball1150 8 หลายเดือนก่อน +4

    This is really cool! Is the AI actually seeing the visuals / have access to knowing its position and the position of other racers or is it just running based on sequence and rewards? (is that a correct term?)

    • @aitango
      @aitango  8 หลายเดือนก่อน +6

      Really glad you like it! Yes, the AI is learning from looking at the screen, the same information people use to play the game. It only knows the position of the other racers from its current placement (1st, 2nd etc), and the minimap. The AI actually only uses rewards when training, not in the actual decision making process. This means that once the AI is trained, no rewards are needed for it to drive, just the screen!

    • @darthfastball1150
      @darthfastball1150 8 หลายเดือนก่อน

      @@aitango incredible! Thanks for letting me know and keep up the great work!

  • @Sammy7426
    @Sammy7426 8 หลายเดือนก่อน +1

    I really wish I knew how to do this! You are amazing and you have a great things ahead of you for yt! Keep it up and Godspeed. You have earned a new subscriber

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thanks you so much, it really means a lot to hear such kind words!

  • @blubbfisch98
    @blubbfisch98 8 หลายเดือนก่อน +1

    Genuinely happy I subscribed to you. I forgot about you but TH-cam reminded me and I was amazed again.

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Really glad to hear, I'm grateful to TH-cam for reminding you of the channel!

  • @firebirdude2
    @firebirdude2 8 หลายเดือนก่อน +3

    It's interesting how the AI skips the boost arrow just before the finish line, every single lap.

    • @fandomguy8025
      @fandomguy8025 6 หลายเดือนก่อน

      By the time it reached the end it learned to stick to the middle so it had no chance to encounter it.

  • @Katastrophe9009
    @Katastrophe9009 8 หลายเดือนก่อน +6

    Just proof that practice makes perfect. Even computers understand this. Really neat to see it actually lapping the CPU's in the end. Imagine if the CPU's in game were at this level.

  • @yellowtitan6040
    @yellowtitan6040 8 หลายเดือนก่อน

    Would love to see this on more tracks, great vid

  • @pausegill
    @pausegill 8 หลายเดือนก่อน +1

    first video ive seen by you, definitely watching more this is great

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +1

      Next video should be out in the 1st September, we try and post every other Friday.

  • @billz8934
    @billz8934 8 หลายเดือนก่อน +4

    surely given enough time to train (and mushrooms) the ai could beat the world record? is this theoretically possible even if it were to take months of training? also keep up these videos i love ai and mario kart you're the goat

    • @tdiogo_gamer
      @tdiogo_gamer 8 หลายเดือนก่อน +4

      I don't think so, the WR has some insane strategies, I don't the current reward system would be able to do it.
      Even with a more advanced reward system, the AI can get stuck and not evolve or take higher risks because that would drastically affect the reward it receives, so it is very unlikely a AI like this could beat the WR.

    • @billz8934
      @billz8934 8 หลายเดือนก่อน +1

      @@tdiogo_gamer very unlikely or impossible?

    • @tdiogo_gamer
      @tdiogo_gamer 8 หลายเดือนก่อน +3

      @@billz8934 I don't exactly know, but if it isn't impossible, it is EXTREMELY unlikely.

    • @aitango
      @aitango  8 หลายเดือนก่อน +4

      I can verify this is pretty accurate. Most AIs do sadly hit a plateau at some point, rather than just continuing to improve forever. There are however other AI models out there designed to use much more data which probably could beat world records, but running these models is only really possible for big companies with huge amounts of compute like google and openai

  • @santiagoacosta3372
    @santiagoacosta3372 8 หลายเดือนก่อน +3

    This video is cool
    I wonder what would happen if you put the AI on other courses; would it perform well because it learned the basics from this track or would it have difficulties with basic things that it took for granted before?

    • @ZoomerNostalgia
      @ZoomerNostalgia 8 หลายเดือนก่อน

      It makes sense that it would be able to use what it learned here on certain tracks that have no obstacles on them. It has basic driving skills, understands the benefits to drifting, popping wheelies, tricks, and shortcuts, and actively avoids falling off. I feel like when things like Toad's Factory's stampers or Moonview Highway's cars come into play, it would need a lot more training before it wouldbe ready.

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Yeah this reply is pretty accurate; it might be able to drive a little on other tracks since it understands the basics, but would definitely struggle with anything too different to what its seen before during training. If the AI was trained on multiple different tracks though, there's a decent chance it would be able to play new tracks straight away since it would start to have a good general knowledge of the game

  • @huskygxd
    @huskygxd 8 หลายเดือนก่อน +2

    Great job on this. 👏🏾

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thank you 🙌

  • @_cocont_
    @_cocont_ 8 หลายเดือนก่อน +1

    "Mario kart but every CPU learns and shares a collective consciousness"

  • @ColemakDH
    @ColemakDH 8 หลายเดือนก่อน +8

    This is sick, I wonder how much better it would be if it went on for say 200 hours or watching Time trials also seeing it learn items would be cool, thanks for making this video

    • @aitango
      @aitango  8 หลายเดือนก่อน +7

      Really glad you like it! I always wonder that too! Many AIs usually hit a plateau after a while, but this one looks like it was still improving from the graph, so would've been really cool! Perhaps in my next video you might see some items :)

  • @JustAPersonWhoComments
    @JustAPersonWhoComments 8 หลายเดือนก่อน +4

    Watching this AI learn to drive in Mario Kart is like watching me try to be the best player: lots of crashes, occasional moments of brilliance, and always hitting a wall when things get tough

    • @aitango
      @aitango  8 หลายเดือนก่อน

      We all have to learn somehow!

  • @Average.Goober
    @Average.Goober 8 หลายเดือนก่อน +2

    I think giving the AI rewards for uncommon events would make it even better, like lapping a racer (which should increase the score exponentially for each racer passed), or letting it keep track of its personal records and giving it points for each time they break it, bc at the end the ai isn’t learning the track to get better but rather it’s learning the track to figure ways to optimize the amount of points it could get

    • @LagMasterSam
      @LagMasterSam 6 หลายเดือนก่อน

      It actually wouldn't, at least not without a substantially larger and more complex model, and that would take much more computation to deal with.
      Rewards work better the more frequent, immediate, and relevant they are (this is actually true of human learning too.)
      You could say beating a previous time is the most relevant thing possible, but it's also the least frequent, least immediate thing possible, because it happens once at the end of the race.
      How does the AI relate beating a previous time to all it's previous work? It has to "think", "OK. I did something better this time, but what was it?" How does it figure out which thing or things it did that lead to beating a previous time?

  • @sez1742
    @sez1742 8 หลายเดือนก่อน

    Think you proved just how vital proper training is. Awesome video.

  • @autumn4442
    @autumn4442 8 หลายเดือนก่อน +4

    I wonder how the ai might be affected if you gave it a slight punishment for rapidly changing its prediction, or if that's something you can feasibly program.
    Ideally it would force it to commit more to its choices, thus making it take more direct lines while retaining the ability to change course at the last second to avoid a collision.
    As I think about it though, I think it might just make it more difficult for it to learn how to avoid crashes, since it may punish early avoidance measures.

    • @aitango
      @aitango  8 หลายเดือนก่อน +4

      There is actually an algorithm that attempts to do something similar this, called Advantage Learning which looks to increase the gaps between the predictions, forcing it to change its choice less. I really like the idea though, as for games like Mario Kart where the constant action swapping is really detrimental, it would definitely help!

    • @renakunisaki
      @renakunisaki 8 หลายเดือนก่อน

      In theory the game already does it since you lose speed.

    • @autumn4442
      @autumn4442 8 หลายเดือนก่อน +2

      @@renakunisaki Well, the issue is that while it loses speed to what we know to be faster, there's almost a sort of, what's it called, a local optima or something? While the ai COULD commit more to turns, committing even slightly more would have little improvement but increase the chances of a crash. It's entirely possible that the ai is stuck trying to optimize this tradeoff, and without shifting reward/punishment values, it may never get out of that rut. In order to improve without that, it must intentionally take actions which it knows will cause more crashes. It doesn't want to crash, so it doesn't try to improve.

  • @michaellyga4726
    @michaellyga4726 8 หลายเดือนก่อน +3

    For all the AI generated images masquerading as art and other ways AI has been used for evil, you're the one using it for good.
    Aamazing video!

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thanks, I'm really glad you like it!

  • @Wolfedit
    @Wolfedit 8 หลายเดือนก่อน

    That's crazy
    Really nice!

  • @ANK-550
    @ANK-550 8 หลายเดือนก่อน +2

    I really like how throughout you were while explaining it in a very simple way. I was able to understand what you were saying completely. I’m fascinated with AI, but most people don’t explain it in simple terms, thus making it hard for me to understand what they are saying. Thanks for explaining it in a simple way so I could enjoy the video.

    • @doorman2374
      @doorman2374 8 หลายเดือนก่อน

      then just learn how to code AI :P

  • @afish12
    @afish12 8 หลายเดือนก่อน +1

    Would love to see more tracks and items on! Video was done very well. New sub👍

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน

      Glad you liked it, hope it was fun and engaging.🙂

  • @htms2855
    @htms2855 8 หลายเดือนก่อน +1

    Always gotta love q learning, it can either work, or never work, I appreciate the time that was needed to complete this

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Yeah you really never know haha. Even with advanced variants of Q-Learning like this one, things can still be a bit unpredictable

  • @MvpCentral21
    @MvpCentral21 8 หลายเดือนก่อน +1

    Great video! Keep up the great work

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks, will do!

  • @hansjmo
    @hansjmo 8 หลายเดือนก่อน

    Subscribed, great content 👏

  • @CaptainBadger69
    @CaptainBadger69 8 หลายเดือนก่อน

    This video is intresting. I hope to see more content like this. :)

  • @ominousoptimism8620
    @ominousoptimism8620 8 หลายเดือนก่อน

    Absolute unit of a music taste man. I was jamming out the entire video. Oh, the AI thing is cool too!

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Glad you enjoyed the music! And of course the AI too haha

  • @kcotajhonson6494
    @kcotajhonson6494 8 หลายเดือนก่อน +2

    Imagine that the CPU of the game has an extreme difficulty with this training

  • @youtubzer
    @youtubzer 8 หลายเดือนก่อน

    Very cool thanks for sharing!

  • @Tobi202
    @Tobi202 หลายเดือนก่อน

    vro did make my day a lil better yess

  • @-_chira_-
    @-_chira_- 8 หลายเดือนก่อน

    I'm so happy to see a video like this, they always remind me of witnessing the birth of Neuro-Sama

  • @FirstnameLastname81206
    @FirstnameLastname81206 8 หลายเดือนก่อน

    To me, this feels like a parent teaching their kid how to do something. Your AI is clearly learning not just from the track layout, but also from the game's CPUs. Obviously your AI is far more advanced than the CPUs' AI, but it seems to be taking what the CPUs do and combining that info with the track layout to come up with the fastest route. The CPUs are like the parents, teaching their kid, your AI, to do something in the hopes of the kid surpassing them one day.
    Loved the video! I'd love to see you do more with this AI, and push it's limits!

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Really glad you enjoyed the video, and I hope to do lots more with the AI too! My dream as that the AI will be able to surpass my own skill level in any game I play!

    • @FirstnameLastname81206
      @FirstnameLastname81206 8 หลายเดือนก่อน

      That's a dream I'm gonna watch you reach. You earned a subscriber today, my friend.

  • @OGThunderPlus
    @OGThunderPlus 8 หลายเดือนก่อน +1

    underrated channel this is so interesting also
    200cc rainbow road vs the wr ghost non tas would be sick

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks! That sounds like quite the challenge, might have to work up to that one haha

  • @lukasimus984
    @lukasimus984 8 หลายเดือนก่อน

    Really good video, please dont stop making videos

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Don't worry, there's definitely more coming!

  • @_GTX1650
    @_GTX1650 8 หลายเดือนก่อน +2

    The AI just collects and replicates gameplay from the average funky kong user

  • @FtwXXgigady
    @FtwXXgigady 8 หลายเดือนก่อน

    What input does the network receive? Is it some encoded positional/velocity data or does it take in the video feed?

  • @Sir_Opus
    @Sir_Opus 8 หลายเดือนก่อน

    I was so happy to see it lap rosalina at the very last second of the video!!

  • @mr_clean575
    @mr_clean575 8 หลายเดือนก่อน

    Besides knowing its velocity and how much of the track it had completed, did the ai have any other inputs from the game? Like it's actual physical position on the map, the visual display on the screen, or anything like that?
    Really cool video, I'd love to see more

  • @GranVlog
    @GranVlog 8 หลายเดือนก่อน

    does it have to play in real-time?
    how do you train it if hitting the wall dont restart the level?

  • @duckdudette
    @duckdudette 8 หลายเดือนก่อน +1

    Really impressive! The music goes hard on this one too 😂

    • @aitango
      @aitango  8 หลายเดือนก่อน +1

      Thanks, glad you liked the music!

    • @benji.botterill
      @benji.botterill 8 หลายเดือนก่อน +1

      glad you appreciated my music choices :) maybe some hardstyle for the next video😂

  • @LarvaTubaShow
    @LarvaTubaShow 8 หลายเดือนก่อน

    I can watch your videos the whole day

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Good to hear! I'll have to make more so you don't run out of videos to watch!

  • @SupLuiKir
    @SupLuiKir 8 หลายเดือนก่อน +2

    What are your practical computation limits for training AIs when it comes to input variables? Because even if the network only allows binary choices, wiimote input data isn't floating point, is you can just give your NN bitwise inputs for the joystick and accelerometers and put them together on the other end for the emulator, unless that's infeasible for practical reasons.

    • @aitango
      @aitango  8 หลายเดือนก่อน

      The AI actually plays by using an emulating gamecube controller rather than a Wii remote, since it makes the controls a fair bit simpler. The NN doesn't actually take any input from the controls either, just the screen (its a Convolutional NN).

  • @Cxmplexe
    @Cxmplexe 8 หลายเดือนก่อน +2

    I really love ai vids but most people dont train the ai far enough in my opinion this ones different very nice

    • @aitango
      @aitango  8 หลายเดือนก่อน

      I tried to give it enough time, but it still maybe could’ve improved. I think my poor pc needed a rest though!

  • @jeremy1234823
    @jeremy1234823 7 หลายเดือนก่อน

    If you took the same a.i. and made it race on any other track, would it start its learning curve from before the first try on Ghost Valley 2 or would it start its learning curve from where it left off after mastering Ghost Valley 2? (In other words, would knowledge acquired from Ghost Valley 2 translate to other tracks?)

  • @somediggthing3649
    @somediggthing3649 8 หลายเดือนก่อน

    Cant wait to see this with every track in the game

  • @ComedyFunAndMore
    @ComedyFunAndMore 8 หลายเดือนก่อน

    Insane!

  • @lopzinc
    @lopzinc 8 หลายเดือนก่อน

    does the ai retry stuff to try and get a better reward? or does it just go for whatever works with a good reward

  • @remi1771
    @remi1771 8 หลายเดือนก่อน

    How did you manage to connect the ai to game information? (Speed, where it's moving etc)

  • @JGamingTV
    @JGamingTV 8 หลายเดือนก่อน

    Oh yeah this one doing numbers. Amazing video!!!!

    • @aitango
      @aitango  8 หลายเดือนก่อน

      Thank you so much, glad you enjoyed!

  • @linkesocke4533
    @linkesocke4533 8 หลายเดือนก่อน +1

    Cool video. That's quite impressive driving for an AI.
    How about you do a race against your AI? It would be very cool to see who can do it better.

  • @LYkyle
    @LYkyle 8 หลายเดือนก่อน

    I’m seriously curious, how did u do this? How did u train it?

  • @flying_horse
    @flying_horse 8 หลายเดือนก่อน +1

    What did you use as inputs for the AI ?
    What does it see ?

  • @sinno96
    @sinno96 8 หลายเดือนก่อน +2

    Since it started to plateau at the end, do you think introducing other variables like tricks and boost pads would see the AI improve or confuse it further?
    Great vid!

  • @bigPandaMan01
    @bigPandaMan01 8 หลายเดือนก่อน +1

    This video made my day so much better

    • @aitango
      @aitango  8 หลายเดือนก่อน

      I'm really glad to hear that!