Linesight
Linesight
  • 9
  • 1 673 184
I Trained an AI for 2 Years on Trackmania. It's Breaking Records.
I trained an AI that plays Trackmania with reinforcement learning. It's getting good enough to challenge Official World Records. This is the story of how my AI reached such an incredible level.
Link to raw run playlist: th-cam.com/play/PLZ-JKrjYnD1GBlp-WS3Mr8bKmqsqSSO-U.html
Link to replay pack: drive.google.com/file/d/19p10V3g4RfFUoUUdVWNN2RXwJmcXfHC2/view?usp=drive_link
Link to input files: drive.google.com/file/d/12Jfp6nDNhAb7vQIeUIao_VU0W8652s_r/view?usp=drive_link
Project Github: github.com/Linesight-RL/linesight
Game: Trackmania Nations Forever
Buymeacoffee: buymeacoffee.com/linesight
00:00 Intro
00:43 map5 (v1 and v2)
03:58 A01
05:10 map5 (v3)
07:27 Official Campaign intro
08:42 A02
11:11 D06
14:22 Official Campaign
15:59 vs TAS
17:14 Minitrial
22:04 Method
23:14 Zero-shot
24:03 Next steps
25:38 Finale - Hockolicious
27:04 Credits & Times
#AI #reinforcement #learning #trackmania #world #record #official #campaign #tmnf
Music Tracks:
Luge - Geographer
Instant Crush - Corbyn Kites
Subway Dreams - Dan Henig
Hall of the Mountain King - Kevin McLeod
I got a stick - Kevin McLeod
Envision
Arp Bounce - Geographer
Up the steps - Geographer
Shitsubo Shita - Josh Lis
Just Breathing - NEFFEX
Tinker Time - Nathan Moore
Contact us:
First join the TMInterface Discord discord.gg/PvWYGkGKqd, then head over to the #programming-talk channel discord.com/channels/847108820479770686/885481860539498508
This project would not have been possible without the awesome TMInterface tool, provided by @Donadigo.
มุมมอง: 1 278 118

วีดีโอ

Trackmania AI Learns To Drift and Beat Pros ? | Hockolicious
มุมมอง 350Kปีที่แล้ว
We trained an AI to play Trackmania on Hockolicious, one of the game's most prestigious tracks. The AI set a strong time, beating the 2012 World Record previously set by CarlJr. Link to map: tmnf.exchange/trackshow/414041 Link to replay: drive.google.com/file/d/1b0p2QSXAea1v7lVCGuWkNr-5xQ6PdqHo/view?usp=drive_link Game: Trackmania Nations Forever Project Github: github.com/Linesight-RL/linesigh...
Superhuman Trackmania AI Demo | map5
มุมมอง 29Kปีที่แล้ว
Linesight project. We used reinforcement learning to train an AI that plays Trackmania. On its training track, the AI is faster than the current human world record. We put this video out as a challenge: can a human still overcome our AI? Game: Trackmania Nations Forever (TMNF) Map: tmnf.exchange/trackshow/10460245 Replay (.gbx file): drive.google.com/file/d/1jfOyhRQCvOC5XSEhhajE2OvmJcCHKDjR Wir...
AI Plays Trackmania - Bloopers
มุมมอง 3.6Kปีที่แล้ว
In this video, and AI is trained with reinforcement learning to accumulate speed and finish a map as fast as possible. The AI learned a behavior where it turns around right before the finish line. This is not a one-off mistake, the AI repeatedly did similar things in back-to-back runs. Can you guess why ?
AI Plays Trackmania - Map5 2:04:91
มุมมอง 9Kปีที่แล้ว
The AI is trained via reinforcement learning. Game: Trackmania Nations Forever (TMNF) Map: tmnf.exchange/trackshow/10460245 Replay (.gbx file): drive.google.com/file/d/1hp1Mz0ooR2YBNpqNjvvxjFrBOGXs8DYD/view?usp=sharing
AI Plays Trackmania - Training Progression Side by Side
มุมมอง 3.3Kปีที่แล้ว
In this video, an AI is trained via reinforcement learning. In order from the top left corner, top right corner, bottom left corner and bottom right corner the AI has received progressively less training time. The video compares the lines taken by the various AIs in different parts of the map. Game: Trackmania Nations Forever (TMNF) Map: tmnf.exchange/trackshow/10460245
AI Plays Trackmania - Map5 2:07:00
มุมมอง 1.2Kปีที่แล้ว
The AI is trained via reinforcement learning. Game: Trackmania Nations Forever (TMNF) Map: tmnf.exchange/trackshow/10460245
AI plays Trackmania - Map5 2:09:12
มุมมอง 1.2Kปีที่แล้ว
The AI is trained via reinforcement learning. Game: Trackmania Nations Forever (TMNF) Map: tmnf.exchange/trackshow/10460245
(Teaser 01) AI learns to play Trackmania with reinforcement learning
มุมมอง 2.3Kปีที่แล้ว
This video is a recording of an AI currently training to finish a custom map as fast as possible with reinforcement learning. This run was played on Trackmania Nations Forever, with TMInterface to link the game and our AI. The run was played on a custom map. The .Gbx map file is available for download here : tmnf.exchange/trackshow/10460245 The AI took 2mn12s25 to complete the map during that t...

ความคิดเห็น

  • @bl1zze
    @bl1zze 18 ชั่วโมงที่ผ่านมา

    Now make the ai play deep dip 2

  • @timeshiftedprios5564
    @timeshiftedprios5564 วันที่ผ่านมา

    Maan, I would love to see this AI learn deep fear for 3 months

  • @kodac4256
    @kodac4256 วันที่ผ่านมา

    Boring

  • @marcelmeyer4746
    @marcelmeyer4746 วันที่ผ่านมา

    What a video. Enjoyed every second.

  • @jamesgray5021
    @jamesgray5021 2 วันที่ผ่านมา

    Hi, Love the video. At about 20 mins you talk about rewarding the AI if close to the world record line. I am just wondering if this 'ham strings' from finding more optimal lines once it finds the path.

  • @AdamsWorlds
    @AdamsWorlds 2 วันที่ผ่านมา

    Would love to see this IRL in those Formula-E cars (or whatever they called) obviously wont be able to wall bash, but would be nuts to see it turn into a competition. Its the ultimate showcase of AI+Human achievement together. Humans do mechanicals and AI does the driving. That really turns it on its head as you would assume it should be other way around with a machine making a machine and human driving it.

  • @pedro.alcatra
    @pedro.alcatra 2 วันที่ผ่านมา

    At this point you just w Should rent some AI server for Training

  • @drunkpaulocosta
    @drunkpaulocosta 2 วันที่ผ่านมา

    Nobody cares bro. It's not AI at all. It's literal basic scripting

  • @root......
    @root...... 2 วันที่ผ่านมา

    The work you are doing is really awesome, never give up on it, RL has always fascinated me. Can't wait to see some human intervention creating supervised IQN!!

  • @vosechu
    @vosechu 3 วันที่ผ่านมา

    Incredible results! Congratulations!

  • @LincolnButterbean
    @LincolnButterbean 3 วันที่ผ่านมา

    You should make an ai for the f1 games and then sell it to f1 teams to train their riders, easy million

  • @haysdixon6227
    @haysdixon6227 3 วันที่ผ่านมา

    this is awesome. I wonder if there’s a good way to “direct” the AI to take a general pattern (like bouncing off walls or leaping a shortcut) by incentivizing it to stay within a certain distance of an example run (in a satisficing and not a maximizing way.) This could have similar benefits to how human input with machine help via Foldit sometimes has better results than straight AI. Often the best protein folds use human intelligence with machine tools (Shake and Wiggle, AlphaFold.)

  • @chrishenderson666
    @chrishenderson666 3 วันที่ผ่านมา

    Ok the part where the AI stops at the bridge to maximize its reward. This happened over a decade ago with an AI that was trained to play a video game and beat it as quickly as possible. It started freezing or crashing the game in order to "not be punished"

  • @ChargedPulsar
    @ChargedPulsar 4 วันที่ผ่านมา

    So basically, instead of playing for 2 years, you worked for 2 years on some robot to play for you instead.

  • @illarionbykov7401
    @illarionbykov7401 4 วันที่ผ่านมา

    Top AI is now solid at chess--almost unbeatable, but at Go AI loses to skilled amateur humans with some anti-computer strategies, and at poker and DotA2 can only play pro level at artificially simplified versions of the games, while playing significantly worse than pros at the full verions of the games, and also loses to the best human pros even at some of the simplified versions. People in the field of AI have a habit of misrepresenting and overstating their achievements. AI has not "mastered" or "conquered" nearly as much as claimed. Not yet, at least.

  • @AnimaLector007
    @AnimaLector007 4 วันที่ผ่านมา

    This remeber me of half size micromouse. Nice work!

  • @MrProy33
    @MrProy33 4 วันที่ผ่านมา

    Dumb. And truly sad. Watching a computer repeatedly playing a video game is just about the most worthless, wasteful use of one's time, maybe ever. Go outside. Learn a trade. Take cooking lessons. Get in a few fistfights. Go on dates and get your heart broken. Get an actual career where you accomplish something and provide a bebefit that is truly useful to the world--farming, carpentry, etc. If you are fascinated by racing, learn to drive a real car and get some track experience. You know, go do some real man things. Computers are supposed to help make our actual lives better, to improve our practices and procedures, not to replace our experiences with digital serotonin and dopamine hits on little blue screens.

  • @hello4fghr
    @hello4fghr 5 วันที่ผ่านมา

    how is this done and how difficult would it be to set it up?

  • @rbstyle1201
    @rbstyle1201 5 วันที่ผ่านมา

    Not expert ai but yes if its trained properly ai will smart enough knowing the timing and limit of something under its control, to give best output

  • @Jesus_Christ_is_Lord_
    @Jesus_Christ_is_Lord_ 5 วันที่ผ่านมา

    So just to out this in perspective the worlds vest time holders were beaten by a fruitfly brain.

  • @cosmolightfoot103
    @cosmolightfoot103 5 วันที่ผ่านมา

    Kind of funny that an intelligence biased towards "immediate rewards" is better at racing ;)

  • @aadamdilshan788
    @aadamdilshan788 5 วันที่ผ่านมา

    23:13 Not us being destroyed by a FLY

  • @gabeitches6955
    @gabeitches6955 6 วันที่ผ่านมา

    I hope we get to see the guy eat his hat

  • @TheDazedCS
    @TheDazedCS 7 วันที่ผ่านมา

    dudes teaching skynet how to chase us down

  • @Name-ot3xw
    @Name-ot3xw 7 วันที่ผ่านมา

    I'm guessing your low res "could you even imagine" image is at least half the pixels of a gameboy. We did alright. You even have greyscale, back in my day it was black on neutral grey/brown.

  • @christophedejonge918
    @christophedejonge918 8 วันที่ผ่านมา

    AI or Machine Learning?

  • @Fluiding
    @Fluiding 8 วันที่ผ่านมา

    19:56 He essentially programmed anxiety

  • @JohnSmith-mf3dh
    @JohnSmith-mf3dh 8 วันที่ผ่านมา

    TAS proves how superior humans are. Humanity, FUCK YEAH!

  • @JohnSmith-mf3dh
    @JohnSmith-mf3dh 8 วันที่ผ่านมา

    So... you cheated! Using humans to teach the "ai!...

  • @gabrielplourde6791
    @gabrielplourde6791 8 วันที่ผ่านมา

    awesome stuff! IS the AI able to be used on any races? Or is it only trained on 1 race at a time and thus 'memorizes' the single best route for a single map?

  • @DarkRedman31
    @DarkRedman31 8 วันที่ผ่านมา

    14:19 I laughed so hard! 🤣

  • @helterskelter416
    @helterskelter416 8 วันที่ผ่านมา

    16:35 your AI *is* a tool, so it *is* a tas. sorry to break it to you :c

  • @imcrow6674
    @imcrow6674 9 วันที่ผ่านมา

    theres gonna be a time where accusations float around of people submitting speedruns done by AI whoo boy we are getting there aint me

  • @dtar380
    @dtar380 9 วันที่ผ่านมา

    Did bro just use AI to beat a WR in Trackmania

  • @ShadowNatsumi
    @ShadowNatsumi 9 วันที่ผ่านมา

    for people saying he didnt train the AI its just stupid cus he infact made a machine do the runs instead of him doing the runs so its AI

  • @一_一一一一一一一一
    @一_一一一一一一一一 9 วันที่ผ่านมา

    I think the reason AI cant find shortcuts is that in order to find them, the time would have to go down for many runs before going up again. since I presume the ai gets punished for its time going up, that would mean shortcuts, at least complex ones will be impossible for the ai to find unless found early on in its training. its for this same reason that animals cant evolve to have extra pairs of limbs or breath fire. because in order to do that, it would have to evolve a bunch of other stuff first, which would lead to the creature getting "punished" by being less efficient and adapted for its environment. evolution is very similar to ai learning if you think about it that way.

  • @AmiciCherno
    @AmiciCherno 9 วันที่ผ่านมา

    Yosh already beat you to it.

  • @MsForzen
    @MsForzen 10 วันที่ผ่านมา

    wehen you dont have skill^^

  • @michaeldubery3593
    @michaeldubery3593 10 วันที่ผ่านมา

    Surely v4 (or 3.5) could just be v3 with higher resolution frames? I think human wr's would be slower if we were playing a lower resolution game too, so I'd expect there to be unpredictable benefits to the existing ai if it has more information input. It would also go some way to providing the AI with a way to look ahead, in an analogous way to how humans actually look ahead in game- by literally looking ahead in the track in any given frame of the race. Since training times per race have already come down so much due to your other optimisations, it seems like increasing the processing requirement by increasing the resolution needn't make training take weeks and weeks. Also final (small) recommendation- collab with Code Bullet! He might have some clever ideas having done a lot of ML projects in different games.

  • @404_BrainNotFound
    @404_BrainNotFound 11 วันที่ผ่านมา

    When the AI agents achieves the fastest time, can it repeat it consistently? (Percentagewise)

  • @Ayelis
    @Ayelis 12 วันที่ผ่านมา

    Are you saying we can train fruitflies to race?! :D

  • @VictoriaVoltia
    @VictoriaVoltia 12 วันที่ผ่านมา

    Uh, hey dude. Your AI training gonna make some people time travel to get you.

  • @mechaboy95
    @mechaboy95 13 วันที่ผ่านมา

    Have you looked into using cloud computing to run your code on? Can be massive help to the training process

  • @Рустам-е9ч
    @Рустам-е9ч 14 วันที่ผ่านมา

    Idk about the rest, but in dota 2 open ai unable to beat pro teams(it cant win a single game). But it winning against amateurs very often, about 98% win rate

  • @cmeier_lifting4542
    @cmeier_lifting4542 14 วันที่ผ่านมา

    could you train an AI to play rocket league? it wouldn't be easy at all but i'd love to see it

  • @PaulsPickles
    @PaulsPickles 14 วันที่ผ่านมา

    Have it try deep dip

  • @Tsskyx
    @Tsskyx 14 วันที่ผ่านมา

    Hmm, I have some thoughts about this. As I play Trackmania now, I'm coming to realize that this AI model is probably inadequate for several reasons. Consider what human players do when they drive. They don't _just_ drive, they also _learn._ Of course, the AI learns too, but it's learning the wrong things, it's constructing an incorrect gameplay model in its "head". First of all, it should somehow be able to learn certain essential skills, such as breaking, drifting, speedsliding, etc, not to mention the tricks on all other surfaces and with other vehicles too. Next, it needs enough memory to build a "visual map" of the track. Us humans do it by eye, but for the AI, I recommend logging the exact spatial coordinate where it plans to execute a given trick. And only _then_ it could begin to start reasonably learning. Though, you probably still can't do much about tracks where one needs to severely slow down or go down a very specific and odd route. Solving such tasks requires creative thinking.

  • @tutacat
    @tutacat 15 วันที่ผ่านมา

    It is not intelligence because it does not understand anything, it's just a database. It is a neural network.

  • @einarsalamon8426
    @einarsalamon8426 15 วันที่ผ่านมา

    Can the ai speedslide?

  • @tutacat
    @tutacat 15 วันที่ผ่านมา

    Using random population and darwinism, can find more local minima/maxima