I created an AI to Play Poker

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 มิ.ย. 2024
  • I created a Poker AI and challenged my friends to a Heads-Up No-Limit Texas Hold'Em tournament. This involved solving for the Nash Equilibrium on an abstracted version of the Poker game.
    My code: github.com/Gongsta/Poker-AI/
    Note*: For the hand starting at 12:28, it's supposed to be a 6 of hearts in my hands (not a 6 of diamonds).
    ----------
    SUPPORT ME
    Not running a Patreon at the moment, but you can support me financially by donating directly through TH-cam, or buying me a coffee here ☕: ko-fi.com/stevengongg
    ----------
    MY SOCIALS
    🌐 Website: stevengong.co
    ✍️ Blog: blog.stevengong.co
    📸 Instagram: / stevengongg
    📱 Twitter: / stevengongg
    ----------
    PRODUCTS I USE
    The links below are Amazon affiliate links, they don't cost extra for you and it helps support the channel!
    Lumix G7 Camera 📷: amzn.to/3JgNPLp
    Blue Yeti Microphone 🎙️: amzn.to/43X4vRw
    10$ Wired Lavalier Microphone 🎤: amzn.to/3Q2rBjZ
    Wireless Lavalier Microphone 🎤: amzn.to/3VRQ9js
    Softbox 💡: amzn.to/3xsGTYL
    HyperX Alloys Origins Core Keyboard ⌨️: amzn.to/3xADGqh
    Razer Deathadder Mouse 🖱️: amzn.to/3TYbCob
    NVIDIA Jetson AGX Orin (for projects) 💻: amzn.to/3xF9N8o
    ----------
    Subscribers at the time of uploading: 31,908
    ----------
    ABOUT ME
    I'm Steven, a university student currently studying Software Engineering at the University of Waterloo. I like to build cool things with code. I started this TH-cam channel to share my love for learning and engineering with the world. I do everything ranging from electronics, CAD, control theory, programming, 3D printing, physics, and design.
    My ultimate goal is to be able to make anything that I want. I'm working towards this by building increasingly difficult engineering projects that push the limits of my knowledge. Consider subscribing if you would like to follow my journey and learn alongside me as a student of the universe!
    #poker #ai #stevengong
  • บันเทิง

ความคิดเห็น • 154

  • @AskiFin
    @AskiFin 2 วันที่ผ่านมา +57

    There is the famous guy who won the AI only poker tournament.. His AI just always went all in, and the other AI's thought since its a large bet, they have better hand and just folded.

    • @Goldandus_30
      @Goldandus_30 วันที่ผ่านมา

      😂😂

    • @Ohrami
      @Ohrami ชั่วโมงที่ผ่านมา

      Then those AIs were completely horrible, or the one that only went all-in got incredibly lucky.

  • @stevengongg
    @stevengongg  5 วันที่ผ่านมา +58

    This has been a project that I've been working on for a pretty long time... so I'm happy to finally release it! Let me know if you guys want a follow-up, since there are still quite a few things I can improve the AI on. I'd like to challenge an actual professional poker player next time. If you enjoyed watching the video, make sure to leave a like, it helps grow the channel :))

    • @thecodingmentor7701
      @thecodingmentor7701 4 วันที่ผ่านมา +1

      This is an incredible video. You combined two of my facorite things, AI and Poker. Cant believe this does not have more views! Such a high production quality also. As for me, I would definitely like a follow up to this video!

    • @JimStanfield-zo2pz
      @JimStanfield-zo2pz 3 วันที่ผ่านมา +2

      It sucks dude. I built a way better one. My poker bot would destroy yours

    • @stratabegood
      @stratabegood 3 วันที่ผ่านมา +1

      AI will take over eventually. Its not smart to be making videos like this. It will definitely ruin online poker one day if it hasn't already... Sad shame

    • @ahsidodna3355
      @ahsidodna3355 3 วันที่ผ่านมา

      for sure, this is very interesting, please continue

    • @perfectionist8380
      @perfectionist8380 3 วันที่ผ่านมา

      Maybe try training the AI on all the top poker tournaments (or just Dan Negreanu strategy) & also use facial recognition for player emotion/stress.
      Or simply take into account the time to react factor.
      Great project overall though! Kudos

  • @ffishcakess
    @ffishcakess 5 วันที่ผ่านมา +40

    Hahhaa the punt jam on AA2 followed by the even bigger punt call "I'm folding, don't get used to it" - I believe it lmao 😂

    • @JonCookeBridge
      @JonCookeBridge 4 วันที่ผ่านมา +1

      Confused why it calls the huge jam on AA2 with QT. Is it because pre-flop and flop are decoupled, somehow? Or is it not realising the difference between all in and small bet?

    • @BarvGwydh
      @BarvGwydh 3 วันที่ผ่านมา +5

      It could mainly be because this spot never comes up in nash equilibrium strategy. I think the bot is not playing GTO in general and has some flaws, but even a well made bot might struggle with this spot because 6x pot jam should never be played.

    • @JonCookeBridge
      @JonCookeBridge 2 วันที่ผ่านมา +1

      @@BarvGwydh Possibly. But watching the film I suspect his preflop model isn’t putting enough aces in this line because it’s decoupled from post flop action. Obviously with more aces in his range here, the need to call weaker hands reduces. To get the algorithm to compute a balanced preflop strategy requires CFR to be done across multiple streets, and he implies that he solved preflop and postflop separately. Basically if you just train a preflop model based on maximising equity before you see the flop, your AI won’t have enough board coverage when it’s deep, and it won’t slow play enough because it won’t see the value it gets from those things post-flop.

  • @MarijnRoorda
    @MarijnRoorda 5 วันที่ผ่านมา +33

    I went from a honest government ad warning me about AI, to this...

    • @Christoff070
      @Christoff070 3 วันที่ผ่านมา

      Great vid that one

    • @authenticallysuperficial9874
      @authenticallysuperficial9874 3 วันที่ผ่านมา +3

      "Honest government" is an oxymoron

    • @Christoff070
      @Christoff070 3 วันที่ผ่านมา

      @@authenticallysuperficial9874 yup,that's the point of it

    • @MarijnRoorda
      @MarijnRoorda 3 วันที่ผ่านมา

      @@authenticallysuperficial9874 It's a youtube channel that warns people for the bullshit during elections. It has pretty girls and plenty of swearing in australien. The Juice media.

  • @PsylocibeChillZen
    @PsylocibeChillZen 3 วันที่ผ่านมา +9

    why there is 2 cards 6 of diamonds on the deck? at 12:55

    • @Johnny2Feathers
      @Johnny2Feathers 16 ชั่วโมงที่ผ่านมา

      Nice catch

  • @CreativeMindstorms
    @CreativeMindstorms 4 วันที่ผ่านมา +5

    I have been wanting to see this for so long! Great job on the AI, it seemed to be making right plays most of the time!

    • @fauge7
      @fauge7 4 วันที่ผ่านมา

      Yeah but most of the time is bad enough to make you go broke...

    • @B00nater
      @B00nater วันที่ผ่านมา

      It really isn’t. Seems easy to exploit.

  • @Kingcrowish
    @Kingcrowish 3 วันที่ผ่านมา +3

    The timing on the Helmuth rage clip was great

  • @AbomidablePlayer
    @AbomidablePlayer 3 วันที่ผ่านมา +3

    The Phil Helmuth reference is gold!

  • @haviick1014
    @haviick1014 4 วันที่ผ่านมา +4

    This needs a part 2

  • @marcuslaxson6237
    @marcuslaxson6237 3 วันที่ผ่านมา

    wow this channel is going to blow up, great idea and very fun execution!

  • @icanyo9301
    @icanyo9301 4 วันที่ผ่านมา +2

    this is so sick! i need to learn to code cus my god it seems so satisfying completing a project like this

  • @JoseLopez-qm4ro
    @JoseLopez-qm4ro 4 วันที่ผ่านมา

    Good job man, 👏 props to you regardless of the loss...Take failure, learn from it and embrace it to always make your self better..

  • @pipipupukok
    @pipipupukok 5 วันที่ผ่านมา +2

    I absolutely love your videos man, you inspired me alot, and now i will be an intern in the biggest it company in poland at 16 years old!

    • @stevengongg
      @stevengongg  5 วันที่ผ่านมา +1

      good job!! keep pushing

  • @rainluo4831
    @rainluo4831 5 วันที่ผ่านมา

    Love your videos!!! Keep pushing❤

  • @Bot-Frenzy
    @Bot-Frenzy 5 วันที่ผ่านมา +1

    Nice work Steven! Keep it up 👍🏻

  • @TrygveVang
    @TrygveVang 5 วันที่ผ่านมา +1

    This vid deserves 1000x more views that what it is at now holy moly

  • @jujubaloolian
    @jujubaloolian 5 วันที่ผ่านมา +3

    It would be interesting to see this how this would develop further if the AI takes into account the position it is sitting at the table to see how many players bet before or would have an opportunity to bet after. Its interesting to see how other players could interpret the AI's decisions, most poker players would tell you to always bet/raise/reraise the same amount preflop for hands you play, as changing the bet amount would only signal that you have a strong hand or were bluffing. 😃

    • @BarvGwydh
      @BarvGwydh 5 วันที่ผ่านมา +1

      The bot will use randomization so that sizes do not make it exploitable, it will also simply be balanced no matter what, it will have an unexploitable mix of value and bluffs. Nash equilibrium cannot be beaten, at best the opponent can break even if they also use a nash equilibrium strategy, like the rock paper scissors example where the NE strategy is to pick each action 1/3 of the time. If the bot used multiple sizes it will still balance them, or for example, if it uses multiple sizes preflop then it might use 2bb half the time and 2.5bb the other half of the time if it wanted to open raise a hand to 2 different sizes.

    • @jujubaloolian
      @jujubaloolian 4 วันที่ผ่านมา +3

      @@BarvGwydh That makes sense for the betting, but I still think it would make a difference based off the position you are sitting. If the bot calls/raises a lower equity hand at the bottom of its range, it has a higher chance of winning when it bets last at the table and there is only one other person in the pot then vs when its UTG and is betting the same hand into 6 people or calling the hand with 6 people in the pot. Nash equilibrium can be beaten in a game with more than two people if others have shifting strategies - it only ensures that a single player can't improve by deviating.

  • @777BrokeDegen
    @777BrokeDegen วันที่ผ่านมา

    before watching the video I haven't decided whether you need to be taken out or not

  • @tiagocarneiro9743
    @tiagocarneiro9743 4 วันที่ผ่านมา +2

    Hey! Poker player here, first of all grat job, must not be easy at all!
    I think you would have better success playing against your friends (and bad poker player in general) if you programme your AI not to play in equilibrium but to play unbalanced in order to exploit these weaker players' leaks!

  • @tehf00n
    @tehf00n 3 วันที่ผ่านมา +1

    A friend of mine coded a poker bot for texas hold 'em back around 2006. He set one instance running 1c/2c cash game tables and was returning around 10c/day profit playing the odds and position only. I haven't seen him for years to see how he went once he weighted it for variance and stake styles.

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา

      He's probably in prison now for theft on gambling sites.

    • @tehf00n
      @tehf00n 3 วันที่ผ่านมา

      @@spacebomb9126 I highly doubt it. The guy was a top level coder and wrote most of the original office polymorphic viruses for bug bounties with Microsoft. And worked a government job. More likely he knew his way around a VPN and a dummy account.

  • @edwrdw
    @edwrdw 5 วันที่ผ่านมา

    Worth the wait

  • @francolauciello4429
    @francolauciello4429 วันที่ผ่านมา

    You should’ve made a results part at the end, good video thooo 🎉🎉

  • @TheOnTiltBoys
    @TheOnTiltBoys 4 วันที่ผ่านมา +4

    we will demolish that AI

    • @jttech44
      @jttech44 4 วันที่ผ่านมา +1

      ChatGPT? More like ChatATM

    • @matteovc2158
      @matteovc2158 3 วันที่ผ่านมา

      You guys don't always seem to be playing all that great, but I think even you would be able to beat this so called GTO playing AI. I think every action was just horrible.

  • @NSelsted
    @NSelsted วันที่ผ่านมา

    It didn't only make 1 mistake. After it calls with the first all-in with the QT, it is clearly that it is easy to exploit. You wait until you hit a hand, where the AI hasn't shown particular strength, and then you go all-in if you have hit. That it want's to call 1/3 of the time in the last hand, shows that this simple strategy will most likely win close to 100% of the time. There are other exploits, but you are far behind to win against any pro poker player. You have a lot more work to do, before it stand a chance against even semi-pros.

  • @RandomFunZer
    @RandomFunZer 4 วันที่ผ่านมา +1

    As an actual full-time professional kt was quite interesting to watch. But this ai at its current state would get absolutely destroyed by a real pro :).
    Still a fun endeavour and for sure very hard to come up with making this system. Good job

  • @mattloulou123123
    @mattloulou123123 5 วันที่ผ่านมา +1

    Lets go Steven!!

    • @stevengongg
      @stevengongg  5 วันที่ผ่านมา +1

      my number 1 supporter ❤️

  • @bayslive8720
    @bayslive8720 3 วันที่ผ่านมา +2

    says he loves poker: Actively creates a program to help ruin it.

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา

      Exactly.
      Idiots like these fools are the reason so many people are cheating online in poker.

  • @fanouzhang3887
    @fanouzhang3887 4 วันที่ผ่านมา +1

    The AI didn't seem to be playing very close to a gto solution of most spots

  • @queenstown9534
    @queenstown9534 4 วันที่ผ่านมา +2

    12:51 Heads up, I wouldn't exactly call this a semi-bluff as BB's range should be way weaker than the AI's, so it's a value raise. Very standard.
    13:26 This hand is atrocious. I'm guessing you guys are playing 50/100 this hand, hard to tell. In which case we're ~30bb effective and this should just be a fold for the AI as even against the bottom of SB's range, it's not looking too hot. Plus, AI blocks the more natural semi-bluffs like Qxhh for a second nut flush draw. If we can really call a 7.25x shove a natural semi-bluff.
    Why does the AI call? If I *had* to rationalize a call, I would say that this shove is either the nuts or air and because we block the Qh, we have more equity against most air hands, except Khx. This is a horrible justification for a call but, it's what it might be thinking.
    Interesting note, at 20bb effective, solver calls QhTd at ~10% frequency.
    15:02 It might seem insane but this hand is not so horrible from your AI's perspective. It's actually correct from a GTO perspective. Reason for the call is that you are beating 137/185 hands (74.5% equity) and getting ~1.53:1 pot odds (~40% equity required to call and break even). If we folded every time, we would be over folding and losing money long run given the price is just so good.
    I think your AI plays kinda okay, considering it's only learnt over 1m hands. The QT hand is crazy, one other consideration is that it's not gonna adjust to the mistakes made by other players so it's gonna make some whack decisions like this one. I'd be interested to play it and also see how it plays against GTO Wizard.

  • @RT-he6zf
    @RT-he6zf 3 วันที่ผ่านมา

    Ive been getting crushed by these "AI`s" in online rooms for years

  • @fabrizzolla
    @fabrizzolla 3 วันที่ผ่านมา +2

    Just imagine: thousands of cheaters letting their PCs playing poker to farm money using this AI and AI vision. R.I.P. online

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา

      Yeah, these guys are losers and should be arrested for showing people how to cheat and steal money.

  • @JMRSplatt
    @JMRSplatt 3 วันที่ผ่านมา

    "Private" Cards.. I love it. I'm using this instead of hole cards from now on. Sir! These are my on cards, they are private!

  • @philipepeters
    @philipepeters 3 วันที่ผ่านมา

    so cool

  • @MrWhatsyourflavour
    @MrWhatsyourflavour 4 วันที่ผ่านมา

    Nice work !

  • @TomRauhe
    @TomRauhe 4 วันที่ผ่านมา +1

    So how is this different from all the existing solvers out there?

    • @stevengongg
      @stevengongg  4 วันที่ผ่านมา +2

      not that much different, I implemented it for my own intellectual curiosity :)

  • @anotherperson76
    @anotherperson76 4 วันที่ผ่านมา +1

    That was unreal

  • @slendi9623
    @slendi9623 5 วันที่ผ่านมา

    Good stuff

  • @justinlau9377
    @justinlau9377 3 วันที่ผ่านมา

    what a sick video!

  • @andycai9866
    @andycai9866 4 วันที่ผ่านมา

    Sick video!

  • @Techn0man1ac
    @Techn0man1ac 17 ชั่วโมงที่ผ่านมา

    8:12 LOL

  • @TheKooldude75
    @TheKooldude75 4 วันที่ผ่านมา

    pretty cool idea, imagine if you got glasses w/ a camera and an earpiece and took this AI to a casino

    • @jttech44
      @jttech44 4 วันที่ผ่านมา +1

      Sounds like a great way to get kicked out of a casino

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา +1

      There's an old saying "F*ck around and find out."
      Try it and see what happens, kiddo.

  • @authenticallysuperficial9874
    @authenticallysuperficial9874 3 วันที่ผ่านมา

    0:30 Please fix this issue in your audio where you played both channels at once and they got garbled.

  • @joebazooks
    @joebazooks วันที่ผ่านมา

    as a player whos won +250k in earnings and a programmer, super entertaining

  • @parasences
    @parasences 4 วันที่ผ่านมา

    Wow, surely noone ever did that before...

  • @djziomsuper
    @djziomsuper วันที่ผ่านมา

    nice vid!!

  • @jonathanwei2457
    @jonathanwei2457 4 วันที่ผ่านมา +1

    very cool

  • @jonathanbeer9955
    @jonathanbeer9955 4 วันที่ผ่านมา +3

    I agree with the general reaction to the last hand I don't see how the math could be right, it should be a significantly lower EV to stay in (easy to calculate) especially if you consider the weight of the bet / pool size against the percentages you reported, in fact I think your opponent started to realize that issue/calculation bug in the previous all in bet, I haven't read any papers on this but even with imperfect information the bet/pool size should have a significant effect on the odds (even if some Nash optimization requires some additional randomness injected into that decision) . I would gamble a normal CNN with a lot of training should outperform this vastly (in its current form, you should be able to beat it if you can figure out the true Nash value of each state).
    Either way its a cool project, but I think there are much more simpler approaches and code implementations that could perform better here. Thumbs up for a great project idea regardless.

  • @cikw7492
    @cikw7492 3 วันที่ผ่านมา

    It will be better if she can exploit from opposite player and applies it in to option percentages!

  • @williamwith9859
    @williamwith9859 4 วันที่ผ่านมา

    why dont you just use reinforcement learning? itspretty ez to code, just need a long time to train

  • @michiel97531
    @michiel97531 4 วันที่ผ่านมา

    Honestly it just played loose but still made good calls and good bets

  • @GutsFreedom
    @GutsFreedom 4 วันที่ผ่านมา +1

    ................................................................................ That is how I felt watching this video

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา

      I agree.
      They are teaching people how to scam people online in poker.

  • @wesch6354
    @wesch6354 4 วันที่ผ่านมา +3

    You need to rework your AI. Facing a river jam with bottom pair is a call at most maybe 5% not 30%.

    • @OmateYayami
      @OmateYayami 4 วันที่ผ่านมา

      Not if you have infinite chips or games i.e. you can afford consequences of losing few times.

    • @cpatterson365
      @cpatterson365 4 วันที่ผ่านมา +5

      @@OmateYayami I've been playing poker wrong all this time. Turns out I just need infinite money!

    • @OmateYayami
      @OmateYayami 4 วันที่ผ่านมา

      @@cpatterson365 you've been playing different poker. Not necessarily wrong. You manage risk differently depending on the relative bet, stack and pot sizes. The bot didn't. Look if you feel your chance of winning is 1 in 3 on a bluff, you won't go into it as likely with all-in as opposed to having 10 more games ahead.
      If you have infinite chips or games there is no difference.
      I'm just saying that calling vs folding being 5 or 30% is wrong. This percentage should change depending on the chips involved.

    • @NSelsted
      @NSelsted วันที่ผ่านมา

      @@OmateYayami theoretically the Martingale system works on roulette. The reason it does this, is because it assumes you have infinite money, and infinite bet sizes. For these things to have any relevance, you need to look at non infinite. And funny enough, as soon as you look at non infinite at the Martingale system, it theoretically always fails.
      Creating a system with an assumption that is not real, will make a system that is not relatable. Even then, this bot would not stand a chance against a pro, even with infinite money and time.

    • @OmateYayami
      @OmateYayami วันที่ผ่านมา

      @@NSelsted i might have oversimplified but you do not have infinite bet size here. There is always a limit. Despite the name "no limit" the pot limit is the sum of all stacks which is always finite. You can't always place a bigger bet and reduce game to one win need.
      I have oversimplified allowing to argue that infinite aggression will always win in an invite game, but that was not the point. The point was that finite aggression was too much and unstable. With invite games it would have been alleviated but that was too broad relaxation, which you properly exploited.
      I am not sure if a pro could reliably win against a properly written bot. I have my doubts.

  • @aspuzling
    @aspuzling 5 วันที่ผ่านมา

    Great work. Loved the video. It seems like your friends worked out how to read the AI by the last round. When it checked after seeing each Ace and then the King, Brandon correctly guessed it had a bad hand. A human in the AI's position probably would have bluffed.

    • @BarvGwydh
      @BarvGwydh 5 วันที่ผ่านมา +1

      If the AI is working correctly, this is similar to saying that you picked up on the strategy of a nash equilibrium bot in rock paper scissors - where the bot's strategy is to randomly pick each action 1/3 of the time.

  • @F0ll0wTheWh1teRabbit
    @F0ll0wTheWh1teRabbit 3 วันที่ผ่านมา

    Id say that you programmed it in a mathematical way using equity. I dont know much about programming so please correct me if I am wrong but a better way would be the AI narrowing the opponents range down using the information that the player gives and use that to make accurate decisions

    • @tehf00n
      @tehf00n 3 วันที่ผ่านมา

      You could have all kinds of heuristics for a player if you were recording their decisions but that might contravene any kind of rules of a provider. But for personal use you could do all what you suggested and go even further to exploit the Mersenne Twist algorithm that all casinos use to shuffle their poker decks.

  • @robbolewis6878
    @robbolewis6878 4 วันที่ผ่านมา

    is it allowed to use AI in online poker tournaments?

    • @timonix2
      @timonix2 3 วันที่ผ่านมา

      No, its against terms of service for just about every platform. I bet there are tons of them out there though

    • @spacebomb9126
      @spacebomb9126 3 วันที่ผ่านมา

      Nope, and it could get you in prison in some cases if you try it.

  • @blackopal3138
    @blackopal3138 3 วันที่ผ่านมา

    Just tweak the algo. Tell it, when it faces elimination, it should tighten it's range and only call when it's 90% and above, or whatever .... why am I helping you? lol

  • @losyart
    @losyart 23 ชั่วโมงที่ผ่านมา

    I thought Greg Goes Allin uploaded video

  • @daylen577
    @daylen577 5 วันที่ผ่านมา +3

    What I'm missing in this is the AI trying to figure out the cards other players have based on their individual strategies, actions, and known data. For example if it figures out after a round or two that a specific player bets high and ends up winning, that's a pretty good indicator that when they bet high they have cards that would let them win. The AI knows the cards it has and the cards on the table, so by matching all of that together it could make a reasonable assumption about how likely each player is to have a winning hand.

    • @BarvGwydh
      @BarvGwydh 5 วันที่ผ่านมา +10

      It doesn't need to. In poker it's just extremely hard to not lose tremendously vs a nash equilibrium strategy and it's impossible to win. The bot can just play as if it's playing against an unexploitable opponent, and it will crush someone whose strategy has nothing to do with the unexploitable strategy, and it will comfortably beat a human who has spent a decade studying nash equilibrium strategies. If it adjusted to play against the poor strategy it would absolutely decimate them even moreso. However, in such a small sample, we will not discover someone's strategy. And trying to make significant adjustments based on a small sample just opens up the bot to being exploited itself. Sometimes human vs human you can find ways to exploit someone's strategy in a comically small sample like 100 hands, mostly because a lot of humans are terrible at the game, but a bot would really need like 10,000 hands just to get a vague idea of someone's strategy. Plus, if it adjusts before that, a smart player could intentionally make it adjust one way then overadjust their own strategy the other way to exploit the bot.
      If anything, it might be better to datamine online heads-up games and take an aggregate strategy of all profitable humans, and the bot could maybe even play a strategy that's mostly GTO with some adjustments mixed in from this online data.
      TLDR trying to make a bot adjust its strategy based on its opponent is probably a bad idea unless it's making very small careful adjustments over 10k+, 100k+ hands.

    • @BarvGwydh
      @BarvGwydh 5 วันที่ผ่านมา +8

      Actually I take back some of what I said. I think a bot could squeeze out a lot of EV by making adjustments over small samples as small as like 50 hands. But it could open the bot up to being exploited for sure. And in any case, the nash equilibrium strategy doesn't need to be improved, it's unbeatable.

    • @queenstown9534
      @queenstown9534 4 วันที่ผ่านมา

      It's just playing according to its preset understanding of optimal play. If it was an actual AI that was taking in other player's tendencies, it wouldn't bother making theoretically correct calls like the 54o hand as it'd know it's rarely good against Brandon. Against Olivier Busquet, though, I'd say it's not a horrible call.
      Edit:
      Think of solvers (like this AI) more as extremely complex and detailed flow charts.

  • @jttech44
    @jttech44 4 วันที่ผ่านมา

    Heads up all in calls need to be much more conservative. Really, overall, it needs to have less weight put on the opponent's bet if it's going to survive any real players at all.
    Also understand that, poker solvers already exist, no need to reinvent the wheel there, all the AI needs to do is take a broader look at equity, not just a single momentary hand but the totality of hands, and what the pot values are because pot value has a whole lot to do with strategy. Think of a bet as buying a ticket to the lotto, and the cards in play determine the odds of winning. It appears to not be doing any of that.
    Also understand that, data is available on how the professional players play certain hands, as is data on how people play online as well. That's a powerful tool that you don't have access to at the moment, but it's simple enough to get access, you just buy it. You could even have a way to have your system ingest that data, and a separate play mode for unknown players, such that you could eventually learn a given player and profile them.
    Good luck.

  • @SchmidtyBrothers
    @SchmidtyBrothers 2 วันที่ผ่านมา

    This video was awesome! Don't really have anything else to say just commenting so this gets more views.

  • @SuhendraLie
    @SuhendraLie 4 วันที่ผ่านมา +2

    I'm missing the point here but so this is a 16minutes video on reinventing GTO Wizard / PIO Solver?

    • @Gatorbait1869
      @Gatorbait1869 2 วันที่ผ่านมา

      This bot isn’t even close to as sophisticated as those softwares

    • @SuhendraLie
      @SuhendraLie 2 วันที่ผ่านมา

      @@Gatorbait1869 I was kidding this is still a commendable endeavor.

  • @TheStein36
    @TheStein36 4 วันที่ผ่านมา +1

    GTO bots already exist. Most of these poker sites don’t do anything to stop them.

    • @wesleykim1758
      @wesleykim1758 4 วันที่ผ่านมา

      GTO doesn't know what to do when a donkey over plays marginal hands and doesnt follow standard range conventions.
      The strategy assumes both players play very rationally/reasonably with specific ranges of hands. Once you have a non-conformist player the strategy starts to show gaps. For example, OMC only plays AA and KK, GTO doesn't factor that in and therefore will consistently be at a disadvantage assuming the player is playing a more reasonable range (i.e. ~50% from button, ~10% UTG). If you adjust your bot on the fly to adjust for this it will perform as intended, but you'd need to know and make that adjustment for every player. Also when you watch top players, they play close to GTO but break away from it on occasion to exploit their opponents.
      This AI trys to adapt to the player which is interesting since it makes it closer to exploitative than GTO, which is more profitable if done correctly.

    • @TheStein36
      @TheStein36 4 วันที่ผ่านมา

      @@wesleykim1758 of course, Exploitative will always be more profitable and deviating is the correct thing to do. GTO will always be a net zero(or lose to rake) Vs other GTO bots. However an exploitative player will never beat a GTO bot(if it’s truly playing perfectly). Not to mention bots have no human emotions and will always be playing it’s A game.
      With all that being said most bots aren’t playing GTO perfect as of yet and a good amount of them are losing players that only profit to rakeback/promos.

  • @RishabhBohra13
    @RishabhBohra13 5 วันที่ผ่านมา

    damn W

  • @emotionless9887
    @emotionless9887 4 วันที่ผ่านมา

    looks like this AI learned from viktor blom 😂😂😂😂

  • @Alexandertygreat
    @Alexandertygreat 3 วันที่ผ่านมา

    Making a bot to beat the market thru options would make boot loads, remove the human element and options is as close to poker then you would ever believe. Or in game sports betting especially basketball. A video on how to create said bot would be amazjng. I would kiss you on the lips 💋

  • @OmateYayami
    @OmateYayami 4 วันที่ผ่านมา

    I think it doesn't consider consequences enough. Fold Vs call or agressivness should probably be weighted against stack and pot size. You wouldn't call 5% chance hand because of a coin flip on an all-in, while you could on a small stake.
    Bot went too aggressive on two occasions and lost.
    Bot played just for a single game and didn't value in stack and pot sizes, did it?

    • @stevengongg
      @stevengongg  4 วันที่ผ่านมา +1

      It's due to the fact that I abstract away bet sizes, so it sometimes sees raising and going all-in as essentially the same, which can lead to sub-optimal decisions at times

    • @OmateYayami
      @OmateYayami 4 วันที่ผ่านมา

      @@stevengongg Thanks for a fast reply. By "abstract away", do you mean completely ignore or substitute with a simpler idea?
      To me it's not only a raise/call being mixed with all in. It is about how many games you will have left if you lose. Maybe that will make the bot more exploitable, because it would play less aggressively when low on chips but on the other hand it would have more games to come back.
      I think the optimal strategy changes depending on chip state. Bet vs pot vs stack.
      To me it seems the bot lost because it got heavily punished for taking big risks at high stakes and getting unlucky.

  • @davemuso1958
    @davemuso1958 4 วันที่ผ่านมา

    Great work... but somehow a relief that a human won... for now!

  • @DMalek
    @DMalek 3 วันที่ผ่านมา

    You're AI need to understand potodds lmao

  • @perc_popper27
    @perc_popper27 4 วันที่ผ่านมา

    This video is really cool and it looks like you worked really hard, however how close do you think your AI is actually playing to an unexploitable strategy? Considering the best solvers currently on the market cannot solve things instantly like how you are doing in this video, I'm skeptical about the quality of your AI's play vs an actually unexploitable strategy.
    I've always wanted to code a solver myself, never really considered involving AI in it but some of the newer tools have been implementing it somewhat so that is interesting. Nice job.

  • @greymatters14
    @greymatters14 4 วันที่ผ่านมา +4

    When did we normalize dudes wearing grandmas pearl necklaces?

    • @stevengongg
      @stevengongg  4 วันที่ผ่านมา +2

      i know fashion these days... it's getting out of hand

  • @gregstrauss3888
    @gregstrauss3888 3 วันที่ผ่านมา

    here before this blows up

  • @robgreen022
    @robgreen022 3 วันที่ผ่านมา

    That AI is dumb AF 😅😅😅😅😅😅

  • @extremelylucky999
    @extremelylucky999 3 วันที่ผ่านมา

    Well done. But slightly too educational on the “edutainment” meter. I would definitely do another video with a different spin on this later.
    GPT 4o vision + 10m pre solved spots + online poker + document your results would slap. Play a bunch of $1 or $3 tournaments using GPT to tell you every single play, but essentially destroy online poker as we know it and cards like America’s cardroom would vanish. They are not very good people.

  • @cameronmciver7588
    @cameronmciver7588 วันที่ผ่านมา

    Cool video. I feel like this bot would do better in non-tournament setting. It’s ultimate loss highlighting the issue with sampling rather than using sampling to inform decision making.
    A person would see bottom pair and know they still do have a chance but the cost of losing this hand is higher than usual because you don’t get to play another if you lose.

  • @gourabsarker9552
    @gourabsarker9552 5 วันที่ผ่านมา +1

    Sir do you earn 150k dollars a year? Plz reply. Thanks a lot.

  • @ReaperAHHH
    @ReaperAHHH 4 วันที่ผ่านมา

    gto only works against players who are good and making correct decisions

  • @k.a.8725
    @k.a.8725 4 วันที่ผ่านมา +1

    I see one problem with this ai. If folding is the better choice 67% of the time, then it should fold 100% of the time. I think it's pretty obvious why.

    • @Danicker
      @Danicker 4 วันที่ผ่านมา

      Not necessarily. If you always act the same way in the same position then it becomes easier for your opponent to predict your cards. Think of the scissors paper rock example, you can't always play the same move you have to switch it up

    • @k.a.8725
      @k.a.8725 4 วันที่ผ่านมา

      @@Danicker the same position basically doesnt exist. There will always be either a different hand, a different pot, a different opponents bet, a different balance, different cards shown. By your logic, the AI should in very very rare cases all-in with a 7-2off pre-flop because there are some cases where that would've been the best choice... Or in very rare cases it should call an all-in with a 8 high because sometimes that's the best choice. You will never see any poker pro making an irrational decision ever because "sometimes stupid shit ends up being the best thing you can do". No good player would ever play poorly on purpose to "switch things up" because that is at its core what you were suggesting. The ai should be allowed to make terrible choices to switch things up... That was the exact reason he lost the tournament.

    • @Danicker
      @Danicker 4 วันที่ผ่านมา

      @@k.a.8725 Yes, you're right, the AI shouldn't be using those percentages to make it's decision, however there is merit to having a non-deterministic strategy. There is a difference between switching things up for the heck of it and playing a balanced strategy that can't be exploited. The latter is what you want, but it's not easy to find. And I can almost guarantee that the Nash equilibrium is not deterministic just like how it is in scissors-paper-rock

    • @k.a.8725
      @k.a.8725 4 วันที่ผ่านมา

      @@Danicker i agree. I think the problem was within the learning process already. He should've found a way to train an AI for adapting to people's playstyles. What he was training the AI with was purely a statistical approach. That's like me telling someone to study the movement patterns of fish and then giving him an aquarium with a toy fish to look at in it. Instead of playing through random hands a million of times, the AI should ideally be trained on a variety of real poker players as that would also be the environment it is competing in. Being able to adapt to people's playstyles would also make you "unpredictable" to some extent as any attempt to adapt to the AI's way of playing would just be counteracted by the AI adapting to the changes you made in your playstyle. His current AI basically just knows the best play against random stuff, but not against actual players. Its basically the best player against idiots but loses to any good poker player.

    • @MiamiConfusion
      @MiamiConfusion 4 วันที่ผ่านมา +1

      The AI wants to be balanced and unexploitable, so it calls 33% of the time to protect itself from people over bluffing, calling with 8 high or shoving 72o are not the same thing as sometimes calling with a bluff catcher that mainly folds, since in poker you can’t outright know how often someone is bluffing or how often in a given situation.

  • @bayslive8720
    @bayslive8720 3 วันที่ผ่านมา

    How long until A.I leads to the depopulation of the poor?

  • @passivepulse4788
    @passivepulse4788 3 วันที่ผ่านมา

    If you suck, your AI is gonna suck as well

  • @JustinShaw
    @JustinShaw 4 วันที่ผ่านมา

    Nice vid! Thx!
    😅

  • @ryansunnnboii7535
    @ryansunnnboii7535 5 วันที่ผ่านมา +2

    Early gang 🔥

  • @JJ-yx6jy
    @JJ-yx6jy 2 วันที่ผ่านมา

    How bout you stop?