I created an AI to Play Poker

Steven Gong

มุมมอง 215 514

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ม.ค. 2025
บันเทิง

ความคิดเห็น •

@AskiFin 6 หลายเดือนก่อน ⁺⁹⁹²
There is the famous guy who won the AI only poker tournament.. His AI just always went all in, and the other AI's thought since its a large bet, they have better hand and just folded.
@Sandeeppandek21 6 หลายเดือนก่อน ⁺¹⁵
😂😂
@Ohrami 6 หลายเดือนก่อน ⁺⁷⁷
Then those AIs were completely horrible, or the one that only went all-in got incredibly lucky.
@AskiFin 6 หลายเดือนก่อน ⁺⁶³
@@Ohrami It was a school project kinda thing, so I assume it was mix of both.
@preedeelosoponkul993 6 หลายเดือนก่อน ⁺³
Others got no AA,KK ?
@djdev182 6 หลายเดือนก่อน ⁺¹
Lol😂
@ffishcakess 6 หลายเดือนก่อน ⁺¹¹⁹
Hahhaa the punt jam on AA2 followed by the even bigger punt call "I'm folding, don't get used to it" - I believe it lmao 😂
@JonCookeBridge 6 หลายเดือนก่อน ⁺⁴
Confused why it calls the huge jam on AA2 with QT. Is it because pre-flop and flop are decoupled, somehow? Or is it not realising the difference between all in and small bet?
@BarvGwydh 6 หลายเดือนก่อน ⁺¹¹
It could mainly be because this spot never comes up in nash equilibrium strategy. I think the bot is not playing GTO in general and has some flaws, but even a well made bot might struggle with this spot because 6x pot jam should never be played.
@JonCookeBridge 6 หลายเดือนก่อน ⁺⁵
@@BarvGwydh Possibly. But watching the film I suspect his preflop model isn’t putting enough aces in this line because it’s decoupled from post flop action. Obviously with more aces in his range here, the need to call weaker hands reduces. To get the algorithm to compute a balanced preflop strategy requires CFR to be done across multiple streets, and he implies that he solved preflop and postflop separately. Basically if you just train a preflop model based on maximising equity before you see the flop, your AI won’t have enough board coverage when it’s deep, and it won’t slow play enough because it won’t see the value it gets from those things post-flop.
@worldofpoker 6 หลายเดือนก่อน ⁺¹¹²
Well done on the poker AI project, challenging your friends. It was definitely worth it, but there are obviously a few areas that use some improvement. Your explanation of Nash and CFR was quite solid, while the practical implementation seemed somewhat missing in depth, especially handling complex game states beyond simple abstractions.
This, in consequence of the pre-flop and flop separation, is what some may consider a big flaw for a more robust AI. Your idea to use Equity for card abstraction is great, but it seems that your method hugely oversimplifies the complex situations, as you noted with very similar equity hands playing differently.
Now, this would be of interest: a tournament of poker AI systems. If you're game for it, let's set up a match against Poker AI from pokerbotAI. It will be impressive to see how different strategies and their various implementations fare against each other.
@DalePumento 2 หลายเดือนก่อน
Variable change is a big factor too. Like the 66 and KQ equity, and each of their distinct strategy. In a single raise pots, both 66 and KQ are 100% raise to an unopened pot in any position. Post flop is where it differs. 66 needs to pot control on 1 over card board texture. Meanwhile, KQ can Cbet 1/3 pot to deny equity and/or semibluff as the aggressor. Variable change for poker is then the product of nut advantage and range advantage.
@worldofpoker 2 หลายเดือนก่อน
@@DalePumento The adaptation of strategy according to hand strength and position is key, and the variations of changes in post-flop decisions are huge. Distinctions between hands like 66 and KQ, and their respective strategies based on board texture, all point to complexities that a more advanced AI should address.
@DalePumento 2 หลายเดือนก่อน
@@worldofpoker I agree with you that OP oversimplified his AI model. The decision tree is too large for the model that he designed. Hopefully with better time and more model training, he could improve it.
@Kingcrowish 6 หลายเดือนก่อน ⁺²⁶
The timing on the Helmuth rage clip was great
@stevengongg 6 หลายเดือนก่อน ⁺¹⁸⁵
This has been a project that I've been working on for a pretty long time... so I'm happy to finally release it! Let me know if you guys want a follow-up, since there are still quite a few things I can improve the AI on. I'd like to challenge an actual professional poker player next time. If you enjoyed watching the video, make sure to leave a like, it helps grow the channel :))
@thecodingmentor7701 6 หลายเดือนก่อน ⁺⁵
This is an incredible video. You combined two of my facorite things, AI and Poker. Cant believe this does not have more views! Such a high production quality also. As for me, I would definitely like a follow up to this video!
@Imokay304 6 หลายเดือนก่อน ⁺²
AI will take over eventually. Its not smart to be making videos like this. It will definitely ruin online poker one day if it hasn't already... Sad shame
@ahsidodna3355 6 หลายเดือนก่อน
for sure, this is very interesting, please continue
@perfectionist8380 6 หลายเดือนก่อน
Maybe try training the AI on all the top poker tournaments (or just Dan Negreanu strategy) & also use facial recognition for player emotion/stress.
Or simply take into account the time to react factor.
Great project overall though! Kudos
@matthewgarcia9164 6 หลายเดือนก่อน
@JimStanfield-zo2pz its bugged forsure
@tfilipeac 6 หลายเดือนก่อน ⁺²²
Hey! Poker player here, first of all grat job, must not be easy at all!
I think you would have better success playing against your friends (and bad poker player in general) if you programme your AI not to play in equilibrium but to play unbalanced in order to exploit these weaker players' leaks!
@CreativeMindstorms 6 หลายเดือนก่อน ⁺¹⁵
I have been wanting to see this for so long! Great job on the AI, it seemed to be making right plays most of the time!
@fauge7 6 หลายเดือนก่อน ⁺¹
Yeah but most of the time is bad enough to make you go broke...
@B00nater 6 หลายเดือนก่อน
It really isn’t. Seems easy to exploit.
@kaitan5508 4 หลายเดือนก่อน ⁺⁸
AI was on tilt after the punt all-in call, its ok, we've all been there 😫😫
@MarijnRoorda 6 หลายเดือนก่อน ⁺⁶³
I went from a honest government ad warning me about AI, to this...
@Christoff070 6 หลายเดือนก่อน
Great vid that one
@authenticallysuperficial9874 6 หลายเดือนก่อน ⁺⁸
"Honest government" is an oxymoron
@Christoff070 6 หลายเดือนก่อน
@@authenticallysuperficial9874 yup,that's the point of it
@MarijnRoorda 6 หลายเดือนก่อน
@@authenticallysuperficial9874 It's a youtube channel that warns people for the bullshit during elections. It has pretty girls and plenty of swearing in australien. The Juice media.
@HamguyBacon 6 หลายเดือนก่อน
"government warning about ai" 😂
@icanyo9301 6 หลายเดือนก่อน ⁺⁶
this is so sick! i need to learn to code cus my god it seems so satisfying completing a project like this
@belmicah หลายเดือนก่อน ⁺¹
His AI was a total calling station lol
@fanouzhang3887 6 หลายเดือนก่อน ⁺¹⁰
The AI didn't seem to be playing very close to a gto solution of most spots
@tehf00n 6 หลายเดือนก่อน ⁺⁴
A friend of mine coded a poker bot for texas hold 'em back around 2006. He set one instance running 1c/2c cash game tables and was returning around 10c/day profit playing the odds and position only. I haven't seen him for years to see how he went once he weighted it for variance and stake styles.
@spacebomb9126 6 หลายเดือนก่อน ⁺³
He's probably in prison now for theft on gambling sites.
@tehf00n 6 หลายเดือนก่อน
@@spacebomb9126 I highly doubt it. The guy was a top level coder and wrote most of the original office polymorphic viruses for bug bounties with Microsoft. And worked a government job. More likely he knew his way around a VPN and a dummy account.
@pipipupukok 6 หลายเดือนก่อน ⁺²
I absolutely love your videos man, you inspired me alot, and now i will be an intern in the biggest it company in poland at 16 years old!
@stevengongg 6 หลายเดือนก่อน ⁺¹
good job!! keep pushing
@JoseLopez-qm4ro 6 หลายเดือนก่อน ⁺¹
Good job man, 👏 props to you regardless of the loss...Take failure, learn from it and embrace it to always make your self better..
@haviick1014 6 หลายเดือนก่อน ⁺⁷
This needs a part 2
@ChrisMaveric 5 หลายเดือนก่อน ⁺¹²
Just a heads up... your video shows two 6 of Diamonds, one is in the AI hand and then a second is the turn card...
@AbomidablePlayer 6 หลายเดือนก่อน ⁺⁵
The Phil Helmuth reference is gold!
@Zarl451 3 หลายเดือนก่อน ⁺¹
Interesting effort, part of the problem you may be encountering is that the GTO solvers generally need an input on your opponents expected range.
GTO by default assumes the other person is also playing GTO. But as they deviate the optimal strategy is for you to also deviate.
@MrZweene 3 หลายเดือนก่อน ⁺²
That’s bs. How can a game theory optimum depend on other people behavior?
@Zarl451 3 หลายเดือนก่อน
@@MrZweene The maths has no problem at all with the "optimum" strategy existing at the level that your response would be unique to every possible opponent in every possible spot. The need for a generalised set of actions is an engineering and practical one not a mathematical one.
@adamkajdi5600 8 วันที่ผ่านมา
@@MrZweene GTO never changes, highest EV line does.
@marcuslaxson6237 6 หลายเดือนก่อน
wow this channel is going to blow up, great idea and very fun execution!
@MarioOLL6251 28 วันที่ผ่านมา
Oh heck yeah, the Daniel vs Phil clip is so iconic
@PsylocibeChillZen 6 หลายเดือนก่อน ⁺²²
why there is 2 cards 6 of diamonds on the deck? at 12:55
@JPatel1995 6 หลายเดือนก่อน
Nice catch
@VeriStrawberi 6 หลายเดือนก่อน ⁺²
At 13:09 its a completely different board ?3QQK too lol He wins with 6d7c
@jasonchan64 6 หลายเดือนก่อน ⁺¹
nice work!! keep going! dont be surprised that AI would ask you to call the last hand or even QTs hands, just the solution come from the AI is based on the optimum play against the player play perfectly (or at least much more bluff than your friends) so to compete with your friends you might need to do a lot of adjustment which your AI couldnt understand
@rogerzhang5993 6 หลายเดือนก่อน
It just means the humans are not playing at Nash equilibrium
@woosix7735 2 วันที่ผ่านมา
I feel the pain on the research paper experience
@terencehill3972 6 หลายเดือนก่อน
why the dancing animals? WHY?? 3:58..."Poker is a game of people" - D. Brunson check and mate no seriously, well-made video you made me laugh thanks, love the AI voice its hilarious 🤭
@Isaac-bb3pj 5 หลายเดือนก่อน
This was really fascinating, would love to see a tournament against pros (but like a table of 6)
@jujubaloolian 6 หลายเดือนก่อน ⁺⁴
It would be interesting to see this how this would develop further if the AI takes into account the position it is sitting at the table to see how many players bet before or would have an opportunity to bet after. Its interesting to see how other players could interpret the AI's decisions, most poker players would tell you to always bet/raise/reraise the same amount preflop for hands you play, as changing the bet amount would only signal that you have a strong hand or were bluffing. 😃
@BarvGwydh 6 หลายเดือนก่อน ⁺¹
The bot will use randomization so that sizes do not make it exploitable, it will also simply be balanced no matter what, it will have an unexploitable mix of value and bluffs. Nash equilibrium cannot be beaten, at best the opponent can break even if they also use a nash equilibrium strategy, like the rock paper scissors example where the NE strategy is to pick each action 1/3 of the time. If the bot used multiple sizes it will still balance them, or for example, if it uses multiple sizes preflop then it might use 2bb half the time and 2.5bb the other half of the time if it wanted to open raise a hand to 2 different sizes.
@jujubaloolian 6 หลายเดือนก่อน ⁺³
@@BarvGwydh That makes sense for the betting, but I still think it would make a difference based off the position you are sitting. If the bot calls/raises a lower equity hand at the bottom of its range, it has a higher chance of winning when it bets last at the table and there is only one other person in the pot then vs when its UTG and is betting the same hand into 6 people or calling the hand with 6 people in the pot. Nash equilibrium can be beaten in a game with more than two people if others have shifting strategies - it only ensures that a single player can't improve by deviating.
@rainluo4831 6 หลายเดือนก่อน
Love your videos!!! Keep pushing❤
@Bot-Frenzy 6 หลายเดือนก่อน ⁺¹
Nice work Steven! Keep it up 👍🏻
@pocket8s 6 หลายเดือนก่อน ⁺¹
okay now make it solve multiway hands! nice vid 😊
@qlf9_ 2 หลายเดือนก่อน
did not expect cornell cs 2850 to make a cameo in this video
@Rocketjake 19 วันที่ผ่านมา
You should implement more than one player into the ai, as i think it would be interesting to see its preflop strategy.
@AlessandroOrlandi83 5 หลายเดือนก่อน
Thanks, this was very interesting as for curiosity I wanted to implement it myself but never got the time to do it.
@lemmyorleans 6 หลายเดือนก่อน
Youd want to get a complete solve of the game tree for several different opening bet sizes and feed that into your ai. It wouldnt even need to do any ai magic unless you end up in a branch of the tree you didnt solve for. Every presolved scenario would just be a database lookup.
@natereg6558 28 วันที่ผ่านมา
Can tell this guy was struggling to say equilibrium lol
@seanrobinson5740 4 หลายเดือนก่อน
you should secretly scan your friends cards and make your Ai win 100% of the time 😂😂
@TrygveVang 6 หลายเดือนก่อน ⁺¹
This vid deserves 1000x more views that what it is at now holy moly
@kailinaleekuk 2 หลายเดือนก่อน ⁺¹
The goal of Texas Holdem is not to make the best 5 card hand. Your goal in Texas Holdem is to put yourself in situations where you have the larger share of equity.
@TheOnTiltBoys 6 หลายเดือนก่อน ⁺⁵
we will demolish that AI
@jttech44 6 หลายเดือนก่อน ⁺²
ChatGPT? More like ChatATM
@matteovc2158 6 หลายเดือนก่อน
You guys don't always seem to be playing all that great, but I think even you would be able to beat this so called GTO playing AI. I think every action was just horrible.
@marcusmartin7576 5 หลายเดือนก่อน
I wanted to attempt a project like this myself I would consider using poker solvers as the basis of a strategy and using solvers as a database because they implement gto and give you the proper move in every scenario
@ortizbryan27 3 หลายเดือนก่อน
I love by the end, the A. I, through training of other people playing, become a degenerative gambler. 😂😂😂
@freakoutz2707 5 หลายเดือนก่อน
Love the video man. Just wish your Mike was a little bit bit better. Only complaint though keep running.
@RandomFunZer 6 หลายเดือนก่อน ⁺¹
As an actual full-time professional kt was quite interesting to watch. But this ai at its current state would get absolutely destroyed by a real pro :).
Still a fun endeavour and for sure very hard to come up with making this system. Good job
@TheKooldude75 6 หลายเดือนก่อน
pretty cool idea, imagine if you got glasses w/ a camera and an earpiece and took this AI to a casino
@jttech44 6 หลายเดือนก่อน ⁺¹
Sounds like a great way to get kicked out of a casino
@spacebomb9126 6 หลายเดือนก่อน ⁺¹
There's an old saying "F*ck around and find out."
Try it and see what happens, kiddo.
@u10-bb5lf 6 หลายเดือนก่อน
it would be so lovely if you would do the Python tutorial or some kind of Playlist to educate your audience. Keep up, love your content!!
@SeanP1982 หลายเดือนก่อน
The hand at 12:55 has 2 x 6 of Diamonds, how did your AI not throuw an error for 2 of the same card (an invalid outcome in poker) did you give the AI rules to follow? How can the AI calculate anything like this?
@JMRSplatt 6 หลายเดือนก่อน
"Private" Cards.. I love it. I'm using this instead of hole cards from now on. Sir! These are my on cards, they are private!
@francolauciello4429 6 หลายเดือนก่อน
You should’ve made a results part at the end, good video thooo 🎉🎉
@Jordan-Tech หลายเดือนก่อน
the problem with playing optimal strat against relatively not good players is that they don't play correctly so it's harder to beat them.. "The unpredictability of suboptimal play"
- In game theory, optimal strategies assume rational decision-making and predictable behavior based on maximizing utility or winning chances. Bad players often deviate from these rational strategies, making their moves harder to predict. Good players, who are skilled at countering optimal strategies, may find themselves struggling against chaotic or irrational play.
@frey8893 6 หลายเดือนก่อน
this AI plays worse than the biggest fishes I have ever seen lmao
@carter3144 5 หลายเดือนก่อน
It would be difficult to have the AI factor in the opponents body language and other tells, but what about their strategy? Would it be possible for the AI to utilize in game machine learning to adapt its playing style based on the opponents hands (that they don't muck), raise frequency, bet sizing... etc.
Additionally, as we all know, playing poker with fake money doesn't really work super well. I was wondering if you and your friends were using real money or just chips? This would drastically change how your friends were playing.
Great video, I learned a lot. Thanks for taking the time!!
@AshtonRiddell 6 หลายเดือนก่อน
I wonder if the ai treats calling the all in as a semi bluff
@tehjoch 2 หลายเดือนก่อน ⁺¹
10:06 that card bending was so cringe
@zoroark567 4 หลายเดือนก่อน
The concept of a style of play that can't be exploited is interesting but not actually the 'best' strategy in poker. The biggest winners are the ones who are able to exploit their opponents blindspots and weaknesses, which necessarily involves exposing themselves to risk of counter-exploitation.
@RT-he6zf 6 หลายเดือนก่อน
Ive been getting crushed by these "AI`s" in online rooms for years
@TsarHare 4 หลายเดือนก่อน ⁺¹
the cards delt, of 52 say exactly the cards that will come. like it am known. play the new drones am greater fun
@timhutter 25 วันที่ผ่านมา
Wow you are absolutely impressive!
@NSelsted 6 หลายเดือนก่อน
It didn't only make 1 mistake. After it calls with the first all-in with the QT, it is clearly that it is easy to exploit. You wait until you hit a hand, where the AI hasn't shown particular strength, and then you go all-in if you have hit. That it want's to call 1/3 of the time in the last hand, shows that this simple strategy will most likely win close to 100% of the time. There are other exploits, but you are far behind to win against any pro poker player. You have a lot more work to do, before it stand a chance against even semi-pros.
@xxtripl3thr33xx 2 หลายเดือนก่อน
An I wrong to think the AI saw quickly that “he who bets last and bets the biggest usually wins” then figured the best way to do that was to bet as much as it could as soon as possible?
@WowkoHrac หลายเดือนก่อน
So the whole time i've been playing like AIs best performance?
@D3MONFIEND 2 หลายเดือนก่อน
That why the paper he derived from say in the end, you still need human data to trained on. The ai self play expect the opposition to play optimal. But human played differently and each player has different behavior to exploit into or being exploit by.
@puffypastels1172 6 หลายเดือนก่อน
Hi! I was wondering what tutorial you watched at 5:55, i'm super interesting in this stuff lol
@TsarHare 4 หลายเดือนก่อน ⁺¹
like the percentage of the cards yourself have increase every fold, as well with every card delt.
@gameaddict3884 27 วันที่ผ่านมา
Pro beat AI in the end.
@authenticallysuperficial9874 6 หลายเดือนก่อน
0:30 Please fix this issue in your audio where you played both channels at once and they got garbled.
@tunglee4349 5 หลายเดือนก่อน
impressive! I love this topic! Did u use RL to train?
@GutsFreedom 6 หลายเดือนก่อน ⁺¹
................................................................................ That is how I felt watching this video
@spacebomb9126 6 หลายเดือนก่อน
I agree.
They are teaching people how to scam people online in poker.
@Menschnskind 28 วันที่ผ่านมา
"2nd top pair" isnt a term and also isnt a semibluff
@BadMouth หลายเดือนก่อน ⁺¹
give me that program
@junaeddhiman9001 6 หลายเดือนก่อน
looks like the AI developed it's own ego😂
@CurtisCanby 3 วันที่ผ่านมา
So the training was good and it learned but then RNJesus took the wheel and sold. You just need a setting to switch it from training mode to game playing mode so it plays what it thinks is optimal
@aspzx 6 หลายเดือนก่อน
Great work. Loved the video. It seems like your friends worked out how to read the AI by the last round. When it checked after seeing each Ace and then the King, Brandon correctly guessed it had a bad hand. A human in the AI's position probably would have bluffed.
@BarvGwydh 6 หลายเดือนก่อน ⁺¹
If the AI is working correctly, this is similar to saying that you picked up on the strategy of a nash equilibrium bot in rock paper scissors - where the bot's strategy is to randomly pick each action 1/3 of the time.
@edwardwav 6 หลายเดือนก่อน
Worth the wait
@nicksommer5916 3 หลายเดือนก่อน
I might be late to the party but you could make it easier on yourself by hard coding some preflop charts and only focusing on your postflop strategy.
@anghungnguyen5115 2 หลายเดือนก่อน
I can not train . it always fails on the turn or river . please help me
@ElKolimar 2 หลายเดือนก่อน
Awesome! Congratulations on getting so far with the project! I find it very interesting how much you achieved without knowing anything about poker. I believe that, precisely because of your lack of knowledge about the game, you may have underestimated it in some way or expected positive results where they don't necessarily hold significant value.
First of all, playing GTO is the foundational study for any good poker player, but it’s not how they actually play. Why? Because they constantly rely on adaptations, with GTO being the pivot. For example, if you play against someone who goes All-In on every post-flop hand, you would likely need to start calling with a stronger range of hands. That’s how you would win, and it has less to do with strict GTO and more with adapting to the player's behavior and moves.
An interesting approach could be to implement a model similar to reinforcement learning or perhaps include more parameters that indicate the game context when training the model. The reality is that GTO play is exploitable; to be unexploitable, it would need to face another player who is also using GTO.
@KtoBylPierwszy หลายเดือนก่อน
What you wrote above is fundamentally untrue. The GTO paradigm assumes that this strategy cannot be exploited. By playing GTO, you guarantee that your opponent cannot gain any advantage over you. While GTO is a defensive strategy, in cases where your opponent does not play GTO, this strategy generates an automatic profit as a result of the opponent's suboptimal play. Deviating from GTO, meaning adapting to your opponent, serves to maximize EV (expected value). However, this always comes with the risk of opening yourself up to potential counter-exploitation.
@ElKolimar หลายเดือนก่อน
@@KtoBylPierwszy Hi! I understand your point, and I agree with the theoretical premise of GTO being unexploitable. That’s absolutely correct in a vacuum. However, my argument is more about the practical application of GTO in real games.
The reality is that while GTO guarantees no one can exploit you, it doesn’t inherently guarantee that you’ll maximize your Expected Value (EV) against non-GTO players. In real games, players deviate from GTO all the time, often significantly. By sticking strictly to GTO, you might leave money on the table against opponents who make obvious mistakes. That’s why strong poker players adapt to maximize EV in those situations, even if it means stepping away from GTO temporarily.
Of course, adaptation introduces the risk of counter-exploitation, as you pointed out, but the best players know how to manage that balance. GTO is an amazing baseline, a theoretical foundation, but it’s not always the most practical way to play in a dynamic and unpredictable environment.
@KtoBylPierwszy หลายเดือนก่อน
@@ElKolimar I'am a former professional.
As I mentioned, GTO is a defensive strategy, so it is obvious that it does not maximize EV against suboptimal oponents. The paradox of the situation is that there is no way to maximize EV in the long term without a very good understanding of GTO, because you need a reference point to do so-a point to which you can return if you are being exploited.
In practice, it is exactly the opposite of what you write: the higher the level, the smaller the edge; the smaller the edge, the pool must play closer to GTO.
@SchmidtyBrothers 6 หลายเดือนก่อน
This video was awesome! Don't really have anything else to say just commenting so this gets more views.
@jttech44 6 หลายเดือนก่อน
Heads up all in calls need to be much more conservative. Really, overall, it needs to have less weight put on the opponent's bet if it's going to survive any real players at all.
Also understand that, poker solvers already exist, no need to reinvent the wheel there, all the AI needs to do is take a broader look at equity, not just a single momentary hand but the totality of hands, and what the pot values are because pot value has a whole lot to do with strategy. Think of a bet as buying a ticket to the lotto, and the cards in play determine the odds of winning. It appears to not be doing any of that.
Also understand that, data is available on how the professional players play certain hands, as is data on how people play online as well. That's a powerful tool that you don't have access to at the moment, but it's simple enough to get access, you just buy it. You could even have a way to have your system ingest that data, and a separate play mode for unknown players, such that you could eventually learn a given player and profile them.
Good luck.
@angelma9087 หลายเดือนก่อน
Jajaja me gustó mucho, si le agregas el razonamiento del rango de manos que puede tener el rival según sus acciones, su nivel va a mejorar drásticamente, por eso hacia call en los all in, porque no consideraba la mano del rival fuera buena, solo que fuera una al azar.
@marekbalco9594 6 หลายเดือนก่อน
How can I run this code to be able adding my hand cards and board cards?
@TheInternetofRandom 6 หลายเดือนก่อน
Id say that you programmed it in a mathematical way using equity. I dont know much about programming so please correct me if I am wrong but a better way would be the AI narrowing the opponents range down using the information that the player gives and use that to make accurate decisions
@tehf00n 6 หลายเดือนก่อน
You could have all kinds of heuristics for a player if you were recording their decisions but that might contravene any kind of rules of a provider. But for personal use you could do all what you suggested and go even further to exploit the Mersenne Twist algorithm that all casinos use to shuffle their poker decks.
@evadecaptcha 2 หลายเดือนก่อน
The problem is that calling there 30% of the time is WAY too much lol. It seems to provide a percentage chance to take actions that should be a 0% frequency (like the call with Q high). This AI will hemorrhage money. I'm guessing the issue is that it's not able to account for your opponent's range, and assumes they could have any 2 random cards for any situation, though I don't know the exact details of setting the AI up, so that may just be an assumption on my part.
@mrjaekwan73 หลายเดือนก่อน
What RL model did u use?
@spyseefan975 5 หลายเดือนก่อน ⁺⁴
Isn't poker solved already?
@Ярослав-н3р3ь 2 หลายเดือนก่อน
Wdym
@chancelyons7401 2 หลายเดือนก่อน
Heads up poker is "solved" but add one more person and all the math becomes gibberish (for now)
@queenstown9534 6 หลายเดือนก่อน ⁺²
12:51 Heads up, I wouldn't exactly call this a semi-bluff as BB's range should be way weaker than the AI's, so it's a value raise. Very standard.
13:26 This hand is atrocious. I'm guessing you guys are playing 50/100 this hand, hard to tell. In which case we're ~30bb effective and this should just be a fold for the AI as even against the bottom of SB's range, it's not looking too hot. Plus, AI blocks the more natural semi-bluffs like Qxhh for a second nut flush draw. If we can really call a 7.25x shove a natural semi-bluff.
Why does the AI call? If I *had* to rationalize a call, I would say that this shove is either the nuts or air and because we block the Qh, we have more equity against most air hands, except Khx. This is a horrible justification for a call but, it's what it might be thinking.
Interesting note, at 20bb effective, solver calls QhTd at ~10% frequency.
15:02 It might seem insane but this hand is not so horrible from your AI's perspective. It's actually correct from a GTO perspective. Reason for the call is that you are beating 137/185 hands (74.5% equity) and getting ~1.53:1 pot odds (~40% equity required to call and break even). If we folded every time, we would be over folding and losing money long run given the price is just so good.
I think your AI plays kinda okay, considering it's only learnt over 1m hands. The QT hand is crazy, one other consideration is that it's not gonna adjust to the mistakes made by other players so it's gonna make some whack decisions like this one. I'd be interested to play it and also see how it plays against GTO Wizard.
@parasences 6 หลายเดือนก่อน
Wow, surely noone ever did that before...
@noThankyou-g5c 5 หลายเดือนก่อน
what is second top pair? idk if that’s a real term, not trying to be an asshole. also im not sure id call that raise a semi bluff, i at least normally associate semi bluffs w a hand that has weak showdown value but has considerable equity to a hand that could win at showdown. I’m not very good at poker all things considered and dont play a lot of heads up but from what I do know about heads up I’d say it’s more likely that second pair is just a good enough hand to raise. In heads up poker things like second, third, even bottom pair, are relatively good hands in the grand scheme of things, or at the very least theyre considerably better than second pair in 6max or full ring poker where 2nd pair is generally just seen as a bluff catcher/showdown hand
@MrWhatsyourflavour 6 หลายเดือนก่อน
Nice work !
@cococam3411 5 หลายเดือนก่อน
Well, first I'd like to congratulate and thank you for your immaculate work. But I'm running into some problems, I did everything by the book, downloaded the stuff with git, but when I run the script, I see a table, with 2 players, cards folded, Total pot $800, but how does it work, I'm completely clueless, I'm running Ubuntu 22.04.
Thanks in advance for your reply.
@anghungnguyen5115 2 หลายเดือนก่อน
Hi , I also have some problems. Please help me. It not work
@StatisticianInStilettos หลายเดือนก่อน
Love this!
@anotherperson76 6 หลายเดือนก่อน ⁺¹
That was unreal
@OmateYayami 6 หลายเดือนก่อน
I think it doesn't consider consequences enough. Fold Vs call or agressivness should probably be weighted against stack and pot size. You wouldn't call 5% chance hand because of a coin flip on an all-in, while you could on a small stake.
Bot went too aggressive on two occasions and lost.
Bot played just for a single game and didn't value in stack and pot sizes, did it?
@stevengongg 6 หลายเดือนก่อน ⁺¹
It's due to the fact that I abstract away bet sizes, so it sometimes sees raising and going all-in as essentially the same, which can lead to sub-optimal decisions at times
@OmateYayami 6 หลายเดือนก่อน
@@stevengongg Thanks for a fast reply. By "abstract away", do you mean completely ignore or substitute with a simpler idea?
To me it's not only a raise/call being mixed with all in. It is about how many games you will have left if you lose. Maybe that will make the bot more exploitable, because it would play less aggressively when low on chips but on the other hand it would have more games to come back.
I think the optimal strategy changes depending on chip state. Bet vs pot vs stack.
To me it seems the bot lost because it got heavily punished for taking big risks at high stakes and getting unlucky.
@pipishrimp_poker 6 หลายเดือนก่อน
why dont you just use reinforcement learning? itspretty ez to code, just need a long time to train
@Techn0man1ac 6 หลายเดือนก่อน
8:12 LOL
@Marcus-v4e 2 หลายเดือนก่อน
I would love to play your AI great job 🎉
@anghungnguyen5115 2 หลายเดือนก่อน
Hi. i want to test this AI
@daylen577 6 หลายเดือนก่อน ⁺³
What I'm missing in this is the AI trying to figure out the cards other players have based on their individual strategies, actions, and known data. For example if it figures out after a round or two that a specific player bets high and ends up winning, that's a pretty good indicator that when they bet high they have cards that would let them win. The AI knows the cards it has and the cards on the table, so by matching all of that together it could make a reasonable assumption about how likely each player is to have a winning hand.
@BarvGwydh 6 หลายเดือนก่อน ⁺¹⁰
It doesn't need to. In poker it's just extremely hard to not lose tremendously vs a nash equilibrium strategy and it's impossible to win. The bot can just play as if it's playing against an unexploitable opponent, and it will crush someone whose strategy has nothing to do with the unexploitable strategy, and it will comfortably beat a human who has spent a decade studying nash equilibrium strategies. If it adjusted to play against the poor strategy it would absolutely decimate them even moreso. However, in such a small sample, we will not discover someone's strategy. And trying to make significant adjustments based on a small sample just opens up the bot to being exploited itself. Sometimes human vs human you can find ways to exploit someone's strategy in a comically small sample like 100 hands, mostly because a lot of humans are terrible at the game, but a bot would really need like 10,000 hands just to get a vague idea of someone's strategy. Plus, if it adjusts before that, a smart player could intentionally make it adjust one way then overadjust their own strategy the other way to exploit the bot.
If anything, it might be better to datamine online heads-up games and take an aggregate strategy of all profitable humans, and the bot could maybe even play a strategy that's mostly GTO with some adjustments mixed in from this online data.
TLDR trying to make a bot adjust its strategy based on its opponent is probably a bad idea unless it's making very small careful adjustments over 10k+, 100k+ hands.
@BarvGwydh 6 หลายเดือนก่อน ⁺⁸
Actually I take back some of what I said. I think a bot could squeeze out a lot of EV by making adjustments over small samples as small as like 50 hands. But it could open the bot up to being exploited for sure. And in any case, the nash equilibrium strategy doesn't need to be improved, it's unbeatable.
@queenstown9534 6 หลายเดือนก่อน
It's just playing according to its preset understanding of optimal play. If it was an actual AI that was taking in other player's tendencies, it wouldn't bother making theoretically correct calls like the 54o hand as it'd know it's rarely good against Brandon. Against Olivier Busquet, though, I'd say it's not a horrible call.
Edit:
Think of solvers (like this AI) more as extremely complex and detailed flow charts.
@cikw7492 6 หลายเดือนก่อน
It will be better if she can exploit from opposite player and applies it in to option percentages!
@wesch6354 6 หลายเดือนก่อน ⁺³
You need to rework your AI. Facing a river jam with bottom pair is a call at most maybe 5% not 30%.
@OmateYayami 6 หลายเดือนก่อน
Not if you have infinite chips or games i.e. you can afford consequences of losing few times.
@cpatterson365 6 หลายเดือนก่อน ⁺⁵
@@OmateYayami I've been playing poker wrong all this time. Turns out I just need infinite money!
@OmateYayami 6 หลายเดือนก่อน
@@cpatterson365 you've been playing different poker. Not necessarily wrong. You manage risk differently depending on the relative bet, stack and pot sizes. The bot didn't. Look if you feel your chance of winning is 1 in 3 on a bluff, you won't go into it as likely with all-in as opposed to having 10 more games ahead.
If you have infinite chips or games there is no difference.
I'm just saying that calling vs folding being 5 or 30% is wrong. This percentage should change depending on the chips involved.
@NSelsted 6 หลายเดือนก่อน
@@OmateYayami theoretically the Martingale system works on roulette. The reason it does this, is because it assumes you have infinite money, and infinite bet sizes. For these things to have any relevance, you need to look at non infinite. And funny enough, as soon as you look at non infinite at the Martingale system, it theoretically always fails.
Creating a system with an assumption that is not real, will make a system that is not relatable. Even then, this bot would not stand a chance against a pro, even with infinite money and time.
@OmateYayami 6 หลายเดือนก่อน
@@NSelsted i might have oversimplified but you do not have infinite bet size here. There is always a limit. Despite the name "no limit" the pot limit is the sum of all stacks which is always finite. You can't always place a bigger bet and reduce game to one win need.
I have oversimplified allowing to argue that infinite aggression will always win in an invite game, but that was not the point. The point was that finite aggression was too much and unstable. With invite games it would have been alleviated but that was too broad relaxation, which you properly exploited.
I am not sure if a pro could reliably win against a properly written bot. I have my doubts.
@jb123rnd 6 หลายเดือนก่อน ⁺³
I agree with the general reaction to the last hand I don't see how the math could be right, it should be a significantly lower EV to stay in (easy to calculate) especially if you consider the weight of the bet / pool size against the percentages you reported, in fact I think your opponent started to realize that issue/calculation bug in the previous all in bet, I haven't read any papers on this but even with imperfect information the bet/pool size should have a significant effect on the odds (even if some Nash optimization requires some additional randomness injected into that decision) . I would gamble a normal CNN with a lot of training should outperform this vastly (in its current form, you should be able to beat it if you can figure out the true Nash value of each state).
Either way its a cool project, but I think there are much more simpler approaches and code implementations that could perform better here. Thumbs up for a great project idea regardless.
@justinlau9377 6 หลายเดือนก่อน
what a sick video!

ต่อไป

เล่นอัตโนมัติ