Deep Mind AI Alpha Zero's Positional Masterpiece With the Black Pieces

agadmator's Chess Channel

มุมมอง 334 693

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ธ.ค. 2017
Download Mproov and Improve Your Chess Today! app.mproov.me/AgadTH-cam1
Follow MprooV on Twitter / mproovapp #agadmator Check out all my videos on this match
• Google Deep Mind Alpha...
Read more about Deep Mind Alpha Zero here arxiv.org/pdf/1712.01815.pdf
Link to the other games lichess.org/study/wxrovYNH
A chess game between Deep Mind Alpha Zero and Stockfish
Google Deep Mind Alpha Zero vs Stockfish
One of the games
1. e4 e5 2. Nf3 Nc6 3. Bb5 Nf6 4. d3 Bc5 5. Bxc6 dxc6 6. O-O Nd7 7. c3 O-O 8. d4 Bd6 9. Bg5 Qe8 10. Re1 f6 11. Bh4 Qf7 12. Nbd2 a5 13. Bg3 Re8 14. Qc2 Nf8 15. c4 c5 16. d5 b6 17. Nh4 g6 18. Nhf3 Bd7 19. Rad1 Re7 20. h3 Qg7 21. Qc3 Rae8 22. a3 h6 23. Bh4 Rf7 24. Bg3 Rfe7 25. Bh4 Rf7 26. Bg3 a4 27. Kh1 Rfe7 28. Bh4 Rf7 29. Bg3 Rfe7 30. Bh4 g5 31. Bg3 Ng6 32. Nf1 Rf7 33. Ne3 Ne7 34. Qd3 h5 35. h4 Nc8 36. Re2 g4 37. Nd2 Qh7 38. Kg1 Bf8 39. Nb1 Nd6 40. Nc3 Bh6 41. Rf1 Ra8 42. Kh2 Kf8 43. Kg1 Qg6 44. f4 gxf3 45. Rxf3 Bxe3+ 46. Rfxe3 Ke7 47. Be1 Qh7 48. Rg3 Rg7 49. Rxg7+ Qxg7 50. Re3 Rg8 51. Rg3 Qh8 52. Nb1 Rxg3 53. Bxg3 Qh6 54. Nd2 Bg4 55. Kh2 Kd7 56. b3 axb3 57. Nxb3 Qg6 58. Nd2 Bd1 59. Nf3 Ba4 60. Nd2 Ke7 61. Bf2 Qg4 62. Qf3 Bd1 63. Qxg4 Bxg4 64. a4 Nb7 65. Nb1 Na5 66. Be3 Nxc4 67. Bc1 Bd7 68. Nc3 c6 69. Kg1 cxd5 70. exd5 Bf5 71. Kf2 Nd6 72. Be3 Ne4+ 73. Nxe4 Bxe4 74. a5 bxa5 75. Bxc5+ Kd7 76. d6 Bf5 77. Ba3 Kc6 78. Ke1 Kd5 79. Kd2 Ke4 80. Bb2 Kf4 81. Bc1 Kg3 82. Ke2 a4 83. Kf1 Kxh4 84. Kf2 Kg4 85. Ba3 Bd7 86. Bc1 Kf5 87. Ke3 Ke6
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
If you realllly enjoy my content, you're welcome to support me and my channel with a small donation via PayPal or Crypto.
Link to PayPal donation www.paypal.me/agadmator
Maiar Wallet @agadmator or get.maiar.com/referral/pv0mam...
BTC address bc1qckd3ut0hqyymzv33eus97ld8klj02xhk2kcwld
BCH address qzmfclyn69hqhjslls40r7r0dsttwe3tcsl946w4fr
LTC address Laarf1RmvCpLt2BcSwC1PBLG3hRC4HjBrz
NANO nano_1h1kgfaq88t1btwadqzx73rbha5hwbb88sxmfns851kwj8hnosdj51w388xx
Monero 4AdvvqmC4xhPyyRSAEDxTTAoXdxAtX2Py6b8Eh4EQzBLGbgo5rY5Khcap1x76JrDJH87yibAE9b6TPwTsvBAiFFCLtM8Be7
For any other currency address, contact me via agadmator@gmail.com
Check out ALL my videos here • "Grand Opening" - Ande...
Facebook: / agadmatoryoutube
Twitter: / agadmator
Instagram: / agadmator
Lichess: lichess.org/@/agadmator
Chess.com: agadmator
Skype: agadmator
League of Legends: agadmator :) "Watch me without ads on your Amazon devices (bit.ly/Agadmator_Amazon) and Roku TV (bit.ly/Agadmator_Roku)
บันเทิง

ความคิดเห็น • 485

@Johaylon 6 ปีที่แล้ว ⁺⁶¹⁹
And stockfish resigned the game... I can never hear enough of this 👏
@Nash9r 5 ปีที่แล้ว ⁺³
Alpha Zero has no ego as well.
@inlovewithi 5 ปีที่แล้ว ⁺¹⁵
I don't think he meant it for ego reasons, but rather because it's such a rare phrase. A situation that rarely happens.
@dwm20ll 5 ปีที่แล้ว ⁺²
it's a super CPU vs a laptop with outdated software
@seandesir7272 6 ปีที่แล้ว ⁺³⁹⁰
Here is my take on alpha zero. I watch all its games. From what i have seen here is its tactics: it locks the middle, immobilizes a few of its opponent's minor pieces deep in their ranks, it sacrifices a minor piece or pawns to create files and mobilizes all its minor pieces for positional gains. It can be down one piece or two, but in reality it actually up cause some of its opponents pieces are immobilize or lock. Very clever. It creates multiple traps so that its opponents have an option to die slowly or die fast. Very naughty machine
@Iodestarr 5 ปีที่แล้ว ⁺²¹
Something I've noted from watching alphazero is it seems alpha has a tendency to use his knights to force its opponent in zugzwang. As bait, sacrificing the knight to (usually) a bishop, in turn taking bishop.
@shankernarayan5028 5 ปีที่แล้ว ⁺³²
It is a fan of anatoly Karpov
@outtabubblegum7034 4 ปีที่แล้ว ⁺⁸
It's called ACTIVITY
@shillhuntingseason9707 4 ปีที่แล้ว ⁺⁷
It’s like watching killer whales force their prey to the surface of the ocean where there is no where left for the prey to swim to
@misteratoz 4 ปีที่แล้ว
It just seems like it just does everything perfectly well....
@GLu-tb1pb 5 ปีที่แล้ว ⁺⁴⁰⁰
stockfish: e4?
alphazero: you lose.
@dannygjk 5 ปีที่แล้ว ⁺⁴
lol savage.
@Hermes1548 5 ปีที่แล้ว
@@dannygjk HA!
@deanaraula 4 ปีที่แล้ว ⁺¹⁸
Just straight up “Lmaooo mate in 211 after e4”
@ojasdighe991 4 ปีที่แล้ว ⁺¹
@@deanaraula meanwhile me proving alphazero by getting mated in 7
@kirill.borisov 6 หลายเดือนก่อน
That’s brutal.
@12345DJay 6 ปีที่แล้ว ⁺³⁵¹
No light square Bishop was harmed in the making of this video
@niels8195 6 ปีที่แล้ว ⁺¹⁰
underrated comment
@elliotmcgee8918 5 ปีที่แล้ว ⁺⁴
not really. A light square bishop was captured
@erkintunca 6 ปีที่แล้ว ⁺³¹²
So many alphazero videos despite you had said no more :D keep up the good work we love them all
@vvinny8 6 ปีที่แล้ว ⁺⁷
Erkin Tunca very hard to resist the temptation!
@TaohRihze 6 ปีที่แล้ว ⁺¹⁷
Guess Alpha Zero forced the move :)
@HargroveNation-100 3 ปีที่แล้ว
Taoh Rihze 😂😂
@BeerdyBruceLeeCentral 6 ปีที่แล้ว ⁺³¹¹
YEAH, when ever I see a deep mind video I insta-click. The last few days I sat down and learned about how deepmind actually works, and it turns out deepmind is learning even when it's playing games against stockfish. This answers your question about the refusing of the draw. Deepmind probably found a good continuation after bying some time with the repeat moves. Keep making deepmind videos please. Deepmind is now my new favorite chess player :)
@MadaxeMunkeee 6 ปีที่แล้ว ⁺³⁵
Beerdy - Bruce Lee Central while it's certainly true that AlphaZero could learn while playing Stockfish, there are two reasons why I think they wouldn't have bothered:
Firstly, it might learn bad moves from stockfish. AlphaZero learns through self play, because it gets the best game data from itself.
Secondly because it would waste computation that could be spent focusing on the match.
It is true though that after the match the game data could be fun through AlphaZero to make it better, but 100 games would be such a small contribution to its training set that I wouldn't see the point. In the four hours, it would have played over 700 billion games with itself.
@NathanBurnham 6 ปีที่แล้ว ⁺³⁰
I teach deep learning, and I agree that they likely didn't train deep mind while playing the matches against stockfish. If they did the games would have little impact in it's play vs the millions of games is has already played.
@Sky2042 6 ปีที่แล้ว ⁺²³
What is probably happening with the refusals is that A0 is making the move with the next-highest probability of winning.
@omerulger8 6 ปีที่แล้ว
Hey, i have some questions about deep learning, just basic question to how to learn it. if you have time to answer them, i can give you contact details? shouldnt take more than tops 10 mins
@Nathan Burnham
facebook.com/omer.ulger.397
@EebstertheGreat 6 ปีที่แล้ว ⁺⁸
I'm not sure where you got 700 billion from, but I believe AlphaZero has played only a few million matches against itself. The preprint mentions only 700,000 games and claims that its performance exceeded Stockfish's after just 300,000.
@curtisbrown547 6 ปีที่แล้ว ⁺³⁵
we like stockfish vs alpha zero because it's like watching the chess equivalent of a dragon ball z fight
@brianbernstein3826 6 ปีที่แล้ว ⁺¹¹¹
Agadmator you are an amazing channel thank you for all your work
@agadmator 6 ปีที่แล้ว ⁺¹¹
Thanks Brian
@alexcerullo3143 4 ปีที่แล้ว ⁺¹
agadmator's Chess Channel damn this was 2 years ago is alpha any better now
@davidegallo2185 4 ปีที่แล้ว
I really hope it can't improve further
@argonthesad 6 ปีที่แล้ว ⁺¹⁶
It's great to see that bully get a taste of its own medicine:)
@xyon9090 6 ปีที่แล้ว ⁺⁵²
*Agadmator said,*
"No more AlphaZero vs. Stockfish videos..we may not be able to appreciate human chess.."
You may be a victim of your own words my friend haha.
@winterguyVV 6 ปีที่แล้ว ⁺²⁸
There is a fitness function in ai that says how good is your solution. It can be as simple as winning = 1, losing = -1. If they set draw as a -0,1 or something similar it gonna refuse the draw by repetition. They should release the progression of learning from those 4 hours. Usually its funny stuff. First games random moves ending in perpetual checks. Then it learns not to draw and come up with some crazy attacks and trades, and eventually it would come up with openings, and basic strategies. Ending in this fish eating beast.
@yixunnnn 6 ปีที่แล้ว ⁺⁸³
4:26 you can smell the fear in Stockfish repeating his moves
@UnXPLO1Table 6 ปีที่แล้ว ⁺⁷
I guess, Alpha0 was programmed to go for 2-time repetitions whenever possible, in order to induce the horizon effect in typical chess engines that examine the game tree to relatively small depths (usually up to 20 moves ahead in the middlegame). One of the ideas that Matthew Lai (one of the Alpha0 contributors) had expressed in his master thesis is to explore the tree deeper in those variations that are assessed (by a special neural network) as the most likely to be in 'the principal variation' (i.e. to be played if both sides play optimally), which is closer to how humans calculate variations, as opposed to usual chess engines that waste time on a lot of improbable variations and extend the search depth only in very specific 'violent' situations (like captures). Alpha0 uses this 'neural network for move probabilities' approach in its Monte-Carlo tree search (search for 'AlphaZero' on arXiv.org and read that preprint) and sees further than Stockfish in the critical variations that end up appearing on the board.
@albo_ar 6 ปีที่แล้ว ⁺⁸⁸
Alpha tries to win the game every time playing as he knows is the best move. The rook is the best move until the posible threefold repetition.
@boblavey3474 6 ปีที่แล้ว
+Albo Nice
@cuervo3097 6 ปีที่แล้ว ⁺³
is not just that, in the second set of the three moves repetition, the rook and the bishop end up in the opposite position in comparison with the first set. which i thinks it's what alfa wanted. so you are right, the best move for black is the rook but not until the threefold, it's because of it. Saludos de un hincha cuervo desde huerta grande, córdoba
@albo_ar 6 ปีที่แล้ว ⁺³
Hola cuervo, i don't think that AlphaZero ever hopes for a threefold. He just want to avoid it as long as he can find another node that's better than draw.
@Dragon7Ball 6 ปีที่แล้ว ⁺⁴
Albo's native language is Spanish. Since in Spanish Alpha Zero's word 'gender' would be masculine we automatically think of "he". It's a mistake we commonly make.
@ThePotaToh 6 ปีที่แล้ว ⁺¹
Cuervo 10 It's not what AlphaZero wanted, but rather it was forced to play a different move as playing the same move gave Stockfish the chance to draw.
@strengthman600 6 ปีที่แล้ว ⁺¹⁰
My theory for the threefold repetition thing is that both of the bots are playing their best move, which just so happens to be a repetitive move. The thing is, when they get to the third time that move is no longer the best, because it leads to a draw, so it rethinks its move and does a better one
@rohangeorge712 ปีที่แล้ว ⁺¹
ah so they do the second best move as the "best move" would lead to a draw otherwise so basically the second best move becomes the best move. i think that kinda makes sense yea
@liljackypaper 10 หลายเดือนก่อน
This doesn't really make sense to me. If the second move is better than a draw then how could the first move be the best move if it leads to a draw? That is counterintuitive
@gJonii 5 หลายเดือนก่อน
@@liljackypaperThe draw only happens after 3 repetitions. The best move doesn't lead to draw in the first 2 times, so playing it in the tiny hope opponent plays less than optimal move, is worth it.
@liljackypaper 5 หลายเดือนก่อน
@@gJonii engines don't play like that though. They don't make sub optimal mate in one threats in hopes that opponents miss it
@gJonii 5 หลายเดือนก่อน
@@liljackypaper If they lose nothing from doing it, why not? Alphazero specifically, being MCTS, would treat the tiny chance of opponent playing wrong worth the extra move. Stockfish I think would treat the moves equally good, with or without mate-in-1 trap
@mechanicalmind07 6 ปีที่แล้ว ⁺⁹⁴
You know what would be interesting if they give alpha and stockfish different opening positions like nimzo or sicilian or some well known gambit positions like kings gambit etc and let them play
@isolatedprawn6592 6 ปีที่แล้ว ⁺¹⁵
Debjyoti Bose i'd love to see alphazero play the kings gambit :)
@yuyurtrtrt2160 6 ปีที่แล้ว ⁺⁶
IIRC in the paper we have alpha vs itself 100 times for a few popular openings. But they don't show any games only the winrates.
@fedra2866 6 ปีที่แล้ว
on arxiv
@SniperMonkeh 6 ปีที่แล้ว ⁺¹
The king's gambit is a forced win for black.
@isolatedprawn6592 6 ปีที่แล้ว ⁺¹
Old man eating a cookie since when?
@EebstertheGreat 6 ปีที่แล้ว ⁺¹¹⁴
"Is e4 a refuted opening?" Let's not go crazy, here.
@sovietai2595 6 ปีที่แล้ว ⁺¹¹
EebstertheGreat Well, Alpha Go is probably the best chess player ever, and it never plays 1.e4
So there could be something to it.
@EebstertheGreat 6 ปีที่แล้ว ⁺⁶
If, back when he was the best player in the world, Paul Morphy had decided to never play 1. e4, that wouldn't have meant he had refuted it. If AlphaZero never plays 1. e4, that may be because it is less successful at that opening, but there are all sorts of reasons why that might be the case beyond it simply being a bad opening.
@Pintkonan 5 ปีที่แล้ว ⁺⁴
@@EebstertheGreat if A0 never plays 1. e4 and this is because it assesses it as less successful, this is exactly what a refutation is dude. after all, it never plays 1. e4 :o and in this video you can clearly see why.
@EebstertheGreat 5 ปีที่แล้ว ⁺¹¹
@@Pintkonan That is not what a refutation means. A refutation does not mean the world champion is bad at that opening (or marginally less good than d4). A line is refuted if a refutation is found--that is, if a defense is found that proves the line is worse than another move. There is no defense to e4 that has been demonstrated to be successful, it's just that over many games, some engines win more with d4. If you want to be strict about it, by your logic, every opening is refuted except the single opening that Lc0 or Stockfish prefers. And if that ever changes in a better engine, suddenly the opening becomes unrefuted.
@lelik0911 4 ปีที่แล้ว ⁺¹⁴
It’s an interesting question. To definitively refute any opening, one would have to solve the game
@znxftw 6 ปีที่แล้ว ⁺⁴⁰
ALPHAZERO IS THE FUTURE.
@Jonathan-ec9pp 6 ปีที่แล้ว ⁺⁴
Maybe... but if Alphazero is the future, we humans are the past...
@benjamineinhorn2314 5 ปีที่แล้ว ⁺²
Really strikes me how visually beautiful alphazeros development is.
@samsmith9764 6 ปีที่แล้ว
Love these alpha zero videos man! keep up the good work :D
@xyon9090 5 ปีที่แล้ว ⁺²⁹
*I'd treat AlphaZero to a drink*
for winning against stockfish as payback for beating me a lot.
@thearmyofiron 5 ปีที่แล้ว ⁺⁷
Then alpha zero beats you 10x more than stockfish
@SpaceEag11 11 หลายเดือนก่อน
I am late but the enemy of my enemy is my friend so he would still buy Alpha zero that drink 😂
@monkeysrightpaw 6 ปีที่แล้ว ⁺⁸
Hooray! Alpha zero returns :)
@bardhanjoy 6 ปีที่แล้ว ⁺¹⁴
No matter how hard I try to define the game with a suitable word, I am heading for the same word over and over again - "Poetry".
@onetouchtwo 6 ปีที่แล้ว
I'm a visitor, VERY much enjoying the AlphaZero coverage. Thanks for doing these videos.
@fokkusuh4425 6 ปีที่แล้ว ⁺¹⁴
2017 - DeepMind AI
2018 - By the time DeepMind became self-aware...
@kirill.borisov 3 ปีที่แล้ว
It did. Now it's developing a master plan to conquer Earth.
@barbosagiordano 6 ปีที่แล้ว ⁺⁷
Your dog is back! Great! =)
@FloydMaxwell 5 ปีที่แล้ว
Such great analysis of Deep Mind's maneuvering of the bishop
@eshneto 6 ปีที่แล้ว ⁺¹⁷
When refusing a draw, probably, Alpha Zero considers itself better in both positions so the draw is worse than going for the "less good" position.
@Amethyst_Friend 6 ปีที่แล้ว ⁺⁶
Yep, this is so obvious and I find it strange that so many people don't get it.
@james_carmichael 5 ปีที่แล้ว ⁺¹
Agreed, alpha repeats moves bc he thinks he has a better position and is trying to 'bait' his opponent or exhaust all chances that his opponent will make a different move and not repeat ... After alpha repeats twice he doesn't want the draw bc , idk, alpha thinks the position is still favorable or playable (in alphas mind!) So he avoids the draw and moves on vs. he always has a draw in the back pocket or he learns a new moves after the same position is repeated.
@YotamPiano 6 ปีที่แล้ว
Alphazero just had a fish for dinner. loving those videos. his type of thinking is astonishing!!
@randomlife7935 6 ปีที่แล้ว ⁺¹⁴
Alpha Zero overprotected the e5 pawn and maneuvered the knight at d6 for the blockade. Is Niemzovich correct all along? Even A0 used it.
@agadmator 6 ปีที่แล้ว ⁺¹⁰
Maybe Alpha read "My System" :D
@randomlife7935 6 ปีที่แล้ว
My thoughts exactly.
@xDMrGarrison 4 ปีที่แล้ว
@@agadmator Don't scare us man :P
@realways6173 6 ปีที่แล้ว
I really like the fact that these games are a real grind, and not just ownage in just few moves. For humans this game is great (its well balanced for both sides) and certainly has alot of future!!
@dodgecoates8760 6 ปีที่แล้ว
Great video!
@markusalanko9134 6 ปีที่แล้ว
It would be so interesting to force these engines to a certain opening and let them continue from there, just to see how it would turn out. Gotta say I did not expect to like these engine games, but they are pure gold and I´m so excited when I see you have uploaded another one! Keep up the good work, cheers from Finland \o
@liljackypaper 10 หลายเดือนก่อน
Isn't that what originally happened? I thought neither engines has opening books?
@benl3988 4 ปีที่แล้ว ⁺⁸
Imagine playing stockfish with black and you're offered a draw.
But, you just think: "Nah, a4 is winning."
@pegion6275 3 ปีที่แล้ว ⁺¹
i would love to see alpha zero playing black with various openings like nimzo defense, scillian, caro-kann, and against some gambot positions too.
@raisethecurve 6 ปีที่แล้ว
I pray the algorithm is developed for commercial distribution because this method of search is beautiful to behold. Could go a long way towards training the next generation of chess players.
@LunchThyme 5 ปีที่แล้ว ⁺⁷
The Berlin defense is better, provided you're a strong enough player to consistently beat Stockfish.
@feliscorax 2 ปีที่แล้ว
Stockfish is the Soviet Red Army. There is no Berlin defence.
@skaterfugater 6 ปีที่แล้ว
what i find interesting about the repition breaks by alpha is not the question whether it found a winning variation in the mean time or just trys to win and having the draw in its sleeve all the time but whether it *cares* about not losing a game and going on with a move it considers less optimal because it *wants* to go on.
@meladezzat 6 ปีที่แล้ว
+agadmator , plz keep making more AlphaZero videos, we need all the games against stockfish
@orlenespinal5788 6 ปีที่แล้ว ⁺⁴
This is really funny because I have not lost with the Berlin defence jet. 15 matches.
@donny.3775 6 ปีที่แล้ว
ur the best chess youtuber :)
@GowthamChakkravarthyNS 6 ปีที่แล้ว
Agadmator I love your channel and have watched most of your videos. Am planning to watch the rest of the videos too. I would like you to do videos on Chess openings and discuss the various variations in each main opening. There are few good chess openings videos on TH-cam and I am sure the entire community would learn from your videos. Please do chess openings videos.
@Themozartthug ปีที่แล้ว
@9.40
Look at the pattern with the pawns and the king, it's completely symmetrical. Alpha moved that king loads of moves earlier, not sure how many......it new where to place the king, if knew what colour bishop was best, it basically new the whole
@Superawesomebob9 6 ปีที่แล้ว ⁺⁵
What do you think would happen if Google let Alpha Zero train for more than 4 hours? What kind of God would they create if they let it train for weeks????
#suggestion if you can find a Alpha Zero vs Alpha Zero game that would be very interesting.
@jaimeduncan6167 6 ปีที่แล้ว
In your variation, you can simply play Bc3 with white and you do stop both pawns for a while. It seems that the pound in H will fall but the black king will take d6 and from there is not dificult to win.
@FloydMaxwell 6 ปีที่แล้ว
7:55 Never has a doubled pawn looked so powerful -- Alpha's pawn fortress...wow.
@malpigwalt 6 ปีที่แล้ว
Yes, we want them all.
@rangedfighter 6 ปีที่แล้ว
I personally think that alpha chooses the best position, by repeating the position 2 times, and then doing it again, in the end it will be in the same position that it actually wanted to be (because after 2 repeatitions it will be 1 turn away from it's optimal position and after doing it twice it's exactly where it wanted to be)
It circumwents the 3 fold repetition rule so to say to force the opponent to accept a position where they normally would want to draw.
@cukbeu4662 6 ปีที่แล้ว
i was looking forward to see where it is going to blow today
@trebledawson 6 ปีที่แล้ว
With respect to AlphaZero *almost* doing three-fold repetition twice in a row: It is very likely that the rook moves (in response to the bishop moves) are in fact the moves that are most likely to lead to a win, if threefold repetition were not a rule. However, AlphaZero is trained to recognize when a move will result in a win, loss, OR draw; the first two repetitions are simply AlphaZero taking the most winning move at the time, but the most winning move changes when it will directly lead to a draw. What is most impressive is that AlphaZero learned the threefold repetition rule; it was not hardcoded into the neural network as it would be for a classical engine. Considering how few games end in threefold repetition, it's truly amazing how DeepMind was able to generate enough games for AlphaZero to learn threefold repetition from scratch.
@fujiapple9675 6 ปีที่แล้ว
7:05 this position reminds me of Alpha's French Defense game, just reversed with the black pieces.
@georgiosvavliaras1066 5 ปีที่แล้ว ⁺²
At 6:41 how did black capture after white moved to f4? They were both on the 4th row next to each other (?)
Am I missing something? Please help, I'm fairly new to chess, excuse my lack of knowledge
@jayd7948 5 ปีที่แล้ว ⁺¹
Google En passant
@aconsideredmoment 6 ปีที่แล้ว
Deepmind Alpha Zero's tight knit structure and movement of play reminds me of a sliding tile puzzle, both interlocking and spiral. A snapshot of Stockfish seems a looser version of the same structure and movement relative to Deepmind Alpha Zero (e.g. 5:58).
@alexandre588 6 ปีที่แล้ว
"And in this position, stock-fish resigned" Kreygasm
@muhammadfahad1187 6 ปีที่แล้ว ⁺¹
Hey can you provide us with a download link to the chess engine you are using on your computer? Thanks
@stillnessinmovement 6 ปีที่แล้ว
I first learned about the technology that DAZ uses (parallel distributed processing) in the early 90's and it was revelatory; AI is smarter when it tries to act like a real brain than a computer. I use some of the lessons from this in my personal work (making mistakes is GOOD, as it helps you learn, don't be afraid of making a mess of something, you might learn something!) and now seeing how DAZ makes such interesting, elegant moves, it's very cool to see.
@outtabubblegum7034 4 ปีที่แล้ว
6:45 I think that this Bishop x Knight exchange has multiple purposes: strategically that's a bad Bishop in a close position, so it's great to exchange for a centralized knight; also that knight was defending c4, which will now depend on the Queen; as she can't move now, the obvious Alekhine Gun that Stockfish was planning to create at f won't happen.
@Trynottoblink 6 ปีที่แล้ว
This is now the DeepMind Alpha Zero chess channel.
@raamshankar4121 6 ปีที่แล้ว
It was given with clear instructions during the initial programming. It works "Minimum defense and Maximum Attack". Stockfish has it opposite way.
@FelixIsGood 2 ปีที่แล้ว
That is not how deep learning works.
@armaanmalhotra9042 2 ปีที่แล้ว ⁺¹
🔥🔥
@pashapasovski5860 4 ปีที่แล้ว
It's fkn unbelievable! AI is going to rule the World and this game shows how!
@Xenon777channel 6 ปีที่แล้ว
If you look in the PDF paper on this, they did put Alpha Zero against Stockfish in the Ruy Lopez in 100 games, which they did in several openings, however, it's not clear which position it started from, either 3. Bb5 - a3 as in the picture, or 7. Bb3 - 0-0 as in the "PV". Nonetheless, Alpha Zero as black won 6 games, drew 44 and lost 0 from which ever position. As white, won 27, drew 22 and lost 1.
@untwerf 6 ปีที่แล้ว
Hey agadmaster, can you offer general recommendations on the best chess books available with reference to particular authors and publishers.. i would also be interested to hear specific titles that you think are particularly good!
@columbus8myhw 6 ปีที่แล้ว ⁺⁷
I wonder if all the drawn games were drawn because of threefold repetition.
@bruceli9094 6 ปีที่แล้ว ⁺²
More more more MOREEEEEEEEEEEZ
@brandons4240 6 ปีที่แล้ว
What would be interesting is how fast AO could undisputably solve chess if allowed to play long enough and self learn to the point where it always picks the same move for any of the estimated 10^43 possible chess positions (there is an estimated 10^120 possible chess games). It constantly refines its strategy based on past learnings...it must already be close if not finished if it can beat Stockfish.
@RLinares22 6 ปีที่แล้ว ⁺¹
I wonder if there's a way to deconstruct scenarios and outcomes from various playing scenarios by forcing Alpha 0 to play itself and set it's opening sequences (Queen's Indian / Belgian v e4 or others) then release the analysis to discover why... Could be interesting either way it's incredible play and thank you for sharing
@spikebtvs 6 ปีที่แล้ว
Hi, i study machine learning, i think the 3 move repetition has to do with how self learning neural networks trains themself -- it has probably learned that the same position happening 3 times means a draw -- but if all it was given is the rules of chess it would have never explored past 3 repeated positions because it would have considered that position "known" or solved for -- the end implication is that is is now forced to pick its second best move, which it also probably thinks is winning .
@Vampiracho 6 ปีที่แล้ว
Helrlo everyone! Love your videos and accent.
@Vampiracho 6 ปีที่แล้ว
I sent $10.
@suezix8689 4 ปีที่แล้ว
#agadmator I'm trying to find the Leela game (or Alpha) were the white queen spent much of her time on H1 but am failing. Can you or someone else point me in the right direction?
@dannygjk 4 ปีที่แล้ว
I think you mean AZ vs SF SF played QID. One of the QID games. Several TH-cam people covered it. Maybe you mean this game? :
th-cam.com/video/NaMs2dBouoQ/w-d-xo.html
@pgyore3111 6 ปีที่แล้ว
I am glad there some discussion in the comments regarding the apparent handicaps Stockfish was dealt at the beginning of the match. Has anyone suggested a rematch yet?
@abebuckingham8198 6 ปีที่แล้ว
To understand why the position repeats but the draw is refused we can look at the algorithm they used to train Alpha Zero. It uses a kind of Monte Carlo method which is a randomization procedure to decide which moves to try next. This means while training if you allow your opponent more opportunities to deviate from the best line you have a higher probability of winning in the position just because you get that extra roll of the dice. I would interpret this behavior as showing that alpha zero felt Stockfish's defense is optimal and that deviation from that line significantly improves Alpha Zero's evaluation of black's position.
@dakbabu 4 ปีที่แล้ว
what is the difference between engine approach and alpha0 approach.
@redgekagaoan9462 6 ปีที่แล้ว
nice
@DarkestValar 6 ปีที่แล้ว
I love these no more videos series :p jk agadmator's love all ur content as usual
@arielperez3434 6 ปีที่แล้ว
I thought you'd decided to let us keep enjoying human chess.
Won't complain, these videos are awesome.
@dickbrazen 4 ปีที่แล้ว
When you said it's difficult to imagine how Alpha makes progress, in situations like that, I just try to find the most likely candidate and start busting stuff up. More successful for someone of my level than you might think.
@dickbrazen 4 ปีที่แล้ว
Alpha is smarter than me tho
@jdamage68 6 ปีที่แล้ว
Interesting..
@nipunpratap6602 6 ปีที่แล้ว
more alpha and stockfish matches pls
@jcsmith5984 6 ปีที่แล้ว
honestly, the only people i watch when it comes to chess commentary and analysis is MatoJelic and Agadmator's chess channel!
They give the most accurate analysis and they are entertaining to listen to and watch!
@shrimp569 6 ปีที่แล้ว
The key here is the Alphazero calculates move probabilities, and not just what is the best move for its opponent. So there is always a small but non-zero probability that white will play something else, and thus giving black an advantage. Since Alphazero is not penalized for playing cycles (until it leads to a draw), it is always better to play the cycle and see whether the opponent will make a suboptimal move or not.
@MadaxeMunkeee 6 ปีที่แล้ว
The reason AlphaZero plays for two repetitions is because it's designed to play the best move for the position on the board.
In those situations, it really is playing the move it thinks is best. And only when the 'best' move would force a draw by three fold repetition does it consider another move.
I think the takeaway you should probably have is that in those positions, AlphaZero prefers the move only if it does not cause a draw. The move it plays instead is its second choice, but still has winning chances.
@stateofdecay2210 4 ปีที่แล้ว
I played against stockfish with an extra queen that I added to my army lol and winning the game was a real pain in the ass because stockfish defending is so strong
@hardkur 6 ปีที่แล้ว
AI brings the views ;-) i would never hear about your channel if not Deep mind games
@existenence3305 6 ปีที่แล้ว
Hey Agadmator, did you find any ratings for AlphaZero??
@SimplyApollo 6 ปีที่แล้ว
i fucking love your dry humor 0:09
@lapulgaatomica9280 6 ปีที่แล้ว
For me it just looks like Alpha Zero doesn't lose anything playing Rf7, because if the opponent responds in the drawish way it can just go back and nothing changed in the position. It is just scouting SF to see if it will answer in the best way possible, cause if it don't maybe there might be some crushing lines behind it
@jasonq7504 4 ปีที่แล้ว
4:37 Maybe alpha zero is giving up a move to reposition the White bishop, since it placed the knight in a square blocking the bishop.
@Jacob32905 6 ปีที่แล้ว
Alpha vs Magnus is gonna be an awesome match!
@dannygjk 5 ปีที่แล้ว
Um yeah... no
@Tutdelasmore 2 ปีที่แล้ว
not really, no human stands a snowballs chance in hell against a top tier engine
@ErnestoAE 6 ปีที่แล้ว
I was expecting a bit more at 6:44 regarding the lines with Qxe3 capture and Re2xe3
@MrYonch 6 ปีที่แล้ว
Amazing video, thank you! I have some questions: It seems to me from the AlphaZero games and paper that it's power lays in super advanced stratigique thought (or maybe stratigique calculation? Hard to chose words to describe this "entity"). Whereas, from my limited knowledge, Stockfish's (and chess engines in general) strength lays in brute force of calculation. So, added with opening books and different middle and endgame tables, Stockfish is merely "mimicking" stratigique thinking, but isn't actually considering positinal aspects, space usage, flexebilty, activity and synergy. It IS eventualy "taken into consideration" indirectly via brute force, because the consequences of such elements are evident in lines calculated by Stockfish. Against a human or an inferior engine, the force of calculation is enough to "hide" the inability to think/calculate strategy. But it seems this is how it is outplayed by AlphaZero.. Also, Stockfish is engineered by humans to evaluate a position not only by calculating possible lines of play but also through material numeric value. Maybe we, humans, "misled" stockfish by "teaching" it a wrong or incomplete evaluation of material and position process... Maybe AlphaZero can teach us a new way of thinking about material value. Either we will learn that a knight is actually worth 3.5 and a bishop is worth 2.7, for example, or that it's wrong to even go through that line of thinking.
What's also interesting in my opinion, is that SF's brute force makes it a "god" of tactics, as tactics are based on calculation rather than "thought". (They could also be based on 'post-calculation'. A GM doesn't have to always calculate a full process to spot a tactical trap, he/she can train to see it by noticing patterns and structures, or known lines of "theory" based and calculation made by them or someone else (including engines) in the past).
I believe Stockfish is bound to always calculate, and it can't develop these abillities that GM's can. Though, It probably doesnt mind (Pun intended ;) ). It is a preety f***ing good calculator.
But is it possible that AlphaZero DOES develop (like a human would) to recognizes tactis without calculating all the time?
Is it possible AlphaZero is "thinking" strategy in a broad and complex way?
Is it possible Stockfish is yet superior in tactics? Would be interesting to present them both with very complicated chess puzzles to see who is better. (Though probably even AlphaZero's inferior calculation power of 'only' 80,000 positions per second can stand any chess puzzle we humans created, and the gap between SF's and AlphaZero's tactiacal quality - if indeed exsists such a gap - would be insignificant or impossible to notice unless both of them are given only fractions of a second to solve the puzzle.)
I want to add that all the asumpstions I based my thoughts upon could be flase. I am new to chess and know almost nothing about computering and AI tech.
Also, as some people find it somewhat depressing that AlphaZero belittled centuries of game development in 4 hours, I want to add an incourging thought:
Even though AlphaZero outclassed us and our programs so effortlessly, it still isn't capable of INVENTING AND DEVOLPING the game of chess. Or even if it is, if instruced to come up with a game, it can't do so just because it WANTS to and INTRIGUED by it. We still have the ability of doing something for the sake of pure enjoyment going for us. For now. :)
@brianniemi7051 6 ปีที่แล้ว
You have won the TL; DR award, my friend יונתן ריבק
@pagesehj9260 4 ปีที่แล้ว
PLEAAAAASE DO MORE ALPHA ZERO GAMES
@Stl71 6 ปีที่แล้ว
It is time for the best GMs to unite in one team. We demand a match of this team against A0 robot now!
@SafetyBoater 6 ปีที่แล้ว ⁺¹
Hey from Alabama
@Koew 6 ปีที่แล้ว ⁺³
Hi agadmator. I just want to share, wouldn't it be interesting if Alpha Zero plays against itself? I mean, what if there are the same moves every time? What if white always wins? What would it mean?
@CesarGomez-kp5lm 5 ปีที่แล้ว ⁺¹
That has already be done...
like 1 million times.
@00tact 5 ปีที่แล้ว ⁺¹
Sheppard. That’s exactly how Alpha0 learns.
@simonemiglioli1165 ปีที่แล้ว
Draw.
@keepthingssimple 6 ปีที่แล้ว
Idea of repeating the move ... is to make sure wther yr opponent find the correct sequence .. I have seen many time stockfish doing this to me when i am analysing my games .. even with +4 advantage ..there is a chance that yr opponent might do something wrong that will increase yr advatage and finishing in quick moves ^_^
@BLUEGENE13 5 ปีที่แล้ว
i don't understand why white would ever go c3 with the pawn, can someone explain. Why would you block your knight like that ever
@rooksman64 6 ปีที่แล้ว
take a shot every time Agadmator says “captures on d4”
@krzysztofjaneczek506 6 ปีที่แล้ว
It is striking to me that in all of that games Stockfish ends up repeating moves without good reason. Stockfish looks for best move possible; AlphaZero is always prepared and looks for best move along with best plan.
@sauravkumar3278 6 ปีที่แล้ว
Last time you said, i can't show any more alpha 0 video

ต่อไป

เล่นอัตโนมัติ

The Word is Compensation | AlphaZero vs Stockfish