Day 2: Revenge of MENACE

  Рет қаралды 123,739

Matt_Parker_2

Matt_Parker_2

Күн бұрын

Пікірлер: 369
@AndrewWilsonStooshie
@AndrewWilsonStooshie 5 жыл бұрын
"In the end we just had that box next to the desk because it was always used" You actually created a memory cache!
@DerMangoJoghurt
@DerMangoJoghurt 7 жыл бұрын
"Young children are basically just random number generators." - Matt Parker, 2017
@dustinandrews89019
@dustinandrews89019 7 жыл бұрын
They probably are a fantastic and cryptographically secure RNG.
@flymypg
@flymypg 7 жыл бұрын
The NSA runs orphanages for this. Modern spin on Oliver Twist.
@arcchitjain
@arcchitjain 7 жыл бұрын
LMAO!!
@Vocnor
@Vocnor 7 жыл бұрын
When can we buy this on a t-shirt?
@andrewjknott
@andrewjknott 7 жыл бұрын
Please sir, can I have more entropy?
@jeffcon1
@jeffcon1 7 жыл бұрын
Does "dying" mean it learned "The only wining move is not to play"?
@theshadowscreen8028
@theshadowscreen8028 7 жыл бұрын
It means that the starting box ran out of beads, so essentially yes, it immediately thinks that the best move is to resign.
@natdrat00
@natdrat00 7 жыл бұрын
'Greetings Professor Falken. How About A Nice Game Of Chess?'
@RowanIngram
@RowanIngram 7 жыл бұрын
A strange game. The only winning move is not to play. How about a nice game of chess?
@thejunkman
@thejunkman 7 жыл бұрын
IMO yes, it learned it is a strange game.
@zionemond2307
@zionemond2307 7 жыл бұрын
It meant that the starting box ran out of beads so it immediately "resigned" as its first move
@RBLXbranefreez
@RBLXbranefreez 7 жыл бұрын
"young children are just random number generators" never before have you described humanity so accurately as a mathematical concept, Matt!
@hhhsp951
@hhhsp951 3 жыл бұрын
@@eliasnierengarten9362 we realize that, it's just that with such a large amount of children whom we cannot predict what numbers they will pick, they are random enough for most calculations that have to do with random datasets.
@kroppyer
@kroppyer 7 жыл бұрын
7:27 - that's not a weird move to do. It leaves 6 squares for the opponent to choose from, 4 will result in a win for menace, 2 in a draw. The opponent needs to be able to think 3 moves ahead in order to find the drawing move(s). This is _the_ way to win a game of knots and crosses/tic-tac-toe. Fun fact: this game is called "boter, kaas en eieren" (butter, cheese and eggs) in Dutch. I don't know why.
@Bobal27
@Bobal27 5 жыл бұрын
Ignore this comment, it was edited from an erroneous statement.
@recanady24
@recanady24 3 жыл бұрын
That's what I was thinking while watching it. If the human player plays one of the other two corners, it leads to a draw. If the human player plays one of the other four spaces, it leads to a win for MENACE.
@cameronmueller-harder3916
@cameronmueller-harder3916 Жыл бұрын
@@recanady24 Exactly! This is basically always my second move, I was very confused when they said it made no sense. Like literally if you google "how to always win at tic tac toe" that's the move order that every single result will be describing...
@zachgriffin7387
@zachgriffin7387 7 жыл бұрын
I like to think that on day 2 MENACE rebelled, having decided noughts and crosses was boring, and tried to refuse to play anymore.
@jeffreysdavis
@jeffreysdavis 5 жыл бұрын
That "weird" corner move where it blocks itself on purpose is actually a very good one. If the next turn, the human player puts their mark on either of the sides next to their first play, then the subsequent play by Menace to block the human's winning move also is a setup for two possible winning moves by Menace. It's a way to coax the other player into setting you up for a guaranteed win. Alternatively, the other moves the Human player could do almost invariably end in a draw, so it's actually quite a safe move to guarantee not to lose
@RoderickEtheria
@RoderickEtheria 5 жыл бұрын
And it's better than the adjacent corner to the opponent. If it picks the adjacent corner 100% of the time it will end up drawing. If it picks the opposite corner, it wins 50% of the time.
@ConsciousAtoms
@ConsciousAtoms 7 жыл бұрын
Connect four has been solved. Player 1 wins, on the standard 7x6 board at least. If the first move is not optimal (play in the center), player 2 can draw (if player 1 plays next to the center) or win (if player 1 plays on one of the two outside lanes). The optimal strategy for Connect Four can be formulated using a few relatively simple rules. See www.informatik.uni-trier.de/~fernau/DSL0607/Masterthesis-Viergewinnt.pdf for the master's thesis of one of the people who solved connect four.
@AxiomPenguin
@AxiomPenguin 7 жыл бұрын
You can also diagram out tic tac toe on a torus or klein bottle. Here's a paper for infinite cylinder connect 4 of different widths pdfs.semanticscholar.org/be0f/4f7bfc36ea43412d55f892de87e2a3f028ce.pdf, one author also has some nice slides cs.gettysburg.edu/~tneller/papers/talks/acg2015.pdf. I'm looking at my friend's lesson plan for mobius strip connect 4, but it's unclear whether that is solved or just "figure out a strategy"
@CoolRubiksCube
@CoolRubiksCube 2 күн бұрын
Since I can't access the paper from this link, here's an alternative: rmarcus.info/blog/assets/conn4/thesis.pdf
@CubsYT
@CubsYT 4 жыл бұрын
Day 1: MENACE begins learning Day 2: Revenge of MENACE Day 3: MENACE grows self-consciousness
@deemcgann1695
@deemcgann1695 Жыл бұрын
day 4: The phantom MENACE
@SyntheticFuture
@SyntheticFuture 7 жыл бұрын
Actually is a really nice visualization of how machine learning works, great job :)
@seancolman9091
@seancolman9091 4 жыл бұрын
I like how it learned to go centre and never lose. Once it learns to goes centre and you don’t go a corner it also eventually learns to win every time too.
@zeldoisaguy
@zeldoisaguy 7 жыл бұрын
Connect 4 is solved! The winning solution for starting first is also online (just Google Connect 4 perfect solver)
@Hollidaydoc111112
@Hollidaydoc111112 6 жыл бұрын
Tic-tac-toe was the first game i ever solved. The fact that perfect corner start has one winnable response was always my favorite part.
@GooberZoober
@GooberZoober 7 жыл бұрын
That random corner move with 10 in it is actually my favourite move when I start in the centre and the opponent takes a corner. I find I either win (particularly from young players) or draw from there.
@chrisjohnsonvan
@chrisjohnsonvan 6 жыл бұрын
At 7:40 that is actually an optimal move - once you make that move, if the human player doesn't follow up by going in one of the corners you can guarantee a win by going in the appropriate adjacent side. And if the human player does go in a corner, you can guarantee a draw. So it actually learned the appropriate move perfectly. (Edit: Or you know, what The Cool Catfish said.)
@Kungfoobacon
@Kungfoobacon 7 жыл бұрын
Omg dudes 3:43 "it managed to beat all their bots apart from 1"... that was me :O Freakiest moment ever hearing him mention that, Matt any idea if he remembers my name? Test me I can prove it :D His bot was so cool.
@thermitebanana
@thermitebanana 7 жыл бұрын
Kungfoobacon that's awesome dude
@jetison333
@jetison333 6 жыл бұрын
Lol that's awesome. Did you do anything interesting strategy wise?
@bassett_green
@bassett_green 6 жыл бұрын
Well done mate!
@OMGclueless
@OMGclueless 7 жыл бұрын
A possible explanation for the "strange" 10 in the corner at 7:24: The corner opposite your opponent is probably the strongest move in this situation. The reason is that playing on the side leads to an easy draw (where any human playing a simple greedy strategy like "win if possible, block if possible, otherwise go randomly" will draw or win), whereas that corner leads to winning chances. If, after going in the corner opposite the human, the human plays any side move, then there is a forced win by playing in a corner, creating a triangle of O's where the human cannot block both winning threats.
@Asha2820
@Asha2820 3 жыл бұрын
Surely the way to beat MENACE near the end of the day is to use the Bobby Fisher strategy: play unconventionally. If you play in a way that leads to states where it has not yet been trained, it may make less optimal moves.
@legendgames128
@legendgames128 Ай бұрын
I've learned that when I played against a matchbox machine for Hexapawn, once it got good enough in a very specific line of play, I would then throw it off by just playing an unorthodox move and scoring a few more wins before it caught on to my suboptimal moves.
@lukashd4
@lukashd4 7 жыл бұрын
It did not die! It actually figured out that the only winning move is not to play.
@jmercermn
@jmercermn 7 жыл бұрын
Just like the computer in the movie War Games.
@Huntracony
@Huntracony 7 жыл бұрын
jmercermn, Spoilers.
@xelxebar
@xelxebar 7 жыл бұрын
Waaattt?! How did I not know about this channel earlier? Matt Parker, you are my hero.
@alanwattman9775
@alanwattman9775 7 жыл бұрын
A strange game. The only winning move is not to play. How about a nice game of chess?
@Bobal27
@Bobal27 5 жыл бұрын
8:44 The 10 move there sets up a common two-way win I love to use. You start corner, opponent center, you play opposite corner. Opponent now has two options, side or corner, and if they choose corner, you block and show off the two-way win. If they go side, you block opposite side, they must block corner, you must block opposite corner, they must block side, you take last move for the draw. It is, in my opinion (before watching the rest of this video from the time noted,) the STRONGEST opening move in tic-tac-toe. Corner start can guarantee a two-way win or draw under optimal strategy.
@Bobal27
@Bobal27 5 жыл бұрын
Ok, now that I’ve finished watching, you guys caught that it was learning a corner opening strategy, but you still didn’t note the two-way win setup that it created by that confusing move. I’ve been studying games and strategy almost all of my life (won my first Euchre game at 3,) and one of my proudest moments was cracking tic-tac-toe (even if I wasn’t the first to do so) with my two-way corner strategy. I find it much easier to pull off against even the most skilled players than any of the center starting win possibilities, but to delve into why, I’d have to get into the realm of human psychology and game theory, only one of which is a topic you generally cover. On a side note, I’m really excited that the first day version did keep some corner starts, for likely that very gambit.
@benbradley_123
@benbradley_123 7 жыл бұрын
Could you use MENACE to solve Rubik's Cubes?
@benbradley_123
@benbradley_123 7 жыл бұрын
Elan Cook but you can try to train the machine to solve within 50 moves or something similar
@fejfo6559
@fejfo6559 7 жыл бұрын
@Direwolf202 Which algorithm are you using?
@benbradley_123
@benbradley_123 7 жыл бұрын
fejfo's games it wouldn't use a particular algorithm - it would teach itself a strategy
@fejfo6559
@fejfo6559 7 жыл бұрын
Shouldn't you use 324 (54*6) input nodes, one for-each possible color of each sticker, otherwise you end up with the networking thinking blue is the color between red and green or something like that.
@dustinandrews89019
@dustinandrews89019 7 жыл бұрын
Someone should check my math. I think you would need on the order of 3.46x10^19. Which I think is 34 Quadrillion. The matchboxes would fill 4,500+ Empire State buildings.
@ArcKayNine
@ArcKayNine 7 жыл бұрын
Another great pair of videos Matt :) When I was first getting into ML I wrote a python script to play Nim after watching your video on it. It very quickly learned the optimal strategy.
@atrumluminarium
@atrumluminarium 7 жыл бұрын
Is that the optimal "life support" strategy tho? I mean keeping it on the one bead it tried to get rid of seems to be a counter productive process. What if you were to introduce some randomness and replace it with some other bead (I believe this is known as "mutation" in ML)?
@tiaxanderson9725
@tiaxanderson9725 7 жыл бұрын
Since all the other boxes are 'trained' on the most common bead (a blue one that they decided to keep in) a different colour bead would be counter productive into finding a single winning (read; not losing) strategy. After all this is a physical machine that demonstrates digital machine learning, not an actual program that makes decisions. It's basically an evolutionary based state trainer.
@edskev7696
@edskev7696 7 жыл бұрын
atrumluminarium I would think that the best approach would be to put in one bead of each colour when any box dies. That way it's exploring every avenue of success equally until it gets to the point where it can win often enough to keep itself alive.
@Mike-px6pg
@Mike-px6pg 7 жыл бұрын
despite its death the strategy it held on to the longest was (theoretically) the best of the worst, so keeping that strategy/bead is optimal and reduces the time it would waste learning to cope with other non-optimal approaches
@Pyronaut_
@Pyronaut_ 7 жыл бұрын
I think an better method to be never remove the last bead of a color (at least for the first box). Instead add one bead of each other color. One issue with this way is that it technically has a chance of doing any move, even bad ones, but if one method is succeeding it should eventually greatly outnumber the one bead of the bad strategy.
@andymcl92
@andymcl92 7 жыл бұрын
Since you are essentially creating an extra bead for the probability count, you could say it is indeed counter-productive, but not in the way you meant!
@trapfethen
@trapfethen 6 жыл бұрын
The really interesting settings are when you give Menace and Menace 2 no penalty for losing, menace gets no beads for drawing, and only one for winning. Then set menace two to get two beads for winning, and one bead for a draw. Play those two settings against each other to see some interesting shifts and changing strategies. You can avoid the "local maxima" problem. This would be closer to strictly reinforcement learning.
@ryoumisakiGP
@ryoumisakiGP 7 жыл бұрын
11:32 Confidentialy Mute, don't wan't everyone to know what you should theoretically do if the graph hits the bottom axis.
@therealzilch
@therealzilch 6 жыл бұрын
Wonderful. This is how this stuff should be taught. Fun and informative.
@BenLubar
@BenLubar 4 ай бұрын
This is a pretty good example of why you design a machine learning algorithm to sometimes make a decision it thinks is wrong while you're training it.
@lofisz
@lofisz 7 жыл бұрын
"Young children are basically just random number generators." That should go on a t-shirt :))
@itchykami
@itchykami 7 жыл бұрын
How many boxes do you need for Menace to play Spin the Bottle?
@AndrewWilsonStooshie
@AndrewWilsonStooshie 5 жыл бұрын
I think a better strategy may to be to remove counters until there's only one left and always keep one, for all boxes. It might take longer to learn but I think it would learn a better strategy.
@MCNarret
@MCNarret 6 жыл бұрын
How I learned it, drawing isn't a win, it's everyone loses. I would recommend the second strategy but have a rule to never take out the last of any color bead... Meaning it may always have a chance to make a "mistake" but will have to learn to overcome it.
@mistycremo9301
@mistycremo9301 7 жыл бұрын
The ten in a corner you discussed around minute 8 is actually a pretty ok move, because it gives two different options for winning moves if you play it right.
@ShinySwalot
@ShinySwalot 7 жыл бұрын
The Revenge of the Phantom Menace: 2 Star Wars films in an 18 minutes long KZbin video
@cosmicjenny4508
@cosmicjenny4508 6 жыл бұрын
+Shiny Swalot Just had to be the two worst Star Wars movies in one go, didn't it?
@vwoxy1
@vwoxy1 7 жыл бұрын
I wonder what set of results you would get if an untrained first-player MENACE played an untrained second-player MENACE
@andrewmccarthy4270
@andrewmccarthy4270 7 жыл бұрын
vwoxy1 So long as they understand when the game is over and who won, they learn eventually. I wrote a quick python script to test it: github.com/andrewmccarthy/menace
@billyrobertson3170
@billyrobertson3170 7 жыл бұрын
Is it weird that before watching this second video, I too looked up how many states you would need to do Connect 4?
@nienke7713
@nienke7713 7 жыл бұрын
I think the conclusion about the box with the Menace corner followed by a human centre emptying, and it not mattering because the starting box was dominant on a centre move, is wrong. I think it's highly likely that the reason why the corner starting move died out was because it would resign rather than try as soon as the human would follow menace's corner by a centre (which is a highly likely move for the human to make).
@karlkastor
@karlkastor 7 жыл бұрын
Starting at 12:08 there are some moving thick lines on the wall. Probably because of shutter speed and strobe of the light or something.
@jamesrobinson6330
@jamesrobinson6330 7 жыл бұрын
Matt is a proper nerd, he blocks his webcam!
@wierdalien1
@wierdalien1 7 жыл бұрын
James Robinson thats just senisble
@kilikx1x
@kilikx1x 7 жыл бұрын
Maybe I just have bad hand writing but whoever wrote all those O's should get like a 'good job' sticker or something because those are really good circles considering how many there were.
@KristianMischke
@KristianMischke 7 жыл бұрын
What if instead of taking the beads out when it dies, you added one of each of the non-losing colors... That way, over time the probability of the losing move goes down, but never to 0. Would probably need to compensate with more beads to the winning moves... And then that may be an impractical amount of beads
@pepn
@pepn 6 жыл бұрын
Seems a really interesting experience as an introduction to machine learning !
@tth-2507
@tth-2507 5 жыл бұрын
Conect four is solved. It is rigously proven, that (provided that both player play optimal) the first player wins if - and only if - he goes in the middle lane on the first move, it is drawn exactly when the first goes into a column next to the middle on and all other moves result in a win for the second player. (Solved by Victor Allis 1988 and James D. Allen 1990 independently)
@turun_ambartanen
@turun_ambartanen 7 жыл бұрын
5:40 it obviously lost all the beads for the corner move in the box before, because whenever it chose the corner it resigned ( =lost)
@manfredpseudowengorz
@manfredpseudowengorz 7 жыл бұрын
7:12 next to the scheme with a 7 in bottom right, there is a scheme with a 9 in middle right - the same situation: block the opponent and enable a winning move.
@Nosirrbro
@Nosirrbro 7 жыл бұрын
I want to see two different menaces fight against eachother.
@honorarymancunian7433
@honorarymancunian7433 7 жыл бұрын
The one where it has 10 in the corner... that's the correct move! It's not just a case of 'it must draw a lot because it confuses the humans'... although the humans probably do get confused and make a fatal mistake. Unless the human follows that move by going in one of the two remaining corners, MENACE can then go in one of the two remaining corners and force a win!
@rikwisselink-bijker
@rikwisselink-bijker 7 жыл бұрын
Wouldn't a better life support system be to reset the first box to the primary state? Now you are potentially introducing a bias to the first move. The whole point of the first move is that it runs on probability, so the first box should have infinite beads. (if you decide to make resigning an option at all)
@leingthelennart
@leingthelennart 7 жыл бұрын
You were talking about a this move where you "confuse" the human, and that it wins/draws because of that. Actually though, that strategy is kind of nice. When you open middle and you opponent responds in a corner, you can now play in the opposite corner. If your oppenent now does'nt respond with playing one of the corners, that are left, you have already won. You can either block him, and have a two-way to win, or just place in the corner opposite to what your opponent just played. Again, a situation your opponent can't get out of. I kept beating my little cousin for quite a while with this. ^^
@aDifferentJT
@aDifferentJT 6 жыл бұрын
Don’t you need to block in that situation, otherwise they just win
@ardinhelme687
@ardinhelme687 7 жыл бұрын
Now I wanna see them build the second move version and make Menace vs Menace
@beliasphyre3497
@beliasphyre3497 7 жыл бұрын
I'd be interested in MENACE A vs MENACE B, where anytime the last bead would be taken out of a box, it is left in.
@TrimutiusToo
@TrimutiusToo 7 жыл бұрын
Corner strategy is superior as it confuses people more... Corner, center, opposite corner and then people sometimes go into corner because of symmetry which is a losing move actually...
@rosebuster
@rosebuster 7 жыл бұрын
7:30 - This move in the corner is not stupid at all! It's not just that it confuses people. It's really the best move in that position. All other moves would force a draw. But now if the human doesn't play in the corner again, he's in a losing position. The machine will then always have a move that creates a double threat. This is actually a winning strategy if you aren't satisfied with only drawing.
@sagov9
@sagov9 7 жыл бұрын
true, but you've got it the wrong way around from this position: x - - - o - - - x if the human plays in the corner he loses: x - o - o - x - x x wins as he has two threaths the human has to play at the edge: x - - - o - - o x this way x has to respond to the threath of o and cannot create the double threath
@rosebuster
@rosebuster 7 жыл бұрын
No, that's a different case. You're right too, but you're talking about a different situation: that's if the machine starts in the corner. If it starts in the center instead, then the human goes in a corner, then the best next move for the machine is still the opposite corner: x - - - o - - - o Now if the human plays on any of the 4 edge positions, he loses. For example: x - - x o - - - o Machine then blocks and double-attacks. x - - x o - o - o So you see there's a good reason for that "suspicious" move that puzzled these guys. It really is, surprisingly, the best move. Somehow the strategies with starting in a corner are better known than strategies following starting in the center, but there's also some trickery in those. By the way if the machine goes in the center and the human goes on one of the edges right away on second move, he also loses. But examine cases for that on your own if you're curious.
@sagov9
@sagov9 7 жыл бұрын
you're right, I misunderstood the situation
@thecoolcatfish8219
@thecoolcatfish8219 7 жыл бұрын
At 7:25 the move MENACE makes with 10 beads in the corner may seem strange, but is in fact the optimal move in my opinion. It actually surprises me that MENACE favoured the other space with 48 beads. Here's why: Tic-tac-toe is really just reacting to the opponents move, so they cannot get three in a row (potentially from the second move). When not reacting to a move the players are building towards the goal of the game, and many people might do that by adding a cross next to the one placed in the previous round, like so: - - o - o - x x - This game is easily won every time by adding a circle in the bottom right corner. All other placements of the second circle will lead to a draw if the opponent simply reacts to the first player's moves.
@chrisjohnsonvan
@chrisjohnsonvan 6 жыл бұрын
Yup you're totally right Catfish, the only thing the other player can do is play in a corner again to force a draw.
@toddkes5890
@toddkes5890 3 жыл бұрын
I wonder if this Menace type setup could be made for a Steam game? Each morning the number of beads in each box is reset, with varying setups according to how you (Matt) choose. After each game, the results are sent to a central server (cloud-based?), and an updated set of totals is downloaded. In the evening the final chart and results are available for viewing/downloading
@russellhanes9238
@russellhanes9238 5 жыл бұрын
Why have it run out of beads? Why not halve the number of losing color beads and double the number of winning color beads? MENACE would still "learn", just more slowly, but would never give up entirely on any strategy. The difficulty with the current setup is that its initial random decisions in the endgame, before MENACE has learned anything, can teach it wrong yet binding lessons about opening moves. MENACE is like the cat that jumps on a hot stove that never jumps on a cold stove again. It's overgeneralized from random mistakes because it "learns" its lessons too fast. Slow MENACE's learning down in opening moves until it has worked its way through the ends of the decision tree fully, and only then can it learn something about the beginning of the decision tree (i.e., nothing good happens down this major branch). Overall, super duper cool experiment. Thank you for sharing this!!
@fluffly3606
@fluffly3606 4 жыл бұрын
A stack of matchboxes falls for the classic human fallacy of "if it ain't broke, don't fix it."
@fullmetalpotato1258
@fullmetalpotato1258 6 жыл бұрын
Have menace 1 play against menace 2. see how they both learn to try to beat the other. Just think of two machine learning systems simultaneously learning how to defeat the other. Likely the one going first will win, but if you try this for other games it may be a very interesting experiment.
@ythaenagor
@ythaenagor 5 жыл бұрын
based on the views, only about 1/3.5 or so of people who watched the first video were interested enough to watch the second. shoutout to the true fans who stayed for both
@RomanQrr
@RomanQrr 7 жыл бұрын
Wait. You have a second channel?!
@desia.brimou
@desia.brimou 7 жыл бұрын
and memes.
@wa11ywa1rus3
@wa11ywa1rus3 7 жыл бұрын
Now what I want is a domino computer that calculates each optimal move.
@robertofontiglia4148
@robertofontiglia4148 7 жыл бұрын
This is an art installation. You're not pretentious enough to call it that, but that's what that is.
@arikwolf3777
@arikwolf3777 6 жыл бұрын
I played the online MENACE about 50-60 times. MENACE won 1 and Draw about 10-15 times, before it ran out of open moves and started resigning on the opening move. It learn that the way not to lose is to not even play.
@vann595
@vann595 7 жыл бұрын
How have i not known that this chanel exists until now??
@shimassi9961
@shimassi9961 7 жыл бұрын
should've changed the scoring to win: 1 draw: 0 loss: -1. see if it can develop a winning strategy XD
@Ethrel1024
@Ethrel1024 7 жыл бұрын
Problem with this is that draw is optimal play -- the very best it can accomplish is a draw, assuming both play the best possible move given the current board state. This implies that the general trend with your scoring would be towards a resign in every gameboard state. It would just learn that the best play it can make, given the scoring system, is to not play.
@6iaZkMagW7EFs
@6iaZkMagW7EFs 7 жыл бұрын
Ethrel1024 win: 2 Draw: 1 Lose: -1 would solve the not playing problem. It would also encourage it to prefer wins, which would make it win when it can, instead of liking thing just as much even when it can win.
@shimassi9961
@shimassi9961 7 жыл бұрын
Thue, that's just a slight variation of what they used XD and I dont think it'd just give up every time, you're still training it to win. You may have a problem with it taking longer to learn and having to fill the first box with a lot of beads but otherwise it should work as you're playing humans (who make errors) so it would keep playing to the last. (correct me if i'm wrong though :D)
@chinareds54
@chinareds54 7 жыл бұрын
Ethrel1024: I'm not sure I agree with this. It's true that draw is the result of both players playing optimally, but MENACE isn't being both players. For MENACE, winning should be considered better than drawing. For example consider the following game state: X _ O _ X X _ O O Playing middle left wins the game for X, but playing bottom left for the block guarantees no worse than a draw. MENACE would not learn to play middle left if it considered "draw" to be optimal play.
@seancolman9091
@seancolman9091 4 жыл бұрын
Ethrel play as a human until menace learns to not lose. Then switch to menace2 and after menace plays centre then menace2 resigns every game.
@Moley1Moleo
@Moley1Moleo 7 жыл бұрын
I think the 'life support' should be to add in one of every bead.
@creeplyjohnson6195
@creeplyjohnson6195 7 жыл бұрын
Set up two of them and have Menace vs Menace!
@creeplyjohnson6195
@creeplyjohnson6195 7 жыл бұрын
Oliver Werndorf that's beyond amazing.
@BeingTheHunt
@BeingTheHunt 7 жыл бұрын
I'm wondering what strategy, if any, could be taken by the human player to sabotage the learning?
@Shadow81989
@Shadow81989 7 жыл бұрын
Play perfect when it starts middle or corner, and ALWAYS let it win when it starts on the edge, until it has only edges left in the first box. If it starts on the edge you can always win, so it will keep loosing from that point on (as long as the human doesn't make a mistake).
@BeingTheHunt
@BeingTheHunt 7 жыл бұрын
Maybe we will be able to implement a similar strategy when the robot uprising occurs.
@Shadow81989
@Shadow81989 7 жыл бұрын
It should always be possible to throw the good choices out at every single point of the game, with enough patience. But I would say the earlier in the game you start, the better do your chances get, to actually make it loose every game against a decent player, as soon as it's "well trained". The main problem with this is that it will eventually die, unless it's put on "life support" as shown...
@excelmaster2496
@excelmaster2496 2 жыл бұрын
When an almost empty matchbox on life support is better at tic tac toe than you
@mestopheles64
@mestopheles64 7 жыл бұрын
8:15 yep thats a great use for young children's minds!
@achu11th
@achu11th 7 жыл бұрын
Parker notifications (Day 2: HD 1080p).
@achu11th
@achu11th 7 жыл бұрын
Matt Parker Jesus. Matt Parker himself has responded to a Parker Joke. I never thought that this day would come. Thanks for the comment, I didn’t expect it.
@achu11th
@achu11th 7 жыл бұрын
Matt Parker btw I hope you don’t mind the parker square jokes. Also by chance let youtube know that they made this mistake (not updating the change of the title). I am still blown away that you responded to me and although it was a parker square as well
@achu11th
@achu11th 7 жыл бұрын
Matt Parker thank you very much for making videos about mathematics in an interesting way. Thank you for bearing with me and many others like me for over a year now. If you think that we can help you in any way (I can’t be a patreon sadly) like for a sample size, please let us/me know and I will gladly give it a go. Could you make a video about american and british trillions ? Basically how we call numbers in general and if there is a language, which does it the best mathematical way. If there is none make up an optimal Parker Numbering System. I will let you know, if I have any better ideas for future videos.
@thesuomi8550
@thesuomi8550 7 жыл бұрын
Why not put those 2 menaces fighting each other?
@thesuomi8550
@thesuomi8550 7 жыл бұрын
TheJJ100100 so what? They have the system for that too
@groszak1
@groszak1 7 жыл бұрын
they only made a set of matchboxes for being player 1
@Maric18
@Maric18 7 жыл бұрын
yes but you could make one for playing second
@efhiii
@efhiii 7 жыл бұрын
It's in Javascript, not Python, but here's the Connect 4 AI you asked for. efhiii.github.io/games/connect-4/ It uses minimax up to a certain level deep, determined by the AI level, and from there uses some basic huristics.
@mitigatedrisk4264
@mitigatedrisk4264 6 жыл бұрын
You should do an adversarial scenario, where two MENACEs learn by playing each other.
@Jackcabbit
@Jackcabbit 7 жыл бұрын
Idea: In Tic-Tac-Toe there are 8 positions on the outside of the perimeter. What rules would have to be worked out such that instead of 8 positions, it can be any even number 8 or above? 4 positions on the outside is a degenerate case where getting 3 in a row would only be possible for Player 1 and only if Player 2 played on the other two "edges." Bringing the game down to only needing to get 2 in a row then means whichever player goes in the middle first must always win, so the rule change to account for fewer positions means it's not much of a game. 6 outside positions but with a rule allowing any 3... For want of a better term "contiguous player marks" to win the game has a little more going for it and would be a playable game though it seems to more quickly go down to "you must control the center" than the 8+center regular Tic-Tac-Toe. What if Tic-Tac-Toe had a rule allowing any 3 contiguous marks and not requiring they be in a row? What if we have a game with 10 outside positions? Or maybe 12? The easiest way to account for adding 2 is simply think of the outside positions as the number of edges and vertexes of a regular n-sided polygon. Every side incremented adds both 1 edge and 1 corner/vertex, thus there is always the ability to get 3 in a row across. But it seems adding more outside positions around the center just makes the center even more important to control.
@HTrance
@HTrance 5 жыл бұрын
Cool to see how you could apply it on evolution. You can think of changing the beginning state as a change in genes or environment and then find out how fast it would die or prosper compared to other beginning states...
@dantebroggi3734
@dantebroggi3734 7 жыл бұрын
What I would kind of like to see is a system where: loss/give up: remove *only the last* chosen piece draw: do nothing win: duplicate each piece used in the win (in the respective boxes)
@AzraelCcs
@AzraelCcs 7 жыл бұрын
"Young children are basically random number generators" Brilliant
@christiandinkel8481
@christiandinkel8481 7 жыл бұрын
There is no way to justify short form billions. Have my disgusted 'like'.
@carlgomberg
@carlgomberg 3 жыл бұрын
What would happen if you played MENACE against MENACE?
@LeeSmith-cf1vo
@LeeSmith-cf1vo 7 жыл бұрын
It would have been interesting to see a flipped win condition - i.e. _don't_ get 3 in a row. Although I can see a challenge with getting kids to teach it that one.
@danielirina8246
@danielirina8246 5 жыл бұрын
I find the second day graph shows much better learning.
@andrewxc1335
@andrewxc1335 7 жыл бұрын
11:00 - MENACE apparently learned as much as Joshua. "How about a nice game of Chess?"
@callumhurdwell3134
@callumhurdwell3134 6 жыл бұрын
i am curious about two possibilities. first what about if menace wasnt rewarded for drawing and secondly what about if you had menace a vs menace b
@spudd86
@spudd86 7 жыл бұрын
Connect 4 is solved, and there is an optimal strategy. Apparently the optimal strategy is very complicated and involves a large amount of data.
@patrickcorby1423
@patrickcorby1423 6 жыл бұрын
Online, Menace vs Menace2, it's 298 for 149, 303 draws, highest number is over 600.
@linamishima
@linamishima 7 жыл бұрын
Menace doesn't die, it just learns that the only way to win is not to play :D
@kyokoyumi
@kyokoyumi 7 жыл бұрын
What if it played Sudoku? Would that even be possible considering it's not a two-player game? (Everything that was mentioned was specifically two players only; chess, go, connect four...) As well, does this mean it could play other two-player games like mancala, backgammon or othello?
@willsparrow7870
@willsparrow7870 7 жыл бұрын
I#'m guessing that last year's Science Festival video isn't turning up then?
@frankharr9466
@frankharr9466 7 жыл бұрын
That is so awesome. Although, if I were to keep it on life support, I'd be inclined to just put in one bead for each possible choice, including the bead that exists. But I suppose that would be just a bit nuts and shows that I'm a humanities guy.
@MPSpecial
@MPSpecial 6 жыл бұрын
How many matchboxes would it take for a standard game of Hexxagon?
@garethbaus5471
@garethbaus5471 5 жыл бұрын
I played online menice 1 for a while, and I beat it twice as often as a draw occurs, it is definitely better than being random, but could do better.
@spellcrafter23
@spellcrafter23 7 жыл бұрын
This is pretty interesting. Loved the online "simulator" I had 1 menace win, 3 draws, and 34 human wins before menace decided the best opening move was to resign. (I broke it.). I'm just wondering when there's going to be a matchbox version of "Global Thermonuclear War".
@karlkastor
@karlkastor 7 жыл бұрын
Connect 4 IS solved. The best starting move is the middle. I wrote a python Connect4 bot years ago, but it was very slow because it did a depth first search (5 moves deep took a minute)
@yoyamon6811
@yoyamon6811 7 жыл бұрын
Have you tried teaching menace the optimal way to lose a game? That would be interesting and fun.
@alexandrezani
@alexandrezani 3 жыл бұрын
I want to see a version when you don't have to keep track of the previous moves. Instead, when you lose, replace the last move's bead with a black bead. Whenever you draw a black bead, replace the bead you just played with that black bead. You'd basically end up doing a version of Q-learning.
@chluff
@chluff 7 жыл бұрын
Why is there so much footage of them all sorting through papers
@timothyjsduff
@timothyjsduff 7 жыл бұрын
You monster! Torturing those poor, helpless sentient matchboxes. For shame.
@gradybeckett1777
@gradybeckett1777 6 жыл бұрын
So could you use the gradient of a best fit line for the winning/drawing portion to give a number to how good MENACE works?
How many different Youtube videos are possible?
16:36
Stand-up Maths
Рет қаралды 434 М.
Calculating π by hand: the Chudnovsky algorithm
16:55
Stand-up Maths
Рет қаралды 492 М.
SIZE DOESN’T MATTER @benjaminjiujitsu
00:46
Natan por Aí
Рет қаралды 7 МЛН
Players push long pins through a cardboard box attempting to pop the balloon!
00:31
My response to being reverse-Dereked
8:36
Matt_Parker_2
Рет қаралды 616 М.
The tech behind my 29.97 fps video
8:33
Matt_Parker_2
Рет қаралды 154 М.
These 27 tickets guarantee a win on the Lottery
15:32
Matt_Parker_2
Рет қаралды 325 М.
The Game That Learns
10:42
Vsauce2
Рет қаралды 2,4 МЛН
MENACE: the pile of matchboxes which can learn
13:59
Stand-up Maths
Рет қаралды 437 М.
Vegas Sphere UPDATE: we fixed it!
9:05
Matt_Parker_2
Рет қаралды 267 М.
How thick is a three-sided coin?
13:04
Stand-up Maths
Рет қаралды 977 М.
How commas broke the Pokémon calculator but fixed an exchange rate.
22:29
The 10,000 Domino Computer
22:27
Stand-up Maths
Рет қаралды 1,3 МЛН
The Great Collatz Collab is complete!
15:39
Matt_Parker_2
Рет қаралды 34 М.
SIZE DOESN’T MATTER @benjaminjiujitsu
00:46
Natan por Aí
Рет қаралды 7 МЛН