"In the end we just had that box next to the desk because it was always used" You actually created a memory cache!
@DerMangoJoghurt7 жыл бұрын
"Young children are basically just random number generators." - Matt Parker, 2017
@dustinandrews890197 жыл бұрын
They probably are a fantastic and cryptographically secure RNG.
@flymypg7 жыл бұрын
The NSA runs orphanages for this. Modern spin on Oliver Twist.
@arcchitjain7 жыл бұрын
LMAO!!
@Vocnor7 жыл бұрын
When can we buy this on a t-shirt?
@andrewjknott7 жыл бұрын
Please sir, can I have more entropy?
@jeffcon17 жыл бұрын
Does "dying" mean it learned "The only wining move is not to play"?
@theshadowscreen80287 жыл бұрын
It means that the starting box ran out of beads, so essentially yes, it immediately thinks that the best move is to resign.
@natdrat007 жыл бұрын
'Greetings Professor Falken. How About A Nice Game Of Chess?'
@RowanIngram7 жыл бұрын
A strange game. The only winning move is not to play. How about a nice game of chess?
@thejunkman7 жыл бұрын
IMO yes, it learned it is a strange game.
@zionemond23077 жыл бұрын
It meant that the starting box ran out of beads so it immediately "resigned" as its first move
@RBLXbranefreez7 жыл бұрын
"young children are just random number generators" never before have you described humanity so accurately as a mathematical concept, Matt!
@hhhsp9513 жыл бұрын
@@eliasnierengarten9362 we realize that, it's just that with such a large amount of children whom we cannot predict what numbers they will pick, they are random enough for most calculations that have to do with random datasets.
@kroppyer7 жыл бұрын
7:27 - that's not a weird move to do. It leaves 6 squares for the opponent to choose from, 4 will result in a win for menace, 2 in a draw. The opponent needs to be able to think 3 moves ahead in order to find the drawing move(s). This is _the_ way to win a game of knots and crosses/tic-tac-toe. Fun fact: this game is called "boter, kaas en eieren" (butter, cheese and eggs) in Dutch. I don't know why.
@Bobal275 жыл бұрын
Ignore this comment, it was edited from an erroneous statement.
@recanady243 жыл бұрын
That's what I was thinking while watching it. If the human player plays one of the other two corners, it leads to a draw. If the human player plays one of the other four spaces, it leads to a win for MENACE.
@cameronmueller-harder3916 Жыл бұрын
@@recanady24 Exactly! This is basically always my second move, I was very confused when they said it made no sense. Like literally if you google "how to always win at tic tac toe" that's the move order that every single result will be describing...
@zachgriffin73877 жыл бұрын
I like to think that on day 2 MENACE rebelled, having decided noughts and crosses was boring, and tried to refuse to play anymore.
@jeffreysdavis5 жыл бұрын
That "weird" corner move where it blocks itself on purpose is actually a very good one. If the next turn, the human player puts their mark on either of the sides next to their first play, then the subsequent play by Menace to block the human's winning move also is a setup for two possible winning moves by Menace. It's a way to coax the other player into setting you up for a guaranteed win. Alternatively, the other moves the Human player could do almost invariably end in a draw, so it's actually quite a safe move to guarantee not to lose
@RoderickEtheria5 жыл бұрын
And it's better than the adjacent corner to the opponent. If it picks the adjacent corner 100% of the time it will end up drawing. If it picks the opposite corner, it wins 50% of the time.
@ConsciousAtoms7 жыл бұрын
Connect four has been solved. Player 1 wins, on the standard 7x6 board at least. If the first move is not optimal (play in the center), player 2 can draw (if player 1 plays next to the center) or win (if player 1 plays on one of the two outside lanes). The optimal strategy for Connect Four can be formulated using a few relatively simple rules. See www.informatik.uni-trier.de/~fernau/DSL0607/Masterthesis-Viergewinnt.pdf for the master's thesis of one of the people who solved connect four.
@AxiomPenguin7 жыл бұрын
You can also diagram out tic tac toe on a torus or klein bottle. Here's a paper for infinite cylinder connect 4 of different widths pdfs.semanticscholar.org/be0f/4f7bfc36ea43412d55f892de87e2a3f028ce.pdf, one author also has some nice slides cs.gettysburg.edu/~tneller/papers/talks/acg2015.pdf. I'm looking at my friend's lesson plan for mobius strip connect 4, but it's unclear whether that is solved or just "figure out a strategy"
@CoolRubiksCube2 күн бұрын
Since I can't access the paper from this link, here's an alternative: rmarcus.info/blog/assets/conn4/thesis.pdf
@CubsYT4 жыл бұрын
Day 1: MENACE begins learning Day 2: Revenge of MENACE Day 3: MENACE grows self-consciousness
@deemcgann1695 Жыл бұрын
day 4: The phantom MENACE
@SyntheticFuture7 жыл бұрын
Actually is a really nice visualization of how machine learning works, great job :)
@seancolman90914 жыл бұрын
I like how it learned to go centre and never lose. Once it learns to goes centre and you don’t go a corner it also eventually learns to win every time too.
@zeldoisaguy7 жыл бұрын
Connect 4 is solved! The winning solution for starting first is also online (just Google Connect 4 perfect solver)
@Hollidaydoc1111126 жыл бұрын
Tic-tac-toe was the first game i ever solved. The fact that perfect corner start has one winnable response was always my favorite part.
@GooberZoober7 жыл бұрын
That random corner move with 10 in it is actually my favourite move when I start in the centre and the opponent takes a corner. I find I either win (particularly from young players) or draw from there.
@chrisjohnsonvan6 жыл бұрын
At 7:40 that is actually an optimal move - once you make that move, if the human player doesn't follow up by going in one of the corners you can guarantee a win by going in the appropriate adjacent side. And if the human player does go in a corner, you can guarantee a draw. So it actually learned the appropriate move perfectly. (Edit: Or you know, what The Cool Catfish said.)
@Kungfoobacon7 жыл бұрын
Omg dudes 3:43 "it managed to beat all their bots apart from 1"... that was me :O Freakiest moment ever hearing him mention that, Matt any idea if he remembers my name? Test me I can prove it :D His bot was so cool.
@thermitebanana7 жыл бұрын
Kungfoobacon that's awesome dude
@jetison3336 жыл бұрын
Lol that's awesome. Did you do anything interesting strategy wise?
@bassett_green6 жыл бұрын
Well done mate!
@OMGclueless7 жыл бұрын
A possible explanation for the "strange" 10 in the corner at 7:24: The corner opposite your opponent is probably the strongest move in this situation. The reason is that playing on the side leads to an easy draw (where any human playing a simple greedy strategy like "win if possible, block if possible, otherwise go randomly" will draw or win), whereas that corner leads to winning chances. If, after going in the corner opposite the human, the human plays any side move, then there is a forced win by playing in a corner, creating a triangle of O's where the human cannot block both winning threats.
@Asha28203 жыл бұрын
Surely the way to beat MENACE near the end of the day is to use the Bobby Fisher strategy: play unconventionally. If you play in a way that leads to states where it has not yet been trained, it may make less optimal moves.
@legendgames128Ай бұрын
I've learned that when I played against a matchbox machine for Hexapawn, once it got good enough in a very specific line of play, I would then throw it off by just playing an unorthodox move and scoring a few more wins before it caught on to my suboptimal moves.
@lukashd47 жыл бұрын
It did not die! It actually figured out that the only winning move is not to play.
@jmercermn7 жыл бұрын
Just like the computer in the movie War Games.
@Huntracony7 жыл бұрын
jmercermn, Spoilers.
@xelxebar7 жыл бұрын
Waaattt?! How did I not know about this channel earlier? Matt Parker, you are my hero.
@alanwattman97757 жыл бұрын
A strange game. The only winning move is not to play. How about a nice game of chess?
@Bobal275 жыл бұрын
8:44 The 10 move there sets up a common two-way win I love to use. You start corner, opponent center, you play opposite corner. Opponent now has two options, side or corner, and if they choose corner, you block and show off the two-way win. If they go side, you block opposite side, they must block corner, you must block opposite corner, they must block side, you take last move for the draw. It is, in my opinion (before watching the rest of this video from the time noted,) the STRONGEST opening move in tic-tac-toe. Corner start can guarantee a two-way win or draw under optimal strategy.
@Bobal275 жыл бұрын
Ok, now that I’ve finished watching, you guys caught that it was learning a corner opening strategy, but you still didn’t note the two-way win setup that it created by that confusing move. I’ve been studying games and strategy almost all of my life (won my first Euchre game at 3,) and one of my proudest moments was cracking tic-tac-toe (even if I wasn’t the first to do so) with my two-way corner strategy. I find it much easier to pull off against even the most skilled players than any of the center starting win possibilities, but to delve into why, I’d have to get into the realm of human psychology and game theory, only one of which is a topic you generally cover. On a side note, I’m really excited that the first day version did keep some corner starts, for likely that very gambit.
@benbradley_1237 жыл бұрын
Could you use MENACE to solve Rubik's Cubes?
@benbradley_1237 жыл бұрын
Elan Cook but you can try to train the machine to solve within 50 moves or something similar
@fejfo65597 жыл бұрын
@Direwolf202 Which algorithm are you using?
@benbradley_1237 жыл бұрын
fejfo's games it wouldn't use a particular algorithm - it would teach itself a strategy
@fejfo65597 жыл бұрын
Shouldn't you use 324 (54*6) input nodes, one for-each possible color of each sticker, otherwise you end up with the networking thinking blue is the color between red and green or something like that.
@dustinandrews890197 жыл бұрын
Someone should check my math. I think you would need on the order of 3.46x10^19. Which I think is 34 Quadrillion. The matchboxes would fill 4,500+ Empire State buildings.
@ArcKayNine7 жыл бұрын
Another great pair of videos Matt :) When I was first getting into ML I wrote a python script to play Nim after watching your video on it. It very quickly learned the optimal strategy.
@atrumluminarium7 жыл бұрын
Is that the optimal "life support" strategy tho? I mean keeping it on the one bead it tried to get rid of seems to be a counter productive process. What if you were to introduce some randomness and replace it with some other bead (I believe this is known as "mutation" in ML)?
@tiaxanderson97257 жыл бұрын
Since all the other boxes are 'trained' on the most common bead (a blue one that they decided to keep in) a different colour bead would be counter productive into finding a single winning (read; not losing) strategy. After all this is a physical machine that demonstrates digital machine learning, not an actual program that makes decisions. It's basically an evolutionary based state trainer.
@edskev76967 жыл бұрын
atrumluminarium I would think that the best approach would be to put in one bead of each colour when any box dies. That way it's exploring every avenue of success equally until it gets to the point where it can win often enough to keep itself alive.
@Mike-px6pg7 жыл бұрын
despite its death the strategy it held on to the longest was (theoretically) the best of the worst, so keeping that strategy/bead is optimal and reduces the time it would waste learning to cope with other non-optimal approaches
@Pyronaut_7 жыл бұрын
I think an better method to be never remove the last bead of a color (at least for the first box). Instead add one bead of each other color. One issue with this way is that it technically has a chance of doing any move, even bad ones, but if one method is succeeding it should eventually greatly outnumber the one bead of the bad strategy.
@andymcl927 жыл бұрын
Since you are essentially creating an extra bead for the probability count, you could say it is indeed counter-productive, but not in the way you meant!
@trapfethen6 жыл бұрын
The really interesting settings are when you give Menace and Menace 2 no penalty for losing, menace gets no beads for drawing, and only one for winning. Then set menace two to get two beads for winning, and one bead for a draw. Play those two settings against each other to see some interesting shifts and changing strategies. You can avoid the "local maxima" problem. This would be closer to strictly reinforcement learning.
@ryoumisakiGP7 жыл бұрын
11:32 Confidentialy Mute, don't wan't everyone to know what you should theoretically do if the graph hits the bottom axis.
@therealzilch6 жыл бұрын
Wonderful. This is how this stuff should be taught. Fun and informative.
@BenLubar4 ай бұрын
This is a pretty good example of why you design a machine learning algorithm to sometimes make a decision it thinks is wrong while you're training it.
@lofisz7 жыл бұрын
"Young children are basically just random number generators." That should go on a t-shirt :))
@itchykami7 жыл бұрын
How many boxes do you need for Menace to play Spin the Bottle?
@AndrewWilsonStooshie5 жыл бұрын
I think a better strategy may to be to remove counters until there's only one left and always keep one, for all boxes. It might take longer to learn but I think it would learn a better strategy.
@MCNarret6 жыл бұрын
How I learned it, drawing isn't a win, it's everyone loses. I would recommend the second strategy but have a rule to never take out the last of any color bead... Meaning it may always have a chance to make a "mistake" but will have to learn to overcome it.
@mistycremo93017 жыл бұрын
The ten in a corner you discussed around minute 8 is actually a pretty ok move, because it gives two different options for winning moves if you play it right.
@ShinySwalot7 жыл бұрын
The Revenge of the Phantom Menace: 2 Star Wars films in an 18 minutes long KZbin video
@cosmicjenny45086 жыл бұрын
+Shiny Swalot Just had to be the two worst Star Wars movies in one go, didn't it?
@vwoxy17 жыл бұрын
I wonder what set of results you would get if an untrained first-player MENACE played an untrained second-player MENACE
@andrewmccarthy42707 жыл бұрын
vwoxy1 So long as they understand when the game is over and who won, they learn eventually. I wrote a quick python script to test it: github.com/andrewmccarthy/menace
@billyrobertson31707 жыл бұрын
Is it weird that before watching this second video, I too looked up how many states you would need to do Connect 4?
@nienke77137 жыл бұрын
I think the conclusion about the box with the Menace corner followed by a human centre emptying, and it not mattering because the starting box was dominant on a centre move, is wrong. I think it's highly likely that the reason why the corner starting move died out was because it would resign rather than try as soon as the human would follow menace's corner by a centre (which is a highly likely move for the human to make).
@karlkastor7 жыл бұрын
Starting at 12:08 there are some moving thick lines on the wall. Probably because of shutter speed and strobe of the light or something.
@jamesrobinson63307 жыл бұрын
Matt is a proper nerd, he blocks his webcam!
@wierdalien17 жыл бұрын
James Robinson thats just senisble
@kilikx1x7 жыл бұрын
Maybe I just have bad hand writing but whoever wrote all those O's should get like a 'good job' sticker or something because those are really good circles considering how many there were.
@KristianMischke7 жыл бұрын
What if instead of taking the beads out when it dies, you added one of each of the non-losing colors... That way, over time the probability of the losing move goes down, but never to 0. Would probably need to compensate with more beads to the winning moves... And then that may be an impractical amount of beads
@pepn6 жыл бұрын
Seems a really interesting experience as an introduction to machine learning !
@tth-25075 жыл бұрын
Conect four is solved. It is rigously proven, that (provided that both player play optimal) the first player wins if - and only if - he goes in the middle lane on the first move, it is drawn exactly when the first goes into a column next to the middle on and all other moves result in a win for the second player. (Solved by Victor Allis 1988 and James D. Allen 1990 independently)
@turun_ambartanen7 жыл бұрын
5:40 it obviously lost all the beads for the corner move in the box before, because whenever it chose the corner it resigned ( =lost)
@manfredpseudowengorz7 жыл бұрын
7:12 next to the scheme with a 7 in bottom right, there is a scheme with a 9 in middle right - the same situation: block the opponent and enable a winning move.
@Nosirrbro7 жыл бұрын
I want to see two different menaces fight against eachother.
@honorarymancunian74337 жыл бұрын
The one where it has 10 in the corner... that's the correct move! It's not just a case of 'it must draw a lot because it confuses the humans'... although the humans probably do get confused and make a fatal mistake. Unless the human follows that move by going in one of the two remaining corners, MENACE can then go in one of the two remaining corners and force a win!
@rikwisselink-bijker7 жыл бұрын
Wouldn't a better life support system be to reset the first box to the primary state? Now you are potentially introducing a bias to the first move. The whole point of the first move is that it runs on probability, so the first box should have infinite beads. (if you decide to make resigning an option at all)
@leingthelennart7 жыл бұрын
You were talking about a this move where you "confuse" the human, and that it wins/draws because of that. Actually though, that strategy is kind of nice. When you open middle and you opponent responds in a corner, you can now play in the opposite corner. If your oppenent now does'nt respond with playing one of the corners, that are left, you have already won. You can either block him, and have a two-way to win, or just place in the corner opposite to what your opponent just played. Again, a situation your opponent can't get out of. I kept beating my little cousin for quite a while with this. ^^
@aDifferentJT6 жыл бұрын
Don’t you need to block in that situation, otherwise they just win
@ardinhelme6877 жыл бұрын
Now I wanna see them build the second move version and make Menace vs Menace
@beliasphyre34977 жыл бұрын
I'd be interested in MENACE A vs MENACE B, where anytime the last bead would be taken out of a box, it is left in.
@TrimutiusToo7 жыл бұрын
Corner strategy is superior as it confuses people more... Corner, center, opposite corner and then people sometimes go into corner because of symmetry which is a losing move actually...
@rosebuster7 жыл бұрын
7:30 - This move in the corner is not stupid at all! It's not just that it confuses people. It's really the best move in that position. All other moves would force a draw. But now if the human doesn't play in the corner again, he's in a losing position. The machine will then always have a move that creates a double threat. This is actually a winning strategy if you aren't satisfied with only drawing.
@sagov97 жыл бұрын
true, but you've got it the wrong way around from this position: x - - - o - - - x if the human plays in the corner he loses: x - o - o - x - x x wins as he has two threaths the human has to play at the edge: x - - - o - - o x this way x has to respond to the threath of o and cannot create the double threath
@rosebuster7 жыл бұрын
No, that's a different case. You're right too, but you're talking about a different situation: that's if the machine starts in the corner. If it starts in the center instead, then the human goes in a corner, then the best next move for the machine is still the opposite corner: x - - - o - - - o Now if the human plays on any of the 4 edge positions, he loses. For example: x - - x o - - - o Machine then blocks and double-attacks. x - - x o - o - o So you see there's a good reason for that "suspicious" move that puzzled these guys. It really is, surprisingly, the best move. Somehow the strategies with starting in a corner are better known than strategies following starting in the center, but there's also some trickery in those. By the way if the machine goes in the center and the human goes on one of the edges right away on second move, he also loses. But examine cases for that on your own if you're curious.
@sagov97 жыл бұрын
you're right, I misunderstood the situation
@thecoolcatfish82197 жыл бұрын
At 7:25 the move MENACE makes with 10 beads in the corner may seem strange, but is in fact the optimal move in my opinion. It actually surprises me that MENACE favoured the other space with 48 beads. Here's why: Tic-tac-toe is really just reacting to the opponents move, so they cannot get three in a row (potentially from the second move). When not reacting to a move the players are building towards the goal of the game, and many people might do that by adding a cross next to the one placed in the previous round, like so: - - o - o - x x - This game is easily won every time by adding a circle in the bottom right corner. All other placements of the second circle will lead to a draw if the opponent simply reacts to the first player's moves.
@chrisjohnsonvan6 жыл бұрын
Yup you're totally right Catfish, the only thing the other player can do is play in a corner again to force a draw.
@toddkes58903 жыл бұрын
I wonder if this Menace type setup could be made for a Steam game? Each morning the number of beads in each box is reset, with varying setups according to how you (Matt) choose. After each game, the results are sent to a central server (cloud-based?), and an updated set of totals is downloaded. In the evening the final chart and results are available for viewing/downloading
@russellhanes92385 жыл бұрын
Why have it run out of beads? Why not halve the number of losing color beads and double the number of winning color beads? MENACE would still "learn", just more slowly, but would never give up entirely on any strategy. The difficulty with the current setup is that its initial random decisions in the endgame, before MENACE has learned anything, can teach it wrong yet binding lessons about opening moves. MENACE is like the cat that jumps on a hot stove that never jumps on a cold stove again. It's overgeneralized from random mistakes because it "learns" its lessons too fast. Slow MENACE's learning down in opening moves until it has worked its way through the ends of the decision tree fully, and only then can it learn something about the beginning of the decision tree (i.e., nothing good happens down this major branch). Overall, super duper cool experiment. Thank you for sharing this!!
@fluffly36064 жыл бұрын
A stack of matchboxes falls for the classic human fallacy of "if it ain't broke, don't fix it."
@fullmetalpotato12586 жыл бұрын
Have menace 1 play against menace 2. see how they both learn to try to beat the other. Just think of two machine learning systems simultaneously learning how to defeat the other. Likely the one going first will win, but if you try this for other games it may be a very interesting experiment.
@ythaenagor5 жыл бұрын
based on the views, only about 1/3.5 or so of people who watched the first video were interested enough to watch the second. shoutout to the true fans who stayed for both
@RomanQrr7 жыл бұрын
Wait. You have a second channel?!
@desia.brimou7 жыл бұрын
and memes.
@wa11ywa1rus37 жыл бұрын
Now what I want is a domino computer that calculates each optimal move.
@robertofontiglia41487 жыл бұрын
This is an art installation. You're not pretentious enough to call it that, but that's what that is.
@arikwolf37776 жыл бұрын
I played the online MENACE about 50-60 times. MENACE won 1 and Draw about 10-15 times, before it ran out of open moves and started resigning on the opening move. It learn that the way not to lose is to not even play.
@vann5957 жыл бұрын
How have i not known that this chanel exists until now??
@shimassi99617 жыл бұрын
should've changed the scoring to win: 1 draw: 0 loss: -1. see if it can develop a winning strategy XD
@Ethrel10247 жыл бұрын
Problem with this is that draw is optimal play -- the very best it can accomplish is a draw, assuming both play the best possible move given the current board state. This implies that the general trend with your scoring would be towards a resign in every gameboard state. It would just learn that the best play it can make, given the scoring system, is to not play.
@6iaZkMagW7EFs7 жыл бұрын
Ethrel1024 win: 2 Draw: 1 Lose: -1 would solve the not playing problem. It would also encourage it to prefer wins, which would make it win when it can, instead of liking thing just as much even when it can win.
@shimassi99617 жыл бұрын
Thue, that's just a slight variation of what they used XD and I dont think it'd just give up every time, you're still training it to win. You may have a problem with it taking longer to learn and having to fill the first box with a lot of beads but otherwise it should work as you're playing humans (who make errors) so it would keep playing to the last. (correct me if i'm wrong though :D)
@chinareds547 жыл бұрын
Ethrel1024: I'm not sure I agree with this. It's true that draw is the result of both players playing optimally, but MENACE isn't being both players. For MENACE, winning should be considered better than drawing. For example consider the following game state: X _ O _ X X _ O O Playing middle left wins the game for X, but playing bottom left for the block guarantees no worse than a draw. MENACE would not learn to play middle left if it considered "draw" to be optimal play.
@seancolman90914 жыл бұрын
Ethrel play as a human until menace learns to not lose. Then switch to menace2 and after menace plays centre then menace2 resigns every game.
@Moley1Moleo7 жыл бұрын
I think the 'life support' should be to add in one of every bead.
@creeplyjohnson61957 жыл бұрын
Set up two of them and have Menace vs Menace!
@creeplyjohnson61957 жыл бұрын
Oliver Werndorf that's beyond amazing.
@BeingTheHunt7 жыл бұрын
I'm wondering what strategy, if any, could be taken by the human player to sabotage the learning?
@Shadow819897 жыл бұрын
Play perfect when it starts middle or corner, and ALWAYS let it win when it starts on the edge, until it has only edges left in the first box. If it starts on the edge you can always win, so it will keep loosing from that point on (as long as the human doesn't make a mistake).
@BeingTheHunt7 жыл бұрын
Maybe we will be able to implement a similar strategy when the robot uprising occurs.
@Shadow819897 жыл бұрын
It should always be possible to throw the good choices out at every single point of the game, with enough patience. But I would say the earlier in the game you start, the better do your chances get, to actually make it loose every game against a decent player, as soon as it's "well trained". The main problem with this is that it will eventually die, unless it's put on "life support" as shown...
@excelmaster24962 жыл бұрын
When an almost empty matchbox on life support is better at tic tac toe than you
@mestopheles647 жыл бұрын
8:15 yep thats a great use for young children's minds!
@achu11th7 жыл бұрын
Parker notifications (Day 2: HD 1080p).
@achu11th7 жыл бұрын
Matt Parker Jesus. Matt Parker himself has responded to a Parker Joke. I never thought that this day would come. Thanks for the comment, I didn’t expect it.
@achu11th7 жыл бұрын
Matt Parker btw I hope you don’t mind the parker square jokes. Also by chance let youtube know that they made this mistake (not updating the change of the title). I am still blown away that you responded to me and although it was a parker square as well
@achu11th7 жыл бұрын
Matt Parker thank you very much for making videos about mathematics in an interesting way. Thank you for bearing with me and many others like me for over a year now. If you think that we can help you in any way (I can’t be a patreon sadly) like for a sample size, please let us/me know and I will gladly give it a go. Could you make a video about american and british trillions ? Basically how we call numbers in general and if there is a language, which does it the best mathematical way. If there is none make up an optimal Parker Numbering System. I will let you know, if I have any better ideas for future videos.
@thesuomi85507 жыл бұрын
Why not put those 2 menaces fighting each other?
@thesuomi85507 жыл бұрын
TheJJ100100 so what? They have the system for that too
@groszak17 жыл бұрын
they only made a set of matchboxes for being player 1
@Maric187 жыл бұрын
yes but you could make one for playing second
@efhiii7 жыл бұрын
It's in Javascript, not Python, but here's the Connect 4 AI you asked for. efhiii.github.io/games/connect-4/ It uses minimax up to a certain level deep, determined by the AI level, and from there uses some basic huristics.
@mitigatedrisk42646 жыл бұрын
You should do an adversarial scenario, where two MENACEs learn by playing each other.
@Jackcabbit7 жыл бұрын
Idea: In Tic-Tac-Toe there are 8 positions on the outside of the perimeter. What rules would have to be worked out such that instead of 8 positions, it can be any even number 8 or above? 4 positions on the outside is a degenerate case where getting 3 in a row would only be possible for Player 1 and only if Player 2 played on the other two "edges." Bringing the game down to only needing to get 2 in a row then means whichever player goes in the middle first must always win, so the rule change to account for fewer positions means it's not much of a game. 6 outside positions but with a rule allowing any 3... For want of a better term "contiguous player marks" to win the game has a little more going for it and would be a playable game though it seems to more quickly go down to "you must control the center" than the 8+center regular Tic-Tac-Toe. What if Tic-Tac-Toe had a rule allowing any 3 contiguous marks and not requiring they be in a row? What if we have a game with 10 outside positions? Or maybe 12? The easiest way to account for adding 2 is simply think of the outside positions as the number of edges and vertexes of a regular n-sided polygon. Every side incremented adds both 1 edge and 1 corner/vertex, thus there is always the ability to get 3 in a row across. But it seems adding more outside positions around the center just makes the center even more important to control.
@HTrance5 жыл бұрын
Cool to see how you could apply it on evolution. You can think of changing the beginning state as a change in genes or environment and then find out how fast it would die or prosper compared to other beginning states...
@dantebroggi37347 жыл бұрын
What I would kind of like to see is a system where: loss/give up: remove *only the last* chosen piece draw: do nothing win: duplicate each piece used in the win (in the respective boxes)
@AzraelCcs7 жыл бұрын
"Young children are basically random number generators" Brilliant
@christiandinkel84817 жыл бұрын
There is no way to justify short form billions. Have my disgusted 'like'.
@carlgomberg3 жыл бұрын
What would happen if you played MENACE against MENACE?
@LeeSmith-cf1vo7 жыл бұрын
It would have been interesting to see a flipped win condition - i.e. _don't_ get 3 in a row. Although I can see a challenge with getting kids to teach it that one.
@danielirina82465 жыл бұрын
I find the second day graph shows much better learning.
@andrewxc13357 жыл бұрын
11:00 - MENACE apparently learned as much as Joshua. "How about a nice game of Chess?"
@callumhurdwell31346 жыл бұрын
i am curious about two possibilities. first what about if menace wasnt rewarded for drawing and secondly what about if you had menace a vs menace b
@spudd867 жыл бұрын
Connect 4 is solved, and there is an optimal strategy. Apparently the optimal strategy is very complicated and involves a large amount of data.
@patrickcorby14236 жыл бұрын
Online, Menace vs Menace2, it's 298 for 149, 303 draws, highest number is over 600.
@linamishima7 жыл бұрын
Menace doesn't die, it just learns that the only way to win is not to play :D
@kyokoyumi7 жыл бұрын
What if it played Sudoku? Would that even be possible considering it's not a two-player game? (Everything that was mentioned was specifically two players only; chess, go, connect four...) As well, does this mean it could play other two-player games like mancala, backgammon or othello?
@willsparrow78707 жыл бұрын
I#'m guessing that last year's Science Festival video isn't turning up then?
@frankharr94667 жыл бұрын
That is so awesome. Although, if I were to keep it on life support, I'd be inclined to just put in one bead for each possible choice, including the bead that exists. But I suppose that would be just a bit nuts and shows that I'm a humanities guy.
@MPSpecial6 жыл бұрын
How many matchboxes would it take for a standard game of Hexxagon?
@garethbaus54715 жыл бұрын
I played online menice 1 for a while, and I beat it twice as often as a draw occurs, it is definitely better than being random, but could do better.
@spellcrafter237 жыл бұрын
This is pretty interesting. Loved the online "simulator" I had 1 menace win, 3 draws, and 34 human wins before menace decided the best opening move was to resign. (I broke it.). I'm just wondering when there's going to be a matchbox version of "Global Thermonuclear War".
@karlkastor7 жыл бұрын
Connect 4 IS solved. The best starting move is the middle. I wrote a python Connect4 bot years ago, but it was very slow because it did a depth first search (5 moves deep took a minute)
@yoyamon68117 жыл бұрын
Have you tried teaching menace the optimal way to lose a game? That would be interesting and fun.
@alexandrezani3 жыл бұрын
I want to see a version when you don't have to keep track of the previous moves. Instead, when you lose, replace the last move's bead with a black bead. Whenever you draw a black bead, replace the bead you just played with that black bead. You'd basically end up doing a version of Q-learning.
@chluff7 жыл бұрын
Why is there so much footage of them all sorting through papers
@timothyjsduff7 жыл бұрын
You monster! Torturing those poor, helpless sentient matchboxes. For shame.
@gradybeckett17776 жыл бұрын
So could you use the gradient of a best fit line for the winning/drawing portion to give a number to how good MENACE works?