This is still by far the best lecture I've seen on the topic so far
@Frost_Byte_Tech2 жыл бұрын
It's because of content like this that I'll never get bored of trying to solve complex problems, really insightful and thought provoking 💫
@mohamedyasser20682 жыл бұрын
attending such a lecture for me is a dream , I can't believe that most of them don't play chess !!
@michaelmassaro43757 ай бұрын
I play chess I’m subscribed to a you tuber he shows many Stockfish game Jozarovs Chess I figured I’d take a look to see how the engines actually work
@philj9594 Жыл бұрын
Just started learning chess and I know only a little about computer science/programming but this was wonderful to gain a better understanding of what chess engines are actually doing under the hood when I use them and also a better understanding of their limitations. I've noticed many people talk about people over-relying on engines so I figured it would be a good use of my time to gain a deeper understanding of what a chess engine even is if I'm going to be using them regularly. Also, it's just interesting and fun to learn! Thanks for the amazing lecture. :)
@Magnulus76 Жыл бұрын
Yes, it's possible to over-rely upon computer chess. Stockfish and Leela are powerful engines but they can have problems in their own understanding (particularly with chess concepts, such as endgame fortresses, something that still baffles engines out there). They also don't always produce data that is particularly relevant to learning chess, in particular Stockfish's "thinking" is very alien and sometimes difficult to learn from.
@michaelmassaro43757 ай бұрын
So engines can be used for studying lines etc or analyzing a game that was played but it seems many players can use them to cheat on line
@LouigiVerona4 жыл бұрын
An incredible lecture, thank you!
@christrifinopoulos8639 Жыл бұрын
about the stockfish evaluation function, is it completely prewritten or are there some (handwritten) parameters that can be optimised through learning? (
@A_Swarm_of_Waspcrabs4 жыл бұрын
Great lecture- that hour whizzed by too fast.
@vladimirtchuiev22183 жыл бұрын
I don't understand why do you need the value function, if you have probabilities over possible moves, you will always during deployment select the argmax of the probability vectors... Is it for victory/defeat flags or something like that? Also, after each iteration of the MCTS, is the network trained until convergence or do you go over the self-played game only once?
@fisheatsyourhead2 жыл бұрын
for timed games is a value function not faster
@vladimirtchuiev22182 жыл бұрын
@@fisheatsyourhead After some month of digging, ye it's faster because you don't have the time usually to go towards the end of the game, and instead you consider the values of the leaf nodes.
@amanbansll Жыл бұрын
I think there is another reason: the task of learning the policy vector alone doesn't teach the model about whether the current position is good or bad, it only learns whats the best thing to do in the situation. While this is enough to play, augmenting the learning process by adding another objective (multi-task learning style, because the model is shared between both objectives, only the head is different) helps the model learn better. Just my thoughts though, feel free to correct me if I'm wrong.
@sunnysunnybay2 жыл бұрын
Without analysing as a chess engine i can see it's actually better for black. Count 9 pieces around the king, both queens are at the 5th rank of the king so they are not included, but black has a rook while no rook is near the white king and the pawn structure has 1 shape out for them too, agains't 3 move on black and good defense around them with both pawn and major pieces. Black has 1 pawn on the 5th rank in font of this weak king also, while it's 1 pawn for H rank & G & E for white.
@kevingallegos94662 жыл бұрын
Please what is the song at the beginning of the video! I've heard it before and now I want to listen to it! Thankyou!
@berndmayer3984 Жыл бұрын
the best investigation yielded approx. 10^42 positions and that is what counts.not the rough estimate of 10^120 possible games.
@allorgansnobody2 жыл бұрын
Wow just 4 minutes in and this is an excellent explanation. Just knowing whether or not stockfish had these "handcrafted" elements is so important to understanding how it works.
@themanwhoknewtoomuch66672 ай бұрын
Also didn't know Stockfish is classical. We tend to take engines as gospel.
@ruffianeo34182 жыл бұрын
There is one point, usually never mentioned. I will try to explain that (rather valid question) below, hoping, someone else will explain, why this is not a concern: Neural networks (deep or otherwise) act as function estimators. Here, it is the value function F(position) -> Value. As was pointed out early in the talk, this must be an approximation, because it would be cheating the universe if it managed to be exact in the presence of those high numbers of possible positions. (Store more information than number of atoms in the universe). So, an assumption is being made (and that is what is usually not elaborated): Positions, never seen before by the network still yield a value and the means of doing that is some form of interpolation. But for this to work, you assume a smooth value function (how ever high dimensional it is), you assume: V(P+Delta) = Value + Delta` for small enough deltas. So for this to work, the value function for chess has to be smooth-ish. But where did anyone ever prove, that this is the case? Here a simple example of the difference I try to point out: f1: Float -> Float f1 x = x * x If you sample F1 at some points, you can interpolate (with some errors) values between the samples: So, you train the network for, say: x in [1,3,5,7,...]. And when the network is trained and applied to values in [2,4,6], you get some roughly useful value (hopefully). Why? Because the function the network approximates is smooth. Here another function, not smooth: f2: Float -> Float f2 x = random x Training a network at the x in [1,3,5,7] cases does not yield a network which gives good estimators for the even x values. Why? Because that function is not smooth (unless you got lucky with your random numbers). So, which of the above F1, F2 is more akin to a chess value function V(position) -> Value? Who has shown, that chess is F1-ish?
@marcotroster8247 Жыл бұрын
You don't just learn a value function but also a strategy to weigh the trajectories sampled during training, so you can concentrate on good moves and their following moves to distill critical positions out of the game tree. It's quite clever actually. The strategy pi provides a random distribution indicating how likely each move is to be picked by the agent. When sampling a trajectory, you pick moves according to the distribution (stochastically), so the training experiences are just an empirical sample of the real game tree. Then you fit the distribution to explore good trajectories more intense by increasing their probabilities and vice versa. (Have a look at policy gradient / actor-critic techniques if you're interested) So to answer your question about smooth functions. You're usually only guaranteed to converge towards a local minimum of your estimator's error term. It's an empirical process, not an analytical one, so you cannot expect that from AI anyways. After all, you pick moves by sampling from a random distribution to model the intuition of "this move looks good" 😉
@congchuatocmay4837 Жыл бұрын
@@marcotroster8247 There are a lot of ways to go in higher dimensional space. If you get blocked one way, there are still 999,999 ways to go for example, or if you get blocked 1000 ways there are still 999000 ways to go. And that is how these artificial neural networks can be trained at all. They completely turn the tables on the 'curse of dimensional.'
@peterwinters85875 жыл бұрын
Monte Carlo probability has been around decades
@ME0WMERE2 жыл бұрын
10:45 as someone who is actually making a chess engine: Haha no.
@zeldasama2 жыл бұрын
The disconnect from professors to students. Lmao
@samreenrehman66432 жыл бұрын
They probably just suck at chess
@michaelmassaro43757 ай бұрын
@@samreenrehman6643they might suck at chess but than again they probably have a greater understanding of the elements the man is speaking on
@danruth108910 ай бұрын
tHANK YOU, but I disagree that rook usage
@michaelmassaro43757 ай бұрын
That Rook move was the best move it’s the only move to draw the game otherwise there is no stopping the Queen from checkmating you can try battling Rook vs Queen but that’s a losing effort
@desertplayz3955 Жыл бұрын
I wanna see stockfish pull a Jerome opening now
@A_Swarm_of_Waspcrabs Жыл бұрын
There's a KZbinr Joe Kempsey that forced Stockfish 14 to play the Jerome Gambit against MagnusApp
@GNU_Linux_for_good4 жыл бұрын
sacrifice the rook [Rc1+] in order to prevent an immediate mate.
@michaelmassaro43757 ай бұрын
After Queens captures than stalemate Whites king can not move correct
@avlavas3 жыл бұрын
Intel Core i7 11700K Motherboard Asus Z590 32GB RAM DDR4 1TB SSD GTX 1060 3GB DDR5 Hi this is my computer, and I use SF14, plus I have a 100 gb chess moves, but in a way I didn't get 100% of it. Can you help me? Ty
@beattoedtli10404 ай бұрын
Nice talk, but in 2024, Stockfish is still better than alpha zero. Why?
@urimtefiki2266 ай бұрын
I play and I do not think, just repeat the same things wasting my time while waiting for the bullshiter since 2016
@Best-go8cd4 жыл бұрын
u need a degree in computer science to know how AlphaZero work
@alexcwagner4 жыл бұрын
I have a degree in computer science. I still don't understand it.
@skinnyboystudios97224 жыл бұрын
you only need to read three books
@AshikVetrivelu4 жыл бұрын
Or just one paper.
@regularname18253 жыл бұрын
You cannoy know how alpha zero works Thats why ai is unpredictable A good way to put is thay It just works
@dustinjames12683 жыл бұрын
@@regularname1825 AI is not unpredictable unless it's self-learning. Not all AI improves on its own, and follows simple rules
@plunderersparadise Жыл бұрын
He sounds like he has no idea what he is talking about lol. Just the speech issue I think. Sorry for hate.
@michaelmassaro43757 ай бұрын
He’s not doing a good job in breaking down the mechanics in my view probably because of my own ineptness but I was hoping for more simpler terms and mechanisms to explain how it is the engines function
@saydtg78ashd3 жыл бұрын
They should show the slide fullscreen so we don't have to zoom our eyes. We don't need the footage of the presenter speaking.
@ME0WMERE2 жыл бұрын
they did? (Or very close to fullscreen anyway)
@your_average_joe57812 жыл бұрын
Footage is a term used for movie film. No film was used here so... No 'footage'👍
@geezlepuss78634 жыл бұрын
This video needs editing; it is over an hour long, for crying outloud.
@user-zb8tq5pr4x4 жыл бұрын
It's a lecture footage, not even meant for a youtube video, wtf are you expecting
@brettselph75913 жыл бұрын
I started getting dizzy with the spotlight darting around. The idea of the spotlight is to indicate to the audience, WHERE to focus their attention on a chart or visual aid with lots of details. But with the contrast set so high that the chart effectively DISAPPEARS, the effect is to scramble visual focus, opposite of what is intended. All we see is a dancing circle. Also, the video would be much better if MOST of the time, we don't see the stage and the ML CONFERENCE frame, but rather, the main window. It would be MUCH easier to follow. This is just a software choice. Hey, we DON'T need to see the frame, and we don't need to see the speaker. At least, not ALL of the time. GeezlePuss is right. This video needs editing. But not just for time. There too many things VISUALLY wrong with this video, so many things that get in the way of our SEEING the visual aids, that it's just impossible to watch. It's so bad, I decided to find a different video to watch. I'm ten minutes in, and I'm bailing out.
@douggale59623 жыл бұрын
Why do you want it to end? So you can go look through a hundred more stupid videos to find another good one? You should see a little X button at the top left or right. Click that, and you can edit out the rest of the video.
@dustinjames12683 жыл бұрын
If you don't have the attention span for an hour long video, then good luck in the real world where things aren't condensed all the time
@ivoplsek3 жыл бұрын
and what problem is there with a 1 hour video? I find it great..not sure why it needs editing?