Man this is your 1st video (probably not, according to professionalism in result) yet it is very well done, good luck to you and your channel, you deserve it, keep rocking!
@morganrademan137521 сағат бұрын
Incredible video! Would be facinating to see how the simulation changes with skill based matchmaking instead of a randomly selected opponent
@jhuyt-17 сағат бұрын
Nice video! IIRC modern games often use the glicko rating system insteadof elo. Might be material for a future video
@Tkdriverx9 сағат бұрын
Came down to suggest a video about the differences between Glicko and Elo (as well as other, less common rating systems)
@thatnhoxiu15 сағат бұрын
This is an amazing video omg. Im pretty sure many math enthusiasts are gonna enjoy this pop up in their YT feed. Would you ever consider doing a video on the Glicko rating system? Both Glicko and Glicko-2.
@bmenrigh7 сағат бұрын
This video was so good! Subscribed. I hope you tackle more related topics!
@aaugustinwwezel456821 сағат бұрын
13:09 the formula for transitivity has an extra equal sign i think. Anyway, great explaination !
@francescovultaggio254020 сағат бұрын
I loved this video! The fact you concluded by dispelling common misconceptions was very welcome.
@jhpenguinn20 сағат бұрын
Keep up the great work! I love the effort that you put into the video. Very impressive for your first upload!
@logician12347 сағат бұрын
great video! I always wondered how to derive elo system. btw, why did elo assume the variance stays the same? what if someone is more consistent in their play, while other players are more chaotic and have 'good' and 'bad' days? if we relax this assumption, do we get a more accurate model?
@therealbuggles15 сағат бұрын
THIS IS YOUR FIRST VIDEO??? I NEED MORE OF THESE!!
@notapplicable89577 сағат бұрын
Wanted to say again, I'm very impressed with this video. I really wish I was making content like this. Keep up the good work!
@victorwindahl49033 сағат бұрын
But how does the Thurstone model work in practice? I understand that it is mostly impossible to see which model is the most correct, but how do they differ and what are the strengths of both models? Great video and I hope you upload more!
@uddhavsaikia739Күн бұрын
Gonna go big for sure...keep doing it
@ghoststartaken17 сағат бұрын
Really good video! would love to see something similar on the Glicko system :D
@g4_6815 сағат бұрын
The effort put into this video amazes me, cant wait for a new video 🙏
@mie59539 сағат бұрын
Well done! Very clear and enjoyable explanations
@anaslakchouch20212 сағат бұрын
Great video, I’m really looking forward to see your upcoming videos, keep it up 👍
@vladthemagnificent905214 сағат бұрын
What an amazing video! clear visual style aids the explanations perfectly. I liked it a lot, and understood everything. I am only left with a question why didn't the original Elo rating based on the normal probability of performance ever catch up? is the assumption not correct? does it converge more slowly? I guess I'm gonna research those question son my own.
@xenmaifirebringer55212 сағат бұрын
Yeah! I was left humgry for a part 2 covering the gaussian-based model further!
@MikaKullu5 сағат бұрын
An excellent question. While I don't know what the complete answer is, I can say this: the integral of the Gaussian distribution at 19:38 (i.e. the cumulative distribution function of the Gaussian distribution) is what's called an Error function, denoted erf(x). The Error function belongs to a class of functions known as sigmoids, named as such due to their resemblance to the letter "s". The Logistic function is also a sigmoid, so already roughly the same shape. But in fact, the derivative of the Logistic function, somewhat confusingly called the Logistic distribution, is actually very similar to the Gaussian distribution - with the only qualitative difference being heavier tails. So in short, the Logistic function model should yield very similar results to a model based off the Gaussian distribution, with discrepancies most noticeable when the difference in player ratings is large.
@ghastor139323 сағат бұрын
Really excellent stuff! Subscribed and excited for anything you may make next!
@shrekeyes241020 сағат бұрын
Great video, it really showed the fundamentals of probability. If im gonna be honest, I didn't know what odds were since the term is so saturated in gambling lol. Did you use manim for this btw?
@j3m-math20 сағат бұрын
Yup, manim and Flash. I also never really knew what odds were before doing this project! That's why I felt the need to include a discussion about them.
@shrekeyes241020 сағат бұрын
@@j3m-math Did you voice this yourself?
@j3m-math20 сағат бұрын
@@shrekeyes2410 Yup
@derickd6150Күн бұрын
Man this is the best video I have seen on the Elo system. I love it. Do you have an explanation on how physicists may have come up with this model from their physics knowledge? It looks like the Fermi function in Fermi-Dirac statistics?
@j3m-math20 сағат бұрын
I didn't dive too deep into the history, but I think basically Elo used the Thurstone model originally, then they swapped in a logistic for a normal distribution later. To be clear (I've seen people get confused about this, including me): this does NOT mean that the Zermelo/Bradley-Terry model is equivalent to both players generating numbers from a logistic distribution, and then the bigger number wins. It's just that the Thurstone and BT models are both of the form p = f(R1 - R2), where f is an increasing function from R into [0, 1]. For the Thurstone, f is the CDF of a Gaussian, and for the BT it's a logistic function. I think they basically just swapped in a different f that gave better results. As for what motivated this specific choice of f... I don't know. In this video I motivate it the way Zermelo does in his paper, using this notion of "strengths", but I'm sure there are different angles that could be motivated by statistical physics as you suggest. I suppose you could start by looking at sections 8.3 and 8.4 of Elo's book (Elo 1978). In those sections he also cites (Elo 1966) and (Berkson 1929, 1944). I haven't looked into those. References: - Elo 1978, The Rating of Chess Players Past and Present - Elo 1966, Use of the Standard Sigmoid and Logistic Curves in Pairwise Comparisons (sounds very relevant!) - Berkson 1929, Application of the Logistic Function to Experimental Data - Berksen 1944, Application of the Logistic Function to Bioassay
@octopodes761912 сағат бұрын
@@j3m-math It's true that the Zermelo model is not equivalent to both players generating numbers from a logistic distribution with the bigger number winning, but there's something almost as good. The issue is that the difference of two iid logistic random variables is not logistically distributed, but if we could find a (family of) distribution(s) (say D) such that when X ~ D(a), Y ~ D(b) independent, then X - Y ~ Logistic(a-b,1), we'd be all set. The Zermelo model would then be equivalent to player 1 generating X, player 2 generating Y, and the largest number wins. It turns out there is a distribution that works, called the Gumbel distribution. You can go the other way too. For example, Elo's original model has a nice transitivity property; knowing p_ij and p_jk uniquely determines p_ik.
@Mathymagical8 сағат бұрын
Didn't expect to watch the whole video.
@Zcon1810 сағат бұрын
As good as 3B1B but it cites and explores the papers behind it? HOLY SHIT S TIER ❤️
@kevinpipich64963 күн бұрын
Really good stuff, well presented
@already_takent18 сағат бұрын
This is the best video on this topic, thank you
@MaxPicAxe15 сағат бұрын
i like the music in this
@timo684316 сағат бұрын
This video is how I wish I could have learnt the content from my Maths undergrad lectures!
@senbatifanola13 сағат бұрын
Wow, great video. I would live to see a video from you on Swiss score.
@nikanzamani154512 сағат бұрын
Hay, great video. Really enjoyed watching it. I was wondering about some other use cases of elo rating. Like in puzzle chess games where I would presume each puzzle gets a rating of its own and win or lose rating according to result of player getting the puzzle right or not. Or the rating system in the context of competitive programming, like in codeforces. It would be fascinating to see a video going deeper on less known use cases of these systems. thanks again for the video, great job.
@duarteribeiro152011 сағат бұрын
19:00 sigma 🗿🗿🗿 But for real very nice video
@another-youtube-user21 сағат бұрын
thanks for the great video! it seems like your target audience is math nerds that like chess. im here for it.
@jmarvins17 сағат бұрын
super high quality video!
@TheBodyOnPC17 сағат бұрын
If one where to simulate strengths from the historical Elo and use the algorithm of the modern Elo, what would the results be? In other words it would be interesting to investigate how robust the Elo system is to different models of the true strength data generating process.
@haniB226 сағат бұрын
lowkey goated video remember me when u hit 10k
@ChinmayKalegaonkarКүн бұрын
Really good video
@friendly_sitie17 сағат бұрын
phenomenal video!
@panagiotiskouzaris954621 сағат бұрын
Very nice video!
@sashaa973817 сағат бұрын
Great video! Keep it up!
@MathHunter12 сағат бұрын
Congrats, you got a subscriber
@pd28cat21 сағат бұрын
The noises they make are crazy
@Jonathan-ij2ky13 сағат бұрын
Incredable for a first vid.
@prokopiy28213 сағат бұрын
i am really really hoping this gets picked up by The Algorithm
@dimlanier342419 сағат бұрын
Really great video ! Now I'll always wonder to my self does the input variable of my sigmoid Neural Net satisfy the Zermalo model. If yes I'll see each neurone as a battle field between my variable 😂. 19:49 Hurts my math eyes to see a 4 under a square root 😅
@illusion655915 сағат бұрын
fantastic stuff
@Nirgranth17 сағат бұрын
You will be a big soon
@sidathsandaruwan7902Күн бұрын
Great content ❤
@vik_you25417 сағат бұрын
Hey, did you build this using Manim library, would be really interested to know...
@omri.d16 сағат бұрын
Is the option for a tie changes the model? I played a chess game on lichess and the rating of me and the opponent didn't change by the same value. What is different
@wofgh473617 сағат бұрын
very cool
@aitehs23 сағат бұрын
Впевнений ти станеш відомим, цікаве відео!
@forayer3 күн бұрын
Is this value C changing over time and is this behavior what people refer as elo inflation?
@j3m-math3 күн бұрын
(15:35 for context) I haven't really thought about Elo inflation, so everything I'm about to say is based on thinking about it while on a 20 minute walk. But... maybe? First of all, to be clear, the value of C definitely does not change "over time" as in over the course of one of the simulations in this video. C is constant during a single simulation since we can calculate it as the difference between the average estimated rating and the average true rating. But of course, what these simulations don't take into account is that true ratings shift as players get better... If everyone got better over time, then yes, I suppose that would cause "inflation", basically measured by the value of C? After all, the Elo system is "self-normalizing" in that it constrains the mean rating to be 1500 (or whatever initial rating you're using). So a 1500 now that everyone's better would not mean the same thing as a 1500 from 10 years ago. Changing player skill is actually one of the things that would be really cool for someone to look at in a follow-up video with more sophisticated simulations. Another thing I don't take into account here when I say average Elo is constant is that in real life, players _leave_. So, imagine after a while all the amateurs get bored and only the grandmasters are still playing. Their Elos won't all magically shift downward, so now, the average Elo would effectively go up from 1500 to, say, 2500. Alternatively, if you win a bunch of games and then bounce, you're basically pulling an Elo heist - hoarding a bunch of points and then disappearing with them. I'm not even sure how to factor that into this whole story.
@jhpenguinn20 сағат бұрын
@@j3m-math Thanks for the detailed explanation! That explains a lot. Love the research you put into this
@patlaciuga19 сағат бұрын
Reminds me of La linea
@weakspirit_8 сағат бұрын
3:50
@uddhavsaikia739Күн бұрын
Great stufff
@jakeaustria544523 сағат бұрын
Thank You
@VicExe_17 сағат бұрын
does it work like this on faceit in cs2?
@sodiboo5 сағат бұрын
No. Faceit uses a system they also call "elo", which in that context just means "any system where players are given a numeric rating, and their rating changes after each match based on the probability of winning". The models described in this video (the Zermelo model, and the historical elo system) fundamentally only work on two-player games. CS2 is not a two-player game, so the same system *cannot* be applied. (you could apply a similar model if every player has a fixed team they only ever play on, then the ratings apply to teams and it is effectively a two-player game. that's not how CS2 works, though) As far as i can tell, the exact mathematical model in Faceit is proprietary, but the "likelihood of winning" is based on the ratio of the *team ratings* between the two teams playing. The *team rating* is calculated on demand based on the team compositions, not just based on players' elo (the score shown on their profiles), but also takes into account the amount of matches played as well as the variance in skill among players. In the system used for chess, these are not taken into account, so there is no difference (in a single match) between being a complete newbie versus having played a thousand matches and ending up dead average (of course over time the newbie might converge to a different true rating). In faceit, this is not the case, and the team rating will be different depending on how many matches have been played previously to end up at these ratings. Fundamentally, its elo system has *some* similar properties to that used in chess, but the underlying mathematical model is nothing like what this video describes.
@darklion13Күн бұрын
This has the underling assumption that there is one true strength, which might not always be the case. For example if there are 3 players (p1,p2,p3), where p1 wins against p2, p2 wins against p3, and p3 wins against p1. Then the elo system will not be a good model for it. I believe that most times elo comes to use, is in online games to put fair matches. This example shows it might not be good enough and lacks the nuances to deal with the reality of fair matches.
@j3m-mathКүн бұрын
Absolutely! The property of odds transitivity shows very clearly that this model can't deal with "rock paper scissors" kinds of situations, where there are different "kinds" of good players that non-transitively beat each other. Of course, really that assumption is inherent in the very idea of a rating - the moment you're assigning ratings to people, you're saying player skill is transitive. Looking at how Elo performs with a game like that would be another fun idea to look at in some follow-up simulations.
@darklion1323 сағат бұрын
@@j3m-math I am not so sure that the assumption is inherent to the idea of rating. In the rock paper scissors example, you could say they all have the same rating. But we could also change that. for example, rock wins against paper 10% of times. Now rock is certainly the "better player", however if I understand correctly the current elo system will not diverge even though there seems to be a way you can rate them.
@j3m-math22 сағат бұрын
@@darklion13 Well, if Rock, Paper and Scissors all have the same rating, then the probability of any of them beating the other would have to be 50% - assuming that the win probability is some function of the ratings. What I mean is that any system in which P(P1 beats P2) = f(R1, R2) and the Ri are real numbers is doomed to be transitive - at least if you also throw in some kind of assumption of "monotonicity", like f(x, R2) is monotonically increasing in x or something (I haven't worked out the details). I feel like the simplest way to model a game with "non-transitive skill" would be to have a multi-dimensional rating. A really simple example off the top of my head would be to model the game internally as being rock-paper-scissors. Each player's "true rating" is modelled as a probability distribution over the set {Rock, Paper, Scissors}, and when two players play, they choose a strategy from that distribution (this has the added advantage over the Zermelo/Bradley-Terry model of automatically accounting for draws, which is cool). This is really a two-dimensional rating since the sum of the three probabilities has to add to one, so we lose a degree of freedom. Then maybe you could come up with some algorithm to estimate that "2d rating vector" from the outcomes of games. It's an interesting rabbit hole.
@octopodes761912 сағат бұрын
It depends on what you want to model! If you dump a bunch of players into a round robin tournament, and all you care about is everyone's final win-loss record, Elo works fine. All of the cycles where p1 beats p2 who beats p3 who beats p1 average out to zero. If you care about the outcomes of specific matchups, Elo simply doesn't carry enough information, although if you ask me it's a good place to start. If there are 3 players in a cycle like that, call them rock, paper, and scissors, and they played a lot of games, all of them would have an even record and the same rating. If rock wins against paper 10% of the time instead of 0%, then rock would have a slightly positive record and a higher Elo rating, while paper would have a lower rating.
@qbinwiz3 күн бұрын
are you michael fassbender you sound just like him
@lmarsh54077 сағат бұрын
The zermello system you developed is quite literally the shape functions we do in mechanical engineering for Finite Element Analysis of linear elements. (FEA) The idea is the same, draw a line and the weight of the value at node i is proportional to x/l, and at node j is x/l - 1. With Si and Sj, same notation!!
@lmarsh54077 сағат бұрын
And i have the exam for this tomorrow lol
@JusttLeagueoflegends12 сағат бұрын
Can anyone send this to Riot Games
@michaellatsky23 сағат бұрын
Helllllll yeahhhhhhhh
@nikilmanu33420 сағат бұрын
Who are you? Why haven't you been shown in my feed till now?
@thatnhoxiu15 сағат бұрын
This channel only has one vid, lol
@nikilmanu33411 сағат бұрын
@ nice lol
@shaunmodipane1Сағат бұрын
why dont IQ test work like this?
@nevokrien9521 сағат бұрын
The proof of convergnce is not that hard... the cost function is obviously convex and so a gradient decent algo should work.