The Elo Rating System

  Рет қаралды 12,778

j3m

j3m

Күн бұрын

Пікірлер: 83
@AnnPartie
@AnnPartie 3 күн бұрын
Man this is your 1st video (probably not, according to professionalism in result) yet it is very well done, good luck to you and your channel, you deserve it, keep rocking!
@morganrademan1375
@morganrademan1375 21 сағат бұрын
Incredible video! Would be facinating to see how the simulation changes with skill based matchmaking instead of a randomly selected opponent
@jhuyt-
@jhuyt- 17 сағат бұрын
Nice video! IIRC modern games often use the glicko rating system insteadof elo. Might be material for a future video
@Tkdriverx
@Tkdriverx 9 сағат бұрын
Came down to suggest a video about the differences between Glicko and Elo (as well as other, less common rating systems)
@thatnhoxiu
@thatnhoxiu 15 сағат бұрын
This is an amazing video omg. Im pretty sure many math enthusiasts are gonna enjoy this pop up in their YT feed. Would you ever consider doing a video on the Glicko rating system? Both Glicko and Glicko-2.
@bmenrigh
@bmenrigh 7 сағат бұрын
This video was so good! Subscribed. I hope you tackle more related topics!
@aaugustinwwezel4568
@aaugustinwwezel4568 21 сағат бұрын
13:09 the formula for transitivity has an extra equal sign i think. Anyway, great explaination !
@francescovultaggio2540
@francescovultaggio2540 20 сағат бұрын
I loved this video! The fact you concluded by dispelling common misconceptions was very welcome.
@jhpenguinn
@jhpenguinn 20 сағат бұрын
Keep up the great work! I love the effort that you put into the video. Very impressive for your first upload!
@logician1234
@logician1234 7 сағат бұрын
great video! I always wondered how to derive elo system. btw, why did elo assume the variance stays the same? what if someone is more consistent in their play, while other players are more chaotic and have 'good' and 'bad' days? if we relax this assumption, do we get a more accurate model?
@therealbuggles
@therealbuggles 15 сағат бұрын
THIS IS YOUR FIRST VIDEO??? I NEED MORE OF THESE!!
@notapplicable8957
@notapplicable8957 7 сағат бұрын
Wanted to say again, I'm very impressed with this video. I really wish I was making content like this. Keep up the good work!
@victorwindahl4903
@victorwindahl4903 3 сағат бұрын
But how does the Thurstone model work in practice? I understand that it is mostly impossible to see which model is the most correct, but how do they differ and what are the strengths of both models? Great video and I hope you upload more!
@uddhavsaikia739
@uddhavsaikia739 Күн бұрын
Gonna go big for sure...keep doing it
@ghoststartaken
@ghoststartaken 17 сағат бұрын
Really good video! would love to see something similar on the Glicko system :D
@g4_68
@g4_68 15 сағат бұрын
The effort put into this video amazes me, cant wait for a new video 🙏
@mie5953
@mie5953 9 сағат бұрын
Well done! Very clear and enjoyable explanations
@anaslakchouch202
@anaslakchouch202 12 сағат бұрын
Great video, I’m really looking forward to see your upcoming videos, keep it up 👍
@vladthemagnificent9052
@vladthemagnificent9052 14 сағат бұрын
What an amazing video! clear visual style aids the explanations perfectly. I liked it a lot, and understood everything. I am only left with a question why didn't the original Elo rating based on the normal probability of performance ever catch up? is the assumption not correct? does it converge more slowly? I guess I'm gonna research those question son my own.
@xenmaifirebringer552
@xenmaifirebringer552 12 сағат бұрын
Yeah! I was left humgry for a part 2 covering the gaussian-based model further!
@MikaKullu
@MikaKullu 5 сағат бұрын
An excellent question. While I don't know what the complete answer is, I can say this: the integral of the Gaussian distribution at 19:38 (i.e. the cumulative distribution function of the Gaussian distribution) is what's called an Error function, denoted erf(x). The Error function belongs to a class of functions known as sigmoids, named as such due to their resemblance to the letter "s". The Logistic function is also a sigmoid, so already roughly the same shape. But in fact, the derivative of the Logistic function, somewhat confusingly called the Logistic distribution, is actually very similar to the Gaussian distribution - with the only qualitative difference being heavier tails. So in short, the Logistic function model should yield very similar results to a model based off the Gaussian distribution, with discrepancies most noticeable when the difference in player ratings is large.
@ghastor1393
@ghastor1393 23 сағат бұрын
Really excellent stuff! Subscribed and excited for anything you may make next!
@shrekeyes2410
@shrekeyes2410 20 сағат бұрын
Great video, it really showed the fundamentals of probability. If im gonna be honest, I didn't know what odds were since the term is so saturated in gambling lol. Did you use manim for this btw?
@j3m-math
@j3m-math 20 сағат бұрын
Yup, manim and Flash. I also never really knew what odds were before doing this project! That's why I felt the need to include a discussion about them.
@shrekeyes2410
@shrekeyes2410 20 сағат бұрын
@@j3m-math Did you voice this yourself?
@j3m-math
@j3m-math 20 сағат бұрын
@@shrekeyes2410 Yup
@derickd6150
@derickd6150 Күн бұрын
Man this is the best video I have seen on the Elo system. I love it. Do you have an explanation on how physicists may have come up with this model from their physics knowledge? It looks like the Fermi function in Fermi-Dirac statistics?
@j3m-math
@j3m-math 20 сағат бұрын
I didn't dive too deep into the history, but I think basically Elo used the Thurstone model originally, then they swapped in a logistic for a normal distribution later. To be clear (I've seen people get confused about this, including me): this does NOT mean that the Zermelo/Bradley-Terry model is equivalent to both players generating numbers from a logistic distribution, and then the bigger number wins. It's just that the Thurstone and BT models are both of the form p = f(R1 - R2), where f is an increasing function from R into [0, 1]. For the Thurstone, f is the CDF of a Gaussian, and for the BT it's a logistic function. I think they basically just swapped in a different f that gave better results. As for what motivated this specific choice of f... I don't know. In this video I motivate it the way Zermelo does in his paper, using this notion of "strengths", but I'm sure there are different angles that could be motivated by statistical physics as you suggest. I suppose you could start by looking at sections 8.3 and 8.4 of Elo's book (Elo 1978). In those sections he also cites (Elo 1966) and (Berkson 1929, 1944). I haven't looked into those. References: - Elo 1978, The Rating of Chess Players Past and Present - Elo 1966, Use of the Standard Sigmoid and Logistic Curves in Pairwise Comparisons (sounds very relevant!) - Berkson 1929, Application of the Logistic Function to Experi­mental Data - Berksen 1944, Application of the Logistic Function to Bioassay
@octopodes7619
@octopodes7619 12 сағат бұрын
@@j3m-math It's true that the Zermelo model is not equivalent to both players generating numbers from a logistic distribution with the bigger number winning, but there's something almost as good. The issue is that the difference of two iid logistic random variables is not logistically distributed, but if we could find a (family of) distribution(s) (say D) such that when X ~ D(a), Y ~ D(b) independent, then X - Y ~ Logistic(a-b,1), we'd be all set. The Zermelo model would then be equivalent to player 1 generating X, player 2 generating Y, and the largest number wins. It turns out there is a distribution that works, called the Gumbel distribution. You can go the other way too. For example, Elo's original model has a nice transitivity property; knowing p_ij and p_jk uniquely determines p_ik.
@Mathymagical
@Mathymagical 8 сағат бұрын
Didn't expect to watch the whole video.
@Zcon18
@Zcon18 10 сағат бұрын
As good as 3B1B but it cites and explores the papers behind it? HOLY SHIT S TIER ❤️
@kevinpipich6496
@kevinpipich6496 3 күн бұрын
Really good stuff, well presented
@already_takent
@already_takent 18 сағат бұрын
This is the best video on this topic, thank you
@MaxPicAxe
@MaxPicAxe 15 сағат бұрын
i like the music in this
@timo6843
@timo6843 16 сағат бұрын
This video is how I wish I could have learnt the content from my Maths undergrad lectures!
@senbatifanola
@senbatifanola 13 сағат бұрын
Wow, great video. I would live to see a video from you on Swiss score.
@nikanzamani1545
@nikanzamani1545 12 сағат бұрын
Hay, great video. Really enjoyed watching it. I was wondering about some other use cases of elo rating. Like in puzzle chess games where I would presume each puzzle gets a rating of its own and win or lose rating according to result of player getting the puzzle right or not. Or the rating system in the context of competitive programming, like in codeforces. It would be fascinating to see a video going deeper on less known use cases of these systems. thanks again for the video, great job.
@duarteribeiro1520
@duarteribeiro1520 11 сағат бұрын
19:00 sigma 🗿🗿🗿 But for real very nice video
@another-youtube-user
@another-youtube-user 21 сағат бұрын
thanks for the great video! it seems like your target audience is math nerds that like chess. im here for it.
@jmarvins
@jmarvins 17 сағат бұрын
super high quality video!
@TheBodyOnPC
@TheBodyOnPC 17 сағат бұрын
If one where to simulate strengths from the historical Elo and use the algorithm of the modern Elo, what would the results be? In other words it would be interesting to investigate how robust the Elo system is to different models of the true strength data generating process.
@haniB22
@haniB22 6 сағат бұрын
lowkey goated video remember me when u hit 10k
@ChinmayKalegaonkar
@ChinmayKalegaonkar Күн бұрын
Really good video
@friendly_sitie
@friendly_sitie 17 сағат бұрын
phenomenal video!
@panagiotiskouzaris9546
@panagiotiskouzaris9546 21 сағат бұрын
Very nice video!
@sashaa9738
@sashaa9738 17 сағат бұрын
Great video! Keep it up!
@MathHunter
@MathHunter 12 сағат бұрын
Congrats, you got a subscriber
@pd28cat
@pd28cat 21 сағат бұрын
The noises they make are crazy
@Jonathan-ij2ky
@Jonathan-ij2ky 13 сағат бұрын
Incredable for a first vid.
@prokopiy282
@prokopiy282 13 сағат бұрын
i am really really hoping this gets picked up by The Algorithm
@dimlanier3424
@dimlanier3424 19 сағат бұрын
Really great video ! Now I'll always wonder to my self does the input variable of my sigmoid Neural Net satisfy the Zermalo model. If yes I'll see each neurone as a battle field between my variable 😂. 19:49 Hurts my math eyes to see a 4 under a square root 😅
@illusion6559
@illusion6559 15 сағат бұрын
fantastic stuff
@Nirgranth
@Nirgranth 17 сағат бұрын
You will be a big soon
@sidathsandaruwan7902
@sidathsandaruwan7902 Күн бұрын
Great content ❤
@vik_you254
@vik_you254 17 сағат бұрын
Hey, did you build this using Manim library, would be really interested to know...
@omri.d
@omri.d 16 сағат бұрын
Is the option for a tie changes the model? I played a chess game on lichess and the rating of me and the opponent didn't change by the same value. What is different
@wofgh4736
@wofgh4736 17 сағат бұрын
very cool
@aitehs
@aitehs 23 сағат бұрын
Впевнений ти станеш відомим, цікаве відео!
@forayer
@forayer 3 күн бұрын
Is this value C changing over time and is this behavior what people refer as elo inflation?
@j3m-math
@j3m-math 3 күн бұрын
(15:35 for context) I haven't really thought about Elo inflation, so everything I'm about to say is based on thinking about it while on a 20 minute walk. But... maybe? First of all, to be clear, the value of C definitely does not change "over time" as in over the course of one of the simulations in this video. C is constant during a single simulation since we can calculate it as the difference between the average estimated rating and the average true rating. But of course, what these simulations don't take into account is that true ratings shift as players get better... If everyone got better over time, then yes, I suppose that would cause "inflation", basically measured by the value of C? After all, the Elo system is "self-normalizing" in that it constrains the mean rating to be 1500 (or whatever initial rating you're using). So a 1500 now that everyone's better would not mean the same thing as a 1500 from 10 years ago. Changing player skill is actually one of the things that would be really cool for someone to look at in a follow-up video with more sophisticated simulations. Another thing I don't take into account here when I say average Elo is constant is that in real life, players _leave_. So, imagine after a while all the amateurs get bored and only the grandmasters are still playing. Their Elos won't all magically shift downward, so now, the average Elo would effectively go up from 1500 to, say, 2500. Alternatively, if you win a bunch of games and then bounce, you're basically pulling an Elo heist - hoarding a bunch of points and then disappearing with them. I'm not even sure how to factor that into this whole story.
@jhpenguinn
@jhpenguinn 20 сағат бұрын
​@@j3m-math Thanks for the detailed explanation! That explains a lot. Love the research you put into this
@patlaciuga
@patlaciuga 19 сағат бұрын
Reminds me of La linea
@weakspirit_
@weakspirit_ 8 сағат бұрын
3:50
@uddhavsaikia739
@uddhavsaikia739 Күн бұрын
Great stufff
@jakeaustria5445
@jakeaustria5445 23 сағат бұрын
Thank You
@VicExe_
@VicExe_ 17 сағат бұрын
does it work like this on faceit in cs2?
@sodiboo
@sodiboo 5 сағат бұрын
No. Faceit uses a system they also call "elo", which in that context just means "any system where players are given a numeric rating, and their rating changes after each match based on the probability of winning". The models described in this video (the Zermelo model, and the historical elo system) fundamentally only work on two-player games. CS2 is not a two-player game, so the same system *cannot* be applied. (you could apply a similar model if every player has a fixed team they only ever play on, then the ratings apply to teams and it is effectively a two-player game. that's not how CS2 works, though) As far as i can tell, the exact mathematical model in Faceit is proprietary, but the "likelihood of winning" is based on the ratio of the *team ratings* between the two teams playing. The *team rating* is calculated on demand based on the team compositions, not just based on players' elo (the score shown on their profiles), but also takes into account the amount of matches played as well as the variance in skill among players. In the system used for chess, these are not taken into account, so there is no difference (in a single match) between being a complete newbie versus having played a thousand matches and ending up dead average (of course over time the newbie might converge to a different true rating). In faceit, this is not the case, and the team rating will be different depending on how many matches have been played previously to end up at these ratings. Fundamentally, its elo system has *some* similar properties to that used in chess, but the underlying mathematical model is nothing like what this video describes.
@darklion13
@darklion13 Күн бұрын
This has the underling assumption that there is one true strength, which might not always be the case. For example if there are 3 players (p1,p2,p3), where p1 wins against p2, p2 wins against p3, and p3 wins against p1. Then the elo system will not be a good model for it. I believe that most times elo comes to use, is in online games to put fair matches. This example shows it might not be good enough and lacks the nuances to deal with the reality of fair matches.
@j3m-math
@j3m-math Күн бұрын
Absolutely! The property of odds transitivity shows very clearly that this model can't deal with "rock paper scissors" kinds of situations, where there are different "kinds" of good players that non-transitively beat each other. Of course, really that assumption is inherent in the very idea of a rating - the moment you're assigning ratings to people, you're saying player skill is transitive. Looking at how Elo performs with a game like that would be another fun idea to look at in some follow-up simulations.
@darklion13
@darklion13 23 сағат бұрын
@@j3m-math I am not so sure that the assumption is inherent to the idea of rating. In the rock paper scissors example, you could say they all have the same rating. But we could also change that. for example, rock wins against paper 10% of times. Now rock is certainly the "better player", however if I understand correctly the current elo system will not diverge even though there seems to be a way you can rate them.
@j3m-math
@j3m-math 22 сағат бұрын
@@darklion13 Well, if Rock, Paper and Scissors all have the same rating, then the probability of any of them beating the other would have to be 50% - assuming that the win probability is some function of the ratings. What I mean is that any system in which P(P1 beats P2) = f(R1, R2) and the Ri are real numbers is doomed to be transitive - at least if you also throw in some kind of assumption of "monotonicity", like f(x, R2) is monotonically increasing in x or something (I haven't worked out the details). I feel like the simplest way to model a game with "non-transitive skill" would be to have a multi-dimensional rating. A really simple example off the top of my head would be to model the game internally as being rock-paper-scissors. Each player's "true rating" is modelled as a probability distribution over the set {Rock, Paper, Scissors}, and when two players play, they choose a strategy from that distribution (this has the added advantage over the Zermelo/Bradley-Terry model of automatically accounting for draws, which is cool). This is really a two-dimensional rating since the sum of the three probabilities has to add to one, so we lose a degree of freedom. Then maybe you could come up with some algorithm to estimate that "2d rating vector" from the outcomes of games. It's an interesting rabbit hole.
@octopodes7619
@octopodes7619 12 сағат бұрын
It depends on what you want to model! If you dump a bunch of players into a round robin tournament, and all you care about is everyone's final win-loss record, Elo works fine. All of the cycles where p1 beats p2 who beats p3 who beats p1 average out to zero. If you care about the outcomes of specific matchups, Elo simply doesn't carry enough information, although if you ask me it's a good place to start. If there are 3 players in a cycle like that, call them rock, paper, and scissors, and they played a lot of games, all of them would have an even record and the same rating. If rock wins against paper 10% of the time instead of 0%, then rock would have a slightly positive record and a higher Elo rating, while paper would have a lower rating.
@qbinwiz
@qbinwiz 3 күн бұрын
are you michael fassbender you sound just like him
@lmarsh5407
@lmarsh5407 7 сағат бұрын
The zermello system you developed is quite literally the shape functions we do in mechanical engineering for Finite Element Analysis of linear elements. (FEA) The idea is the same, draw a line and the weight of the value at node i is proportional to x/l, and at node j is x/l - 1. With Si and Sj, same notation!!
@lmarsh5407
@lmarsh5407 7 сағат бұрын
And i have the exam for this tomorrow lol
@JusttLeagueoflegends
@JusttLeagueoflegends 12 сағат бұрын
Can anyone send this to Riot Games
@michaellatsky
@michaellatsky 23 сағат бұрын
Helllllll yeahhhhhhhh
@nikilmanu334
@nikilmanu334 20 сағат бұрын
Who are you? Why haven't you been shown in my feed till now?
@thatnhoxiu
@thatnhoxiu 15 сағат бұрын
This channel only has one vid, lol
@nikilmanu334
@nikilmanu334 11 сағат бұрын
@ nice lol
@shaunmodipane1
@shaunmodipane1 Сағат бұрын
why dont IQ test work like this?
@nevokrien95
@nevokrien95 21 сағат бұрын
The proof of convergnce is not that hard... the cost function is obviously convex and so a gradient decent algo should work.
@KeithBoehler
@KeithBoehler 18 сағат бұрын
Subscriber 225
Why is This Curve Called "The Witch"?
29:37
Another Roof
Рет қаралды 46 М.
СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️
01:01
DO$HIK
Рет қаралды 3,3 МЛН
Is the Future of Linear Algebra.. Random?
35:11
Mutual Information
Рет қаралды 375 М.
Why is there no equation for the perimeter of an ellipse‽
21:05
Stand-up Maths
Рет қаралды 2,2 МЛН
Solving Wordle using information theory
30:38
3Blue1Brown
Рет қаралды 10 МЛН
Arch Linux Experience - Virtualization
38:43
Bog
Рет қаралды 21 М.
Sherlock Holmes NEVER 'Deduced' Anything
29:38
Another Roof
Рет қаралды 347 М.
Python laid waste to my C++!
17:18
Sheafification of G
Рет қаралды 120 М.
Where does “e” come from?
14:45
Ali the Dazzling
Рет қаралды 24 М.
This Physics Problem is Unsolvable
7:33
Sabine Hossenfelder
Рет қаралды 219 М.
How to Master Deep Learning Math?
13:42
Deep Learning with Yacine
Рет қаралды 8 М.
Why We've Only Cured HIV Seven Times
12:05
SciShow
Рет қаралды 57 М.