The Z Factor - Numberphile

  Рет қаралды 115,313

Numberphile

Numberphile

3 ай бұрын

Featuring Tom Crawford. Jane Street skyscraper puzzle (and info on the AMP program) at bit.ly/numberphile-amp --- More links & stuff in full description below ↓↓↓
Jane Street’s Academy of Math and Programming is now accepting applications for their summer 2024 program, which will run from June 29th-August 2nd in NYC... Or you can just check out the puzzle for fun too - bit.ly/numberphile-amp
Tom Crawford's website, with links to his work and other outreach: tomrocksmaths.com
More Tom videos on Numberphile: bit.ly/Crawford_Videos
Tom on the Numberphile Podcast: • The Naked Mathematicia...
Patreon: / numberphile
Numberphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: bit.ly/numberphile-janestreet
We're also supported by the Simons Laufer Mathematical Sciences Institute (formerly MSRI): bit.ly/MSRINumberphile
Our thanks also to the Simons Foundation: www.simonsfoundation.org
NUMBERPHILE
Website: www.numberphile.com/
Numberphile on Facebook: / numberphile
Numberphile tweets: / numberphile
Subscribe: bit.ly/Numberphile_Sub
Video by Brady Haran and Pete McPartlan
Numberphile T-Shirts and Merch: teespring.com/stores/numberphile
Brady's videos subreddit: / bradyharan
Brady's latest videos across all channels: www.bradyharanblog.com/
Sign up for (occasional) emails: eepurl.com/YdjL9

Пікірлер: 425
@idonno87
@idonno87 3 ай бұрын
I think the best way to interpret the Z-score here is 'how much better than the competition were they in their time'. If you use the Z-score to directly compare two teams, you could argue that someone in an amateur league with a better Z-score is a better team than a professional team.
@plackt
@plackt 3 ай бұрын
Correct. We use IQ as an example in my classes because it is *always* M = 100 and SD = 15, because it is “normed” to the current population. What this ignores is what’s called the Flynn Effect, which is that IQ is constantly increasing (or we’re getting better at taking tests). So someone with an average IQ now (100) is smarter than someone with that same average IQ score 50 years ago.
@wobaguk
@wobaguk 3 ай бұрын
Yes there isnt an allowance for the whole league being stronger over time. You can kind of get away with it in this example, but its the 'Babe Ruth vs Modern Baseball or Muhammed Ali vs Modern Boxer' questions that get posed where it falls apart, never mind the 'who is the greatest motorsport driver' questions where the actual demands and rules of the sport change heavily over time.
@davejacob5208
@davejacob5208 3 ай бұрын
yeah, the point about somehow getting rid of the comparative/relative nature of the data irritated me pretty much, you cannot turn the information "how many times did they win/lose AGAINST OTHER TEAMS" into a datapoint that gives us information independant of the teams they played against.
@GlacialScion
@GlacialScion 3 ай бұрын
That's what he says in the video.
@EebstertheGreat
@EebstertheGreat 3 ай бұрын
@@wobaguk Yeah for a lot of sports that won't work. For some events like swimming, the sport has changed only a little, but times have improved a lot. Evidently, the average professional swimmer today is faster than the best professional swimmers from 50 years ago. So a Z-score comparison wouldn't be very helpful there.
@seanbucket
@seanbucket 3 ай бұрын
To answer the question at the end, the lowest z-score to win in the end theat i found was 1900 VFL Melbourne team who won from 6th with 24 points (league average of 28). They had a z-score of -0.33072.
@TimmehTRP
@TimmehTRP 3 ай бұрын
You got the result for a regular system, where it's just 1st place points = 1 place, not like Belgium/Australia?
@seanbucket
@seanbucket 3 ай бұрын
4 points for a win, 2 for a draw, 0 for a loss
@mad_vegan
@mad_vegan 3 ай бұрын
20:50 Yeah, just a 6.5, no big deal. For those who don't know, this has been cited as the most impressive statistics in all of sports.
@esotericVideos
@esotericVideos 3 ай бұрын
Didn't the guy say he'd never seen one higher than 4? Then we end the video with a 6.5? Heck, the whole video maybe should have been about Bradman.
@matthiasgreen4042
@matthiasgreen4042 3 ай бұрын
I plot Z-scores every day and all the axes cut off at 4. I've only seen a couple players barely above 4, 6.5 is ridiculous!
@Kleyguerth
@Kleyguerth 3 ай бұрын
@@esotericVideos He never saw higher than 4 for team games won in a season. Bradman data is not a team winning games
@CheeseAlarm
@CheeseAlarm 3 ай бұрын
I sat through the whole video waiting them to get to Bradman.
@joshuaychung
@joshuaychung 3 ай бұрын
He was beyond 6 sigmas away from the mean. There is a GE executive somewhere who just orgasmed talking about this.
@djsmeguk
@djsmeguk 3 ай бұрын
That bradman fact at the end. He was just totally god tier. 20:49
@Lashb1ade
@Lashb1ade 3 ай бұрын
A 1-in-25Billion performance! Edit: If there have been 117Billon humans in history, then only 4-5 humans have ever perfected their craft to this level.
@Mark_Williams.
@Mark_Williams. 3 ай бұрын
Don Bradman at Z=6.5 is insane!
@kiltysalter2966
@kiltysalter2966 3 ай бұрын
I spent half the video wondering about Don Bradman’s Z-score. Had a feeling it would be huge. Thanks for including that.
@jursamaj
@jursamaj 3 ай бұрын
7:45 No. It doesn't remove the issue at all, it just ignores it. It remains true that the same Z factor from different sets may mean totally different things as far as which team is better.
@davidgustavsson4000
@davidgustavsson4000 3 ай бұрын
Yeah, it sounds like he has it backwards. You can't compare historical teams directly, but Z scoring lets you compare how they played relative to their competition.
@jonpaxman
@jonpaxman 3 ай бұрын
And in the NBA it's virtually impossible because of how dramatically teams can change from one year to another. In other sports you can use an ELO rating idea to compare teams from different times. But the further apart they are in time, the harder it is to draw a sound conclusion.
@matthiasgreen4042
@matthiasgreen4042 3 ай бұрын
As a football data analyst, I can confirm that z-scores are used everywhere, for instance comparing metrics that are measured differently (i.e. goals scored and possession %). They also allow us to create compound metrics which are more reliable and meaningful. Normalizing data without losing the details of how it's distributed (as you would with percentile ranking) is fantastic.
@therealax6
@therealax6 3 ай бұрын
Why would you lose that data with a percentile ranking? A z-score can be trivially converted to a fractile and back, after all.
@matthiasgreen4042
@matthiasgreen4042 3 ай бұрын
@@therealax6 if your data perfectly follows the normal distribution that is true, but not with real data. For example the top competitor's percentile ranking will always be 100%, no matter how far ahead of the others they are. Z-scores however reflect that gap.
@therealax6
@therealax6 3 ай бұрын
@@matthiasgreen4042 That makes sense! I always assumed you'd just adjust fractiles for discrete data so that they represent the midpoint of a continuous range (e.g., you have four data points: use fractiles .5/4, 1.5/4, 2.5/4 and 3.5/4 instead of 0/3, 1/3, 2/3, 3/3), but that adjustment is probably too inaccurate to adequately capture the edges of the underlying distribution.
@derektrombley589
@derektrombley589 3 ай бұрын
Somewhere, Michael Jordan is taking this video personally...
@cliftonchurch6039
@cliftonchurch6039 3 ай бұрын
If Matt Parker isn't already using this to comedically compare F1 driver Lewis Hamilton's career against NASCAR driver Jimmie Johnson's in order to troll as many people as possible, I bet he is after reading this.
@spudmanwsmb
@spudmanwsmb 3 ай бұрын
Don Bradman having a Z Factor of 6.5 is insane
@RetroMoviesAndGames
@RetroMoviesAndGames 3 ай бұрын
I just want to say that I recently have gotten a job tutoring math to fellow college students - something that i would have never even considered before stumbling upon Numberphile! math used to be a subject I lamented, now it's my favorite.
@therealax6
@therealax6 3 ай бұрын
Throughout the years I've come to realise that people don't hate math; they hate math class. Math at general education levels (primary school, secondary school, entry-level courses at university, etc.) is taught in a way that makes it about as uninteresting and uninspiring as a subject can be. Students are taught to pointlessly memorise formulas that they then have to echo back in the correct order without knowing why or what for. Students are almost never exposed to actual problems that let them exercise their problem-solving skills, sadly.
@grahamwilson8843
@grahamwilson8843 3 ай бұрын
Same here!! I love getting opportunities to show people that math isn't so bad.
@emilyrln
@emilyrln 2 ай бұрын
@@therealax6yes, I was very lucky to have parents who were huge math nerds and multiple teachers who made the subject engaging… and even then, I still had huge math anxiety and took computer science instead of math at university to fulfill the requirement.
@unvergebeneid
@unvergebeneid 3 ай бұрын
The Z Factor sounds like a game show where teen contestants face each other in a battle royale-style TikTok dance-off.
@elkjaer28
@elkjaer28 3 ай бұрын
I am actually quite unsure if you can use the analogy of z-score to portray greatness. An underlying assumption is that the observations are i.i.d. (independent and indentically distributed) and they are clearly not independent. The total number of wins is equal for each season - if one wins, another one looses. And because you compare how much of an outlier you are relatively to the seasonal distribution, you end up just comparing how much your win share is different from the others - while their win distribution are dependent on eachother - which could lead to some weird constructs of high z-scores occuring more often than they actually should. I think a much better estimate is the combinatorical calculation of how likely that win number (with all the other teams win number also happening) is to occur for a season!
@bberk1286
@bberk1286 3 ай бұрын
I completely agree, using the Z-score of win totals across different sports leagues makes no sense. I don't think it's even very useful to compare across eras of a single sports league as many others have said. The wins aren't independent, the league average win rate is absolutely necessarily 41 wins in an 82 game season, and there is a limit on the standard deviation that's related to the number of games too. So comparing the biggest outlier for wins in the NBA vs the Premier league vs the MLB doesn't really work. Much less comparing Z-scores of wins in these leagues to independent statistics. My back of the napkin math says it's mathematically impossible for an NBA team's "win rate" Z-score to reach the Bradman batting score of 6.5 that's quoted. But that's not to discount that Bradman score as an incredible outlier. Use Z-score to show me how far above league average Jordan's midrange efficiency was and compare that to Curry's insane 3 point shooting and I'll listen, because you can use independent data points.
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@trademarked2476
@trademarked2476 3 ай бұрын
@@B3Bandbro this is a MATH video
@4DINK
@4DINK 4 сағат бұрын
Hey, would you be able to further expand on how you would use the combinatorial calculation? What would you define as the 'n' and 'r' for example in the case of the Bulls vs Warriors situation? Thanks!
@bertblankenstein3738
@bertblankenstein3738 3 ай бұрын
Very nice. I can understand this math and it is also put into context of "how exceptional" performances were.
@ultramarathonman100
@ultramarathonman100 3 ай бұрын
This approach is used all the time in physics. Theoretical physicists commonly plot in terms of dimensionless parameters to distill the physics without influence from individual parameters
@marksteers3424
@marksteers3424 3 ай бұрын
Not comfortable with three aspects of the assumptions. First we take discrete data (not a particularly large dataset) and fit it to a continuous function. It is not possible to win 13.2 games of basketball and there is no possibility of winning 200 games in a season even though the normal distribution says there is a very small but finite probability. Second, the wins could be distributed very differently in different seasons. In one extreme the overall winners could win every game with all other teams winning just above or just below half of their games. In another extreme, the winning team could win by the narrowest of margins (overall points scored and same number of wins as the team that came second. Third, the winning margins etc could be identical but it could be that all teams in the second dataset are ALL either much better or much worse than in the comparison dataset.
@soloban81
@soloban81 3 ай бұрын
There are discrete distributions that could be used as well.
@marksteers3424
@marksteers3424 3 ай бұрын
@@soloban81 Agreed - though not mentioned in this video
@AbelShields
@AbelShields 3 ай бұрын
Yes, but the normal distribution is a close approximation to the discrete distribution (in this case probably a binomial) given large enough sample size and p not too close to 0 or 1. Agree with the other two points though :)
@user-tp2tu8jp2x
@user-tp2tu8jp2x 3 ай бұрын
I was expecting an ELO model video. I believe this is the best way to compare teams and performance across different leagues, sports, etc. Even adding some modifiers to the model can take into consideration home and away factors, best results against certain opponents and so
@aaronb.5185
@aaronb.5185 3 ай бұрын
Beautiful as always!
@MrConverse
@MrConverse 3 ай бұрын
It’s cute to see how Tom really knows some (mostly math) stuff but clearly also has no clue about other (mostly sports) stuff. :-)
@ANunes06
@ANunes06 3 ай бұрын
So I have often wondered about "adding dimensions" to this process. Like, if you wanted to examine two different but connected measures (say total wins AND points differential), could you just slap the second measure down as an orthogonal axis and glean anything useful from that? And if so, can you just kind of Linear Algebra + MV Calculus your way to more robust conclusions? It naively feels like it should be an option, but I can't even begin to conceive of the methodology you'd use to ... do it.
@agmessier
@agmessier 3 ай бұрын
What about the '95/'96 Bulls? That was supposed to be the best year for the Bulls.
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@B3Band
@B3Band 3 ай бұрын
It was an error. He meant to use the 95/96 team.
@Iztari
@Iztari 3 ай бұрын
Would love to see the Z factors around Wayne Gretzky. Every statistical fact I've read of him seems unreal.
@allank8497
@allank8497 3 ай бұрын
Amateur NBA statistical analyst here! This concept is exactly what i use to compare player statistics across eras. I don't like using it for team comparisons, however, because throughout nba history the number of teams in the league has changed a bunch of times, and when theres fewer teams, one particular team's dominance counts against them more than when theres more teams. They play opposing teams more often, and so if theyre really good, opposing teams will naturally lose more of their games. For instance, in the 1967 nba season, the Philadelphia 76ers set an NBA record in winning 84% of their games (68-13). In the 1960s is was infrequent for the best teams in the league to win more than even 60 games, let alone 68. There were only 10 teams in the league at the time, however, so the bottom teams that year, like the Baltimore Bullets, who went 20-61, might seem like an left hand outlier in our distribution, but they played the 76ers 9 times and went 1-8 against them, whereas nba teams today with 30 teams play each other between 2 and 4 times. For this reason you have to exclude the team in question from the distribution. Doing so, the '67 Sixers have a z score of 2.82, compared to the 16 warriors score of 2.5 What i use z-score for is individual player statistics, and doing "inflation adjustments" between eras. I think this works much better because this means there are way way more data points and so you dont run into the problem of one outlier player inflating the standard deviation of the distribution, working against their own chances of producing a high z score
@pedroyochinori8371
@pedroyochinori8371 3 ай бұрын
So, applying this technique, who wins then? The 1994 brazilian world champion football team, Mike Tyson or the Nigerian bobsled team they made that movie about? In a more serious note: Does the quality of what makes a sport change little enough over time that these comparisons can be made? Is Basketball played primarily for 3 pointers the same sport as Basketball played mostly for 2 pointers?
@levibernard1838
@levibernard1838 3 ай бұрын
Really??? Nigerian bobsled team? Completely wrong continent dude
@pedroyochinori8371
@pedroyochinori8371 3 ай бұрын
@@levibernard1838 actually Jamaican, found it after searching. Movie name is Cool Runnings 😅
@hus390
@hus390 2 ай бұрын
The man compared two basketball teams in two different eras! The 3 points were there in the 1990s! While you are ridiculously trying to dismiss him by talking about three different sports! .. go away!
@pedroyochinori8371
@pedroyochinori8371 2 ай бұрын
Dismiss 😅? It's just a question. A reply with yes or with no and maybe some reasoning are both fine. I guess you can also try to dunk (get it?) on other people for internet brownie points too though. Cheers! 🎉
@dontich
@dontich 3 ай бұрын
This is a big thing in baseball stats - IE OPS+, ERA+ are all similar type metrics but just divide by the average instead of taking the z score. It would be the same ranking if you assume that the in season personal variance is not useful
@trygswyrmwoodside3229
@trygswyrmwoodside3229 3 ай бұрын
When you look into salary cap changes, you get a better feel for just how strong MJ was. There was two years where he got paid over 100% of the bulls normal salary cap, and all the other players signed for tiny contracts that were fully in the luxury tax area. If you gave Lebron James or Steph Curry 100% of a teams salary cap, and only put players around them that would sign for bare minimums, they would literally never win a single match.
@fernandocarrazzoni
@fernandocarrazzoni 3 ай бұрын
The 1995-96 Bulls, actually.
@misterkefir
@misterkefir 3 ай бұрын
Both 95-6 and 96-7 Bulls. Easily, too.
@turmericgarage8509
@turmericgarage8509 3 ай бұрын
Assumption is that rules, athleticism, and technical skills have remained the same.
@peregrinef3203
@peregrinef3203 3 ай бұрын
The biggest problem with this (in my mind) is that the two teams were playing under different sets of rules. The NBA changed rules over this time, so how are you accounting for the different sets of rules? Would the 1990s Bulls performed better under the new rules that promoted offense?
@phyphor
@phyphor 3 ай бұрын
Indeed. It would assist that one could use the same mathematics to compare, say, a basketball team and a javelin thrower, because everything becomes a unitless Z-score
@phyphor
@phyphor 3 ай бұрын
Heck, you could compare a basketball team to a singer (where the number and length of time at number 1 in the charts is used as the metric), or to anything else that you can rank. I'm not sure I'm ok with that, though.
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@hus390
@hus390 2 ай бұрын
@@phyphorthat’s a lame way to dismiss the video! Dude compared same sports and same league but different eras. Not any meaningful changes in the rules. I still put my eggs on Bulls though!
@KipIngram
@KipIngram 2 күн бұрын
It's also important to note that what you're really saying is that Golden State might beat Chicago if they played a statistically significant number of games. You really can never tell how one particular game is going to go.
@mathewlegge639
@mathewlegge639 3 ай бұрын
This is why statistics is the best 😄 Great video!
@realdreamerschangetheworld7470
@realdreamerschangetheworld7470 3 ай бұрын
I really like the clarification of what you can actually compare using this methodology. And that there are a whole host of unlisted factors which contribute to a win in a head to head matchup. Love the video!
@frankharr9466
@frankharr9466 3 ай бұрын
That's pretty interesting. Thank you. I feel like e's in the standard deviation there somewhere.
@maverator
@maverator 3 ай бұрын
I'd like a tour of the office.
@Dr_Tim_Sidnell
@Dr_Tim_Sidnell 3 ай бұрын
As an engineer, you'll see some very impressive Z factors from manufacturing lines. Particularly in industries where performance is literally life or death. For example, in aviation engineering, a manufacturer may achieve Z factors of 7 or 8 for a given part made in large quantities. Anything above 6 is equivalent to a process making a faulty part less than one in a million times, hence there's an entire branch of engineering rigour known as Six Sigma.
@12tone
@12tone 3 ай бұрын
Another interesting wrinkle in all this is that most sports statistics are competitive. Like, The 2015/16 Warriors getting so many wins reduced the number of available wins for all the other teams, because all of them played (and, mostly, lost) games against GSW. This makes it harder to meaningfully compare across eras, because your sense of how strong the competition is is, in part, self-correcting, so just dividing out the standard deviation doesn't necessarily account for it entirely. (For a toy example, consider a tournament where matches are decided by rolling a 6-sided die, but then one year they add a rule where every player adds 4 to their result. The win distribution should have exactly the same standard deviation, but even mediocre players from the +4 year will obviously have a huge advantage in hypothetical head-to-head matches against top players from other years.) This comes up a lot in baseball discussions: Modern pitching has advanced so far that it's pretty questionable whether Babe Ruth could even get to play in the major leagues today, whereas a replacement-level modern hitter sent back to his time would likely be a superstar because they'd trained against better opponents. I don't think any of this discounts Tom's actual point (in fact, I think it supports it) but it's interesting to think about how these statistics are shaped.
@monkerud2108
@monkerud2108 3 ай бұрын
yeah, you can argue which team is better in the context of their season. but you can't say which is more likely to win in a match on the grounds of that, for example it is easy to imagine the Chicago bulls data set coming from a wheel chair basket ball league team, identical numbers, but that team would lose badly against the actual Chicago bulls unless by some miracle wheelchairs beat legs. the direct comparison cannot be made between the teams, because the conclusion is a function of the dataset not the teams ability to play. it just means they were better in their time, playing the teams they did :).
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@VINTERIUM..EXPLORIUM.1
@VINTERIUM..EXPLORIUM.1 3 ай бұрын
Nice Work & Video 👍
@swill128
@swill128 3 ай бұрын
Normally I have no clue wtf they’re talking about on numberphile. Today however, is intro to statistics stuff. A class I’ve taken 3 times. 😅
@ItachiUchiha-ns1il
@ItachiUchiha-ns1il 3 ай бұрын
The rules were also a bit different back then, so it’s difficult to compare.
@DeGuerre
@DeGuerre 3 ай бұрын
Now I kind of want to see Sachin Tendulkar against a bodyline bowler.
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@hus390
@hus390 2 ай бұрын
@@B3Bandno meaningful changes in the rules. I still would put my eggs on the bulls’ basket
@RobertJordan7
@RobertJordan7 3 ай бұрын
The Z-Score compares unique data sets assuming the competition level is equal. It doesn’t account for modern teams being more developed from decades of improvement in sports science, conditioning, & play-calling. It might work for 2 teams 5-10 years apart with similar strength leagues but over time most sports drift. At 14:55 he does mention that it doesn’t directly say who would win in a modern head to head game.
@ThePeadar2211
@ThePeadar2211 3 ай бұрын
That skyscraper solve at the end was dire.
@maxid87
@maxid87 3 ай бұрын
"The numbers outside the grid indicate how many skyscrapers are visible in that direction" has nothing to do with height - I was wondering if that makes the outside numbers completely meaningless!?
@ThePeadar2211
@ThePeadar2211 3 ай бұрын
@@maxid87 It's a standard puzzle type but the way it's presented here is really confusing. The number indicates how many absolute increases are in that row or column. Or how many times a new height record is broken from that direction. So when the 6 is outside the grid that should indicate that there are 6 increases in that direction which is forced to be 1 2 3 4 5 6. But in this solve they've put a 6 in the first position so it can't increase any further.
@rmsgrey
@rmsgrey 3 ай бұрын
Yeah, putting a 6 inside the grid next to the 6 outside makes it clear they're not actually attempting to solve the puzzle. Also, they have two 2s in that left column, which breaks the "Latin Square" rule..
@joinedupjon
@joinedupjon 3 ай бұрын
My half baked thought is how does fitting a standard distribution to the Scottish premier league work - afaik the SPL normally has 2 super teams (Rangers & Celtic) that win almost every game they play and then 'the rest' who are scrapping amongst themselves for the lower positions.
@awaedin
@awaedin 3 ай бұрын
You might want to look into the ELO rating system. Its been in use for over 60 years in rating chess players, and probably will do a better job of comparing across different time periods.
@goldnutter412
@goldnutter412 2 ай бұрын
Revisit this in 100 years please someone, see what has changed..
@richardmellish2371
@richardmellish2371 3 ай бұрын
Integrating from minus infinity to x1 makes sense for the mathematical function f(x) but not for the distribution of people's heights.
@empmachine
@empmachine 3 ай бұрын
Wait a second... so what happens when the stddev -> 0?? (i.e. all the teams tie, and the final winner is from a league-wide-tie-breaker)? That team, that simply won the tie-breaker, would have A Z-FACTOR THAT EXPLODES.. no..??
@heeerrresjonny
@heeerrresjonny 3 ай бұрын
Is there anything invalid about finding the Z-scores for multiple data points and then averaging them to create sort of a "z-index"? I know often times doing things like that has unintended effects lol, but on the surface it seems like it might provide an even more "informed" comparison between the teams.
@NaR00W
@NaR00W 3 ай бұрын
Depends on if you care about the result being correct. If you just want a result then it's fine
@alexandrecolautoneto7374
@alexandrecolautoneto7374 3 ай бұрын
Pele mentioned, Brazil mentioned. Let's go!
@aonpl
@aonpl 3 ай бұрын
Chicago bulls with mj, any time, always. No math needed.
@collapsibletank
@collapsibletank 3 ай бұрын
Maths Schmaths - Lovely t-shirt!
@kirkanos771
@kirkanos771 3 ай бұрын
This method allows to compare competitivity but it cannot answer the question as stated. Because it cannot estimate technicity (or team and personal performances, each time you try, you'll confront their data to some sets of numbers of their adversaries in their era). Each generation its own. Young pele at his peak couldnt shine in today's football the same. However you choose your dataset, choosen numbers are anchored to their generational context. So after such a time gap, it's mostly possible competitors became stronger in their art. Adversity grows, competitivity oscillates (it's also circumstantial), and technicity grows. Thus statistics are super important and powerful on medium periods.
@clarkaustin6294
@clarkaustin6294 3 ай бұрын
The Bulls’ 72-10 season was in 1995-1996 and not 1996-1997.
@FedeDragon_
@FedeDragon_ 3 ай бұрын
So cool!
@laggyy3987
@laggyy3987 3 ай бұрын
He already explained clearly that this is just about one parameter (number of wins in the season) among peers of their period. If you want to level up the complexity, you could add more parameters to consider then get their combined Z scores. For “changing rules” adding complexity alone: you need to establish how you would have gotten that rule-changing factor in a numeric scale… that’s math for some other days. Again, you can’t ignore the bravery of them bringing this topic up while telling others his favorite Premier League team… and the comment with the most upvote proves people went out of their way to be emotional for just, one, parameter’s Z-factor analysis… (that’s in the field of psychology on team sports)
@laz001
@laz001 3 ай бұрын
Soooo - my question. Is there a way of more directly comparing teams from different years by trending the z-factors of ALL the teams, to create a lineage from team from era A to team from era B?
@gameofquantity96
@gameofquantity96 3 ай бұрын
Thank you so much
@unnamed7225
@unnamed7225 3 ай бұрын
I think the real champions are our dads when we were in primary school
@WrongParadox
@WrongParadox 3 ай бұрын
nope! lots of dads are not even remotely thought of in a positive light by their primary school aged children. There are even lots of examples in literature.
@Yggdraseed
@Yggdraseed 2 ай бұрын
Just to be especially contrarian a month after my opinion would have been welcome: while I'm no basketball expert, my understanding is that the rule changes since Michael Jordan retired from basketball have also made the defense's job harder than it was when Jordan had to get in to score points. Back when the defense had more options for positioning, he was still doing incredible numbers each game. So you need to factor in the fact that the 2016 Golden State Warriors also had less obstacles to scoring than the Bulls did when Jordan played, meaning if he were playing under 2016 rule sets, he'd likely be scoring more than he was when he was playing professionally.
@blinblue
@blinblue 3 ай бұрын
My statistics brain is a bit rusty... Does it make a difference that data points for "Win %" are very much not independent? In that, for one value to go up, some one else's value *must* go down? Compared to say, "Total Points" which could be higher or lower for a given team without necessarily affecting another team's outcome.
@awaedin
@awaedin 3 ай бұрын
I assume that if you simply ranked using win percentage you'd get the same ranking as using the z factor?
@eliasmochan
@eliasmochan 3 ай бұрын
About Brady's last question. This is not the actual actual answer, but let me talk to you about 2004 Pumas (UNAM) in Liga MX (although I think it wasn't called that back then). In those times the Liga used to have a very weird format, in which 18 teams were split into 3 random groups, but everyone would play everyone (even if they were in a different group). Then, the best of 2 of each group and the best 3rd places would go on to the playoffs. This made no sense, but it was the format. As a ressult, Pumas got second place in their group despite being actually 9th overall. They went on to win the playoffs and win the championship, with a Z-factor of -0.1 (yes, that's a "minus" sign on front). EDIT: Ok... I only counted the regular season and not the playoffs. If we add the playoffs the Z-factor goes up to 1.63. But it's a bit unfair as it gives some teams more games than others. If we take teh playoffs into account and measure by (points gained)/(number of games) we get Z=1.07.
3 ай бұрын
Let me understand this... On one side you have 29 players with an Elo ranking under 1000 and one with a 2500 Elo, playing a double round robin league. On another side you have 29 players with an Elo ranking between 2400 and 2490 and one with a 2500 Elo, in a similar league. In both cases the Z factor of the 2500 Elo player will be the same? Won't there be much more distance between the 29 players group and the winner in the first case than in the second? PS. If my memory is not wrong, per each 400 Elo points difference, you have about 90% chance to win the game. So, 800 points is 99,01%. 100 points more is about 65%, 50 points is 57%.
@MadMan123654
@MadMan123654 3 ай бұрын
Could you do the NFL by breaking down the games played into each individual play? For example, could you measure the offense's performance in terms of American Yards gained on the field per play against the Yards prevented by the defense on each play? Obvious pitfalls here is it doesn't account for long drives from one end of the field toward the goal that does *not* result in a score, but I think it would help to more accurately determine the output of the team's performance in term of their athletic ability?
@thatguyintheback1656
@thatguyintheback1656 3 ай бұрын
Goes to show just how impressive it was that Cleveland beat GS that year!
@kevinstewart2572
@kevinstewart2572 3 ай бұрын
Acknowledging, as you said yourself, a limitation of this analysis is "based on level of skill at that time," let's just assume that the average level of competition in different times is equal, then maybe you could do the following statistical calculation: Given z1 = 2.06 (Bulls) and z2 = 2.54 (Warriors), what is the probability that Warriors would win over Bulls? Or for another way of looking at it, if the two teams would play 100 games, what's the most likely split of wins between them?
@markconrad9619
@markconrad9619 3 ай бұрын
Baseball is the ultimate statistical nerd sport. Sheer number of data points is endless.
@thenashville5withchriseber738
@thenashville5withchriseber738 3 ай бұрын
The only thing I can think of that has happened in baseball that might have a Z=3.5+ is two grand slam home runs, in the same inning, by the same player off the same pitcher. Fernando Tatis Sr in 1999. Even that isn’t 70 goals in 20 games level.
@likenem
@likenem 3 ай бұрын
This doesn't count rule changes. I think it depends on what rule set the twi teams have to play with.
@NataliaBazj
@NataliaBazj 3 ай бұрын
However, there cannot be a normal distribution of values whose distribution is fundamentally asymmetric. For example, the height of people. If it had a normal distribution, it would mean that, although very rare, there are people with negative height. The central limit theorem states that standard distributed are the *mean* value of peoples' height measured by different researchers or taken from various reference books. Not the heights of different people in any - even as big as UK - set.
@clayspillars
@clayspillars 3 ай бұрын
I would like to see an average of the Z factors between several stats, from important like wins to seemingly unimportant like hand size and team average height. Also Pele’s 3.8 is amazing but what about Michael Phelps and his Gold medals?
@christianellegaard7120
@christianellegaard7120 3 ай бұрын
You could argue that the strongest team was the one that won the season with the lowest standard deviation.
@davidhasen7983
@davidhasen7983 3 ай бұрын
I think that doesn't account for the strength of the winner. Something like the ratio of Z-score to standard deviation might be better.
@B3Band
@B3Band 3 ай бұрын
Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.
@magnus425
@magnus425 2 ай бұрын
Can you please share datasets I can use to test this out?
@alexzavoluk2271
@alexzavoluk2271 3 ай бұрын
The 1996/97 bulls was not the Bulls' highest winning season. They won 69 games that season, but 72 games the year prior. Brady's question around 13:10 is quite prescient, because for smaller sample sizes of games, average point differential is a better predictor that team's future win record than their past win/loss record. I'm not sure where the crossing point is (i.e. is it more or less than 82 games). Doesn't this analysis require that each season, individually, is at least approximately normal? There are (well, at least since shortly before the Bull's season in questions) about 30 teams in the NBA, which is right on the cusp of the commonly used threshold to apply a normal approximation, but looking at the histograms around 6:00, these look only very roughly normal. A few other factors that one would want to take into consideration: -teams play more games against other teams in their conference (East or West) than in the other conference, and the 2 conferences are often of unequal strength. I believe both teams were in the stronger conference in their own time, but I don't have a good measure of by how much. I'm actually not sure how this affects their z-score: In some sense, it might be less "surprising" that a particularly good team would emerge from a group with a higher mean, but their performance is also further above the *league* average than their W/L implies. -many teams will deliberately rest late in the season, especially good teams who have secured a playoff spot. But a team that is close to setting a record might continue to play as hard as possible, and in fact I believe the Warriors seemed to do this, and it might have contributed to their comparatively poor performance in the playoffs.
@batlrar
@batlrar 3 ай бұрын
A lot happened in '96, so this all depends on whether Jordan has met and trained with the Cartoon All-Stars and saved the planet from the Monstars already. You'd also have to ban his world-destroying Chaos Dunk, for obvious reasons.
@patrese993
@patrese993 3 ай бұрын
Is Steve Kerr playing or coaching then?
@hypejuice1321
@hypejuice1321 3 ай бұрын
Fascinating video! Backing warriors any-day though😅
@srwapo
@srwapo 3 ай бұрын
As someone who grew up in Chicago-land in the 90's, it's obviously the Bulls.
@CaioTrinchinato
@CaioTrinchinato 3 ай бұрын
I would love to see how to correctly combine multiple types of data into one graph. Something like point per match and free throw conversion and win by how many points and so on... My first instinct is to transform everything in percentages... but its just wrong lol
@Nemelis0
@Nemelis0 3 ай бұрын
Sounds to me that to have an ultimate Z-factor for sports you should combine a number of Z-factors in all kinds of statistics which are really key for a sport and work that out in 1 Z-factor.
@EliezerGrawe
@EliezerGrawe 3 ай бұрын
I would love to see F1 comparision. Senna, Shumacher, Prost, Hamilton...
@basieluxanno7909
@basieluxanno7909 3 ай бұрын
now I am interessting how strong Verstappen was in the view of the Z-factor
@ccoderproject
@ccoderproject 3 ай бұрын
I always wondered how the positive values like human height can be normally distributed.
@oldcowbb
@oldcowbb 3 ай бұрын
it's not, it's really close because the mean is high and the standard deviation is small, so the non-negative effect is negligible, it's just another instance of central limit theorem
@mykolanikolayev1714
@mykolanikolayev1714 3 ай бұрын
Tom, can we set sigma not just as real value. But as Divisor function(perfect number?)
@AndersHaalandverby
@AndersHaalandverby 3 ай бұрын
If you were talking about the actual best team, you would have to team up the 90's team with the 2020's teams, and in that case I think the old teams would lose every match. If you look at a more interesting and much more popular sport, like juggling for instance, almost no-one could even do 7-clubs juggling in the 90's, but now "everyone" does it. I believe this massive improvement is probably occuring in lesser sports like football and basketball as well.
@steeevealbright
@steeevealbright 3 ай бұрын
Isn't it possible that ALL the other teams in 2001 were really bad, but the same amount of bad as each other? The distribution being tighter doesn't have to mean that the competition was tougher. In fact, I don't see any relation between those two things.
@goldnutter412
@goldnutter412 2 ай бұрын
Bradman practiced with a gold ball and a wicket as the bat Genius
@NaR00W
@NaR00W 3 ай бұрын
Who's better Ninja Turtles or Street Sharks. Figure that out math boy
@flickingbollocks5542
@flickingbollocks5542 3 ай бұрын
He ran out of paper so had to scribble on his arms.
@R_V_
@R_V_ 3 ай бұрын
​@@flickingbollocks5542 On his arms and probably elsewhere too, but I'm not sure I want to know...
@GhettoFabulousLorch
@GhettoFabulousLorch 3 ай бұрын
Could you check the Z factor of all teams in the league for each season and then measure who had better win records against the rest of their respective leagues' Z factors? That could determine a more answer the question better.
@reidflemingworldstoughestm1394
@reidflemingworldstoughestm1394 3 ай бұрын
To clarify, when he says the z factor is just an alternate pronunciation for the z factor.
@trevorbennett8438
@trevorbennett8438 3 ай бұрын
I think what bugs me about using wins is that it's zero-sum. So, within a league # of wins of each team is not an independent event. When working with statistics, don't you want each of your datapoints to be independent?
@DekarNL
@DekarNL 3 ай бұрын
This is very flawed unless you intended to compare the relative percormance compared to their respective fields. To me it sounded like you wanted to compare who would win in a match between the two basketball teams if they were to play against one another.
@Tker1970
@Tker1970 2 ай бұрын
In MLB, there 162 games every year. Its said each team wins 54, loses 54. Its what you do with the other 54 that matters.
@IrishEye
@IrishEye 3 ай бұрын
Liverpool's Z score the year they won the league. Nice.
@shruggzdastr8-facedclown
@shruggzdastr8-facedclown 3 ай бұрын
If no one has corrected Tom yet re. his knowledge of the total regular season games in an NFL season, a couple/few years ago the League decided to add an additional game to the regular season; so, now it's 17 rather than 16 games -- not that that additional data point for the sample size is all that much more statistically significant, of course
@Typical.Anomaly
@Typical.Anomaly 3 ай бұрын
Holy Sh!t, guys, you had me at Jordan! Insta-upvote.
@mobilephil244
@mobilephil244 3 ай бұрын
The usual problem with applying statistical methods to real-world things: It doesn't work well. The real world has too many "other factors".
@SubtleForces
@SubtleForces 3 ай бұрын
Is Ole Einar Bjørndalen the King of Biathlon? I wonder if there is a calculation of biathletes z-factor per year and since they are individual athletes, whether the evolution of the z-factor for all athletes can allow us to do two things: normalize an athletes performance over time and hence make an actual inter-generational comparison (without the mess of players changing teams). Trouble is, I would have to work with ranks in races not wins...
@gustrindade
@gustrindade 3 ай бұрын
Please please do a part 2 focusing on various z-scored stats for Messi v. Ronaldo
The Beautiful Math of Snakes and Ladders - Numberphile
21:46
Numberphile
Рет қаралды 203 М.
Error Correcting Curves - Numberphile
17:46
Numberphile
Рет қаралды 230 М.
OMG 😨 Era o tênis dela 🤬
00:19
Polar em português
Рет қаралды 3,8 МЛН
Stupid man 👨😂
00:20
Nadir Show
Рет қаралды 29 МЛН
О, сосисочки! (Или корейская уличная еда?)
00:32
Кушать Хочу
Рет қаралды 6 МЛН
1. Introduction to Statistics
1:18:03
MIT OpenCourseWare
Рет қаралды 1,9 МЛН
Why do all shapes lie in the Polyhedron Plane?
24:01
Stand-up Maths
Рет қаралды 221 М.
Archimedes' Genius | Proof of the Archimedes Quadrature formula
5:00
The Rookie Nerds
Рет қаралды 10 М.
But what is the Central Limit Theorem?
31:15
3Blue1Brown
Рет қаралды 3,3 МЛН
A Problem with Rectangles - Numberphile
17:12
Numberphile
Рет қаралды 467 М.
The Brick Factory Problem - Numberphile
14:51
Numberphile
Рет қаралды 418 М.
What are these strange dice? - Numberphile
16:28
Numberphile
Рет қаралды 131 М.
How do you DERIVE the BELL CURVE?
35:21
Mathoma
Рет қаралды 108 М.
Carregando telefone com carregador cortado
1:01
Andcarli
Рет қаралды 1,3 МЛН
ПРОБЛЕМА МЕХАНИЧЕСКИХ КЛАВИАТУР!🤬
0:59
Добавления ключа в домофон ДомРу
0:18
How Neuralink Works 🧠
0:28
Zack D. Films
Рет қаралды 30 МЛН
IPad Pro fix screen
1:01
Tamar DB (mt)
Рет қаралды 3,4 МЛН