Statistical Inception: The Bootstrap (

  Рет қаралды 27,647

Very Normal

Very Normal

10 ай бұрын

Stay updated with the channel and some stuff I make!
👉 verynormal.substack.com
👉 very-normal.sellfy.store
An entry for the 2023 Summer of Math Exposition (#SoME3)

Пікірлер: 73
@MultiWilliam15
@MultiWilliam15 9 ай бұрын
As a mathematics graduate student, it’s interesting to me how a lot of upper-level statistical tools are derived from real analysis. It’s like one of those courses that no mathematician wants to touch unless they’re working with PDEs or statistics.
@user-nf6jl9cg1t
@user-nf6jl9cg1t 8 ай бұрын
Why do mathematicians want avoid this?
@micayahritchie7158
@micayahritchie7158 8 ай бұрын
​@@user-nf6jl9cg1tit's hard
@pavlevilotijevic7266
@pavlevilotijevic7266 8 ай бұрын
@@user-nf6jl9cg1t It is a really technical course, lots of theorems, proofs and abstract thinking. This isn't unusual for mathematics but it is usually held in the first year so for a lot of people it takes getting used to. Also it is really big, basically every year a third of what you study is just different kinds of analysis (real, complex, functional, differential equations etc.). Still, it is objectively really useful and, if you can navigate through the mathematical rigor and detail, it is also really interesting and satisfying once you grasp the meaning.
@ivanjorromedina4010
@ivanjorromedina4010 7 ай бұрын
For me, real analysis is like the pretty popular girl, everyone wants to go with her. I know beacuse of the flatmates (I'm from Europe) that I had during my undergraduate, that there is also this type of subject in other degrees. For biology someone told me it is genetics, for medicine it is neurology and for physics it is astrophysics. Of course people don't want to mess around with real analysis and end up doing some descriptive set theory or other difficult stuff, but a lot of people like it because it is somehow intuitive at a very basic level (calculus, not foundational analysis). I want to mention that there is other places were you end up messing somehow with real analysis, but they are more higher level or advanced like harmonic analysis or measure theory.
@CsyroxXx
@CsyroxXx 5 күн бұрын
Finally someone who guides you in the right direction of WHY it works
@swaree
@swaree 9 ай бұрын
here from the SoME judging page, nice video that I hope gets recommended to more people. you earned a new subscriber. good luck with the contest!
@very-normal
@very-normal 9 ай бұрын
Thanks for your time! Glad you enjoyed it
@movax20h
@movax20h 8 ай бұрын
There is a restricted variant of bootstrap, where instead of doing independent sample samples using Monte Carlo, you use correlated sample sample by doing it iteratively. In each iteration you replace few random elements of the sample, from the full sample, making a new sample sample, instead of replacing everything. This is called Jack Knife method. I learned this when I was studying physics, and it is very rarely used method, but it is very powerful, because with proper coding (all usual statistics like mean, variance, moments, can be implemented using incremental and reversible updates, that do not depend on a sample sample size, but rather just on number or replaced elements), can speed up things A LOT. Nuclear physicist and physconomics people you this quite a bit, because it also gives you all kinds of estimates and error estimates for free.
@Singularity24601
@Singularity24601 7 ай бұрын
Huh? Don't know if we're talking about the same thing, but your description doesn't sound like jackknife resampling to me. Jackknife is rather oldschool and pre-dates bootstrapping by several decades. Any chance you're thinking of Gibbs sampling?
@patrickmoehrke4982
@patrickmoehrke4982 8 ай бұрын
Thanks for the great video! One thing that would have tied it up is bringing it back to the original example and briefly illustrating why bootstrap could work in that case
@asdf56790
@asdf56790 8 ай бұрын
What I don't get is that we wanted to avoid using the CLT, because of our small sample size. However, our reasoning for the bootstrap still seems to be asymptotics, so the assumption of a large n. Also: Where exactly do we use Monte-Carlo methods here? I understand them as an approximation method used for numerical estimates, but not for proofs? I guess it would've helped me a lot to apply the reasoning to our specific example. But this is a great topic and I'm super happy to see someone mathematically justifying it on KZbin
@andrewhubbard4043
@andrewhubbard4043 8 ай бұрын
I think the idea is that while you only have a sample n of 30, you can have a bootstrap n of much larger, say 200.
@icodestuff6241
@icodestuff6241 7 ай бұрын
the CLT only gives info about the mean, not the median, since we can see a visible skew
@Camstraction
@Camstraction 9 ай бұрын
I’m super glad this video was recommended to me, I really enjoyed it! Thanks for putting in the time and effort to make it. After subscribing I took a look at your channel to see if you had any other similar videos, but they’re mostly very short form. I hope you make more videos like this that are of a mid/long format. Good luck and I hope more people see this video and subscribe (:
@very-normal
@very-normal 9 ай бұрын
Thanks! More long form videos are definitely coming in the future!
@fiona8081
@fiona8081 6 ай бұрын
oh man, one of my thesis committee members just literally gave the comment "you should consider bootstrapping" with no further detail (for a project with several types of analyses), and all I knew was that when we learned how to run mediation in R using PROCESS in our first year stats class it used bootstrapping. So, this vid randomly getting suggested to me was pretty funny. Thank you for explaining the actual basic concept, bc like you said, mostly I had just seen high level math-heavy explanations, or instructions on how to do it in specific contexts.
@the_math_behind
@the_math_behind 8 ай бұрын
Great video! I had always wondered what was going on under the hood with bootstrapping. Can’t wait check out some of your other vids!
@nickdinenis9883
@nickdinenis9883 26 күн бұрын
Great video!!
@Mutual_Information
@Mutual_Information 8 ай бұрын
Very well done! Keep going!
@Danylux
@Danylux 8 ай бұрын
I've heard a lot about bootsraping from my phycisist friends and didn't know the details, I'm taking probability rn with a bit of measure theory and seeing how análisis and and measure theory make this idea useful is a really good insight!
@BleachWizz
@BleachWizz 8 ай бұрын
4:38 - ok before I get to the explanation here, when you said the concept of the idea it's just brilliant! I know it's unsatisfying to not know a proof but it feels like it just has to work if you're not with an exceptional case.
@nelson6814
@nelson6814 8 ай бұрын
This video is art, so good 😊🎉
@FlamenLion
@FlamenLion 9 ай бұрын
Delightfully interesting!
@davidmoore5846
@davidmoore5846 8 ай бұрын
Woohoo, I saw 999 subscribers and had to hit subscribe! Excellent explanation. I always introduce it by starting from an algorithm ("you just resample the data") and then moving forward to the more complicated terminology (bootstrap estimator, monte carlo estimate of bootstrap estimator). It's nice to see the historical and mathematical details being discussed!
@very-normal
@very-normal 8 ай бұрын
I will remember you forever (thank you for subscribing!)
@alexoprea1000
@alexoprea1000 9 ай бұрын
Really nice explanation
@papetoast
@papetoast 8 ай бұрын
I like the topic. A small criticism is that the video doesn't seem to be made with a clear target audience in mind, and as a result the math difficulty jumps around too much. For example, PMF and CDF probably don't need any explanation here. If someone doesn't know those, they likely won't understand everything else anyways.
@very-normal
@very-normal 8 ай бұрын
Thanks for watching and for the feedback! After watching 3b1b’s video on the SoME3 winners, I definitely see the importance of thinking of a target audience more. I’ll do my best to do better in future videos!
@SimGunther
@SimGunther 8 ай бұрын
Was gonna respond to another thread on this, but it's more convenient to have it here: ​The reason that the Monte Carlo method is named the way it is is due to Stanislaw Ulam's uncle and how he would borrow money from his relatives in Monaco to support his gamblin'. In knowing this, Nicholas Metropolis made the suggestion based off the Monaco Caseeno just to give Ulam and John von Neumann a code name required for a neutron simulation program. Yep, you can thank Metropolis for giving the name to methods used in these simulations for the Manhattan Project. That's some dark lore for you for some innocent non-deterministic math.
@sjoerdglaser2794
@sjoerdglaser2794 8 ай бұрын
Maybe I don't really understand it. But as you mention in the beginning (and I heard in my education), the bootstrap is mostly used when datasets are too small to provide meaningful estimates on their ow. However, the final part of why bootstrapping works still hinges mostly on asymptotics: in the limit of infinite datasets. So maybe I understand it wrong, but don't the asymptotics destroy the point of bootstrapping?
@very-normal
@very-normal 8 ай бұрын
I also learned early on that it could be used with smaller datasets, but it seems this isn’t the case. After some thinking, I think it was a mistake on my part to say that bootstrap can be used with smaller datasets. I don’t think there’s anything stopping us from using it with small datasets, but then we aren’t assured that the bootstrap estimate will converge to the underlying CDF. But from a practical perspective, it can still tell us a little bit about the distribution which can still be helpful. Thanks for watching the video and the feedback!
@trinalmage9446
@trinalmage9446 8 ай бұрын
This 'flaw' of the bootstrap is well-known: all justifications of the bootstrap are also asymptotic (there's some great articles by Le Cam on this). However, if you delve deep within these asymptotics, the bootstrap approximates something known as the Edgeworth expansion, and as such is expected to be a "faster", more reliable asymptotic than relying on the usual methods.
@ollie-d
@ollie-d 5 ай бұрын
Would’ve been helpful to walk through an example of applying the bootstrap to the data you framed the narrative around.
@AVeryLargeSon
@AVeryLargeSon 7 ай бұрын
Hey you probably used Latex to generate the text in this. Just FYI, a double backtick `` is used to compile a forwards facing quotation mark.
@kaiserruhsam
@kaiserruhsam 7 ай бұрын
specifically the phrase was meant as an example of something impossible to do, not simply lacking outside help.
@AllemandInstable
@AllemandInstable 7 ай бұрын
nice one
@yorailevi6747
@yorailevi6747 8 ай бұрын
Can you show an example of the bootstrap method?
@treelight1707
@treelight1707 8 ай бұрын
Although I kind of understand what bootstrap is about, I didn't get to the part where I can actually implement it. Nevertheless, I enjoyed the video very much, and I really like the channel name :) Looking forward for part 2.
@very-normal
@very-normal 8 ай бұрын
Thanks! I’ll always be kicking myself for not including that last analysis part lol, I thought people wouldn’t care about it but I was wrong
@trinalmage9446
@trinalmage9446 8 ай бұрын
This video displays some wonderful intentions and craftsmanship, especially when it comes to animation and the early parts! However, I hope you will accept some feedback: - 5:17 This phrase can cause confusion: I know you know that the PDF is not P(X=x), but "the likelihood of seeing different values" makes it sound like that for someone who doesn't know better. More importantly, the discussion can be had entirely without densities, and a quick comment on how the CDF describes distributions most generally since it encompasses both the discrete and continuous cases would probably have been clearer. - There are a lot of unclear phrases that hint at depth but are also just a tad confusing. "They (PDF and CDF) both describe the randomness in a random variable" doesn't really help me understand anything that's going on. - Talking about "dependence on n" wrt the cdfs and bootstraps is fine, but isn't the bigger, aching question the fact that you are approximating two **random** functions? This discussion seems to have been ignored, but kind of underlines the whole thing. Then again, seems like a hard task for a 10-minute video. Maybe it would've been better to leave the Monte Carlo explanation for a second part, discussing implementation? - The infamous middle step of the proof is a little confusingly laid out to me, still. But, seems like a frankly herculean task to try to explain all the baggage to properly grasp at why that proof-idea makes complete sense. - Given that you cite the Wasserman notes in the video, probably would've been nice to include them in the description? - On a minor note, these videos could do with a little more focus. You seem to go off on a few tangents related to you, your experience and how you learned the subject a fair few times, which are of course valid, but could be a little sparser, given the objective of the video is teaching others. Best of luck in your present and future academic and KZbin career!
@nujabraska
@nujabraska 7 ай бұрын
It would’ve been nice to return to your original example to see how a bootstrap method would transform your data, but other than that, good video. I have taken a couple stats classes at uni and that helped me understand, so someone less statistically inclined may have trouble understanding this video.
@imotvoksim
@imotvoksim 7 ай бұрын
Unless you've gone through a graduate statistics course, I don't think you have a chance at taking away much from this video. But since I went through a few such courses, I feel like you left me hanging in there with a teaser for an actual proof. Nice explanation overall, but I feel like the target audience could handle more depth. Also, it's funny that the intro finishes with "you can't always ask asymptotic to save you" and the video proceeds to proof the bootstrap through asymptotics.
@SebWilkes
@SebWilkes 7 ай бұрын
Thanks for the video! I just wonder if there could have been a demonstration on the clinical trial? Perhaps your point is that is still a lot of work, but if it was doable to show then I would be curious to see how the ideas behind bootstrapping then apply to "the real world". This is to say, some people learn better from examples!
@Singularity24601
@Singularity24601 7 ай бұрын
We generally design clinical trials in such a way that bootstrapping is not relevant. Bootstrapping is not in my starting lineup of methodological tools, it's more like a fallback for when all else fails.
@user-wn2vr3dg4z
@user-wn2vr3dg4z 8 ай бұрын
Not sure I understand what is the benefit in precision from bootstrap, as we could directly compute the CDF of the sampling distribution ?
@very-normal
@very-normal 8 ай бұрын
It can be used as a general tool since we may not always be able to compute the CDF directly. For example, if we’re dealing with functional data, we can use it to construct bootstrap intervals to understand the variability in the coefficient function.
@lorenzoplaserrano8734
@lorenzoplaserrano8734 9 ай бұрын
❤❤❤
@TheOneMaddin
@TheOneMaddin 8 ай бұрын
"It is called Monte Carlo because we are picking multiple boot strap samples at random" ... okay, cool ... but why is it called Monte Carlo?
@very-normal
@very-normal 8 ай бұрын
Great question! That’s worth talking about in a video
@tomasalvim1022
@tomasalvim1022 7 ай бұрын
Casino from Monaco
@tomasalvim1022
@tomasalvim1022 7 ай бұрын
Really liked the video, but the music is a little loud.
@gingeral253
@gingeral253 7 ай бұрын
Interesting
@jestingrabbit
@jestingrabbit 7 ай бұрын
My brother in christ, the content is good, the repetitive flashing of a horizontal white bar about a third from the bottom and left of the screen is incredibly distracting. The 'this is a video' conceit is no worth this distraction.
@Possumman93
@Possumman93 8 ай бұрын
So, what was the treatment effect
@very-normal
@very-normal 8 ай бұрын
forty-two
@htomerif
@htomerif 8 ай бұрын
I didn't like the part where we didn't get to see some actual analysis of the original data set presented to us at the beginning of the video. What I got out of it was that this bootstrapping "works" but the degree to which it works depends on the size of the data set. I don't get why the number of unique bootstrap data sets is relevant compared to n^n. I'm guessing it's not relevant and computationally constructing only unique data sets would be very bad for accurately embiggening the original data set. I also don't get if this is supposed to lend some continuity to the data set that "works"? My guess is "no". For example, in the real world there's the TAS2R38 gene that lets people taste certain chemicals. There are (so far as I can tell) 8 genotypes for this gene. If I sampled 200 people for sensitivity to tasting a chemical relevant to TAS2R38, all 200 of them would fall into one of exactly 8 bins (I don't think it's partially dominant). If I didn't know this already, any statistical analysis that assumed the continuity of the represented total population and generated so much as one extra bin (much less a continuity of bins) would be really very broken. So that's what I'm worried about with this bootstrapping. There are some situations where I would want a nice curve and some where the discreteness is pretty critical information, but I would have to know which I was looking for in advance for the bootstrapping to work, which kind of unstraps the bootstrapping. Does that make sense? Anyway, its not that I didn't like the video, I did. It just felt like instead of being handed a tool, I was handed an advertisement for a tool. I'm obviously not a statistician but I'm also not an idiot, so I guess I'll have to do some more looking into this on my own.
@wanfuse
@wanfuse 8 ай бұрын
can I get permission to use this content in an open source project I am making?
@very-normal
@very-normal 8 ай бұрын
Go for it!
@sherifffruitfly
@sherifffruitfly 8 ай бұрын
YOU CAN'T JUST USE ASYMPTOTICS TO SOLVE ALL YOUR PROBLEMS! (dies laughing)
@sherifffruitfly
@sherifffruitfly 8 ай бұрын
and naturally he immediately moves on to use asymptotics with his bootstrap estimators :P love the video tho - thank you! :)
@daniellebalouise9596
@daniellebalouise9596 8 ай бұрын
What are the prereqs for me to watch this video?
@AP-dc1ks
@AP-dc1ks 8 ай бұрын
Since you choose the median by looking at the data, don't you just invalidate the method as a whole? If you pick your methods that you like most after checking the data (especially a method like the median, that effectively invalidates parts of the data) you might as well forge your own data instead? You literally say you choose the median precisely because it removes data you don't like.
@very-normal
@very-normal 8 ай бұрын
I think you’re referring to p-hacking. If I had done a statistical test first and it gave me a result I didn’t like, it would be wrong of me to select a new model to try to get a new result, like you said. But in this case, I’m trying to pick a model that better suits this dataset; the sample median can perhaps perform better than the mean because it’s resistant to outliers that I notice in the data.
@trinalmage9446
@trinalmage9446 8 ай бұрын
@@very-normal He's not wrong, actually. By conditioning the estimator you use on the data that you've observed, the correct distribution for the median that you're calculating refers not to median(X), but median(X)|there are outliers. Frequentist statistics requires a very "multiversal" approach to these things; you have to pay attention to the choices you haven't made. You can probably easily show via simulation that this procedure of "mean, but if there are outliers, median" will not lead to the same distribution for the median as simply using it in the first place. This difference may be small, but can be important, and as a general practice is best left avoided.
@Singularity24601
@Singularity24601 8 ай бұрын
I would argue that medians are overrated and rarely a good idea. High school tells us that medians are like means but better, because they're robust to outliers, while neglecting to tell us that such robustness comes with costs. Besides, "robustness" to outliers is just another way of saying "ignorant" to outliers. Yes, many clincal trials do report medians, but these are generally for illustrative purposes only and have nothing to do with the actual hypothesis testing. I think you could have done the same presentation without going into medians.
@Hamiltonianmcmc
@Hamiltonianmcmc 7 ай бұрын
Robustness is very much not overrated. The reason why it is not overrated has to do with the fact that we aren't really ever actually interested in the average treatment effect itself; we have to settle for it. A doctor is truly interested in knowing how the medicine he prescribes will impact the patient right in front of him, not how it will impact patients on average. Looking for average/median treatment effects is a compromise with reality; we are truly interested in how medicine will TEND to impact people so that we can talk about how we think it will impact someone in front of us. For this reason, robustness can become important; especially, for example, when we are comparing two medicines against each other. What if most people get a small effect from medicine A with huge effects for some others, while medicine B tends to give much better effects for most people but with no outliers so that it looks like there's not much of a difference between the two? Consider this! People who discount robustness or say "never throw away data unless you can absolutely show that it was a mistake" are showing that they are not focused on the problem on hand and the needs of the end users of their analyses, but on dogma. Every single choice must be informed by the problem domain you are working in and the relative cost/benefit of being right or wrong when it comes to the decisions or actions that the analysis informs.
@Singularity24601
@Singularity24601 7 ай бұрын
​@@Hamiltonianmcmc Robustness is indeed overrated. I am a doctor (and a biostatistician). Are you? Allow me to address some of your points individually: 1. "A doctor is truly interested in knowing how the medicine he prescribes will impact the patient right in front of him, not how it will impact patients on average..." Firstly, the hard reality is that no one can predict the future with perfect accuracy. Hence, we cannot know how the medicine will impact the patient, we can only estimate a distribution of possible outcomes (preferably based on the data of other patients with similar details to the patient in front of us). We can summarize this distribution with a mean or a median. Secondly, you claim to value the details of the individual patient in front of us... this strengthens my argument, not yours, for means incorporate all the details whereas medians throw away details. 2. "What if most people get a small effect from medicine A... Consider this!" Consider what? I think it'll be more effective if you state your point rather just a rhetorical question. The scenario you paint is not that unusual for a day at my work, yet I still can't answer your question because it's not clear what question you're trying to ask. 3. "People who discount robustness or say 'never throw away data'... [are focused] on dogma." That's a rather sweeping statement to claim... dogmatic even. I am a cancer doctor. When we are trying to cure someone with a small chance of cure, outliers can be everything. Two treatments can have the same median overall survival, but one gives a 49% chance of long-term survival and the other gives a 0% chance of long-term survival. The median dogmatist claims that these two treatments are equal. I claim they are not.
@mv3845
@mv3845 Ай бұрын
The music is so distracting:(
@very-normal
@very-normal Ай бұрын
Sorry about that, there is a version without the music in the description available
@twandekorte2077
@twandekorte2077 Ай бұрын
Great video!
The most important ideas in modern statistics
18:26
Very Normal
Рет қаралды 105 М.
100+ Statistics Concepts You Should Know
13:46
Very Normal
Рет қаралды 12 М.
Is it Cake or Fake ? 🍰
00:53
A4
Рет қаралды 17 МЛН
1❤️#thankyou #shorts
00:21
あみか部
Рет қаралды 83 МЛН
Универ. 10 лет спустя - ВСЕ СЕРИИ ПОДРЯД
9:04:59
Комедии 2023
Рет қаралды 1,3 МЛН
WHY IS A CAR MORE EXPENSIVE THAN A GIRL?
00:37
Levsob
Рет қаралды 21 МЛН
Explaining Probability Distributions
12:54
Very Normal
Рет қаралды 12 М.
The Fascinating Math behind Piston Extenders #SoME3
20:08
mattbatwings
Рет қаралды 557 М.
Monte Carlo and Bootstrap Methods Introduction
27:07
Fourth Z
Рет қаралды 2,2 М.
The Boundary of Computation
12:59
Mutual Information
Рет қаралды 955 М.
What is Bootstrapping? - Computerphile
8:19
Computerphile
Рет қаралды 422 М.
The most important skill in statistics
13:35
Very Normal
Рет қаралды 302 М.
The Concept So Much of Modern Math is Built On | Compactness
20:47
Morphocular
Рет қаралды 373 М.
What does a complex function look like? #SoME3
20:38
mathematimpa
Рет қаралды 118 М.
A problem so hard even Google relies on Random Chance
12:06
Breaking Taps
Рет қаралды 1,1 МЛН
The better way to do statistics
17:25
Very Normal
Рет қаралды 165 М.
Is it Cake or Fake ? 🍰
00:53
A4
Рет қаралды 17 МЛН