Bootstrapping and Resampling in Statistics with Example| Statistics Tutorial #12 |MarinStatsLectures

  Рет қаралды 112,231

MarinStatsLectures-R Programming & Statistics

MarinStatsLectures-R Programming & Statistics

Күн бұрын

Пікірлер: 133
@marinstatlectures
@marinstatlectures 5 жыл бұрын
👋🏼 Hello there! In this statistics lecture we learn the Bootstrap method (a brute force method) in statistics, along with why one may want to use such an approach. Bootstrap in statistics is a re-sampling based approach, useful for estimating the sampling distribution and standard error of an estimate. If Like to support us you can Donate (bit.ly/2CWxnP2), Share our Videos, Leave us a Comment and Give us a Like 👍🏼 ! Either way We Thank You! 🦄
@Isuppose12
@Isuppose12 4 жыл бұрын
Thank you Mike! I have a question (I probably got it wrong...). In the lesson, you used a small sample of 5 (5 observations), so is it true that there would only be 5 to the power of 5 = 3125 ways of different sampling? If so, how would it help to have B bigger than 3125?
@evon4441
@evon4441 3 жыл бұрын
Your video saved me. Thank you soo much. Really appreciated :) :)
@raminessalat9803
@raminessalat9803 Жыл бұрын
So many youtube videos that try to explain bootstrapping and yet this guy explains it so well you don't even want have to try hard or anything to understand! He understands it well and explains it well!
@rachelzhao6624
@rachelzhao6624 6 жыл бұрын
The last question is the high light of the whole video!!! You teach much clearly than my prof!
@marinstatlectures
@marinstatlectures 6 жыл бұрын
thanks, we appreciate that :)
@mrvvrm5951
@mrvvrm5951 4 жыл бұрын
Do you guys need this for.your brain to come somewhere, o my god professor needed to come some where in your own theori. It oke there are some gow need guidance line from a PROF. your the prof in your brain or not
@mrvvrm5951
@mrvvrm5951 4 жыл бұрын
You mean you not we follow the leader is you you will never be a leader but a follower. Like following mami to come to the playingyard
@pedronucci2095
@pedronucci2095 5 жыл бұрын
the best explanation available in the internet!
@marinstatlectures
@marinstatlectures 5 жыл бұрын
thanks, we agree ;)
@MarkoRadulovic
@MarkoRadulovic 4 жыл бұрын
ABSOLUTELY BRILLIANT!!! This concept of random and sequential measurement selection is so simple, but this is the only spot on the internet which manages to explain it well
@djgulston
@djgulston 6 жыл бұрын
What a coincidence! We just started with bootstrapping in class. I'm currently doing second year stats. I am in my second semester right now. I didn't quite get what my lecturer was trying to say in class, but you explained it so well here. Thank you so much for this video!
@marinstatlectures
@marinstatlectures 6 жыл бұрын
Great to hear! I’m teaching bootstrapping in my class this week :)
@ltdata5282
@ltdata5282 2 жыл бұрын
Thank you so much for this video!! KZbin university is a life saver
@dharmawangsa9592
@dharmawangsa9592 4 жыл бұрын
Best explanation about bootstrapping in yt
@marinstatlectures
@marinstatlectures 4 жыл бұрын
I agree ;)
@lhodeniz
@lhodeniz 5 ай бұрын
Your explanation is so clear! Thank you.
@TheZchristina97
@TheZchristina97 5 жыл бұрын
Incredibly clear and tangible. Rare to find in stats videos. Thank you!
@-lll7585
@-lll7585 4 жыл бұрын
OMG your videos have literally saved my life!!!! Thanks!!!!
@marinstatlectures
@marinstatlectures 4 жыл бұрын
You’re welcome, happy to help :)
@echoecho5244
@echoecho5244 Жыл бұрын
brilliant, much better than my uni days
@rainsein
@rainsein 4 жыл бұрын
Hello, professor! I am learning a lot and contents are just.... incredibly clear and informative!! Thank you so so much for this contents!
@kamrangurbanov4364
@kamrangurbanov4364 4 жыл бұрын
It was very helpful. Thank you very much
@alexandermrkich8734
@alexandermrkich8734 4 жыл бұрын
Very well done. Really appreciate the example at the end.
@aviahuja5024
@aviahuja5024 5 жыл бұрын
Amazing and very lucid! Thanks Marin, you make life easy for grad students struggling with dense notation from their professors.
@marinstatlectures
@marinstatlectures 5 жыл бұрын
thanks, i teach grad students as well, and im trying to do the same for them, so glad to hear it's working ;)
@josephphillips7231
@josephphillips7231 3 жыл бұрын
Brilliantly clear description. Thank you!
@stevehof
@stevehof 4 жыл бұрын
Just stumbled across your channel. Fantastic work! Please keep them coming
@theopronk6095
@theopronk6095 4 жыл бұрын
A great thanks from the Netherlands
@nezuki7995
@nezuki7995 4 жыл бұрын
Wow so much great reviews, I’m going to show this to my Computer Science teacher for a project that I have to do :(
@daesoolee1083
@daesoolee1083 4 жыл бұрын
Great video!
@stellahkilawe8208
@stellahkilawe8208 Жыл бұрын
Hello there, in this session it was informative and helpful. Thanks Professor for the incredible content
@gzitterspiller
@gzitterspiller 4 жыл бұрын
You guys have to understand bootstrap is a simple idea but there are not any formal proof that is works... so it is difficult for a professor to explain it, it is always a handwaving explanation on why it works. But you put the concept very clear I liked it.
@danielmonroy6874
@danielmonroy6874 5 жыл бұрын
You are such a great teacher! Thank you!
@marinstatlectures
@marinstatlectures 5 жыл бұрын
you're welcome :)
@dandiaran
@dandiaran 3 жыл бұрын
Amazing and absolutely clear video. Thank you!
@wgwandawg
@wgwandawg 4 жыл бұрын
Very well explained!
@frankie59er
@frankie59er 3 жыл бұрын
Great video, really helped!
@flamboyantperson5936
@flamboyantperson5936 6 жыл бұрын
Great lecture
@RajeshSharma-bd5zo
@RajeshSharma-bd5zo 4 жыл бұрын
Beautifully explained!! One point w.r.t Bootstrapping, via resampling we create child samples out of the first sample. But doesn't it introduce a dependency between the first and the subsequent samples as we will always get the same data values in child samples? Let's say if I have Blood pressure data of 500 patients and out of these records there are only 200 unique BP values then the child samples after resampling will always have values from these 200 unique values. So, can't we say that just like the parametric approach we should have an adequate amount of observations in the parent sample for bootstrapping?
@carlosbarros6705
@carlosbarros6705 4 жыл бұрын
Great what you're doing, Marin. Thank you so much for everything.
@branalfeirantrigo9350
@branalfeirantrigo9350 3 жыл бұрын
Very helpful indeed, and writing reverse!!!
@leetingfung
@leetingfung 3 жыл бұрын
Very nice one
@lorrainewaters6189
@lorrainewaters6189 2 жыл бұрын
Thank you! Now I understand.
@IkaTra95
@IkaTra95 4 жыл бұрын
Very nice video, helped me alot in understanding the principle of Bootstrapping!
@krishln7830
@krishln7830 4 жыл бұрын
Nice informative video. I have a couple questions though: 1. If we randomly sample 10,000 times out of a small sample space (of 5 in our case) isn't that going to tend towards a normal distribution since it's a large collection of random sampled values? 2. Isn't the point of Bootstrapping to estimate the population standard deviation when we don't have enough samples, and won't a t-test be better in that case? I know that a T-test works only for normally distributed data and bootstrapping I believe is especially effective when the distribution of the population is not normal, in which case we assume the distribution to be the same as the small number of samples. But doesn't this get skewed when we do random sampling 10,000 times and get a normal distribution through that? Thanks,
@ivanbukac4618
@ivanbukac4618 3 жыл бұрын
Did you find out?
@baobaocai1969
@baobaocai1969 5 жыл бұрын
7:42: Resampling for B times may result in a "B+1" at the foot of X-bar-*
@marinstatlectures
@marinstatlectures 5 жыл бұрын
here, we are taking repeated samples 1,2,3,4,...,B to have B total samples. although it really doesn't matter how many you take, and the concepts is the exact same if you take R=B+1
@heinerbuchholz3935
@heinerbuchholz3935 Жыл бұрын
Great mirror-writing skills
@iceerabanillo5120
@iceerabanillo5120 5 жыл бұрын
Thank you! This helps a lot ❤️
@marinstatlectures
@marinstatlectures 5 жыл бұрын
You’re welcome, great to hear!
@KnowledgeHub79
@KnowledgeHub79 5 жыл бұрын
quite helpful and baby's beautiful words just make my day.
@marinstatlectures
@marinstatlectures 5 жыл бұрын
good to hear! our boy wanted to be part of the video creation, and so he's taken on that role ;)
@KnowledgeHub79
@KnowledgeHub79 5 жыл бұрын
@@marinstatlectures great
@ftg4864
@ftg4864 4 жыл бұрын
I am wondering why the standard error for your example is just using sqrt(n) and not sqrt(n-1) considering the data size is small?
@SNPolka56
@SNPolka56 5 жыл бұрын
Great Video. Thank you very much.
@ruturajmane4663
@ruturajmane4663 4 жыл бұрын
In the parametric we were taking samples of same size from population and getting distribution, but in case of bootstraping we are taking data from one sample(not population) then how are u comparing these two things?
@yannanzhao5779
@yannanzhao5779 5 жыл бұрын
GOD IT IS SO HELPFUL! THANK YOU FOR MAKING THIS VIDEO!
@marinstatlectures
@marinstatlectures 5 жыл бұрын
you're welcome :)
@nupatowoch3063
@nupatowoch3063 2 жыл бұрын
Interesting one
@jeffreylin235
@jeffreylin235 6 жыл бұрын
This is an excellent presentation. I am wondering what is the point of calculating bootstrapping standard error. We used SE to calculate 95%CI in a parametric approach. When we do bootstrap, we can directly obtain 95%CI from bootstrap data. If we create 10000 bootstrap samples and sort them from minimum to maximum. The 251st and the 9750th are the lower and upper bound of 95%CI. Correct me, if I am wrong.
@statisticscuriosity
@statisticscuriosity 3 жыл бұрын
Thanks a lot Sir...The intro you provided is the best i have ever seen.. it helped a lot!! Please make some videos regarding Bayesian methods using R whenever it is possible!!!!
@ngonhatnam131
@ngonhatnam131 Жыл бұрын
Hi. I want to ask about SE of 1 specific percentile. I understand that SE is on average how far sample means are likely to be from the population mean. My question is what is that going to do with percentile? Why a percentile has its own SE?
@n.briglia3574
@n.briglia3574 6 жыл бұрын
Very useful! Thank you! Are you going to realize a video regarding Cross-validation and Bootstrap methods (in R) used for validating the regression models?
@marinstatlectures
@marinstatlectures 6 жыл бұрын
Probably at some point, but in the near term we’re focusing on building videos for intro stats, and next for regression modeling (of all sorts)
@karenhalpern
@karenhalpern 4 жыл бұрын
Thank you!! I has been really helpfull!!
@waisyousofi9139
@waisyousofi9139 3 жыл бұрын
I got a question for you : Is there any difference between bootstrapping and the central limit theorem? if yes, I just wanna know, exactly when to use bootstrapping in inferential statistics? Thanks for all the effort u r doing.
@abonady6747
@abonady6747 3 жыл бұрын
Same question here, could you please share your feedback if you get any answer?
@rfatorhanckmazel7979
@rfatorhanckmazel7979 4 жыл бұрын
Thanks for the clear explanation. However, I wonder that which kind of bootstrapping this is
@gasimhoda
@gasimhoda 6 жыл бұрын
Thanks a lot Marin, Can you do some videos in Factor analysis
@MrDp297
@MrDp297 6 жыл бұрын
How can u write in reverse?? Thats so cool!!
@marinstatlectures
@marinstatlectures 6 жыл бұрын
Lots of practice ;) but it’s actually using something called a “light board”, where the image is recorded and then reversed like a mirror...so I’m not actually writing backwards. I get access to it at UBC Studios :)
@MrDp297
@MrDp297 6 жыл бұрын
U mentioned in the video that u have some more examples of bootstrapping.....is there perhaps a link?
@Katurha
@Katurha 5 жыл бұрын
@@marinstatlectures Wait, so how do you seem to be writing on the right or the left, and it displays on the same size of the board. That knots my brain way harder than bootstrapping
@brendanredler3666
@brendanredler3666 3 жыл бұрын
@@Katurha I was pretty distracted by this apparently amazing ability at first, too! If you somehow haven't figured it out by now...you can test it out with a smartphone's front-facing camera. Get a thin/cheap piece of paper and a thick black marker so your text will show through. Write "Test" on the piece of paper. Hold up the paper so you can read it, and turn on the front-facing camera on the phone held out in front of you. You'll see that you've able to read the text even though the camera is looking at it from the back side! Same principle, barely different application.
@WonderfulLife73
@WonderfulLife73 4 жыл бұрын
Thank you..!
@marinstatlectures
@marinstatlectures 4 жыл бұрын
You’re welcome
@ironstark_007
@ironstark_007 3 жыл бұрын
Sir small sample means how small for bootstrapping?
@KristoferPettersson
@KristoferPettersson 6 жыл бұрын
I don't understand if the sample size is the number of samples from the population or the number of elements in each sample. This gets particularly confusion later when he talks about resampling from a sample of 5 elements. What am I missing?
@emresdance
@emresdance 3 жыл бұрын
Having to work with a low number of samples seems to be a statistician's nightmare...
@nazmurrahmannobel11
@nazmurrahmannobel11 Жыл бұрын
All resamples data size should be equal but should it need to equal to the sample data size I mean if sample has 5 data Can we take 3 data randomly from the sample for each resampling?
@GD-uy9td
@GD-uy9td 4 жыл бұрын
I am new to statistics and I have a doubt regarding the calculation of Standard error of bootstrapping example. How did the Standard error of those 3 resamples come out to to be 5.57. Here's what I did, Could you tell me where I am wrong: I calculated the standard deviation of the 3 examples (84,73,86) and it was 7. The Standard Error is hence, 7/√3 which is 4.04.
@bozhou1454
@bozhou1454 Жыл бұрын
the 3 sample mean should be (84, 73, 80), and the SD/SE of them is 31^0.5 = 5.57
@mmmmmm6510
@mmmmmm6510 4 жыл бұрын
Thank you very much for this video. It was easy to understand. I do have one quick question, how did you get the sample error 5.57? Thank you in advance if you can help me answer my question!!!
@marinstatlectures
@marinstatlectures 4 жыл бұрын
That was by calculating the SD of the bootstrap means. In reality we would do this for many more bootstrap resamples than I did in this video
@abonady6747
@abonady6747 3 жыл бұрын
@@marinstatlectures thank you, please it is important to explain how did you get 5.57? so i can correct mine and get the full knowledge
@abonady6747
@abonady6747 3 жыл бұрын
i am asking Sir, because my calculation gives 5.244044 :) not 5.57
@yuvenmuniandy8202
@yuvenmuniandy8202 6 жыл бұрын
You made this easier to understand. Marin could you do a video on power analysis in R studio
@marinstatlectures
@marinstatlectures 6 жыл бұрын
hi, we are about to release a video explaining the concept of Power, in the context of tests for a mean. we hope to record a complimentary video showing some of that stuff in R...but have many things recorded and in need of editing before we can get to that
@pratikbhangale3538
@pratikbhangale3538 4 жыл бұрын
Hello Sir, in large sample theory if we increase number of observation it will eventually leads to normal distribution. While in bootstrap I don't think so. Consider marks for exams. If we use large sample, we will eventually end in normal distribution. While during bootstrap I use only 5 random elements. Eg. 10,11,20,35,23 and I do resampling does bootstrap will give closest answer to large sample
@orsonhey
@orsonhey 5 жыл бұрын
Thanks for your video!!! May I ask that is the boostrap in SPSS able to do internal validation for predictive model?
@marinstatlectures
@marinstatlectures 5 жыл бұрын
I’m not sure about using SPSS...I know the basics, but I’m an R user...
@mxfglsthlr
@mxfglsthlr Жыл бұрын
only question that I now have: where did you learn to write in mirrored letters... :D
@claudiamesaaparicio8517
@claudiamesaaparicio8517 4 жыл бұрын
bravo!
@farhanputra2857
@farhanputra2857 3 жыл бұрын
Can we make a statistical model from bootstrap sample distribution?
@marinstatlectures
@marinstatlectures 3 жыл бұрын
You can use bootstrapping with statistical modeling. This video introduces the concept as it applies to a sampling distribution, but you can use a bootstrap approach as an alternative approach to most methods
@zainabkhan2475
@zainabkhan2475 5 жыл бұрын
thank you sir for this wonderfully explained video, please make another video on how to do sampling for generating the sample means of sampling distribution. I don't understand how do we do that practically. Thanks in Advance...
@lemyul
@lemyul 5 жыл бұрын
thanks mari
@marinstatlectures
@marinstatlectures 5 жыл бұрын
you're welcome :)
@michaelhaskins6627
@michaelhaskins6627 5 жыл бұрын
How did you calculate the bootstrap standard error using the 3 resamples? Is the formula (1/ n^0.5)( (Σ (resample mean - sample mean) ^2 ) / n-1) ^.05
@marinstatlectures
@marinstatlectures 5 жыл бұрын
here is would just be the SD of all of the bootstrap-sample means. basically what you have written, but without the (1/ n^0.5).... it would just be the: (Σ (resample mean - mean.of.all.resample.mean) ^2 ) / n-1) ^0.5
@alfcnz
@alfcnz 5 жыл бұрын
Why does the screen keep flipping? It's making me dizzy!
@alfcnz
@alfcnz 5 жыл бұрын
@@The_Real_Goodboy_Link I'm talking about the annoying transition effect…
@The_Real_Goodboy_Link
@The_Real_Goodboy_Link 5 жыл бұрын
@@alfcnz AHHHHHH, thought you meant the reverse writing screen. Watching again I see what you mean. That's some old school screen transitioning right there!
@The_Real_Goodboy_Link
@The_Real_Goodboy_Link 5 жыл бұрын
AHHHHHH, thought you meant the reverse writing screen. Watching again I see what you mean. That's some old school screen transitioning right there!
@hemantdhoundiyal1327
@hemantdhoundiyal1327 5 жыл бұрын
Maybe some editing was done by him to skip some irrelevant part of the video.
@saumyamishra9004
@saumyamishra9004 4 жыл бұрын
Marin can you plzzz explain how had u calculated the SE value as I'm getting "11.51/root 3=6.65"??? plzz explain m i putting wrong values?
@anandruparelia8970
@anandruparelia8970 3 жыл бұрын
Calculate Resample Mean (84,73,80) => 79 Now the SD of the resamples This would lead you to 5.57 SE
@erniyunita1285
@erniyunita1285 6 жыл бұрын
Thank you for explaining this !
@marinstatlectures
@marinstatlectures 6 жыл бұрын
you're welcome :)
@donolegario
@donolegario 5 жыл бұрын
"This bootstrapping appoach" aahaha Awesome explanation! Thanks!
@marinstatlectures
@marinstatlectures 5 жыл бұрын
you're welcome
@anastasia_wang17
@anastasia_wang17 4 жыл бұрын
what technology is this, it automatically mirror the whiteboard??? amazing!
@stefanhoi8016
@stefanhoi8016 4 жыл бұрын
the whole video is just mirrored ;)
@bruninshiotani
@bruninshiotani 5 жыл бұрын
Hello, thanks for the video!!! helped a lot, but can you give me some hint for doing bootstrapping on the R software? (please, don't mind my english , i'm from another country) =)
@marinstatlectures
@marinstatlectures 5 жыл бұрын
Hi, sure, we have a few different videos for that. if you check out the following playlist (kzbin.info/aero/PLqzoL9-eJTNAz0IuV1nAV7KMkGBf4QcQX) you'll see in the middle 4 videos on bootstrap hypothesis tests and confidence intervals, both explained in concept, as well as implemented in R.
@bruninshiotani
@bruninshiotani 5 жыл бұрын
@@marinstatlectures thanks so much!!!!
@kunalbali810
@kunalbali810 6 жыл бұрын
Can you provide this stat example in R with some file samples ?
@marinstatlectures
@marinstatlectures 6 жыл бұрын
we have another video in editing showing how to use this to construct a confidence interval...and we will also record an R compliment to that, showing that example with R. we also have recorded videos explaining the use of Bootstrap (and Permutation/Re-Shuffling Tests) in the context of comparing 2 groups...we wont get to editing that one for about a month or more, and plan to also record the R compliment for that, showing how to implement the concepts in R. it will take a bit of time for those to get up, as we haver many others ahead in the editing cue...but we do plan to haver those up in time...
@vuminhquanle1426
@vuminhquanle1426 4 жыл бұрын
Video should be called, how man wrote backwards in 17m
@meribel7071
@meribel7071 6 жыл бұрын
I need to do bootstrap on Gretl please. for estimation NARDL model
@The_Real_Goodboy_Link
@The_Real_Goodboy_Link 5 жыл бұрын
no
@luckyhubbie
@luckyhubbie 9 ай бұрын
If your small sample is limited to observed data points within a short term trend there is no way account for this. If you are trying to predict sea level rise but you just bootstrap the data taken as the tide rolls in one evening you will never account for the cycles of high and low tide. Seems disingenuous to claim outliers effect large sample statistics just the same as bootstrapped samples. But I get it. I did bootstrapping all the time in the 80s and 90s when I did my science fair projects last min.
@marinstatlectures
@marinstatlectures 8 ай бұрын
What you are describing is a poor sampling design. If you collect data in the way you describe, any analysis will lead to incorrect conclusions. I must say it’s very impressive that as a high school kid you knew bootstrapping! The first paper on the topic was published in 1979, and it didn’t become commonly used until high powered computers. Very impressive!
@luckyhubbie
@luckyhubbie 8 ай бұрын
@@marinstatlectures I just mean I reused the inadequate sample I had to pretend I had collected a sufficient sampling of the population.
@songlinchua1212
@songlinchua1212 4 жыл бұрын
Wow? I thought the word i see there above the scatter plot was "PORN" 0.o
@rohitpant6473
@rohitpant6473 3 жыл бұрын
didnt help me
@mrvvrm5951
@mrvvrm5951 4 жыл бұрын
We life in a some kind of new world this is so boring and old theori
@marinstatlectures
@marinstatlectures 4 жыл бұрын
Incorrect. Classical approaches to statistical inference are old theory, this is a much more modern approach, made possible by high computing power
@larissacury7714
@larissacury7714 2 жыл бұрын
Hi, thank you! I'm completly lost at how many times I should bootstrap my sample...I'm making a regression model, but my errors are not normally distributed, so I'm considering bootstrapping. Question: how many times should I bootstrap the original data set? I have 21 participants, each has 2 observations of 2 tests in 2 different years (totalling 4 per participant) @MarinStatsLectures-R Programming & Statistics
@KareemHusseini
@KareemHusseini 4 жыл бұрын
This is great. Thank you.
Hypothesis Testing: Calculations and Interpretations| Statistics Tutorial #13 | MarinStatsLectures
16:22
MarinStatsLectures-R Programming & Statistics
Рет қаралды 26 М.
Bootstrap Hypothesis Testing in Statistics with Example |Statistics Tutorial #35 |MarinStatsLectures
16:56
MarinStatsLectures-R Programming & Statistics
Рет қаралды 43 М.
Air Sigma Girl #sigma
0:32
Jin and Hattie
Рет қаралды 45 МЛН
Counter-Strike 2 - Новый кс. Cтарый я
13:10
Marmok
Рет қаралды 2,8 МЛН
Statistical Inception: The Bootstrap (#SoME3)
13:50
Very Normal
Рет қаралды 31 М.
Bootstrapping Main Ideas!!!
9:27
StatQuest with Josh Starmer
Рет қаралды 485 М.
Monte Carlo and Bootstrap Methods Introduction
27:07
Fourth Z
Рет қаралды 5 М.
Time Series Forecasting Example in RStudio
37:53
Adam Check
Рет қаралды 144 М.
26: Resampling methods (bootstrapping)
9:40
Matthew E. Clapham
Рет қаралды 148 М.
Permutation Hypothesis Testing with Example | Statistics Tutorial # 37 | MarinStatsLectures
17:19
MarinStatsLectures-R Programming & Statistics
Рет қаралды 45 М.
Sampling methods (for the CFA Level 1 exam)
57:31
Let me explain
Рет қаралды 3,7 М.
Power Calculations in Hypothesis Testing | Statistics Tutorial #17 | MarinStatsLectures
19:59
MarinStatsLectures-R Programming & Statistics
Рет қаралды 32 М.