An explanation of why parametric families are so commonly used in statistics Stay updated with the channel and some stuff I make! 👉 verynormal.substack.com 👉 very-normal.sellfy.store
Пікірлер: 33
@TriglycerideBeware7 ай бұрын
This is my favorite non-entry-level statistics channel. Well done!
@EduardoVodopives7 ай бұрын
Agree
@varbias7 ай бұрын
Great video! A lot of shade thrown at the Poisson distribution 😅 count data arise all the time in health research and beyond, and the Poisson is (for better or worse) the go-to standard model
@academyofuselessideas7 ай бұрын
The sign of a great presentation is to make explicit what for an expert is obvious! This is important because the expert usually forgets that what is obvious to them is far from obvious to the novice. Thanks for making the concept of parametric distribution to everyone!... Parametric distributions are taught in every basic statistics class, they are so obvious to professors that sometimes they forget to stop a bit and just explain why they are important and what we are doing when we choose a parametric distribution... In another note, one common mistake is to pick the wrong parametric distribution to model a population... This is one of the main complains about the use of the normal distribution (the classical example is the use of normal distribution in finances which produced misleading estimates of risk during the sub mortgage financial crisis)... But to complement your example on the binomial distribution, it could also happen that each of the trials are not independent
@frankmazza53597 ай бұрын
I rarely comment on videos, but your channel definitely warrants one. Great work!
@very-normal7 ай бұрын
ERRATA (aka my brain on editing) 7:28: Poisson PMF contains the letter "k", but these should all be "x". I let my disdain for the Poisson slip. EDIT: I have no real beef with the Poisson, don’t worry Poisson stans lol
@AnthonyBerlin7 ай бұрын
Oh I can't beilieve what I'm hearing! Young man, your disdain is *unfounded*. This distribution is *very* useful. Especially since it isnt limited to modelling events that occur in intervals of time, but it can also model events in intervals of space (any kind of space, not just physical space). I hereby officiall call for an end to the hate for the Poisson distribution, effective immediately!
@barutaji7 ай бұрын
I think it is P and not C because the main concept is the probability, and not the cumulative. It just so happen that working with cumulative distributions is way easier than with the pdf themselves, specially when mixing discrete and continuous distributions.
@kprabhakar9757 ай бұрын
Thank you for great presentation. I learnt a lot from your presentation sir.
@Michallote7 ай бұрын
Love your humor!
@plaza38252 ай бұрын
The chance of a continuous distribution yielding any specific value versus the infinite other possibilities is so unlikely as to be considered 0. Thus, we instead calculate the probability of getting something within a range of values. Meaning that the values that dnorm returns are Not the real probabilities but rather just a y-value from the density function. It's pnorm that calculates the probability of a range, but the first endpoint would need another input, so for convenience we assume the first endpoint is the distribution's minimum. That endpoint makes it identical to the cumulative function. If Rstudio lets you change the first endpoint, then pnorm wouldn't be the cumulative function
@pauls.6686Ай бұрын
Thank you very much!
@daltakid7 ай бұрын
First time seeing the DPQR acronym, very clearly summarized! Perhaps pnorm and qnorm are inverses of each as p and q look to be opposite of each other so easier to remember?
@psl_schaefer4 ай бұрын
In computational biology the Poisson distribution and the Gamma-Poisson (negative binomial) distribution are used quite often :)
@very-normal4 ай бұрын
Oh that’s cool! What kinds of stuff we usually approached with these models?
@psl_schaefer3 ай бұрын
@@very-normal In computational biology there is all kinds of count data, but I am most familiar with (single-cell) RNA-sequencing where you basically count mRNA molecules in a sample (or in single cells). So if you want to model those data in a statistically robust way you have to use the Poisson / Gamma-Poisson or the Zero-inflated versions (ZIP, ZINB). Otherwise we commonly log normalize data to "account" for the heteroskedasticity...
@xavierlarochelle27427 ай бұрын
This is amazing. I want to teach stats one day and I'm definitely gonna steal some ideas from this video. Hope you don't mind! With a proper shout out of course :)
@kezza77737 ай бұрын
Great video
@karangarg46317 ай бұрын
Great video! Just pointing out a small typo, @7:28 you've got k instead of x for the Poisson pmf where the function states it's f(x) not f(k)
@very-normal7 ай бұрын
Thank you for catching that! I've added a pinned errata post
@karangarg46317 ай бұрын
@@very-normal Great! btw in case you're looking for video ideas, I'd love to hear some thoughts on parametric vs non-parametric hypothesis tests (esp coming from someone in biostats since you guys tend to have such small sample sizes in experimental trials etc). I'm often surprised to see how often I see t-tests and the like when CLT seems absurd for that sample size and the distribution is almost certainly going to be not normal!
@very-normal7 ай бұрын
That’s a great idea! I think that slots nicely with other material I have planned, thank you!
@karangarg46317 ай бұрын
@@very-normal looking forward to it!
@SampleroftheMultiverseКүн бұрын
Thanks for your interesting video. 14:08 Can I have tomorrow off from work? I find myself doing bad things to my normal density distribution curve. I’m so obsessed with it but I made a video and I think it relates to quantum particles in a box. But hey what are the chances? Area under a curve is often equivalent to energy. Buckling of an otherwise flat field shows a very rapid growth of this area to a point. If my model applies, it may show how the universe’s energy naturally developed from the inherent behavior of fields. Your subscribers might want to see this 1:29 minutes video showing under the right conditions, the quantization of a field is easily produced. The ground state energy is induced via Euler’s contain column analysis. Containing the column must come in to play before over buckling, or the effect will not work. The sheet of elastic material “system”response in a quantized manor when force is applied in the perpendicular direction. Bonding at the points of highest probabilities and maximum duration( ie peeks and troughs) of the fields “sheet” produced a stable structure when the undulations are bonded to a flat sheet that is placed above and below the core material. Some say this model is no different than plucking guitar strings. You can not make structures with vibrating guitar strings or harmonic oscillators. kzbin.info/www/bejne/raOlpKSfepWpfZYsi=waT8lY2iX-wJdjO3 If this model has merit, seeing the sawtooth load verse deflection graph produced could give some real insight in what happened during the quantum jumps between energy levels. The mechanical description and white paper that goes with the video can be found on my LinkedIn and KZbin pages. You can reproduce my results using a sheet of Mylar* ( the clear plastic found in some school essay folders. Seeing it first hand is worth the effort!
@user-kg5ii5lq1v7 ай бұрын
very good
@yorailevi67477 ай бұрын
I am mainly waiting for the mode advanced stuff to be covered, like those other distributions mentioned
@very-normal7 ай бұрын
I’ll be real with you, it’s going to be a while for this format lol, but I’ll try to cover more advanced stuff in other videos
@yorailevi67477 ай бұрын
@@very-normalI liked the video about bootstrap although I don't understand it enough in practicality, I didn't know about it nor I knew about the other resources mentioned
@OhInMyHouse7 ай бұрын
So, is it correct to assume that the utilization of the parametric family facilitates the estimation process because we only need to estimate the parameters that shape the function instead of trying to estimate the probability distribution itself because in that case, we would need to estimate a lot of values ?
@very-normal7 ай бұрын
Yes! I think you’ve phrased it well
@braineaterzombie39817 ай бұрын
Very nice video. Can you please make videos of distributions at 9:14 (not gaussian)in future.
@jameyhall52557 ай бұрын
Good video, but R feels less relevant every year
@AnthonyBerlin7 ай бұрын
It is still relevant, good sir! For real though, I still find R to be more accurate than most of the commonly used Python libraries for many numerical approximations of common functions. There have been times in my work where the difference in errors have been as big as 10e6 between Python and R, which in many applications can be catastrophic.