As a PhD student who has used frequentist statistics for as long as I remember, I’d only ever heard gossip and rumours about Bayesian statistics, but your video hooked me from start to finish on such a fascinating subject! Great video!!!
@very-normal9 ай бұрын
Thanks! Despite the title name, I think having both under your belt is better than just choosing a “side” in this nerdy debate lol
@fernandojackson72078 ай бұрын
I went out with a Bayesian probabilist. She had a great posterior, but too many priors.
@mathboy81888 ай бұрын
That must a be well-known joke in stats circles. If you came up with that, dude, that's brilliant.
@exentrikk6 ай бұрын
Ba-dum tschhh
@kadaj13131310 ай бұрын
Half my professors would fight you over this title, the other half would agree with you
@very-normal10 ай бұрын
😈
@jasondads950910 ай бұрын
I swear the title changed, what was it before?
@very-normal10 ай бұрын
nah it didn’t change
@ThatOneAmpharos8 ай бұрын
@@very-normal what was the probability it would have changed if the probability of change was the probability of it not changing?
@tracywilliams79298 ай бұрын
Lol!
@rtg_onefourtwoeightfiveseven10 ай бұрын
I'm an astrophysicist, and in our field Bayesian statistics is the way. Great and all, except everyone seemingly expected me to know what an MCMC analysis was (it wasn't mentioned anywhere in the refresher lectures at the start of my PhD) despite never having heard of it before I started. This video was a massive help.
@very-normal10 ай бұрын
If that isn’t the PhD student experience, I don’t know what is Also, I’m using your comment to brag to my friends that I spoke to an astrophysicist
@rtg_onefourtwoeightfiveseven10 ай бұрын
@@very-normal Haha, glad I count as clout to someone. I'm just a humble 1st-year PhD student, but no need to tell them that. ;-)
@very-normal10 ай бұрын
Oof, first year is tough, it was definitely the most character building I’ve done in a short span of time. Hang in there, and best of luck!
@RomanNumural910 ай бұрын
Math finance PhD student here. Great video! Just so you know there's a book called "Deep Learning" by Ian Goodfellow et al. It covers Bayesian stats, including MCMCs and other things. It's a great resource and if you wanna know more about this stuff I found it a pretty reasonable read! :)
@Nino213709 ай бұрын
🔥
@JetJockey879 ай бұрын
Ray Kurzweil's novel "How to Create a mind" goes into this as well with his description of the Monte Carlo Markov Chain Models he used to build his software that has been an absolute staple in the Medical Industry for dictation and still continues to outperform Transformers models. Dragon Naturally Speaking. Which is really hilarious if you are the kind of nerd to know what that software is as well as knowing who Ray Kurzweil is and what he is more famous for - Singularity-esque Neohuman Futurism propaganda
@lupino6529 ай бұрын
Yep, a classic, well peaced for someone qith background
@raphaelscaff23999 ай бұрын
Cool
@luisjuarez72919 ай бұрын
Hey out of curiosity, if you have a doctorate in math finance, do you find that a lot of job opportunities are still based off whether you have a CFA or CPA on top of your degree? (If you don’t pursue being a profesor or researcher)
@cameronhill776910 ай бұрын
I used to be a frequentist, but then I updated my beliefs.
@tracywilliams79298 ай бұрын
Lol! Very good!
@hoppybrewologist5 ай бұрын
Just Mean
@qwerty1111112210 ай бұрын
As an introduction to Bayes theorem, i think that 3b1b really helped me form an intuition about this statistics using his "bayes ratio" of multiplying your prior by a ratio formed by the likelihood and margin to form the posterior, a new prior
@barttrudeau923710 ай бұрын
3b1b was where I first heard of Bayes Theorem. I've been hooked ever since.
@leassis9110 ай бұрын
this video from 3b1b is a life saver
@fetilu09753 ай бұрын
The video about screening diseases from a bayesian perspective is especially enlighting. The prior distribution is of such an importance even from a frequentist point of view.
@avenger182510 ай бұрын
I always get excited when I see one of your uploads; I've been studying heavily about statistics coming from a pure mathematics background, and your videos are always very helpful to build the conceptual foundations that textbooks often obscure in favor of specialized, theoretical language. This has already cleared up several things I didn't quite understand about Bayesian statistics, so thank you (for this and your other videos)! :^)
@very-normal10 ай бұрын
These kind of comments are the ones that get me really fired up (in a good way). The videos are doing what I want them to do, thanks for letting me know and taking the time to comment!
@KokomiClan4 ай бұрын
The MCMC approach is one we used with STAN to generate models for the returns of financial timeseries (daily feeds). We had our Garch models setup and every parameter of each model had its own posterior distribution. The power with this was that we could a.) make forecasts continually every day and update it but more importantly, b.) detect changes across all the models when new data arrived. Further to this, being able to run scenario analysis was important and we used this during Covid to estimate recovery times for the assets (i.e. when to switch back into risky positions - out of cash into equities) and it worked really well. We didnt write the model to forecasts returns but rather it proved useful when forecasting volaility. It helps solve the approach of you fitting a model with fixed parameters and then updating it with a new set every 6 months. Instead, uncertainty was baked into the model via its parameters being distributions. The benefit of the Bayesian approach was that we could account for uncertainty better and it worked with our risk management approach: Deploy, monitor, collect, update, test, deploy...etc.
@tomalapapa10010 ай бұрын
Ive studied math as a degree and specialized in statistics and finance. Had rhe same experience with numerous frequentist clases but few bayesian. Ive studied on my own and with a couple of clases that were available to me in grad. Struggled a lot to get the gist of bayesian statistics. This video is s perfect for people with knowledge of frequientist view who wish to then learn bayesian
@charlesbwilliams10 ай бұрын
Its so cool to see MCMCs get some love. The only use I’ve ever seen of it in my field (Psychology) is in Item Response Theory. Awesome video!
@Bayesian_Wrapper9 ай бұрын
It's also used quite a bit in economics industrial organization or in quantitative marketing!
@elinope47459 ай бұрын
KZbin recommended this video to me on my recommended feed. Only about one in twenty videos is any good. The odds were low that someone would make a video worth while, subbed, liked.
@joelbeeby8669 ай бұрын
My university UG finance course has taught me rudimentary statistics but not to the level that I want. Your videos are genuinely amazing self-studying and really bring out the logic in statistic, which textbooks almost never do. Thank you! Please keep it up!
@very-normal9 ай бұрын
Thanks for watching! It’s always very encouraging to see they’re helping people out, thanks for taking the time out to tell me
@nzt299 ай бұрын
Best video i’ve seen on this so far. I like the comparison between the two methods and that fact that you map back the data and parameter variables back to the typical A and B seen in the Baye’s thm definition. edit: I should have phrased this instead as how you connected Baye’s thm to distributions.
@mmilrl576810 ай бұрын
I’m currently finishing up my first of many statistic courses and the first month of the course we spent on Bayesian statistics then started focusing more on the frequency side of things. I had no idea these are often were even considered different things. Very cool video!
@user-jn7ic7un1e9 ай бұрын
Check out conformal prediction
@danielerdody1605 ай бұрын
I took a class on stochastic models which relied heavily on Bayesian methods. This video is helping me better understand my old notes. Thank you!!
@adw1z8 ай бұрын
I’m in final year of undergrad, my exams on all these topics in Bayesian inference and constructing credible intervals, Bayes decision rules, minimax and admissibility, and sampling methods such as MCMC/Metropolis Hastings are all on it next week 😭 Thank u for explaining this so simply in a way anyone could understand and enjoy, you really did earn a new subscriber
@very-normal8 ай бұрын
Good luck with your exam!
@gapsongg7 ай бұрын
Broooo you are insanely good at explaining stuff. Really nice video! Perfect speed. Perfect visuals. Everything made so much sense.
@cameronkhanpour300210 ай бұрын
Great video once again! You mentioned MCMC algorithms, to add some details, they work by constructing a regular/ergodic markov chain that has a unique stationary distribution, and we want that stationary distribution to be the target distribution so you can, say, sample it for inference. So the real question is now, how do you design a transition kernel that (if converges) leads an initial vector to the proper stationary distribution you wish to sample after some burn in time. I know this from a probabilistic graphical models perspective where this is used extensively in the form of Gibbs sampling, a special case of Metropolis-Hastings algorithm, and Rao-Blackwellized particles, which sample certain (more complex/loopy) parts of a network then does exact (analytical) inferencing on the rest. Variational inference is a form of inferencing as an optimization problem, such as in mean field approximation where you choose a simpler distribution Q and compute new parameters that gets it close to your actual distribution P. The main way I have learned is to minimize the KL divergence (relative entropy) between Q and P (in that order since KL is not symmetric). If anyone would like to read more Bayesian Reasoning and Machine Learning by David Barber is really good (IIRC variational inference in particular is in chapter 28).
@RohanKuntoji6 ай бұрын
Incredibly well explained with realistic and easily relatable examples! Highly recommend watching this video for a quick & easy grasp over basics, or even to refresh fundamentals.
@justdave919510 ай бұрын
Could you please make a video on Generalized Linear Models too? These explanations are soooo helpful.
@xanmos9 ай бұрын
Very comprehensible video about Bayesian Statistics. I have seen most of your videos and i will recommend it to my students. I am teaching undergraduate basic Statistics and i must stay, ur videos are very well-made. I will put it in my course sites so my students could learn from you as well. ❤
@very-normal9 ай бұрын
Thank you! I’m honored!
@jpnye5 ай бұрын
Thanks!
@very-normal5 ай бұрын
Wow thank you, I appreciate it!
@derWeltraumaffe10 ай бұрын
I'm still learning frequentist statistics right now in university (first year psychology) so this video still goes way over my head, but it was a really nice overview to get a general idea of the concept of bayesian statistics. All we learned about it is "btw there's a thing called bayesian statistics. ok... let's move on." not kidding.
@very-normal10 ай бұрын
This was my experience word for word in undergrad too
@laitinlok15 ай бұрын
10:43 this is essentially the total probability theorem in a continuous probability density function, for discrete probability functions, it should use summation.
@TheOnlyNightmare10 ай бұрын
Loved this! Definitely a subscriber now 🎉 I got confronted with Bayes in a Seminar where we used various Machine learning and deep learning models with the expectation of already knowing all this prior to starting. It led to me having no confidence in the model results even though they outperformed some other approaches.
@postblitz5 ай бұрын
This video is fairly difficult because of the maintenance of jargon. I got the sense that the Conjugate or the property of Conjugacy is when the prior distribution has the same shape as the posterior distribution i.e. the new observation doesn't change the prior distribution which means the chosen prior is stable despite the new evidence i.e. you've chosen a particular belief/set of parameters and they fit the data processed so far. I may be wrong on this but that's the gist. Arbitrarily choosing a prior distribution is still a fanciful and mysterious thing given the information of this video.
@fetilu09753 ай бұрын
Look up 3blue1brown two video's on bayes. One is about the general idea behind bayes and the other is about the importance of the priors and of the bayes factor (which haven't been explained here but are at the core of bayesian inference).
@hyunsunggo85510 ай бұрын
The cool thing about variational inference is that it converts the problem of computing the intractable integral into a more manageable optimization problem of, with respect to the parameters, optimizing some quantity, the variational free energy! This not only makes the problem often easier (through the more flexible variational graphical model) and more tractable (than e.g. MCMC, etc..), but also enables borrowing insights from mathematical optimization theory, to solve the particular formulation of the problem. By the way, this connection to mathematical optimization is why it is called "variational" inference in the first place, directly connected to calculus of variations! Also, VI has amazing applications in deep learning, namely, variational autoencoders (VAEs), in which it's applied to the latent space for the induced probability distribution, for explaining the data distribution, to become much, much more complex, compared to the classical examples you've shown in this video. For example, diffusion models, that can create those amazing images, can indeed be seen as an instance of VAE! Thank you for this great video! I learned a lot! :)
@waylonbarrett34569 ай бұрын
I'm developing an AI model based on variational inference
@hyunsunggo8559 ай бұрын
@@resnon Well, LDMs such as SD make use of VAEs to reduce resource requirements but that's not what I was talking about. You see, mathematically, you can think of the noising & denoising steps themselves as an instance of a VAE. Which includes non-LDMs that operate on the original data domain such as the pixels themselves.
@hyunsunggo8559 ай бұрын
@user-ju2pu8cf2l While LDMs such as SD indeed utilize VAEs for reducing resource requirements but I wasn't talking about that specific use case. The noising and de-noising steps as a whole is also an instance of a VAE under a certain interpretation, not limited to LDMs but also pure non-LDMs, that operate on the original data domain such as the pixels themselves, as well. (Idk why but my previous reply was deleted. 🤔)
@Y45HV1N9 ай бұрын
Really cool video and it's easy to follow. I just think it would be better with the frequentist bashing. I think the parts about what frequentists hope they could do or fool themselves into thinking are more a reflection of misinformed /poorly trained frequentists. Ultimately frequentism has itself two main approaches, the Neyman Pearson NHST approach and the Fisher compatibility approach. The search for p
@very-normal9 ай бұрын
Bayesians aren’t allowed to take two steps without making fun of frequentist methods
@wulfrion10 ай бұрын
This video was great. As someone who is studying data science on my own online, I'm 32 right now, I had a hard time understanding the Bayesian Theorem for a bit there. I wish I saw this video way earlier, it would have saved me weeks of wrapping my brain around it.
@very-normal10 ай бұрын
Thank you! Best of luck on your data science journey, I hope I can help you out more with the statistical portion of it
@lbognini3 ай бұрын
Me again! Yet other trivialties😂🤣 At 04:00 Using S and W instead of A and B in the formulas would have helped many people out there. This may seem banal but it's important to move from the academic notations, that most people struggle with, to getting more closer to real word. It also helps in thinking about the different events, their likelihood and conditional probabilities, since we're switching them around. P(S|W) is more intuitive than P(A|B)
@thegimel10 ай бұрын
Great video as the rest of your content. You have a pleasantly simple, intuitive and concise way of presenting the D :) I would very much like for you to dive deeper into the Likelihood in particular, and why it isn't a real PDF even though it can look like one. Cheers!
@DedicatedStudentIITMАй бұрын
I am in love with your videos! What a treasure of information
@KirinDave10 ай бұрын
What's funny about this presentation is that it makes it look like the MCMC approach is the one only the deep practitioners use. In reality, non-statisticians who need to use this stuff prefer the MCMC approach because it's *very flexible* and just requires we provide a model and priority and then then an optimizer and the data fight it out in the computation. So in a very real sense, the MCMC approach is easier for non-statisticians and preferred.
@very-normal10 ай бұрын
Ah yeah, in hindsight my “levels of Bayes” framing does give off this feeling. Not intended, but something for me to think about for future videos. Thanks for your insight!
@glauco6454Ай бұрын
Dude you're like a God I was understanding nothing about this and u helped me a lot thanks
@mathieudespriee66465 ай бұрын
Nice video, it helped me, thanks. Your explanation of "conjugate prior" at 12:40 just made the things click. I read these words so many times without undestanding....
@entivreality10 ай бұрын
Really great explanation! Love the progression from elementary to more advanced topics. A video on empirical Bayes methods could also be cool :)
@andrewhancock24515 ай бұрын
At 6:40, a Binomial distribution is described as a "probability that an event will happen". I think that's the Bernoulli trial. If one draws from a random variable with a Bernoulli distribution, one gets an integer rather than a head/tail or zero/one. This ambiguity has consequences for my understanding later on. I'm a bit confused by the slide at 10:20. The data D is described as the series of n Bernoulli trials in the 1st equation, but the last equation shows that D is a count of the heads/successes in the n Bernoulli trials. I suspect that D must be the count itself if the distribution is going to be parameterized by p. If so, however, then D is just a randomly drawn count (i.e., a scalar number) and I suspect that there is no "joint probability" associated with D. In the general case, perhaps for a more complicated example, D could a collection of scalars (i.e., measurements, "observations", or observed categorical/quantitative data) and there is a joint probability to think about. As one possible example, I suppose that D *could* be chosen to be a sequence of n Bernoulli trials, but unlike the above case where D is a sampling of a random variable with a Binomial distribution, there are different arrangements of a given number of heads and tails, i.e., one sample value of a Binomially distributed random variable corresponds to many possible sequences of Bernoulli trials. Each such arrangement would be considered a different D. I was intrigued by the reference to non-analytical priors and likelihood functions. Douglas Hubbard has a book "How to Measure Anything" where he has spreadsheets of formulas and data representing distributions. It's tough to grind through, but I found that it was possible to catch a glimpse of the Bayes formula in them. Not enough to make me confident, which is why I've been prowling the internet for years to solidify my intuitive understanding.
@hugomsouto6 ай бұрын
This video is a work of art. Thank you very much!
@CakeIsALie995 ай бұрын
Being bayesian gives you the ability you conjure up a prior that perfectly matches the result you are trying to demonstrate, truly miraculous
@very-normal5 ай бұрын
if only it worked liked that lol
@figmundsreud936310 ай бұрын
Very nice introduction to Bayesian Statistics. What I like about Bayesian Statistics is that one can in many contexts interpret Frequentist methods just as a special case of Bayesian methods with uninformative priors. Also from my experience many people are just Bayesian for pragmatic reasons because for many problems Bayesian methods just work better (frequentists also just discover the importance of shrinkage estimation and the most generalized way to apply shrinkage methods is through Bayesian priors). So I think the philosophical debate between Frequentist and Bayesian methods is somewhat overhyped. The biggest downside of Bayesian methods is their computational cost. Currently working with a model that doesn't even have a computable expression of the Likelihood. MCMC somehow still works (magic) but estimation even for a relatively small model takes several hours
@very-normal10 ай бұрын
I think the debate is overhyped too, it’s truly just nerd drama I feel your MCMC pains, best of luck 🫡
@TerabyteTy3009 ай бұрын
For some reason I always love when someone says “hi mom” in a video. It’s just wholesome and nice to know they are getting their mom’s support.
@tobiaspeelen43958 ай бұрын
when i first saw the concept of bayesian statistics, i thought: "wow, thats dumb, having probabilities only rely on your own belief" but now i see that it is a way of deducing what is likely the real probability and im like"WOW, AMAZING"
@stevenjackson82269 ай бұрын
Cool. Nice overview. This is the most rigorous presentation I've seen. I get Bayesian statistics at an intuitive level, but was curious about how it works mathematically. And there it is.
@marcovitturini948110 ай бұрын
Thanks to your channel i'm considering chosing a biostat MSc. Thanks for explaining and inspiring
@trevorgalvez91279 ай бұрын
I used Bayes Theorem for a simple learning model for establishing categories for various phrases that were similar but not exactly the same. Going through thousands of records manually was possible, but using this allowed me to do it in a day with the help of excel and python.
@jamesmcadory132210 ай бұрын
It’s funny because when I was a Physics major two professors in the department argued over whether frequentist or Bayesian Stats were better and would teach their labs differently based on their preference.
@ottoludewig124410 ай бұрын
I began studying Bayesian Statistics last year for developing the tools toward an applied study with Climate Change model data, and the main tool that facilitates the posterior calculation is somewhat recent, called INLA. The theory that gives it structure is pretty dense and difficult, but by far is the easiest to implement and has the least computational costs for the cases that it is applied to. A year deep into this I've found it fascinating how these perspectives open up so much compared to the restrictive nature of frequentist analysis. I recommend the Rethinking Statistics course for all audiences and the Gelman book (Bayesian Data Analysis) for a more mature audience with background in math and Statistics.
@VTdarkangel8 ай бұрын
I don't know much about Bayesian methods beyond the most basic premises of it. However, even at that basic level, I can see the power of it. As an engineer, I was really only taught the fundamentals of frequentist statistics. While it has proven useful to understand that (despite my grumbling at the time I took the class), I could see the problem of assumptions being required in the analysis. Bayesian methods seem to account for that.
@DataTranslator6 ай бұрын
“None of them were sufficient” , clearing throat I see what you did there 😀
@barttrudeau923710 ай бұрын
This is a great video on the subject. I really hope you produce more content on Bayesian stats. (maybe dive into PyMC.?) Thank you!
@bringbackthedislikecount676710 ай бұрын
Currently taking statistical physics, found out some similarities between the two, such as prior probability in Bayesian statistics and priori theorem for each microstates of a system in statistical physics. Interesting to learn about Bayesian statistics from a physics major’s perspective nonetheless
@davidl.e52038 ай бұрын
If my understanding is correct, Bayesians are basically frequentists plus moving average. Frequentists draw conclusions about probabilities based on historic observations and take for granted of its probability. Bayesians subset the historic observation by time-scale, then make predictions for the next time-scale. If the next time-scale probabilities don't match the historic probabilities, update probability.
@jamesvaughan7487 ай бұрын
The sound bite at 8:16 is what made me subscribe 😂
@very-normal7 ай бұрын
wooooow
@dandandan1810 ай бұрын
My university teaches both approaches to statistics, but professors and lecturers don't make the distinction known (at least for a bachelor's degree). Bayes' theorem is taught, including how probability densities may differ, how we arrive at the prior belief (or distribution), and the difference in philosophy that affects where Bayesian statistics are often used. Ultimately though, we employ frequentist approaches for theses since it's easier to teach and more common for bachelor's degrees since there's less credibility to design models (i.e., there's little mastery of the field to justify the prior beliefs that will affect the distribution). However, as a civil engineering student, I most often see studies that do not incorporate prior beliefs when modeling real world phenomena that would incredibly affect data interpretation. For instance, studies on flooding, landslides, groundwater flow, the structural health of bridges and concrete buildings, and project management are heavily time-dependent, which makes prior beliefs more significant, but I only encounter risk reports and models that only focus on the current data. I do think that for most studies, frequentist methods are more applicable and would quite suffice given the more theoretical nature of the data and of the methodologies, since most call for single-parameter hypothesis testing. But given the cost of testing materials and tools, I believe the Bayesian approach (incorporating expert knowledge and credible intervals instead of "typical confidence intervals") could be incredibly helpful for handling smaller sample sizes.
@frankjohnson12310 ай бұрын
Love the videos, bit of friendly criticism: when working with a specific example (like 3:35 on), it would help to switch from generic variable names like A and B to something more specific to the example (e.g., S and W in this case). It could even be looser for pedagogical purposes, like replacing A with "Sub" and B with "Watch", though I know not everyone likes that.
@very-normal10 ай бұрын
Ah that’s a good idea I also find it helpful to see the events in the expression but didn’t make the connection there. thanks for pointing that out!
@tommys48099 ай бұрын
Using p notation sometimes indicates discrete random variables and calculating the marginal would be summation, f would denote probability density functions of continous random variable which the marginal could be calculated by integrating with respect to theta
@RabbitLLLord9 ай бұрын
To understand variational distribution better, perhaps understanding variational autoencoder can be a good start
@very-normal9 ай бұрын
Thanks! I’ve heard about it vaguely, but I’ll look more deeply into it!
@jrlearnstomath9 ай бұрын
Looking forward to more on variational inference, it's really doing my head in
@BrakeForLoop9 ай бұрын
Very helpful! I did get a little lost when trying to think about applications from my experiences.
@lbognini6 ай бұрын
Great video. A little thing about the format: -distractive images -a bit hard to read the text and focus on what you say since the phrasing if often different -The fonts with borders(contours) are hard to read.
@eschares5 ай бұрын
At 4:08, you say A is the prior probability of watching this video. Shouldn't that be the prior of subscribing instead?
@simonpedley97299 ай бұрын
what I love about bayesian statistics is the way i can get almost whatever results i want by changing the prior
@very-normal9 ай бұрын
lol i wish that’s how it worked
@ResearchStatisticsCorrectly5 ай бұрын
Wonderful presentation, definitely better than I have been able to do so far. However, (maybe its somewhere down there in the comments). Bayes did it in 1763, not '1963.'
@anecetcetera78614 ай бұрын
This almost feels like it could be used as a really transparent way to disclaim biases.
@very-normal4 ай бұрын
how can something disclaim bias but also be transparent about them lol
@Jaylooker10 ай бұрын
The Bayesian method sounds similar to how neural networks update their nodes to new data. There are Bayesian neural networks that implement Bayes theorem which allows them have a confidence percentage instead of just an answer to some presented data.
@very-normal10 ай бұрын
Oh that’s cool, I didn’t know that was a thing. I really do be learning a lot from my comment section nowadays
@q-tuber70344 ай бұрын
Today I learned: some people pronounce Bayesian like “beige” rather than “Bayes”
@very-normal4 ай бұрын
beigians
@idrankwhatt10 ай бұрын
Fantastic as always, I am starting on the biostatistics track myself!
@very-normal10 ай бұрын
Best of luck!
@logaandm7 ай бұрын
I started off with "Yeh, yeh, yeh. Conditional Probability, meh" and ended with "This is how the world works. I must learn this!" Great introduction on why Bayesian Statistics is so important.
@alishermirmanov56089 ай бұрын
Amazing video, provides great intuitive understanding!
@brainsify9 ай бұрын
A pdf is a density function not a distribution function. At least in the text books I’ve taught from. I understand this isn’t a big deal, but you made a whole thing.
@very-normal9 ай бұрын
Based on my experience, the terms can be used interchangeably, but I can see where the confusion can come from
@James-bv4nu9 ай бұрын
Isn't a distribution function, a density function? The area under the curve, f(x), is probability; therefore, f(x) is the probability density. That is, f(x) dx is in unit of probability; that makes f(x) in unit of probability per unit x.
@very-normal9 ай бұрын
For me, it’s mostly a semantics thing. The pdf f(x) can be referred to as a “probability density function”or a probability distribution function. When I refer to the cumulative distribution function, I’ll make sure to say “cumulative” instead of just saying “distribution”. This is one of those topics where it’s really easy to get lost in the sauce. If I say “probability density function”, then someone will invariably say that I should also include “probability mass function”. It just becomes too wordy for the script, so I stick to probability distribution. As long as I show what I’m referring to, my hope is that people will get what I’m saying
@mohammadnoori92786 ай бұрын
Hi, great video, I would appreciate if you would suggest a few reference books om these topics to study. Thanks
@very-normal6 ай бұрын
Bayesian Data Analysis by Gelman and Statistical Rethinking by McElreath are good references to start from
@mohammadnoori92786 ай бұрын
@@very-normal I didn't expect to get my answer so quick 😍 thanks 🙏
@mikestein59838 ай бұрын
Anyone wanting a deep dive into Bayesian stats as it applies to research should consider the book Statistical Rethinking by Richard McElreath. He has also prepared a semester’s worth of lectures available on KZbin. This is not a quick fix, but essentially a graduate course. It is IMHO quite accessible and doesn’t assume too much in terms of math background.
@NoMoreToxicRule9 ай бұрын
So, I never knew the thing I hated most about statistics was called the frequencist approach, and I always hated p values and null hypothesis, which is my opinion is worthless. I knew of Bayes, but when I went to school, it was never taught. Great video and now I'm subscribed.
@KinomaroMakhosini10 ай бұрын
What a coincidence I am starting bayesian analysis next week on my Stohastic Probability class😂
@maltez64469 ай бұрын
Where was this video two weeks ago when i was writing my exam project on HMM's, this shit is too hard to comprehend for a second year bachelor student :( such a great video!!
@kennethgottfredsen7674 ай бұрын
Hooked me. Can you recommend some books that start from the basics of Bayesian statistics?
@very-normal4 ай бұрын
Statistical Thinking by McElreath and A Students Guide To Bayesian Statistics by Lambert were two books that I used to get up to speed!
@kennethgottfredsen7674 ай бұрын
@@very-normalThank you for the answer.
@durg89098 ай бұрын
Aspiring Biostatistician here, my target school has some current PhD students specializing in Bayesian inference so I came here to learn what I might be in for. A part of me worries that specializing in Bayesian techniques could downsize my potential job pool, is there any validity to this concern?
@very-normal8 ай бұрын
Hi! I assume you’re doing an MS or PhD in biostat, so I’ll answer from this perspective. Let me know if I’m off the mark In your coursework, you will most likely be trained in frequentist methods. If you’re lucky, you’ll get some exposure to Bayesian methods but I think it’ll be unlikely you’ll use it much. Therefore, your coursework will help you cover your bases for basic skills expected of a statistician. I’m of the opinion that learning how to do Bayesian analyses will help expand your opportunities, since you simply have more tools/skills. I suppose if you were a PhD student specializing in esoteric Bayesian methods, you might have trouble finding positions where you apply those methods, but that’s a general PhD problem.
@durg89098 ай бұрын
@@very-normalThanks for the speedy reply! That’s a fair point about the general PhD problem, sadly. It sounds like Bayesian inference could be a tool in the belt that may or may not be used in the workplace, but it’s definitely something I want to study. Thanks for the awesome video man!
@chaosenergy19904 ай бұрын
Can we use the frequentist model to generate the prior knowledge if there is none?
@浴中哲思-j5f9 ай бұрын
The inference at 4:30 is very wrong. It is not an independent term so you cannot increase that alone without affecting other terms. Also, it should not be that you need current subscribers to watch your videos more, but that you need the watchers more from your subscriber pool, since the total number of watchers and subscribers both are not constant.
@very-normal9 ай бұрын
Not my best example, thanks for your point
@Ltsoftware31396 ай бұрын
In the example about subscribing and watching the video, on one side, you mention P(A|B) as Probability of subscribing(future), given that you watched the video. And P(B|A) as the probability of watching the video, given that I'm already a subscriber. Shouldn't P(A|B) be the probability that I am subscribed(present) given that I watched the video? Is there a difference between saying the two ways of defining P(A|B)?
@very-normal6 ай бұрын
Admittedly its a messy example in retrospect 😅 that’s also another way to think about it that I didn’t think of when I was writing this script
@GM-zt6ti3 ай бұрын
Glad someone mentioned it. I was starting to think my understanding of Bayes' rule was incorrect
@TN-cx4qi9 ай бұрын
We used Bayes theorem a lot in discrete math, stats, machine learning, and AI classes. I markov chains in a couple personal programming projects.
@very-normal9 ай бұрын
Man the machine learning classes get more exposure to it than the statistics classes lol
@TN-cx4qi9 ай бұрын
@@very-normalthey really do. When it first popped up on the screen and the professor asked if anyone has seen this formula before, it was like a crazy joke. It begs to be used in something like hvac for temperature control.
@HiVisl7 ай бұрын
So I should frequently use Bayesian statistics? 🤔
@thesoundofscience8 ай бұрын
I clearly missed something ... if P(D) is just a number, and we know that the posterior must be normalized over some range of theta, then isn't P(D) just the normalization constant?
@very-normal8 ай бұрын
It is! We know that it’s a number, but this integral is usually difficult to calculate
@provocateach9 ай бұрын
Likelihoodists: are we a joke to you?
@Megasteel3210 ай бұрын
lmao im taking the required entry level stats/prob class for my comp sci major and this was all our last test was about, good to know that me being super confused was normal.
@very-normal10 ай бұрын
it’s very normal
@laitinlok15 ай бұрын
I think in some ways the Bayesian method for testing vaccines make sense, it is a typical way to say the probability of getting covid given they have taken the vaccine is often used as a metric of how effective the vaccine instead of the probability of getting covid.
@realalehomebrewer82738 ай бұрын
Did my Masters Degree using the concepts from Bayesian Reliability Analysis applied to radiation carcinogenesis.
@dullyvampir839 ай бұрын
Am I correct that frequentist just set P(θ) = 1, which probably also makes the integral easy?
@very-normal9 ай бұрын
I’m not sure, I’ve never heard of that before. I do know that Bayesian analyses agree with Frequentist analysis when you use uniform priors, since it essentially boils down to maximum likelihood. But I don’t think they can assign a probability to the parameter since it’s viewed as just a number, not random
@dullyvampir839 ай бұрын
@@very-normal Thanks for the answer. I made a mistake. What I meant to say was: Shouldn't they agree if we set the Prior to dirac_delta(x-θ)? Wouldn't that express the conviction, that θ is simply a number? Then the numerator would equal the denumenator and the posterior would be 1, so not worth investigating further.
@Onecool03 ай бұрын
In time label, fix the typo "Bayes' Theorem" from "Bas Theorem"
@laitinlok15 ай бұрын
I have learned both ways in uni, it is interesting
@feifeizhang77576 ай бұрын
I need to listen again for understanding 😂❤
@cremildamondlane39007 ай бұрын
hi, i did math stats, anova and regression end of last year, this year i did stochastic and im currently doing time series. I want to deepen my stats knowlege any sugestions on which courses i should do next?
@very-normal7 ай бұрын
I think getting a good understanding of the basic hypothesis tests would be good to solidify. You can also get familiar with logistic and survival analysis.
@NoufKh-e1l7 ай бұрын
wish u included some refs that u used in this vid
@very-normal7 ай бұрын
what refs did you want in particular
@NoufKh-e1l7 ай бұрын
@@very-normal Just wanted to know what source material you used for this video, great video nonetheless :D
@신선규-p1z7 ай бұрын
In this video, there are many p variables: P (Probability), p (parameter in binominal distribution 8:35 ), p (probabilistic variable 9:28 and 12:08) Please clarify each p, or at least use different alphabet!!!!
@very-normal7 ай бұрын
🆗
@RexAstrum9 ай бұрын
Thank you for this!
@titong_totong10 ай бұрын
What a time to be alive!
@very-normal10 ай бұрын
👀 two minute papers watcher?
@nyx21110 ай бұрын
Ice cream for my eyes!
@Richard-ft6zp6 ай бұрын
it should be pointed out if you used the 'informative prior' .. which is a delta function in the video then no matter what the data you wouldn't learn anything. you're prior has to be non-zero over the interval of possible values.
@austenmoore73269 ай бұрын
Bayesian stats is cool in theory but I’ve never found anything on how to deal with cofounders with it. Do they just deal with univariate data?
@very-normal9 ай бұрын
Theoretically, you can handle confounders through your design and just use a Bayesian analysis, but I’m not sure about the observational setting. I know Bayesian causal methods exist but I haven’t used them myself
@me5ng310 ай бұрын
I had to take bayesian statistics for my machine learning classes. I didn't know know that they aren't as taught in other places, since in Germany they're fairly popular. They are even taught in highschool
@PR-cj8pd10 ай бұрын
No, everyone will see Bayesian probability, not Bayesian statistics
@huhuboss82748 ай бұрын
I am from Germany too and I don't think bayesian statistics are taught in highschool. May you confuse it with bayes theorem, which also has a frequentistic interpretation?