Using Linear Models for t tests and ANOVA, Clearly Explained!!!

  Рет қаралды 61,581

StatQuest with Josh Starmer

StatQuest with Josh Starmer

Күн бұрын

Пікірлер: 54
@statquest
@statquest 2 жыл бұрын
Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/
@newtonfamily2274
@newtonfamily2274 Жыл бұрын
I have to say that I really appreciate the fact that this video is bringing me back to 2009 KZbin while teaching me stats. Thanks!
@statquest
@statquest Жыл бұрын
bam! :)
@flaetus217
@flaetus217 Жыл бұрын
I cannot express how grateful I am for such wonderful videos!
@statquest
@statquest Жыл бұрын
Glad you like them!
@rahulbahadur1
@rahulbahadur1 Жыл бұрын
Thanks
@statquest
@statquest Жыл бұрын
Hooray! Thank you so much for supporting StatQuest! TRIPLE BAM! :)
@za7607
@za7607 Жыл бұрын
Thanks for understanding non native speakers and your clear explanation
@statquest
@statquest Жыл бұрын
Thanks!
@user-ob3gy3zo6y
@user-ob3gy3zo6y 2 жыл бұрын
Thanks so much for posting this! We are going into two way ANOVA, I hope this helps
@statquest
@statquest 2 жыл бұрын
Good luck!!
@natiajavakhishvili8517
@natiajavakhishvili8517 2 жыл бұрын
wow! this is really good! we need more for means and effects parametrization!
@statquest
@statquest 2 жыл бұрын
Thanks!
@2460z_htdja
@2460z_htdja 8 ай бұрын
I am really grateful for having found this site. I just want a simple suggestion as to what type of statistics (Anova, chi-square, etc.) I should use to determine my goal below. I am truly confused yet optimistic for someone generous out there, and I would greatly appreciate any additional comments or suggestions to clarify or simplify my statement or claim here. Thank you in advance. "Among the randomly selected senior students in the eight (4 public and 4 private) US south-west states (California, Arizona, New Mexico, and Texas), their responses are unanimous (strongly agree, agree, neutral, disagree, strongly disagree) as regards participating in home gardening rather than school gardening."
@massisenergy
@massisenergy 4 ай бұрын
Look for 'Chi-square test of independence' and 'Friedman's test'
@ryanmckenna2047
@ryanmckenna2047 Жыл бұрын
Here the p-value for the mean of the data is equal to the number of parameters for the equation of the mean and the p-vaue for the fit of the data is equal to the number of parameters for the fit of the data. How does this related to the typical notion of p-values (probability we should not reject null hypothesis)? Also once F has been computed, is F the itself a P-value or a value that we need to compute a p-value?
@statquest
@statquest Жыл бұрын
You might want to start with the first video in this series (this is "part 2") because that will answer your questions about how the p-values are computed and what they mean: kzbin.info/www/bejne/pJyVdIR_idKSm9E
@hameddadgour
@hameddadgour 9 ай бұрын
Great video. Thank you for sharing!
@statquest
@statquest 9 ай бұрын
Thank you!
@camillesylvain804
@camillesylvain804 2 жыл бұрын
Hii, could you do videos on the different GLM (Poisson, binomial,log, Tweedie) and also the link function and some R examples plsss? I loveee your video🙌🏼
@statquest
@statquest 2 жыл бұрын
I'll keep those topics in mind.
@nizogos
@nizogos 3 ай бұрын
In the end, doesn't the second design matrix assume that the first column is the baseline? Since it's always on ( column of 1s) .Thus interpreting the results with respect to the difference between the two groups rather than their magnitudes.
@statquest
@statquest 3 ай бұрын
What time point, minutes and seconds, are you referring to?
@nizogos
@nizogos 3 ай бұрын
@@statquest in the end you present an alternative design matrix that has full 1's on the first column
@statquest
@statquest 3 ай бұрын
@@nizogos The answer to your question is "yes". This is explained in the follow up video on design matrices: kzbin.info/www/bejne/eaKveKmtnpJohsU
@SNAKE1375
@SNAKE1375 Жыл бұрын
Hello Josh, it's me again....I had a question concerning the Anova. I still don't understand why we have to use an Anova when we have to compare a reference condition to several drug treatments A-B-C done in the same experiment. For instance, I am only interested in the effect of drug A or drug B compared to a control condition. So I only need a T-test. Why should I have to do an Anova?
@statquest
@statquest Жыл бұрын
An ANOVA is just a generalized t-test. Technically, you can do an ANOVA with just 2 conditions, Drug A vs a control, and you'll get the exact same results as a t-test (which is why people call it a t-test).
@pavoldzama4641
@pavoldzama4641 10 ай бұрын
Hi, thanks for the awesome video ... I have a question, I still cannot get my head around the concept of this matrix... you have the formula y= 1* 2,2 + 0*3.6 + residual of the value ... I thought that only residuals are going into calculation ... how do we get from this bunch of y= equations one below the other to the final calculation, how does it fit in there? I am missing a bridge there. As well once there is residual at the end of the y equation and other times there is not, is this annotation later used counting with this residuals? If I understand correctly SS(fit) = sum of all these y equations together .... and I don't get why we need this matrix there if we do only the sum of residuals anyway ... or if I follow the equation literally I have sum residual squared (nobody written is squared but I suppose so) plus mean (which depends if is on or off according to the matrix)?? Thanks a lot for your response
@statquest
@statquest 10 ай бұрын
What time point, minutes and seconds, are you asking about?
@pavoldzama4641
@pavoldzama4641 10 ай бұрын
It is spreadt between 5:00-7:00. ... where you start to show y= ewuations and you end up with simple equation in 7 amd then you move on, I am confused how this fit together
@statquest
@statquest 10 ай бұрын
@@pavoldzama4641 First, when we use the equations to calculate the exact y-axis value for each of the known data point, we include the residuals because the mean value + the residual = the exact y-axis coordinate for the original data value. However, when we use the equations to make predictions - say like someone asks us to predict gene expression for a new control mouse - and we don't actually know what that value is - then we leave the residual off the equation, since we don't know what the difference between the mean and the "true" value is. Thus, when the equation is used with known values for the y-axis, we include the residual. When the equation is used to make predictions (and we don't know the y-axis value) we leave off the residual. Now, the reason we keep track of the equations in a matrix is that we can change the coefficients in one place and see how they effect the residuals and thus, how they minimize ss(fit).
@pavoldzama4641
@pavoldzama4641 10 ай бұрын
​@@statquestthanks for explainig, I think maybe we misunderstand, I just could not connect the dost .... I think I get it now after watching again, you use this formula for fit and this should not include residual as we calculate residuals based on this... then I got co fused pretty much on those matrices but ypur follow up video saved the day! Great wotk, keep it up
@fanghou8924
@fanghou8924 29 күн бұрын
The second example should be a 2 x 2 + 1 design?
@statquest
@statquest 29 күн бұрын
I actually don't know what you'd call it. For me it's easier to figure out what the model and contrasts are than it is to figure out what to call it.
@saude-5online422
@saude-5online422 2 жыл бұрын
First, thank you for your awesome videos! Is there any advantage of General Linear Model over a simple ANOVA, for example?
@statquest
@statquest 2 жыл бұрын
ANOVA is just a specific type of General Linear Model.
@unlearningcommunism4742
@unlearningcommunism4742 Жыл бұрын
When I hear your voice, I want to make chemometric papers with you
@statquest
@statquest Жыл бұрын
:)
@ToniSkit
@ToniSkit 2 жыл бұрын
Woah I was like that’s weird a video … and then it was like - that’s a a lot of videos BAMMMMM
@computerconcepts3352
@computerconcepts3352 2 жыл бұрын
yeah lol, idk if it's a re-upload though
@statquest
@statquest 2 жыл бұрын
It's a re-upload. For some reason KZbin put the originals behind a paywall...so I re-uploaded them so that they would still be free.
@computerconcepts3352
@computerconcepts3352 2 жыл бұрын
@@statquest oh ok, interesting 🤔, I thought you manually did that, lol. Thanks 👍 for the information and clarification. I'm curious whether if you access your own video or do you have to pay KZbin to watch your own videos as well?
@statquest
@statquest 2 жыл бұрын
@@computerconcepts3352 I can watch them, so at first I had no idea what was going on. The content worked for me, but no one else. It was very confusing and stressful because I got a lot of negative comments. Ugh. A bad day in StatLand. :(
@computerconcepts3352
@computerconcepts3352 2 жыл бұрын
@@statquest oofty doof oof oof, I guess uploading videos to other platforms could help reduce damages against something like this but then I really hate how KZbin does random things like this. I remember one of my videos got deleted for the wrong reason and I lost a bunch of views. Fixing KZbin is actually one of my eventual goals and me watching your KZbin videos is my first step towards that goal 👍
@aiswaryanair9033
@aiswaryanair9033 Жыл бұрын
Pardon me if this is a dumb question , but I am having a hard time wrapping my head around the idea! Why would one prefer to do a linear regression for T test? How does it make this better?
@statquest
@statquest Жыл бұрын
It's actually the exact same test - there are just two ways to do it. The advantage of doing it this way (using regression) is that we have more flexibility - we can easily generalize it into an ANOVA test or even something more fancy.
Linear Regression, Clearly Explained!!!
27:27
StatQuest with Josh Starmer
Рет қаралды 320 М.
Explaining the ANOVA and F-test
11:51
Very Normal
Рет қаралды 26 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
Using Linear Models for t-tests and ANOVA, Clearly Explained!!!
11:38
StatQuest with Josh Starmer
Рет қаралды 429 М.
Understanding Generalized Linear Models (Logistic, Poisson, etc.)
20:19
ANOVA simply explained in less than 3 minutes
2:58
Marcel Butschle
Рет қаралды 6 М.
T test and ANOVA Explained
5:02
RayBiotech
Рет қаралды 52 М.
How To Know Which Statistical Test To Use For Hypothesis Testing
19:54
Amour Learning
Рет қаралды 825 М.
ROC and AUC, Clearly Explained!
16:17
StatQuest with Josh Starmer
Рет қаралды 1,6 МЛН
T-test, ANOVA and Chi Squared test made easy.
15:07
Global Health with Greg Martin
Рет қаралды 331 М.
How to choose an appropriate statistical test
18:36
TileStats
Рет қаралды 156 М.
Linear Regression, Clearly Explained!!!
27:27
StatQuest with Josh Starmer
Рет қаралды 1,4 МЛН
Regularization Part 1: Ridge (L2) Regression
20:27
StatQuest with Josh Starmer
Рет қаралды 1,1 МЛН
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН