How to interpret (and assess!) a GLM in R

  Рет қаралды 24,902

Chloe Fouilloux

Chloe Fouilloux

Күн бұрын

Hi! New to stats? Did you just run a GLM and now you have an output that you have no idea how to interpret? Then this video is just for you! In addition to interpreting the output of standard GLM models in R, we also go over diagnosing the suitability/appropriateness of a GLM for your data.
*Our mantra:* Just because it runs, doesn't mean it's right!
Jump around the video:
0:00 Introduction
01:06 Loading Libraries
01:06 *Introduction to Iris Data*
02:34 First GLM table
03:01 Understanding *intercepts*
03:33 Understanding *estimates*
04:28 Changing the levels of comparison in a GLM
05:49 Understanding *standard errors and t-values*
06:59 Understanding *null deviance and residual deviance*
09:09 Understanding *deviance residuals*
09:24 Model quality checks and DHARMa
12:06 *EXAMPLE 2* Diamonds dataset
12:26 Building diamonds GLM
12:52 Knowledge check
13:58 DHARMa analysis for continuous GLM
14:35 Patterns in residuals
15:21 GLM with multiple predictors
15:57 Understanding intercept with multiple predictors
16:40 Are do your data and intercept agree?
17:17 Outro
Find the code for this video on my GitHub: github.com/chloefouilloux/GLM...
Disclaimer: I definitely misspeak/misuse some terms throughout this video, but the general concepts are correct. I was just kind of free-balling with no script here, but I still hope you find the content useful! *hugs*

Пікірлер: 59
@livinglyrics2778
@livinglyrics2778 3 ай бұрын
This video is the first video of yours that I’ve come across and I just wanted to say, I absolutely love your teaching and presentation style!! Your enthusiasm and explanation style are so engaging, it’s awesome; and, the way you break things down whilst also simplifying concepts is great, especially because such concepts are generally taught/explained in a much more complex way in university courses, textbooks, and in other KZbin/online tutorials - together, I feel this all really helps with improving understanding of all concepts discussed. I’m a postgrad student and would have loved to have access to this type of content in my earlier years when learning stats - I must say though, I’ve still learnt some new info from this tutorial!! Would love to see more R programming tutorials like this one - if you’re thinking about posting more, please do because you definitely have the gift of making stats engaging and fun (descriptive words that you don’t usually find when people are talking about stats 😅). Thanks for this content!! 🙌
@user-mh7px2uy1k
@user-mh7px2uy1k 6 ай бұрын
I am learning mixed effect linear models - could you do a video on how to interpret the outcome of those types of models? I have tons of info on the modeling aspect but not entirely sure how to leverage the output effectively. I appreciate the humor and thoughtfulness in your videos to make them interesting.
@karlaandreoli1986
@karlaandreoli1986 18 күн бұрын
I just arrived here, and I have to say thank you soooo much for this video! You are very didactic Hugs from Brazil 🥰
@rhodrambles3943
@rhodrambles3943 Жыл бұрын
This was super useful, not come across the DHARMA package before and its so much simpler than what I was trying to do. Thank you so much!
@emilybrayton4457
@emilybrayton4457 3 ай бұрын
Thanks so much for this video, I feel like I have some clarity in understanding GLMs and my outputs so much more now. It feels good to have this confidence!!!
@MV-wn6kc
@MV-wn6kc Жыл бұрын
This is exactly what i needed for my university report. Thank you so much!
@isabelvictoriamoralesbelpa9649
@isabelvictoriamoralesbelpa9649 10 ай бұрын
Thank you very much Chloe, you are the best for explaining this tricky things. Please if you can do a video about GLM including interactions among factors
@martinabautista
@martinabautista 2 ай бұрын
You are incredible! I enjoy every second I watch your video
@icefunkdark8555
@icefunkdark8555 4 ай бұрын
I love how you present it :) thank you!
@fionac5717
@fionac5717 10 ай бұрын
Hi Chloe, this was a fabulous explanation of how GLM works, clear, concise and helped me no end to get to grips with my GLMM on factors affecting pollinators visiting annual bedding plants! thanks so much, not least for the introduction to DHARMa!! More please, love your friendly style.
@adeyemiblessing
@adeyemiblessing 7 ай бұрын
Hi Fiona, Hope you are good? I came here for this same reason as I am a student working on pollinator interactions and effect of different factors on them. Is there a better way we can connect? I'm looking forward to your reply
@Cintiams95
@Cintiams95 11 ай бұрын
Omg! Thank youuu ❤ The way you explained.... amazing 😊
@paulobarrosbio
@paulobarrosbio 7 ай бұрын
Thank you! Amazing explanation! Really helped me understand key aspects of a GLM. And thanks to the tip on the DHARMa package!
@jsc0625
@jsc0625 4 ай бұрын
This really helped me fill in some knowledge gaps I had about the GLM, thanks so much 😊
@user-nv6fq7qb1n
@user-nv6fq7qb1n 6 ай бұрын
This was really helpfull, clear, and fun to watch ! thank you very much :)
@yuvalgal-shahaf2782
@yuvalgal-shahaf2782 3 ай бұрын
You manage to make statistics fun anc cool! wow. Thank you so much. You are great
@user-mh7px2uy1k
@user-mh7px2uy1k 6 ай бұрын
Very good explanation, helpful reminder. And appreciate the tip on the Dharma package.
@keniadanielareyesochoa1224
@keniadanielareyesochoa1224 3 ай бұрын
OMG, this is pure gold! Thank you so much
@Sseyedsalehi
@Sseyedsalehi 10 ай бұрын
Thank you so much Chloe!
@juanlb1105
@juanlb1105 Жыл бұрын
The model is modelling. that´s meme material there. Thanks for the video Chloe! finally learned some tricks with GLMs
@chacmool2581
@chacmool2581 2 ай бұрын
Statisticians like to generalize and GLM is a generalization of lots of survival cases. For example, OLS regression is a surgical case of a GLM with a Gaussian link. Fit an lm() and a Gaussian GLM, and you'll get identical results.
@mattounou
@mattounou 10 ай бұрын
Merci beaucoup pour les explications claires ! Précieux notamment pour juger la validité du glm et ce joli package DHARMa
@agustinabayon1887
@agustinabayon1887 9 ай бұрын
Great explanation! thank you so much for the video. Could you please make a video about which glm models can be used when the data is not normally distributed?
@bkarim7349
@bkarim7349 11 ай бұрын
Thank you great video
@user-xy5ko8xr9i
@user-xy5ko8xr9i 5 ай бұрын
chloe ily this is such a good video
@bes2963
@bes2963 Жыл бұрын
Hi Chloe, just watched this and I have to say thank you so much for speaking in English for all of us not super familiar with statistics. This was so easy to understand, it puts most professors I've had to shame. Any chance you could explain working with a non-normal distribution, interpreting a GLM Poisson? I'm struggling with my data analysis for my thesis :)
@chloefouilloux
@chloefouilloux 11 ай бұрын
I will make this the focus of my next video!
@adeyemiblessing
@adeyemiblessing 7 ай бұрын
Is that video out now? 😊
@katerynarusnyak947
@katerynarusnyak947 Жыл бұрын
You are great!
@chloefouilloux
@chloefouilloux Жыл бұрын
Wow, that means the world. Thanks! If there's anything you'd like to learn in data viz, don't hesitate to ask! :-)
@gabrielbatista4329
@gabrielbatista4329 9 ай бұрын
Super helpful, what model would use for data that is not normally distributed?
@user-ro9ex5im2p
@user-ro9ex5im2p 4 ай бұрын
Thanks!
@Maddawg31415
@Maddawg31415 11 ай бұрын
Very good. Now lets say you had the 3 flower variables as categorical, and you wanted to generate ORs based on whether the species had long or short (1/0) septal length. How would you do that for a model where the coefficients are expressed as differences off of the reference's coefficient?
@yusmanisleidissotolongo4433
@yusmanisleidissotolongo4433 11 ай бұрын
Thanks so much. Do we need to include in the code the distribution?
@user-ge6ee1sf5h
@user-ge6ee1sf5h Ай бұрын
Hello! Thank you for the video! May I ask to explain in details what Estimates mean in GLM please? Or where can I read more about it?
@gmasji
@gmasji 11 ай бұрын
Thanks for the video. I want to ask you, If I have 2 categorical factors and one numeric response, Can I do a glm? Thank you, I am just starting with glm😅
@bobmandinyenya8080
@bobmandinyenya8080 Ай бұрын
Thanks Chloe, how can I make the plot as the one you have at 3:06 minutes for the different species?
@markelov
@markelov 8 ай бұрын
Loved your video! Have you ever used check_model() from the performance() package?
@chloefouilloux
@chloefouilloux 8 ай бұрын
I haven't! I just looked it up and it looks pretty cool. It seems very similar to DHARMa but perhaps a bit more flexible, which can be good or bad depending on your handling on stats (for example, I see that you can compare models with different parameters from different datasets within the same call! that seems. . . dangerous. . .and can be super misleading if you don't know what is underlying the output).
@markelov
@markelov 8 ай бұрын
For sure! I am slowly but surely making the transition to R by way of SPSS and then Stata, and am constantly amazed at how flexible R can be-for better or for worse! I have only tinkered with check_model(). I like that it offers a vehicle to visually inspect the most salient OLS assumptions at once, and especially love the added guidance of what you should be looking for to guide your interpretation. Merci mille fois !
@alexanderstosich3584
@alexanderstosich3584 Жыл бұрын
is DHARMA only for GLM's? Is there something similar for GLMM's? great video!
@chloefouilloux
@chloefouilloux Жыл бұрын
It actually works best for GLMMs! More troubleshooting options. Check out their super detailed vignettes here: cran.r-project.org/web/packages/DHARMa/vignettes/DHARMa.html
@alcinaxavier3623
@alcinaxavier3623 2 ай бұрын
What if I want to test interections (they were significant for Tukey test)? What commends should I write?
@sucinovita4422
@sucinovita4422 Жыл бұрын
Hi there, Thank you for sharing ❤, but i have a question. If the model have multiple predictor, and one of them is continous data. How to change the intecept for that continous variable after i transform the data? Thank you
@chloefouilloux
@chloefouilloux Жыл бұрын
Hi Suci! Great question. Short answer: (1) First transform the data, and **save it as a new column in your data sheet**, (2) run the model with this updated variable. Long answer (example, lol): Let's say we had mass as a predictor. We have a data frame called *df*. Now, let's say we want to transform mass. I would first load the tidyverse package, and then use the function "mutate" to make a new (transformed) variable! #some code! library(tidyverse) df1% mutate(mass_new = mass-mean(mass)/sd(mass)) #Now, see above, we have our NEW variable called "mass_new. So, all we have to do now is use this in our model! (In the fake code, I have saved it here as a new data frame to avoid confusion) glm( y ~ mass_new + x2, data = df1) The model above will then be using your transformed variable
@sucinovita4422
@sucinovita4422 Жыл бұрын
Thank you for your answers, i’ll try it first 🙏🙏☺️
@rubyanneolbinado95
@rubyanneolbinado95 2 ай бұрын
Hi, why is R studio producing different results even though I am using the same call and data.
@chloefouilloux
@chloefouilloux 2 ай бұрын
Hmmmmm, I wouldn't know without looking at your code, but you can check out the code of this video that I have annotated on my GitHub to see if there are any mismatches. github.com/chloefouilloux/GLMOutput/blob/main/GLM_Output.Rmd
@Hamromerochannel
@Hamromerochannel 9 ай бұрын
Hi Chloe what’s your background (profession) ? Academics or …. ???
@chloefouilloux
@chloefouilloux 9 ай бұрын
Hi! I am in academia, yes! Which is why the videos are quite irregular, but I am going to try to get one up before the holidays!
@rubyanneolbinado95
@rubyanneolbinado95 3 ай бұрын
thank you for the information.
@chloefouilloux
@chloefouilloux 3 ай бұрын
Thanks for the feedback 😸 I'm working on a follow-up video that might include interactions and other model families. If it's okay could you let me know what info you felt was lacking? I'm always trying to improve on explanations!
@ALIENwaveENGINEER
@ALIENwaveENGINEER 3 ай бұрын
🤐🤐🤐🤐🤐🤐🤐
@rubyanneolbinado95
@rubyanneolbinado95 3 ай бұрын
@@chloefouilloux ohh thank you so much for the prompt reply. I am just frustrated and confused on how to select the best model for my 7 response variables. Should I use the AIC (via backward selection) to select the best fitted model or should I just use 3 models (of which I selected the explanatory variables, one with only 2, one with 5 and one with 5 explanatory variables+interactions). Please help me what should I do on this. I've done too many researches but they have used different methods and just confused me more. Huhu
@rubyanneolbinado95
@rubyanneolbinado95 3 ай бұрын
@@chloefouilloux one more things please. Is it okay to use just one model for my different 7 response variables?
@chloefouilloux
@chloefouilloux 3 ай бұрын
@@rubyanneolbinado95 Hi hi! Okay, let me tackle these one at a time. (1) One glm model for 7 predictors is probably not going to be great (especially if there are interactions!). These models tend to be *overfit* which means that you are trying to split your data into too many little boxes-- fewer predictors means more explanatory power (check dharma part of the video-- you can check dispersion of your model using dharma too!). (2) So, how to reduce the number of predictors? Well, you can do the backward selection that you mention, for sure. I don't love to use this method *initially* because it can get rid of the variables you are actually interested in! (because stepwise isn't a biologist, you are!). I would first check if any of your predictors are collinear/autocorrelated! (ex. mass and length are two variables that often are highly correlated-- when you have too much autocorrelation between predictors, they get mad at each other and wreck your model) -- here, you can check correlation between variables *and choose which one is more biologically reasonable* to keep in the model-- drop the other ones. (3) If option 2 isn't working out for you, a GLM just might not be the right model for your data! I would start thinking about a PCA or more advanced modelling, like mixed models. Hope this helps :)
Create a Google API key for R from Scratch
4:51
Chloe Fouilloux
Рет қаралды 1,4 М.
Having Fun with Random Effects in Mixed Models (GLMMs)
12:37
Chloe Fouilloux
Рет қаралды 1,6 М.
Неприятная Встреча На Мосту - Полярная звезда #shorts
00:59
Полярная звезда - Kuzey Yıldızı
Рет қаралды 7 МЛН
Smart Sigma Kid #funny #sigma #comedy
00:25
CRAZY GREAPA
Рет қаралды 5 МЛН
THEY WANTED TO TAKE ALL HIS GOODIES 🍫🥤🍟😂
00:17
OKUNJATA
Рет қаралды 8 МЛН
Understanding Generalized Linear Models (Logistic, Poisson, etc.)
20:19
R package reviews | glmulti | Find The Best Model !
13:27
yuzaR Data Science
Рет қаралды 11 М.
Interpreting R's regression table
17:11
James Scott
Рет қаралды 15 М.
Mixed Effect Regression for Factorial Designs
34:17
Keith Lohse
Рет қаралды 2,2 М.
Understanding the glm family argument (in R)
16:15
Kasper Welbers
Рет қаралды 19 М.
Introduction to Generalized Additive Models with R and mgcv
3:22:40
Bottom of the Heap
Рет қаралды 70 М.
What is GLMM, and When Should You Use It?
1:01:13
statistics complete course
Рет қаралды 6 М.
Visualizing Regression Models in R
1:03:44
Elijah Appiah
Рет қаралды 3,9 М.
Неприятная Встреча На Мосту - Полярная звезда #shorts
00:59
Полярная звезда - Kuzey Yıldızı
Рет қаралды 7 МЛН