Combining Random Forests and GLMs in R

  Рет қаралды 5,215

Quant Psych

Quant Psych

Күн бұрын

Пікірлер: 22
@decimus6
@decimus6 Жыл бұрын
Thanks a lot for this video ! It's awsome as usual. A question rised in my mind : RF are good to select predictors. Yet how does it tackle multicollinearity ? Is it possible that two or three good predictors are multicollinear? Thanks a lot.
@damaranaidoo9855
@damaranaidoo9855 3 ай бұрын
Hi there, I am looking into the RF analysis for my MSc thesis but I am struggling a bit, I am looking at the effect of various environmental variables (Predictor variables) on a binomial response variable (absence/presence), the data is non-linear, and is somewhat skewed, I have tried to run a glm and a gam analysis but both models are not good fits and are underfitting the data. Do you think a RF analysis with the glm would be more appropriate in this case?
@nikidiogou4203
@nikidiogou4203 10 ай бұрын
Another very useful video for stats, thank you for all of them! The estimates from the rf using flexplot seem not to align with the variable importance score (vi). Shouldn't we have the same ranking of variables when we look at the estimates and when we look at the vi?
@OnLyhereAlone
@OnLyhereAlone Жыл бұрын
Very informative as usual. The case for linear mixed models (LMM) brought me to your channel. Question; could random forest be used to determine variables to include in an LMM too? Thanks again for the great work you do.
@QuantPsych
@QuantPsych Жыл бұрын
Yes. There was a paper I saw recently that builds random forest atop mixed models: onlinelibrary.wiley.com/doi/abs/10.1002/sam.11505 Alternatively, I've averaged the scores within cluster, used RF to find variables, then used mixed models on those variables.
@mohamedrefaat197
@mohamedrefaat197 3 жыл бұрын
Thanks for the quality content! I wonder what you mean by transportable at the beginning?
@QuantPsych
@QuantPsych 3 жыл бұрын
Have you checked out this video? kzbin.info/www/bejne/jKKudquQfJaWl6s I believe that explains what "transportable" means.
@tatjanajak
@tatjanajak 2 жыл бұрын
cforest() from party package takes a loooong time. But, when I try to use result of the randomForest() from randomForest package within the estimates(), I get the following error: Error in x$r.squared : $ operator is invalid for atomic vectors. I guess that the results of these two functions are different and only cforest() can be used within flexplot functions. I hope in the future you will introduce randomForest() into this whole process. I think it's worth it because cforest is just too memory/time consuming.
@christoph3933
@christoph3933 10 ай бұрын
What do you do in case of missing values? Do you recommend doing Multiple Imputation before?
@QuantPsych
@QuantPsych 7 ай бұрын
Yes!
@gimanibe
@gimanibe 3 жыл бұрын
Thanks for the videos you make. I learn a lot! Are this R script available somewhere?
@QuantPsych
@QuantPsych 3 жыл бұрын
The code in the video should work. If I find time, I'll put them in the description.
@scottnelson7841
@scottnelson7841 Жыл бұрын
no matter how many times I load the package and Library, I get this error message: Error in variable_dropout(explained_rf, type = "raw") : could not find function "variable_dropout". Any help?
@francisolsson9728
@francisolsson9728 2 жыл бұрын
Can you use random forests using categorical and numeric variables?
@tatjanajak
@tatjanajak 3 жыл бұрын
@QuantPsych it seems you did not use GLMs but rather standard lm.
@QuantPsych
@QuantPsych 3 жыл бұрын
Possibly. I haven't watched this video for a while :) But I use to use GLM to refer to general linear models and GLIM to refer to general*ized* linear models. I switched the notation somewhat recently. I might have meant to refer to LMs instead of GLMs.
@tatjanajak
@tatjanajak 3 жыл бұрын
@@QuantPsych at 20:58 is where I believe the error occurs. It is really just a small mistake. It is really not a big deal, but the whole presentation is awesome as usual and the only thing to do is to say at 20:58 "I ment glm instead of lm".
@woosterjeeves
@woosterjeeves Жыл бұрын
@@tatjanajak GLM is variously used to refer to General Linear Model (which is done using lm() in R), or the Generalized Linear Model, which is what you are referring to (which is done using glm() in R). So he is talking of General Linear Model. Hence not a "mistake".
@tatjanajak
@tatjanajak Жыл бұрын
@woosterjeeves thanks.
@Martyr022
@Martyr022 3 жыл бұрын
still waiting on that paper!
@QuantPsych
@QuantPsych 3 жыл бұрын
Here it is! psyarxiv.com/ebsmr
@Martyr022
@Martyr022 3 жыл бұрын
@@QuantPsych Huzzah! Thank you! Your channel has been super helpful!
What are Random Forest Models? Simple Explanation
14:16
Quant Psych
Рет қаралды 2,8 М.
Всё пошло не по плану 😮
00:36
Miracle
Рет қаралды 3,4 МЛН
小蚂蚁会选到什么呢!#火影忍者 #佐助 #家庭
00:47
火影忍者一家
Рет қаралды 120 МЛН
Wait for the last one 🤣🤣 #shorts #minecraft
00:28
Cosmo Guy
Рет қаралды 10 МЛН
啊?就这么水灵灵的穿上了?
00:18
一航1
Рет қаралды 76 МЛН
The Basics of Flexplot in R
12:23
Quant Psych
Рет қаралды 6 М.
Random Forest in R
47:58
Ecological Applications in R
Рет қаралды 10 М.
Mixed Model Notation - A Simple Explanation
20:42
Quant Psych
Рет қаралды 25 М.
Introduction to R: Random Forests
16:35
DataDaft
Рет қаралды 6 М.
Understanding model interpretability in R with ggplot2 and mikropml (CC134)
29:36
Random Forests : Data Science Concepts
15:56
ritvikmath
Рет қаралды 47 М.
Visualizing Regression Models in R
1:03:44
Elijah Appiah
Рет қаралды 5 М.
Combining classical and machine learning methods in Survival Analysis
1:05:03
Всё пошло не по плану 😮
00:36
Miracle
Рет қаралды 3,4 МЛН