I have been trying to find workflow videos on regression analysis for a while now, this is the first (and only one) that I found. It helped me immensely, thank you.
@mronkko7 ай бұрын
You are welcome. It is surprising that very few people teach how to actually use the analyses in empirical research practice.
@THEPSYCHOTIC7 ай бұрын
@@mronkko that's true. Most videos cover only interpretation of results or are focused on let's say one part of the analysis but no one covers the whole process in a single video, with a single dataset. Just an idea - You could maybe consider doing a workflow series focusing on how to do analysis with different combinations of explanatory/response variables? Let's say one categorical explanatory variable, 1 exp and 1 quantitative, 2 categorical exp var, and so on. And the same logic with explanatory - quantitative vs qualitative. I'm not sure if you've done it already, but it'd be so so helpful! Thanks again, keep up the good work. I wish you good luck!
@BrinderSadler5 ай бұрын
A very informative video that is clear and uses examples so that viewers can better follow. Thank you.
@mronkko5 ай бұрын
You are welcome!
@magnusjensen58673 жыл бұрын
Best explabation I’ve come across on KZbin! Keep up the good work
@mronkko3 жыл бұрын
Glad it helped!
@newtonocharimenyenya24583 жыл бұрын
A Great Piece. Simple to understand.
@mronkko3 жыл бұрын
Glad you think so!
@bezaeshetu54543 жыл бұрын
Thank you for the nice and clear explanation.
@mronkko3 жыл бұрын
You are welcome!
@Youtube304s6 ай бұрын
Subscribed. Very good
@mronkko5 ай бұрын
You are welcome.
@whx20443 жыл бұрын
Thank you for teaching !
@mronkko3 жыл бұрын
You are welcome.
@newtonocharimenyenya24583 жыл бұрын
A very Great piece.
@mronkko3 жыл бұрын
Thanks
@harijha6279 Жыл бұрын
best explanation
@mronkko Жыл бұрын
Good that you liked it!
@statistikochspss-hjalpen8335 Жыл бұрын
Great video. My question is what to do when ln transformation doesn't help? Imagine a regression with only Likert scale variables (1-5). Customer satisfaction as the dependent variable and product quality, customer service as independent variables. Most customers score 4 or 5 on the all variables. Almost all of the MLR assumptions are not met. How to approach the problem? I read about PLS being an alternative instead of OLS, but my coefficients are almost identical with both OLS and PLS (don't know if it's because of a fairly big dataset, n=8000).
@mronkko Жыл бұрын
If your scales are poorly calibrated so that you get just 4s and 5s in a 1-5 scale, then I do not think that there is anything that you can do except to collect better data. How to approach the "allmost all assumptions are not met": I would start by looking at a specific assumption first and what you can do about it. For example, if the relationships are not linear, then I would start thinking about using nonlinear functional forms.
@statistikochspss-hjalpen8335 Жыл бұрын
@@mronkko Thank you for taking the time to respond. The data is real and based on real customers. The satisfaction metric (dependent variable) is already well established in the industry. If I'm interpreting my normal probability (y axis shows percent and x axis shows residual) plot it looks like 7% of the observations are off the line. The residuals go from minus 10 to positive 5. The residual vs fits, the residuals slope downwards as the fitted value increases.
@mronkko Жыл бұрын
@@statistikochspss-hjalpen8335 If the residual slopes downward, then you might have nonlinearity and you need to consider other functional forms. The fact that a measure is well-established does not necessarily mean that the data are good. For example if you want to assess the effect on persons height on persons weight, but only measure people between 180 and 181 cm, then normal measurement tape would not suffice because it is not precise enough. The same can happen in your data, if you have little variation in satisfaction you might need a measure that is calibrated differently. I think I talk about measurement calibration in one of the measurement presentations, but I am not 100% sure about that.
@ltang2 ай бұрын
Around 7:49 are farmers less prestigious than the model predicted or more? What does sitting below the y=x line mean?
@mronkko2 ай бұрын
They are more prestigious. Check the residual on the y-axis. Anything above zero is respected more than what the model predicts.
@ltang2 ай бұрын
@@mronkkoSo it is below the y=x line just means that theoretically on that percentile we would expect the residual to be even higher? What does the theoretical percentile mean? Is it just based on rank
@rutwikkadane24093 жыл бұрын
Thanks for the explanation!
@mronkko3 жыл бұрын
Glad it was helpful!
@faemillongo68392 жыл бұрын
Thanks. So clear
@mronkko2 жыл бұрын
Happy that you find it helpful. The lack of reporting that regression diagnostics were done is a big problem in published research. And this would be so easy to fix. Pay attention to your model assumptions and justify them.
@kar21943 жыл бұрын
Hi Thanks for the content! 3:09, you said you have a video of the regression coefficient, I can't find it, I would like to check it out :)
@mronkko3 жыл бұрын
Good question. The videos are from a course that I run and I have organized them as KZbin playlists. This video is from the third study unit and the video that I refer to is from the second unit: kzbin.info/www/bejne/oXyoYmCffM6src0
@kar21943 жыл бұрын
@@mronkko Thanks!
@auddssey Жыл бұрын
i want to see the r code for residual vs leverage plot, how the occupation outliers appear :-)
@mronkko Жыл бұрын
The slides are linked in the video description and contain some R code in the slide notes library(car) data(Prestige) reg1
@zwan18863 жыл бұрын
In your AV plots around 15:00 isn't it showing that the women regressor doesn't add anything to the model?
@mronkko3 жыл бұрын
Yes. that is what the model shows. Also he regression coefficient in the table at 2:58 shows that the effect of women is nonsignificant.