Binary logistic regression using Stata (2018)

  Рет қаралды 133,808

Mike Crowson

Mike Crowson

Күн бұрын

This video provides a demonstration of the use of Stata version 14 to carry out binary logistic regression. It covers menu options and syntax, and reviews post-estimation options that are available to you.
You can download a copy of the data file used in the video here: drive.google.com/file/d/13ioH...
You can download a copy of the referenced "do file" here: drive.google.com/file/d/15i_Q...
UPDATE: I have uploaded a new video using Stata 17 to KZbin, with a focus on using drop-down menus for generating and interpreting results. You can access that video here: • Binary logistic regres...
For more instructional videos and other materials on various statistics topics, be sure to my webpages at the links below:
Introductory statistics:
sites.google.com/view/statist...
Multivariate statistics:
sites.google.com/view/statist...

Пікірлер: 62
@willhauscarriague9639
@willhauscarriague9639 4 жыл бұрын
Thank you! The first video I have found that not only goes over how to run the code, but how to interpret results. My Econometrics Capstone project is entirely better because of this video.
@mikecrowson2462
@mikecrowson2462 4 жыл бұрын
Glad to be of help Will! Also, although you may be finished with your project, you might visit this newer video I put together in 2019 on binary logistic regression in Stata. It also has a Powerpoint that goes into more details (see under video description). Please share with your fellow students! Best wishes!
@kaizoku4life
@kaizoku4life 4 жыл бұрын
Thank you!! you are a lifesaver! Well done with the overview!
@user-kj9ms4it9q
@user-kj9ms4it9q 2 жыл бұрын
Clear message, clear structure, easy to understand, thank you
@noorrizvi1657
@noorrizvi1657 6 жыл бұрын
Thanks for such a good explanation!
@mikecrowson2462
@mikecrowson2462 6 жыл бұрын
Thank you for watching! I appreciate it!
@instantexodus209
@instantexodus209 5 жыл бұрын
incredible job at explaining !
@dayakarpeddi892
@dayakarpeddi892 4 жыл бұрын
Thank you, it is so useful for beginners
@jingxu1388
@jingxu1388 3 жыл бұрын
Thanks! This video is so helpful. Really benefit from it~
@mikecrowson2462
@mikecrowson2462 5 жыл бұрын
DEAR VIEWERS: I have recently uploaded a new video (2019) on binary logistic regression in STATA that INCLUDES MUCH MORE INFORMATION ON RUNNING AND INTERPRETING RESULTS. There are also ADDITIONAL SUPPLEMENTAL LEARNING MATERIALS THAT ARE LINKED UNDER THE VIDEO DESCRIPTION. PLEASE CHECK OUT: kzbin.info/www/bejne/jWaWkoB8hKiasK8
@AK-md1mr
@AK-md1mr 4 жыл бұрын
Hi Mike! Thanks for the video!! Do you have any recommended reading on when to treat ordinal items as metric? I am writing my final paper and ne some source on that... best
@Milica_Mirkov
@Milica_Mirkov 4 жыл бұрын
Thank you very much! Very useful!! 😀
@gayatreesahoo9839
@gayatreesahoo9839 Жыл бұрын
Excellent👍. Your video really helped me a lot. Thank you so much sir.
@emptyxnes
@emptyxnes Жыл бұрын
Thank you for going over it and explaining. Question, let say in the polint variable, if it was to increase by a factor of 1, 2, 3, etc. How would the odds change? How can one model or calculate that? Thank you.
@wasafisafi612
@wasafisafi612 3 жыл бұрын
Thanks a lot for the videos. I am very grateful
@hosnywahyana6332
@hosnywahyana6332 3 жыл бұрын
thank you, you are lifesaver
@rahulgupta6440
@rahulgupta6440 5 жыл бұрын
How to find percentage of correct predictions in ordered probit models? estat classification command is not working. Please guide
@Jdonovanford
@Jdonovanford 3 жыл бұрын
Excellent and fun explanation. I would like to test if a series of continuous IV predict two categorical variables (income level and gender). As you see, the DV are two, one of them is binary (gender) and the other has several levels (low, medium, high, extreme). What type of regression shall I use? I considered bivariate logistic regression, but here one of the DV is not binary.
@tajfaa
@tajfaa 3 жыл бұрын
In my case I have a multinomial logit and my problem is that STATA has much limited options regarding goodness-of-fit comparing with binary logit (basically only AIC and BIC)
@sidrazain5251
@sidrazain5251 4 жыл бұрын
If the predictor is a continuous variable then how to interpret the odds ratio?
@tewoderosmeleaku6627
@tewoderosmeleaku6627 Жыл бұрын
Thanks Mike for sharing document and analysis on logit output and its interpretation. What dose indicate the result of ldfbeta? additionally, if you have on multivariate probit analysis please share as.
@carlosvaz6338
@carlosvaz6338 7 ай бұрын
So very helpful! Thanks
@mikecrowson2462
@mikecrowson2462 3 жыл бұрын
Hi folks, I have just added a new 2021 video on binary logistic regression using Stata version 17 (kzbin.info/www/bejne/hqeom5Weo655n8U), with a focus on using the drop-down menus in the program. This video (and the accompanying Powerpoint) demonstrates how to run the analysis and how to interpret results. I have an additional video (kzbin.info/www/bejne/aKe8ooyjgJuVoac) demonstrating how to obtain fully and partially standardized regression coefficients for your logistic regression model. I hope you try these out!
@mariaqeight6986
@mariaqeight6986 4 ай бұрын
Great video but there was some mixed information regarding the negative coefficient. First, you said that males would be more likely to fall in the intent category than females and then later on you said that a negative coefficient means it is less likely to fall in the target category. So a negative coefficient means more likely or less likely to fall in the target category?
@allaboutlife40
@allaboutlife40 2 жыл бұрын
Hello how i can paired my data set or matched the case controls for logistic regression analysis
@Cihan_Quotes
@Cihan_Quotes 5 жыл бұрын
Do you know how to interpret Oaxaca results in stata? Best wishes
@dehiole6463
@dehiole6463 3 ай бұрын
What a lifesaver❤❤❤
@mikecrowson2462
@mikecrowson2462 3 ай бұрын
I'm so happy you found this video useful!
@RobertSisoev
@RobertSisoev Жыл бұрын
Best video ever
@johnedwards4440
@johnedwards4440 4 жыл бұрын
I'm so confused as to why when I my run my logit model (with results as coefficient), I get different coefficients and standard errors depending on how I order the independent variables. Each of these variables is binary.
@sdgskenyaforum1178
@sdgskenyaforum1178 3 жыл бұрын
There are only two binary variables and actually gender is nominal and the rest treated as continuous, i think that should pretty give the same results
@user-ki9co1xw1f
@user-ki9co1xw1f 5 ай бұрын
which tools you have to use to write the text during presentations
@martijnvaningen768
@martijnvaningen768 6 жыл бұрын
Any tips on what to do with my data when most variables seem to be insignificant?
@jibinjoseph5133
@jibinjoseph5133 3 жыл бұрын
Discard the analysis, quit academia/job and live your life in the woods
@thidachawhlaing3494
@thidachawhlaing3494 5 жыл бұрын
Thanks so much for this video which helps my research. But, I am still confusing the Stata result table from logistics regression of "Odds ration" and "Coefficient". You mean, in Stata result, the interpretation of "coefficient" differs with OLS, but "Odds Ration" shows the same interpretation with OLS? Is it? Sorry for silly questions as I confused a lot on that point. Because I got my Stata result with "Odd rations" but I am now facing difficulties how to interpret my logistics regression model. Could you please give me more explanation and guidance how to interpret this "Odds ratio" please. Thanks in advance.
@dipenmodi1807
@dipenmodi1807 4 жыл бұрын
The coefficient in OLS is not the same as the coefficient in Probit/logit. In order to get the same meaning, you need to calculate the marginal effects. In Stata, you can do this by using the ''mfx'' command. Odds ratio is basically the ratio of the probabilities occurring vs not occurring and this can only be done for the logit model
@user-ck6eh3dt1q
@user-ck6eh3dt1q 3 жыл бұрын
Thanks for the video! If I have a unbalanced dataset, I have different observations for two different logistic regressions. How can I do to check whether there is an increase of model fitness? I am looking forward to your response. Thanks in advance!
@awfan221
@awfan221 4 жыл бұрын
So Mike, what do we do if our specificity or sensitivity is really low. I ran into a low specificity (18%), while my sensitivity was 92%. What would one do in this situation?
@mikecrowson2462
@mikecrowson2462 4 жыл бұрын
Hi David, the difference in sensitivity and specificity generally indicates that the model is doing a better job of predicting one outcome than the other. Your model does not necessarily correctly predict group membership on the DV with the same level of accuracy. If you are wanting to increase accuracy of true negatives (specificity) then you might consider other theoretically relevant predictors that might do a better job of predicting group membership. I don't know what the nature of your research is, but the difference in classification accuracy could be less important than the type of classification errors. Just an FYI, I have a much new video on logistic regression with Stata - and there are accompany learning materials you can download at links underneath the video description. I hope you check it out: kzbin.info/www/bejne/jWaWkoB8hKiasK8 Best wishes!
@teresacorreia2866
@teresacorreia2866 Жыл бұрын
Hi Mike! First of all: thank you very much for sharing your knowledge with us. However, I have a small doubt over the definition of some exogenous variables you have considered. For istances, when you say DOG you mean the extent to which a person perceives herself as being dogmatic, right? And regarding EPE, what really does it mean? I cannot understand at 1:26... Thank you in advance and sorry for bothering
@mikecrowson2462
@mikecrowson2462 Жыл бұрын
Hi Teresa. Yes, higher scores on DOG indicate a person who is higher in dogmatism (lower scores indicate lower levels of dogmatism). EPE refers to External Political Efficacy. Higher scores indicate greater efficacy and lower scores indicate lower efficacy. Thanks for visiting :)
@giantafrica7204
@giantafrica7204 3 жыл бұрын
Dear May I ask you if I can use binary logistic for the data on drivers of land use land cover change that collected from three agroecology?
@mikecrowson2462
@mikecrowson2462 3 жыл бұрын
Hi there, and thanks for your question. The fact is that the choice of statistical approach is heavily dependent heavily on the type of data you are working with. If you have a dependent variable that is binary and you are seeking to predict the probability of a case falling into one category or the other on that DV, where the probability of a given outcome is modeled as a function of one or more predictors, then binary logistic regression is a viable approach. By the way, I do have a more recent video that you should check out on this topic. I hope you check it out! kzbin.info/www/bejne/jWaWkoB8hKiasK8 Cheers!
@chrisisah8006
@chrisisah8006 5 жыл бұрын
Very helpful video. Many thanks! I am a bit confused about the nature of data to use in BINARY log regression. My concern is with the basic of data type. while binary suggests dichotomous values, you used continous values - POLINT, DOG & EPE. will it make any difference in the result if these values a dichotomized prior to regression?
@mikecrowson2462
@mikecrowson2462 5 жыл бұрын
Hi Chris, thanks for visiting! I'm not sure I totally understand your question. Are you asking about dichotomizing the IV's or the DV? The DV is binary. The IV's can be either continuous OR binary. For the record, it is permissible to include a binary predictor directly into the model as a predictor (moreover dummy coding the predictor to 0 and 1 can aid in interpretation). You could also, if you wish, treat the binary predictor as a factor and use the 'i.' prefix. [Because I tend to like to have more control over the dummy coding (rather than relying on program defaults for dealing with factor variables, which can be annoying in programs such as SPSS where the default may select the reference category for you ), I often just do the recoding myself and then enter the dummy variable directly into the model. If you have a factor with more than two levels, then could still manually choose to do the dummy coding yourself (and enter the dummy variables directly, as described above), or you could (again) use the 'i.' prefix and allow the program to dummy code your variables (but with the limitation that the default will select your reference category).] By the way, iif you want a more "up-to-date" presentation on binary logistic regression (and includes some additional supplemental learning materials under the video description), check out this KZbin video I've recently put together kzbin.info/www/bejne/jWaWkoB8hKiasK8 . I hope I have successfully answered your question!
@chrisisah8006
@chrisisah8006 5 жыл бұрын
@@mikecrowson2462 Many thanks for the feedback, Yes indeed, i was referring to dichotomizing the IVs. now, i understand that the IV can be continous given a binary DV, and one can play safe by dummy-coding to 0 and 1 (for ease of interpretation, yea?). Although i'm a bit fuzzy on the usefulness of having a "factor" predictor in the model. does it help in the model "fit" or ensure the analyst (not the program) has control on the reference category choosen? I will most definitely view your recent videos as suggested. Thank you.
@mikecrowson2462
@mikecrowson2462 5 жыл бұрын
Hi Chris, yes using dummy coding (0 and 1) can facilitate interpretation - particularly of the intercept in regression models. What I mean with 'factor' variables is a variable that is nominal or ordinal. With appropriate re-coding of a variable of this type, you can include it in a regression model (including binary logistic regression). Here's another video I have made on that topic (although the demonstration uses SPSS): kzbin.info/www/bejne/jnjPk3qXhNiJb7c Best wishes.
@chrisisah8006
@chrisisah8006 5 жыл бұрын
@@mikecrowson2462 Ok. I will watch the video. Many thanks for your time and explanations. cheers!
@chrisisah8006
@chrisisah8006 5 жыл бұрын
@@mikecrowson2462 Hi Mike. thanks once more for your awesome videos, they were helpful. Please, I have details of a BLR analysis; DV (outcome) = transition (0=unsuccessful, REF; 1=successful) Predictor = religion (0=christian, REF; 1=islam)...inserted in model as "i." factor variable coefficient -0.0524; p>(z) 0.909 odds ratio 0.948; p>(z) 0.909 I concluded that "religion" is non-significant predictor of "successful" transition, loosely. in further explaining the output; odds of successful transition is .909 for practicing Islams than for Christians. Do you think my explanations of the results are good? Thank you.
@alfredoncsantana8515
@alfredoncsantana8515 5 жыл бұрын
How can I define what variable is continuous OR categorical , before including these variables in logistic regression?
@nkiruanyika
@nkiruanyika 5 жыл бұрын
c.(variable name) = continuous i.(variable name)= categorical , the dot is included in the syntax.
@peteramutungire9576
@peteramutungire9576 Жыл бұрын
What do you do if you find out that there is no good model fitness?
@lakshmipriya-ql5py
@lakshmipriya-ql5py 6 жыл бұрын
Can u quickly help me how to calculate inverse mills ratio
@mikecrowson2462
@mikecrowson2462 6 жыл бұрын
My understanding is that you don't compute inverse mills ratio when using binary logistic regression. It appears to be used in the context of probit regression. But here's a link that might give you the information you are looking for: statalist.org/forums/forum/general-stata-discussion/general/1348264-probit-inverse-mills-ratio
@anhnguyen-um1lw
@anhnguyen-um1lw 2 жыл бұрын
Character In the video It's great, I like it a lot $$
@FulTV444
@FulTV444 3 жыл бұрын
Good
@CristianArm13
@CristianArm13 6 жыл бұрын
Mike I Need your help pleaseeee
@mikecrowson2462
@mikecrowson2462 6 жыл бұрын
Hi Alejandro, did you have a question you need help with?
@sandeepsivakumar4959
@sandeepsivakumar4959 4 жыл бұрын
Difference between Odds and Odds ratio 14:35
@sandeepsivakumar4959
@sandeepsivakumar4959 4 жыл бұрын
Thank you Prof for a clear explanation of the Logistics regression model. I have a request, If you give time stamps for important parts of the video, it would be helpful.
@mikecrowson2462
@mikecrowson2462 4 жыл бұрын
@@sandeepsivakumar4959 Thanks for your posting. I'll have to look into the time stamps. I've never done that. However, what I have been doing over the last year is putting together Powerpoints and supplemental info with my videos. There is a much newer video here (kzbin.info/www/bejne/jWaWkoB8hKiasK8), and the Powerpoint that accompanies it can be downloaded here (drive.google.com/open?id=1HLjPcZZqXGwwa7Uee1A38L2WWoRbz5HK). Cheers!
Multilevel modeling using STATA (updated 2/9/18)
33:20
Mike Crowson
Рет қаралды 58 М.
Logistic Regression with Stata
39:03
Dominic Wells
Рет қаралды 112 М.
Викторина от МАМЫ 🆘 | WICSUR #shorts
00:58
Бискас
Рет қаралды 5 МЛН
Doing This Instead Of Studying.. 😳
00:12
Jojo Sim
Рет қаралды 23 МЛН
Они так быстро убрались!
01:00
Аришнев
Рет қаралды 2,3 МЛН
The very basics of Logit and Probit models in Stata.
28:33
Mike Jonas Econometrics
Рет қаралды 23 М.
Binary logistic regression using Stata syntax (March 2021)
27:56
Mike Crowson
Рет қаралды 24 М.
Testing assumptions of Logistic Regression using STATA
11:35
Stata: Interpreting logistic regression (Low)
8:35
Dana R Thomson
Рет қаралды 66 М.
Binary Logistic Regression with Categorical predictor in STATA
10:37
Dr. Mahmoud Omar (Statistics)
Рет қаралды 6 М.
Викторина от МАМЫ 🆘 | WICSUR #shorts
00:58
Бискас
Рет қаралды 5 МЛН