Great video. Please upload videos implementing various other tests
@roksanarezaei96084 жыл бұрын
Krish, Thank you for the explanation. I have a question. Why didn't you use the P-Value and Chi_Square values that the contingency function provides and you calculated them separately? even the numbers you got are not the same.
@lenaara4569 Жыл бұрын
your explanations are really awesome! Thank you😊
@pramodkumargupta18244 жыл бұрын
Krish, Really nice video. what steps should we take after we perform these test. I have following question- 1. What should we do if two features are related with each another. Do we need to exclude one in feature selection? Or what should we do? 2. If there are independent, then we are good to take both features in our model for prediction?
@6shipra4 жыл бұрын
I have the same questions to ask
@Rahul-gn7px4 жыл бұрын
if one feature can be derived or is highly dependent on another variable it would be wise to remove it for example age and birth date
@ankitayadav26903 жыл бұрын
Very nice explanation sir
@pranabmishra26094 жыл бұрын
Thanks, Explanation is Clear and Concise. Able to understand properly.
@amalsunil47224 жыл бұрын
Guys just a tip here...u can simplify the process of obtaining the X2 statictic-> X2_statistic=(observed_values - estimated_values)**2/estimated_values X2_statistic=X2statistic.sum() and make sure observed_values and estimated_values are numpy arrays
@vanessaleiko4 жыл бұрын
Thank you so much for this explanation!
@kaifahmed3163 жыл бұрын
Grate explanation sir thank you 😊
@ajithshenoy55664 жыл бұрын
Hey Krish , Love your videos. Kindly upload more videos in the machine learning pipeline section. The last one is feature selection . Interpretation and deployment videos would be largely appreciated.
@krishnaik064 жыл бұрын
Sure
@badiyabhargav85973 жыл бұрын
Sir I have doubt.. At 11:42 u said that chi2_statistic should always be greater than critical value then only we retain null hypothesis but in the code our chi2_statistic value smaller than critical value in if condition u gave if(chi2_statisti>=critical value): print(reject ho and accept h1 there is a relation) else: print(retain ho there is no relation) I think we have to reject the ho null hypothesis if chi2_statistic is greater than critical value
@akashsoni58704 жыл бұрын
Am I the only one who saw statquest by josh starmer? I am following statquest before krish Naik sir lecture...... believe me statquest is very good for indepth knowledge
@bhagyashreemohanta78264 жыл бұрын
Thank you so much... 🙂 Highly Obliged..... 🙏
@omkarpatil28544 жыл бұрын
hello Krish, awesome video series as always. if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@sandipansarkar92114 жыл бұрын
thanks krish .great explanation
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
this is what im confused, it already returns at [1] index (0.925417020494423) from chi2.contigency result, then why we should recalculate the p value. hopefully someone can explain
@varunupadhyay15762 жыл бұрын
@@hilmanrevisionery130 Did you got it?
@AK-ws2yw3 жыл бұрын
Hey Krish, I had a doubt. If i have 4 columns which have 2 character type data. Eg. Let 4 columns name be A,B,C,D and all these 4 columns are categorical data that is all 4 columns have Yes and No type data. My aim is to find whether all 4 columns have a Yes. Which Test should i go for in that case.
@PravinKumar-zc2eq2 жыл бұрын
Hi, Krish ur videos are really helping me understand these concepts in a easy way thank you . Is there any possibility a video on ANOVA??
@biranchinath84283 жыл бұрын
Thank you sir for your help.
@shobitjain96194 жыл бұрын
Sir, can you Make videos on different different pairwise metrics in sklearn like cosine similarity, sigmoid krnel, rbf kernel etc..
@SHIVAMBAJPEYIMIM4 жыл бұрын
Thank you so much, this makes my day:)
@abhinavsharma72913 жыл бұрын
Krish, Thank You! Any video on ipynb file explaining ANOVA test ?
@solar_girl_here3 жыл бұрын
Amazing. Thanks
@lokanathshroff33014 жыл бұрын
Sir, not able to see the big data playlist
@abhinaygupta82433 жыл бұрын
suppose there are around 50 features in my data set so, should i do the chi square test for each of two features and same with others also , it will more time consuming...........or we will directly find the correlation as per pair plot and select one out of similar ones
@priyaduttbhatt56913 жыл бұрын
simply perfect!
@pratikchatterjee59924 жыл бұрын
Hi Krish. Nice video. Where are the big data videos?
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@AsifMarazi2 жыл бұрын
Instead of writing chi_square_statistic=chi_square[0]+chi_square[1].... for number of rows just replace this line with chi_square_statistic=chi_square.sum(), So you need not to worry about the writing all the rows in case of having more rows
@nanditasharma67664 жыл бұрын
Krish, you told there is a relationship & one will have some effect on another as they have relation. So we have to consider one or both variable?????????? if we consider one then it will definitely get effected as they are related with each other... considering one will give miss effect on target then????
@Balubindass4 жыл бұрын
Hi Krish Naik, I-am following you channel and it is very clear and easily understandable. After your z test and T test video, i tried doing some hypothesis test. Here is my example and would need your help if i am doing it wrong. I have a file with 5000 rows And i am considering as a population and i have assuming hypothesis. Null Hypothesis as age 30 This is one tailed test. So here is my question do i need create a sample from population or else i need filter age >=30 and consider it as sample? And if z score table 1.694 and z test gave 3.54 the do i need reject null hypothesis? Please kindly help me.
@akshayvishnukishore22823 жыл бұрын
Question: why did we calculate the p-value again? cant we just use the p-value returned from the chi2_contingency() ?
@amansinghrathore83083 жыл бұрын
+1
@varunupadhyay15762 жыл бұрын
@@amansinghrathore8308 Did you got it?
@rajulshakya48994 жыл бұрын
Nice video
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@sushantshekhar80824 жыл бұрын
Krish, please upload similar implementation video for Anova test aslo
@MJAYRECORDS4 жыл бұрын
Hi Krish the video is good can u tell me the solution for the chi square test coding for marital status and different education level problem
@AkshayDudvadkar3 жыл бұрын
What do we do when we have multiple categorical columns ??
@anupamasonnad2204 жыл бұрын
Hi Krish, If I have to figure out the association/ relation between more than 2 categorical variable , will that be done using Chi2? If I have to test the multicollinearity between more than 2 categorical variables, can we convert them into numeric and apply VIF?
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@saumyamishra52033 жыл бұрын
plzz tell me what 1st 2 values in the result of function chi2_contingency is ....as i was thinking that 1st one is chi_statistics_value & 2nd one is p_value.???
@gurdeepsinghbhatia28754 жыл бұрын
too gud sir ,
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@gurdeepsinghbhatia28754 жыл бұрын
@@mohammedahtesham2021 Dear Mohd , p_value is just a probability value that assures our result , the main result is in correlation , let us take an example , suppose we got correlation of +1 with p_value of 0.05 , then it means that the 2 variables have positive 1 correlation with 0.05 probability ie with 5% of accuracy , now why only 5 percentage for this u must see the hypothesis testing video of the Krish Sir , for further doubts mail me at gsbhatia111@gmail.com , if u feel , i hope my reply helps u thanks
@alextjflorida3 жыл бұрын
Thank you for the video. It seems Python is not efficient in running statistical tests. You have to get one single test results by taking too many steps. Other software packages can do a better job in this department.
@PrinceKumar-eb8hd4 жыл бұрын
e to upar sea nikal gaya..koi nai..jab jarurat hoga...tab dubara sea research kiya jaye ga...vese thanq sir..
@Tejashri_Kate2 жыл бұрын
And how do we know if there is type1 or type2 error?
@vincetechclass33902 жыл бұрын
What about negative values?
@sushantrauthan57044 жыл бұрын
Thanks for the video but i have a doubt , i've never really grasped the concept how of how you choose the hypothesis in some cases you choose NULL hypothesis for the motion and in some cases you choose the hypothesis against the motion.LIke how does that work?
@amalsunil47224 жыл бұрын
yes it's very important...as we always assume the H0 hypothesis to true while testing/finding the p-value. H0: there's no significant difference (just do this for all cases...it can be btw 2 variables,a sample mean nd a given population mean etc)
@amitjajoo95104 жыл бұрын
Thanks
@mohammedahtesham20214 жыл бұрын
if p-value is high then both samples are related to each other right? in your code, there is a condition where if p_value
@dheerajkumark22684 жыл бұрын
Sir while finding p value, can we give pdf instead of cdf
@vaibhavmohite4684 жыл бұрын
Can you explain goodness of fit test in python
@rayhankabir6458 ай бұрын
Please would you help me with this dataset
@minakshi_1193 жыл бұрын
Can anyone please help me with Expected_Values=val[3], why here val[3] means..
@parikshitgurjar55453 жыл бұрын
Hello guys , Plese can anyone explain :- The Degree of freedom = 1 in the output what this "1" signifies.
@rushin30902 жыл бұрын
can anyone send this playlist?
@amadoum.jallow6204 жыл бұрын
Sir, can you please recommend me a very good book for statistics.
@chetanmazumder3104 жыл бұрын
What about anova ?
@manishbolbanda98724 жыл бұрын
getting error for sns.load_dataset('tips') even though i have imported seaborn
@minakshi_1193 жыл бұрын
Can anyone please help me with Expected_Values=val[3], what here val[3] means..