Hindi Machine Learning Tutorial 10 - Decision Tree

  Рет қаралды 56,815

codebasics Hindi

codebasics Hindi

Күн бұрын

Decision tree algorithm is used to solve classification problem in machine learning domain. In this tutorial we will solve employee salary prediction problem using decision tree. First we will go over some theory and then do coding practice. In the end I've a very interesting exercise for you to solve.
#MachineLearningHindi #PythonMachineLearning #MachineLearningTutorial #Python #PythonTutorial #PythonTraining #MachineLearningCource #DecisionTree
Code: github.com/cod...
To download csv and code for all tutorials: go to github.com/cod..., click on a green button to clone or download the entire repository and then go to relevant folder to get access to that specific file.
Website: codebasicshub.com/
Facebook: / codebasicshub
Twitter: / codebasicshub

Пікірлер: 83
@poojabehera8675
@poojabehera8675 4 жыл бұрын
sir, please make separate videos on Info gini, Gini Impurity & Information gain
@muhammadjaffarrazadar967
@muhammadjaffarrazadar967 4 жыл бұрын
waiting for you to upload Deep learning tutorials in hindi
@codebasicsHindi
@codebasicsHindi 4 жыл бұрын
Sure Muhammad, I have noted this down in my Todo list.
@muhammadjaffarrazadar967
@muhammadjaffarrazadar967 4 жыл бұрын
​@@codebasicsHindi Desperately waiting.
@ajaykushwaha4233
@ajaykushwaha4233 3 жыл бұрын
Sir aap itna accha explain kerte hain ki kya batayein. Sab kuch samajh aa jata hai.
@codebasicsHindi
@codebasicsHindi 3 жыл бұрын
ajay bhai, dhanyavad.
@DEV-un3ok
@DEV-un3ok 4 ай бұрын
For NaN(missing values) values I calculated Mean of Age column and replaced the NaN values in the Age column with the mean of Age column and when I split (train_test_split method)the data I got 80.44 % accuracy but when I did not split the data I got 97.41 % accuracy. But when I visualise my Decision tree then the one with the train_test_split method looks more accurate so which one is correct?
@Gemreviewer
@Gemreviewer 5 ай бұрын
Hello sir, just wanted to confirm that, in the 4th step in the Jupiter notebook, instead of creating 3 objects for Label Encoder, we could have created just one object and the used it for the 3 columns. Was there any particular reason to use it 3 times? Please let me know
@niku237
@niku237 Жыл бұрын
Sir... ANN algorithm bhi kra dijiye hindi mei... hmare liye kaafi benefit hoga..
@rushikeshdargeai5154
@rushikeshdargeai5154 4 жыл бұрын
97.97% Thank you!!!
@ashishmachcha8626
@ashishmachcha8626 4 жыл бұрын
Can you make more video of machine learning in python in hindi....it is very helping me.
@ashkraze
@ashkraze 2 жыл бұрын
i read that LabelEncoder should only be used for dependent variables only , can we use it for independent values also??
@farman4164
@farman4164 Жыл бұрын
Make a video on Deep learning pls..
@nidhivishwakarma1339
@nidhivishwakarma1339 Жыл бұрын
bro you are just amazing..
@kainaatmakhani6550
@kainaatmakhani6550 Ай бұрын
good job.
@Ishowmeat8
@Ishowmeat8 3 жыл бұрын
Everyday i visit this channel and i watched videos after liked it .
@abhasharma9455
@abhasharma9455 3 жыл бұрын
Sir according to initial 1st minute how can we check data is distributed like this so that we will apply DT. Thank u very much Sir for great lectures.
@govind1937
@govind1937 4 жыл бұрын
sir firstly i did without splitting into train and test, then it gave score 0.9777 and after splitting it gives 0.822222 taking .05 as test_size. and there is a strong correlation between Pclass and Fare, so we can use only one.
@varshaasabe1192
@varshaasabe1192 Жыл бұрын
Hello sir, i am seen all your vedio and thanks for that , I understand but I am not understand how import csv file in R. Can me explain I am doing data science. T
@mazharbukhari786
@mazharbukhari786 3 жыл бұрын
Sir, one question, from where you learnt?
@VishalMalik2023
@VishalMalik2023 Жыл бұрын
my model predict without using train_test_split so they predict score is 97% , then i using train_test_split with test_size = 0.2 then they predict score is 82%
@abhasharma9455
@abhasharma9455 3 жыл бұрын
Sir u explain great.
@codebasicsHindi
@codebasicsHindi 3 жыл бұрын
Glad it was helpful!
@ashwiniawati2297
@ashwiniawati2297 2 жыл бұрын
Hi Sir, you are doing really a great job. Kudos to you 🙌. I just solved the Titanic exercise with inputs Age, sex, Pclass and Fare. I got my score 0.75. Please advise about the score. Is it recommended to choose single column as an input and then check the score?
@owais20017
@owais20017 2 жыл бұрын
please send source code
@sagar8460830871
@sagar8460830871 4 жыл бұрын
how to drop variable please explain correlation and p-value. i am totaly confuse
@debdyutimohanta9007
@debdyutimohanta9007 2 жыл бұрын
There are lots of Nan values in age column
@apnadesicricket
@apnadesicricket 4 жыл бұрын
Titanic program ka score aaya 0.98765 is it correct please tell first time i done the program from my own
@codebasicsHindi
@codebasicsHindi 3 жыл бұрын
IT is correct
@apnadesicricket
@apnadesicricket 3 жыл бұрын
@@codebasicsHindi thanks so much Sir, watching your videos great help for me, then i understand the ML in simpler way. Thanks again.
@neerajrokaya1132
@neerajrokaya1132 2 жыл бұрын
Sir, I got 81.6% with test_size = 0.25 and random_state = 10 . I checked with your github repo and found that you have taken mean to fill the nan value in age column and got ans as 79.3%. I did the box plot and found out that there are many outliers. So, will you please explain in short about the mean and median selection.
@data_science_06
@data_science_06 11 ай бұрын
mean is used for when data is normally distributed... median is used for when some outliers are existing in data..... mod is used .... when your data has category..... like:- 0,1,2,0,1,1 and male , female and so on....
@akshatsingh6036
@akshatsingh6036 4 жыл бұрын
sir tensorflow kb aaegi
@ashmitsingh5473
@ashmitsingh5473 4 жыл бұрын
I got 1.0 is it fine..??
@me_debankan4178
@me_debankan4178 2 жыл бұрын
I have tried the exercise and each time i am getting more accuracy using Logistic regression than decision trees
@as-it-is6627
@as-it-is6627 14 күн бұрын
Wow, what an explanation.
@poojabehera8675
@poojabehera8675 4 жыл бұрын
why my fit.transform() throwing error as: transform() missing 1 required positional argument: 'y' I have written corect code
@fooglefirst9738
@fooglefirst9738 3 жыл бұрын
fit_transform() ye wala correct karloo aap ko aagayegaa
@oldisgold5803
@oldisgold5803 2 жыл бұрын
mine too is throwing the same error..... Did you find way out of it?
@DEV-un3ok
@DEV-un3ok 4 ай бұрын
df['column'] = column.fit_transform(df['column'])
@anantsharma5330
@anantsharma5330 4 жыл бұрын
I am getting 98.59% Model Score. I have Label Encoded all columns of inputs. Is it wrong?
@manu-prakash-choudhary
@manu-prakash-choudhary 3 жыл бұрын
you have probably not used train test split method
@prathameshmore5262
@prathameshmore5262 2 жыл бұрын
Got 0.97 score on titanic dataset using decision tree
@DopeGaming719
@DopeGaming719 4 жыл бұрын
got a score of 79.10 is it ok??
@preetiranichaudhary9758
@preetiranichaudhary9758 3 жыл бұрын
my score is 0.7867(input column is pclass and sex and target variable is survived) then i got score is 0.7867..is it correct or not???
@codebasicsHindi
@codebasicsHindi 3 жыл бұрын
yes it is correct. 78% is a pretty good score
@preetiranichaudhary9758
@preetiranichaudhary9758 3 жыл бұрын
@@codebasicsHindi thanku sir 4 answering
@dheerajyadav3675
@dheerajyadav3675 3 жыл бұрын
@@codebasicsHindi can i have it's code?
@mashalamjadkhan9351
@mashalamjadkhan9351 2 жыл бұрын
Sir please share csv file argent
@DharmendraKumar-pf4fs
@DharmendraKumar-pf4fs 4 жыл бұрын
My score is 0.977 is this correct for Titanic exercise .???
@codebasicsHindi
@codebasicsHindi 4 жыл бұрын
Yup.97.7 is a pretty good score. Good job 👏👍
@karmakarishan56
@karmakarishan56 2 жыл бұрын
result is 1
@anilkumarsharma8901
@anilkumarsharma8901 2 жыл бұрын
Apney subscribe waley ko artificially intelligent ka support dilva do phir Duniya following karegi
@ajmeershahtricks6666
@ajmeershahtricks6666 2 жыл бұрын
sir i got 1.0 score (accuracy) but im confused that what does it means ? is this the accuracy in %?
@mohammedmouizuddin4403
@mohammedmouizuddin4403 2 жыл бұрын
1.0 means 100% accuracy rate and 0.98... , 0.95 ... means 98% , 95% accuracy rate ....
@ajmeershahtricks6666
@ajmeershahtricks6666 2 жыл бұрын
@@mohammedmouizuddin4403 thank you bro❤
@owais20017
@owais20017 2 жыл бұрын
please @Ajmeer Shah tricks send me sorce code
@RahulVerma-ud1rr
@RahulVerma-ud1rr 2 жыл бұрын
:D 97.75% accuracy
@kamilazim5498
@kamilazim5498 4 жыл бұрын
For Titanic Dataset Test_size = 0.3 model.score = 0.8059701492537313
@darshankachhadiya8014
@darshankachhadiya8014 3 жыл бұрын
My model score is 0.81
@gaurangsharma5428
@gaurangsharma5428 3 жыл бұрын
My score is 0.997755310886644
@uddiptasaikia3121
@uddiptasaikia3121 Жыл бұрын
Sir , after coverting "male" and "female" to 1 & 2 respectively , it is not showing 1 and 2 while i type inputs.head()...it is showing NaN....please help.
@uddiptasaikia3121
@uddiptasaikia3121 Жыл бұрын
@codebasicsHindi
@NationalistNewsNetwork
@NationalistNewsNetwork Жыл бұрын
How it behave on irrational input. pls reply
@waidapapa1514
@waidapapa1514 11 ай бұрын
If we see the actual documentation of scitkit learn of label encoder we will know that label encoder ia used for target or output variable only.
@izainonline
@izainonline Жыл бұрын
Many Thanks finally I found the clear guidance for example how and why transform the data into numeric and how to train,Inplace?,drop or add column reason. Thanks again
@sanjayrajbanshi7698
@sanjayrajbanshi7698 2 жыл бұрын
I got 80% accuracy by doing train_test_split method (Changed null value in 'Age' column and Converted 'Sex' column into dummy variable column).
@musakhan9779
@musakhan9779 2 жыл бұрын
Why is Decision Tree a regression model if it classifies. Isn't Regression model expected to have continuous data and prediction.
@PankajVerma-p8y
@PankajVerma-p8y 2 ай бұрын
my mode score is 98% .. Really amazing videos sir. Aap jo samjhate ho an maza aa jata hai and concept clear ho jata hai...
@subhamsaha2235
@subhamsaha2235 3 жыл бұрын
Sir aapne to isme decision tree banaya hi nhi????????????????
@shivam7164
@shivam7164 Жыл бұрын
some values in age are missing.
@sudarshandev6369
@sudarshandev6369 3 жыл бұрын
lajwaab sir awesome explanation
@mashalamjadkhan9351
@mashalamjadkhan9351 2 жыл бұрын
please share the csv file
@rockfighter9974
@rockfighter9974 Жыл бұрын
Really very nice video
@shahidmalik6107
@shahidmalik6107 Жыл бұрын
very well explaind
@soumyaprakashdasmohapatra7717
@soumyaprakashdasmohapatra7717 4 жыл бұрын
got a score of 74.8% is it correct?
@muhammadjaffarrazadar967
@muhammadjaffarrazadar967 4 жыл бұрын
I tried to do it by my own and did it with 0.9797979797979798 test scores. but I didn't bread the data into test and train. maybe that's why.
@codebasicsHindi
@codebasicsHindi 4 жыл бұрын
Yes splitting data in test and train is important otherwise your model is biased and you get a high score
@shivanshsingh5555
@shivanshsingh5555 4 жыл бұрын
0.82
Hindi Machine Learning Tutorial 11 - Support Vector Machine
24:32
codebasics Hindi
Рет қаралды 36 М.
Part 1-Decision Tree Classifier Indepth Intuition In Hindi| Krish Naik
34:17
Ozoda - Lada (Official Music Video)
06:07
Ozoda
Рет қаралды 11 МЛН
How To Get Married:   #short
00:22
Jin and Hattie
Рет қаралды 20 МЛН
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 254 М.
Decision Tree Classification Clearly Explained!
10:33
Normalized Nerd
Рет қаралды 670 М.
Hindi Machine Learning Tutorial 12 - Random Forest
19:40
codebasics Hindi
Рет қаралды 27 М.
Decision and Classification Trees, Clearly Explained!!!
18:08
StatQuest with Josh Starmer
Рет қаралды 748 М.
How To Self Study AI FAST
12:54
Tina Huang
Рет қаралды 549 М.