Machine Learning Tutorial Python - 14: Naive Bayes Classifier Algorithm Part 1

  Рет қаралды 241,910

codebasics

codebasics

Күн бұрын

Пікірлер: 152
@codebasics
@codebasics 2 жыл бұрын
Check out our premium machine learning course with 2 Industry projects: codebasics.io/courses/machine-learning-for-data-science-beginners-to-advanced
@codebasics
@codebasics 5 жыл бұрын
Part 2 of this naive bayes tutorial. Email spam detection: kzbin.info/www/bejne/pHmshoytg5JoqK8
@agvlogs5773
@agvlogs5773 3 жыл бұрын
Great things is that - you recommend other people's channel as well. It seems u r just trying to make people learn something no matter whose channel. Respect🙏🙏
@Marshall_Mohammed
@Marshall_Mohammed 9 ай бұрын
This is the first time I am exploring machine learning and Python, I have never tried to learn Python. But your tutorials are just awesome, it is much easier to learn and understand the concepts. Great Work!❤👏
@ZEHRAİLKAYASLAN
@ZEHRAİLKAYASLAN 2 ай бұрын
I think you are the best teacher. Thanks to you, I am learning data science. I hope I can find a job in this field. Greetings from Turkey.❤🤩
@prakharmishra2977
@prakharmishra2977 5 жыл бұрын
thanks a lot sir,for your great support,I started my data science path through your videos really great mentor,altruistic human being i am proud of you sir!!
@codebasics
@codebasics 5 жыл бұрын
hey Prakhar, thanks for your kind words and I wish you all the best. I am sure you will become a successful data scientist one day. good luck :)
@vibestouchessoul2602
@vibestouchessoul2602 4 жыл бұрын
@@codebasics sir we waant full play list of deep lerning and real world data science and machine learning projects
@ashish31416
@ashish31416 2 жыл бұрын
Simplicity is the ultimate sophistication. You are amazing!
@flamboyantperson5936
@flamboyantperson5936 5 жыл бұрын
Great to see you back with a new tutorial. Your idea of first explaining theory then going to practical is awesome. That's awesome.
@codebasics
@codebasics 5 жыл бұрын
hey flaboyant person. I was expecting to see your comment. How are you ?
@flamboyantperson5936
@flamboyantperson5936 5 жыл бұрын
@@codebasics It's impossible I won't like and reply to your video. I am great fantastic. How are you and how is your health now?
@aryanverma7506
@aryanverma7506 5 ай бұрын
Your Playlists worked as Revision before my interview. Thank You for your support
@codebasics
@codebasics 4 жыл бұрын
Exercise solution: github.com/codebasics/py/blob/master/ML/14_naive_bayes/Exercise/14_naive_bayes_exercise.ipynb Step by step guide on how to learn data science for free: kzbin.info/www/bejne/jJ_CnqCFqraeiaM Machine learning tutorials with exercises: kzbin.info/www/bejne/nZ7Zp5Sll9Jqm7M
@ayushibansal7947
@ayushibansal7947 3 жыл бұрын
This is the best tutorial on you tube. I understand concepts easily.
@codebasics
@codebasics 3 жыл бұрын
I am happy this was helpful to you.
@midhunskani
@midhunskani 4 жыл бұрын
We need more tutorials on deep learning and start a new AI tutorials. Your machine learning tutorials are really good
@LS15QD
@LS15QD 3 жыл бұрын
You are fantastic! If you were a lecturer you would be the one everyone likes!
@sreesai4729
@sreesai4729 Жыл бұрын
That was the best ever tutorial I watched about naive bayes.... Thank you so much ❤
@levyax1964
@levyax1964 7 ай бұрын
Great intro. For the last line of code in notebook, I think we shall use X, y instead bc now it's time for full model evaluation: np.mean(cross_val_score(GaussianNB(), X, y, cv = 5))
@geeksfornongeeks4546
@geeksfornongeeks4546 3 жыл бұрын
You really know how to explain jargons in simple language. Thanks a lot
@zananpech1522
@zananpech1522 5 ай бұрын
I found that RandomForest classifier performs slightly better than Naive bayes model. anw, love your tutorials, thank u for your hard works :)
@deepanshuaggarwal7042
@deepanshuaggarwal7042 5 жыл бұрын
Your teaching skills are best. Please continue this series and covers all topics of ML. If not possible, then plz provide link so that we can study. There is no channel which teaches ML the way you are. Hope, you will come to INDIA and do your dream job "Organic farming "
@codebasics
@codebasics 5 жыл бұрын
Oh deepanshu.. I want to do that and spread awareness of eating right. Anyways but yea I have plan to cover many more topics in ML, stay tuned.
@shaiksuleman3191
@shaiksuleman3191 4 жыл бұрын
Super B SIr Cyrstal clear Explanation.There are some many videos on machine learning but no one cann't explain as you.
@codebasics
@codebasics 4 жыл бұрын
Thanks and welcome
@AmanSingh-bk1um
@AmanSingh-bk1um 4 жыл бұрын
Good keep doing these AI videos, i liked it good to see the flow of functions in single video.
@milindprakash8855
@milindprakash8855 4 жыл бұрын
You are doing wonderful job ...really learnt a lot from your videos
@user-wr4yl7tx3w
@user-wr4yl7tx3w 2 жыл бұрын
this is a really good example explanation
@MDNAZMUNHASANNAFEES-yz7vq
@MDNAZMUNHASANNAFEES-yz7vq 8 ай бұрын
you are the best man. I am being not Naive.
@aniketjha5919
@aniketjha5919 4 жыл бұрын
Sir these are the best videos with best explanation,Thanks alot for these sources. Please try to upload more projects and please help and explain more detail about when to use which classifiers. Thankyou
@ravikumarrai7325
@ravikumarrai7325 3 жыл бұрын
Sir, I got one doubt here that, since you have created dummies in this project, you should drop the first dummy column, in order to avoid Multi-colinearity. Please revert back with your comments if I am wrong
@r0cketRacoon
@r0cketRacoon 9 ай бұрын
I agree, have u figured out the answer?
@dragnar4743
@dragnar4743 3 ай бұрын
oh yeah, dummy variable trap
@supersql8406
@supersql8406 3 жыл бұрын
Great video, great teaching, great speed, great other misc stuff like fillna, drop, concat. Please make more of these types of videos! Subscribed!
@codebasics
@codebasics 3 жыл бұрын
I am happy this was helpful to you.
@spicytuna08
@spicytuna08 2 жыл бұрын
what a gift of explanation!!!
@sidharthkumthekar3179
@sidharthkumthekar3179 2 жыл бұрын
Very well explained sir! Thanks :-)
@Spiray
@Spiray 4 жыл бұрын
Wonderful video, thank you. Simple but well-explained! It has helped me a lot. =)
@codebasics
@codebasics 3 жыл бұрын
Glad it helped!
@msctube45
@msctube45 4 жыл бұрын
Thank you for this very well explained Tutorial.
@codebasics
@codebasics 4 жыл бұрын
Glad it was helpful mario!
@channel-lz5og
@channel-lz5og 3 жыл бұрын
Always love the u teach. ..u are amazing
@codebasics
@codebasics 3 жыл бұрын
Glad it was helpful!
@jepsmyt
@jepsmyt 5 жыл бұрын
Hello, Thanks for the explanation. I was wondering dont we need to normalize the data? Let me know your thoughts on this.
@mohannads2757
@mohannads2757 2 жыл бұрын
Naive can deal without any Feature Scaling
@devanshgoel9070
@devanshgoel9070 2 жыл бұрын
thank you bhai for the explanation
@pranjalgupta9427
@pranjalgupta9427 4 жыл бұрын
Very nice video sir can I used logistics regression also because I think it also give same result as naive bayes
@dragnar4743
@dragnar4743 3 ай бұрын
Generally, people suggest for using GaussianNB if we are dealing with continuous feature like age & fare. And BernoulliNB for Binary type of data like female & male. categoricalNB for discrete features like Pclass. So, can GaussianNB can take all types of features??
@GlobalDee_
@GlobalDee_ 3 жыл бұрын
Great, thanks for this series...Pls can u do series on evaluation metrics,I will love to see explicit explanation on it.
@mohammadkassir3545
@mohammadkassir3545 5 жыл бұрын
Please complete tutorials for deep learning
@sulimanallahgabo1080
@sulimanallahgabo1080 5 жыл бұрын
just i was looking for , thanks sir oh u re great great .....
@codebasics
@codebasics 5 жыл бұрын
Thanks man. I really appreciate love from all of you :)
@sulimanallahgabo1080
@sulimanallahgabo1080 5 жыл бұрын
@@codebasicsas usual many thank to u pls if u have material for data science of don't mind send to me ,suliman_allahgabo@yahoo.com
@nastaran1010
@nastaran1010 10 ай бұрын
Hi. Thanks so much.
@ashishzine14
@ashishzine14 2 жыл бұрын
Great job sir thank u
@codebasics
@codebasics 2 жыл бұрын
Glad it was helpful!
@tagoreji2143
@tagoreji2143 3 жыл бұрын
thanks a lot SIR
@pythonenthusiast9292
@pythonenthusiast9292 4 жыл бұрын
sir , how did you come to know that the data is a bell curve (gaussian distribution).
@mohammedmunavarbsa573
@mohammedmunavarbsa573 4 жыл бұрын
super video
@yes_i_am8378
@yes_i_am8378 4 жыл бұрын
Hey @codebasic you are teaching way is awesome. i hv question here. why is there a target variable in unsupervised learning?
@dhainik.suthar
@dhainik.suthar 3 жыл бұрын
why my predict_proba return >1 and
@azus9819
@azus9819 3 жыл бұрын
same issue
@manavgora
@manavgora 8 ай бұрын
can we also do "One Hot Encoding" instead of dummy variables.
@codingmadesimplified
@codingmadesimplified 4 жыл бұрын
Awsome!But I Have A doubt why we have not normalize out dataset certain columns?
@Aniquekhan
@Aniquekhan Жыл бұрын
thanks
@aaditstudent
@aaditstudent 2 жыл бұрын
I have a small query here. Why did we not drop the either female/male column after one hot encoding to avoid dummy variable trap?
@JatinSharma-tu2zg
@JatinSharma-tu2zg 4 жыл бұрын
Sir Hindi main bhi bana dijiye pls aap bohut acha samjha rahe hain
@codebasics
@codebasics 4 жыл бұрын
sure jatin. KZbin me codebasics hindi search karo, maine already those ML ke video hindi me upload kiye hai.
@mariav1234
@mariav1234 3 жыл бұрын
Do you not need to remove one of the dummies "male" or "female"? It does not make sense to have both of them, since in that dataset, who is not male is female. To my knowledge that is an essential step.
@zutubee
@zutubee Жыл бұрын
Yes your right, these are negatively correlated. Including both of the features would make gender have higher influence on inference
@flamboyantperson5936
@flamboyantperson5936 5 жыл бұрын
I have a request if you could make one video on it that would be very helpful. I want to know when we make a UDF in python how can I check it at each and every step function is working or not before completing the whole UDF
@kamilazim5498
@kamilazim5498 4 жыл бұрын
sir make tutorials on Natural language processing(NLP)
@विशालकुमार-छ7त
@विशालकुमार-छ7त 3 жыл бұрын
From where you have learned NLP???? ( I assume that you have done something for learning NLP)
@cedricvumisa7416
@cedricvumisa7416 3 жыл бұрын
thank you
@cnuvadali
@cnuvadali 4 жыл бұрын
super
@debatradas9268
@debatradas9268 3 жыл бұрын
thank you so much
@codebasics
@codebasics 3 жыл бұрын
Glad it was helpful!
@sayantandas7054
@sayantandas7054 4 жыл бұрын
Sir, why there is a target variable????? like it's a clustering algorithm i.e unsupervised...and target variable is used in supervised
@nitishkeshri2378
@nitishkeshri2378 3 жыл бұрын
"target" is a variable, you can take anything as a variable
@SaiChand4492
@SaiChand4492 3 жыл бұрын
Good explanation but the problem is, in the dataset I'm not able to find jack and rose :/
@ramandeepbains862
@ramandeepbains862 2 жыл бұрын
Sir plz explain when to take the mean median mode for null values .....
@anasahmed4682
@anasahmed4682 4 жыл бұрын
shouldn't we split the data first and then perform preprocessing or does it not matter?
@laxmandesai573
@laxmandesai573 3 жыл бұрын
Afaik, the order dosen't really matter. If you split first, you'd need to preprocess all the parts individually.
@jaiprathapgv2273
@jaiprathapgv2273 4 жыл бұрын
Should I use mean method or median method for Nan because in one video you told us to use median method in this video it is mean method. Which one is the best right to use?
@zutubee
@zutubee Жыл бұрын
Median is more robust to outlier so generally a better idea. Features like age are normally distributed hence mean can also be safely used
@michaelrall3916
@michaelrall3916 3 жыл бұрын
Just for the ones who might be as stupid as me and were missing the "survived" column: On kaggle there are two files. One test and one train file. Take the train file instead :)
@OnlineGreg
@OnlineGreg 2 жыл бұрын
thanks for this video! one thing i dont understand: at 2:33 P(queen) should be 1/4 or? and P(diamond) 1/13. as there are 4 queens and 13 diamonds in the deck
@ivanherrera10
@ivanherrera10 2 жыл бұрын
You are right, I think there is an error in the presentation.
@HerlonCosta
@HerlonCosta 5 жыл бұрын
Waw, i am a first! 😃😃 #LovePython
@codebasics
@codebasics 5 жыл бұрын
oh yup. Hanzo.. you got the "first commenter" award :) ha ha...
@rahmaaja6600
@rahmaaja6600 Жыл бұрын
thanks a lot.. can i ask to you, what if i want to show xtest result after tf-idf sir? I have tried only with the xtest code but the results are not as desired
@Gauravkr0071
@Gauravkr0071 5 жыл бұрын
Hii , the lectures are just amazing , cn u plz make a tutorial on how to write custom layers in keras like we make in variational autoencoder . Plz man therre is almost no resource in internet explaining it properly
@codebasics
@codebasics 5 жыл бұрын
Thanks gaurav for appreciation. I have noted down the topic you suggested and will get to it in future 👍
@mvs69
@mvs69 3 жыл бұрын
why we use train_test _split, we can use cross_validation fro better results, cant we?
@nikhilgaikwad9954
@nikhilgaikwad9954 4 жыл бұрын
In a categorical dataset , how can we decide whether the problem can be solved by using Naive Bayes algorithm or no? Or which algorithm will give high accuracy?
@codebasics
@codebasics 4 жыл бұрын
Based on type of problem you might end up using one or the other algorithm. You can use gridsearchCV to evaluate performance of different alogs with different parameters. Please watch my video in this same series "it is called hypertunning parameters using gridsearchcv"
@MohamedAshraf-zs6nv
@MohamedAshraf-zs6nv 4 жыл бұрын
how you decide which feature to keep and use in the model and which to drop? I mean is there any strategy to handle this situation?
@Star_z_07
@Star_z_07 20 күн бұрын
may i know the reason of using one hot encoding but not label encoding ??...can anyone clear my doubt
@worldcomingtoanend
@worldcomingtoanend Жыл бұрын
by saying male as a feature are u sure u did not confuse it with sex. I thought sex would be the feature with values such as male, female. Unless you meant male as a feature taking on values yes or no?
@JainmiahSk
@JainmiahSk 5 жыл бұрын
Sir how is your health? I'm waiting for your videos. Sir, how to encode multiple variables at once?
@codebasics
@codebasics 5 жыл бұрын
hey Jainmiah, my health is improving. The full recovery might still take one complete year but at least I am in a position to upload videos now.
@JainmiahSk
@JainmiahSk 5 жыл бұрын
@@codebasics I Pray God to recover you fast to your GOOD Health. #LoveCodeBasics and #LoveYouSir.
@sukantithakur4225
@sukantithakur4225 4 жыл бұрын
hi,the titanic data we used earlier in DecisionTreeClassifier model. i campared the score is higher in DTC than Naive Bayes and we get probability in DTC also,so just wanted to ask how to know which model is the best to use in realtime?please suggest.
@Yang11235
@Yang11235 Жыл бұрын
Does we don't need to drop one dummy column? Dose the dummy variable trap only for linead_mode?
@r0cketRacoon
@r0cketRacoon 9 ай бұрын
I agree, have u figured out the answer?
@akshaybanaye
@akshaybanaye 4 жыл бұрын
Hi, I am getting following error: ValueError: Input contains NaN, infinity or a value too large for dtype('float64'). But I checked and there is no NA value in my dataframe. inputs.info() Gives me this output: RangeIndex: 891 entries, 0 to 890 Data columns (total 5 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 Pclass 891 non-null int64 1 Age 891 non-null float64 2 Fare 891 non-null float64 3 female 891 non-null uint8 4 male 891 non-null uint8 dtypes: float64(2), int64(1), uint8(2) memory usage: 22.7 KB
@onatmufasa0522
@onatmufasa0522 2 жыл бұрын
I have question, why you did not drop either fenale or male column? In your previous tutorials, you said one column should be dropped if converting using dummy. Thanks...
@r0cketRacoon
@r0cketRacoon 9 ай бұрын
I agree, have u figured out the answer?
@aryanverma7506
@aryanverma7506 5 ай бұрын
Better than Andrew NG
@jyrust1713
@jyrust1713 4 жыл бұрын
hey u know parameters prior_fit in naive bayes for what? i dont understand in documentation thx
@karannchew2534
@karannchew2534 3 жыл бұрын
Hi, A couple of questions, hope someone could help please: 1) I thought Gaussian NB only take continuous features value. But here, there are continouse (e.g. Age) and discrete (e.g. Gender) value. Can I use Gaussian NB if all features are discrete value? 2) One hot encoding split the Gender data into two data: Male and Female. These features are related i.e. mutually exclusive. Does Gaussian NB algorithm jointly 'process' these two data as one feature or two separate feature? Hope someone could enlighten. Thanks.
@thebrainfeed402
@thebrainfeed402 2 жыл бұрын
hey i have the same doubts regarding Gaussian NB. Did you figure it out? Would be really helpful for me:)
@isaakimaliev5584
@isaakimaliev5584 3 жыл бұрын
y should be a 1d array, got an array of shape (179, 5) instead.
@sougandhhm
@sougandhhm 5 жыл бұрын
hello sir, Please do some videos on Natural Language processing, I am waiting for this badly
@codebasics
@codebasics 5 жыл бұрын
wow....actually thats the topic I am going to cover next. You read my mind almost. I will start that series soon.
@245uday
@245uday 4 жыл бұрын
@@codebasics yes...i am also waiting...I started studying data science a few days back...your way of teaching are simply awesome...
@maresh_vlsi
@maresh_vlsi 4 жыл бұрын
could not convert string to float: 'Birnbaum, Mr Jakob' how to eliminate this error
@Lifebw9to5
@Lifebw9to5 4 жыл бұрын
sir why you have used GaussianNB model instead of using Logistic model or any other
@codebasics
@codebasics 4 жыл бұрын
Because this tutorial is on naive Bayes 😊
@Lifebw9to5
@Lifebw9to5 4 жыл бұрын
@@codebasics so can be use any other model to perform this task of mail classification or this model is best suited for this task
@ayazahmed9611
@ayazahmed9611 5 жыл бұрын
Sir can u upload more videos on ML ALgorithms like this??
@codebasics
@codebasics 5 жыл бұрын
yes Ayaz. sure. I have the plan to upload more videos on this topic.
@asheeshyadav5519
@asheeshyadav5519 4 жыл бұрын
Sir Is it necessary to learn behind mathematics of machine learning algorithm or some overview of mathematics of machine learning algorithm Sir please tell me please Because I am very confused
@codebasics
@codebasics 4 жыл бұрын
You need to know some math. Not very much in depth. So don't worry too much about it. If you want to become machine learning engineer or data scientist who solves complex problem than of course advanced math knowledge is always useful
@asheeshyadav5519
@asheeshyadav5519 4 жыл бұрын
@@codebasics Sir if I want machine learning Engineer than if I have some knowledge of math Please reply I wait
@nilanjanbanik7509
@nilanjanbanik7509 3 жыл бұрын
@6:20 Why can't we use LabelEncoder instead of panda's dummy variables?
@abhinavkale4632
@abhinavkale4632 3 жыл бұрын
both can be used, it is just that he is more used to one hot encoder.. that is the same pd.get_dummies(" " ). Otherwise, the results will be the same for both.
@abhinavkale4632
@abhinavkale4632 3 жыл бұрын
pd.get_dummies(df,columns=['sex']) could have just done this.. no need to perform concati. @6:45
@fahadabdullah510
@fahadabdullah510 2 жыл бұрын
How P(diamond/queen) is 1/4 Can somebody explain to me?
@tagoreji2143
@tagoreji2143 3 жыл бұрын
SIR ,HOW TO IMPROVE ITS ACCURACY?
@YYZ-e1w
@YYZ-e1w 4 жыл бұрын
isn't it female and male column 'highly related'?
@tanyabhalla5083
@tanyabhalla5083 4 жыл бұрын
Input contains NaN, infinity or a value too large for dtype('float64'). - Can you help with the errror?
@nitishkeshri2378
@nitishkeshri2378 3 жыл бұрын
use fillna() method to fill that "NAN" value as shown in video
@TheEngineerpodcast
@TheEngineerpodcast Жыл бұрын
can't the sex column be simplified by LabelEncoding??
@harmainiwilliam5539
@harmainiwilliam5539 3 жыл бұрын
We can't able to download ur code... It's coming invalid
@codebasics
@codebasics 3 жыл бұрын
I have checked all URL's working perfectly. Please check URL in description.
@rinsyarifuddin8696
@rinsyarifuddin8696 4 жыл бұрын
Please help me, i got this error message "could not convert string to float : 'male' " someone can explain me why it's happen to me?
@vatsal_gamit
@vatsal_gamit 4 жыл бұрын
Use sklearn encoding
@RishabhSingh-bh7fu
@RishabhSingh-bh7fu 4 жыл бұрын
I am getting file not found error as I import data set using the same code
@nmana9759
@nmana9759 4 жыл бұрын
if you're using IDE you should import the file to the workspace you're working in
@rudrarajput4764
@rudrarajput4764 5 жыл бұрын
Boss, can you please make a lecture on reinforcement learning and also one lecture on Q learning??
@muhammadsaqibsangi1495
@muhammadsaqibsangi1495 3 жыл бұрын
where is code of this tutorial??
@codebasics
@codebasics 3 жыл бұрын
check video description for github link
@ashutoshpurohit8603
@ashutoshpurohit8603 4 жыл бұрын
Where to get the dataset?
@akshaybanaye
@akshaybanaye 4 жыл бұрын
github.com/codebasics/py/tree/master/ML/14_naive_bayes
@Kaafirpeado54-6ayesha
@Kaafirpeado54-6ayesha 2 күн бұрын
Islam free world ❤
@adamassrkfan
@adamassrkfan 5 ай бұрын
7:04
Machine Learning Tutorial Python 12 - K Fold Cross Validation
25:20
Как Я Брата ОБМАНУЛ (смешное видео, прикол, юмор, поржать)
00:59
СКОЛЬКО ПАЛЬЦЕВ ТУТ?
00:16
Masomka
Рет қаралды 3,5 МЛН
Увеличили моцареллу для @Lorenzo.bagnati
00:48
Кушать Хочу
Рет қаралды 8 МЛН
Naive Bayes, Clearly Explained!!!
15:12
StatQuest with Josh Starmer
Рет қаралды 1,1 МЛН
Naive Bayes classifier: A friendly approach
20:29
Serrano.Academy
Рет қаралды 146 М.
The Math Behind Bayesian Classifiers Clearly Explained!
11:53
Normalized Nerd
Рет қаралды 94 М.
Bayes theorem, the geometry of changing beliefs
15:11
3Blue1Brown
Рет қаралды 4,5 МЛН
Naïve Bayes Classifier -  Fun and Easy Machine Learning
11:59
Augmented AI
Рет қаралды 441 М.
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 341 М.
Math for AI and ML: This strategy makes learning so much easy!
11:20
Как Я Брата ОБМАНУЛ (смешное видео, прикол, юмор, поржать)
00:59