Tutorial 44-Balanced vs Imbalanced Dataset and how to handle Imbalanced Dataset

  Рет қаралды 42,608

Krish Naik

Krish Naik

Күн бұрын

Пікірлер: 61
@shahnawazkhan1636
@shahnawazkhan1636 4 жыл бұрын
Great sir there is no need to join any institute to learn the Data Science just follow the Krish Naik sir playlist.
@sushantapanda4589
@sushantapanda4589 5 жыл бұрын
You are a great tutor, the way you are explaining, great to see you holds to the subject. Awesome
@mayurkhandeshe4813
@mayurkhandeshe4813 4 жыл бұрын
your teaching is very effective sir.... very easy to understand
@abhijitsarkar5946
@abhijitsarkar5946 5 жыл бұрын
Nice series. Get going. The numbers should be 630, 270 and the accuracy exactly 90%. This is the same as your original imbalance.
@SandeepSingh-tf7ni
@SandeepSingh-tf7ni 5 жыл бұрын
Simplistic approach for Beginners, would really appreciate if you could do demo with dataset(1000 rows ) of 4-5 features, as well please explain Xgboost. Thanks in advance. Look forward to you response.
@arjyabasu1311
@arjyabasu1311 5 жыл бұрын
Upto what ratio should we consider it as a balanced dataset ??
@arjyabasu1311
@arjyabasu1311 4 жыл бұрын
@Kushal Hu what ratio that is?
@louerleseigneur4532
@louerleseigneur4532 3 жыл бұрын
Thanks Krish
@tusharbhatnagar3146
@tusharbhatnagar3146 4 жыл бұрын
Can you make video/tutorial on hyper parameter tuning in classification algorithms!! As it has been coming to many interviews also.
@venkataraomannem6585
@venkataraomannem6585 5 жыл бұрын
Well-done sir. Thanks for sharing very good to understand everyone
@praveensingh1234
@praveensingh1234 4 жыл бұрын
Very nice explain, Thanks a lot.
@manishshukla125
@manishshukla125 5 жыл бұрын
Thanks Sir, plz make a video for overfitting and underfitting
@neelpatel3844
@neelpatel3844 4 жыл бұрын
Very informative, thank you.
@aayushijain2160
@aayushijain2160 4 жыл бұрын
Sir I have a doubt in this question that how to handle imbalanced data-set either by using right evaluation metrics or by these sampling techniques???Please let me know I'm very much confused...
@cinemascope8847
@cinemascope8847 4 жыл бұрын
aayushi jain SMOTE can be used where we are increasing the minority data. Safest technique
@vineetsansi
@vineetsansi 5 жыл бұрын
XGboost will take care of the weights by itself and we don't need to do any weight adjustment manually ... is that right?? Great videos .. thanks for sharing them .. I am sure you will get big number of followers very soon!! I am also applying XGBoost to the DataScience youtube channels that I am following and your channel seems to be getting heavier and heavier weights ;)
@dr.bheemsainik4316
@dr.bheemsainik4316 2 жыл бұрын
Sir, I have data with binary classification output variables. the ratio of classes is 7.5:2.5. Is this balanced data or unbalanced data?
@gopalakrishna9510
@gopalakrishna9510 5 жыл бұрын
i really happy with imbalanced and balanced dataset explaination.......
@sashpatra88
@sashpatra88 4 жыл бұрын
Krish : Can you put this in MACHINE LEARNING playlist? If I am not missing anything
@azmathalisyed9114
@azmathalisyed9114 5 жыл бұрын
Great information,, good explanation.. 👌👌
@shashankvashishtha9149
@shashankvashishtha9149 3 жыл бұрын
can u please explain the that 2 algorithms xgboost and adaboost?
@mandarpawar27
@mandarpawar27 4 жыл бұрын
Hi krish Plz upload vedios on interview questions
@udanial
@udanial 3 жыл бұрын
From which playlist this video is?
@DatAcuity
@DatAcuity 3 жыл бұрын
I am just asking, So, if we use xgboost algorithm for classification problem we no need to bother about class imbalance. Am I right sir.
@MsRAJDIP
@MsRAJDIP 5 жыл бұрын
Can u show the techniques of handling missing data excluding mean,median mode technique. I read that u can use regression or classification technique to find missing values but never seen implemented.
@prasanthkumar7328
@prasanthkumar7328 5 жыл бұрын
while doing downsampling as mentioned we will be reducing the points to 100 so which points to be reduced or simply picking in random is also not a good practice so how could we select those 100 points.
@NinjaAnkit
@NinjaAnkit 5 жыл бұрын
what i feel when see you videos explanation, your explanation like as when you communicating in english in your video its feels like you are communicating in regional language ..........thats why i understand more.........you explain most of the difficult terms in simple way. I love your explanation.
@NinjaAnkit
@NinjaAnkit 5 жыл бұрын
and also i remembered most of the concept in long time .
@NinjaAnkit
@NinjaAnkit 5 жыл бұрын
and i also like your videos which i watched.
@surendranathify82
@surendranathify82 5 жыл бұрын
very useful . thanks. could you pls post video on PCA and LDA and about Regularization as well. thanks
@krishnaik06
@krishnaik06 5 жыл бұрын
Thanks, please check my playlist, video is already there for PCA
@umang8895
@umang8895 5 жыл бұрын
great video, easy to understand.
@harshays2873
@harshays2873 4 жыл бұрын
sir suppose if i have less data to train my model at the time what i have to do?
@biswanandanpattanayak1938
@biswanandanpattanayak1938 4 жыл бұрын
how to handle missing data if data is 1tb or more? please explain
@satyaranjanbehera5492
@satyaranjanbehera5492 5 жыл бұрын
good explanation..Thanks..
@HarpreetKaur-mn4we
@HarpreetKaur-mn4we 5 жыл бұрын
Very helpful video
@kakarlanagajyothi4089
@kakarlanagajyothi4089 4 жыл бұрын
any videos for continuing these
@joyeetamallik5063
@joyeetamallik5063 5 жыл бұрын
Can you share python code to implement these up sampling techniques. Is this concept is also applicable to NLP datasets?
@PhilippHusiA
@PhilippHusiA 4 жыл бұрын
If working with tf.keras, add following code to model: 1) from skearn.utils import class_weight 2) class_weights = class_weight.compute_class_weight('balanced',np.unique(y_train),y_train) 3) history = tf.keras.model.fit(x_train, y_train, batch_size=x, class_weight=class_weights)
@gopalakrishna9510
@gopalakrishna9510 5 жыл бұрын
can you explian with python codes ?
@victorxu9634
@victorxu9634 5 жыл бұрын
great content. would be nice if it goes deeper
@rahulmahajan6391
@rahulmahajan6391 4 жыл бұрын
Can we do down sampling in credit card fraud detection dataset?
@kakarlanagajyothi4089
@kakarlanagajyothi4089 4 жыл бұрын
Small doubt missclassfication and imbalanced are same or different..
@kakarlanagajyothi4089
@kakarlanagajyothi4089 4 жыл бұрын
Pls answer me
@galisurendra6397
@galisurendra6397 4 жыл бұрын
reply this question please
@prithviraj25
@prithviraj25 4 жыл бұрын
Thankyou Sir
@venkataraomannem6585
@venkataraomannem6585 5 жыл бұрын
Sir can you please do this same as practically. Thank you sir
@pruthvigirijala8146
@pruthvigirijala8146 5 жыл бұрын
Discuss everything in upcoming video..? :p
@niketanjha
@niketanjha 5 жыл бұрын
Really helpful 🙏
@Shylajakarthick
@Shylajakarthick 5 жыл бұрын
Thank you so much
@karndeepsingh
@karndeepsingh 5 жыл бұрын
how to deal with imbalance dataset when we have multiclass in target variable?
@MasterofPlay7
@MasterofPlay7 4 жыл бұрын
use other metrics such as F1 score instead of accuracy...
@sushedbubai
@sushedbubai 5 жыл бұрын
Waiting for more interview questions
@krishnaik06
@krishnaik06 5 жыл бұрын
You can find the complete playlist on the below youtube url kzbin.info/aero/PLZoTAELRMXVPkl7oRvzyNnyj1HS4wt2K- I will be updating this with all the questions
@davinderc
@davinderc 5 жыл бұрын
Consider using better whiteboard software. Your written words and numbers are nearly impossible to read in Paint.
@krishnaik06
@krishnaik06 5 жыл бұрын
Hi Davinder, feedback taken
@azad8upt
@azad8upt 5 жыл бұрын
It should be 250+ in test not 350+
@Beyond90Days
@Beyond90Days 4 жыл бұрын
how is accuracy 350/30 ?
@Beyond90Days
@Beyond90Days 4 жыл бұрын
got it :) 350/380
@pruthvigirijala8146
@pruthvigirijala8146 5 жыл бұрын
You know..? :p
@moulidinavahi1498
@moulidinavahi1498 4 жыл бұрын
How we can downsample data points ?
Dimensional Reduction| Principal Component Analysis
19:06
Krish Naik
Рет қаралды 163 М.
Players vs Pitch 🤯
00:26
LE FOOT EN VIDÉO
Рет қаралды 126 МЛН
БУ, ИСПУГАЛСЯ?? #shorts
00:22
Паша Осадчий
Рет қаралды 2,5 МЛН
Handling Missing Data Easily Explained| Machine Learning
23:22
Krish Naik
Рет қаралды 188 М.
How to handle imbalanced datasets in Python
11:48
Data Professor
Рет қаралды 53 М.
Standardization Vs Normalization- Feature Scaling
12:52
Krish Naik
Рет қаралды 303 М.
The New Efficient Way Of Applying Jobs And Getting Hired
11:33
Krish Naik
Рет қаралды 10 М.
Feature Selection Techniques Easily Explained | Machine Learning
23:01
Live Discussion On Handling Imbalanced Dataset- Machine Learning
1:20:07
Players vs Pitch 🤯
00:26
LE FOOT EN VIDÉO
Рет қаралды 126 МЛН