Live Discussion On Handling Imbalanced Dataset- Machine Learning

  Рет қаралды 81,770

Krish Naik

Krish Naik

Күн бұрын

Пікірлер: 91
@nothing_to_love
@nothing_to_love 9 ай бұрын
Appreciate your sharing, sir! 2024 but your VDO is extremely helpful. Thanks, sir.
@darant
@darant 4 жыл бұрын
Hi Krish, Thank you so much for everything that you are offering us at free of cost!
@yogenderkushwaha5523
@yogenderkushwaha5523 4 жыл бұрын
Thank you sir, always feel motivated after seeing your enthusiasm for data science. Learning a lot from you ❤️
@eduhomebyshubh5445
@eduhomebyshubh5445 4 жыл бұрын
Most welcome bhai
@mahmoudbabiker75
@mahmoudbabiker75 Жыл бұрын
We are ...greatly indebted to you
@oo_wais
@oo_wais 2 жыл бұрын
thank you Krish.. i am working on a project and it had imbalanced target variable. this video really helped me out.
@eduhomebyshubh5445
@eduhomebyshubh5445 4 жыл бұрын
Best machine learning tutorials sir
@abhinavmahajan448
@abhinavmahajan448 2 жыл бұрын
Thanks for the informative video
@heyrobined
@heyrobined 3 жыл бұрын
random-forest class weights example: 40:00 undersampling : 43:25
@DeadTalkLive
@DeadTalkLive 4 жыл бұрын
Good video ♥♥! As a current KZbinr, I am on the lookout for creative ideas! Nice Job!
@talhasaleem8333
@talhasaleem8333 2 жыл бұрын
The precision is gone #UnderSampling 🤣 That laugh
@ankitg200
@ankitg200 4 жыл бұрын
Very Nice video..the greatest thing is we get to know what is currently used in industry not what is bookish
@bhavindedhia9968
@bhavindedhia9968 4 жыл бұрын
Always motivated and encourage me when new video comes
@lovejazzbass
@lovejazzbass 4 жыл бұрын
Thank you so much Krish. I have two teachers on KZbin. Krish and Harshit!
@niranjannahak89
@niranjannahak89 2 жыл бұрын
Due to an update in imblearn version fit_sample is throwing error. So i used-- X_train_ns,Y_train_ns = ns.fit_resample(X_train,Y_train)---and it works fine for me..
@ammar46
@ammar46 2 жыл бұрын
Hey, is SMOTETomek taking long time?
@sucharitha9365
@sucharitha9365 4 жыл бұрын
Nice explained sir
@str7749
@str7749 2 жыл бұрын
Thank you v.much !
@sandipansarkar9211
@sandipansarkar9211 3 жыл бұрын
finished practicing code
@SUPRIYASUMAN-qg7qk
@SUPRIYASUMAN-qg7qk 2 жыл бұрын
Hello Krish, Could you please share your video on handling data imbalance in deep learning models? It would indeed be a great help.
@sambitnath9853
@sambitnath9853 3 жыл бұрын
Always a pleasure to watch your videos sir 👍
@sandipansarkar9211
@sandipansarkar9211 3 жыл бұрын
finished watching
@cliffordtarimo1511
@cliffordtarimo1511 3 жыл бұрын
Learned a lot from this!! Thanks man
@InovateTechVerse
@InovateTechVerse 3 жыл бұрын
Thank you so much for the session.
@seeutube8860
@seeutube8860 2 жыл бұрын
After applying 'ns', is resplitting of undersampled dataset necessary or not? Here, after balancing, model was used on original train and test dataset.
@shylashreedev2685
@shylashreedev2685 2 жыл бұрын
Dont we have the Gpay facility for joining the membership,since it is asking for card number
@natarajanlalgudi
@natarajanlalgudi 4 жыл бұрын
Great lecture thanks as always Krish
@krishnaik06
@krishnaik06 4 жыл бұрын
Thank u :)
@lujeinalsheikh8316
@lujeinalsheikh8316 2 жыл бұрын
Hi Krish, would it be possible to make a video about how class weights are used to perform a node split in a weighted decision tree?
@tejaswinimanne7539
@tejaswinimanne7539 Жыл бұрын
Sir can you upload video about how to predict earthquake using Naive bayes
@diycollection96
@diycollection96 2 жыл бұрын
Really helpful vdo ... Sir i just want to how u select the arange(-2,3)?
@rambaldotra2221
@rambaldotra2221 3 жыл бұрын
Even after installing imblearn its giving me error Module not found. Can someone help me please, due to this error I am stuck.
@tirthadatta7368
@tirthadatta7368 3 жыл бұрын
Sir, is there any video of imbalanced image dataset handling in CNN? or can we use the basic of this live tutorial for image classification purpose??
@new_edition3149
@new_edition3149 2 жыл бұрын
Did u get good accuracy (precison and recall) on image imbalnaced dataset?
@new_edition3149
@new_edition3149 2 жыл бұрын
Actually i'm also looking the answer for same question
@jupinsgil
@jupinsgil 2 жыл бұрын
godlike!
@simranade1106
@simranade1106 3 жыл бұрын
if data are unbalanced in regression problem then how to handle??
@AmitSingh-co7ci
@AmitSingh-co7ci 4 жыл бұрын
Hi Sir, Similar kind of error i am getting while using over sampling method.. from imblearn.over_sampling import RandomOverSampler os = RandomOverSampler(0.5) x_train_ns, y_train_ns = os.fit_sample(x_train, y_train) print("The number of classes before fit{}".format(Counter(y_train))) print("The number of classes after fit{}".format(Counter(y_train_ns))) error: --------------------------------------------------------------------------- AttributeError Traceback (most recent call last) in 2 3 os = RandomOverSampler(0.5) ----> 4 x_train_ns, y_train_ns = os.fit_sample(x_train, y_train) 5 print("The number of classes before fit{}".format(Counter(y_train))) 6 print("The number of classes after fit{}".format(Counter(y_train_ns))) c:\users\asing053\appdata\local\programs\python\python38-32\lib\site-packages\imblearn\base.py in fit_resample(self, X, y) 75 check_classification_targets(y) 76 arrays_transformer = ArraysTransformer(X, y) ---> 77 X, y, binarize_y = self._check_X_y(X, y) 78 79 self.sampling_strategy_ = check_sampling_strategy( c:\users\asing053\appdata\local\programs\python\python38-32\lib\site-packages\imblearn\over_sampling\_random_over_sampler.py in _check_X_y(self, X, y) 77 def _check_X_y(self, X, y): 78 y, binarize_y = check_target_type(y, indicate_one_vs_all=True) ---> 79 X, y = self._validate_data( 80 X, y, reset=True, accept_sparse=["csr", "csc"], dtype=None, 81 force_all_finite=False, AttributeError: 'RandomOverSampler' object has no attribute '_validate_data'
@Thedark.i
@Thedark.i 4 жыл бұрын
Hey sir, the one question you asked in your Virtual interview "Case study which has 0 or 1 dependent variable and they are further dividied into subcatergory. Can you please answer that question like how we can do it?
@netrachinival6249
@netrachinival6249 3 жыл бұрын
hello sir, I am new one machine learning .I didn't get from where should I start .Can you please give any list or syllabus.I was seen all your vt classes ,things could great sir .Thank you so much
@adipurnomo5683
@adipurnomo5683 3 жыл бұрын
Starting learn statistic and probability first
@sameerkumar6431
@sameerkumar6431 4 жыл бұрын
Hi Krish, can you please make video on multi variant time series forecasting model
@nagnathsatav9978
@nagnathsatav9978 4 жыл бұрын
Hi krish want to know if more than 2 classes imbalanced problem?
@riteshmukhopadhyay6922
@riteshmukhopadhyay6922 2 жыл бұрын
Hello Krish, I have been following your data analytics videos throughout. I have completed Live EDA And Feature Engineering Playlists, then I started following this playlist. I am quite over whelmed with sudden introduction to ML and other models which I have no clue about. Can you please tell me the playlist I should follow first to get the basic understanding of what you are teaching here? Thanks for your effort,
@ammarkhan2611
@ammarkhan2611 4 жыл бұрын
What are the default parameters used by a Random Forest Classifer ( Tree Depth, No of Trees Used, No of variables used at each step) in Python ?
@ARUNADEVIRUIT
@ARUNADEVIRUIT 4 жыл бұрын
Hi sir.. my dataset has 13l records for class a and 2k records for class b.. I tried the same smote, randomundersample,gridsearch,randomforest,decisiontree.. everything but column has less correlation wit the target and I'm getting very less score for class b
@ahiyaahammed3643
@ahiyaahammed3643 2 жыл бұрын
Hi Krish, Thank you for your videos can you please do a session on Handling Imbalanced Image Dataset (Medical dataset if possible)?
@virtuous_views
@virtuous_views 4 жыл бұрын
In fraud classification, false negatives should be more important right and that means we should focus on our recall score. Am I correct??
@md.muntasirulhoque8563
@md.muntasirulhoque8563 3 жыл бұрын
best sir
@AlgoTribes
@AlgoTribes 4 жыл бұрын
Krish bhai random forest model ke saath isko kaggle pe upload kijiye naa..kaggle score mein toh bahut upar rahegaa yeh model..kar ke dikhayega..request h please.
@krishnaik06
@krishnaik06 4 жыл бұрын
Sure karke batata hoon :)
@taniyabanerjee2609
@taniyabanerjee2609 3 жыл бұрын
I really appreciate what you are trying to do. But it would have been much better if you actually answer the questions raised and also if you could explain why and how something is happening. You ask if it's clear, and I see people asking questions but sadly you avoid all the questions. And sometimes I actually have the same questions raised by others but we have to go to some other tutor and learn about it. But nevertheless you give us content to follow through, thankyou :)
@joelbraganza3819
@joelbraganza3819 4 жыл бұрын
When exactly in the pipeline should the imbalanced data be balanced? Is it before we begin any feature analysis, feature selection and other pre-processing techniques? Because many time Outlier analysis and removal methods will rule out some good data points in a variable, counting them as outliers when in fact they are just unbalanced data-points associated with the minority class. OR, should we balance the data just before predictive modelling for the sake of getting unbiased models & result? Let me know, please.
@nischalneupane3
@nischalneupane3 3 жыл бұрын
By “handling” imbalanced dataset, you are not really “transforming” the dataset as you would in Feature Engineering pipeline. It is part of Model Training/Tuning. You do not “clean” an imbalanced dataset, and it’s perfectly natural for datasets to be imbalanced.
@sajidchoudhary1165
@sajidchoudhary1165 4 жыл бұрын
Sir Please makes video on Mathematics behind on SVM Regression, AdaBoost Regression, Gradient Boost Classification
@ammumammu6677
@ammumammu6677 3 жыл бұрын
Hi sir ,please do data processing using CLI in machine learning
@2728jay
@2728jay 3 жыл бұрын
How to handle imbalance data for multi class classification problem which has only text column as feature
@priyankamehta2827
@priyankamehta2827 4 жыл бұрын
I have 1244 obs in class a and 244 obs in class b. My algorithm is classifying everything in one class. How should i rectify it? I tried logistic regression, svm, random forest.. Same problem
@rajputjay9856
@rajputjay9856 4 жыл бұрын
AttributeError: 'NearMiss' object has no attribute '_validate_data' .... Due to the version differnce the error comes sir
@rajputjay9856
@rajputjay9856 4 жыл бұрын
This error generally comes when we have freshly installed the library and you need to shut everything off star jupyter notebook once again and you can solve that error
@MechiShaky
@MechiShaky 4 жыл бұрын
Krish sir ,If our targets are regression values instead of classification problem . Then how to examine the data is imbalanced ?
@vineet094
@vineet094 4 жыл бұрын
Why will a regression problem have imbalance ? Give me an example? Regression problems do not have imbalance!!!! They are real values.
@Naveenkumar-xc9ms
@Naveenkumar-xc9ms 4 жыл бұрын
Hello sir where can i find videos fo spam detection project.
@SnehaSingh-ts7oi
@SnehaSingh-ts7oi 3 жыл бұрын
Hi Sir. Why have you not use SVM. I have read its very popular algorithm.
@gauravverma-dl9yl
@gauravverma-dl9yl 3 жыл бұрын
Why are looking left every other minute ?
@Bunny2.O
@Bunny2.O 3 жыл бұрын
For me my model is predicting hight false alrets and low true alrets , I am new to ML, my data is imbalanced can u please suggest which models are good for my data, when I check the model in live .. there also positive class is low num like 15 , negative class is high like 3665.
@priyankamehta2827
@priyankamehta2827 4 жыл бұрын
I have 1244 obs in class a and 244 obs in class b. My algorithm is classifying everything in one class. How should i rectify it?
@gayatrichopade7674
@gayatrichopade7674 3 жыл бұрын
Try using kfold.
@SatyamKumar-dj3jo
@SatyamKumar-dj3jo 4 жыл бұрын
ValueError: Logistic Regression supports only penalties in ['l1', 'l2', 'elasticnet', 'none'], got 11.
@nandlalmishra4435
@nandlalmishra4435 4 жыл бұрын
Must have mistyped l1 as 11 maybe
@etsutina9594
@etsutina9594 3 жыл бұрын
I have a different row for my 17 years data which means, one year have 800 row and other year I have 300 so how can I make the rows similar for my time sires prediction
@hailayteklehaymanot2658
@hailayteklehaymanot2658 3 жыл бұрын
If the training data has only “negative class”, whereas testing data has both the classes; “negative” and “positive”. what kind of algorithm shall we apply ?
@bdrcmym
@bdrcmym 3 жыл бұрын
Sir can you explain,how to create our own model
@rich007p
@rich007p 3 жыл бұрын
(y) Great
@deutschvalley3574
@deutschvalley3574 3 жыл бұрын
How We can save the data after under sampling or over sample ?? If possible kindly sir give me code 👩‍💻
@devarakondahimaja8423
@devarakondahimaja8423 3 жыл бұрын
Difference between smote sampling and adasyn sampling?
@vijaykumar-yq7sf
@vijaykumar-yq7sf 4 жыл бұрын
Don’t go to usa on H1 or F1. Reason 1. Forget about green card 2. Conceltency , and tax will take 50% of ur salary. 3. Very low savings and expensive. 4. I have seen people working in software and being homeless for years. 5. Big problem u cannot bring ur parents for sure. 6. Parents health insurance is big problem. 7. Do any small business in India don’t come to usa. Sorry for hard words, but it’s truth.
@avibitm
@avibitm 3 жыл бұрын
@krish: what if there is imbalance in data in one single attributes rather class attributes
@tahirullah4786
@tahirullah4786 3 жыл бұрын
I got this Error how can I solve it.....ValueError: Solver lbfgs supports only 'l2' or 'none' penalties, got l1 penalty.
@srikeshhp7541
@srikeshhp7541 4 жыл бұрын
Hi Sir,Please upload a video on detailed explanation to crack Google Summer of Code,Please Sir...Thank You..
@omkarsutar578
@omkarsutar578 4 жыл бұрын
I don't understand, Why this video in feature engineering playlist
@samarendrakumarsinha8898
@samarendrakumarsinha8898 4 жыл бұрын
LOGIC FOR 1000 MILES /HOUR RAILWAY ENGINE
@utkarshupadhyay1485
@utkarshupadhyay1485 3 жыл бұрын
why you have uploaded this video on Feature Engineering ->
@yajnabopaiah8616
@yajnabopaiah8616 3 жыл бұрын
The concepts could have been explained better rather just focusing on hands on session with the technique
Wall Rebound Challenge 🙈😱
00:34
Celine Dept
Рет қаралды 20 МЛН
Подсадим людей на ставки | ЖБ | 3 серия | Сериал 2024
20:00
ПАЦАНСКИЕ ИСТОРИИ
Рет қаралды 584 М.
All Machine Learning algorithms explained in 17 min
16:30
Infinite Codes
Рет қаралды 421 М.
Live Q&A With Nitish Singh @CampusX- Ask Any Questions-Data Science
1:05:00
Israel Has The Right To Defend Itself | Stand-up Comedy by Daniel Fernandes
15:07
Processing 100+ GBs Of Data In Seconds Using Polars GPU Engine
19:09
3 Years of VFX - From Learning to Creating | 222 VFX Breakdown
4:46
Stanley Xavier
Рет қаралды 27 М.
Wall Rebound Challenge 🙈😱
00:34
Celine Dept
Рет қаралды 20 МЛН