SMOTE - Handle imbalanced dataset | Synthetic Minority Oversampling Technique | Machine Learning

  Рет қаралды 18,496

Data Magic (by Sunny Kusawa)

Data Magic (by Sunny Kusawa)

Күн бұрын

Пікірлер: 14
@mahipatil369
@mahipatil369 8 ай бұрын
Can i use smote technique for handling imbalanced data where classes are 3. While doing sentiment analysis using basebert model ?
@hafizhaaghniahasya358
@hafizhaaghniahasya358 3 ай бұрын
Is it okay to use smote after feature encoding using OHE?
@akshaypatil8155
@akshaypatil8155 Жыл бұрын
4:05 if it's a high dimensional data what technique should be used both for undersampling and oversampling?
@DataMagicAI
@DataMagicAI Жыл бұрын
SMOTE is good choice for over sampling. For under sampling you can go with random under sampling.
@Bumbazz
@Bumbazz 5 ай бұрын
how can we balance a multiclassed dataset?
@akshaypatil8155
@akshaypatil8155 Жыл бұрын
for SMOTE technique is it necessary to have all the input columns and target variable in numerical format? or is it okay if some of the input columns are categorical?
@DataMagicAI
@DataMagicAI Жыл бұрын
You can use Synthetic Minority Over- sampling Technique-Nominal Continuous (SMOTE-NE) instead of just SMOTE when you have numerical as well as categorical features.
@MargaritaHantke
@MargaritaHantke 8 ай бұрын
Why can't you oversample or undersample just a bit more? Why do they have to be equivalent in 50-50% proportion?
@DataMagicAI
@DataMagicAI 8 ай бұрын
Its is to reduce the bias. If you provide more sample from class A then there is high chance like trined model will learn more about class A and get biased towards it. Thats the reason we want to make all class sample size equivalent to avoid such bias Trined models. There also some models which are capable to handle the unbalanced dataset. So in that case we dont need to do balancing.
@shrikant136661
@shrikant136661 Жыл бұрын
If this imbalance data set is sparse then SMOTE will not be usefull as it works on base of KN theory,then what to do in that case?
@DataMagicAI
@DataMagicAI Жыл бұрын
Try for the data augmentation techniques with GAN.
@tornyu
@tornyu 2 ай бұрын
Checking my understanding: so SMOTE uses KNN and _linear_ interpolation to generate synthetic data, is that right? But linear is a big assumption, so by using a GAN you hope to synthesise samples that are closer to being in-distribution?
@Prasannashinde365
@Prasannashinde365 2 жыл бұрын
how to buy google colab pro in india can you help me?
@DataMagicAI
@DataMagicAI 2 жыл бұрын
colab.research.google.com/signup visit this link select plan free/ Pro/ Pro plus and sign up.
Handling Imbalanced Datasets   SMOTE Technique
24:32
DataMites
Рет қаралды 50 М.
Magic or …? 😱 reveal video on profile 🫢
00:14
Andrey Grechka
Рет қаралды 61 МЛН
Nurse's Mission: Bringing Joy to Young Lives #shorts
00:17
Fabiosa Stories
Рет қаралды 16 МЛН
Whoa
01:00
Justin Flom
Рет қаралды 56 МЛН
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 48 МЛН
SMOTE - Synthetic Minority Oversampling Technique
8:34
Jitesh Khurkhuriya
Рет қаралды 64 М.
How to handle imbalanced datasets in Python
11:48
Data Professor
Рет қаралды 50 М.
Magic or …? 😱 reveal video on profile 🫢
00:14
Andrey Grechka
Рет қаралды 61 МЛН