Missing Value - kNN imputation in R

  Рет қаралды 45,728

Gourab Nath

Gourab Nath

Күн бұрын

This video discusses how to do kNN imputation in R for both numerical and categorical variables.
#MissingValue Imputation
#KNNimputation
#MachineLearning

Пікірлер: 51
@malvarez8588
@malvarez8588 3 жыл бұрын
Nice explanation! Just an advice regarding the last step: You can also use the argument "imp_var = FALSE" in "kNN()" to skip the additional logical variables.
@MrGozobozo
@MrGozobozo 3 жыл бұрын
You are awesome Mr. Gourab! Thank you very much for the simple yet so helpful tutorial!
@nhosurkar
@nhosurkar 8 жыл бұрын
Excellent explanation. Very simple and methodical
@abrammakgalemele3491
@abrammakgalemele3491 5 жыл бұрын
Thanks. Great demo!
@niveditapaul2921
@niveditapaul2921 3 жыл бұрын
Very informative video.
@preeyank5
@preeyank5 6 жыл бұрын
Thanks for the videos dude, very informative.
@RobertoSolanoM
@RobertoSolanoM 5 жыл бұрын
nice, short video! thanks for sharing
@kukulemon
@kukulemon 5 жыл бұрын
Why did you use k=6 and how do you identify the most applicable value for this parameter?
@TM-ip7vm
@TM-ip7vm 4 жыл бұрын
Thanks. धन्यवाद
@parijatkumar339
@parijatkumar339 6 жыл бұрын
Excellent!!
@jindai8827
@jindai8827 7 жыл бұрын
Very Helpful! Thx
@Protect_the
@Protect_the 8 жыл бұрын
Thanks sir! It was quite helpful. Just wanted to know for categorical variable imputation is there any other option available in R? EM or MI won't work because they treat even categorical variable as continuous ones which further distort the categorical variable values
@meghalgandhi4357
@meghalgandhi4357 6 жыл бұрын
Great. That helps. Thanks a ton
@larrylai4080
@larrylai4080 6 жыл бұрын
Thanks a lot! In addition, how to deal with the following warning messages? "In [
@APPUIT2KUTTU
@APPUIT2KUTTU 5 жыл бұрын
Thanks for the video. please explain how missing values for POSIXct format variables can be updated.
@elmonovagales2929
@elmonovagales2929 5 жыл бұрын
I dont know if the error I am getting is result of a lack of maintenance of the package. What it does when I impute, is to create a new column with a Boolean output (True for NA's and False for real values). Any idea?
@ravindarmadishetty736
@ravindarmadishetty736 6 жыл бұрын
Useful video
@harishdhanarajan3297
@harishdhanarajan3297 5 жыл бұрын
Under what assumption K was taken as 6? Basically, K has to be selected based on the elbow criterion / Sum of Squares and in order to plot it we require a data-set without NA's, I might be wrong on how to find k value but is there a way to determine the K value? Thanks in advance and great video BTW.
@AliNavarro-tq6zy
@AliNavarro-tq6zy 2 ай бұрын
how do you identify what is the best value for the hyper parameter K?
@ayushdudedon
@ayushdudedon 8 жыл бұрын
Should we treat missing values before removing outliers or after ?
@pratikshah305
@pratikshah305 7 жыл бұрын
after, as outliers will skew the treatment to a higher/lower value thereby changing the average/median or any other treatment that you are trying out
@vedantsahay8750
@vedantsahay8750 4 жыл бұрын
Hello Gourab, thanks for the video! I tried to impute categorical data using kNN as mentioned but after running the code, I still have the same number of blanks values in the column. Could you please provide some inputs on the same? Thanks!
@juandavidlozadacalderon1356
@juandavidlozadacalderon1356 5 жыл бұрын
thanks.
@petax004
@petax004 8 жыл бұрын
Can you make video for VIM package(GUI preferably)? I like the ability to visualize the missing values using VIM package along with imputations.
@Protect_the
@Protect_the 8 жыл бұрын
start with install.packages(VIM) on R 3.3.0 version, it will keep on guiding you which additional package to add. Keep on adding all supporting packages as per pop up messages in R. It will work.
@sainaveenbali
@sainaveenbali 8 жыл бұрын
can i impute this in panel data and then proceed to normally i.e. calculate fixed and random effect model and then apply hausmen etc. Please help
@antoniorodriguezandres161
@antoniorodriguezandres161 5 жыл бұрын
Did you find out for panel data?
@chonthanyayosbuth4648
@chonthanyayosbuth4648 6 жыл бұрын
Thank you very much. Do you have other method ???
@Mythical_motion
@Mythical_motion 6 жыл бұрын
Thanks
@maticslearning_science
@maticslearning_science 5 жыл бұрын
hi prof.... how do you solve nominal and numeric attribut data with missing values data?
@akrsrivastava
@akrsrivastava 8 жыл бұрын
The extra columns that kNN created were to keep track of imputed values. Isnt that so?
@SA-gt8lx
@SA-gt8lx 5 жыл бұрын
i need imputation for a data with 1.7 million rows and 27 columns. my pc cant handle this. what to do?
@rookiedrummer6838
@rookiedrummer6838 6 жыл бұрын
Hi Gourab thank for the explanation, i need small information ... how do we decide the value of k?
@TheFreezwater
@TheFreezwater 5 жыл бұрын
There is no standard as such. You need to give a couple odd choices and see how each model performs and try to choose the one the best fits your need. This could change as source data changes.
@rajdipsaha
@rajdipsaha 3 жыл бұрын
Here we can take k = round(sqrt(n)), where n is the number of observations.
@ayeledesalegn5367
@ayeledesalegn5367 4 жыл бұрын
For further any body can see the vedio kzbin.info/www/bejne/fWPVkmqBqpiYfa8
@fardazay
@fardazay 4 жыл бұрын
there is no library(VIM) on my R, can you help me please?
@gourabnath9789
@gourabnath9789 4 жыл бұрын
Did you install the package? If not run: install.packages("VIM") Then load the package library(VIM)
@khachiakhadija7738
@khachiakhadija7738 6 жыл бұрын
I get this error :Error in `[.data.table`(data, , sapply(.SD, function(x) all(is.na(x))), : Some items of .SDcols are not column names (or are NA)
@rookiedrummer6838
@rookiedrummer6838 6 жыл бұрын
I a also getting the same error
@vaibhavkabdwal6275
@vaibhavkabdwal6275 5 жыл бұрын
use variable=c("NAME OF COLUMN)"
@miguelescobedorivas9763
@miguelescobedorivas9763 6 жыл бұрын
I already installed VIM package but its stil showing me this message. > library(VIM) Error: package or namespace load failed for ‘VIM’ in loadNamespace(j
@hans4223
@hans4223 5 жыл бұрын
ok
@nhosurkar
@nhosurkar 8 жыл бұрын
Excellent explanation. Very simple and methodical
@kennetholelewe9932
@kennetholelewe9932 8 жыл бұрын
library(VIM) can't run on my computer. What could the problem be?
@gourabnath5076
@gourabnath5076 8 жыл бұрын
+Kenneth Olelewe Probably the VIM package is not installed in your system. Try installing the package VIM using install.packages("VIM")
@rishabharora5072
@rishabharora5072 5 жыл бұрын
@@gourabnath5076 Hi, even i am facing this issue. When i try to install this package it gives an error. Is it this VIM is under some other package?
@gourabnath9789
@gourabnath9789 5 жыл бұрын
@@rishabharora5072 Hi Rishabh! No, VIM is an independent package. If install.packages('VIM') does not work check if ur version of the R software is compatible with this package or not. In that case upgrade R and try again.
@rishabharora5072
@rishabharora5072 5 жыл бұрын
Gourab Nath could you please tell me any other package that can run on 3.5.1 version.. need to perform imputation for a project. Will be very helpful. I have some 30+ column and 6K + rows
@gourabnath9789
@gourabnath9789 5 жыл бұрын
@@rishabharora5072 For your version VIM would have supported. However you can try using other imputation methods like MICE using the mice package in R.
One Sample t test 1
12:39
Gourab Nath
Рет қаралды 748
Handling Missing Values using R
16:07
Dr. Bharatendra Rai
Рет қаралды 44 М.
Angry Sigma Dog 🤣🤣 Aayush #momson #memes #funny #comedy
00:16
ASquare Crew
Рет қаралды 50 МЛН
Пришёл к другу на ночёвку 😂
01:00
Cadrol&Fatich
Рет қаралды 10 МЛН
HAH Chaos in the Bathroom 🚽✨ Smart Tools for the Throne 😜
00:49
123 GO! Kevin
Рет қаралды 12 МЛН
Applying and Understanding K-Nearest Neighbors (KNN) in R
12:20
Spencer Pao
Рет қаралды 25 М.
Missing Value - Imputation uing Simple Linear Regression using R
12:03
How to impute missing data using mice package in R programming
11:26
Rajendra Choure
Рет қаралды 5 М.
Robert Greene: How To Seduce Anyone, Build Confidence & Become Powerful | E232
1:54:48
Handle Missing Values: Imputation using R ("mice") Explained
22:47
DataExplained
Рет қаралды 63 М.
Dealing With Missing Data - Multiple Imputation
11:02
ritvikmath
Рет қаралды 46 М.
R: Regression With Multiple Imputation (missing data handling)
12:34
Regorz Statistik
Рет қаралды 13 М.
Nearest Neighbour Classifier - k Nearest Neighbour (kNN)
11:16
Gourab Nath
Рет қаралды 17 М.
Dealing with Missing Data in R
33:34
LiquidBrain Bioinformatics
Рет қаралды 5 М.