This video discusses how to do kNN imputation in R for both numerical and categorical variables. #MissingValue Imputation #KNNimputation #MachineLearning
Пікірлер: 51
@malvarez85883 жыл бұрын
Nice explanation! Just an advice regarding the last step: You can also use the argument "imp_var = FALSE" in "kNN()" to skip the additional logical variables.
@MrGozobozo3 жыл бұрын
You are awesome Mr. Gourab! Thank you very much for the simple yet so helpful tutorial!
@nhosurkar8 жыл бұрын
Excellent explanation. Very simple and methodical
@abrammakgalemele34915 жыл бұрын
Thanks. Great demo!
@niveditapaul29213 жыл бұрын
Very informative video.
@preeyank56 жыл бұрын
Thanks for the videos dude, very informative.
@RobertoSolanoM5 жыл бұрын
nice, short video! thanks for sharing
@kukulemon5 жыл бұрын
Why did you use k=6 and how do you identify the most applicable value for this parameter?
@TM-ip7vm4 жыл бұрын
Thanks. धन्यवाद
@parijatkumar3396 жыл бұрын
Excellent!!
@jindai88277 жыл бұрын
Very Helpful! Thx
@Protect_the8 жыл бұрын
Thanks sir! It was quite helpful. Just wanted to know for categorical variable imputation is there any other option available in R? EM or MI won't work because they treat even categorical variable as continuous ones which further distort the categorical variable values
@meghalgandhi43576 жыл бұрын
Great. That helps. Thanks a ton
@larrylai40806 жыл бұрын
Thanks a lot! In addition, how to deal with the following warning messages? "In [
@APPUIT2KUTTU5 жыл бұрын
Thanks for the video. please explain how missing values for POSIXct format variables can be updated.
@elmonovagales29295 жыл бұрын
I dont know if the error I am getting is result of a lack of maintenance of the package. What it does when I impute, is to create a new column with a Boolean output (True for NA's and False for real values). Any idea?
@ravindarmadishetty7366 жыл бұрын
Useful video
@harishdhanarajan32975 жыл бұрын
Under what assumption K was taken as 6? Basically, K has to be selected based on the elbow criterion / Sum of Squares and in order to plot it we require a data-set without NA's, I might be wrong on how to find k value but is there a way to determine the K value? Thanks in advance and great video BTW.
@AliNavarro-tq6zy2 ай бұрын
how do you identify what is the best value for the hyper parameter K?
@ayushdudedon8 жыл бұрын
Should we treat missing values before removing outliers or after ?
@pratikshah3057 жыл бұрын
after, as outliers will skew the treatment to a higher/lower value thereby changing the average/median or any other treatment that you are trying out
@vedantsahay87504 жыл бұрын
Hello Gourab, thanks for the video! I tried to impute categorical data using kNN as mentioned but after running the code, I still have the same number of blanks values in the column. Could you please provide some inputs on the same? Thanks!
@juandavidlozadacalderon13565 жыл бұрын
thanks.
@petax0048 жыл бұрын
Can you make video for VIM package(GUI preferably)? I like the ability to visualize the missing values using VIM package along with imputations.
@Protect_the8 жыл бұрын
start with install.packages(VIM) on R 3.3.0 version, it will keep on guiding you which additional package to add. Keep on adding all supporting packages as per pop up messages in R. It will work.
@sainaveenbali8 жыл бұрын
can i impute this in panel data and then proceed to normally i.e. calculate fixed and random effect model and then apply hausmen etc. Please help
@antoniorodriguezandres1615 жыл бұрын
Did you find out for panel data?
@chonthanyayosbuth46486 жыл бұрын
Thank you very much. Do you have other method ???
@Mythical_motion6 жыл бұрын
Thanks
@maticslearning_science5 жыл бұрын
hi prof.... how do you solve nominal and numeric attribut data with missing values data?
@akrsrivastava8 жыл бұрын
The extra columns that kNN created were to keep track of imputed values. Isnt that so?
@SA-gt8lx5 жыл бұрын
i need imputation for a data with 1.7 million rows and 27 columns. my pc cant handle this. what to do?
@rookiedrummer68386 жыл бұрын
Hi Gourab thank for the explanation, i need small information ... how do we decide the value of k?
@TheFreezwater5 жыл бұрын
There is no standard as such. You need to give a couple odd choices and see how each model performs and try to choose the one the best fits your need. This could change as source data changes.
@rajdipsaha3 жыл бұрын
Here we can take k = round(sqrt(n)), where n is the number of observations.
@ayeledesalegn53674 жыл бұрын
For further any body can see the vedio kzbin.info/www/bejne/fWPVkmqBqpiYfa8
@fardazay4 жыл бұрын
there is no library(VIM) on my R, can you help me please?
@gourabnath97894 жыл бұрын
Did you install the package? If not run: install.packages("VIM") Then load the package library(VIM)
@khachiakhadija77386 жыл бұрын
I get this error :Error in `[.data.table`(data, , sapply(.SD, function(x) all(is.na(x))), : Some items of .SDcols are not column names (or are NA)
@rookiedrummer68386 жыл бұрын
I a also getting the same error
@vaibhavkabdwal62755 жыл бұрын
use variable=c("NAME OF COLUMN)"
@miguelescobedorivas97636 жыл бұрын
I already installed VIM package but its stil showing me this message. > library(VIM) Error: package or namespace load failed for ‘VIM’ in loadNamespace(j
@hans42235 жыл бұрын
ok
@nhosurkar8 жыл бұрын
Excellent explanation. Very simple and methodical
@kennetholelewe99328 жыл бұрын
library(VIM) can't run on my computer. What could the problem be?
@gourabnath50768 жыл бұрын
+Kenneth Olelewe Probably the VIM package is not installed in your system. Try installing the package VIM using install.packages("VIM")
@rishabharora50725 жыл бұрын
@@gourabnath5076 Hi, even i am facing this issue. When i try to install this package it gives an error. Is it this VIM is under some other package?
@gourabnath97895 жыл бұрын
@@rishabharora5072 Hi Rishabh! No, VIM is an independent package. If install.packages('VIM') does not work check if ur version of the R software is compatible with this package or not. In that case upgrade R and try again.
@rishabharora50725 жыл бұрын
Gourab Nath could you please tell me any other package that can run on 3.5.1 version.. need to perform imputation for a project. Will be very helpful. I have some 30+ column and 6K + rows
@gourabnath97895 жыл бұрын
@@rishabharora5072 For your version VIM would have supported. However you can try using other imputation methods like MICE using the mice package in R.