Log Transformation for Outliers | Convert Skewed data to Normal Distribution

  Рет қаралды 29,151

The AI University

The AI University

Күн бұрын

Пікірлер: 33
@TheAIUniversity
@TheAIUniversity 5 жыл бұрын
What are the other techniques I can use to treat outliers or convert negative or positive skewed data into normal distribution form?
@nabeelnaseer5592
@nabeelnaseer5592 5 жыл бұрын
roots, exponents, inverse methods..
@nizamlootera3163
@nizamlootera3163 4 жыл бұрын
In Linear Regression suppose both the variables or features are positively skewed, then we should apply log10 to both of them
@XuanTran-ri1hn
@XuanTran-ri1hn 2 жыл бұрын
How about log 1 plus?
@nabeelnaseer5592
@nabeelnaseer5592 5 жыл бұрын
What can we do if even after the transformation, there are outliers..am kinda puzzled over this notion of natural outliers. Like we are supposed to treat them separately.. can you give some pointers..
@mansirawat1052
@mansirawat1052 Жыл бұрын
Suppose in one of my outcome measures pre is normal but post is not normal, so should I log transform only the post recording or should I transform both the pre and post values for further analysis?
@aakashv4594
@aakashv4594 5 жыл бұрын
What are the functions to be applied for negative skews and also if the data has zero
@ajaykushwaha-je6mw
@ajaykushwaha-je6mw 3 жыл бұрын
Sir what is the correct sequence of variable transformation. First we need to do feature scaling then Gaussian transformation or First Gaussian transformation then feature scaling ?
@pallavijagtap8140
@pallavijagtap8140 3 жыл бұрын
Sir, Once you transform the variables, do we have to use same transformed columns in further process of melling?
@pallavijagtap8140
@pallavijagtap8140 3 жыл бұрын
Pallavi Jagtap 1 second ago Sir, Once you transform the variables, do we have to use same transformed columns in further process of modelling?
@balamurali75
@balamurali75 2 жыл бұрын
Sir small dout I have two variables(independent and Dependent) represented in percentage. If I apply log for only one variable. Will result differs. Is it the correct way of transformation/analysis
@durgadeviarulrajan4560
@durgadeviarulrajan4560 2 жыл бұрын
Hi, Thanks for the great video. Is it necessary to convert all features into normally distributed, before modeling? Is it a compulsory step to follow in feature engineering?
@usmanriaz6241
@usmanriaz6241 Жыл бұрын
It confuses me too. tell me if you know now
@vinayvvalaboju
@vinayvvalaboju 3 жыл бұрын
Can you fix a custom bin And filter data til upper quartile.
@MrNabiwishes
@MrNabiwishes 4 жыл бұрын
Log transformation applied to train set, and when out of sample data comes in do we apply same transformation...
@edphi
@edphi 3 жыл бұрын
Excellent
@elyasmohammadi8409
@elyasmohammadi8409 4 жыл бұрын
Hello and thank you for this nice video. Could you please clarify that what are the axis X and Y before and after log transformation. Thank you in advance
@edphi
@edphi 3 жыл бұрын
Frequency distribution graph
@nikhilgaikwad9954
@nikhilgaikwad9954 4 жыл бұрын
after we transformed the column values using log10. if we build a app using flask what values we should pass for that column to predict the output?? the original value or first we need to transform that value using log 10 and then insert??
@prathameshmistry3868
@prathameshmistry3868 4 жыл бұрын
no,the values are inserted and then transformed in the code
@vineethp8925
@vineethp8925 4 жыл бұрын
@Prathamesh Mistry can u please explain more clearly because iam also having the same doubt
@amitbudhiraja7498
@amitbudhiraja7498 3 жыл бұрын
I have a doubt like what is the optimal method to do remove the outliers [Z-score , IQR method] or use transformation methods like log normal or inverse Can someone tell ?
@creativesurgeinfidel
@creativesurgeinfidel 3 жыл бұрын
Thank you.. Could you please let me know how to convert natural log back to the original value
@shrutimadan4451
@shrutimadan4451 3 жыл бұрын
using log10 transformation, it didnt give normal distribution. How to deal with this?
@aniketsultan9497
@aniketsultan9497 5 жыл бұрын
other methods square root, cube root , binning
@Karthik_info_vlogs
@Karthik_info_vlogs 4 жыл бұрын
Good info
@independent7212
@independent7212 4 жыл бұрын
negatively skewed data to normal distribution?
@rohitjaiswal6102
@rohitjaiswal6102 4 жыл бұрын
Can u share your github link about this codes....
@TheAIUniversity
@TheAIUniversity 4 жыл бұрын
Here you go... github.com/nitinkaushik01/Machine_Learning_Data_Preprocessing_Python/find/master?q=
@ankurkamthan5854
@ankurkamthan5854 4 жыл бұрын
Why should not taken log with base e and y base 10
@nicholaslipanovich827
@nicholaslipanovich827 3 жыл бұрын
The information you communicated to us was fine but your delivery could use some work. Trying to repeat yourself less might help.
@vuminhquanle1426
@vuminhquanle1426 4 жыл бұрын
I listened very carefully, cause I can't understand anything at 1.5x Speed
@fakhrik
@fakhrik 4 жыл бұрын
Why do Indians have to use the word OK so much?
It’s all not real
00:15
V.A. show / Магика
Рет қаралды 13 МЛН
小丑女COCO的审判。#天使 #小丑 #超人不会飞
00:53
超人不会飞
Рет қаралды 9 МЛН
Players push long pins through a cardboard box attempting to pop the balloon!
00:31
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 29 МЛН
LogTransformations.1.Why Log Transformations for Parametric
10:12
Quantitative Analysis Institute
Рет қаралды 68 М.
Normalization Vs. Standardization (Feature Scaling in Machine Learning)
19:48
Correcting Skewed Data with Scipy and Numpy
11:36
AnalytiCode
Рет қаралды 9 М.
How to Correct Data that Violates the Parametric Assumption of Normality
24:58
Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)
11:59
Ryan & Matt Data Science
Рет қаралды 15 М.
Fitting Probability Distributions to Data with SciPy (Python)
16:23
Mike Saint-Antoine
Рет қаралды 16 М.
Statistics 101: Variable Transformations, LOG Transform in Excel
18:32
It’s all not real
00:15
V.A. show / Магика
Рет қаралды 13 МЛН