Normalization Vs. Standardization (Feature Scaling in Machine Learning)

  Рет қаралды 63,191

Prof. Ryan Ahmed

Prof. Ryan Ahmed

Күн бұрын

In this video, we will cover the difference between normalization and standardization.
Feature Scaling is an important step to take prior to training of machine learning models to ensure that features are within the same scale.
Normalization is conducted to make feature values range from 0 to 1.
Standardization is conducted to transform the data to have a mean of zero and standard deviation of 1.
Standardization is also known as Z-score normalization in which properties will have the behavior of a standard normal distribution.
Check top-rated Udemy courses below:
10 days of No Code AI Bootcamp
www.udemy.com/course/10-code-...
Modern Artificial Intelligence with Zero Coding
www.udemy.com/course/modern-a...
Python & Machine Learning for Financial Analysis
www.udemy.com/course/ml-and-p...
Modern Artificial Intelligence Masterclass: Build 6 Projects
www.udemy.com/course/modern-a...
AWS SageMaker Practical for Beginners | Build 6 Projects
www.udemy.com/course/practica...
Data Science for Business | 6 Real-world Case Studies
www.udemy.com/course/data-sci...
AWS Machine Learning Certification Exam | Complete Guide
www.udemy.com/course/amazon-w...
TensorFlow 2.0 Practical
www.udemy.com/course/tensorfl...
TensorFlow 2.0 Practical Advanced
www.udemy.com/course/tensorfl...
Machine Learning Regression Masterclass in Python
www.udemy.com/course/machine-...
Machine Learning Practical Workout | 8 Real-World Projects
www.udemy.com/course/deep-lea...
Machine Learning Classification Bootcamp in Python
www.udemy.com/course/machine-...
MATLAB/SIMULINK Bible|Go From Zero to Hero!
www.udemy.com/course/matlabsi...
Python 3 Programming: Beginner to Pro Masterclass
www.udemy.com/course/python-3...
Autonomous Cars: Deep Learning and Computer Vision in Python
www.udemy.com/course/autonomo...
Control Systems Made Simple | Beginner's Guide
www.udemy.com/course/control-...
Artificial Intelligence in Arabicالذكاء الصناعي مبتدئ لمحترف
www.udemy.com/course/artifici...
The Complete MATLAB Computer Programming Bootcamp
www.udemy.com/course/the-comp...
Thanks and see you in future videos!
#featurescaling #normalization

Пікірлер: 108
@samuelkoramoah3552
@samuelkoramoah3552 Жыл бұрын
this is by far the best explanation I've come across. So simple to understand. Thank you Prof. You just earned a follower!!
@jingyiwang5113
@jingyiwang5113 11 ай бұрын
I am really grateful for your detailed explanation! I am self studying machine learning this summer holiday. And I am at this point now. I am so confused before watching your video. Now I finally understand this point. Thank you so much!
@alexismachado2262
@alexismachado2262 2 жыл бұрын
Great explanation however i think saying scaling is not required for distance based algorithm is wrong as these algorithm are most affected by the range of features. Can you comment on this.
@rafaelposadas2341
@rafaelposadas2341 7 ай бұрын
I think the same
@shahzarhusain3662
@shahzarhusain3662 3 ай бұрын
Exactly! Scaling is crucial for distance based algorithm.
@bernardesp_
@bernardesp_ Ай бұрын
I believe that such as in the case of k-means, the algorithm calculates distances based on column versus same column as opposed to a neural network were each column can have a impact on target output. As distances are measured in the same scale (column x column), of course one feature is going to affect more clusterization {for instance}, but that's the point of k-means, we want to see which features describe data distribution across dimensions.
@bogdancristurean73
@bogdancristurean73 Жыл бұрын
This was pretty clearly explained. For anyone else looking for this, the standardization chapter begins at 6:49.
@twanwolthaus
@twanwolthaus 4 ай бұрын
Your explanation is as amazing as a rainbow cloud after a thunderstorm!!! I'm so glad I found this visual explanation!
@1littlehelper
@1littlehelper 7 ай бұрын
Hi Professor, thank you so much for this video! Clear and concise you have no idea how much I needed this. Keep up the great work, I will be sure to check out your other videos as well 😊
@GreatIndiaTrips
@GreatIndiaTrips Жыл бұрын
Well explained about standardization and normalization.Now i got full clarity on these topics.Thanks for taking this effort and explaining in this way.
@vskraiml2032
@vskraiml2032 2 жыл бұрын
Impressed with your way of teaching. You are explaining very well with the right examples... awesome work of you... One small request is that in your playlist sequence of 'Artificial Intelligence, Machine Learning, and Deep Learning' is jumbled, please keep the playlist in order for easy learning.
@ifeanyiedward2789
@ifeanyiedward2789 Жыл бұрын
Thank you so much Professor Ryan. You just made my life easy. best explanation. so simple to understand even for someone who doesnt have a background knowledge in machine learning.
@sukhwinder101
@sukhwinder101 4 ай бұрын
For ML context : if data is following gaussian distribution ( bell shape) follow standard deviation else go with normalisation ( improves cluster scaling as well).
@57_faizalabdillah99
@57_faizalabdillah99 Жыл бұрын
Amazing Explanation.. Just in one run, i get your whole point in an easy way. Big Thanks
@beloaded3736
@beloaded3736 Ай бұрын
This professor is so pleasant for all senses. Thanks for sharing knowledge selflessly :)
@Sickkkkiddddd
@Sickkkkiddddd 2 жыл бұрын
Came here from your udemy course. You are a life saver, prof!
@yasmineelezaby5197
@yasmineelezaby5197 9 ай бұрын
Thank you so much! I couldn't wait to end this video before thanking you ! you made it super clear.
@fiqrifirdaus
@fiqrifirdaus 8 күн бұрын
clear as a crystal, thankyou
@lethalgaming7087
@lethalgaming7087 Ай бұрын
Thank You Leonard Hofstadder..🙂
@professor-ryanahmed
@professor-ryanahmed Ай бұрын
Hahaha thanks ❤️😂
@atharvambokar573
@atharvambokar573 Жыл бұрын
This was such a crystal clear explanation! Thank you so much sir!
@yosefasefaw4207
@yosefasefaw4207 Жыл бұрын
amazing video! clearly explained! Congratulation Professor !
@PJ-od9ev
@PJ-od9ev Жыл бұрын
A great scientist and teacher. keep it up, sir. thank you.
@albertoavendano7196
@albertoavendano7196 Жыл бұрын
Many thanks for this video... One of the best explanations ever seen by me
@anuradhabalasubramanian9845
@anuradhabalasubramanian9845 2 жыл бұрын
Fantastic Explanation Sir ! Thanks so much !
@nutanaigal9761
@nutanaigal9761 Жыл бұрын
thanks a lot ...worth watching..u explanined each concept in a simple way...
@louisCS502
@louisCS502 23 күн бұрын
the outlier thing is so crucial actually damn, i havent seen this is in a machine learning course before, banger
@catulopsae
@catulopsae 10 ай бұрын
Awesome. I understand finally. Very good explanation. Easy to follow
@AndromedHH
@AndromedHH Жыл бұрын
Fantastic explanation ! Thank you so much.
@shadyshawky6737
@shadyshawky6737 2 жыл бұрын
Very Clear Explanation. Thank you :)
@AbrahamStrange-tt4fv
@AbrahamStrange-tt4fv Жыл бұрын
Great explanation. Thank you very much, Sir!
@user-yk3ec4fl5v
@user-yk3ec4fl5v Жыл бұрын
This is my first time that I am watching your video.. You look very ..very much similar to Saif Ali Khan.. In fact the smile is also same. One like vote from me. A gentle smile on face make you different from all the others.
@amrittiwary080689
@amrittiwary080689 Жыл бұрын
Great video, would say we need scaling for distance-based as it will get wrong results if features are on different scales. We don't need scaling for tree-based as they are not susceptible to variance.
@sanjeevjangra84
@sanjeevjangra84 3 ай бұрын
Awesome explanation. Thank you!
@vijayarana2087
@vijayarana2087 Жыл бұрын
Many thanks for this video... One of the best explanations
@tasnimsart3430
@tasnimsart3430 Жыл бұрын
Such a great explanation. Thank you very much
@zanyatta1
@zanyatta1 4 ай бұрын
The best simple explanation ever
@algosavage7057
@algosavage7057 2 жыл бұрын
good. clearly explained. thanks
@louisCS502
@louisCS502 23 күн бұрын
thank you boss man, just used normalization instead of standardization, life saver
@mahamadounouridinemamoudou9875
@mahamadounouridinemamoudou9875 Жыл бұрын
thank you very much, I can't pass without thanking you and subscribe for the clarity you gave me on that topic
@leixiao169
@leixiao169 6 ай бұрын
Thank you for the clear explanation!
@muhammadabdurrazaq2069
@muhammadabdurrazaq2069 9 ай бұрын
Thank you for your best explanation as easy to understand
@KarinaRodriguez-tb6ol
@KarinaRodriguez-tb6ol 2 жыл бұрын
Amazing explanation!
@jyothsnaraajjj
@jyothsnaraajjj Жыл бұрын
Excellent explanation.
@ItsTheGameDude
@ItsTheGameDude 5 ай бұрын
Thank you so much, Prof!
@EvaPev
@EvaPev 7 ай бұрын
Outstanding content.
@MariaDonayreJackson
@MariaDonayreJackson 2 ай бұрын
Excellent thanks!!!
@user-zb5zi3ll3g
@user-zb5zi3ll3g 4 ай бұрын
Informative!
@4abdoulaye
@4abdoulaye 2 жыл бұрын
Appreciated it, Thanks.
@saremish
@saremish 11 ай бұрын
Excellent!
@jimherebarbershop8188
@jimherebarbershop8188 2 жыл бұрын
Gr8 explanation!!!
@remmaria
@remmaria 2 жыл бұрын
Great explanation!! Could you say more about when the input is image datasets - like CNNs?
@deepakkumar-ej1je
@deepakkumar-ej1je 5 ай бұрын
Hello Professor, Video was able to explain the concepts and its practical implementation in a concise manner. Awesome work
@professor-ryanahmed
@professor-ryanahmed Ай бұрын
Many thanks!
@caliguy1260
@caliguy1260 3 ай бұрын
Awesome explanation for a beginner like me. Wish I had access to the S&P 500 dataset.
@sololife9403
@sololife9403 Жыл бұрын
Thank you Prof!
@poizn5851
@poizn5851 2 жыл бұрын
Thank you it is helpful
@FRANKWHITE1996
@FRANKWHITE1996 Жыл бұрын
Thanks for sharing ❤
@gaberhassan3972
@gaberhassan3972 9 ай бұрын
Great job 👏👏❤
@sanumioluwafemi7247
@sanumioluwafemi7247 Жыл бұрын
Thank you for this video
@professor-ryanahmed
@professor-ryanahmed Жыл бұрын
My pleasure
@joguns8257
@joguns8257 Жыл бұрын
Superb illustration.
@professor-ryanahmed
@professor-ryanahmed Жыл бұрын
Thank you so much 😀
@joguns8257
@joguns8257 Жыл бұрын
@@professor-ryanahmed You're welcome, Prof. Please, the link to the dataset?
@noonereally0007
@noonereally0007 6 ай бұрын
hey professor, that was a very cool and simple video to follow and understand, could i ask for where i cold find the notebook you used at the end to use?
@peaceadesina
@peaceadesina Жыл бұрын
Thank you!
@muralidhargrao
@muralidhargrao Жыл бұрын
Hi Prof. Ryan, Thank you for explaining the subject in a simple manner. I have a Human Resources situation at hand. We have an employee appraisal system and the rating is on a 6 point scale (ranging from Poor performer to Outstanding performer). We have 15 departmental heads who rate their respective team members on this 6 point rating scale. However, there are immense biases that creep in during evaluation. Also, some evaluators are tougher/lenient than others. Consequently, we end up with different ranges/averages. As the ratings are linked to incentives, sometimes, good performers lose out against their peers in other departments. I intend to eliminate this bias/lack of neutrality which have been rated by 15 different departments (for 1000 employees). Can you suggest how I should go about this situation please. Regards...Muralidhar
@NickMaverick4
@NickMaverick4 2 ай бұрын
Good theoretical explanation.. but I think scaling is used for k means, knn
@zahra-pl1sk
@zahra-pl1sk 2 ай бұрын
SUPEEEEEEEER clair. thanks
@odosmatthews664
@odosmatthews664 Жыл бұрын
Can you show an example of scaling with train test split? Do you scale the train and test data with the same scaler?
@dunwally2433
@dunwally2433 Жыл бұрын
Can you share the dataset you used for this demo pls?
@samws_4
@samws_4 10 ай бұрын
Helpful!
@mamounarakza5951
@mamounarakza5951 7 ай бұрын
حبيبي يا بروف
@patientmuke7008
@patientmuke7008 Жыл бұрын
For supervised algorithms, can we used both as data input ?
@hasszhao
@hasszhao 2 жыл бұрын
thx Prof
@user-le2cc1yt8n
@user-le2cc1yt8n 6 ай бұрын
رائع .. متميز
@anp9929
@anp9929 Жыл бұрын
you've not missed a single base brother. what an explain
@arjundev4908
@arjundev4908 2 жыл бұрын
He used to be on Stemplicity as well.
@zaldi19
@zaldi19 Жыл бұрын
Question, what if our model encounters bigger value than what we had in training data? How do we handle that
@asyakatanani8181
@asyakatanani8181 11 ай бұрын
as always: outstanding! Your enthusiasm is inspiring... On the other hand, it is clear why tree-based algorithms do not require feature scaling. However, distance-based algorithms such as K nearest Neighbors and K-means require Euclidean Distance calculation which means that feature scaling is necessary with them. Am I wrong?
@whynot13
@whynot13 8 ай бұрын
I think you should scale features for K-means and K-nn. Think about it intuitively. If you are looking at two points and their x y (feature) distances, how would you want to define their closeness? Do you want their features to be considered equally when calculating your distance or is one feature more important then the other ? If you want both x and y to be considered on equal playing fields, then you should scale them so that the distance computed reflects their importance. Scale each feature by the method that makes more since to that feature. This is most likely [0 to 1] across samples.
@believer8754
@believer8754 13 күн бұрын
top explanation along with code, can you upload the notebook file with each video u explain . thanks
@user-lk1fd7lz3c
@user-lk1fd7lz3c Жыл бұрын
thx
@liviumircea6905
@liviumircea6905 Жыл бұрын
Good
@amineazemour2476
@amineazemour2476 Жыл бұрын
top of the top
@lucasgonzalezsonnenberg3204
@lucasgonzalezsonnenberg3204 9 ай бұрын
Firstly, I like very much your explination. Secondly, I would like to know, how do you plot the row and rescalled data? Do you use the histograms function from pandas? Thank you very much and keep working so on!
@lucasgonzalezsonnenberg3204
@lucasgonzalezsonnenberg3204 9 ай бұрын
I have all ready founded. :D import seaborn as sns sns.pairplot(df)
@mahmodi5timetolearn
@mahmodi5timetolearn 7 ай бұрын
The best, marhaba
@TheOraware
@TheOraware Жыл бұрын
At 11:27 you mentioned in last bullets that scaling is not required for K-NN and SVM is not correct. K-NN and SVM exploits distances or similarities they do require scaling.
@floriant9104
@floriant9104 Жыл бұрын
very true!!
@plowface
@plowface 3 ай бұрын
I'm finding a lot of sources are saying feature scaling is advised when using k nearest neighbours. Is there more nuance to this point? Is scaling required after all?
@joguns8257
@joguns8257 Жыл бұрын
Please, where's the link to the dataset? I'd really appreciate if you can paste it here, Prof. Thanks a lot.
@ArvindKumar-vr4gf
@ArvindKumar-vr4gf Жыл бұрын
How to apply z score normalisation in live data ??? 🙏🙏🙏
@andyh3970
@andyh3970 3 ай бұрын
could you put a link to the csv file so we can download and try the exercise ourselves please?
@ARCsGARDEN
@ARCsGARDEN Жыл бұрын
Can you please share the github repo link for accessing the data files used in the video
@alhelalyhossam
@alhelalyhossam Жыл бұрын
I really liked your explanation, thanks P.S. Are you Egyptian? I mean your accent is perfect, but your pauses while speaking give the intuition that you're from the Great Egypt.
@yamanarslanca8325
@yamanarslanca8325 11 ай бұрын
11:40 wait I am confused now, because I thought that since the distance of the data is so important in algorithms such as kNN, SVM etc. scaling is a MUST pre-process step, but now you are saying that it is not required ? Could you please clarify this ?
@SaFFire123x
@SaFFire123x 11 күн бұрын
Just came from a KMeans clustering course that demonstrates how normalization results in better clusters. But at 11:40, you say KMeans clustering doesn't require standardization or normalization. I'm confused.
@yossryasser2646
@yossryasser2646 10 ай бұрын
where can I get the dataset?
@jiberuba8856
@jiberuba8856 2 жыл бұрын
Thank you. Where I can download the notebook code?
@ShawnBecker11
@ShawnBecker11 2 жыл бұрын
I also have this question
@cvino0618
@cvino0618 10 ай бұрын
Could've added this into your udemy course
@jeffkamuthu3276
@jeffkamuthu3276 2 ай бұрын
A whole semester in 20 minutes
@chandrasekharnettem1537
@chandrasekharnettem1537 Жыл бұрын
distance-based methods assume that features are normalized?. feature scaling is required?. please confirm that?. tree-based does not need scaling
@things-tz8dj
@things-tz8dj 5 ай бұрын
dataset please
@mariwanahmad9362
@mariwanahmad9362 Жыл бұрын
Dear Rayan, how to test a scaled data model. i used this way the predict value is very different X_testing=np.array([[550,440,110,0,0,0,0.33,400,8.8,0,863,771]]) #78.6 ypred=model.predict(scaler.fit_transform( X_testing)) # predicted should be 78 , but i got [[0.17291696]] also without fit_transform also the value is different. many thanks for you replay.
Standardization Vs Normalization- Feature Scaling
12:52
Krish Naik
Рет қаралды 290 М.
I wish I could change THIS fast! 🤣
00:33
America's Got Talent
Рет қаралды 70 МЛН
Khó thế mà cũng làm được || How did the police do that? #shorts
01:00
World’s Deadliest Obstacle Course!
28:25
MrBeast
Рет қаралды 131 МЛН
🤖🧠AI vs ML vs DL vs Data Science for absolute beginners
17:10
Prof. Ryan Ahmed
Рет қаралды 12 М.
Z-Scores, Standardization, and the Standard Normal Distribution (5.3)
6:57
Simple Learning Pro
Рет қаралды 1,2 МЛН
Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)
11:59
Standardization vs Normalization Clearly Explained!
5:48
Normalized Nerd
Рет қаралды 120 М.
Why Neural Networks can learn (almost) anything
10:30
Emergent Garden
Рет қаралды 1,2 МЛН
Support Vector Machines: All you need to know!
14:58
Intuitive Machine Learning
Рет қаралды 134 М.
Feature Engineering Techniques For Machine Learning in Python
47:58
Normalization & Standardization
15:36
DataR Labs
Рет қаралды 8 М.
All Learning Algorithms Explained in 14 Minutes
14:10
CinemaGuess
Рет қаралды 182 М.
Secret Wireless charger 😱 #shorts
0:28
Mr DegrEE
Рет қаралды 1,8 МЛН
SSD с кулером и скоростью 1 ГБ/с
0:47
Rozetked
Рет қаралды 219 М.
Собери ПК и Получи 10,000₽
1:00
build monsters
Рет қаралды 1,4 МЛН