Understanding Pipeline in Machine Learning with Scikit-learn (sklearn pipeline)

  Рет қаралды 39,607

Dr. Data Science

Dr. Data Science

Күн бұрын

Often in Machine Learning and Data Science, you need to perform a sequence of different transformations of the input data (such as finding a set of features or generating new features) before applying a final estimator. Pipeline gives you a single interface for encapsulating transformers and predictors to simplify the process. Since transformers are usually combined with estimators for preprocessing, using pipeline in scikit-learn can be really useful. To be precise, Pipelines sequentially apply a list of transformers and a final estimator. Therefore, the purpose of the pipeline is to assemble several steps that can be cross-validated while setting different parameters. In this video, we discuss the Python implementation of Pipeline using a polynomial regression example.
#Transformer #Estimator #Pipeline

Пікірлер: 26
@AnkitGupta005
@AnkitGupta005 2 жыл бұрын
Short and crisp. Thank you!
@fabianaltendorfer11
@fabianaltendorfer11 Жыл бұрын
that's a great introduction to pipelines! Thanks
@DrDataScience
@DrDataScience Жыл бұрын
Thank you!
@kianaliaghat7740
@kianaliaghat7740 2 жыл бұрын
thanks for your short, useful introduction! it helped me a lot
@DrDataScience
@DrDataScience 2 жыл бұрын
Thanks for the comment.
@Hajar1992ful
@Hajar1992ful Жыл бұрын
Thank you for this useful video!
@gabrielmarchioli4669
@gabrielmarchioli4669 2 жыл бұрын
Great video. Helped me a lot
@aszx-tv4pq
@aszx-tv4pq 19 күн бұрын
HI there, very happy with this channel could you explain a bit simpler what is pipeline part!
@maxwellpatten9227
@maxwellpatten9227 5 ай бұрын
This is excellent. Thank you
@DrDataScience
@DrDataScience 5 ай бұрын
Thanks!
@muhammadjamalahmed8664
@muhammadjamalahmed8664 2 жыл бұрын
Love your tutorials..
@DrDataScience
@DrDataScience 2 жыл бұрын
Thanks!
@sebacortes8812
@sebacortes8812 Жыл бұрын
muchas gracias saludos desde chile!!
@DrDataScience
@DrDataScience Жыл бұрын
Gracias!
@hiba8484
@hiba8484 Жыл бұрын
Thanks, its really helpfull
@DrDataScience
@DrDataScience Жыл бұрын
Thanks for watching this video!
@nachoeigu
@nachoeigu Жыл бұрын
I have a big one question: What is the difference of build a Machine Learning application with Pipeline and to build a machine learning application with a OOP technique? I see that it is the same.
@DrDataScience
@DrDataScience Жыл бұрын
Everything in Python is defined as a class so we use OOP all the time. Pipeline provides a nice flexible way to combine multiple transformers and an estimator.
@burakakay6632
@burakakay6632 Жыл бұрын
Thank you :=}
@DrDataScience
@DrDataScience Жыл бұрын
You are welcome!
@adiver_
@adiver_ 6 ай бұрын
hello As you have imported polynomial features and transformed the independent variable(X_train) for it be fitted in a polynomial regression then why did you put linearregression() as the estimator in the last tuple of the list?? shouldn't you have use polyfit function or something else? NOTE: I am a beginner here , so the doubts can be silly.
@DrDataScience
@DrDataScience 6 ай бұрын
Good question! We have already created all the polynomial terms that we need, i.e., x, x^2, x^3, etc. Thus, we can now view this as a linear regression problem with respect to the "new/artificial" features.
@adiver_
@adiver_ 6 ай бұрын
I appreciate your reply , it cleared exactly what i was asking. Thanks @@DrDataScience
@adiver_
@adiver_ 6 ай бұрын
@@DrDataScience one more thing I need to ask if you can spare some time, I have seen people do parameter scaling using StandardScaler() before polynomial features and estimator in a Pipeline argument, so is the scaling a necessary step or we can skip it??
@rishidixit7939
@rishidixit7939 Ай бұрын
Why are all arrays converted to column matrices while applying sklearn
@DrDataScience
@DrDataScience Ай бұрын
Because each column corresponds to a feature or attribute of your data set. Thus, the number of elements in that column vector is equal to the number of samples.
Normalization Vs. Standardization (Feature Scaling in Machine Learning)
19:48
Children deceived dad #comedy
00:19
yuzvikii_family
Рет қаралды 4 МЛН
Неприятная Встреча На Мосту - Полярная звезда #shorts
00:59
Полярная звезда - Kuzey Yıldızı
Рет қаралды 6 МЛН
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 104 МЛН
Каха ограбил банк
01:00
К-Media
Рет қаралды 9 МЛН
Comparing machine learning models in scikit-learn
26:42
Data School
Рет қаралды 186 М.
Scikit-Learn Model Pipeline Tutorial
16:50
Greg Hogg
Рет қаралды 24 М.
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 280 М.
A Comprehensive Guide to Cross-Validation with Scikit-Learn and Python
24:55
Scikit-learn Crash Course - Machine Learning Library for Python
2:09:22
freeCodeCamp.org
Рет қаралды 351 М.
How do I encode categorical features using scikit-learn?
27:59
Data School
Рет қаралды 137 М.
Children deceived dad #comedy
00:19
yuzvikii_family
Рет қаралды 4 МЛН