Building and Deploying Reproducible Machine Learning Pipelines - Data Science Festival

  Рет қаралды 6,984

Data Science Festival

Data Science Festival

Күн бұрын

Title: Building and Deploying Reproducible Machine Learning Pipelines
Speaker: Soledad and Chris Samiullah - Train In Data
Abstract: Deployment of machine learning (ML) models, or simply, putting ML models into production, is fundamentally about bridging the gap between the research environment and live systems. Successful deployments make our models available so they can be easily accessed by both internal and external systems, depending on business requirements. Once our ML models are deployed, other systems can send input data to these models and receive back predictions. Only through effective machine learning model deployment can we maximize the business value of the models we build. When we think about data science, we think about how to build machine learning models. We think about which algorithm will be more predictive, how to engineer our features and which variables to use to make the models more accurate. However, the “last mile” of planning how to use the models in production is often neglected, despite its critical importance. Machine learning systems have all the usual challenges of software development, combined with additional data science-specific challenges, which means that deployments and system architecture require careful planning. This is a realisation that many individuals and organisations make when it is too late. In this talk, we will discuss the steps and challenges involved in putting a machine learning model into production. We will cover setting up an effective machine learning pipeline for feature engineering, feature selection and model building. We will describe the architecture of the research and production environments and how they can be connected. We will highlight the challenges to obtaining reproducible models between the two environments and how to ensure reproducibility. Finally we will present a machine learning pipeline solution that tackles these problems.
Subscribe to our channel: kzbin.info/door/tas...
Website: datasciencefestival.com/
LinkedIn: / data-science-festival
Twitter: / datasciencefest

Пікірлер: 5
@1skeeta4u
@1skeeta4u 3 жыл бұрын
I came here from your udemy class and I appreciate all the knowledge that you are passing alone. Can't wait to absorb more.
@gerardorosiles8918
@gerardorosiles8918 3 жыл бұрын
same here
@vijayotnm
@vijayotnm 4 жыл бұрын
Thank you guys for the very useful presentation!
@gerardorosiles8918
@gerardorosiles8918 3 жыл бұрын
How about hybrid tech stacks where all ML/DS pipelines all the way to production are done in Python and the rest of the application is in another language. It seems a microservice architecture would be an obvious fit
@1skeeta4u
@1skeeta4u 3 жыл бұрын
What do you mean?
Advanced Feature Engineering Tips and Tricks - Data Science Festival
53:59
Data Science Festival
Рет қаралды 11 М.
Data Science Workflows using Docker Containers
32:10
Chicago Python Users Group
Рет қаралды 38 М.
아이스크림으로 체감되는 요즘 물가
00:16
진영민yeongmin
Рет қаралды 56 МЛН
БОЛЬШОЙ ПЕТУШОК #shorts
00:21
Паша Осадчий
Рет қаралды 10 МЛН
Was ist im Eis versteckt? 🧊 Coole Winter-Gadgets von Amazon
00:37
SMOL German
Рет қаралды 39 МЛН
What is MLOps?
6:55
IBM Technology
Рет қаралды 61 М.
Machine learning: from black boxes to white boxes - Mihaela van der Schaar
1:15:51
The Alan Turing Institute
Рет қаралды 8 М.
How to deploy machine learning models into production
35:39
DataWorks Summit
Рет қаралды 133 М.
It’s Bigger on the Inside - the story of BBC+ - Data Science Festival
36:26
How I Would Learn Data Science (If I Had to Start Over)
8:36
Ken Jee
Рет қаралды 1,4 МЛН
Wayfair Data Science Explains It All: Evaluating Recommender Systems
9:26
Wayfair Data Science
Рет қаралды 12 М.
Complete Dockers For Data Science Tutorial In One Shot
1:19:29
Krish Naik
Рет қаралды 110 М.
Todos os modelos de smartphone
0:20
Spider Slack
Рет қаралды 25 МЛН
Battery  low 🔋 🪫
0:10
dednahype
Рет қаралды 5 МЛН
Samsung Galaxy 🔥 #shorts  #trending #youtubeshorts  #shortvideo ujjawal4u
0:10
Ujjawal4u. 120k Views . 4 hours ago
Рет қаралды 6 МЛН
⚡️Супер БЫСТРАЯ Зарядка | Проверка
1:00