Feature Engineering for Time Series Forecasting - Kishan Manani

  Рет қаралды 24,945

DataTalksClub ⬛

DataTalksClub ⬛

Күн бұрын

In this podcast episode, we talked with Kishan Manani about feature engineering for time series forecasting.
0:00 Introduction and Welcome
2:16 Speaker Introduction
2:54 Topic Introduction: Feature Engineering for Time Series Forecasting
4:23 Motivating Example: M5 Forecasting Competition
6:25 Machine Learning for Time Series Forecasting
8:50 Direct Forecasting vs. Recursive Forecasting
10:50 Creating Lag Features
11:45 Handling Exogenous Variables
15:55 Static Features
18:00 Time Series Cross Validation
20:00 Key Differences in Machine Learning Workflow
21:35 Feature Engineering Overview
23:00 Lag Features and Correlation Methods
29:20 Window Features
32:25 Static Features and Encoding
37:25 Avoiding Data Leakage
39:30 Useful Libraries and Tools
40:30 Example with Darts Library
45:00 Conclusions and Q&A
🔗 USEFUL LINKS
- Repo and slides: github.com/KishManani/DataTal...
- Forecasting: Principles and Practice: otexts.com/fpp2/
- International Journal of Forecasting: reader.elsevier.com/reader/sd...
- Temporal Fusion Transformers for interpretable multi-horizon time series forecasting: www.sciencedirect.com/science...
- Interpretable Deep Learning for Time Series Forecasting (blog post): ai.googleblog.com/2021/12/int...
🎙 ABOUT THE PODCAST
At DataTalksClub, we organize live podcasts that feature a diverse range of guests from the data field. Each podcast is a free-form conversation guided by a prepared set of questions, designed to learn about the guests’ career trajectories, life experiences, and practical advice. These insightful discussions draw on the expertise of data practitioners from various backgrounds.
We stream the podcasts on KZbin, where each session is also recorded and published on our channel, complete with timestamps, a transcript, and important links.
You can access all the podcast episodes here - datatalks.club/podcast.html
📚Check our free online courses
ML Engineering course - mlzoomcamp.com
Data Engineering course - github.com/DataTalksClub/data...
MLOps course - github.com/DataTalksClub/mlop...
Analytics in Stock Markets - github.com/DataTalksClub/stoc...
LLM course - github.com/DataTalksClub/llm-...
Read about all our courses in one place - datatalks.club/blog/guide-to-...
👋🏼 GET IN TOUCH
If you want to support our community, use this link - github.com/sponsors/alexeygri...
If you’re a company, support us at alexey@datatalks.club

Пікірлер: 19
@iftikhar58
@iftikhar58 Жыл бұрын
It was a great talk about data. Thank you so much. I hope you can share similar talks on the future as well
@jossec1344
@jossec1344 Жыл бұрын
Magnificent work Bravo!!!
@ivanliu1173
@ivanliu1173 4 ай бұрын
Thanks for this informative video! 👏👏👏
@user-uj9sw3ze2d
@user-uj9sw3ze2d Жыл бұрын
Great talk!
@MinhVu-ym4tk
@MinhVu-ym4tk 2 жыл бұрын
good to know :D I am working on RUL estimating and prognosis using time series data.
@jacobschultz3168
@jacobschultz3168 Жыл бұрын
Great presentation. To clarify, is overfitting always an issue? I'm assuming it always is. In the scenario where you compute the window values, ensuring you're only using the available data... there will be no leakage at a row-level. But when you consider all training values.. for example at Time = 1 vs Time = 8, the relationships being built by the Forecasting algorithm when predicting Time = 1 will still use Time = 8 values.
@anoubhav
@anoubhav Жыл бұрын
For two different time series, does it make sense to build two separate models instead of having the targets of both the series in the single model (as shown at 24:40)?
@piotrbjastrzebski
@piotrbjastrzebski 11 ай бұрын
It is great but something is wrong with time_col in definition of the procedure. It seems to work if that column is an index and not mentioned in a function call.
@RDarrylR
@RDarrylR 2 жыл бұрын
What is the name/link of the “chunky” review paper you mentioned at the end of the presentation?
@kishanmanani1466
@kishanmanani1466 2 жыл бұрын
The paper was indeed in the references slide. It is: Petropoulos, Fotios, Daniele Apiletti, Vassilios Assimakopoulos, Mohamed Zied Babai, Devon K. Barrow, Souhaib Ben Taieb, Christoph Bergmeir et al. "Forecasting: theory and practice." International Journal of Forecasting (2022). It's also free to access online.
@RDarrylR
@RDarrylR 2 жыл бұрын
@@kishanmanani1466 Thanks! I must have been looking in the wrong place!
@oneforallah
@oneforallah Жыл бұрын
@@kishanmanani1466 Thanks !
@AhmedThahir2002
@AhmedThahir2002 10 ай бұрын
Hi @@kishanmanani1466 , it was a lovely talk. I was wondering if you could point me in the direction of how to implement the recursive forecasting that you in Darts using sktime. I couldn't really find an intuitive explanation online.
@AhmedThahir2002
@AhmedThahir2002 10 ай бұрын
Hi, does anyone know how to implement the recursive forecasting that he did in Darts using sktime. I couldn't really find an intuitive explanation online.
@gurjinderkaur5007
@gurjinderkaur5007 4 ай бұрын
In target encoding section, when product ID is encoded dynamically, how will the model distinguish between the data points belonging to same time series or different time series?
@mamyrak1114
@mamyrak1114 3 ай бұрын
can someone help me to deal with categorical features for forecasting time series in ML
@pranavkhatri9564
@pranavkhatri9564 11 ай бұрын
can you explain something about stock prediction?
@b1ueocean
@b1ueocean 5 ай бұрын
What tools are folks using to expose/extract/generate features? Tsfresh? getML? I work in Java for my ML tasks but will happily integreate Python or C/C++ based tools into the pipeline. I'm not a statistics guy so I can't write these feature generation algos myself.
@dariozoric7181
@dariozoric7181 Жыл бұрын
Great talk!
Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - 685
42:36
The TWIML AI Podcast with Sam Charrington
Рет қаралды 1 М.
ВОДА В СОЛО
00:20
⚡️КАН АНДРЕЙ⚡️
Рет қаралды 32 МЛН
БАБУШКИН КОМПОТ В СОЛО
00:23
⚡️КАН АНДРЕЙ⚡️
Рет қаралды 18 МЛН
Amazing weight loss transformation !! 😱😱
00:24
Tibo InShape
Рет қаралды 64 МЛН
Secret Experiment Toothpaste Pt.4 😱 #shorts
00:35
Mr DegrEE
Рет қаралды 36 МЛН
Implement a Search Engine - Alexey Grigorev
1:43:21
DataTalksClub ⬛
Рет қаралды 7 М.
Feature Engineering Secret From A Kaggle Grandmaster
22:23
Forecastegy
Рет қаралды 35 М.
Time Series Forecasting with XGBoost - Advanced Methods
22:02
Rob Mulla
Рет қаралды 116 М.
181 - Multivariate time series forecasting using LSTM
22:40
DigitalSreeni
Рет қаралды 272 М.
Hierarchical Forecasting in Python | Nixtla
25:15
Data Council
Рет қаралды 8 М.
Short Tutorial on Using Deep Learning for Time Series Classification (Technical Talk 1)
2:02:36
Forecasting with the FB Prophet Model
20:42
Rob Mulla
Рет қаралды 77 М.
Choices for your loved ones❤️
0:15
ISSEI / いっせい
Рет қаралды 19 МЛН
头还可以刷卡买东西的吗?#海贼王#路飞
0:26
路飞与唐舞桐
Рет қаралды 10 МЛН
ПИЩЕВОЙ ВАНДАЛ НАКАЗАН
0:20
МАКАРОН
Рет қаралды 3,1 МЛН
Matt Kills Dexter's Deer | Dexter: New Blood S1E1 | #Shorts
0:51
Clashed PR
Рет қаралды 24 МЛН