Feeding ML models with the data from the databases in real-time - DevConf.CZ 2024

  Рет қаралды 54

DevConf

DevConf

Күн бұрын

Speaker(s): Vojtech Juranek
---
In today's fast-paced business environment, and especially with the advent of machine learning (ML), organizations are seeking ways to derive better insights from their data as quickly as possible. However, implementing a complete ML pipeline can be quite challenging. It’s even harder if you want to process newly arrived data immediately or you have a legacy system which is not easy to connect with your modern infrastructure . Change Data Capture (CDC) has emerged as a technology for delivering real-time data changes from various sources, especially from the databases. In this talk we will introduce [Debezium](debezium.io/), a leading open source framework for CDC. We will discuss how it can be leveraged for ingesting data from the various databases into ML frameworks like TensorFlow and what the pitfalls are if you go this route. We will also briefly discuss possible future improvements in this area, especially possible integration with emerging ML feature store technology.
The talk will be accompanied by a demo in which well-known example of recognizing handwritten digits using the TensorFlow model and images stored in a Postgres database will be shown. All in real-time.
Attendees will gain an understanding of how Debezium CDC works, how it can help them to ingest data from the source database into the ML framework in real time and also what are the possible challenges with this approach.
---
Full schedule, including slides and other resources:
pretalx.com/de...

Пікірлер
An Unknown Ending💪
00:49
ISSEI / いっせい
Рет қаралды 57 МЛН
How do Cats Eat Watermelon? 🍉
00:21
One More
Рет қаралды 11 МЛН
Win This Dodgeball Game or DIE…
00:36
Alan Chikin Chow
Рет қаралды 40 МЛН
AusNOG 2024 - Tim Raphael - Nokia
27:45
AusNOG
Рет қаралды 458
An Unknown Ending💪
00:49
ISSEI / いっせい
Рет қаралды 57 МЛН