Unveil the Magic Without Hoodini: Transform Your Machine Learning Pipelines with Apa... Nadine Farah

  Рет қаралды 132

The Linux Foundation

The Linux Foundation

Күн бұрын

Unveil the Magic Without Hoodini: Transform Your Machine Learning Pipelines with Apache Hudi - Nadine Farah, Onehouse
ML pipelines integrated with data lakes have emerged as a potent combination, enabling orgs to derive actionable insights from vast reservoirs of raw data. However, this integration presents distinct challenges. The dynamic nature of ML requires data to be consistently fresh, accurate, and available in near real-time. Traditional data lakes, while scalable, are immutable. It’s often hard to grapple with issues like data latency, incremental updates, and ensuring timely data availability for ML models.
Apache Hudi introduces features and services for upserts, incremental processing, and near real-time access for data lakes. Hudi natively supports efficient upserts, record-level updates, and deletions, ensuring that ML models always have access to the latest data. Furthermore, Hudi’s time-travel querying and incremental data pulls allow ML practitioners to harness historical data versions and detect potential model drifts effectively. In this talk, attendees will learn:
Challenges with building ml pipelines on data lakes
How Hudi unlocks analytics on the data lake
Build efficient ml pipelines incremental processing on the data lake

Пікірлер
How to set up RAG - Retrieval Augmented Generation (demo)
19:52
Don Woodlock
Рет қаралды 35 М.
Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen
00:21
TheSoul Music Family
Рет қаралды 33 МЛН
Try Not To Laugh 😅 the Best of BoxtoxTv 👌
00:18
boxtoxtv
Рет қаралды 7 МЛН
2 MAGIC SECRETS @denismagicshow @roman_magic
00:32
MasomkaMagic
Рет қаралды 25 МЛН
How Data Engineering Works
14:14
AltexSoft
Рет қаралды 456 М.
What is an AI Recommendation Engine?
10:53
IBM Technology
Рет қаралды 4,8 М.
A year of Servo Reboot: Where Are We Now?
32:11
Igalia
Рет қаралды 417
Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress
30:54
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Do NOT Learn Kubernetes Without Knowing These Concepts...
13:01
Travis Media
Рет қаралды 314 М.
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
Delta Live Tables A to Z: Best Practices for Modern Data Pipelines
1:27:52
Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen
00:21
TheSoul Music Family
Рет қаралды 33 МЛН