Delta Lake Deep Dive: Liquid Clustering

  Рет қаралды 3,417

Delta Lake

Delta Lake

5 ай бұрын

Join us on Thursday, December 7 at 10AM PST for an enlightening session on Delta Lake's Liquid Clustering, a transformative approach in data management and optimization with Vítor Teixeira, Senior Data Engineer at Veeva Systems.
Liquid Clustering is Delta Lake's answer to the complex challenges of Big Data. Traditionally, partitioning and Z-Order clustering have been used to improve query performance by managing large datasets effectively. However, these methods come with limitations such as complexity in implementation, rigidity in data layout, and the need for frequent data rewrites. Delta Lake’s Liquid Clustering offers a dynamic solution. It allows for flexible redefinition of clustering keys without the need to rewrite existing data, adapting effortlessly to evolving analytic needs.
This session will cover how Liquid Clustering simplifies data layout decisions and optimizes query performance, marking a significant advancement over traditional partitioning and Z-Order clustering methods. Don’t miss this opportunity to learn about Liquid Clustering and how it can revolutionize your data management strategy.
Quick Links
Join us on Slack: go.delta.io/slack
GitHub: github.com/delta-io
Join Google Groups: groups.google.com/forum/#!for...

Пікірлер: 7
@alexischicoine2072
@alexischicoine2072 Ай бұрын
It's a great combo with vector deletions as you don't have to rewrite the data. Without vector deletions it could make deletes more expensive as the data would be spread and mixed across files.
@alexischicoine2072
@alexischicoine2072 Ай бұрын
Very interesting. For zordering you can store the columns in table properties at table creation and then retrieve them when optimizing it's not that much code.
@chrisstephenson9890
@chrisstephenson9890 4 ай бұрын
Thank for sharing this talk. Would you be so kind to share a link to the slide deck presented by Vitor?
@luisriveros1119
@luisriveros1119 5 ай бұрын
Hi !! I have a question is it possible to implementing liquid clustering for DataFrames directly saved to delta files (df.write.format("delta").save("path")), The conventional approach involving table creation
@k.saibhargav8072
@k.saibhargav8072 2 ай бұрын
what is difference between bucket By vs Liquid Clustering
@raviv5109
@raviv5109 3 ай бұрын
One question, is it wise decision to apply partition to liquid clustering table?
@paulfunigga
@paulfunigga 3 ай бұрын
partitioning is not compatible with liquid clustering
Delta Lake Deep Dive: Rust Crate
1:00:41
Delta Lake
Рет қаралды 646
Parquet File Format - Explained to a 5 Year Old!
11:28
Data Mozart
Рет қаралды 17 М.
Be kind🤝
00:22
ISSEI / いっせい
Рет қаралды 19 МЛН
Vector Physics Concept Class-XI ll PART-2
16:43
Physics by Santu Sir
Рет қаралды 10
Intro To Databricks - What Is Databricks
12:28
Seattle Data Guy
Рет қаралды 206 М.
Joscha at Microsoft
48:46
Simuli
Рет қаралды 1,5 М.
Next.js 14 Tutorial - 17 - Routing Metadata
8:13
Codevolution
Рет қаралды 51 М.
Unity Catalog Overview
6:48
Databricks
Рет қаралды 21 М.
Core Databricks: Understand the Hive Metastore
22:12
Bryan Cafferky
Рет қаралды 12 М.
Learn Apache Spark in 10 Minutes | Step by Step Guide
10:47
Darshil Parmar
Рет қаралды 238 М.
What’s your charging level??
0:14
Татьяна Дука
Рет қаралды 7 МЛН
5 НЕЛЕГАЛЬНЫХ гаджетов, за которые вас посадят
0:59
Кибер Андерсон
Рет қаралды 710 М.
Which Phone Unlock Code Will You Choose? 🤔️
0:14
Game9bit
Рет қаралды 12 МЛН
iphone fold ? #spongebob #spongebobsquarepants
0:15
Si pamer 😏
Рет қаралды 176 М.
How much charging is in your phone right now? 📱➡️ 🔋VS 🪫
0:11