very insightful, nice explanation of highlighting the limitations of hive-style partitioning, z-order optimization techniques & how liquid clustering provides a neat solution though its internals look quite complex to understand, thank you for sharing :)
@alexischicoine20728 ай бұрын
It's a great combo with vector deletions as you don't have to rewrite the data. Without vector deletions it could make deletes more expensive as the data would be spread and mixed across files.
@chrisstephenson989011 ай бұрын
Thank for sharing this talk. Would you be so kind to share a link to the slide deck presented by Vitor?
@alexischicoine20728 ай бұрын
Very interesting. For zordering you can store the columns in table properties at table creation and then retrieve them when optimizing it's not that much code.
@luisriveros1119 Жыл бұрын
Hi !! I have a question is it possible to implementing liquid clustering for DataFrames directly saved to delta files (df.write.format("delta").save("path")), The conventional approach involving table creation
@raviv510911 ай бұрын
One question, is it wise decision to apply partition to liquid clustering table?
@paulfunigga10 ай бұрын
partitioning is not compatible with liquid clustering
@k.saibhargav80729 ай бұрын
what is difference between bucket By vs Liquid Clustering