No video

129. Databricks | Pyspark| Delta Lake: Deletion Vectors

  Рет қаралды 1,108

Raja's Data Engineering

Raja's Data Engineering

Күн бұрын

Пікірлер: 11
@venkatasai4293
@venkatasai4293 Ай бұрын
Good video Raja . Could you please make a video on liquid clustering with example illustrating the difference with normal partitioning .
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Hi Venkat, good suggestion. Sure will create a video on liquid clustering soon
@venkatasai4293
@venkatasai4293 Ай бұрын
Thanks raja .
@sravankumar1767
@sravankumar1767 Ай бұрын
Superb explanation Raja 👌 👏 👍
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Thank you so much 🙂
@akashghadage5377
@akashghadage5377 Ай бұрын
Thanks!
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Welcome!
@RaviY-o6r
@RaviY-o6r Ай бұрын
After applying optimise command the behaviour is same with out enable deletion vector. What use case we use deletion vector.
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Yes that's right. But we usually don't run optimize command frequently and its not recommended also. So the use cases are like where we need to manipulate the data frequently. This is going to big boost to performance and storage
@venkatasai4293
@venkatasai4293 Ай бұрын
@@rajasdataengineering7585why it was not recommend to use optimize command ? Any overhead ?
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Optimize command is rearranging the data files which is costlier operation. So it's not recommended to run so frequently. When we accumulate significant amount of data from previous run, we can run optimize command
130. Databricks | Pyspark| Delta Lake: Change Data Feed
17:26
Raja's Data Engineering
Рет қаралды 1,5 М.
121. Databricks | Pyspark| AutoLoader: Incremental Data Load
34:56
Raja's Data Engineering
Рет қаралды 16 М.
WHO CAN RUN FASTER?
00:23
Zhong
Рет қаралды 41 МЛН
CHOCKY MILK.. 🤣 #shorts
00:20
Savage Vlogs
Рет қаралды 28 МЛН
艾莎撒娇得到王子的原谅#艾莎
00:24
在逃的公主
Рет қаралды 49 МЛН
Advancing Spark - Delta Deletion Vectors
17:02
Advancing Analytics
Рет қаралды 3,3 М.
66. Databricks | Pyspark | Delta: Z-Order Command
14:16
Raja's Data Engineering
Рет қаралды 20 М.
How This New Battery is Changing the Game
12:07
Undecided with Matt Ferrell
Рет қаралды 85 М.
65. Databricks | Pyspark | Delta Lake: Vacuum Command
15:32
Raja's Data Engineering
Рет қаралды 15 М.
122. Databricks | Pyspark| Delta Live Table: Introduction
24:25
Raja's Data Engineering
Рет қаралды 16 М.
61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)
20:03
23. Databricks | Spark | Cache vs Persist | Interview Question | Performance Tuning
18:56
114. Databricks | Pyspark| Performance Optimization: Re-order Columns in Delta Table
18:14
WHO CAN RUN FASTER?
00:23
Zhong
Рет қаралды 41 МЛН