129. Databricks | Pyspark| Delta Lake: Deletion Vectors

  Рет қаралды 3,480

Raja's Data Engineering

Raja's Data Engineering

Күн бұрын

Пікірлер: 12
@sravankumar1767
@sravankumar1767 7 ай бұрын
Superb explanation Raja 👌 👏 👍
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Thank you so much 🙂
@piyushagarwal79
@piyushagarwal79 Ай бұрын
what is the difference between REORG & OPTIMIZE then
@venkatasai4293
@venkatasai4293 7 ай бұрын
Good video Raja . Could you please make a video on liquid clustering with example illustrating the difference with normal partitioning .
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Hi Venkat, good suggestion. Sure will create a video on liquid clustering soon
@venkatasai4293
@venkatasai4293 6 ай бұрын
Thanks raja .
@akashghadage5377
@akashghadage5377 7 ай бұрын
Thanks!
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Welcome!
@RaviY-o6r
@RaviY-o6r 7 ай бұрын
After applying optimise command the behaviour is same with out enable deletion vector. What use case we use deletion vector.
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Yes that's right. But we usually don't run optimize command frequently and its not recommended also. So the use cases are like where we need to manipulate the data frequently. This is going to big boost to performance and storage
@venkatasai4293
@venkatasai4293 7 ай бұрын
@@rajasdataengineering7585why it was not recommend to use optimize command ? Any overhead ?
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Optimize command is rearranging the data files which is costlier operation. So it's not recommended to run so frequently. When we accumulate significant amount of data from previous run, we can run optimize command
130. Databricks | Pyspark| Delta Lake: Change Data Feed
17:26
Raja's Data Engineering
Рет қаралды 6 М.
52. Databricks| Pyspark| Delta Lake Architecture: Internal Working Mechanism
30:13
Raja's Data Engineering
Рет қаралды 50 М.
小丑教训坏蛋 #小丑 #天使 #shorts
00:49
好人小丑
Рет қаралды 54 МЛН
Advancing Spark - Delta Deletion Vectors
17:02
Advancing Analytics
Рет қаралды 3,8 М.
Querying 100 Billion Rows using SQL, 7 TB in a single table
9:07
Arpit Agrawal (Elastiq.AI)
Рет қаралды 60 М.
66. Databricks | Pyspark | Delta: Z-Order Command
14:16
Raja's Data Engineering
Рет қаралды 26 М.
SQL Tutorial for Beginners
44:57
Kevin Stratvert
Рет қаралды 2,3 МЛН
65. Databricks | Pyspark | Delta Lake: Vacuum Command
15:32
Raja's Data Engineering
Рет қаралды 19 М.
Learn Database Normalization - 1NF, 2NF, 3NF, 4NF, 5NF
28:34
Decomplexify
Рет қаралды 2,2 МЛН
ЛАЙФХАК НА КУХНЕ ! 🧐🤦🏻‍♂️ #shorts #лайфхак
0:15
Крус Костилио
Рет қаралды 109 М.
Мы Сняли Радужных Друзей на новый iPhone 14 PRO !
24:18
Это лютый угар 🤣 | приколы Арсен Симонян
0:14
Арсен Симонян
Рет қаралды 294 М.
Это лютый угар 🤣 | приколы Арсен Симонян
0:14
Арсен Симонян
Рет қаралды 294 М.
три кошака и ростелеком
0:26
Мистер Денала
Рет қаралды 2,4 МЛН