66. Databricks | Pyspark | Delta: Z-Order Command

  Рет қаралды 22,573

Raja's Data Engineering

Raja's Data Engineering

Күн бұрын

Azure Databricks Learning: Delta Lake - Z-Order Command
========================================================
What is Z-order Command in delta table and how to apply in delta lake development?
Z-order one of the performance optimization techinique used in delta lake. It is used along with optimize command and used to compact small files into optimal size and at the same time relevant data is co-located to improve the performance.
This video gives complete understanding of Z-order command
#DeltaZorder, #DatabricksZorder, #PerformanceOptimization, #Zorder,#Z-order, #Z-Ordering, #DeltaOptimize, #DeltaOptimizeZorder #DeltaCompactFiles, #DeltaSmallFileIssue, #DeltalakePerformance, #DeltaPerformanceImprovement ,#DeltalakeIntro, #IntroductionToDeltaLake, #Deltalake, #DeltaTable, #DatabricksDelta, #DeltaTableCreate, #DatawarehouseVsDataLakevsDeltaLake, #PysparkDeltaLake, #DeltalakevsDatalake, #SQLDeltaTable, #DataframeDeltaTable,#DeltaFormat ,#DatabricksRealtime, #SparkRealTime, #DatabricksInterviewQuestion, #DatabricksInterview, #SparkInterviewQuestion, #SparkInterview, #PysparkInterviewQuestion, #PysparkInterview, #BigdataInterviewQuestion, #BigdataInterviewQuestion, #BigDataInterview, #PysparkPerformanceTuning, #PysparkPerformanceOptimization, #PysparkPerformance, #PysparkOptimization, #PysparkTuning, #DatabricksTutorial, #AzureDatabricks, #Databricks, #Pyspark, #Spark, #AzureDatabricks, #AzureADF, #Databricks, #LearnPyspark, #LearnDataBRicks, #DataBricksTutorial, #azuredatabricks, #notebook, #Databricksforbeginners

Пікірлер: 88
@shreeyashransubhe2537
@shreeyashransubhe2537 2 жыл бұрын
Sir, I have gone through lots of videos but never understood the concepts so simple yet very detailed manner. Thank you very much. I have shared your playlist with my colleagues too. They also liked it very much.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you for your valuable comments. Really appreciated
@sundarkannan1856
@sundarkannan1856 2 күн бұрын
Thank you and very clear and helps to understand what is Zorder method in delta table
@rajasdataengineering7585
@rajasdataengineering7585 2 күн бұрын
Glad it was helpful! Keep watching
@pratikparbhane8677
@pratikparbhane8677 8 ай бұрын
Great Explain , Understood OPTIMISE , VACCUM() AND Z-ORDERING in One Video
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
Glad it was helpful!
@rohitdanda
@rohitdanda Жыл бұрын
Your videos are so simple that a kid can also understand. Thanks and salute sir🖖 for putting so much effort and making videos and helping us!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks for your comment. Glad to know it helps data engineers
@navdeepjha2739
@navdeepjha2739 3 ай бұрын
Invaluable explanation sir! I went through many blogs but couldn't get it. You made it crystal clear😊
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Glad to hear that! Thanks for your comment
@AnupGupta-05
@AnupGupta-05 2 ай бұрын
Hi Brother you are best teacher, the way you explain its best, keep up the good work
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Thank you!
@annaduraip3182
@annaduraip3182 2 ай бұрын
Great, thank you. You have explained in simpler way to understand anyone.
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Thank you
@ravisunkara6664
@ravisunkara6664 2 ай бұрын
Awesome explanation on Z-ordering. Greatly appreciated your efforts making this video.
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Thank you
@mukilanlakshmanan8968
@mukilanlakshmanan8968 11 ай бұрын
Sir, I love your teaching method, you have explained it in detail.
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Thanks Mukilan! Glad to hear that
@NileshPatil-b3u
@NileshPatil-b3u 8 ай бұрын
Sir, Thanks for explaining in a very simple manner.
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
Thanks and welcome
@SumitAmbatkar
@SumitAmbatkar 5 ай бұрын
i watched your nearly all playlist i loved your teching style, how to ogrip on concept, your explaination are fabulous keeping doing sir, best of luck. we are always here for you Thank you..:)
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Thank you,Sumit! Keep watching
@ajaykiranchundi9979
@ajaykiranchundi9979 Жыл бұрын
A very well explained . The way you broke down the data to explain the same is amazing. I am sure it would have taken good time to put it together. Indebted to you brother.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Ajay👍🏻
@terrificmenace
@terrificmenace Жыл бұрын
Thank you sir 🙏🏻 I went through many udemy courses but never understood these concepts. Ur explanation is very good and easy to understand many many thanks sir 🙏🏻 🙏🏻
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you 👍🏻
@shankar1556
@shankar1556 Жыл бұрын
Hi Azar, Thank you for explanation. I have a dought. in this example it shows that z-order create new partitions with sorting emp_id. Does z-order really create new partitions?
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Hi Shankar, this is Raja. When we perform z-order, data is being co-located within same set of files. It is not shuffling the data, nor creating new partitions
@YogeshBiguvu2208
@YogeshBiguvu2208 11 ай бұрын
Excellent explanation with Examples.....Thank you so mcuh sir..
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
You are most welcome! Glad it helps
@satheeshkumarak6708
@satheeshkumarak6708 4 ай бұрын
Hello Sir, How do you determine the number of columns to be used in Z order and whether or not to use a particular column for Z order provided that you have calculated the cardinality percentage of all the columns?
@RanjeetkumarYadav
@RanjeetkumarYadav 6 ай бұрын
Amazing and very intuitive example. Thank You!!
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
You're very welcome! Keep watching
@shwetac2929
@shwetac2929 Жыл бұрын
you teaching methos is very good ....this video clear my all doubt
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad to hear that
@venkatasai4293
@venkatasai4293 2 жыл бұрын
Thanks for the great explanation Raja. So are the statistics collected on all the columns ? What if we want to query on other columns ? Will it work ?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Yes Venkata, it will work first 32 columns. If your table contains more than 32 columns and you want to collect statistics for those columns, we can configure that separately
@venkatasai4293
@venkatasai4293 2 жыл бұрын
@@rajasdataengineering7585 ok . So zorder is similar to bucketing right ? Colocating the data into same set of files ? If two tables contains same key and if we zorder them on the key While joining the data it will fetch only required files into the executor ?
@purnimasharma9734
@purnimasharma9734 2 жыл бұрын
Hi Raja, how is the partition column determined e.g. how does it know that you have to use emp_id here? Is it based on the predicate column?
@purnimasharma9734
@purnimasharma9734 2 жыл бұрын
Never mind, when I watched your video completely, I found out.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Great
@omprakashreddy4230
@omprakashreddy4230 2 жыл бұрын
Your videos are definitely creating great impact. Thank you for that. Can you also please explain df.explain() command in great detail with examples.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Happy to hear that it's creating impact on data engineers. Thank you Sure, will post a video on explain plan
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi Omprakash, created a video on explain plan as per your request. Hope it helps you - kzbin.info/www/bejne/bH_Vh4SKl9Chmrc
@arabajshaikh8411
@arabajshaikh8411 Ай бұрын
Excellent, Thank you so much.
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Glad it was helpful! You are welcome
@sathyahisto
@sathyahisto Жыл бұрын
good Explaination, liked it when you demonstrated with excel. Just one suggestion syntax for zorder seems to be changed to "Zorder by ()"
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes, you are right. Thanks
@sraoarjun
@sraoarjun 6 ай бұрын
Indeed an awesome video !! Great explanation !!
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
Glad you liked it! Thank you
@tanushreenagar3116
@tanushreenagar3116 2 жыл бұрын
Very nice sir 👌 cleared my concept now
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you
@saurav0777
@saurav0777 2 жыл бұрын
Thanks for uploading . Very nice explanation
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks
@TheDataArchitect
@TheDataArchitect 5 ай бұрын
What about using multiple columns in z-order?
@AFSARAHMED4
@AFSARAHMED4 Жыл бұрын
Excellent Explaination Sir
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@rahul_chilukamari
@rahul_chilukamari 3 ай бұрын
one such good video with neat explanation.
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Thank you
@vivek05117gece
@vivek05117gece Жыл бұрын
very well explained. Kudos to you.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@mohitupadhayay1439
@mohitupadhayay1439 5 ай бұрын
Raja please try to create a full project where all these optimizations can be shown at full scale.
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Sure Mohit, will do!
@FreakONcW1
@FreakONcW1 11 ай бұрын
Extremely helpful video.
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Thanks Kinjal! Glad to know it was helpful!
@dineshwaditake5248
@dineshwaditake5248 Жыл бұрын
Nicely explained !!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@gil.0007
@gil.0007 11 ай бұрын
Very nicely explained 🎉
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Thanks, glad it was helpful!
@ravulapallivenkatagurnadha9605
@ravulapallivenkatagurnadha9605 2 жыл бұрын
Please continue this videos
@tanushreenagar3116
@tanushreenagar3116 Жыл бұрын
PERFECT CONTENT SIR
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Tanu!
@3a8saisamireddi61
@3a8saisamireddi61 6 ай бұрын
detailed explanation👏
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
Thank you 🙂
@manjit_singhh
@manjit_singhh 2 жыл бұрын
Very nice explanation 🙂
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks
@viniciusguimaraessantana5455
@viniciusguimaraessantana5455 Жыл бұрын
thank you very much.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
You are welcome!
@anuragpaudyal3297
@anuragpaudyal3297 2 ай бұрын
awesome😃
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Thank you! Cheers!
@TotuBabyBird
@TotuBabyBird Жыл бұрын
Great!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@aswaniyettapu9992
@aswaniyettapu9992 2 жыл бұрын
Can u do one video on lead and lag in pyspark..?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure, will post a video on lead and lag very soon
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi Aswani, have posted a video on lead and lag function today as per your request
@aswaniyettapu9992
@aswaniyettapu9992 2 жыл бұрын
Tq so much
@ravulapallivenkatagurnadha9605
@ravulapallivenkatagurnadha9605 2 жыл бұрын
Please do video on how to convert pandas data pipilines to spark data pipiy
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure will do
67. Databricks | Pypark | Delta: Schema Evolution - MergeSchema
7:53
Raja's Data Engineering
Рет қаралды 14 М.
65. Databricks | Pyspark | Delta Lake: Vacuum Command
15:32
Raja's Data Engineering
Рет қаралды 17 М.
Good teacher wows kids with practical examples #shorts
00:32
I migliori trucchetti di Fabiosa
Рет қаралды 12 МЛН
What's in the clown's bag? #clown #angel #bunnypolice
00:19
超人夫妇
Рет қаралды 12 МЛН
Я сделала самое маленькое в мире мороженое!
00:43
Кушать Хочу
Рет қаралды 4,6 МЛН
Advancing Spark - Give your Delta Lake a boost with Z-Ordering
20:31
Advancing Analytics
Рет қаралды 29 М.
52. Databricks| Pyspark| Delta Lake Architecture: Internal Working Mechanism
30:13
Raja's Data Engineering
Рет қаралды 44 М.
64. Databricks | Pyspark | Delta Lake: Optimize Command - File Compaction
13:16
Raja's Data Engineering
Рет қаралды 18 М.
Diving into Delta Lake 2.0
29:37
Databricks
Рет қаралды 4,7 М.
Core Databricks: Understand the Hive Metastore
22:12
Bryan Cafferky
Рет қаралды 17 М.
What is this delta lake thing?
6:58
Guy in a Cube
Рет қаралды 60 М.
61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)
20:03
Good teacher wows kids with practical examples #shorts
00:32
I migliori trucchetti di Fabiosa
Рет қаралды 12 МЛН