66. Databricks | Pyspark | Delta: Z-Order Command

  Рет қаралды 19,211

Raja's Data Engineering

Raja's Data Engineering

2 жыл бұрын

Azure Databricks Learning: Delta Lake - Z-Order Command
========================================================
What is Z-order Command in delta table and how to apply in delta lake development?
Z-order one of the performance optimization techinique used in delta lake. It is used along with optimize command and used to compact small files into optimal size and at the same time relevant data is co-located to improve the performance.
This video gives complete understanding of Z-order command
#DeltaZorder, #DatabricksZorder, #PerformanceOptimization, #Zorder,#Z-order, #Z-Ordering, #DeltaOptimize, #DeltaOptimizeZorder #DeltaCompactFiles, #DeltaSmallFileIssue, #DeltalakePerformance, #DeltaPerformanceImprovement ,#DeltalakeIntro, #IntroductionToDeltaLake, #Deltalake, #DeltaTable, #DatabricksDelta, #DeltaTableCreate, #DatawarehouseVsDataLakevsDeltaLake, #PysparkDeltaLake, #DeltalakevsDatalake, #SQLDeltaTable, #DataframeDeltaTable,#DeltaFormat ,#DatabricksRealtime, #SparkRealTime, #DatabricksInterviewQuestion, #DatabricksInterview, #SparkInterviewQuestion, #SparkInterview, #PysparkInterviewQuestion, #PysparkInterview, #BigdataInterviewQuestion, #BigdataInterviewQuestion, #BigDataInterview, #PysparkPerformanceTuning, #PysparkPerformanceOptimization, #PysparkPerformance, #PysparkOptimization, #PysparkTuning, #DatabricksTutorial, #AzureDatabricks, #Databricks, #Pyspark, #Spark, #AzureDatabricks, #AzureADF, #Databricks, #LearnPyspark, #LearnDataBRicks, #DataBricksTutorial, #azuredatabricks, #notebook, #Databricksforbeginners

Пікірлер: 78
@ravisunkara6664
@ravisunkara6664 7 сағат бұрын
Awesome explanation on Z-ordering. Greatly appreciated your efforts making this video.
@rajasdataengineering7585
@rajasdataengineering7585 6 сағат бұрын
Thank you
@pratikparbhane8677
@pratikparbhane8677 6 ай бұрын
Great Explain , Understood OPTIMISE , VACCUM() AND Z-ORDERING in One Video
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
Glad it was helpful!
@navdeepjha2739
@navdeepjha2739 9 күн бұрын
Invaluable explanation sir! I went through many blogs but couldn't get it. You made it crystal clear😊
@rajasdataengineering7585
@rajasdataengineering7585 8 күн бұрын
Glad to hear that! Thanks for your comment
@shreeyashransubhe2537
@shreeyashransubhe2537 Жыл бұрын
Sir, I have gone through lots of videos but never understood the concepts so simple yet very detailed manner. Thank you very much. I have shared your playlist with my colleagues too. They also liked it very much.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you for your valuable comments. Really appreciated
@rohitdanda
@rohitdanda Жыл бұрын
Your videos are so simple that a kid can also understand. Thanks and salute sir🖖 for putting so much effort and making videos and helping us!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks for your comment. Glad to know it helps data engineers
@user-wy5tl2ev2r
@user-wy5tl2ev2r 23 күн бұрын
one such good video with neat explanation.
@rajasdataengineering7585
@rajasdataengineering7585 23 күн бұрын
Thank you
@YogeshBiguvu2208
@YogeshBiguvu2208 9 ай бұрын
Excellent explanation with Examples.....Thank you so mcuh sir..
@rajasdataengineering7585
@rajasdataengineering7585 9 ай бұрын
You are most welcome! Glad it helps
@RanjeetkumarYadav
@RanjeetkumarYadav 3 ай бұрын
Amazing and very intuitive example. Thank You!!
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
You're very welcome! Keep watching
@mukilanlakshmanan8968
@mukilanlakshmanan8968 9 ай бұрын
Sir, I love your teaching method, you have explained it in detail.
@rajasdataengineering7585
@rajasdataengineering7585 9 ай бұрын
Thanks Mukilan! Glad to hear that
@user-hj2nv8gt4o
@user-hj2nv8gt4o 5 ай бұрын
Sir, Thanks for explaining in a very simple manner.
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Thanks and welcome
@ajaykiranchundi9979
@ajaykiranchundi9979 Жыл бұрын
A very well explained . The way you broke down the data to explain the same is amazing. I am sure it would have taken good time to put it together. Indebted to you brother.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Ajay👍🏻
@SumitAmbatkar
@SumitAmbatkar 3 ай бұрын
i watched your nearly all playlist i loved your teching style, how to ogrip on concept, your explaination are fabulous keeping doing sir, best of luck. we are always here for you Thank you..:)
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Thank you,Sumit! Keep watching
@saurav0777
@saurav0777 Жыл бұрын
Thanks for uploading . Very nice explanation
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@sraoarjun
@sraoarjun 3 ай бұрын
Indeed an awesome video !! Great explanation !!
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Glad you liked it! Thank you
@terrificmenace
@terrificmenace Жыл бұрын
Thank you sir 🙏🏻 I went through many udemy courses but never understood these concepts. Ur explanation is very good and easy to understand many many thanks sir 🙏🏻 🙏🏻
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you 👍🏻
@vivek05117gece
@vivek05117gece Жыл бұрын
very well explained. Kudos to you.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@shwetac2929
@shwetac2929 Жыл бұрын
you teaching methos is very good ....this video clear my all doubt
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad to hear that
@FreakONcW1
@FreakONcW1 8 ай бұрын
Extremely helpful video.
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
Thanks Kinjal! Glad to know it was helpful!
@3a8saisamireddi61
@3a8saisamireddi61 3 ай бұрын
detailed explanation👏
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Thank you 🙂
@dineshwaditake5248
@dineshwaditake5248 11 ай бұрын
Nicely explained !!
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Glad it was helpful!
@omprakashreddy4230
@omprakashreddy4230 2 жыл бұрын
Your videos are definitely creating great impact. Thank you for that. Can you also please explain df.explain() command in great detail with examples.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Happy to hear that it's creating impact on data engineers. Thank you Sure, will post a video on explain plan
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi Omprakash, created a video on explain plan as per your request. Hope it helps you - kzbin.info/www/bejne/bH_Vh4SKl9Chmrc
@ravulapallivenkatagurnadha9605
@ravulapallivenkatagurnadha9605 Жыл бұрын
Please continue this videos
@tanushreenagar3116
@tanushreenagar3116 Жыл бұрын
Very nice sir 👌 cleared my concept now
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you
@mohitupadhayay1439
@mohitupadhayay1439 2 ай бұрын
Raja please try to create a full project where all these optimizations can be shown at full scale.
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Sure Mohit, will do!
@viniciusguimaraessantana5455
@viniciusguimaraessantana5455 Жыл бұрын
thank you very much.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
You are welcome!
@gil.0007
@gil.0007 8 ай бұрын
Very nicely explained 🎉
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
Thanks, glad it was helpful!
@AFSARAHMED4
@AFSARAHMED4 Жыл бұрын
Excellent Explaination Sir
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@manjit_singhh
@manjit_singhh Жыл бұрын
Very nice explanation 🙂
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@tanushreenagar3116
@tanushreenagar3116 Жыл бұрын
PERFECT CONTENT SIR
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Tanu!
@sathyahisto
@sathyahisto Жыл бұрын
good Explaination, liked it when you demonstrated with excel. Just one suggestion syntax for zorder seems to be changed to "Zorder by ()"
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes, you are right. Thanks
@TotuBabyBird
@TotuBabyBird Жыл бұрын
Great!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks
@satheeshkumarak6708
@satheeshkumarak6708 Ай бұрын
Hello Sir, How do you determine the number of columns to be used in Z order and whether or not to use a particular column for Z order provided that you have calculated the cardinality percentage of all the columns?
@venkatasai4293
@venkatasai4293 2 жыл бұрын
Thanks for the great explanation Raja. So are the statistics collected on all the columns ? What if we want to query on other columns ? Will it work ?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Yes Venkata, it will work first 32 columns. If your table contains more than 32 columns and you want to collect statistics for those columns, we can configure that separately
@venkatasai4293
@venkatasai4293 2 жыл бұрын
@@rajasdataengineering7585 ok . So zorder is similar to bucketing right ? Colocating the data into same set of files ? If two tables contains same key and if we zorder them on the key While joining the data it will fetch only required files into the executor ?
@shankar1556
@shankar1556 Жыл бұрын
Hi Azar, Thank you for explanation. I have a dought. in this example it shows that z-order create new partitions with sorting emp_id. Does z-order really create new partitions?
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Hi Shankar, this is Raja. When we perform z-order, data is being co-located within same set of files. It is not shuffling the data, nor creating new partitions
@TheDataArchitect
@TheDataArchitect 3 ай бұрын
What about using multiple columns in z-order?
@purnimasharma9734
@purnimasharma9734 Жыл бұрын
Hi Raja, how is the partition column determined e.g. how does it know that you have to use emp_id here? Is it based on the predicate column?
@purnimasharma9734
@purnimasharma9734 Жыл бұрын
Never mind, when I watched your video completely, I found out.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Great
@aswaniyettapu9992
@aswaniyettapu9992 2 жыл бұрын
Can u do one video on lead and lag in pyspark..?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure, will post a video on lead and lag very soon
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi Aswani, have posted a video on lead and lag function today as per your request
@aswaniyettapu9992
@aswaniyettapu9992 2 жыл бұрын
Tq so much
@ravulapallivenkatagurnadha9605
@ravulapallivenkatagurnadha9605 Жыл бұрын
Please do video on how to convert pandas data pipilines to spark data pipiy
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Sure will do
67. Databricks | Pypark | Delta: Schema Evolution - MergeSchema
7:53
Raja's Data Engineering
Рет қаралды 12 М.
Z-Order Visualized
10:24
Lester Martin
Рет қаралды 699
Задержи дыхание дольше всех!
00:42
Аришнев
Рет қаралды 3,3 МЛН
Amazing weight loss transformation !! 😱😱
00:24
Tibo InShape
Рет қаралды 58 МЛН
65. Databricks | Pyspark | Delta Lake: Vacuum Command
15:32
Raja's Data Engineering
Рет қаралды 14 М.
64. Databricks | Pyspark | Delta Lake: Optimize Command - File Compaction
13:16
Raja's Data Engineering
Рет қаралды 16 М.
Performance Tuning in Spark
14:13
CloudFitness
Рет қаралды 7 М.
Advancing Spark - Give your Delta Lake a boost with Z-Ordering
20:31
Advancing Analytics
Рет қаралды 27 М.
Diving into Delta Lake 2.0
29:37
Databricks
Рет қаралды 4,3 М.
Delta Lake Deep Dive: Liquid Clustering
40:54
Delta Lake
Рет қаралды 4,7 М.
75. Databricks | Pyspark | Performance Optimization - Bucketing
22:03
Raja's Data Engineering
Рет қаралды 16 М.
Stanford's FREE data science book and course are the best yet
4:52
Python Programmer
Рет қаралды 682 М.
Liquid Clustering 101: What every Databricks Developer should know
19:13
Rajaniesh Kaushikk
Рет қаралды 1,7 М.
Красиво, но телефон жаль
0:32
Бесполезные Новости
Рет қаралды 1,5 МЛН
Здесь упор в процессор
18:02
Рома, Просто Рома
Рет қаралды 415 М.
Tag her 🤭💞 #miniphone #smartphone #iphone #samsung #fyp
0:11
Pockify™
Рет қаралды 71 МЛН
iPhone 15 Pro Max vs IPhone Xs Max  troll face speed test
0:33