Advancing Spark - Understanding Low Shuffle Merge

  Рет қаралды 5,249

Advancing Analytics

Advancing Analytics

Күн бұрын

Пікірлер: 11
@WastedFury
@WastedFury 2 жыл бұрын
Really useful. New to Databricks and you're last couple of videos have really helped me understand how it will support some of the key concepts needed and some of the gotchas that are actually being resolved in the new releases. Thank you.
@YoussefMrini
@YoussefMrini 2 жыл бұрын
I've been using it since the day 1. It has improved my merges :D
@fb-gu2er
@fb-gu2er 2 жыл бұрын
Kudos for the whiteboard. You should do it more often
@drummerboi4eva
@drummerboi4eva 2 жыл бұрын
Useful feature , very well explained
@AdvancingAnalytics
@AdvancingAnalytics 2 жыл бұрын
Apologies - looks like I wiped out comments when clearing some initial spam. Apologies if anyone's actual comments got dropped! Simon
@Monsalvo888
@Monsalvo888 2 жыл бұрын
really clear! thank!
@dylanmccullough2679
@dylanmccullough2679 2 жыл бұрын
So cool!
@ArcaLuiNeo
@ArcaLuiNeo 2 жыл бұрын
Thanks for the explanation. What device are you using for the whiteboarding part?
@AdvancingAnalytics
@AdvancingAnalytics 2 жыл бұрын
Microsoft Whiteboard on a separate tablet, coming through a HDMI capture card! Probably a fairly overengineered approach!!
@Vikasptl07
@Vikasptl07 Жыл бұрын
Thanks for explanation. I am working on one such scenario where table (no efficient column for partition in table , not able to use predicate pushdown in merge )has 2 bn rows and my batch job run every 1 hour for loading(1mn rows every hour). Now merge is taking more time upwards of 50mins. I will try to implement low shuffle merge and also optimize z order by (once daily). Can you suggest any other optimization techniques?
@tarun080311
@tarun080311 Жыл бұрын
very time consuming explanation method.
Advancing Spark - Understanding the Spark UI
30:19
Advancing Analytics
Рет қаралды 55 М.
Advancing Spark - Delta Deletion Vectors
17:02
Advancing Analytics
Рет қаралды 3,6 М.
Чистка воды совком от денег
00:32
FD Vasya
Рет қаралды 4,9 МЛН
Farmer narrowly escapes tiger attack
00:20
CTV News
Рет қаралды 13 МЛН
Noodles Eating Challenge, So Magical! So Much Fun#Funnyfamily #Partygames #Funny
00:33
Databricks Apps First Look - Advancing Spark
22:44
Advancing Analytics
Рет қаралды 3 М.
Advancing Spark - Azure Databricks News April 2022
29:18
Advancing Analytics
Рет қаралды 1,6 М.
Shuffling: What it is and why it's important
14:06
Big Data Analysis with Scala and Spark
Рет қаралды 26 М.
Advancing Spark - Bloom Filter Indexes in Databricks Delta
24:41
Advancing Analytics
Рет қаралды 9 М.
Advancing Spark - Give your Delta Lake a boost with Z-Ordering
20:31
Advancing Analytics
Рет қаралды 29 М.
Advancing Spark - Identity Columns in Delta
20:00
Advancing Analytics
Рет қаралды 10 М.
Shuffle Partition Spark Optimization: 10x Faster!
19:03
Afaque Ahmad
Рет қаралды 11 М.
Advancing Spark - Exploring DLT Event Metrics
27:33
Advancing Analytics
Рет қаралды 4,7 М.
Чистка воды совком от денег
00:32
FD Vasya
Рет қаралды 4,9 МЛН