Dynamic Partition Pruning in Apache Spark

  Рет қаралды 14,079

Learning Journal

Learning Journal

Күн бұрын

Пікірлер: 17
@EugenePetrash
@EugenePetrash 2 жыл бұрын
Genious explanation. Not only on that certain topic, but all of the author's videos and articles are also totally clear. Thanks a lot. Subscribed!
@skywalker66ful
@skywalker66ful 3 жыл бұрын
Best Explanation I have found till date about Dynamic Partition Pruning and infact about Predicate Pushdown and Partition Pruning as well
@anikethdeshpande8336
@anikethdeshpande8336 Жыл бұрын
super explanation! simple to understand, thanks for showing the execution plans!
@andre__luiz__
@andre__luiz__ Жыл бұрын
Amazing explanation!!!
@akshaygupta013
@akshaygupta013 3 жыл бұрын
Nice explanation. I do have a doubts what's the need for broadcast if the filter condition is already being applied to dimensions table and if it is required than tables which are greater than broadcast threshold in those case will this technique not work or just join type will be different.
@feelings__flicks
@feelings__flicks 2 жыл бұрын
Same doubt brother. If u get the answer can u please share it.
@mohammedsafiahmed1639
@mohammedsafiahmed1639 2 жыл бұрын
from what I understand, filter condition, or predicate pushdown as Databricks calls it, works only when querying single table. When you join two tables, you need to 'broadcast' the filter to the other table being joined.
@artemvolkov5682
@artemvolkov5682 Жыл бұрын
What if I just add year and month to the 'ON' statement ? I believe partition pruning will work, but AQE should be enabled.
@ylchen5975
@ylchen5975 3 жыл бұрын
Very useful and expiation is pretty clear, thank you!
@babyscookbook2751
@babyscookbook2751 2 жыл бұрын
Hi sir., can we use apache kafka for sending emails? Please sir, I need it help
@hierfnhg
@hierfnhg 3 жыл бұрын
Very informative thanks for deep diving.
@lancequin5209
@lancequin5209 2 жыл бұрын
Him: Make Sense? Me: Nope Him: Great
@artemvolkov5682
@artemvolkov5682 Жыл бұрын
hahah, feel exactly the same
@bhomiktakhar8226
@bhomiktakhar8226 3 жыл бұрын
Nicely explained !....but how does the filter is transferred to order table..since where condition is on year and month , query on fact table would still have to figure out what is full_date (of dimension table)column values on for 2021 Feb...could be multiple full_dates for month,year .?
@kolketzz
@kolketzz 2 жыл бұрын
probably it would do date like '2021-02%'
@octo3010
@octo3010 3 жыл бұрын
Neat feature
@trainsam22
@trainsam22 2 жыл бұрын
Hi Prashant, you know your concepts. but stop saying : makes sense.. or simple etc.. that is too uncle like.. b cool
Delta Lake for Apache Spark - Why do we need Delta Lake for Spark?
18:57
Learning Journal
Рет қаралды 46 М.
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
Try this prank with your friends 😂 @karina-kola
00:18
Andrey Grechka
Рет қаралды 9 МЛН
Partitioning
14:32
Big Data Analysis with Scala and Spark
Рет қаралды 21 М.
Spark Interview Question | Partition Pruning | Predicate Pushdown
8:17
Dynamic Partition Pruning | Spark Performance Tuning
6:32
Data Savvy
Рет қаралды 42 М.
Dynamic Partition Pruning: How It Works (And When It Doesn’t)
20:33
dynamic partition pruning in spark | Lec-22
18:42
MANISH KUMAR
Рет қаралды 11 М.
10 recently asked Pyspark Interview Questions | Big Data Interview
28:36
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН