Dynamic Partition Pruning in Apache Spark

  Рет қаралды 13,059

Learning Journal

Learning Journal

2 жыл бұрын

Spark Programming and Azure Databricks ILT Master Class by Prashant Kumar Pandey - Fill out the google form for Course inquiry.
forms.gle/Nxk8dQUPq4o4XsA47
-------------------------------------------------------------------
Data Engineering using is one of the highest-paid jobs of today.
It is going to remain in the top IT skills forever.
Are you in database development, data warehousing, ETL tools, data analysis, SQL, PL/QL development?
I have a well-crafted success path for you.
I will help you get prepared for the data engineer and solution architect role depending on your profile and experience.
We created a course that takes you deep into core data engineering technology and masters it.
If you are a working professional:
1. Aspiring to become a data engineer.
2. Change your career to data engineering.
3. Grow your data engineering career.
4. Get Databricks Spark Certification.
5. Crack the Spark Data Engineering interviews.
ScholarNest is offering a one-stop integrated Learning Path.
The course is open for registration.
The course delivers an example-driven approach and project-based learning.
You will be practicing the skills using MCQ, Coding Exercises, and Capstone Projects.
The course comes with the following integrated services.
1. Technical support and Doubt Clarification
2. Live Project Discussion
3. Resume Building
4. Interview Preparation
5. Mock Interviews
Course Duration: 6 Months
Course Prerequisite: Programming and SQL Knowledge
Target Audience: Working Professionals
Batch start: Registration Started
Fill out the below form for more details and course inquiries.
forms.gle/Nxk8dQUPq4o4XsA47
--------------------------------------------------------------------------
Learn more at www.scholarnest.com/
Best place to learn Data engineering, Bigdata, Apache Spark, Databricks, Apache Kafka, Confluent Cloud, AWS Cloud Computing, Azure Cloud, Google Cloud - Self-paced, Instructor-led, Certification courses, and practice tests.
========================================================
SPARK COURSES
-----------------------------
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/d...
KAFKA COURSES
--------------------------------
www.scholarnest.com/courses/a...
www.scholarnest.com/courses/k...
www.scholarnest.com/courses/s...
AWS CLOUD
------------------------
www.scholarnest.com/courses/a...
www.scholarnest.com/courses/a...
PYTHON
------------------
www.scholarnest.com/courses/p...
========================================
We are also available on the Udemy Platform
Check out the below link for our Courses on Udemy
www.learningjournal.guru/cour...
=======================================
You can also find us on Oreilly Learning
www.oreilly.com/library/view/...
www.oreilly.com/videos/apache...
www.oreilly.com/videos/kafka-...
www.oreilly.com/videos/spark-...
www.oreilly.com/videos/spark-...
www.oreilly.com/videos/apache...
www.oreilly.com/videos/real-t...
www.oreilly.com/videos/real-t...
=========================================
Follow us on Social Media
/ scholarnest
/ scholarnesttechnologies
/ scholarnest
/ scholarnest
github.com/ScholarNest
github.com/learningJournal/
========================================

Пікірлер: 17
@anikethdeshpande8336
@anikethdeshpande8336 9 ай бұрын
super explanation! simple to understand, thanks for showing the execution plans!
@EugenePetrash
@EugenePetrash 2 жыл бұрын
Genious explanation. Not only on that certain topic, but all of the author's videos and articles are also totally clear. Thanks a lot. Subscribed!
@ylchen5975
@ylchen5975 2 жыл бұрын
Very useful and expiation is pretty clear, thank you!
@hierfnhg
@hierfnhg 2 жыл бұрын
Very informative thanks for deep diving.
@skywalker66ful
@skywalker66ful 2 жыл бұрын
Best Explanation I have found till date about Dynamic Partition Pruning and infact about Predicate Pushdown and Partition Pruning as well
@andre__luiz__
@andre__luiz__ 9 ай бұрын
Amazing explanation!!!
@octo3010
@octo3010 2 жыл бұрын
Neat feature
@akshaygupta013
@akshaygupta013 2 жыл бұрын
Nice explanation. I do have a doubts what's the need for broadcast if the filter condition is already being applied to dimensions table and if it is required than tables which are greater than broadcast threshold in those case will this technique not work or just join type will be different.
@feelings__flicks
@feelings__flicks 2 жыл бұрын
Same doubt brother. If u get the answer can u please share it.
@mohammedsafiahmed1639
@mohammedsafiahmed1639 2 жыл бұрын
from what I understand, filter condition, or predicate pushdown as Databricks calls it, works only when querying single table. When you join two tables, you need to 'broadcast' the filter to the other table being joined.
@artemvolkov5682
@artemvolkov5682 11 ай бұрын
What if I just add year and month to the 'ON' statement ? I believe partition pruning will work, but AQE should be enabled.
@babyscookbook2751
@babyscookbook2751 2 жыл бұрын
Hi sir., can we use apache kafka for sending emails? Please sir, I need it help
@bhomiktakhar8226
@bhomiktakhar8226 2 жыл бұрын
Nicely explained !....but how does the filter is transferred to order table..since where condition is on year and month , query on fact table would still have to figure out what is full_date (of dimension table)column values on for 2021 Feb...could be multiple full_dates for month,year .?
@kolketzz
@kolketzz Жыл бұрын
probably it would do date like '2021-02%'
@lancequin5209
@lancequin5209 Жыл бұрын
Him: Make Sense? Me: Nope Him: Great
@artemvolkov5682
@artemvolkov5682 11 ай бұрын
hahah, feel exactly the same
@trainsam22
@trainsam22 Жыл бұрын
Hi Prashant, you know your concepts. but stop saying : makes sense.. or simple etc.. that is too uncle like.. b cool
Delta Lake for Apache Spark - Why do we need Delta Lake for Spark?
18:57
Learning Journal
Рет қаралды 45 М.
Partitioning
14:32
Big Data Analysis with Scala and Spark
Рет қаралды 21 М.
Sigma Kid Hair #funny #sigma #comedy
00:33
CRAZY GREAPA
Рет қаралды 35 МЛН
Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House
00:18
123 GO! HOUSE
Рет қаралды 17 МЛН
WHAT’S THAT?
00:27
Natan por Aí
Рет қаралды 14 МЛН
Advancing Spark - Data Lakehouse Star Schemas with Dynamic Partition Pruning!
18:00
Apache Spark Internal architecture jobs stages and tasks
9:40
Learning Journal
Рет қаралды 43 М.
Dynamic Partition Pruning | Spark Performance Tuning
6:32
Data Savvy
Рет қаралды 40 М.
Spark Logical & Physical Plan
8:24
Data Engineering
Рет қаралды 2,7 М.
Зарядка-брелок для Apple Watch
0:39
Rozetked
Рет қаралды 315 М.
Лазер против камеры смартфона
1:01
Newtonlabs
Рет қаралды 714 М.