Top Big Data Interview Questions asked in 2024 | Cloud Data Engineer | Azure | Spark | SQL

  ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 13,461

Sumit Mittal

Sumit Mittal

2 ะฐะน ะฑาฑั€ั‹ะฝ

๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
"๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
Our highly experienced guest interviewer, Ganesh Ramdas Kudale, / ganesh-kudale-50bb14ab shares invaluable insights and practical guidance drawn from his extensive expertise in the Big Data Domain.
Our expert guest interviewee, Prithvi Salve, / prithvi-salve-45545a1ba has an interesting approach to answering the interview questions on Apache Spark, SQL and Azure Cloud Services.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - โ€ข SQL tutorial for every...
Python Playlist - โ€ข Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/#testimonials
TIMESTAMPS : Questions Discussed
01:00 Introduction
01:47 What is Hadoop and how does it work?
03:09 Why move from MapReduce to Spark?
05:07 Does Spark provide storage?
05:47 Give a high-level explanation of Spark.
06:50 Why switch from RDDs to DataFrames in Spark?
07:53 Which languages does Spark support?
08:27 What are RDDs and their importance?
09:47 What happens during actions/transformations in Spark?
11:15 Explain Spark architecture.
13:06 What are deployment modes and their use cases?
14:30 Describe the plans created when executing a Spark job.
16:00 What is a predicate push down?
18:10 Explain jobs, stages, and tasks in Spark.
19:10 What are the types of transformations in Spark?
20:38 Difference between repartition and coalesce?
23:30 Should you infer schema or specify it when creating a DataFrame?
24:19 What are the ways to enforce schema? Provide an example.
24:54 SQL coding questions
41:09 Which Azure cloud services have you used?
41:35 Explain Databricks architecture at a high level.
42:40 How do you run SQL queries in Databricks?
44:10 How can one notebook run another in Databricks?
45:35 Can you use parameters when running Databricks notebooks?
46:07 Difference between Data Lake and Delta Lake? Pros and cons of each.
48:11 What activities are available in ADF?
49:09 Scenario-Based question
Music track: Retro by Chill Pulse
Source: freetouse.com/music
Background Music for Video (Free)
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ะŸั–ะบั–ั€ะปะตั€: 16
@rishabhkesarwani-br2rx
@rishabhkesarwani-br2rx 2 ะฐะน ะฑาฑั€ั‹ะฝ
The guy answered very well ! Got the good idea on what to say and what to avoid during interview
@lazzybirdflying3225
@lazzybirdflying3225 6 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
Though it is a mock interview, I appreciate his calm and pleasant responses to all the questions!
@shaileshchile329
@shaileshchile329 2 ะฐะน ะฑาฑั€ั‹ะฝ
Thanks for the videos. It's very helpful!
@gudiatoka
@gudiatoka 2 ะฐะน ะฑาฑั€ั‹ะฝ
16:53 Broadcast join decided on the go or run time which is by Adaptive Query Execution not spark sql engine or catalytic optimizer as said
@axatdewangan
@axatdewangan ะะน ะฑาฑั€ั‹ะฝ
Great answers!
@gudiatoka
@gudiatoka 2 ะฐะน ะฑาฑั€ั‹ะฝ
When ever transformation applied it never created a dag rather than it created a lineage between rrds and action created a DAG
@shrikantkorate5933
@shrikantkorate5933 ะะน ะฑาฑั€ั‹ะฝ
he answered to the point most of the questions very good
@Nalaka-Wanniarachchi
@Nalaka-Wanniarachchi 2 ะฐะน ะฑาฑั€ั‹ะฝ
Well scored.
@ravulapallivenkatagurnadha9605
@ravulapallivenkatagurnadha9605 2 ะฐะน ะฑาฑั€ั‹ะฝ
Continue this series
@jithindev9185
@jithindev9185 ะะน ะฑาฑั€ั‹ะฝ
๐Ÿ‘๐Ÿ‘๐Ÿ‘๐Ÿ‘
@user-nc9nt9nw5w
@user-nc9nt9nw5w 20 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
The million dollar question is...."Is he selected"..??? and how did he do in the 2nd round..??..2nd round questions please..
@RohitSharma-ny1oq
@RohitSharma-ny1oq 2 ะฐะน ะฑาฑั€ั‹ะฝ
Good explanation men๐Ÿ˜…
@hdr-tech4350
@hdr-tech4350 28 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
Java used in Hadoop Bound to work on mapreduce Can only work on batch process not real time in map reduce
@suvenduku2
@suvenduku2 23 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
Sir pls provide the questions in description
@hdr-tech4350
@hdr-tech4350 28 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
Spark core -Rdd (flexible) high level apis- Df and Spark sql (easy to write query) Transformation n action Spark submit process Deployment modes Types of transformation Repartition n coalesce Methods for schema enforcement - ddl, struct Consecutive wins in sql
@rushirajkadge3995
@rushirajkadge3995 24 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
The row_number values for marks are not correct (35:16). The correct output is: Marks Row_number 100 1 100 2 99 1 98 1 98 2 98 3 97 1 96 1 95 1
Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2
41:04
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
TCS Live Interview for Azure Data Engineer | Technical round -1 Azure | KSR DATAVIZON
34:05
KSR Datavizon
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 112 ะœ.
Double Stacked Pizza @Lionfield @ChefRush
00:33
albert_cancook
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 87 ะœะ›ะ
Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks
00:26
Sergio Outdoors
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 90 ะœะ›ะ
35M Subscriber Moment Almost Here๐ŸŽ‰โค๏ธ Supported by Korean creators๐Ÿ‡ฐ๐Ÿ‡ท๐Ÿค๐Ÿ‡ฏ๐Ÿ‡ต
00:32
ISSEI / ใ„ใฃใ›ใ„
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 88 ะœะ›ะ
Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices
43:49
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question
31:45
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 16 ะœ.
Learn Snowflake in 10 Minutes| High Paying Skills | Step by Step Hands-On Guide
11:17
Darshil Parmar
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 201 ะœ.
10 recently asked Pyspark Interview Questions | Big Data Interview
28:36
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 28 ะœ.
GCP Data Engineer Mock  interview
15:22
Grow With Google Cloud
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 1,1 ะœ.
Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture
31:41
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Mock Interview for Data Engineers | Spark Optimizations | Real-time Project Challenges and Scenarios
45:21
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11 ะœ.
Tips and Tricks- Azure Data Engineering Interview Questions | Managed Identity vs Service Principal
14:05
Mr. K Talks Tech
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 6 ะœ.
TCS SQL Real Interview BY TCS Team! TCS Interview Recording Simulation! TCS Ninja Hiring
48:02
CodiMinati
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 606 ะœ.
15 Data Engineering Interview Questions in less than 15 minutes Part-1 #bigdata #interview
12:44
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11 ะœ.