Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices

  ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9,446

Sumit Mittal

Sumit Mittal

4 ะฐะน ะฑาฑั€ั‹ะฝ

๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
"๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
Expert guest interviewer, Sachin R, / sachin-r27 imparts invaluable insights and practical advice derived from extensive experience.
Suman Basu, / basusuman23 skilled guest interviewee, showcases an exceptional approach in answering interview questions.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - โ€ข SQL tutorial for every...
Python Playlist - โ€ข Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/#testimonials
Discussed Questions : Timestamp
1:37 Introduction
2:50 Brief about your project responsibilities
5:26 Discuss SQL code documentation best practices for ensuring query efficiency.
9:56 What are transformations and actions in PySpark DataFrames?
10:35 What are the best practices you have followed specific to PySpark?
12:39 What is the difference between cache and persist?
13:33 Explain the concept of partitioning.
14:58 When allocating multiple worker nodes/executors, how to increase or decrease the number of partitions?
16:38 Which is more effective in avoiding data skewness. Repartitioning or coalesce? what is data skewness?
18:07 Coding questions
36:20 Dealing with data quality issues
38:30 After fetching data from CSV files, how would you define the schema?
41:00 Preferred file format for data loading.
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ะŸั–ะบั–ั€ะปะตั€: 19
@Vlogs..573
@Vlogs..573 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sachin is really knowledgeable, and he is helping to answer the questions as well with Suman.
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
yes both have been great. Kudos to Sachin & Suman.
@isenhiem
@isenhiem ะะน ะฑาฑั€ั‹ะฝ
This is such an amazing initiative...While watching the video I felt like as if I was being interviewed...I cant stress on how helpful this will be for so many people. It gave me a very good idea of the level of my preparation. Thanks a lot and I hope you will create more videos like this.
@sharankarthick3364
@sharankarthick3364 3 ะฐะน ะฑาฑั€ั‹ะฝ
Informative!
@prannay19
@prannay19 4 ะฐะน ะฑาฑั€ั‹ะฝ
Great initiative. Thank you Sumit Sir ๐Ÿ™. Looking forward to more such videos. Keep up the good work ๐Ÿ‘
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thanks a ton
@user-ji9ke8yb2d
@user-ji9ke8yb2d 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thank you so much Sumit sir.Really a great initiative
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
thank you very much
@DataJourneyHuub
@DataJourneyHuub 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thank you Sumit Sir
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
you are welcome
@crunchyworks6374
@crunchyworks6374 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sir as I see from last 3 days everytime cloud tech you use is Azure only , please make it on AWS too itโ€™s very helpful
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
definitely, you will see a lot of variety
@AliKhanLuckky
@AliKhanLuckky 4 ะฐะน ะฑาฑั€ั‹ะฝ
36:03 1.he is asking only highest 2. Dept vise highest Use sql code as follow 1.select max(salary) from emp; 2 select dept,max(salary) from emp group by dept; As simple as that he did not asked you to write window function if he ask you then do it ๐Ÿ˜Š
@sriharidhanakshirur9245
@sriharidhanakshirur9245 4 ะฐะน ะฑาฑั€ั‹ะฝ
In case 1 , we should use WinDow function bcoz, we need to print id and name as well
@AliKhanLuckky
@AliKhanLuckky 4 ะฐะน ะฑาฑั€ั‹ะฝ
@@sriharidhanakshirur9245 in this case u can use sub query as well if anyone explicitly ask you is there any other way or do it using windows then at that time interviewer will get impress ๐Ÿ˜Š
@user-oy9cc8dv8i
@user-oy9cc8dv8i ะะน ะฑาฑั€ั‹ะฝ
if possible mention the experience also , to which experience level these interview are targeting (like this is for 1 year, fresher or for 3 year experience )
@RohitSharma-ny1oq
@RohitSharma-ny1oq 4 ะฐะน ะฑาฑั€ั‹ะฝ
Plz increase little bit complexity of interview because in actual its more complex ๐Ÿ˜Š
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
candidates mostly get stuck in basic fundamentals. These are actual people who conduct interviews in companies.
@IsmailKhan-jy9ew
@IsmailKhan-jy9ew 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thankyou sumit sir for this initiative.
Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2
41:04
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
Mock Interview for Data Engineers | Spark Optimizations | Real-time Project Challenges and Scenarios
45:21
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11 ะœ.
ะšะฐะบ ะฑะตัะฟะปะฐั‚ะฝะพ ะทะฐะผัƒั‚ะธั‚ัŒ iphone 15 pro max
00:59
ะ–ะ•ะ›ะ•ะ—ะะซะ™ ะšะžะ ะžะ›ะฌ
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 7 ะœะ›ะ
Stay on your way ๐Ÿ›ค๏ธโœจ
00:34
A4
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 13 ะœะ›ะ
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 22 ะœะ›ะ
EVOLUTION OF ICE CREAM ๐Ÿ˜ฑ #shorts
00:11
Savage Vlogs
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 4,2 ะœะ›ะ
Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture
31:41
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Big Data Engineer Mock Interview | Real-time Project Questions | Amount of Data | Cluster Size
25:35
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 4 ะœ.
How i Cracked 5 offers in 30 days
14:17
TechDataEngineer
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 338
Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question
31:45
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 16 ะœ.
Top 10 Data Analyst Interview Questions (with answers)
15:58
Chandoo
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 81 ะœ.
Asking Google Engineers How To Get Hired and Their Salaries
11:05
The Code Skool
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 1,6 ะœะ›ะ
Data Engineering Interview at top product based company | First Round
40:07
The Big Data Show
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 6 ะœ.
10 PySpark Product Based Interview Questions
39:46
The Data Tech
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 15 ะœ.
Big Data Engineer Mock Interview | Big Data Project Pipeline | Managerial #interview #question
31:19
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 5 ะœ.
Cloud Data Engineer Mock Interview | Focusing on SQL, PySpark, Project & Cloud Questions.
34:57
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
ะšะฐะบ ะฑะตัะฟะปะฐั‚ะฝะพ ะทะฐะผัƒั‚ะธั‚ัŒ iphone 15 pro max
00:59
ะ–ะ•ะ›ะ•ะ—ะะซะ™ ะšะžะ ะžะ›ะฌ
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 7 ะœะ›ะ