Mock Interview for Data Engineers | Spark Optimizations | Real-time Project Challenges and Scenarios

  ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11,802

Sumit Mittal

Sumit Mittal

ะšาฏะฝ ะฑาฑั€ั‹ะฝ

๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
"๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
A highly experienced guest interviewer, Himanshu Mishra, / himanshu-mishra-4796014b conducting a well engaging interview covering all the important topics that a Data Engineer should be aware of.
Our talented guest interviewee, Hamida Bano, / hamida-bano-793804208 answering the interview questions in a very simplistic way with good examples.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - โ€ข SQL tutorial for every...
Python Playlist - โ€ข Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/#testimonials
Discussed Questions : Timestamp
1: 40 Introduction
2:21 Challenges you faced in your project
4:40 Whatโ€™s the contribution towards your project ?
6:20 File formats you have worked on in your project ?
7:53 What is wide and narrow transformations ?
9:38 Lazy evaluation in spark ?
11:25 What is fault tolerance in spark and mapreduce and how does it work ?
13:32 Client mode and Cluster mode in spark ?
14:15 Broadcast joins we have in spark ?
15:18 Memory management in spark ?
18:12 In live production, if you are facing an out of memory error. So whatโ€™s the approach you follow to debug that?
19:51 What is Data skewness ?
20:16 What is Caching ?
21:38 How do you test your spark code ?
22:17 What are the performance tuning techniques that you use to tune your spark job ?
23:18 What is coalesce and when should we use it ?
24:54 Managed and external tables with a use case
26:28 How do you deploy your spark code ?
27:29 How did you schedule your workflow ?
28:14 What are the version control tools you have used ?
28:49 What is shuffling and why do we need to think of minimising it ?
29:50 One of the Spark jobs you've developed is experiencing slow performance. How would you go about resolving this issue?
31:00 What are the transformations and actions you have performed in the current project ?
32:03 How does spark work ? Explain Spark Architecture ?
33:05 What is lineage in spark ?
33:50 Different types of joins in spark ? Use case on any one of those joins ?
35:25 What is a spark session and how do we initialise it ?
36:33 How to read a parquet file into a dataframe ?
37:37 How can you perform filters on a dataframe?
39:20 How to remove duplicates in a dataframe ?
39:56 Consider a scenario where in dataframe we want to update a column name, So how will you do this ?
40:40 Usage of withColumn ?
41:27 How to remove any column from a dataframe ?
41:50 Have you handled any null values in your dataframe ?
42:37 SQL Coding Question
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ะŸั–ะบั–ั€ะปะตั€: 26
@chetankakkireni8870
@chetankakkireni8870 4 ะฐะน ะฑาฑั€ั‹ะฝ
she spoke about user memory, executor memory, cache memory which uses off heap memory which does not use garbage collector, which I felt very useful.
@himanshusrivastava64
@himanshusrivastava64 6 ะบาฏะฝ ะฑาฑั€ั‹ะฝ
Her knowledge !! Commendable
@akshaythengane4302
@akshaythengane4302 2 ะฐะน ะฑาฑั€ั‹ะฝ
This series is too good! Keep em coming!
@poojabarawkar1808
@poojabarawkar1808 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thanks
@PraveenSingh-no8ol
@PraveenSingh-no8ol 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sumit Sir kindly make a video on a person who has transition from non-It to Data Engineering profile it will be really helpful
@prannay19
@prannay19 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thanks again. I am following these closely and feel that these would be immensely helpful in cracking the interviews. Appreciate it. ๐Ÿ‘
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
definitely
@zaffer2024
@zaffer2024 4 ะฐะน ะฑาฑั€ั‹ะฝ
๐Ÿ™
@_-_Abhinav_-_33
@_-_Abhinav_-_33 4 ะฐะน ะฑาฑั€ั‹ะฝ
This interview is really very helpful. Thank you so much Sir for this entire series.
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
Pleasure to share more such content for all my supportive followers!
@swapnildande4706
@swapnildande4706 4 ะฐะน ะฑาฑั€ั‹ะฝ
Really thanks sir for mock interview playlist ๐Ÿ™๐Ÿป
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
Most welcome
@sadiqueahmad6781
@sadiqueahmad6781 4 ะฐะน ะฑาฑั€ั‹ะฝ
Insightful interview ๐Ÿ‘
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
thank you
@karthikeyanudayakumar9553
@karthikeyanudayakumar9553 4 ะฐะน ะฑาฑั€ั‹ะฝ
Excellent mock interview ๐Ÿ‘
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
Glad you enjoyed it!
@user-rx3vl2en5i
@user-rx3vl2en5i 4 ะฐะน ะฑาฑั€ั‹ะฝ
Hi sir good morning it was helpful to us please do make some AWS data engineering interview also instead of azure..
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
Noted
@user-rx3vl2en5i
@user-rx3vl2en5i 4 ะฐะน ะฑาฑั€ั‹ะฝ
Yeah please we facing the end to end data pipeline AWS side explanation where use etl used nd which transfer that used and so on.
@pritamkabiraj7691
@pritamkabiraj7691 2 ะฐะน ะฑาฑั€ั‹ะฝ
Hi Sumit Sir I also want to appear for Mock Interview. Is there any process involved or Can you help me with the process to appear?
@umeshpagoti1017
@umeshpagoti1017 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sir continue the python videos
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
yes
@shiprasarwada
@shiprasarwada 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sir keep mock interviews for gcp data engineer
@sumitmittal07
@sumitmittal07 4 ะฐะน ะฑาฑั€ั‹ะฝ
sure
@karthikeyanr1171
@karthikeyanr1171 4 ะฐะน ะฑาฑั€ั‹ะฝ
too many questions
@telugoons2292
@telugoons2292 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thanks
Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices
43:49
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question
31:45
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 16 ะœ.
Cool Items! New Gadgets, Smart Appliances ๐ŸŒŸ By 123 GO! House
00:18
123 GO! HOUSE
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 17 ะœะ›ะ
่ทฏ้ฃžๅคช่ฟ‡ๅˆ†ไบ†๏ผŒ่‡ชๅทฑๆธธๆณณใ€‚#ๆตท่ดผ็Ž‹#่ทฏ้ฃž
00:28
่ทฏ้ฃžไธŽๅ”่ˆžๆก
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 15 ะœะ›ะ
ะ—ะฐะดะตั€ะถะธ ะดั‹ั…ะฐะฝะธะต ะดะพะปัŒัˆะต ะฒัะตั…!
00:42
ะั€ะธัˆะฝะตะฒ
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 3,3 ะœะ›ะ
#JasonDeruloTV // Lottery #GotPermissionToPost From  @prestige_et_collection #FromTheIslands
00:17
Jason Derulo
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 76 ะœะ›ะ
Big Data Engineer Mock Interview | Big Data Project Pipeline | Managerial #interview #question
31:19
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 5 ะœ.
Big Data Engineer Mock Interview | Real-time Project Questions | Amount of Data | Cluster Size
25:35
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 4 ะœ.
GCP Data Engineer Mock  interview
15:22
Grow With Google Cloud
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 1,1 ะœ.
Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2
41:04
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
Top 10 SQL Interview Queries | Popular SQL Queries for SQL Interview
36:33
techTFQ
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 240 ะœ.
10 recently asked Pyspark Interview Questions | Big Data Interview
28:36
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 28 ะœ.
Must Watch Live Mock Interview For Data Engineers | System Design | Data Modeling #interview
59:41
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture
31:41
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 9 ะœ.
Asking Google Engineers How To Get Hired and Their Salaries
11:05
The Code Skool
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 1,6 ะœะ›ะ
Cloud Data Engineer Mock Interview | Focusing on SQL, PySpark, Project & Cloud Questions.
34:57
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
Cool Items! New Gadgets, Smart Appliances ๐ŸŒŸ By 123 GO! House
00:18
123 GO! HOUSE
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 17 ะœะ›ะ