Azure Data Engineer Mock Interview | PySpark | Delta Live Tables| Managerial

  ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 5,428

Sumit Mittal

Sumit Mittal

ะšาฏะฝ ะฑาฑั€ั‹ะฝ

๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
"๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
Our highly experienced guest interviewer, Ankur Bhattacharya, / ankur-bhattacharya-100... shares invaluable insights and practical advice coming from his extensive experience in the Big Data Domain
Our expert guest interviewee, Srinivaasan AK, / srinivaasan-ak-609b8a195 has a remarkable approach to answering the coding round interview questions on Azure Cloud Services.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - โ€ข SQL tutorial for every...
Python Playlist - โ€ข Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/#testimonials
Discussed Questions with Timestamp
1:16 Data engineering experience, current project, tech stack?
3:11 Explain Data pipeline architecture of current project?
5:45 What are your Roles and responsibilities?
7:14 How do you manage lately arriving records in Delta Live tables?
7:58 Explain cluster configuration for handling 1000 GB of data?
9:47 Experience with latency bottlenecks and resolution?
11:16 How much data do you handle?
11:33 How do you handle slowness or out of memory issues due to increased load?
13:52 What is an optimized command in PySpark and how does it work?
14:47 What are the join strategies in PySpark?
16:00 How would you join two large tables to avoid shuffling?
17:53 What components would you use to design a pipeline for on-premise CDC and live dashboard updates?
21:27 How would you implement CDC from on-prem to streaming data?
24:10 SQL Coding question
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ะŸั–ะบั–ั€ะปะตั€: 13
@imranhossain1660
@imranhossain1660 4 ะฐะน ะฑาฑั€ั‹ะฝ
Optimize is a perfomance optimization technique available in delta lake table. Whenever we perform any kind of DML operations in Delta table each and every time it generates a new records. Over a period of time it generates a huge number of small files and it is kind of overhead for the delta engine to effectively perform the execution of our query as it eventually increases our resource usage such as i/o read/write and the computaion. Optimize command helps us to combine these small files to a larger file which eventually improves the performance of the delta table. As after this optimize operation, delta table refers to this latest snapshot file in order to retrieve results whever we query our table. And we can delete the obselete small files and free up the space with the help of vaccum command.
@user-jg2tn1wb3d
@user-jg2tn1wb3d 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thank u sumit sir
@prakashtripathi270
@prakashtripathi270 4 ะฐะน ะฑาฑั€ั‹ะฝ
Thank you Sumit sir for arranging such a insightful session..
@sumitmittal07
@sumitmittal07 3 ะฐะน ะฑาฑั€ั‹ะฝ
Always happy to help the community!
@MsMohanj
@MsMohanj 4 ะฐะน ะฑาฑั€ั‹ะฝ
Is it the join is correct or we can go for left join
@maheshtiwari2297
@maheshtiwari2297 4 ะฐะน ะฑาฑั€ั‹ะฝ
Hello sir, i have interview for big data/Etl developer at amazon please guide me for that.
@codinggeek9992
@codinggeek9992 2 ะฐะน ะฑาฑั€ั‹ะฝ
Less questions from Azure Synapse....
@user-jg2tn1wb3d
@user-jg2tn1wb3d 4 ะฐะน ะฑาฑั€ั‹ะฝ
Sir, need one video to know how bussiness requirement is, and how data engg gets the bussiness requirement and working strategy
@sumitmittal07
@sumitmittal07 3 ะฐะน ะฑาฑั€ั‹ะฝ
Noted. Will have a session around this aspect!
@tahiliani22
@tahiliani22 3 ะฐะน ะฑาฑั€ั‹ะฝ
I would like to add to this. If I am understanding correctly, @user_j% is talking about questions like Design Yelp but from a Database perspective.
@zaffer2024
@zaffer2024 3 ะฐะน ะฑาฑั€ั‹ะฝ
Tough sql question, ๐Ÿ˜ญ
@user-nv6ho7uk8b
@user-nv6ho7uk8b 3 ะฐะน ะฑาฑั€ั‹ะฝ
Check this. create table oldest_youngest(person varchar(10),type varchar(20),age int); insert into oldest_youngest values ('A1','ADULT',54), ('A2','ADULT',53), ('A3','ADULT',52), ('A4','ADULT',58), ('A5','ADULT',54), ('C1','CHILD',20), ('C2','CHILD',19), ('C3','CHILD',22), ('C4','CHILD',15); WITH ranked_adult AS ( SELECT person as adult, ROW_NUMBER() OVER(ORDER BY age desc) as r_a FROM oldest_youngest where type = 'ADULT' ), ranked_child as ( SELECT person as child, ROW_NUMBER() OVER(ORDER BY age asc) as r_c FROM oldest_youngest where type = 'child' ) SELECT adult,child FROM ranked_adult a left join ranked_child c on a.r_a = c.r_c
@user-nv6ho7uk8b
@user-nv6ho7uk8b 3 ะฐะน ะฑาฑั€ั‹ะฝ
create table oldest_youngest(person varchar(10),type varchar(20),age int); insert into oldest_youngest values ('A1','ADULT',54), ('A2','ADULT',53), ('A3','ADULT',52), ('A4','ADULT',58), ('A5','ADULT',54), ('C1','CHILD',20), ('C2','CHILD',19), ('C3','CHILD',22), ('C4','CHILD',15); WITH ranked_adult AS ( SELECT person as adult, ROW_NUMBER() OVER(ORDER BY age desc) as r_a FROM oldest_youngest where type = 'ADULT' ), ranked_child as ( SELECT person as child, ROW_NUMBER() OVER(ORDER BY age asc) as r_c FROM oldest_youngest where type = 'child' ) SELECT adult,child FROM ranked_adult a left join ranked_child c on a.r_a = c.r_c
Big Data Engineer Mock Interview | Big Data Project Pipeline | Managerial #interview #question
31:19
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 5 ะœ.
Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2
41:04
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 10 ะœ.
Llegรณ al techo ๐Ÿ˜ฑ
00:37
Juan De Dios Pantoja
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 53 ะœะ›ะ
WHATโ€™S THAT?
00:27
Natan por Aรญ
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 14 ะœะ›ะ
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 22 ะœะ›ะ
The hard turtle was blasted into pieces |Chinese Mountain Forest Life And Food #MoTiktok #Fyp
00:19
Eater Straw Hat
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 20 ะœะ›ะ
TCS Live Interview for Azure Data Engineer | Technical round -1 Azure | KSR DATAVIZON
34:05
KSR Datavizon
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 112 ะœ.
Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question
31:45
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 16 ะœ.
Question 10: PWC Interview Questions | data engineers | #pyspark #bigdata #pwc #interview
11:34
pysparkpulse
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 3,5 ะœ.
Azure Cloud Data Engineer Interview | Real-time Scenario based Questions & Expert Feedback | BigData
34:56
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 4,3 ะœ.
Azure Data Engineer Interview Questions and Answers | K21Academy
29:47
K21Academy
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 31 ะœ.
4 LPA to 15 LPA ๐Ÿค‘ AZURE DATA Engineer Journey ๐Ÿš€ All Secrets & Guide Revealed
32:56
E-Learning Bridge
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 33 ะœ.
Azure Cloud Data Engineer Mock Interview | Important Questions asked in Big Data Interviews| Pyspark
29:08
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 4,5 ะœ.
Data Engineering Interview | Apache Spark Interview | Live Big Data Interview
34:03
Data Savvy
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 160 ะœ.
Mock Interview for Data Engineers | Spark Optimizations | Real-time Project Challenges and Scenarios
45:21
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11 ะœ.
Big Data Engineer Mock Interview | AWS | Kafka Streaming | SQL | PySpark Optimization #interview
47:48
Sumit Mittal
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 11 ะœ.
Llegรณ al techo ๐Ÿ˜ฑ
00:37
Juan De Dios Pantoja
ะ ะตั‚ า›ะฐั€ะฐะปะดั‹ 53 ะœะ›ะ