Big Data Engineer Mock Interview | AWS | Kafka Streaming | SQL | PySpark Optimization

  Рет қаралды 20,946

Sumit Mittal

Sumit Mittal

Күн бұрын

Пікірлер: 9
@arunsundar3739
@arunsundar3739 10 ай бұрын
very insightful on sql, aws, data modeling concepts & applications of those concepts, helps to recall & understand better the concepts learnt in big data master course & sql leetcode playlist :)
@sonuparmar5836
@sonuparmar5836 10 ай бұрын
@sumitmittal07 The SQL aggregate question in which we need to calculate cumulative profit won't use ROWS Between as that will be used for rolling profit between a range, instead it should be simply: CUMULATIVE_PROFIT = SUM(profit) OVER(ORDER BY transaction_id, transaction_date). Let me know if I understood the question correctly or not. Also, in the partitioning and bucketing question interviewee have explained vice-versa.
@aniruths9900
@aniruths9900 8 ай бұрын
You are right - Buckets are stored as files. Partitions are stored as directories.
@avicool08
@avicool08 2 ай бұрын
Good 👍
@ankandatta4352
@ankandatta4352 10 ай бұрын
In the case of creating a primary key in case unavailable, we can select any attribute and check if that attribute has 1 to 1 relationship with other composite values (in excel using a pivot table, check distinct values) and then use sha2 or md5 in adf to form the surrogate key. Correct me if I'm wrong
@rajeshvijayakumar
@rajeshvijayakumar 10 ай бұрын
Yes, I was also thinking about md5
@dattabandi9226
@dattabandi9226 10 ай бұрын
👌👌
@akashprabhakar6353
@akashprabhakar6353 4 ай бұрын
The interviewer looks like Tarun gill :) BTW nice interview.
@harirk3239
@harirk3239 2 ай бұрын
It's not good actually that candidate have to present screen and write query
Andro, ELMAN, TONI, MONA - Зари (Official Audio)
2:53
RAAVA MUSIC
Рет қаралды 8 МЛН
Azure Data Engineer Mock Interview - Project Special
26:28
Azurelib Academy
Рет қаралды 32 М.
GCP Data Engineer Mock  interview
15:22
Grow With Google Cloud
Рет қаралды 7 М.
How to Crack Data Engineering Interviews
20:41
Ankit Bansal
Рет қаралды 28 М.
10 recently asked Pyspark Interview Questions | Big Data Interview
28:36
Real Interview Q&A for Senior Data Engineer #1 | Surfalytics
30:26
Surfalytics TV
Рет қаралды 10 М.