Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture

  Рет қаралды 12,615

Sumit Mittal

Sumit Mittal

Күн бұрын

Пікірлер: 58
@thetransformer2217
@thetransformer2217 10 күн бұрын
Satinder has in-depth understanding and a lot of experience and he explains things in precise and easy to understand language. Kudos to him. Thanks a lot Sumit Sir.
@MANGESHpawarsm42
@MANGESHpawarsm42 Ай бұрын
One of the most Informative interview I ever Watched. Big Shout Out to Satinder Singh as he explained topic clear and most understandable way. Thank You.
@imranhossain1660
@imranhossain1660 9 ай бұрын
parquet is a columnar based storage format, so it is a very good file format in terms of retrieving the data through the query. It definitely reduces the usage of i/o read and network bandwidth. Besides that it has built in support for compression in the form of snappy format. So it reduces the space usgae. Another one I can think of is, parquet files comes up a structure with 3 components, they are header, body and footer. Heder actually the name of the file(part001,part002). Body is actual data content which it is storing and footer is basically for the metadata. This metadata includes the minimum and maximum values of the columns. So whenever we try to query the data which is stored in parquet format this metadata helps us for the data skipping which in turn fast our query execution. Hope it helps.
@pallavigosavi6851
@pallavigosavi6851 9 ай бұрын
Thank you!! 👍
@sruthiselvakumar9817
@sruthiselvakumar9817 9 ай бұрын
This interview is really great as Satinder explained some concepts like property for broadcast etc more clearly. Thanks Sumit Sir!! Expecting more videos like this..
@sumitmittal07
@sumitmittal07 9 ай бұрын
satinder will be conducting more interviews
@KiyanshLife
@KiyanshLife 9 ай бұрын
Best Interview I ever seen. Both of you too good at your level.
@sumitmittal07
@sumitmittal07 9 ай бұрын
yes this interview was next level
@mohammedalikhan9819
@mohammedalikhan9819 9 ай бұрын
The interview was more focused on pyspark, sql we expect interviewer to ask more qns on AWS cloud as well. Because in most of the interview videos posted pyspark has been asked a lot.If qns on AWS would have been asked it would have been very helpful.
@sumitmittal07
@sumitmittal07 9 ай бұрын
Hi Mohammed, will definitely have some interviews planned specifically for AWS in the upcoming days.
@mohammedalikhan9819
@mohammedalikhan9819 9 ай бұрын
Thank you sir😊
@avinash7003
@avinash7003 9 ай бұрын
I see mostly asked 70% in Pyspark SQL rest cloud ​@@mohammedalikhan9819
@Hope-xb5jv
@Hope-xb5jv 5 ай бұрын
23:05 use dense rank instead of row number because may be more than one student have same highest number in same subject
@grim_rreaperr
@grim_rreaperr 9 ай бұрын
Hi Sumit Sir, In the first sql problem where we are required to find subject wise toppers, one case where row_number() will fail is when we have two top-scorers with the same marks in a specific subject. Please check the example below: student_name, subject, marks (-- derived column) stud_1, maths, 90 -- 1 stud_2, maths, 90 -- 1 stud_1,economics, 95 --1 stud_2, economics, 90 -- 2 stud_3, economics, 88 -- 3 Instead of row_number(), we can choose any one from rank or dense_rank as we just need the first rankers(based on highest marks scored in each subject). My approach will be as follows: WITH top_scorers AS ( SELECT student_name, subject, marks, DENSE_RANK() OVER(PARTITION BY subject ORDER BY marks DESC) AS rnk FROM student_marks ) SELECT student_name, subject, marks FROM top_scorers WHERE rnk = 1;
@SreemantaKesh
@SreemantaKesh 9 ай бұрын
This was a good interview. Different from the earlier one's. Satinder's question and advice was very good.
@sumitmittal07
@sumitmittal07 9 ай бұрын
this interview has really gone well
@sunitasolankar5161
@sunitasolankar5161 Ай бұрын
Thank you so much satindar sir its very informative and useful while giving interview excellent.
@abhishekmodak8496
@abhishekmodak8496 9 ай бұрын
This was a good interview and Satinder has good experience as an interviewer.
@safarnama65
@safarnama65 9 ай бұрын
Very Informative one of the best mock interview with proper answering and details
@sumitmittal07
@sumitmittal07 9 ай бұрын
Keep watching for more such insightful interviews
@goldykarn5922
@goldykarn5922 9 ай бұрын
Best interview session so far.
@tanujarora4906
@tanujarora4906 7 ай бұрын
Satinder sir is awesome, always something to learn from his questions.
@mojibshaikh4092
@mojibshaikh4092 8 ай бұрын
Informative and Excellent interview.
@Sagar0155
@Sagar0155 9 ай бұрын
Interview was insightful. Learnt core concepts of spark from Satinder
@sumitmittal07
@sumitmittal07 9 ай бұрын
glad that it helped you
@abhishekkmalik4399
@abhishekkmalik4399 9 ай бұрын
Very informative video, liked the point of view by Satinder Sir.
@sumitmittal07
@sumitmittal07 9 ай бұрын
satinder is a very knowledgeable person
@DesireIsIrrelevant
@DesireIsIrrelevant 9 ай бұрын
Thanks for uploading such a great Interview video Sir!
@sumitmittal07
@sumitmittal07 9 ай бұрын
Glad you found the interview informative!
@AliKhanLuckky
@AliKhanLuckky 9 ай бұрын
Sir i personaly want to see satinder sirs more interviews 😊
@sumitmittal07
@sumitmittal07 9 ай бұрын
yes definitely, he will be conducting more interviews
@akshaykumarverma8644
@akshaykumarverma8644 9 ай бұрын
This was a very good video
@sauravroy9889
@sauravroy9889 9 ай бұрын
Really nice interview sir.❤
@DataJourneyHuub
@DataJourneyHuub 9 ай бұрын
It’s really helpful sir. Thank you so much
@sumitmittal07
@sumitmittal07 9 ай бұрын
Most welcome
@sabyspeaksonline
@sabyspeaksonline 9 ай бұрын
What's the difference between parquet and delta format?
@ashwenkumar
@ashwenkumar 8 ай бұрын
Aditya - u need to be strong in the basics and always answer straight forward and crisply on points . Don’t beat the bush
@akashprabhakar6353
@akashprabhakar6353 3 ай бұрын
i felt the same.
@ameygoesgaming8793
@ameygoesgaming8793 9 ай бұрын
My SQL would be: SELECT student_id, max(marks) FROM class GROUP BY subject
@grim_rreaperr
@grim_rreaperr 9 ай бұрын
every non-aggregated column in your select statement must be included in the group by statement.( here student_id is a non aggregated column and it should be in your group by clause and same applies for the subject column too which is not being called in the select statement)
@ameygoesgaming8793
@ameygoesgaming8793 9 ай бұрын
@@grim_rreaperr Oh yes, its a typing bug. It should be: SELECT subject, max(marks) FROM class GROUP BY subject
@Abhishek-14
@Abhishek-14 9 ай бұрын
Sir please continue python course along with this 🙏
@sumitmittal07
@sumitmittal07 9 ай бұрын
yes, one video coming tomorrow at 7 pm
@Abhishek-14
@Abhishek-14 9 ай бұрын
@@sumitmittal07 thank you so much sir that's a relief to hear this.
@Amrit-Manash
@Amrit-Manash 9 ай бұрын
Very nice interview
@sumitmittal07
@sumitmittal07 9 ай бұрын
glad that you liked it
@zaffer2024
@zaffer2024 9 ай бұрын
Excellent
@sumitmittal07
@sumitmittal07 9 ай бұрын
Thanks
@ameygoesgaming8793
@ameygoesgaming8793 9 ай бұрын
what is NC SQL way?
@SB-ix7db
@SB-ix7db 9 ай бұрын
ANSI
@ameygoesgaming8793
@ameygoesgaming8793 9 ай бұрын
so ANSI SQL is normal SQL syntax which we write right?@@SB-ix7db
@doyouwanttoknow3366
@doyouwanttoknow3366 9 ай бұрын
Please upload a gcp data engineer interview video sir
@sumitmittal07
@sumitmittal07 9 ай бұрын
very soon
@mohitbutola1140
@mohitbutola1140 9 ай бұрын
have anyone have taken the course ?
@sumitmittal07
@sumitmittal07 9 ай бұрын
Please share your contact number if you would like to know more about the courses that I offer
@zaffer2024
@zaffer2024 9 ай бұрын
Why data engineer roles have very easy questions
@sumitmittal07
@sumitmittal07 9 ай бұрын
we make it look easy, else its complex.. haha
@akhilsingh3801
@akhilsingh3801 6 ай бұрын
Bro is cheating on mock interview with zero fundamental knowledge of Spark or Hadoop 😂😂😂. At least interviewer has asked questions to get something out of this video.
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 158 МЛН
Гениальное изобретение из обычного стаканчика!
00:31
Лютая физика | Олимпиадная физика
Рет қаралды 4,8 МЛН
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
GCP Data Engineer Mock  interview
15:22
Grow With Google Cloud
Рет қаралды 6 М.
Big Data Mock Interview
39:34
The Big Data Show
Рет қаралды 9 М.
The only Cloud services you actually need to know
17:17
NeetCodeIO
Рет қаралды 209 М.
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 158 МЛН