01. Databricks: Spark Architecture & Internal Working Mechanism

  Рет қаралды 272,068

Raja's Data Engineering

Raja's Data Engineering

Күн бұрын

Пікірлер: 273
@souravchoudhury3698
@souravchoudhury3698 Жыл бұрын
Not sure why your channel does not show while searching pyspark tutorial. I spoke to a developer on linkedin and he suggested me your channel. Great work thank you Sir!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad to hear it helps you! Thanks for visiting my channel
@jatinsethi6415
@jatinsethi6415 Ай бұрын
Thanks for the videos. Your videos really helped me to switch my job. Thanks for the great content. Your explanation is awesome. Thanks again.
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Great to hear! Thank you so much!
@abhinavsingh2894
@abhinavsingh2894 2 жыл бұрын
This is an absolute masterpiece on introduction of Spark and all it's internal structure. Thank you for such a detailed video.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you Abhinav👍🏻
@abhinavsingh1173
@abhinavsingh1173 Жыл бұрын
​@@rajasdataengineering7585 Your course it best. But problem with you course is that you are not attching the github link for your sample data and code. Irequest you as your audience please do this. Thanks
@adithyabharadwaj5608
@adithyabharadwaj5608 9 ай бұрын
Can beginners learn these?
@hitvlogz
@hitvlogz 11 ай бұрын
Simple. Clear . To the point stuff. Thanks. Love your series.
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Glad you like them! Thanks for your comment
@figh761
@figh761 4 ай бұрын
@@rajasdataengineering7585 Sir I would like to learn databricks fully .Please giude me
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
Pls go through all videos in this channel. You can learn databricks thoroughly
@merazshaik3504
@merazshaik3504 2 жыл бұрын
This series and explanation is too good than other channels and I still don't know why this channel is not showing any recommendation when we search for databricks vidoes.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you 👍🏻
@ravisaxena1599
@ravisaxena1599 2 жыл бұрын
I really appreciate the way you have explained the difference between in memory computation and using external system.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you Ravi
@rudraganesh1507
@rudraganesh1507 Жыл бұрын
Yes great explanation
@kushalthakkar5988
@kushalthakkar5988 Жыл бұрын
​@@rajasdataengineering7585hi may you share PPT and databricks file of the course
@shivanaga3302
@shivanaga3302 2 ай бұрын
Best intro of spark which i've seen till now.......
@rajasdataengineering7585
@rajasdataengineering7585 2 ай бұрын
Thank you
@vutv5742
@vutv5742 Жыл бұрын
Superb , Fantastic , Marvellous.....What a great teacher you are .
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you so much! Glad it helps
@vutv5742
@vutv5742 Жыл бұрын
Yes with clarity you have explained architecture and specially the partitioning with diagram was really helpful.@@rajasdataengineering7585
@charangowdamn8661
@charangowdamn8661 Жыл бұрын
Hi sir do you conduct any online coarses
@PrashanthPatil-br8vj
@PrashanthPatil-br8vj 7 ай бұрын
simple straight to the point absolute master class i was searching for this for long time no one taught it this easily thank you for this
@rajasdataengineering7585
@rajasdataengineering7585 7 ай бұрын
Glad it helps! Thanks Prashanth, for your comment
@kumarashirwadmishra7414
@kumarashirwadmishra7414 Жыл бұрын
Thanks Sir for Wonderful Explanation and provided in-dept Knowledge of Spark Architecture. Wonderful Resource for start SPARK Journey.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks and welcome
@ankitachaturvedi1138
@ankitachaturvedi1138 Ай бұрын
This interview series is really helpful. I haven't worked much on databricks but these videos are giving great insight of internal working & concepts. I am able to crack interviews. Thanks a lot for such informative videos!!
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Glad to hear this! Keep watching
@prathapganesh7021
@prathapganesh7021 Жыл бұрын
I have searched lots of videos regarding spark arictecture and working but this is videos is awesome I realy appreciate for this video nice presentation and i understand very clearly complete concepts thank you so much🙏🙏
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful! Thanks for your comment
@sunilt1739
@sunilt1739 9 ай бұрын
Thank you so much for putting such a great effort. I haven't gone thru all videos yet, but i can definitely imagine the hard work that you must have put behind this playlist.
@rajasdataengineering7585
@rajasdataengineering7585 9 ай бұрын
Thank you so much!
@boseashish
@boseashish 8 ай бұрын
you have put in a lots of effort and tried to cover all important points. thank you very much for your immense contributions
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
My pleasure! Thank you for your comment
@KarthikChavan-zs7iz
@KarthikChavan-zs7iz 8 ай бұрын
Just started but I love clear and simple explanation, thanks a lot for your efforts
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
You're very welcome! Glad it helped
@sangeethaezhumalai168
@sangeethaezhumalai168 11 ай бұрын
Appreciate your detailed explanation Sir... Really helpful
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Glad it is helpful. Thanks for your comment
@vydudraksharam5960
@vydudraksharam5960 Жыл бұрын
Raja, this is excellent way of explanation .
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you, Vydu!
@Sreenivasan-cn5qv
@Sreenivasan-cn5qv Жыл бұрын
for sure best video ever seen before... Raja Great Presentation
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad you liked it! Thanks for your comment
@kasmitharam982
@kasmitharam982 6 ай бұрын
To the point and crisp yet detailed explanation, I've seen in a while, thank you so much!
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
Glad it was helpful!
@mayank_om
@mayank_om 10 ай бұрын
excellent exp;anation across all the youtube channels thanks
@rajasdataengineering7585
@rajasdataengineering7585 10 ай бұрын
Much appreciated! Thanks for your comment
@rk-ej9ep
@rk-ej9ep 23 күн бұрын
Such a great info. Awesome..
@rajasdataengineering7585
@rajasdataengineering7585 23 күн бұрын
Glad it was helpful! Keep watching
@debasishkalia135
@debasishkalia135 3 ай бұрын
this explanation is great , very detailed
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Thank you!
@Moeistic
@Moeistic 2 жыл бұрын
Very well explained Raja, thanks for making this series brother.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks Nasser
@ashswinsubbiah3752
@ashswinsubbiah3752 Жыл бұрын
What an explanation, thank you so much sir.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
You are most welcome
@harikareddy579
@harikareddy579 Жыл бұрын
Amazing explanation sir, I am able to understand it very clearly
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks. Glad you enjoyed this content!
@bashaali1685
@bashaali1685 Жыл бұрын
​@@rajasdataengineering7585 hi sir can i talk to you ..can i get ur contact num plzzz
@RamiReddy-y8r
@RamiReddy-y8r Жыл бұрын
your explanation very excellent
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@ranyasri1092
@ranyasri1092 4 ай бұрын
Thanks alot for in depth explanation😊
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
Hope it helps! Thanks and welcome
@karthikeyana6490
@karthikeyana6490 Жыл бұрын
Just starting to watch yr playlist with the hope to learn spark, lets see how it goes. BTW thaks for the complete playlist mate!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Hope you enjoy it!
@MindBodyEvolutionTV
@MindBodyEvolutionTV Жыл бұрын
Thank you for great and fantastic master pieces
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks for listening!
@ShivamGupta-wn9mo
@ShivamGupta-wn9mo 25 күн бұрын
great playlist
@rajasdataengineering7585
@rajasdataengineering7585 25 күн бұрын
Thank you
@karthiknani1503
@karthiknani1503 Жыл бұрын
Thankyou very much for the content Sir.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it helps you gaining insight about spark internals!
@varun8952
@varun8952 2 жыл бұрын
Hi Raja, This is a great explanation. Appreciate your hard work.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you!
@sowjanyagvs7780
@sowjanyagvs7780 3 ай бұрын
when you mention referring the other videos, can you also keep mentioning those links in description. Thanks a lot for your explanation!!
@rajasdataengineering7585
@rajasdataengineering7585 3 ай бұрын
Sure thing! Will add links
@saimounika6475
@saimounika6475 Жыл бұрын
great explaination sir
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks! Hope you find it helpful
@SystemTinu
@SystemTinu 2 жыл бұрын
Great video Raja!! Explained very well..Thanks
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you
@rahulmittal116
@rahulmittal116 9 ай бұрын
Excellent video
@rajasdataengineering7585
@rajasdataengineering7585 9 ай бұрын
Thank you very much!
@HarshaVardhan-ox2zh
@HarshaVardhan-ox2zh 2 жыл бұрын
Thank you for making vedios Actually helped a lot.....
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks Harsha, for your comment!
@shusants
@shusants Жыл бұрын
Could you please provide the slides used in all the lectures. This will be super useful. Thank you for this master pieces!!.
@adig8881
@adig8881 8 ай бұрын
Watch in full screen, and take screenshots bro..
@dineshbvbv6479
@dineshbvbv6479 Жыл бұрын
Good explanation! keep up the good work.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you
@sunitachoudhary6348
@sunitachoudhary6348 11 ай бұрын
Very good course🎉
@rajasdataengineering7585
@rajasdataengineering7585 11 ай бұрын
Glad you think so! Thanks
@naveenraj9977
@naveenraj9977 Жыл бұрын
Very good explanation,I watched all your playlist gain knowledge about spark and writing code also,I hope to do more vedios on spark , I'm requesting you to upload vedios with subtitles too so we can make a note of the entire session, please add subtitles too for you old vedios.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Naveen! Sure, will try to add subtitles
@naveenraj9977
@naveenraj9977 Жыл бұрын
@@rajasdataengineering7585 Thanks,that will be great
@vidhikumar1664
@vidhikumar1664 6 ай бұрын
Great explanation.
@rajasdataengineering7585
@rajasdataengineering7585 6 ай бұрын
Glad it was helpful!
@shekarsubramani9861
@shekarsubramani9861 2 жыл бұрын
Hi Raja, I was very much confused with the architecture, once I saw your video ,now its clear, Keep up the good work
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks Shekar!
@AnandKumar-dc2bf
@AnandKumar-dc2bf 3 жыл бұрын
nice pictorial representations bro keep gng
@rajasdataengineering7585
@rajasdataengineering7585 3 жыл бұрын
Thanks Anand
@JindamSrilekha
@JindamSrilekha 10 ай бұрын
Well Explained
@rajasdataengineering7585
@rajasdataengineering7585 10 ай бұрын
Thanks
@vishalaaa1
@vishalaaa1 Жыл бұрын
Excellent
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you! Cheers!
@terrificmenace
@terrificmenace 2 жыл бұрын
Excellent video 😀 thank you
@alex45688
@alex45688 Жыл бұрын
good explanation
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks for liking
@AnandKumar-dc2bf
@AnandKumar-dc2bf 3 жыл бұрын
Nice explanation...
@subhashyadav9262
@subhashyadav9262 4 ай бұрын
Very Nice
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
Thanks
@tanushreenagar3116
@tanushreenagar3116 5 ай бұрын
perfect video sir
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Thank you!
@shyamkumardhamode4475
@shyamkumardhamode4475 Жыл бұрын
Soooo good explanation
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@arunshankar1987
@arunshankar1987 5 ай бұрын
Exactly what am looking for. Please let me know where I can find the datasets to practice.
@pridename2858
@pridename2858 Жыл бұрын
Yes, this is master piece. Thanks
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Welcome, Glad you like it!
@pralgs628
@pralgs628 9 ай бұрын
Awesome!! Could you please attach the PPT for Each Video.. Thanks
@siddavatamvenugopalreddy9686
@siddavatamvenugopalreddy9686 5 ай бұрын
Is this all tutorials related to spark only? Or it includes data bricks aswell? Please confirm
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
It's more on databricks which is internally using apache Spark
@divyamariyameldo6495
@divyamariyameldo6495 4 ай бұрын
Thanks for the content!
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
My pleasure! Welcome
@abinashsenapati1880
@abinashsenapati1880 10 ай бұрын
It is really helpful. Thank you.. Where will I get the complete PPT of this playlist?
@sharunkumar4806
@sharunkumar4806 Ай бұрын
Hi Sir, Thank you for such an excellant material. I do not have access to Azure Data bricks at the moment. Can I still learn the complete playlist by installing pyspark on my local PC and pactising on Jupyter notebook ?
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Thank you. Yes you can practice in your local installation
@saravninja
@saravninja 2 жыл бұрын
Started your videos!! All are great
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you Ninja
@saravninja
@saravninja 2 жыл бұрын
@@rajasdataengineering7585 I went through complete video second by second. Video has lot of clarity than any other KZbin channel. Keep up good work!! Have you experienced data skew issue, if yes can you point video or do video for us.
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you for your kind words. It gives a motivation to create more videos which can help genuine knowledge seekers like you. For data skew, have posted one video (though it does not cover advanced concepts) kzbin.info/www/bejne/e4LLnZevgbyDras
@saravninja
@saravninja 2 жыл бұрын
@@rajasdataengineering7585 thanks a lot again Raja!! Will go through it. I am looking for airflow training, I have sent mail to you, kindly respond.
@krishnamurthy1243
@krishnamurthy1243 4 ай бұрын
Hi Raja ,please do azure synapse analytics,eagerly waiting
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
Sure Krishna, will create videos on synapse analytics
@mohammedmujahiduddin4715
@mohammedmujahiduddin4715 Жыл бұрын
Thank you Raja for the detailed explanation. Do we have any video which is focusing on Worker Node and its details ? And as you were about to make a video regarding the memory management details, please also share that or the video title if already present. Thank you so much in advance!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Please watch videos kzbin.info/www/bejne/mYXNeaKhpN1of9U kzbin.info/www/bejne/d2mToGyNfL1-las
@CoopmanGreg
@CoopmanGreg 2 жыл бұрын
Great Video! 👍
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks 👍🏻
@MichaelAdebayo-y4n
@MichaelAdebayo-y4n 24 күн бұрын
Would watching these videos be enough to help pass the Databricks Certified Associate Developer for Apache Spark 3.0 - Scala exam?
@rajasdataengineering7585
@rajasdataengineering7585 24 күн бұрын
Yes most of the concepts are covered in this channel.
@ayanjit9196
@ayanjit9196 Жыл бұрын
Sir can you please give us the links of the notebooks used in this series. This has helped me and a lot of other people. Giving this link would be even more helpful 🙏🙏🙏
@100useful7
@100useful7 Жыл бұрын
Yes, please
@manojru1
@manojru1 Жыл бұрын
I have installed Jupyter with Pyspark...where should I run my command to see the Spark job like you are showing on 38:21sec? or should I install some other IDE for that?
@ArupSankarRoy
@ArupSankarRoy 2 ай бұрын
Mistake 14:15 Partition Size should be 100 mb
@vinothloganathan2623
@vinothloganathan2623 8 ай бұрын
Hi Raja, One of the amazing explanation. I couldn't find these level of details in any of the source like - books, medium and other youtube. Amazing work !!. Could you share if there are any resource helped you for spark >
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
Thank you! I don't have any other resources. I summarised these concepts based on my working experience
@sudippandit9855
@sudippandit9855 3 жыл бұрын
excellent explanation!!
@rajasdataengineering7585
@rajasdataengineering7585 3 жыл бұрын
Thank you Sudip
@lanyofrancis1195
@lanyofrancis1195 22 күн бұрын
Thanks for the videos and nice explanation, I have a question. Default partition size if 128 MB, so for 2 GB RDD file, no of partitions created will be 16. Here in your example are you changing the default size of partition to 10 MB instead of default 128 MB? Please correct if I am missing anything?
@lakshmidvs3258
@lakshmidvs3258 Ай бұрын
Hi Raja, very nice explanation ... Are you taking classes
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Hi Lakshmi, thanks for your comment! I'm not taking classes
@lakshmidvs3258
@lakshmidvs3258 Ай бұрын
Hi Raja , Thank you so much for your quick reply . I want to write data bricks certification . It's show two exams 1.azure data bricks 2 . Databricks . For which certification I should go is there any difference in those two in getting job opportunities. I have experience as Data Engineer, now I want to do certification .
@rajasdataengineering7585
@rajasdataengineering7585 Ай бұрын
Both are same. You can go with databricks certified data engineering professional
@oluakano6497
@oluakano6497 2 жыл бұрын
hi, i am new to spark and your videos seem like a great resource to learn. i am wondering what is the best order to watch them? through the playlist pr just use the numbers like 1,2,3...
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi, for all videos, I have given serial number. You can follow the order based on that serial number
@jayaprakashm2849
@jayaprakashm2849 2 жыл бұрын
Nice info
@kneelakanta8137
@kneelakanta8137 2 жыл бұрын
can I have document for reference of this playlist
@ElhamMirshekari
@ElhamMirshekari 2 жыл бұрын
Raja could you kindly make a video on these three functions and compare them: Join, Union, Concat
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi Ellie, I have already created videos for join and union. Will make a video for concat as per your request. Join : kzbin.info/www/bejne/pHuqm3mDhaefisk Union: kzbin.info/www/bejne/fIW3fYB4gc6tjJo hope it helps you
@ranganathhittanagi3315
@ranganathhittanagi3315 Ай бұрын
11:40 why is the 2GB file is divided into 200 partitions ? by default? isn't it file size 2GB should be divided by 128MB (block size) which is 16 partitions
@RAVINDRAsap
@RAVINDRAsap Жыл бұрын
could you create CI CD for databricks please
@codeslayer4713
@codeslayer4713 Жыл бұрын
Hi Raja, I have a question here, in terms of partitions when we will be loading a file of 2gb the minpartitionbyte of 128 mb makes the initial partitions to be 16 with the logic 2*1024 / 128 right ? and the minpartition property has a number 200, but isnt it that if there is any shuffle operation then only 200 partitions will be there but not while reading
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes, that's shuffling partition parameter which is not applicable for reading partition
@codeslayer4713
@codeslayer4713 Жыл бұрын
So you mentioned 200 in your example that's why my doubt arose
@neelbanerjee7875
@neelbanerjee7875 2 жыл бұрын
Thank you ver much for such contents.. one request - Can you please make a video on real time executor number, core, memory allocation based on input data size like. 1. 1-5 gb 2. 5-15 gb 3. 15-25 gb 4. 25-50 gb 5. > 50gb = 1 tb
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure Neel, will make a video on this requirement
@mohitupadhayay1439
@mohitupadhayay1439 Жыл бұрын
@@rajasdataengineering7585 Waiting for this!
@dipanjanpan1
@dipanjanpan1 5 ай бұрын
Can we create multiple executor node on a worker node?
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Yes we can. Executor is logical division of computing resources
@BlingKing321
@BlingKing321 Жыл бұрын
In order to store data in JVM memory we need to do serialization and deserialization. Why ?
@shamsmalek
@shamsmalek 4 ай бұрын
Excellent job. Can you please provide me the data set and code? Or please give me the Git link to download the dataset and code for your tutorials. Thanks.
@srikanthbachina7764
@srikanthbachina7764 Жыл бұрын
HI Raj, Videos are Missing from 27 to 30 Could you Please Upload them.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Hi Srikanth, those 4 videos are related to Azure Synapse analytics. Its still available under all videos section
@arupnaskar3818
@arupnaskar3818 2 жыл бұрын
Hi Raja u r teaching is awesome .... really help .. sir, just wanted to know .. for "STAGING" here u mentioned about "Nodes" .. here "Nodes" means No of worker Nodes or Partitions ??
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thank you Arup. Node means no of worker nodes in the cluster
@arupnaskar3818
@arupnaskar3818 2 жыл бұрын
@@rajasdataengineering7585 thanks raja ..💐🌸
@rudraganesh1507
@rudraganesh1507 Жыл бұрын
This is the masterpiece
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks Ajay
@البداية-ذ1ذ
@البداية-ذ1ذ 2 жыл бұрын
please could you make videos in examples about pyspark in real projects
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure, will make videos on real time projects
@البداية-ذ1ذ
@البداية-ذ1ذ 2 жыл бұрын
Thanks alot
@saifahmed7843
@saifahmed7843 12 күн бұрын
Greetings, Is this sufficient for Databricks Certified Associate Developer exam? Appreciate any clarity.
@rajasdataengineering7585
@rajasdataengineering7585 12 күн бұрын
Yes, I have covered almost all the topics. If you understand all the concepts I explained in this channel, that's more than sufficient
@saifahmed7843
@saifahmed7843 11 күн бұрын
@@rajasdataengineering7585 Thank you so much for the info.
@rajasdataengineering7585
@rajasdataengineering7585 11 күн бұрын
Welcome
@dinesh_tadepalli
@dinesh_tadepalli 4 ай бұрын
Are any prerequisites required to this pyspark series?
@rajasdataengineering7585
@rajasdataengineering7585 4 ай бұрын
No, nothing needed. I have covered from basic
@TelugupodcasterDT
@TelugupodcasterDT 4 ай бұрын
Thank you!
@saikrishna1939
@saikrishna1939 2 жыл бұрын
Can you please guide me how to start your videos I mean the order I can see many playlists in the channel. I want to learn spark and data bricks
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure bro, let me give serial number to my videos so that you can follow the structured learning list
@saikrishna1939
@saikrishna1939 2 жыл бұрын
@@rajasdataengineering7585 yeah thanks, also please comment here that which playlists we shld follow for the order to learn spark and data bricks 🙂
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Sure
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
@@saikrishna1939 The videos are given with serial number. You can follow with that sequence
@saikrishna1939
@saikrishna1939 2 жыл бұрын
@@rajasdataengineering7585 yes but which playlist to follow as there are 7 playlists in the channel so it's Lil confusion. Say for example a playlist has 5 videos but when I open it I can see 17, 18, 19, 20 in the videos. In interview series it starts with 1 again
@ElhamMirshekari
@ElhamMirshekari 2 жыл бұрын
Is the master node same as cluster manager? or they are two different concepts?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Master node is driver and different from cluster manager
@ElhamMirshekari
@ElhamMirshekari 2 жыл бұрын
@@rajasdataengineering7585 Thanks for the prompt response . Master node = Driver
@ParthKhambhayta-dj9te
@ParthKhambhayta-dj9te 6 ай бұрын
Sir you said when read CSV file it's divided in default 200 partitions but default size of block is 128MB so it should decide into 16 partition please let me know am I correct or not ?
@anilmnt82
@anilmnt82 3 ай бұрын
I do see it as more detailed explanation on spark , but not really on Databricks, missing many Databricks features like Unity catalog, DBFS, Vaccum, Liquid clustering etc..
@aashishmalhotra
@aashishmalhotra Жыл бұрын
amazing
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you! Cheers!
@morgann4276
@morgann4276 2 жыл бұрын
Great video raja!! Wanted to know how you have such in depth knowledge.. did you learn from spark docs ?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Thanks Morgan. Yes spark documents and working experience helped me to understand concepts
@itsallinyourhead3593
@itsallinyourhead3593 2 жыл бұрын
Hi Raja, How do we set the number of executors in azure databricks ? like in this example the worker node is divided into 4 executors. Thanks in advance!
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Hi, number of executors can be controlled using spark config parameter "spark.executor.instances". Number of cores per executor can be set by spark.executor.cores. Hope it helps
@itsallinyourhead3593
@itsallinyourhead3593 2 жыл бұрын
@@rajasdataengineering7585 thank you, so these parameters have to set while cluster creation meaning are these parameters at cluster level or can be changed/set by developers during etl/data processing?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
It can be set at cluster level using init scripts or at notebook level using syntax spark.config.set()
@itsallinyourhead3593
@itsallinyourhead3593 2 жыл бұрын
@@rajasdataengineering7585 thank you 🙏
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Welcome
@VipinYadav-ii1ow
@VipinYadav-ii1ow 5 ай бұрын
Just starting to learn spark and databrics. Is this resource is enough to crack entry level data engineering job?
@rajasdataengineering7585
@rajasdataengineering7585 5 ай бұрын
Yes definitely, these videos are more than enough to crack entry level job
@Arvind-sr6ze
@Arvind-sr6ze 2 жыл бұрын
i need prepare for apache spark programming with databricks certification,will this videos help me?
@rajasdataengineering7585
@rajasdataengineering7585 2 жыл бұрын
Yes, it will help
@Arun-uw1hy
@Arun-uw1hy Жыл бұрын
Hi, here the executor mean processor (CPU) , Right? Because each node may have multiple CPU, and also each CPU can have multiple cores. So each cores has a separate executor.
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Executor means logical division of nodes. That means combination of processor + memory + network
@mannykhan7752
@mannykhan7752 10 ай бұрын
Yun number of worker nodes?? What is Yun or yum??
@shivayogihiremath4785
@shivayogihiremath4785 Жыл бұрын
I'm following this channel from couple of days now. The content and way of explanation is awesome. Good job my friend. keep up the good work. wishing you all the very best. one small suggestion, if possible, please try to avoid the initial music (which is played at the beginning of the video) at times it is annoying. thank you!
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Hi Shiv, thank you for your valuable comments. I already removed this initial music. May be it is still there for only few initial videos.
02. Databricks | PySpark: RDD, Dataframe and Dataset
12:41
Raja's Data Engineering
Рет қаралды 75 М.
22. Databricks| Spark | Performance Optimization | Repartition vs Coalesce
21:11
Raja's Data Engineering
Рет қаралды 56 М.
Правильный подход к детям
00:18
Beatrise
Рет қаралды 10 МЛН
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 54 МЛН
Smart Sigma Kid #funny #sigma
00:33
CRAZY GREAPA
Рет қаралды 38 МЛН
03. Databricks | PySpark: Transformation and Action
16:15
Raja's Data Engineering
Рет қаралды 58 М.
121. Databricks | Pyspark| AutoLoader: Incremental Data Load
34:56
Raja's Data Engineering
Рет қаралды 22 М.
21. Databricks| Spark Streaming
18:12
Raja's Data Engineering
Рет қаралды 37 М.
Apache Kafka Architecture
11:19
Anton Putra
Рет қаралды 42 М.
10. Databricks | Pyspark:  Utility Commands - DBUtils
28:43
Raja's Data Engineering
Рет қаралды 30 М.
Master Databricks and Apache Spark Step by Step: Lesson 1 - Introduction
32:23
Правильный подход к детям
00:18
Beatrise
Рет қаралды 10 МЛН