Not sure why your channel does not show while searching pyspark tutorial. I spoke to a developer on linkedin and he suggested me your channel. Great work thank you Sir!
@rajasdataengineering7585 Жыл бұрын
Glad to hear it helps you! Thanks for visiting my channel
@jatinsethi6415Ай бұрын
Thanks for the videos. Your videos really helped me to switch my job. Thanks for the great content. Your explanation is awesome. Thanks again.
@rajasdataengineering7585Ай бұрын
Great to hear! Thank you so much!
@abhinavsingh28942 жыл бұрын
This is an absolute masterpiece on introduction of Spark and all it's internal structure. Thank you for such a detailed video.
@rajasdataengineering75852 жыл бұрын
Thank you Abhinav👍🏻
@abhinavsingh1173 Жыл бұрын
@@rajasdataengineering7585 Your course it best. But problem with you course is that you are not attching the github link for your sample data and code. Irequest you as your audience please do this. Thanks
@adithyabharadwaj56089 ай бұрын
Can beginners learn these?
@hitvlogz11 ай бұрын
Simple. Clear . To the point stuff. Thanks. Love your series.
@rajasdataengineering758511 ай бұрын
Glad you like them! Thanks for your comment
@figh7614 ай бұрын
@@rajasdataengineering7585 Sir I would like to learn databricks fully .Please giude me
@rajasdataengineering75854 ай бұрын
Pls go through all videos in this channel. You can learn databricks thoroughly
@merazshaik35042 жыл бұрын
This series and explanation is too good than other channels and I still don't know why this channel is not showing any recommendation when we search for databricks vidoes.
@rajasdataengineering75852 жыл бұрын
Thank you 👍🏻
@ravisaxena15992 жыл бұрын
I really appreciate the way you have explained the difference between in memory computation and using external system.
@rajasdataengineering75852 жыл бұрын
Thank you Ravi
@rudraganesh1507 Жыл бұрын
Yes great explanation
@kushalthakkar5988 Жыл бұрын
@@rajasdataengineering7585hi may you share PPT and databricks file of the course
@shivanaga33022 ай бұрын
Best intro of spark which i've seen till now.......
@rajasdataengineering75852 ай бұрын
Thank you
@vutv5742 Жыл бұрын
Superb , Fantastic , Marvellous.....What a great teacher you are .
@rajasdataengineering7585 Жыл бұрын
Thank you so much! Glad it helps
@vutv5742 Жыл бұрын
Yes with clarity you have explained architecture and specially the partitioning with diagram was really helpful.@@rajasdataengineering7585
@charangowdamn8661 Жыл бұрын
Hi sir do you conduct any online coarses
@PrashanthPatil-br8vj7 ай бұрын
simple straight to the point absolute master class i was searching for this for long time no one taught it this easily thank you for this
@rajasdataengineering75857 ай бұрын
Glad it helps! Thanks Prashanth, for your comment
@kumarashirwadmishra7414 Жыл бұрын
Thanks Sir for Wonderful Explanation and provided in-dept Knowledge of Spark Architecture. Wonderful Resource for start SPARK Journey.
@rajasdataengineering7585 Жыл бұрын
Thanks and welcome
@ankitachaturvedi1138Ай бұрын
This interview series is really helpful. I haven't worked much on databricks but these videos are giving great insight of internal working & concepts. I am able to crack interviews. Thanks a lot for such informative videos!!
@rajasdataengineering7585Ай бұрын
Glad to hear this! Keep watching
@prathapganesh7021 Жыл бұрын
I have searched lots of videos regarding spark arictecture and working but this is videos is awesome I realy appreciate for this video nice presentation and i understand very clearly complete concepts thank you so much🙏🙏
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful! Thanks for your comment
@sunilt17399 ай бұрын
Thank you so much for putting such a great effort. I haven't gone thru all videos yet, but i can definitely imagine the hard work that you must have put behind this playlist.
@rajasdataengineering75859 ай бұрын
Thank you so much!
@boseashish8 ай бұрын
you have put in a lots of effort and tried to cover all important points. thank you very much for your immense contributions
@rajasdataengineering75858 ай бұрын
My pleasure! Thank you for your comment
@KarthikChavan-zs7iz8 ай бұрын
Just started but I love clear and simple explanation, thanks a lot for your efforts
@rajasdataengineering75858 ай бұрын
You're very welcome! Glad it helped
@sangeethaezhumalai16811 ай бұрын
Appreciate your detailed explanation Sir... Really helpful
@rajasdataengineering758511 ай бұрын
Glad it is helpful. Thanks for your comment
@vydudraksharam5960 Жыл бұрын
Raja, this is excellent way of explanation .
@rajasdataengineering7585 Жыл бұрын
Thank you, Vydu!
@Sreenivasan-cn5qv Жыл бұрын
for sure best video ever seen before... Raja Great Presentation
@rajasdataengineering7585 Жыл бұрын
Glad you liked it! Thanks for your comment
@kasmitharam9826 ай бұрын
To the point and crisp yet detailed explanation, I've seen in a while, thank you so much!
@rajasdataengineering75856 ай бұрын
Glad it was helpful!
@mayank_om10 ай бұрын
excellent exp;anation across all the youtube channels thanks
@rajasdataengineering758510 ай бұрын
Much appreciated! Thanks for your comment
@rk-ej9ep23 күн бұрын
Such a great info. Awesome..
@rajasdataengineering758523 күн бұрын
Glad it was helpful! Keep watching
@debasishkalia1353 ай бұрын
this explanation is great , very detailed
@rajasdataengineering75853 ай бұрын
Thank you!
@Moeistic2 жыл бұрын
Very well explained Raja, thanks for making this series brother.
@rajasdataengineering75852 жыл бұрын
Thanks Nasser
@ashswinsubbiah3752 Жыл бұрын
What an explanation, thank you so much sir.
@rajasdataengineering7585 Жыл бұрын
You are most welcome
@harikareddy579 Жыл бұрын
Amazing explanation sir, I am able to understand it very clearly
@rajasdataengineering7585 Жыл бұрын
Thanks. Glad you enjoyed this content!
@bashaali1685 Жыл бұрын
@@rajasdataengineering7585 hi sir can i talk to you ..can i get ur contact num plzzz
@RamiReddy-y8r Жыл бұрын
your explanation very excellent
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@ranyasri10924 ай бұрын
Thanks alot for in depth explanation😊
@rajasdataengineering75854 ай бұрын
Hope it helps! Thanks and welcome
@karthikeyana6490 Жыл бұрын
Just starting to watch yr playlist with the hope to learn spark, lets see how it goes. BTW thaks for the complete playlist mate!
@rajasdataengineering7585 Жыл бұрын
Hope you enjoy it!
@MindBodyEvolutionTV Жыл бұрын
Thank you for great and fantastic master pieces
@rajasdataengineering7585 Жыл бұрын
Thanks for listening!
@ShivamGupta-wn9mo25 күн бұрын
great playlist
@rajasdataengineering758525 күн бұрын
Thank you
@karthiknani1503 Жыл бұрын
Thankyou very much for the content Sir.
@rajasdataengineering7585 Жыл бұрын
Glad it helps you gaining insight about spark internals!
@varun89522 жыл бұрын
Hi Raja, This is a great explanation. Appreciate your hard work.
@rajasdataengineering75852 жыл бұрын
Thank you!
@sowjanyagvs77803 ай бұрын
when you mention referring the other videos, can you also keep mentioning those links in description. Thanks a lot for your explanation!!
@rajasdataengineering75853 ай бұрын
Sure thing! Will add links
@saimounika6475 Жыл бұрын
great explaination sir
@rajasdataengineering7585 Жыл бұрын
Thanks! Hope you find it helpful
@SystemTinu2 жыл бұрын
Great video Raja!! Explained very well..Thanks
@rajasdataengineering75852 жыл бұрын
Thank you
@rahulmittal1169 ай бұрын
Excellent video
@rajasdataengineering75859 ай бұрын
Thank you very much!
@HarshaVardhan-ox2zh2 жыл бұрын
Thank you for making vedios Actually helped a lot.....
@rajasdataengineering75852 жыл бұрын
Thanks Harsha, for your comment!
@shusants Жыл бұрын
Could you please provide the slides used in all the lectures. This will be super useful. Thank you for this master pieces!!.
@adig88818 ай бұрын
Watch in full screen, and take screenshots bro..
@dineshbvbv6479 Жыл бұрын
Good explanation! keep up the good work.
@rajasdataengineering7585 Жыл бұрын
Thank you
@sunitachoudhary634811 ай бұрын
Very good course🎉
@rajasdataengineering758511 ай бұрын
Glad you think so! Thanks
@naveenraj9977 Жыл бұрын
Very good explanation,I watched all your playlist gain knowledge about spark and writing code also,I hope to do more vedios on spark , I'm requesting you to upload vedios with subtitles too so we can make a note of the entire session, please add subtitles too for you old vedios.
@rajasdataengineering7585 Жыл бұрын
Thanks Naveen! Sure, will try to add subtitles
@naveenraj9977 Жыл бұрын
@@rajasdataengineering7585 Thanks,that will be great
@vidhikumar16646 ай бұрын
Great explanation.
@rajasdataengineering75856 ай бұрын
Glad it was helpful!
@shekarsubramani98612 жыл бұрын
Hi Raja, I was very much confused with the architecture, once I saw your video ,now its clear, Keep up the good work
@rajasdataengineering75852 жыл бұрын
Thanks Shekar!
@AnandKumar-dc2bf3 жыл бұрын
nice pictorial representations bro keep gng
@rajasdataengineering75853 жыл бұрын
Thanks Anand
@JindamSrilekha10 ай бұрын
Well Explained
@rajasdataengineering758510 ай бұрын
Thanks
@vishalaaa1 Жыл бұрын
Excellent
@rajasdataengineering7585 Жыл бұрын
Thank you! Cheers!
@terrificmenace2 жыл бұрын
Excellent video 😀 thank you
@alex45688 Жыл бұрын
good explanation
@rajasdataengineering7585 Жыл бұрын
Thanks for liking
@AnandKumar-dc2bf3 жыл бұрын
Nice explanation...
@subhashyadav92624 ай бұрын
Very Nice
@rajasdataengineering75854 ай бұрын
Thanks
@tanushreenagar31165 ай бұрын
perfect video sir
@rajasdataengineering75855 ай бұрын
Thank you!
@shyamkumardhamode4475 Жыл бұрын
Soooo good explanation
@rajasdataengineering7585 Жыл бұрын
Glad it was helpful!
@arunshankar19875 ай бұрын
Exactly what am looking for. Please let me know where I can find the datasets to practice.
@pridename2858 Жыл бұрын
Yes, this is master piece. Thanks
@rajasdataengineering7585 Жыл бұрын
Welcome, Glad you like it!
@pralgs6289 ай бұрын
Awesome!! Could you please attach the PPT for Each Video.. Thanks
@siddavatamvenugopalreddy96865 ай бұрын
Is this all tutorials related to spark only? Or it includes data bricks aswell? Please confirm
@rajasdataengineering75855 ай бұрын
It's more on databricks which is internally using apache Spark
@divyamariyameldo64954 ай бұрын
Thanks for the content!
@rajasdataengineering75854 ай бұрын
My pleasure! Welcome
@abinashsenapati188010 ай бұрын
It is really helpful. Thank you.. Where will I get the complete PPT of this playlist?
@sharunkumar4806Ай бұрын
Hi Sir, Thank you for such an excellant material. I do not have access to Azure Data bricks at the moment. Can I still learn the complete playlist by installing pyspark on my local PC and pactising on Jupyter notebook ?
@rajasdataengineering7585Ай бұрын
Thank you. Yes you can practice in your local installation
@saravninja2 жыл бұрын
Started your videos!! All are great
@rajasdataengineering75852 жыл бұрын
Thank you Ninja
@saravninja2 жыл бұрын
@@rajasdataengineering7585 I went through complete video second by second. Video has lot of clarity than any other KZbin channel. Keep up good work!! Have you experienced data skew issue, if yes can you point video or do video for us.
@rajasdataengineering75852 жыл бұрын
Thank you for your kind words. It gives a motivation to create more videos which can help genuine knowledge seekers like you. For data skew, have posted one video (though it does not cover advanced concepts) kzbin.info/www/bejne/e4LLnZevgbyDras
@saravninja2 жыл бұрын
@@rajasdataengineering7585 thanks a lot again Raja!! Will go through it. I am looking for airflow training, I have sent mail to you, kindly respond.
@krishnamurthy12434 ай бұрын
Hi Raja ,please do azure synapse analytics,eagerly waiting
@rajasdataengineering75854 ай бұрын
Sure Krishna, will create videos on synapse analytics
@mohammedmujahiduddin4715 Жыл бұрын
Thank you Raja for the detailed explanation. Do we have any video which is focusing on Worker Node and its details ? And as you were about to make a video regarding the memory management details, please also share that or the video title if already present. Thank you so much in advance!
Would watching these videos be enough to help pass the Databricks Certified Associate Developer for Apache Spark 3.0 - Scala exam?
@rajasdataengineering758524 күн бұрын
Yes most of the concepts are covered in this channel.
@ayanjit9196 Жыл бұрын
Sir can you please give us the links of the notebooks used in this series. This has helped me and a lot of other people. Giving this link would be even more helpful 🙏🙏🙏
@100useful7 Жыл бұрын
Yes, please
@manojru1 Жыл бұрын
I have installed Jupyter with Pyspark...where should I run my command to see the Spark job like you are showing on 38:21sec? or should I install some other IDE for that?
@ArupSankarRoy2 ай бұрын
Mistake 14:15 Partition Size should be 100 mb
@vinothloganathan26238 ай бұрын
Hi Raja, One of the amazing explanation. I couldn't find these level of details in any of the source like - books, medium and other youtube. Amazing work !!. Could you share if there are any resource helped you for spark >
@rajasdataengineering75858 ай бұрын
Thank you! I don't have any other resources. I summarised these concepts based on my working experience
@sudippandit98553 жыл бұрын
excellent explanation!!
@rajasdataengineering75853 жыл бұрын
Thank you Sudip
@lanyofrancis119522 күн бұрын
Thanks for the videos and nice explanation, I have a question. Default partition size if 128 MB, so for 2 GB RDD file, no of partitions created will be 16. Here in your example are you changing the default size of partition to 10 MB instead of default 128 MB? Please correct if I am missing anything?
@lakshmidvs3258Ай бұрын
Hi Raja, very nice explanation ... Are you taking classes
@rajasdataengineering7585Ай бұрын
Hi Lakshmi, thanks for your comment! I'm not taking classes
@lakshmidvs3258Ай бұрын
Hi Raja , Thank you so much for your quick reply . I want to write data bricks certification . It's show two exams 1.azure data bricks 2 . Databricks . For which certification I should go is there any difference in those two in getting job opportunities. I have experience as Data Engineer, now I want to do certification .
@rajasdataengineering7585Ай бұрын
Both are same. You can go with databricks certified data engineering professional
@oluakano64972 жыл бұрын
hi, i am new to spark and your videos seem like a great resource to learn. i am wondering what is the best order to watch them? through the playlist pr just use the numbers like 1,2,3...
@rajasdataengineering75852 жыл бұрын
Hi, for all videos, I have given serial number. You can follow the order based on that serial number
@jayaprakashm28492 жыл бұрын
Nice info
@kneelakanta81372 жыл бұрын
can I have document for reference of this playlist
@ElhamMirshekari2 жыл бұрын
Raja could you kindly make a video on these three functions and compare them: Join, Union, Concat
@rajasdataengineering75852 жыл бұрын
Hi Ellie, I have already created videos for join and union. Will make a video for concat as per your request. Join : kzbin.info/www/bejne/pHuqm3mDhaefisk Union: kzbin.info/www/bejne/fIW3fYB4gc6tjJo hope it helps you
@ranganathhittanagi3315Ай бұрын
11:40 why is the 2GB file is divided into 200 partitions ? by default? isn't it file size 2GB should be divided by 128MB (block size) which is 16 partitions
@RAVINDRAsap Жыл бұрын
could you create CI CD for databricks please
@codeslayer4713 Жыл бұрын
Hi Raja, I have a question here, in terms of partitions when we will be loading a file of 2gb the minpartitionbyte of 128 mb makes the initial partitions to be 16 with the logic 2*1024 / 128 right ? and the minpartition property has a number 200, but isnt it that if there is any shuffle operation then only 200 partitions will be there but not while reading
@rajasdataengineering7585 Жыл бұрын
Yes, that's shuffling partition parameter which is not applicable for reading partition
@codeslayer4713 Жыл бұрын
So you mentioned 200 in your example that's why my doubt arose
@neelbanerjee78752 жыл бұрын
Thank you ver much for such contents.. one request - Can you please make a video on real time executor number, core, memory allocation based on input data size like. 1. 1-5 gb 2. 5-15 gb 3. 15-25 gb 4. 25-50 gb 5. > 50gb = 1 tb
@rajasdataengineering75852 жыл бұрын
Sure Neel, will make a video on this requirement
@mohitupadhayay1439 Жыл бұрын
@@rajasdataengineering7585 Waiting for this!
@dipanjanpan15 ай бұрын
Can we create multiple executor node on a worker node?
@rajasdataengineering75855 ай бұрын
Yes we can. Executor is logical division of computing resources
@BlingKing321 Жыл бұрын
In order to store data in JVM memory we need to do serialization and deserialization. Why ?
@shamsmalek4 ай бұрын
Excellent job. Can you please provide me the data set and code? Or please give me the Git link to download the dataset and code for your tutorials. Thanks.
@srikanthbachina7764 Жыл бұрын
HI Raj, Videos are Missing from 27 to 30 Could you Please Upload them.
@rajasdataengineering7585 Жыл бұрын
Hi Srikanth, those 4 videos are related to Azure Synapse analytics. Its still available under all videos section
@arupnaskar38182 жыл бұрын
Hi Raja u r teaching is awesome .... really help .. sir, just wanted to know .. for "STAGING" here u mentioned about "Nodes" .. here "Nodes" means No of worker Nodes or Partitions ??
@rajasdataengineering75852 жыл бұрын
Thank you Arup. Node means no of worker nodes in the cluster
@arupnaskar38182 жыл бұрын
@@rajasdataengineering7585 thanks raja ..💐🌸
@rudraganesh1507 Жыл бұрын
This is the masterpiece
@rajasdataengineering7585 Жыл бұрын
Thanks Ajay
@البداية-ذ1ذ2 жыл бұрын
please could you make videos in examples about pyspark in real projects
@rajasdataengineering75852 жыл бұрын
Sure, will make videos on real time projects
@البداية-ذ1ذ2 жыл бұрын
Thanks alot
@saifahmed784312 күн бұрын
Greetings, Is this sufficient for Databricks Certified Associate Developer exam? Appreciate any clarity.
@rajasdataengineering758512 күн бұрын
Yes, I have covered almost all the topics. If you understand all the concepts I explained in this channel, that's more than sufficient
@saifahmed784311 күн бұрын
@@rajasdataengineering7585 Thank you so much for the info.
@rajasdataengineering758511 күн бұрын
Welcome
@dinesh_tadepalli4 ай бұрын
Are any prerequisites required to this pyspark series?
@rajasdataengineering75854 ай бұрын
No, nothing needed. I have covered from basic
@TelugupodcasterDT4 ай бұрын
Thank you!
@saikrishna19392 жыл бұрын
Can you please guide me how to start your videos I mean the order I can see many playlists in the channel. I want to learn spark and data bricks
@rajasdataengineering75852 жыл бұрын
Sure bro, let me give serial number to my videos so that you can follow the structured learning list
@saikrishna19392 жыл бұрын
@@rajasdataengineering7585 yeah thanks, also please comment here that which playlists we shld follow for the order to learn spark and data bricks 🙂
@rajasdataengineering75852 жыл бұрын
Sure
@rajasdataengineering75852 жыл бұрын
@@saikrishna1939 The videos are given with serial number. You can follow with that sequence
@saikrishna19392 жыл бұрын
@@rajasdataengineering7585 yes but which playlist to follow as there are 7 playlists in the channel so it's Lil confusion. Say for example a playlist has 5 videos but when I open it I can see 17, 18, 19, 20 in the videos. In interview series it starts with 1 again
@ElhamMirshekari2 жыл бұрын
Is the master node same as cluster manager? or they are two different concepts?
@rajasdataengineering75852 жыл бұрын
Master node is driver and different from cluster manager
@ElhamMirshekari2 жыл бұрын
@@rajasdataengineering7585 Thanks for the prompt response . Master node = Driver
@ParthKhambhayta-dj9te6 ай бұрын
Sir you said when read CSV file it's divided in default 200 partitions but default size of block is 128MB so it should decide into 16 partition please let me know am I correct or not ?
@anilmnt823 ай бұрын
I do see it as more detailed explanation on spark , but not really on Databricks, missing many Databricks features like Unity catalog, DBFS, Vaccum, Liquid clustering etc..
@aashishmalhotra Жыл бұрын
amazing
@rajasdataengineering7585 Жыл бұрын
Thank you! Cheers!
@morgann42762 жыл бұрын
Great video raja!! Wanted to know how you have such in depth knowledge.. did you learn from spark docs ?
@rajasdataengineering75852 жыл бұрын
Thanks Morgan. Yes spark documents and working experience helped me to understand concepts
@itsallinyourhead35932 жыл бұрын
Hi Raja, How do we set the number of executors in azure databricks ? like in this example the worker node is divided into 4 executors. Thanks in advance!
@rajasdataengineering75852 жыл бұрын
Hi, number of executors can be controlled using spark config parameter "spark.executor.instances". Number of cores per executor can be set by spark.executor.cores. Hope it helps
@itsallinyourhead35932 жыл бұрын
@@rajasdataengineering7585 thank you, so these parameters have to set while cluster creation meaning are these parameters at cluster level or can be changed/set by developers during etl/data processing?
@rajasdataengineering75852 жыл бұрын
It can be set at cluster level using init scripts or at notebook level using syntax spark.config.set()
@itsallinyourhead35932 жыл бұрын
@@rajasdataengineering7585 thank you 🙏
@rajasdataengineering75852 жыл бұрын
Welcome
@VipinYadav-ii1ow5 ай бұрын
Just starting to learn spark and databrics. Is this resource is enough to crack entry level data engineering job?
@rajasdataengineering75855 ай бұрын
Yes definitely, these videos are more than enough to crack entry level job
@Arvind-sr6ze2 жыл бұрын
i need prepare for apache spark programming with databricks certification,will this videos help me?
@rajasdataengineering75852 жыл бұрын
Yes, it will help
@Arun-uw1hy Жыл бұрын
Hi, here the executor mean processor (CPU) , Right? Because each node may have multiple CPU, and also each CPU can have multiple cores. So each cores has a separate executor.
@rajasdataengineering7585 Жыл бұрын
Executor means logical division of nodes. That means combination of processor + memory + network
@mannykhan775210 ай бұрын
Yun number of worker nodes?? What is Yun or yum??
@shivayogihiremath4785 Жыл бұрын
I'm following this channel from couple of days now. The content and way of explanation is awesome. Good job my friend. keep up the good work. wishing you all the very best. one small suggestion, if possible, please try to avoid the initial music (which is played at the beginning of the video) at times it is annoying. thank you!
@rajasdataengineering7585 Жыл бұрын
Hi Shiv, thank you for your valuable comments. I already removed this initial music. May be it is still there for only few initial videos.