spark ecosystem | Lec - 4

  Рет қаралды 30,914

MANISH KUMAR

MANISH KUMAR

Күн бұрын

In this video, I have talked about all the component of saprk. And how it stich to all of the component to run an spark application.
Directly connect with me on:- topmate.io/man...
For more queries reach out to me on my below social media handle.
Follow me on LinkedIn:- / manish-kumar-373b86176
Follow Me On Instagram:- / competitive_gyan1
Follow me on Facebook:- / manish12340
My Second Channel -- / @competitivegyan1
Interview series Playlist:- • Interview Questions an...
My Gear:-
Rode Mic:-- amzn.to/3RekC7a
Boya M1 Mic-- amzn.to/3uW0nnn
Wireless Mic:-- amzn.to/3TqLRhE
Tripod1 -- amzn.to/4avjyF4
Tripod2:-- amzn.to/46Y3QPu
camera1:-- amzn.to/3GIQlsE
camera2:-- amzn.to/46X190P
Pentab (Medium size):-- amzn.to/3RgMszQ (Recommended)
Pentab (Small size):-- amzn.to/3RpmIS0
Mobile:-- amzn.to/47Y8oa4 ( Aapko ye bilkul nahi lena hai)
Laptop -- amzn.to/3Ns5Okj
Mouse+keyboard combo -- amzn.to/3Ro6GYl
21 inch Monitor-- amzn.to/3TvCE7E
27 inch Monitor-- amzn.to/47QzXlA
iPad Pencil:-- amzn.to/4aiJxiG
iPad 9th Generation:-- amzn.to/470I11X
Boom Arm/Swing Arm:-- amzn.to/48eH2we
My PC Components:-
intel i7 Processor:-- amzn.to/47Svdfe
G.Skill RAM:-- amzn.to/47VFffI
Samsung SSD:-- amzn.to/3uVSE8W
WD blue HDD:-- amzn.to/47Y91QY
RTX 3060Ti Graphic card:- amzn.to/3tdLDjn
Gigabyte Motherboard:-- amzn.to/3RFUTGl
O11 Dynamic Cabinet:-- amzn.to/4avkgSK
Liquid cooler:-- amzn.to/472S8mS
Antec Prizm FAN:-- amzn.to/48ey4Pj

Пікірлер: 32
@bhavyamalviya8364
@bhavyamalviya8364 11 ай бұрын
Wonderful theoretical explanation
@nilavnayan4521
@nilavnayan4521 Жыл бұрын
Manish bhai, watched all your last 4 videos (this series) in one sitting and loved it - waiting for more :) Edit - Not just watched laet ke - but copy kalam ke saath :D
@HanuamnthReddy
@HanuamnthReddy 10 ай бұрын
Bhai can you do databricks
@andekhiaawaz
@andekhiaawaz 5 ай бұрын
Zabardast Series
@kavyabhatnagar716
@kavyabhatnagar716 Жыл бұрын
Great series ❤
@sairamguptha9988
@sairamguptha9988 Жыл бұрын
Hi Manish, you are doing an awesome job... Also pls try to do more vidoes on spark practicals like how to practice spark on data bricks account.
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
Already doing that
@sanooosai
@sanooosai 8 ай бұрын
thank you sir
@Mitra-xb7tz
@Mitra-xb7tz 5 ай бұрын
So, can we use HDFS cluster upon which Spark will act for data processing ?
@Watson22j
@Watson22j 5 ай бұрын
yes we can
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
Directly connect with me on:- topmate.io/manish_kumar25
@kartikmod7952
@kartikmod7952 Жыл бұрын
Hi, As per the video if the situation occurs when the cluster manager is down then the request will not go further, is this a possible case?
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
Yes. If your cluster is down then your request won't reach till the server. Master machine(on which yarn runs) are more powerful than the workers node and also it has a standby machine. So it is highly unlikely that your master will be down
@gchanakya2979
@gchanakya2979 Жыл бұрын
Bhaiya dsa python me hi karu ya fir muje java ya c++ me karna chahiya. Apne kya kiya.
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
You should do either in python or java. Because DE me ye 2 language hi use hoti hai
@alakanandade3883
@alakanandade3883 8 ай бұрын
@manish_kumar_1 does hadoop create replication of data for intermediate data as well? And how are these deleted later on?
@ankitsrivastava6218
@ankitsrivastava6218 7 ай бұрын
Hadoop intermediate data ka replication nahi karta across multiple nodes, jaise ki wo stored data (HDFS data) ke saath karta hai. Intermediate data sirf temporarily local disk pe store rehta hai jahaan Mapper run ho raha hota hai. Agar koi Mapper fail ho gaya toh Hadoop us Map task ko automatically doosre node pe re-execute karta hai, aur phir se wo intermediate data generate hota hai.
@praveenkumarrai101
@praveenkumarrai101 Жыл бұрын
bro you said we will use dataframe and spark sql then what is the use of python ? I am really confused.
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
To stich all your components together you need python. Let say I ask you to get the data from s3 bucket and make some changes on path and then transform it. So in this example you will use programming language I.e python in our case to access the file and make adjustment in the path. And then that path will be passed as a parameters to spark. In this playlist we are just learning spark
@praveenkumarrai101
@praveenkumarrai101 Жыл бұрын
@@manish_kumar_1 ok brother thanks
@saumyasingh9620
@saumyasingh9620 Жыл бұрын
Your videos are great...Can you pls make these little longer, capturing more concepts in one videos.
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
Yes I will
@nitilpoddar
@nitilpoddar 11 ай бұрын
done
@navjotsingh-hl1jg
@navjotsingh-hl1jg Жыл бұрын
manish bhai what are the roles of executor? executor means execute the data am i right ? or what is driver bro?
@younevano
@younevano Ай бұрын
Driver orchestrates the job and Executors do the data processing tasks!
@explorewithaAB
@explorewithaAB Жыл бұрын
Since spark is open source, so who does pay for this hardware. Since I think that hardware is nothing but a physical machine so there has to be someone who is paying right?
@manish_kumar_1
@manish_kumar_1 Жыл бұрын
Spark is a framework that is nothing but a piece of code which helps you to run your Data processing work in distributed manner. To run your code you need a cluster that is nothing but a physical machine
@explorewithaAB
@explorewithaAB Жыл бұрын
@@manish_kumar_1 thanks a lot for the reply. And who does pay for this cluster as it is yet to deploy on cloud service
@sankuM
@sankuM Жыл бұрын
@@explorewithaAB , I believe spark is just a frame work developed in Open source manner which will allow you to utilize distributed processing over cluster of compute(rs)! The cost of any such cluster is paid by the users only. e.g., if you run spark on AWS EMR or Databricks, you're charged money as per your usage of compute resources! Hope this helps! I'm exploring databricks community edition for spark practice, and am unsure as to how those clusters are managed & paid for internally! @Manish Kumar
@bangalibangalore2404
@bangalibangalore2404 Жыл бұрын
​@@explorewithaABif you install on laptop then your laptop you are buying, companies initially used on premise servers which they had to pay and purchase but now companies go to Azure, Azure gives them the computer on rent remember this way, the more powerful processor and hard disk you need the more you have to pay, at the end of month you will receive a bill for the usage that you did
@ordinary_indian
@ordinary_indian 10 ай бұрын
Well explained
@nitilpoddar
@nitilpoddar 11 ай бұрын
done
spark architecture | Lec-5
21:03
MANISH KUMAR
Рет қаралды 56 М.
Hadoop vs Spark | Lec-3 | In depth explanation
22:42
MANISH KUMAR
Рет қаралды 36 М.
Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 34 МЛН
Don’t Choose The Wrong Box 😱
00:41
Topper Guild
Рет қаралды 39 МЛН
Long Nails 💅🏻 #shorts
00:50
Mr DegrEE
Рет қаралды 20 МЛН
Мама у нас строгая
00:20
VAVAN
Рет қаралды 12 МЛН
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 3,9 МЛН
What is Spark in Hindi? #spark #dataengineering #data
16:53
CloudFitness
Рет қаралды 2,5 М.
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 103 МЛН
Learn Apache Spark in 10 Minutes | Step by Step Guide
10:47
Darshil Parmar
Рет қаралды 360 М.
Spark RDD | Big Data Analytics | Big Data Tutorial in Hindi
7:51
learnTechWithPriya
Рет қаралды 3,5 М.
Part 1 - Big Data Processing   Introduction au Big Data
58:28
Professeur Mohamed YOUSSFI
Рет қаралды 44 М.
Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 34 МЛН