Intro to Amazon EMR - Big Data Tutorial using Spark

  Рет қаралды 34,575

jayzern

jayzern

Күн бұрын

Пікірлер: 89
@jovelynobias5422
@jovelynobias5422 9 ай бұрын
I hope you create more videos about AWS services. Loved the way you explain things, perfect for beginners.
@JeffSylvan-y8j
@JeffSylvan-y8j 10 ай бұрын
This is an outstanding tutorial. Thank you for making this!
@brijeshhota550
@brijeshhota550 2 ай бұрын
Best tutorial I've seen so far. Was confused between Glue and EMR for a future projects requiring big compute power with control over each node.
@Munk-tt6tz
@Munk-tt6tz 7 ай бұрын
So sad your channel doesn't have more tutorials like this :( thank you so much!
@MKtechnoverse
@MKtechnoverse 3 ай бұрын
Same thoughts me too
@DarshilParmar
@DarshilParmar 8 ай бұрын
Great work mate, very crisp!
@jayzern
@jayzern 8 ай бұрын
Thanks man!! Love ur content
@isaaclee3714
@isaaclee3714 9 ай бұрын
This is so goood :). Please keep making these kind of videos! Hello from Seattle
@jayzern
@jayzern 8 ай бұрын
Thanks Isaac from Seattle! Appreciate your support
@harishchitluri3137
@harishchitluri3137 6 ай бұрын
Absolutely enjoyed watching the entire video. I felt this video is gonna be great start to understand EMR. Thanks for making it jay
@miguelhermar
@miguelhermar 6 ай бұрын
We need more videos Jaaay 🙏🏻💪🏻 You're awesome dude!
@meghasingal7082
@meghasingal7082 Ай бұрын
Very well explained EMR video, thank you
@vineethdas4160
@vineethdas4160 6 ай бұрын
awesome explanation, simple , subtle and to the point!
@dianadai4616
@dianadai4616 2 ай бұрын
I can see you are brilliant.
@AleLopesYTube
@AleLopesYTube 2 ай бұрын
Excellent tutorial
@TheDataArchitect
@TheDataArchitect 2 ай бұрын
That was FAST, you are subscribed :D Any vids related to "Amazon Managed Workflows for Apache Airflow"???
@jeahyunkim3141
@jeahyunkim3141 3 ай бұрын
thank you!! I watched the KZbin demo and it was really helpful. I also want to study spark on eks
@yutao1982
@yutao1982 Жыл бұрын
Very clear! Thank you for sharing this excellent tutorial!
@szhongy
@szhongy Жыл бұрын
great tutorial! can’t wait to see more
@StartDataLate
@StartDataLate 10 ай бұрын
this is crazy ❤❤❤ wish i had seen this earlier ! is this how the whole amazon product in a actual work flow look like? and also could you maybe make another showing azure system? pleaaase
@lucashoww
@lucashoww 10 ай бұрын
gnarly stuff man! great content.
@carloshenriquekaphos8814
@carloshenriquekaphos8814 Жыл бұрын
Go ahead bro....CONGRATS TUTO
@prabhathkota107
@prabhathkota107 7 ай бұрын
Very well explained, kudos
@tatenda_mk
@tatenda_mk Жыл бұрын
Great tutorials! thanks for the headup! do you have a git repo or more notion notes? would like some guidance
@vmmismagic
@vmmismagic 7 ай бұрын
Hey, thank you so much!!.. you really explain very well!
@nellyoi9831
@nellyoi9831 2 ай бұрын
thank you, this is great tutorial
@thanhchien1602
@thanhchien1602 Жыл бұрын
Your video is very interesting! Hope you release many new videos :)
@jasonyuen105
@jasonyuen105 7 ай бұрын
nice job, great tutorial
@NhungNguyen-wh7uf
@NhungNguyen-wh7uf Жыл бұрын
Could you share more about project for data engineer beginners? I have start to learn to be a DE recently and I hope to know more about some personal project that help me to enhance my skills. Thank you so much for your sharing and waiting for your next video :> Have a good day
@Ярослав-ю9н8з
@Ярослав-ю9н8з Жыл бұрын
impressive and informative video, good job, go on doing tutorials plss :) Would be very interesting to see a video about spark and snowflake on your channel!
@hoangng16
@hoangng16 4 ай бұрын
I'm learning data engineering and I find it's hard to find end-to-end data engineering projects to learn building scalable B2B data infrastructure systems that process large amounts of data. Many examples only touch the surface or handle small datasets (well, they say example). Do you have a plan to make a tutorial about actual use cases of data engineering (large and complex data, scalable, cost efficiency system, etc.)?
@jayzern
@jayzern 4 ай бұрын
Yea for sure, I’ll consider making more end to end videos in the future. Hard to mimic actual use case data engineering with toy projects, but will try to bridge that gap
@hoangng16
@hoangng16 4 ай бұрын
@@jayzern, I'm doing a project to provide a management solution for small businesses. Data such as customer appointment schedules, staff's working hours, etc., will be stored in MongoDB. I'm thinking of a data engineering component: some data will be extracted, transformed and loaded to AWS S3 for visualization on a mobile device. I think it's still simple but, perhaps, good enough
@martinghiena5270
@martinghiena5270 9 ай бұрын
You killed it. Loved it! Extremely useful
@jayzern
@jayzern 8 ай бұрын
Thank you man! Hope to create more
@goumze
@goumze 8 ай бұрын
Great Article ! Thanks for sharing..
@markkevnjflores
@markkevnjflores 5 ай бұрын
this is amazing, thank you!
@mahmoudfadaly8074
@mahmoudfadaly8074 3 ай бұрын
the type of video that makes me wanna quit the field because of how bad i feel about the level I am in , but its a very helpful video though
@shafiq_ramli
@shafiq_ramli Ай бұрын
Are you working as data engineer right now?
@mahmoudfadaly8074
@mahmoudfadaly8074 Ай бұрын
@@shafiq_ramli I am looking for a job now as a Data Engineer
@mahmoudfadaly8074
@mahmoudfadaly8074 Ай бұрын
@@shafiq_ramliif u could help me find one 😢😅
@bishop9168
@bishop9168 8 ай бұрын
Fantastic tutorial indeed! I did as instructed and I got two fails in deploying the 'add step' part of the EMR Cluster stage, any insights would be appreciated.
@123Bankai123
@123Bankai123 3 ай бұрын
Hi bishop, I got this error too just now - did you manage to fix it? afaik I did everything the same
@KheireddineAzzez-l3g
@KheireddineAzzez-l3g 2 ай бұрын
nice, keep going
@hassanlaqrabti4036
@hassanlaqrabti4036 Жыл бұрын
More tutorials 🙏
@errrbrrr3821
@errrbrrr3821 Жыл бұрын
great video! can you make also for AWS Glue? Thank you!
@BhavyaJoshi-r4z
@BhavyaJoshi-r4z 5 ай бұрын
Great video
@pottamvivek
@pottamvivek 8 ай бұрын
Great job
@tatenda_mk
@tatenda_mk Жыл бұрын
when writing the spark script, does it ever change or the skeleton layout remains the same? i truly appreciate this and i cannot wait for more
@jazzypants4047
@jazzypants4047 11 ай бұрын
I am wondering if I only needed to do PySpark, is EMR the best tool or is it overkill and Glue serverless would be good enough with a lot less to manage and fewer configurations to worry about. Is it possible to enable better performance with all the options in EMR?
@jazzypants4047
@jazzypants4047 11 ай бұрын
And thank you for this video - I’m studying for AWS certification and it was helpful to see your demonstration
@_its_ck
@_its_ck Жыл бұрын
More videos on Streaming, Airflow and Spark
@syedmehdi5125
@syedmehdi5125 Жыл бұрын
I hav done masters of science in biotech, 38 yers of age, want to switch to data science...how shud i do it??? Plz reply.....
@CK30585
@CK30585 Жыл бұрын
Do projects and add them in your resume. Try upwork and do some projects as freelancers. Keep applying
@Ved3sten
@Ved3sten Жыл бұрын
Don’t
@syedmehdi5125
@syedmehdi5125 Жыл бұрын
@@Ved3sten y , plz reply...
@Ved3sten
@Ved3sten Жыл бұрын
@@syedmehdi5125 bc most companies want senior data analysts or graduate students when it comes to data science. You’ll waste more money chasing a data science job than you’ll make
@datexland
@datexland 11 ай бұрын
Thanks for sharing man 👌
@shivaramthallapally369
@shivaramthallapally369 Жыл бұрын
From where you learn that coding part 😢
@EstebanHenryG
@EstebanHenryG Жыл бұрын
Great!! Thank u so much!
@ZyklonB-88
@ZyklonB-88 3 ай бұрын
why do you need to create a VPC?
@etf_chach
@etf_chach 2 ай бұрын
VPC is for nodes. It allows them to communicate between each other and the master node.
@giovannimaia9652
@giovannimaia9652 6 ай бұрын
Please post more videos
@shakendra2011
@shakendra2011 5 ай бұрын
Hey why did you create vpc?
@jayzern
@jayzern 4 ай бұрын
Hey, we typically create VPCs over EMR clusters for more networking control and better security. If I rmb correctly, here we defined a public subnet and internet gateway which connects to S3. You could also use private subnets to avoid attaching internet gateway, but it's trickier to setup and can incur NAT gateway charges. The video is just an example
@mandata143
@mandata143 Жыл бұрын
is this free to use or do i need to have a licensed software in order to use? this is quite interesting.
@pradeepnim3689
@pradeepnim3689 Жыл бұрын
Thanks .. Good stuff
@jovelynobias5422
@jovelynobias5422 9 ай бұрын
Isnt using EMR notebook one of of the ways to trigger EMR job?
@jayzern
@jayzern 9 ай бұрын
Yes it is! Wanted to keep things simple in the video so didn't include it
@tbd4156
@tbd4156 9 күн бұрын
💗
@atreushouse8848
@atreushouse8848 5 ай бұрын
bro thank you i survive
@sisami2109
@sisami2109 Жыл бұрын
thanks for the video
@moverecursus1337
@moverecursus1337 2 ай бұрын
Great VIdeo
@carloshenriquekaphos8814
@carloshenriquekaphos8814 Жыл бұрын
Don't stop
@SaurabhKrPathak
@SaurabhKrPathak 4 ай бұрын
Just for the information to all the learners this is not how things to be done in tech industries....you need to understand Terra form scripts along with jenkins which deploys aws services....you will not get access to go on management console and play around and do stuff.
@hoangng16
@hoangng16 4 ай бұрын
Your content is valuable but I find that your presentation is too fast. Sometimes, you fastforward steps too quickly. It makes the video look not lengthy but it's challenging for audience to closely follow your steps.
@jayzern
@jayzern 4 ай бұрын
That’s super helpful feedback, I’ll try to slow down on the steps and talk more in future videos. Hopefully my latest video is slower. How the audience feels matters a lot
@DivakarJ-gk6op
@DivakarJ-gk6op Жыл бұрын
nice try but its not working
@jayzern
@jayzern Жыл бұрын
Let me know how I can help
@DivakarJ-gk6op
@DivakarJ-gk6op Жыл бұрын
I can add a step for the spark application@@jayzern
@jayzern
@jayzern Жыл бұрын
Check if 1. the Spark script is encrypted when you upload it inside S3 2. any typos (line 41 should be "add_argument")
@DivakarJ-gk6op
@DivakarJ-gk6op Жыл бұрын
I had tried. but it's not working for me @@jayzern
@jayzern
@jayzern Жыл бұрын
Send me a DM on instagram @jayzern or linkedin, happy to pair up
@koliux1
@koliux1 9 ай бұрын
eah good in EMR AWS but an absolute rookie in Videography and equipment use manual focus since you are stationary.... your autofocus keeps looking for something and change light set-up
@jayzern
@jayzern 9 ай бұрын
Fair point 👍 will work on lighting and camera setup more next time
@chulada03
@chulada03 Жыл бұрын
thanks so much
@christinachen9669
@christinachen9669 9 ай бұрын
Love the ways how you demonstrate! so clear and easy to understand! Thanks for sharing @jayzern
JISOO - ‘꽃(FLOWER)’ M/V
3:05
BLACKPINK
Рет қаралды 137 МЛН
КОНЦЕРТЫ:  2 сезон | 1 выпуск | Камызяки
46:36
ТНТ Смотри еще!
Рет қаралды 3,7 МЛН
AWS Glue ETL Vs EMR - Which one should I use?
8:05
Johnny Chivers
Рет қаралды 42 М.
The ONLY PySpark Tutorial You Will Ever Need.
17:21
Moran Reznik
Рет қаралды 146 М.
Amazon EMR Deep Dive and Best Practices - AWS Online Tech Talks
40:32
AWS Developers
Рет қаралды 58 М.
Running Spark jobs on Amazon EMR Serverless
25:12
dacort - Data Analytics
Рет қаралды 10 М.
AWS Tutorials - Absolute Beginners Tutorial for Amazon EMR
46:35
AWS Tutorials
Рет қаралды 30 М.
How I would learn Data Engineering (if I could start over)
11:21
Practical Projects to Learn Data Engineering On AWS
8:04
DataEng Uncomplicated
Рет қаралды 51 М.
What is Dagster? Asset Based Orchestration [2hr full course]
1:11:05
Snowflake Query Pruning: A deep dive
41:15
SELECT
Рет қаралды 253
AWS EMR Tutorial [FULL COURSE in 60mins]
1:01:06
Johnny Chivers
Рет қаралды 66 М.