Hi Folks - The much requested update to this video with the new AWS Console UI for AWS Glue is now available on the channel with a new GitHub repo containing everything you need to follow along. kzbin.info/www/bejne/kKethJSfpLWMr9E.
@sayakutube2 жыл бұрын
you were preparing the video when the entire west world was yelling "Happy New Year"! Great commitment and awesome result!
@TerminatorAyan Жыл бұрын
in 41 minutes, I never knew I can gain this much knowledge and at least be ready with AWS glue as we are in a transformation project. It really helped, thank you so much!
@farhodotaboev332411 ай бұрын
If you had watched with 1.25 playback speed, it would take you 33 minutes
@adeyemitunji3432 жыл бұрын
This the most detailed AWS glue video i have ever seen. Keep up the great work Johnny.
@danielboza5747 Жыл бұрын
For me the most amazing thing is that he was working in the morning of January 1st. My respect! 😅
@chamila.fernando.us2fernan6638 ай бұрын
You are Awesome. watching in 2024... ETL steps needs minor updating but I was still able to follow ! Keep up the great work !
@CaseWalker-j4g Жыл бұрын
I am enjoying the video, thanks for this resource! I wanted to note that it seems you blurred your S3 buckets early in the video, but at 19:22 and also 21:33 when configuring glue, you do not blur your other S3 buckets. Also 24:49.
@SungSam-wz2vk2 жыл бұрын
Awesome. I've never met people like you that you're so young with solid data science & aws skills.
@JohnnyChivers2 жыл бұрын
Thanks Sung!
@StephenGillie2 жыл бұрын
Troubleshooting tip for especially complex environments: If the user has access to ALL S3 locations in the table, then Glue will assemble the table for Athena to query. If even one S3 location can't be accessed by the user, then the table won't show up in Athena. Hope this helps complex schema lovers.
@JohnnyChivers2 жыл бұрын
Good tip.
@waeldimassi33552 жыл бұрын
Amazing content by far !! Please continue with Glue. Never seen such high quality tutorial :D
@JohnnyChivers2 жыл бұрын
Thanks for watching
@manasisingh294Ай бұрын
you speak my favorite accent that's such a plus for me. Thank you so much for the quality content! :)
@kakamoora78745 ай бұрын
simple and explain everything ... what we need .... thank you man ... i'm from sri lanka
@jimaustin3608 Жыл бұрын
At 28:00 (AWS Glue Jobs section) current screens are much different than video. Figure all the same information has to be entered, but order and screen flips are totally different.
@junaidraza6774 Жыл бұрын
I am not sure, does anyone said this before or not, but i wanted to say, You are a rock start.
@vedprakash9413 Жыл бұрын
41 minutes was sufficient for me to get hands-on AWS glue. Thanks, Johhny for this awesome tutorial.
@adalke22 жыл бұрын
As someone transitioning from a data scientist to DE role, I found this extraordinarily helpful! Subscribed! Thank you!
@arunsar78932 жыл бұрын
I am interested in understanding why did you decide to make the transition.
@JohnnyChivers2 жыл бұрын
Awesome! Thank you!
@augustocarrillo5927 Жыл бұрын
I don't like subscribing to people but lit I wouldn't be able to learn aws without you. Best and unique content
@EEdgerocks2 жыл бұрын
excellent Glue tutorial .. One of the best I have come across which teaches in a composed pace and easy to pickup for anybody
@JohnnyChivers2 жыл бұрын
Thanks for watching!!!
@ThanhTran-wf2jf2 жыл бұрын
This is one of the best tutorials i've seen in youtube videos. Very well-explained, very useful. Thank you for uploading this!
@JohnnyChivers2 жыл бұрын
Thanks for the comment and supporting the channel.
@JulioSerratos Жыл бұрын
This tutorial is pure gold. Thanks
@lyndsaymizen18812 жыл бұрын
Great intro to AWS Glue for someone who's never seen it before. Needed to know these basics as we're moving to Glue for our data transformation/warehousing and this video does just that! Thanks!
@JohnnyChivers2 жыл бұрын
Thanks for watching Lyndsay!
@Ajay-nc9yn Жыл бұрын
I have spent many hours on Udemy for Glue Job and Glue data Catalog, but after watching your video. I must say Damn Good Stuff, Sir!
@engenheironomade Жыл бұрын
Excelent tutorial. I made three others courses before that and only with this that I understead complete this tool
@guilhermejf86422 жыл бұрын
I just can't believe it's free here on KZbin! Great content, very well explained. Thanks!
@JohnnyChivers2 жыл бұрын
Thanks for watching the video and a supporting the channel!
@DataCraftBackbone2 жыл бұрын
As a skilled Ops Engineer / Developer i have search for infomartion about whats Glue and how its working in the basic and thanks so much for your video, its late me understanding the basic about what Glue are and whats it can be used for, :)
@femaledeer Жыл бұрын
Glad the tutorial took me through the console instead of running code.
@1209youandme2 жыл бұрын
Superb course, covering all features with examples
@JohnnyChivers2 жыл бұрын
Thanks for comment.
@jairo_55 Жыл бұрын
I am using this in 2023, the interface has changed, but this still working, thank you so much for the video.
@asifalimir304310 ай бұрын
Such an amazing video. Gives end to end idea about Glue in just 45 mins. Keep it up.
@victors9585 Жыл бұрын
Johnny, you ROCK, man!!! 🔥👏🔥👏🔥👏🔥 Thank you so much and please don't stop spreading the wisdom of the GURU!!!! 🙏🙏🙏👍👍👏👏🚀🌟
@hankbirkdale2154 Жыл бұрын
One of the best tutorials I've ever seen - let alone on a topic that is not easy.
@AbhirupSenguptaUK3 ай бұрын
your commitment shows in the quality!
@diegogiardini Жыл бұрын
It was very usefull for someone who is starting with AWS from scratch! Thank you :)
@shelleycurrie7644 ай бұрын
Johnny Chivers you rock. Really great intro to Glue.
@Pavan-kn5pg Жыл бұрын
One of THE BEST videos on AWS Glue. Thank you Johnny :)
@dbeckett Жыл бұрын
Finally an aws tutorial that uses "wee" and talks in a NI accent, made me laugh that I finally found a fellow NI person :) followed every word :D
@GauravSakhuja Жыл бұрын
Great tutorial in 40 mins you covered everything essential in Glue 👏
@Rk23able Жыл бұрын
Wonderful and neatly described in such a short video
@venkatrao78687 ай бұрын
You are amazing and a natural teacher !!
@javiermadriz783410 ай бұрын
Great video for beginners I hope to build some projects for keep learning
@mohamedyasser52852 жыл бұрын
Fantastic video, going to binge-watch your videos for the next few months!
@JohnnyChivers2 жыл бұрын
Thanks for watching.
@Josh-og9eo2 жыл бұрын
So excited to find this channel. Looking forward to watching all your videos!
@ryoutausami6817 Жыл бұрын
Thank you. This is an awesome introduction to AWS Glue for beginners.
@hemantmattoo Жыл бұрын
Very nice and easy to learn video especially for begineers. Thanks for posting this.
@sreddy5845 Жыл бұрын
Fantastic video: I have two questions 1. Can you create a partition key when importing data with a crawler? 2. The UI seems to have changed. The 'ETL job' has changed. Can you publish a refresher on that part?
@jeevangangavarapu86832 жыл бұрын
Excellent work Johnny. Keep continuing the work .
@JohnnyChivers2 жыл бұрын
Thanks, will do!
@akashdasgupta50882 жыл бұрын
This is one of the best AWS Glue tutorial that I've come across on KZbin. Totally worth a like and sub❤️
@JohnnyChivers2 жыл бұрын
Thanks for the sub!
@ibinabobob-manuel32311 ай бұрын
Towards the end of the course, I lost my way after we created the parquet file because my version is different from yours and I couldn't just get the visuals on my dashboard. Thank you so much and I will give this a try again some other time.
@shadabbigdel5017 Жыл бұрын
Thank you very much for you great explanation and hands-on! It was very useful for me.
@patriciacafundo1626 Жыл бұрын
Very good course, gave me a good view of glue AWS service
@danielmdubois Жыл бұрын
Do you have any resources or can direct me to any resources that can dig deeper into the topic skipped over @8:50, i.e., what someone would actually do with regards to IAM roles best practices?
@ToddCunningham2 жыл бұрын
right when i needed this, always amazing content
@JohnnyChivers2 жыл бұрын
Thanks for watching Todd
@davidlean86742 жыл бұрын
Great & comprehensive coverage of AWS Glue. I was hoping it did much more. Makes you appreciate how much easier, faster & cheaper it is to develop in Azure DataFactory. Where a single pipeline can consume multiple different CSV files, parse the filename, figure out the metadata on load, & redirect the output based on lookup parameters. All graphical or generated via script.
@DataEngUncomplicated2 жыл бұрын
Hi David, This video did not cover AWS Glue Studio which provides more of a compressive graphical interface that I think would be more comparable to Azure Data Factory.
@debarshiacharya3729 Жыл бұрын
This was a really nice video. Lot of learning within a very short time. Thanks
@martinvuong66522 жыл бұрын
Such an amazing tutorial! Thanks for getting this together. Looking forward to more content.
@vjsnapp8178 Жыл бұрын
Amazing Content and style of teaching, Thanks a lot for making these videos.
@vrlchebolu Жыл бұрын
Great intro on AWS Glue! Thanks!!
@gonzalea35 Жыл бұрын
This was awesome explanation of glue, thanks.
@guilhermmontealto2160 Жыл бұрын
huge fan from brasil, thank you for your great content, it helped me a loooot
@ambhat3953 Жыл бұрын
Very helpful for my studies
@bitashamsss Жыл бұрын
fantastic video, very helpful and useful. Looking forward to seeing more. keep up the great work and thank you so much.
@AviinashP2 жыл бұрын
excellent video on glue
@hazemzamalkawy145 ай бұрын
Thank you very much, I learnt a lot from this tutorial.
@ammar97002 жыл бұрын
Nicely done and explained. I have done my first AWS Glue job, without any issue... thumbs up bro 👍
@JohnnyChivers2 жыл бұрын
Fantastic news Ammar!!
@sriravi8152 жыл бұрын
Thanks for nice video, it helped me understand glue and setting up the pipeline in aws.
@KoEDeath2 жыл бұрын
Impressive free content. Thank you!
@JohnnyChivers2 жыл бұрын
Thanks for watching
@maxmax2746 Жыл бұрын
Thank you for your work! Could you please help few little questions: 1) i saw, that while you it said that "customer_csv" table is partitioned, but "customer_parquet" is not (there is not "partitioned" mark nearby). So how should i make that files partitioned? Because it seemed, that you did the same thing for csv ant parquet, but get different result. Thank you in advance! 2) What would be happened, when you add another csv data for different day? how job will work? i didn't get, how jobs determine, that previous day already transformed to parquet, and newest day - no. And what if . As i understand - in your particular example from that video, that Glue job will be transforming from csv to parquet all the fyles inside customer_csv folder. But how to make it more determined based on run date? for example: in general i want to transform only previous loaddate files. As i understand, it should be done only in code of job
@gabesusman4592 Жыл бұрын
this channels a great resource for learning data engineering on aws, it's been a big help. Keep up the good work Johnny!
@roarmarketingconceptsllc5074 Жыл бұрын
This was a great tutorial! I really learned a lot about AWS Glue and I plan to leverage this knowledge to be more effective in my job tasks. Thanks so much, Johnny!!
@obedrajugantala3489 Жыл бұрын
Good tutorial. I was able to follow and execute. Thanks Johnny!
@maryo11342 жыл бұрын
Amazing teacher..Many thanks for this
@JohnnyChivers2 жыл бұрын
Thanks for watching.
@PURAMSAIPRIYA2 жыл бұрын
Thank you for this video, it is very helpful gives a clear glimpse of AWS glue. This is a very important video for me for the interviews :)
@kjewelson8 ай бұрын
happy new year
@JasonZhang-se2jo2 жыл бұрын
You are definitely a hero
@JohnnyChivers2 жыл бұрын
Thanks for watching
@Matrix_Mayhem Жыл бұрын
Thankyou so much. Very informative Johnny!
@anarossetto27422 жыл бұрын
wonderful tutorial, great explanation, thank you! Unfortunatly aws glue job console has changed a lot and I could not finished the tuto :(
@machinimaaquinix3178 Жыл бұрын
Thank you and well done. Was able to do the course still in June of 2023, though the ETL chapter was a bit of a challenge as AWS has completely redone those screens.
@JohnnyChivers Жыл бұрын
Great to hear!
@michealdmouse Жыл бұрын
How did you get past the ETL job chapter? The interface is totally different.
@AkashBhosale-mr7kk9 ай бұрын
Great Video great efforts. Thanks a lot for detail explanation. When I ran the crawler for 1st time as per video, it did not create partition column, I again created a new crawler with same details using same s3 folders and now it created a partition. What might the possible reason that it failed to detect at 1st time? Any key point to remember during building or some miss that lands us into such situation?
@jriosfer2 жыл бұрын
It's very interesting your approach, I have a question, why does the glue's crawler from converts csv file to parquet format could not create the parquets table with a partition definition, which came from csv file?
@JohnnyChivers2 жыл бұрын
Hi Jorge. The crawler will only create partitions when there is a folder. So something like s3://table_name/parition_1/file.csv will result in partition being created. However, crawlers can be a bit temperamental. They are some sort of ML algorithm and do go AWOL at times. For that reason I usually just create the tables through code manually - terraform, cloud formation or even DDL via Athena.. when it comes to real life use cases.
@sagnikmukherjee5108 Жыл бұрын
Thoroughly enjoyed, Thanks.
@ishabisht36852 жыл бұрын
Thank you very much, it was very helpful video to learn end to end AWS Glue. 🙂
@qadiralidanish7529 Жыл бұрын
Awesome tutorial.
@daviluancarneiro6901Ай бұрын
Jesus bless you, my friend. Your explanations are very clear. My desire is that you continue making the difference, creating and publishing awesome contents that help people. Congratulations! Good job!
@PatrickMcDonoughVanWash Жыл бұрын
Highly recommend!
@balledachandrahas83262 жыл бұрын
Super helpful.... Thank you so much.
@JohnnyChivers2 жыл бұрын
Thanks for watching!
@benitinmagnate49379 ай бұрын
@25:00 "I'll talk about connections quickly", LOL! That's what AWS Glue, Azure Data Factory, SSIS, Informatica, are all about: CONNECTIONS! You are moving data from a source to a target, and to do that, you need to be connected to both, the source and the target. Basically, you are an S3 guy, LOL!
@YogeshSeemakurthi2 жыл бұрын
Thank you for the course!! I want to create a custom classifier for text files is that possible?
@JohnnyChivers2 жыл бұрын
Yes that possible. You’ll find it on the glue console under classifiers and ‘create’. I think there are some limitations on how customisable classifiers can be.
@nishaddheeraj2 Жыл бұрын
really amazing and fun learning with Johnny :)
@niranjanjamkhande37732 жыл бұрын
Thanks for the video. We need videos for real world problems with real world data such as nested json. Can you guide on that?
@JohnnyChivers2 жыл бұрын
Niranjan - something I can totally do and feel free to suggest ideas. For now there is an AWS Glue Library called Rationalize that can de-nest for you - aws.amazon.com/blogs/big-data/simplify-querying-nested-json-with-the-aws-glue-relationalize-transform/ There is also a pandas library that does exactly the same thing called json_normalize() and this available in glue out of the box - but don't use pandas in glue on bigger JSON loads.
@muadddib47342 жыл бұрын
Excellent content! Hope you keep it coming. I just subbed
@JohnnyChivers2 жыл бұрын
Thanks for watching.
@lukacindric80659 ай бұрын
What would be the video on your channel where you explain loading the data from s3 to dwh using Glue?
@tullez01 Жыл бұрын
Amigo, muito obrigado pelo vídeo... Ficou ótimo! Abraços! Dear friend, thanks for this video. It's really great, helped a lot... Hugs from Brazil :)
@JeffLentz Жыл бұрын
Fantastic! Thank you for putting this together. It helped me a lot.
@deetechbee Жыл бұрын
awesome video!
@claudioruz2 жыл бұрын
Good training course, thank for all
@JohnnyChivers2 жыл бұрын
Thanks for watching.
@HughMcBrideDonegalFlyer Жыл бұрын
Great video Johnny , my only complaint would be video resolution , It can be a bit of a strain to try to watch this on a laptop
@TheSimpGatsby Жыл бұрын
thanks mate, u made my day.
@BEBKomalTeke2 жыл бұрын
Hi, there's one option in ETL as jobs and jobs legacy. Could you explain what's the difference?
@JohnnyChivers2 жыл бұрын
The “jobs” tab is new where AWS look to be consolidating all functionality to do with jobs. The “legacy” tab is where jobs have been located for the last 4+ years. It looks like most of the legacy functionality has been migrated to the jobs tab - albeit the UI is slightly different. This change only happened about 2/3 months ago and is all pretty new. I suspect the legacy tab will be removed eventually, but only once all functionality has been migrated to the new ‘jobs’ section.
@Straight-Data-Science Жыл бұрын
Very well done! Thanks for sharing this!!
@augustogoldner4267 Жыл бұрын
That was fantastic, mate! Thank you very much for sharing your knowledge!