Python ETL Data Pipeline Tutorial with AWS, Terraform and Spotify

  Рет қаралды 30,480

datadrivendecisions

datadrivendecisions

Күн бұрын

Пікірлер: 67
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thanks for watching! Let me know if you've got any questions! 00:00​ - Intro 01:41 - Pre-Requisites 04:59 - Extract and Transform with Python 14:20 - Loading with Lambda & AWS 18:38 - Terraform Deployment 29:13​ - Summary
@manuelaguilar9679
@manuelaguilar9679 2 жыл бұрын
8⁸
@redrum4486
@redrum4486 2 жыл бұрын
I just want to say thank you my man! I followed this tutorial which took me 2 full dasy to do and it helped me get a jobs as I was able to talk about the process! God bless you homie
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
This is why I started making KZbin videos. Love to hear it 🔥congratulations on the job and for putting in the work 💯
@teaspells9994
@teaspells9994 3 жыл бұрын
thank you! I loved this video, Im triyng to get a job in the database dev field and ETL is one of the main requirements for these kinds of jobs.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
I’m really glad that this helped you and I hope it helps with the job interviews! I’m definitely going to be doing more videos in the future around the skills required to get into the data/data engineering profession
@sameershaik9998
@sameershaik9998 Жыл бұрын
Successfully done the project. This was my first ETL Project. Thanks a lot liam.😇
@k00a7
@k00a7 3 жыл бұрын
Thank you for taking time to post this video. I love how you used TF , python and aws tools to build the application.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
You’re welcome Karthik. I’ll be using terraform, AWS and python to deploy my bitcoin bot in my next video!
@k00a7
@k00a7 3 жыл бұрын
@@datadrivendecisions0 that would be great. Also it would be a great addition if you can use a CI/CD pipline like gitlab.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
@@k00a7 I deploy some of my work with CodePipeline from AWS at the moment. But I also use Jenkins, CircleCi and GitHub actions at work so I can go over any of those in future content if you're interested!
@garcialn
@garcialn Жыл бұрын
Great content, mate. Hope to see more from data engineer Matthew Mcconaughey 😉
@justincooke5888
@justincooke5888 2 жыл бұрын
Dude !!! Please Please Please add more content . You make Data Engineering Fun haha !!!
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
thank you! check out new content on my new channel kzbin.info
@karolinasowinska
@karolinasowinska 3 жыл бұрын
Thanks for so much valuable content in one video! I might finally get my hands on Terraform! P.S. you've just given me AWS instances OCD with that infinite loop story 😅
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Karolina! Happy too show you some of the basics sometime. I think you know the person who made the infinite loop, we both worked with him 😉
@aaronroethe1010
@aaronroethe1010 2 жыл бұрын
This video was high tier. Excellent Job!
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Thanks Aaron!
@onurbasar7840
@onurbasar7840 3 жыл бұрын
Thank you. I guess I found a valuable activity to do this evening.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
You’re welcome! I hope you enjoyed your evening!
@shariqshahab5352
@shariqshahab5352 3 жыл бұрын
Great video ! Very informative, yet entertaining content. Looking forward to more cloud computing videos !
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Shariq! I’ll be deploying my crypto currency trading bot onto AWS soon!
@carlosleotaud3255
@carlosleotaud3255 2 жыл бұрын
Damn man you have great attitude and the content is really good, thanks !
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Thanks Carlos! Hope you found it useful
@redrum4486
@redrum4486 2 жыл бұрын
You sir are the Goat.... big up man like Liam!
@mikelmenaba
@mikelmenaba Жыл бұрын
you are a real G
@whateverimnot1137
@whateverimnot1137 3 жыл бұрын
Nice project. I just started to learn terraform, so it was nice to see an example. Would love to see a video about streaming data in AWS and applying some live analytics in the future.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Glad you liked it! I’ll try and do some more terraform in the future. Could try and come up with a streaming solution in the future. Like using Kinesis for streaming?
@whateverimnot1137
@whateverimnot1137 3 жыл бұрын
@@datadrivendecisions0 Yeah kinesis would be perfect!
@sase1017
@sase1017 3 жыл бұрын
Good stuff, bro, please make more
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
They’re in the pipeline 😏
@raulgutierrez5862
@raulgutierrez5862 3 жыл бұрын
Excellent demo!
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Raul!
@DivineAura1
@DivineAura1 Жыл бұрын
Thanks for the video! Are you using pycharm btw for the code editor ?
@abhi0603-c3b
@abhi0603-c3b 2 жыл бұрын
Thanks for sharing this excellent video. It shows that you are highly skilled and have in depth knowledge. I have a question as well for you: Would saving the data in RDS have been better for querying with SQL later on? Are there any architectural reasons for choosing S3?
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Yeah an RDS instance would definitely be the better way to store data for analysis. S3 is much quicker and cheaper to setup as it doesn’t require any DDL which is why I chose it for this project
@cjoshmartin
@cjoshmartin 3 жыл бұрын
Great video
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Josh!
@vert_sr
@vert_sr 2 жыл бұрын
bro you got me dying throughout this whole vid 🤣🤣
@taoufikizem5129
@taoufikizem5129 Жыл бұрын
Thank you for this demo, just I didn't get the idea where I should insert the following credentials(Spofify_client_id, spotify_client_secret, S3_name, S3_key..) into API ?
@niklausmikealson3115
@niklausmikealson3115 Жыл бұрын
I can't understand how the file structure was made. Since it was already created, i am confused what comes first and what not. What to do?
@brandonltran
@brandonltran 2 жыл бұрын
Fun project! Loved the article too. What IDE are you using in this video?
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Thank you Brandon! Glad you enjoyed it. I was using either PyCharm or IntelliJ. I'd recommend PyCharm for Pyhon :)
@tuetran9212
@tuetran9212 3 жыл бұрын
Great channel. I like your enthusiasm. You got a sub from me.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Tue! Stay tuned 🤟
@fayssalelaazouziai1573
@fayssalelaazouziai1573 2 жыл бұрын
Good stuff, bro , please i have a question , what is the "redirect uri" should i put for this project?
@janakasandun2346
@janakasandun2346 2 жыл бұрын
Really helpful!..
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Glad it was helpful!
@brianboyle1363
@brianboyle1363 2 жыл бұрын
Hey Liam hope all is good! I'm doing a college project, basically using your code from the video 'Python ETL Data Pipeline Tutorial with AWS, Terraform and Spotify' I'm just stuck on the artist python code so for an example you have spotify:artist:5K4W6rqBFWDnAN6FQUkS6x but what I'm wondering is where you are getting the 5K... part as I'm doing it with my own playlist.
@datadrivendecisions0
@datadrivendecisions0 2 жыл бұрын
Hey Brian, we chatted on Instagram and you found your answer by right clicking on the albums to get the artist links ☺️
@maybenew7293
@maybenew7293 Жыл бұрын
damn... it took me 3 hours just only for installing all the packages... 😮‍💨
@dbseraph
@dbseraph 3 жыл бұрын
The answer is contracts and attention spans.
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
😂
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
What’s new in the contracts these days?
@arunkumaracharya9641
@arunkumaracharya9641 2 жыл бұрын
This can be a 1-month crash course for a beginner even if he knows python and AWS.
Жыл бұрын
Hello, thank you very much for this project! I made a project based on yours, but now I'm stuck. I created the project and used Terraform to deploy to AWS. But when trying to run the file with the collect_data function in AWS Lambda, I got this error: "[ERROR] Runtime.ImportModuleError: No module named 'redis' Traceback" Do you have any idea what could be causing this error? Thank you.
@SameerShaik-ic1oz
@SameerShaik-ic1oz Жыл бұрын
same problem
@sase1017
@sase1017 3 жыл бұрын
What IDE are you using? IntelliJ?
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
IntelliJ in this video but I would recommend using PyCharm! I use it in my more recent videos
@tello9504
@tello9504 3 жыл бұрын
Why keep it in a Lake and not in a Warehouse?
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Purely for simplicity and costs. A properly constructed data warehouse would definitely be the better end product!
@tello9504
@tello9504 3 жыл бұрын
@@datadrivendecisions0 Thank you
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you José!
@firefoxcarranza6432
@firefoxcarranza6432 2 жыл бұрын
Plis, make tutorial from Sagemaker, thanks..
@ChhatreMihir
@ChhatreMihir 3 жыл бұрын
Loved the video! Thanks got making this. Would love to see some more videos around building data pipelines :) Sub from me 👍
@datadrivendecisions0
@datadrivendecisions0 3 жыл бұрын
Thank you Mihir! I'll be finishing off this crypto series and then I'll be talking about my football betting algorithm which is effectively a pipeline of data with some logic built on top of it! Stay tuned :)
@DaarioNeharis
@DaarioNeharis 2 жыл бұрын
implemented almost everything, but getting the following error when trying to run terraform apply: AccessDeniedException: User: arn:aws:iam::16003352######:user/spotify-data-analysis is not authorized to perform: events:DescribeRule on resource
Секрет фокусника! #shorts
00:15
Роман Magic
Рет қаралды 83 МЛН
Nastya and balloon challenge
00:23
Nastya
Рет қаралды 25 МЛН
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 50 МЛН
Fake watermelon by Secret Vlog
00:16
Secret Vlog
Рет қаралды 14 МЛН
Airflow for Beginners - Run Spotify ETL Job in 15 minutes!
16:38
Karolina Sowinska
Рет қаралды 142 М.
ETL with Python
57:19
Mean, Median and Moose
Рет қаралды 64 М.
Top AWS Services A Data Engineer Should Know
13:11
DataEng Uncomplicated
Рет қаралды 163 М.
How to Code a Crypto Trading Bot in Python
31:30
datadrivendecisions
Рет қаралды 14 М.
Quick Python 6: Our First Data Pipeline
6:41
PageKey
Рет қаралды 28 М.
Секрет фокусника! #shorts
00:15
Роман Magic
Рет қаралды 83 МЛН