In this video, we'll go through how you can build your own ETL pipeline using Airflow! You can swap out any of these components for your specific use case, but the general concepts will still hold true! Code: registry.astro...
Пікірлер: 6
@AlexZdanov Жыл бұрын
Hey George, great stuff! I run a couple of websites that use a lot of data from a bunch of different APIs / CSVs / excel sheets that are published online on a regular basis, and it's generally not very "clean", in that it requires a lot of transforming, testing, combining, etc (so big emphasis on the T in ETL). I'm looking for a tool that would be able to handle all of that transformation, while also making it very easy to visualize and document where everything is coming from, how it's being manipulated and where it's going. For the latter part, everything is pretty much being stored in a single postgres DB, and I doubt there's any need for something bigger. Would you recommend Airflow for this? Are there any other tools you think would be appropriate?
@thedataguygeorge Жыл бұрын
I would highly recommend Airflow for this! Especially with the new Triggerer, it's really good for async consumption/transformation of that kind of data, and you can build DAG generators for adding new resources.
@harizizzudin Жыл бұрын
hi, can i do this project using aws free tier?
@thedataguygeorge Жыл бұрын
100p!
@saket8431 Жыл бұрын
Good stuff🤌
@thedataguygeorge Жыл бұрын
Thanks so much! Let me know if there's anything else you'd like to see!