Рет қаралды 16,656
A data lake is a centralized cloud storage in which you can store of all the data, both structured and unstructured, at any scale. This platform is fast becoming the standard for users looking to store and process big data. we will cover how to build an AWS S3 data lake with an on-premise SQL Server database. S3 is an easy to use data store. We use it to load large amounts of data for later analysis.
Link to medium article on this topic: blog.devgenius.io/how-to-build-a-s3-data-lake-with-python-from-on-premise-database-23d5d2cdd1da
Link to GitHub repo:: github.com/hnawaz007/pythondataanalysis/tree/main/AWS%20Data%20Lake
Subscribe to our channel:
kzbin.info
---------------------------------------------
Follow me on social media!
GitHub: github.com/hnawaz007
Instagram: bi_insights_inc
LinkedIn: www.linkedin.com/in/haq-nawaz/
---------------------------------------------
#AWS #S3 # DataLake
Topics covered in this video:
0:00 - Intro data lake from on-premise to to AWS S3
1:03 Create S3 user with programmatic access
2:37 - Create S3 bucket
3:04 - Python setup
3:56 - Read data from SQL Server
5:04 - Load Data to S3 Bucket
6:59 - Code Demo
7:36 - Review S3 Data Lake