Cloud Data Engineering Project : Migrating an On-Premises Data Pipeline to AWS

  Рет қаралды 226

Aymane Maghouti

Aymane Maghouti

Күн бұрын

In this video, you will see:
Introduction: Overview of the project and the motivation behind migrating from an on-premises setup to AWS.
On-Premises Pipeline Overview: Understanding the initial pipeline using HDFS, Spark, Kafka, PostgreSQL, and Power BI... ( details here : [ • Big Data engineering P... ])
AWS Pipeline Architecture:
Data Collection: Scraping data from Jumia using BeautifulSoup, transforming it with Python and pandas, and storing it in Amazon S3.
Data Processing and Cataloging: Using AWS Glue Crawler to automatically catalog the data stored in S3.
Data Analysis: Running SQL queries on the data using Amazon Athena.
Data Visualization: Creating insightful visualizations with Amazon QuickSight.
Results Storage: Storing SQL query results in an S3 bucket.
Detailed Steps Covered:
1. Setting up S3 buckets for raw and processed data.
2. Implementing web scraping with BeautifulSoup.
3. Configuring AWS Glue Crawler and Data Catalog.
4. Running data analysis queries with Amazon Athena.
5. Visualizing data with Amazon QuickSight.
#AWS #DataEngineering #CloudComputing #DataPipeline #BigData #WebScraping #AmazonS3 #AWSGlue #AmazonAthena #AmazonQuickSight #Python #BeautifulSoup

Пікірлер: 1
😜 #aminkavitaminka #aminokka #аминкавитаминка
00:14
Аминка Витаминка
Рет қаралды 726 М.
Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры
00:47
ДЕНЬ УЧИТЕЛЯ В ШКОЛЕ
01:00
SIDELNIKOVVV
Рет қаралды 4 МЛН
Basic Data Engineering Project - End-To-End From Web Scraping to Tableau
48:53
Introduction to AWS Services
38:54
AWS with Chetan
Рет қаралды 2,2 МЛН
Intro to AWS - The Most Important Services To Learn
50:07
Be A Better Dev
Рет қаралды 432 М.
😜 #aminkavitaminka #aminokka #аминкавитаминка
00:14
Аминка Витаминка
Рет қаралды 726 М.