Airflow Tutorial: Running Data Quality Checks with Snowflake and Soda

  Рет қаралды 8,109

Data with Marc

Data with Marc

Күн бұрын

Airflow Tutorial: Running Data Quality Checks with Snowflake and Soda
🏆 BECOME A PRO WITH AIRFLOW: www.udemy.com/...
In this project, you will learn:
✅ How to set up an Airflow environment with the Astro CLI
✅ How to set up and configure Snowflake with Airflow
✅ How to load data from an HTTP endpoint into a Snowflake table
✅ How to use python virtual environments to avoid dependency conflicts
✅ How to run data quality checks with Soda and the ExternalPythonOperator
Enjoy ❤️

Пікірлер: 16
@hizokadarkwolf
@hizokadarkwolf Жыл бұрын
I just added this video to my weekend KZbin playlist, but thanks, Mark, for consistently creating great tutorials and sharing tips for data people in simple human language.
@MarcLamberti
@MarcLamberti Жыл бұрын
Thank you ❤️
@rishirajtandon3849
@rishirajtandon3849 10 ай бұрын
@MarcLamberti can we do soda integration wthout storing Snowflake password in code?
@JoseR-ui9vn
@JoseR-ui9vn 7 ай бұрын
Do we need Docker Desktop to installed in the machine?
@MarcLamberti
@MarcLamberti 7 ай бұрын
Yes
@orafaelgf
@orafaelgf Жыл бұрын
Great video. This complete environment that you used can I to use in prod? (astro, dbt soda, etc) And eveything is free?
@MarcLamberti
@MarcLamberti Жыл бұрын
Yes except snowflake
@karantatariya1303
@karantatariya1303 4 ай бұрын
Kindly assist; Still facing connection issues: getting error snowflake.connector.errors.OperationalError: 250001: 250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting,
@JoseR-ui9vn
@JoseR-ui9vn 7 ай бұрын
How to enable Test connection in Airflow.
@O9W5I7Q
@O9W5I7Q Жыл бұрын
I have a problem with @task.external_python decorator. It seems that it also requires airflow package in soda_venv virtual environment which is kinda odd..
@MarcLamberti
@MarcLamberti Жыл бұрын
You can find the code in the link in description from this video kzbin.info/www/bejne/eqvbpXaunpmMl6M :)
@O9W5I7Q
@O9W5I7Q Жыл бұрын
@@MarcLamberti Particularly, I am getting an error "ModuleNotFoundError: No module named 'airflow'" and "ModuleNotFoundError: No module named 'pendulum'" when running "astro@f867522c2cb4:/usr/local/airflow$ airflow tasks test movie check_movie" (the same error is obtained also if I step into the virtual environment using "astro@f867522c2cb4:/usr/local/airflow$ source soda_venv/bin/activate")
@O9W5I7Q
@O9W5I7Q Жыл бұрын
Is this tutorial anywhere on Git?
@mikekenneth2339
@mikekenneth2339 Жыл бұрын
Great video Marc, Very well put together. 👏👏👏👏👏👏 PS: It would been nice if you add the link to the source movie file in the comment or description for easier follow up.
@MarcLamberti
@MarcLamberti Жыл бұрын
Thank you so much! will do
@kaycullen8707
@kaycullen8707 Жыл бұрын
*PromoSM*
Airflow with DBT tutorial - The best way!
17:54
Data with Marc
Рет қаралды 53 М.
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
1% vs 100% #beatbox #tiktok
01:10
BeatboxJCOP
Рет қаралды 67 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН
It’s all not real
00:15
V.A. show / Магика
Рет қаралды 20 МЛН
How to Get Started with Soda for Data Quality Checks!
11:48
The Data Guy
Рет қаралды 738
How to Use Great Expectations for Data Quality Checks with Airflow
10:39
Airflow Data Pipeline with AWS and Snowflake for Beginners | Project
24:09
Soda Core | Open-Source Data Reliability As-Code
10:48
Soda
Рет қаралды 2,6 М.
Soda Data Reliability Engineering
24:26
Data Council
Рет қаралды 2 М.
Elevating Data Quality: Great Expectations and Airflow at PepsiCo
23:54
Data Quality and Reliability with Soda Core - Vijay Kiran
1:30:59
DataTalksClub ⬛
Рет қаралды 4,3 М.
Apache Airflow Tutorial for Data Engineers
55:32
TechTalkSourav
Рет қаралды 17 М.
What's new in Apache Airflow 2.7?
12:55
Data with Marc
Рет қаралды 4,7 М.
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН