How to Use Great Expectations for Data Quality Checks with Airflow

  Рет қаралды 5,019

The Data Guy

The Data Guy

Күн бұрын

Пікірлер: 25
@joaovitoralmeidaaraujobelc6993
@joaovitoralmeidaaraujobelc6993 6 ай бұрын
Very simple video with excellent explanation and not overcomplicating things. Thanks for sharing it!
@thedataguygeorge
@thedataguygeorge 6 ай бұрын
Thanks for watching!
@roopashastri9908
@roopashastri9908 5 ай бұрын
Great explaination!Any thoughts on how we can save the great expectation results in the Database?
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
I would configure the expectation results storage location to be a bucket and then have a pipeline that takes the expectation results and stores them in a database
@VarunDeep-x5j
@VarunDeep-x5j Ай бұрын
Hi - Thanks for sharing this video. When I tried running it, I kept getting errors like "raise gx_exceptions.DataContextError( great_expectations.exceptions.exceptions.DataContextError: expectation_suite strawberry_suite not found". During a deep dive into the code, i found there is a condition "self.expectations_store.has_key(key)". I am missing something concerning the store.?
@roopashastri9908
@roopashastri9908 5 ай бұрын
Also how can we automate the threshold changes with the changing business needs?
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
You'd want to have another helper pipeline that checks for changing business requirements and then either alerts you or makes adjustments
@roopashastri9908
@roopashastri9908 5 ай бұрын
On failure of great expectation validation, would this raise alerts?
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
Yes as long as you have Alerts configured for your Airflow DAG
@LucasGomes-q9t
@LucasGomes-q9t 4 ай бұрын
On minute 3:57 how could create the default file of great_expectations? I created the json but I got a blank one.
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
You then just fill out that json with all the expectation info you want!
@BubbaB2323
@BubbaB2323 Жыл бұрын
Very useful bud. Thank you.
@thedataguygeorge
@thedataguygeorge Жыл бұрын
No problem, do it all for you!
@BubbaB2323
@BubbaB2323 Жыл бұрын
@@thedataguygeorge will reach out on the side to talk shop if that's cool, loving your work.
@thedataguygeorge
@thedataguygeorge Жыл бұрын
Always cool!
@criistiina71
@criistiina71 4 ай бұрын
May I know if, we can create our own expectations. If I have a expectations who is not in the script that is on the documentation Could I create my own one?For Example, if one column is created from a formula and used a diferent database Could I create a expectation of who makes the math? Hi, from Colombia :)
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
Definitely can create your own expectations, honestly one of the best features of great expectations!
@criistiina71
@criistiina71 4 ай бұрын
@@thedataguygeorge Do you have a video-tutorial where you’re teaching how to connect GX with Databricks? 😊
@roopashastri9908
@roopashastri9908 5 ай бұрын
Also can we include more than one expectation in the expectation file?
@thedataguygeorge
@thedataguygeorge 4 ай бұрын
Yes!
@karangupta_DE
@karangupta_DE Жыл бұрын
Hi, do you prefer soda or great expectations?
@thedataguygeorge
@thedataguygeorge Жыл бұрын
I've only recently started using Soda so I'm not sure if I have enough experience to form a definitive opinion, but I have definitely enjoyed the UX much more so far, SCL is a lot more human readable than great expectation "expectations" imo
@maheshbhatm9998
@maheshbhatm9998 10 ай бұрын
Thank You
@thedataguygeorge
@thedataguygeorge 10 ай бұрын
No worries, let me know if there's any other videos you'd like to see!
Elevating Data Quality: Great Expectations and Airflow at PepsiCo
23:54
Lakehouse data validation with Great Expectations in Microsoft Fabric
36:18
Learn Microsoft Fabric with Will
Рет қаралды 6 М.
Lamborghini vs Smoke 😱
00:38
Topper Guild
Рет қаралды 64 МЛН
А я думаю что за звук такой знакомый? 😂😂😂
00:15
Денис Кукояка
Рет қаралды 6 МЛН
I thought one thing and the truth is something else 😂
00:34
عائلة ابو رعد Abo Raad family
Рет қаралды 22 МЛН
How to Use Soda for Data Quality Checks with Airflow!
10:47
The Data Guy
Рет қаралды 2,1 М.
Что такое Apache Airflow - Курсы "Школы Больших Данных" г. Москва
31:50
How to test your Data Pipelines with Great Expectations
8:42
BI Insights Inc
Рет қаралды 18 М.
Airflow Vs. Dagster: The Full Breakdown!
14:51
The Data Guy
Рет қаралды 8 М.
Implementing Data Quality in Python w/ Great Expectations
5:42
Avery Smith | Data Analyst
Рет қаралды 15 М.
How to begin writing data tests with Great Expectations
19:45
DataEngineerOne
Рет қаралды 13 М.
Lamborghini vs Smoke 😱
00:38
Topper Guild
Рет қаралды 64 МЛН