Data Pipelines: How to make them better

  Рет қаралды 4,562

nullQueries

nullQueries

Күн бұрын

Пікірлер: 3
@pipicovers
@pipicovers 8 ай бұрын
Steps to make pipeline better 1. Good auditing and logging: error handling 2. Repeatable and identical 3. Self healing: finding a way to find the delta , log files and compare, add a data lake before data warehouse , add hash or water marks before compare 4. Decouple EL and T: Landon Rae formate, transform to Dwh, make reporting table clean, 5. Always available: trancate and load refresh faster than update. Or build semantic layer 6. CICD: coded, git connected, versioned , rollbacks
@MrHaste12
@MrHaste12 2 жыл бұрын
Thanks for the video. Do you have an example of a pipeline built from scratch following the best practices mentioned in the video? Text/book or course-based doesn't matter
@user-ho4bl8wo2p
@user-ho4bl8wo2p 2 жыл бұрын
great video thanks for your effort but could you make more videos about building pipelines with open source tools that would greatly benefits people who just started in that field before jumping directly in the world of cloud
Data Engineering Isn't That Complicated
3:07
nullQueries
Рет қаралды 1,6 М.
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
Avoid These Mistakes in Realistic Data Architectures
5:51
nullQueries
Рет қаралды 3,1 М.
Building a Data Pipeline
4:45
nullQueries
Рет қаралды 8 М.
Do you Need a Data Warehouse?
5:17
nullQueries
Рет қаралды 4,8 М.
Don't Pick the Wrong Data Career
7:41
nullQueries
Рет қаралды 4,3 М.
Frameworks of Data Governance
5:02
nullQueries
Рет қаралды 4,4 М.
What Does ETL Mean?  And How Does it Apply to Data Integration?
4:59
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН