Scalable Data Ingestion Architecture Using Airflow and Spark | Komodo Health

  Рет қаралды 32,170

Data Council

Data Council

Күн бұрын

Пікірлер: 8
@Barnabassteiniger
@Barnabassteiniger 2 жыл бұрын
Wow. Great speaker. Learned a lot. Nice to see someone is dealing with the same problem.
@dsinghr
@dsinghr 4 жыл бұрын
Composer vs Airflow: Airflow version upgrade is a nightmare. you won't have to worry about that if you use Composer. Another advantage is ipv4 addresses. As they are limited, you don;t have to think too much about them if you use composer. Imagine you created multiple namespaces for different use cases and each use case has 3-5 different environments, just think about how many IP addresses you would need. You may exhaust you quota pretty quickly that way. So composer is great. But i think it is still in beta.
@MarioRugeles
@MarioRugeles 2 жыл бұрын
I got a question: Why not use AWS EMR's autoscaling for the spark layer?
@supermousedd
@supermousedd 5 жыл бұрын
Very Cooooool!
@sumitkumarsahoo
@sumitkumarsahoo 3 жыл бұрын
Can anyone tell me what is that commonization tool to being in schema or columns for transformation or joining? Curious about it, seems it's inhouse built in that organization
@Funfina
@Funfina 2 жыл бұрын
What could be a common schema ?
@atampanday6085
@atampanday6085 4 жыл бұрын
why not use EKS?
@dsinghr
@dsinghr 4 жыл бұрын
why won't you use cloud dataflow on GCP instead of Spark? You then won't have to worry about Kubernetes at all as far as etl is concerned. Airflow itself should definitely run inside Kubernetes.
The Newcomer's Guide to Airflow's Architecture
27:26
Apache Airflow
Рет қаралды 23 М.
Event-Driven Architecture (EDA) vs Request/Response (RR)
12:00
Confluent
Рет қаралды 138 М.
Modus males sekolah
00:14
fitrop
Рет қаралды 11 МЛН
1ОШБ Да Вінчі навчання
00:14
AIRSOFT BALAN
Рет қаралды 5 МЛН
나랑 아빠가 아이스크림 먹을 때
00:15
진영민yeongmin
Рет қаралды 14 МЛН
Magic or …? 😱 reveal video on profile 🫢
00:14
Andrey Grechka
Рет қаралды 58 МЛН
How to Submit a PySpark Script to a Spark Cluster Using Airflow!
10:04
Airflow on Kubernetes: Dynamic Workflows Simplified - Daniel Imberman, Bloomberg & Barni Seetharaman
23:22
CNCF [Cloud Native Computing Foundation]
Рет қаралды 17 М.
Apache Airflow Architecture 101
18:29
Bryan Cafferky
Рет қаралды 11 М.
Amundsen: A Data Discovery Platform From Lyft | Lyft
37:11
Data Council
Рет қаралды 17 М.
Data Pipeline Frameworks: The Dream and the Reality |  Beeswax
35:34
Лучшая защита экрана
0:40
Newtonlabs
Рет қаралды 1,1 МЛН
D3 XIAOMI SU7 MAX
14:25
smotraTV
Рет қаралды 310 М.
iPhone 16 - 16 последних фишек за 16 минут
18:59
iPhone VS Samsung🤯
1:00
Skinnycomics
Рет қаралды 20 МЛН
POV: You Find a 🗑️ Full of iPhones ⭐
0:13
Shakeuptech
Рет қаралды 2,2 МЛН