Wow. Great speaker. Learned a lot. Nice to see someone is dealing with the same problem.
@dsinghr4 жыл бұрын
Composer vs Airflow: Airflow version upgrade is a nightmare. you won't have to worry about that if you use Composer. Another advantage is ipv4 addresses. As they are limited, you don;t have to think too much about them if you use composer. Imagine you created multiple namespaces for different use cases and each use case has 3-5 different environments, just think about how many IP addresses you would need. You may exhaust you quota pretty quickly that way. So composer is great. But i think it is still in beta.
@MarioRugeles2 жыл бұрын
I got a question: Why not use AWS EMR's autoscaling for the spark layer?
@supermousedd5 жыл бұрын
Very Cooooool!
@sumitkumarsahoo3 жыл бұрын
Can anyone tell me what is that commonization tool to being in schema or columns for transformation or joining? Curious about it, seems it's inhouse built in that organization
@Funfina2 жыл бұрын
What could be a common schema ?
@atampanday60854 жыл бұрын
why not use EKS?
@dsinghr4 жыл бұрын
why won't you use cloud dataflow on GCP instead of Spark? You then won't have to worry about Kubernetes at all as far as etl is concerned. Airflow itself should definitely run inside Kubernetes.