Rewriting History: Migrating petabytes of data to Apache Iceberg using Trino

  Рет қаралды 3,547

Trino

Trino

Күн бұрын

Пікірлер: 2
@Mentaloow
@Mentaloow Жыл бұрын
Rather than considering writing your own task scheduler/runner, consider using the open-source HPC tools out there.. Slurm with auto-scaling is an absolute beast, as it was designed, and is used, to schedule millions of jobs daily for thousands of users against extremely busy/constrained super-computers around the world (over 60% of the supercomputers use it) - job runtimes ranging from sub-second to months. And you benefit from a massive set of other features such as user/team management, quotas, accounting/budgeting, flexible scheduler resources/constraints..
@stavetx
@stavetx Жыл бұрын
Hmmm. But may be such great difference between json+gzip vs iceberg+parquet is not point of the iceberg. Binary parquet (with metadata in it) vs text json...
How Data Engineering Works
14:14
AltexSoft
Рет қаралды 473 М.
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН
Trino Demonstration With PostgreSQL and MySQL
6:26
Pandio
Рет қаралды 6 М.
Trino for Large Scale ETL at Lyft
40:59
Trino
Рет қаралды 2,7 М.
Why You Shouldn’t Care About Iceberg | Tabular
20:26
Data Council
Рет қаралды 14 М.
How Dremio implemented Materialized Views with Iceberg?
39:20
Apache Iceberg
Рет қаралды 528
Building an Open Data Lake House Using Trino and Apache Iceberg
47:06
Data Science Connect
Рет қаралды 8 М.
Journey to Iceberg with SK Telecom
29:30
Trino
Рет қаралды 1,1 М.