Rewriting History: Migrating petabytes of data to Apache Iceberg using Trino

Best practices and insights when migrating to Apache Iceberg for data engineers

How Data Engineering Works

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

Cat mode and a glass of water #family #humor #fun

SHK TV - We have a new robot - Be nice to people around you #shorts #sadstory #SHK

人是不能做到吗？#火影忍者 #家人 #佐助

Rewriting History: Migrating petabytes of data to Apache Iceberg using Trino

Рет қаралды 3,547

Trino

Күн бұрын

Пікірлер: 2

@Mentaloow Жыл бұрын

Rather than considering writing your own task scheduler/runner, consider using the open-source HPC tools out there.. Slurm with auto-scaling is an absolute beast, as it was designed, and is used, to schedule millions of jobs daily for thousands of users against extremely busy/constrained super-computers around the world (over 60% of the supercomputers use it) - job runtimes ranging from sub-second to months. And you benefit from a massive set of other features such as user/team management, quotas, accounting/budgeting, flexible scheduler resources/constraints..

@stavetx Жыл бұрын

Hmmm. But may be such great difference between json+gzip vs iceberg+parquet is not point of the iceberg. Binary parquet (with metadata in it) vs text json...

Best practices and insights when migrating to Apache Iceberg for data engineers

32:57

Best practices and insights when migrating to Apache Iceberg for data engineers

Trino

Рет қаралды 1,3 М.

How Data Engineering Works

14:14

How Data Engineering Works

AltexSoft

Рет қаралды 473 М.

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

00:34

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

INNA SERG

Рет қаралды 7 МЛН

Cat mode and a glass of water #family #humor #fun

00:22

Cat mode and a glass of water #family #humor #fun

Kotiki_Z

Рет қаралды 42 МЛН

SHK TV - We have a new robot - Be nice to people around you #shorts #sadstory #SHK

00:46

SHK TV - We have a new robot - Be nice to people around you #shorts #sadstory #SHK

SHK TV

Рет қаралды 14 МЛН

人是不能做到吗？#火影忍者 #家人 #佐助

00:20

人是不能做到吗？#火影忍者 #家人 #佐助

火影忍者一家

Рет қаралды 20 МЛН

Trino Demonstration With PostgreSQL and MySQL

6:26

Trino Demonstration With PostgreSQL and MySQL

Pandio

Рет қаралды 6 М.

Trino for Large Scale ETL at Lyft

40:59

Trino for Large Scale ETL at Lyft

Trino

Рет қаралды 2,7 М.

What’s Next for Lakehouse in 2025 With Databricks and CelerData

59:46

What’s Next for Lakehouse in 2025 With Databricks and CelerData

CelerData

Рет қаралды 855

Apache Iceberg Tutorial: Learn the Problem & Solution Behind Iceberg's Origin Story

23:13

Apache Iceberg Tutorial: Learn the Problem & Solution Behind Iceberg's Origin Story

Dremio

Рет қаралды 46 М.

Demandbase Ditches Denormalization By Switching off ClickHouse

47:19

Demandbase Ditches Denormalization By Switching off ClickHouse

CelerData

Рет қаралды 369

Why You Shouldn’t Care About Iceberg | Tabular

20:26

Why You Shouldn’t Care About Iceberg | Tabular

Data Council

Рет қаралды 14 М.

Using Alluxio caching via the Iceberg connector over MinIO file storage

32:42

Using Alluxio caching via the Iceberg connector over MinIO file storage

Trino

Рет қаралды 2,1 М.

How Dremio implemented Materialized Views with Iceberg?

39:20

How Dremio implemented Materialized Views with Iceberg?

Apache Iceberg

Рет қаралды 528

Building an Open Data Lake House Using Trino and Apache Iceberg

47:06

Building an Open Data Lake House Using Trino and Apache Iceberg

Data Science Connect

Рет қаралды 8 М.

Journey to Iceberg with SK Telecom

29:30

Journey to Iceberg with SK Telecom

Trino

Рет қаралды 1,1 М.

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

00:34

ЧТО ЖЕ МЫ КУПИЛИ СОБАКЕ ВМЕСТО ТАБАЛАПОК😱#shorts

INNA SERG

Рет қаралды 7 МЛН