Apache Arrow and Substrait, the secret foundations of Data Engineering - Alessandro Molina

  Рет қаралды 1,176

EuroPython Conference

EuroPython Conference

9 ай бұрын

[EuroPython 2023 - North Hall on 2023-07-19]
ep2023.europython.eu/session/...
Apache Arrow, and its Python library PyArrow are becoming the standard de facto for transfering data and interoperability between libraries and languages. As more compute engines, storages and databases start to speak arrow, you might be relying on it without even knowing.
The same transformation is happening with Substrait, that is on track to be the standard representation of query plans themselves. Allowing queries to be routed to different engines as far as they speak substrait, or even decomposed and forwarded to different engines.
This talk we will provide a quick introduction to the Arrow ecosystem, showing to Python developers how libraries like Pandas, Polars and PyArrow itself leverage Arrow and how compute engines like Velox, Datafusion and Acero are embracing Arrow and Substrait.
The talk will also show how a basic database system based on Arrow and Substrait can be built with a minimum amount of code thanks to all the foundations they provide.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License creativecommons.org/licenses/b...

Пікірлер
Olympic Data Analytics | Azure End-To-End Data Engineering Project
1:36:00
Вечный ДВИГАТЕЛЬ!⚙️ #shorts
00:27
Гараж 54
Рет қаралды 13 МЛН
ОСКАР ИСПОРТИЛ ДЖОНИ ЖИЗНЬ 😢 @lenta_com
01:01
THE POLICE TAKES ME! feat @PANDAGIRLOFFICIAL #shorts
00:31
PANDA BOI
Рет қаралды 17 МЛН
How I Use Python as a Data Engineer
9:08
Darshil Parmar
Рет қаралды 70 М.
Apache Arrow: High-Performance Columnar Data Framework (Wes McKinney)
1:02:26
CMU Database Group
Рет қаралды 10 М.
A Brief History of Data Storage - Eli Holderness
41:07
EuroPython Conference
Рет қаралды 603
Would Rust make you a better Pythonista? - Alexys Jacob
47:23
EuroPython Conference
Рет қаралды 1,3 М.
Big Data is Dead | MotherDuck
25:58
Data Council
Рет қаралды 11 М.
Мой инст: denkiselef. Как забрать телефон через экран.
0:54
В России ускорили интернет в 1000 раз
0:18
Короче, новости
Рет қаралды 1,1 МЛН
Hisense Official Flagship Store Hisense is the champion What is going on?
0:11
Special Effects Funny 44
Рет қаралды 2,5 МЛН
ИГРОВОВЫЙ НОУТ ASUS ЗА 57 тысяч
25:33
Ремонтяш
Рет қаралды 312 М.