Sound Data Engineering in Rust-From Bits to DataFrames

  Рет қаралды 11,068

Databricks

Databricks

Күн бұрын

Spark applications often need to query external data sources such as file-based data sources or relational data sources. In order to do this, Spark provides Data Source APIs to access structured data through Spark SQL.
Data Source APIs have optimization rules such as filter push down and column pruning to reduce the amount of data that needs to be processed to improve query performance. As part of our ongoing project to provide generic Data Source V2 push down APIs, we have introduced partial aggregate push down, which significantly speeds up spark jobs by dramatically reducing the amount of data transferred between data sources and Spark. We have implemented aggregate push down in both JDBC and parquet.
Connect with us:
Website: databricks.com
Facebook: / databricksinc
Twitter: / databricks
LinkedIn: / data. .
Instagram: / databricksinc

Пікірлер: 10
@che5ari
@che5ari 8 ай бұрын
Great talk. Many thanks for this.
@rajdeeproychowdhury3450
@rajdeeproychowdhury3450 4 ай бұрын
Great talk, there is typo at 19:10 "graphana" -> "grafana"
@Rene-tu3fc
@Rene-tu3fc Жыл бұрын
interesting talk. the video description does not match the content, though
@gw1284
@gw1284 Жыл бұрын
Thank you
@konstantinrebrov675
@konstantinrebrov675 Жыл бұрын
I think that the IDE window is much too small. It should have been in the full screen, without the "Demo" title and without the face of the lecturer. In Visual Studio Code you can maximize the editor window by hitting F10, if I'm not wrong.
@budiardjo6610
@budiardjo6610 Жыл бұрын
this person is cool af
@mehdiyahiacherif2326
@mehdiyahiacherif2326 Жыл бұрын
19:25 graphana :/ , good talk
@talt6856
@talt6856 Жыл бұрын
Data.table is and R package… not Julia
@weouthere6902
@weouthere6902 Жыл бұрын
What's up with dask..? Slower than pandas??
@theLowestPointInMyLife
@theLowestPointInMyLife Жыл бұрын
Pandas probably uses c
How to Implement a Semantic Layer for Your Lakehouse
35:49
Databricks
Рет қаралды 12 М.
🍕Пиццерия FNAF в реальной жизни #shorts
00:41
They RUINED Everything! 😢
00:31
Carter Sharer
Рет қаралды 22 МЛН
$10,000 Every Day You Survive In The Wilderness
26:44
MrBeast
Рет қаралды 121 МЛН
Rust for Python data engineers - Karim Jedda
27:30
EuroPython Conference
Рет қаралды 4,9 М.
Understanding Ownership in Rust
25:31
Let's Get Rusty
Рет қаралды 239 М.
Apache Arrow DataFusion Architecture Part 1
30:53
Andrew Lamb
Рет қаралды 3,7 М.
Why is the JavaScript ecosystem switching to Rust?
48:08
chris biscardi
Рет қаралды 128 М.
The Rustvolution: How Rust Is the Future of Cloud Native - Flynn, Buoyant
33:51
CNCF [Cloud Native Computing Foundation]
Рет қаралды 2,4 М.
The columnar roadmap: Apache Parquet and Apache Arrow
41:39
DataWorks Summit
Рет қаралды 32 М.
i love you subscriber ♥️ #iphone #iphonefold #shortvideo
0:14
Задача APPLE сделать iPHONE НЕРЕМОНТОПРИГОДНЫМ
0:57