Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer

  Рет қаралды 1,009

Data Council

Data Council

Күн бұрын

ABOUT THE TALK:
Over the last decade I have been lucky enough to contribute a few successful open source projects to the data ecosystem. In this talk
Julien Le Dem shares the story of his contribution to successful open source projects to the data ecosystem and what made their success possible. From the ideation process and early growth of the Apache Parquet columnar format and how this led to the creation of its in-memory alter-ego Apache Arrow. Julian will end with showing how this experience enabled the success of OpenLineage, an LFAI & Data project that brings observability to the data ecosystem.
ABOUT THE SPEAKER:
Julien Le Dem is the Chief Architect of Astronomer and Co-Founder of Datakin. He co-created Apache Parquet and is involved in several open source projects including OpenLineage, Marquez (LFAI&Data), Apache Arrow, Apache Iceberg and a few others. Previously, he was a senior principal at Wework; principal architect at Dremio; and tech lead for Twitter’s data processing tools and principal engineer working on content platforms at Yahoo, where he received his Hadoop initiation.
ABOUT DATA COUNCIL:
Data Council (www.datacounci...) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.
Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.
FOLLOW DATA COUNCIL:
Twitter: / datacouncilai
LinkedIn: / datacouncil-ai

Пікірлер
Apache Arrow DataFusion Architecture Part 1
30:53
Andrew Lamb
Рет қаралды 4,8 М.
Blue Food VS Red Food Emoji Mukbang
00:33
MOOMOO STUDIO [무무 스튜디오]
Рет қаралды 35 МЛН
Bike vs Super Bike Fast Challenge
00:30
Russo
Рет қаралды 22 МЛН
Шок. Никокадо Авокадо похудел на 110 кг
00:44
Пришёл к другу на ночёвку 😂
01:00
Cadrol&Fatich
Рет қаралды 3,9 МЛН
How Riot Games Uses Data to Maximize Engagement & Enjoyment
39:17
Presto and Apache Iceberg - Building out Modern Open Data Lakes
38:09
Presto Foundation
Рет қаралды 7 М.
What polars does for you - Ritchie Vink
27:45
EuroPython Conference
Рет қаралды 3,9 М.
A 101 in Time Series Analytics with Apache Arrow, Pandas and Parquet
31:42
Unified Stream/Batch Execution with Ibis
33:58
Data Council
Рет қаралды 715
Elasticsearch in an Hour
49:35
Next Day Video
Рет қаралды 121 М.
Blue Food VS Red Food Emoji Mukbang
00:33
MOOMOO STUDIO [무무 스튜디오]
Рет қаралды 35 МЛН