Tutorials - Gil, Jim, Naty, Phillip: Introduction to Ibis: blazing fast analytics with DuckDB...

  Рет қаралды 775

PyCon US

PyCon US

Ай бұрын

Full title: Tutorials: Introduction to Ibis: blazing fast analytics with DuckDB, Polars, Snowflake, and more, from the comfort of your Python repl.
Presented by:
Gil Forsyth
Jim Crist-Harif
Naty Clementi
Phillip Cloud
Tabular data is ubiquitous, and pandas has been the de facto tool in Python for analyzing it. However, as data size scales, analysis using pandas may become untenable. Luckily, modern analytical databases (like DuckDB) are able to analyze this same tabular data, but perform orders-of-magnitude faster than pandas, all while using less memory. Many of these systems only provide a SQL interface though; something far different from pandas’ dataframe interface, requiring a rewrite of your analysis code.
This is where Ibis comes in. Ibis provides a common dataframe interface to many popular databases and analytics tools (BigQuery, Snowflake, Spark, DuckDB, …). This lets users analyze data using the same consistent API, regardless of which backend they’re using, and without ever having to learn SQL (but you can use SQL if you want to!). No more pains rewriting pandas code to something else when you run into performance issues; write your code once using Ibis and run it on any supported backend.
In this tutorial we’ll cover:
The basic operations of Ibis (select, filter, group_by, order_by, join, and aggregate), and how these operations may be composed to form more complicated queries.
How Ibis may be used on a number of different local and remote backend engines to execute the same queries on different systems.
The tradeoffs of different database engines, and recommendations for how to choose the best tool for the job.
How Ibis integrates into the larger Python data ecosystem, including tools like Scikit-Learn, Matplotlib, PyArrow, pandas, Altair, and VegaFusion.
This is a hands-on tutorial, with numerous examples to get your hands dirty. Participants should ideally have some experience using Python and pandas, but no SQL experience is necessary.

Пікірлер
Why should you care about DuckDB? ft. Mihai Bojin
14:35
MotherDuck
Рет қаралды 8 М.
Playing hide and seek with my dog 🐶
00:25
Zach King
Рет қаралды 34 МЛН
What it feels like cleaning up after a toddler.
00:40
Daniel LaBelle
Рет қаралды 83 МЛН
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
Рет қаралды 23 МЛН
Double Stacked Pizza @Lionfield @ChefRush
00:33
albert_cancook
Рет қаралды 100 МЛН
An Introductory QGIS Workshop for Beginners
3:49:41
QGIS North America
Рет қаралды 460 М.
DuckDB and PostGIS: Your geospatial super duo
55:38
Matt Forrest
Рет қаралды 816
Full Stack Python Application FastAPI, HTMX, SQLite Part 1
32:24
CSPythonForScience
Рет қаралды 1 М.
🔴Build Twitter Fleets (stories) with React Native and AWS Amplify
3:43:10
CrowdStrike IT Outage Explained by a Windows Developer
13:40
Dave's Garage
Рет қаралды 2,1 МЛН
ChatGPT for Data Analytics: Full Course
3:35:30
Luke Barousse
Рет қаралды 231 М.
Wolfram Physics Project Launch
3:50:19
Wolfram
Рет қаралды 1,6 МЛН
How To Choose The Right Database?
6:58
ByteByteGo
Рет қаралды 299 М.
Houdini Algorithmic Live #060 - Quadrilateral Remesh
3:53:40
Junichiro Horikawa
Рет қаралды 393 М.
Playing hide and seek with my dog 🐶
00:25
Zach King
Рет қаралды 34 МЛН