Accelerated Data Science with Python Polars

  Рет қаралды 19,499

Python Simplified

Python Simplified

Күн бұрын

Today we will explore Polars - the fastest data science library in Python!! 🐻‍❄️🐻‍❄️🐻‍❄️
The best part is, as of earlier this month, it even got faster with a brand new release of a GPU engine! 🤩
We will learn about Queries, Lazy Frames, Engines, and use them in real life settings, analyzing and visualizing a free dataset with over 260 million rows (and 22GB in size!!! way bigger than what programs like Excel or Sheets can process).
So not only will we learn how to load, compress and process so much data all at once, but we will also plot it with millions of data nodes on the same graph!! 😱
If you think it might be challenging for Polars - prepare to be surprised!!! because that's exactly where it shines, especially when the new GPU engine is involved!
⭐ More about Polars GPU on GitHub: nvda.ws/gpu-po...
⭐ Official GPU Polars Colab Notebook: nvda.ws/gpu-po...
💻Tutorial GitHub Repository 💻
----------------------------------------------------------------
github.com/Mar...
🎥 Video Commands and Links 🎥
----------------------------------------------------------------
⭐ Install Polars GPU:
!pip install polars[gpu] --extra-index-url=pypi.nvidia.com
⭐ Mount Google Drive
from google.colab import drive
drive.mount('/content/drive')
⭐ Download Compressed Parquet Dataset (4GB):
For Google Colab:
!wget storage.google... -O transactions.parquet
For PC:
!wget storage.google... -O transactions.parquet
📺 Related Videos 📺
----------------------------------------------------------------
⭐ Anaconda for beginners:
• Anaconda Beginners Gui...
⭐ Basic Guide to Pandas:
• Basic Guide to Pandas!...
⏰ TIMESTAMPS ⏰
-------------------------------------------------------
00:00 - intro
-------------------------------------------------------
⭐ QUICKSTART
00:48 - Polars in Google Colab
01:01 - Lazy Frame
02:36 - Querying
03:29 - GPU Engine
-------------------------------------------------------
⭐ WORKFLOW
04:51 - Simulated Transactions Dataset
05:25 - Install Polars and GPU Engine locally
06:33 - Read CSV File with Polars
07:07 - Compress CSV to Parquet
07:54 - Read Parquet File with Polars
-------------------------------------------------------
⭐ QUERYING
08:38 - Select Statement
09:09 - Filter Statement
10:05 - Column Data Types
10:37 - Multiple Filters
11:15 - Group By Statement
12:32 - GPU Versus CPU
13:06 - Multiple Aggregations
-------------------------------------------------------
⭐ DATA VISUALIZATION
15:40 - Bar Chart
16:15 - Scatter Plot
16:58 - Chart Width
17:17 - Chart Z Axis with Colors
17:38 - Mark Styling
18:09 - Chart Title
18:29 - Tooltip Customization
19:10 - Solve Max Rows Error
-------------------------------------------------------
20:33 - Thanks for Watching
🤝 Connect with me 🤝
----------------------------------------------------------------
🔗 Github:
github.com/mar...
🔗 X:
x.com/MariyaSh...
🔗 LinkedIn:
/ mariyasha888
🔗 Blog:
www.pythonsimp...
🔗 Discord:
/ discord
💳 Credits 💳
----------------------------------------------------------------
⭐ Beautiful titles, transitions, sound FX:
mixkit.co
⭐ Thumbnail:
flaticon.com
freepik.com
#python #pythonprogramming #polars #pandas #datascience #querying #database #cuda #gpu #pythonprojects #pythonforbeginners #graphs #plotting #dataanalytics #dataanalysis #dsa #coding #learnpython #bigdata #beginners #tutorial #codingtutorial #technology #tech

Пікірлер: 125
Data Dashboard GUI App with Taipy Scenarios -  Step by Step Python Tutorial
1:15:57
OCCUPIED #shortssprintbrasil
0:37
Natan por Aí
Рет қаралды 131 МЛН
Please Master This MAGIC Python Feature... 🪄
25:10
Tech With Tim
Рет қаралды 180 М.
Much Faster Pandas with cuDF GPU Processing - CPU vs GPU Speed Benchmarks
19:57
15 POWERFUL Python Libraries You Should Be Using
22:31
ArjanCodes
Рет қаралды 72 М.
Web Scraping with Playwright + CAPTCHA Bypass For Beginners
20:31
Python Simplified
Рет қаралды 8 М.
5 Python Libraries You Should Know in 2025!
22:30
Keith Galli
Рет қаралды 91 М.
10 Important Python Concepts In 20 Minutes
18:49
Indently
Рет қаралды 504 М.
DuckDB vs Pandas vs Polars For Python devs
12:05
MotherDuck
Рет қаралды 22 М.
5 Good Python Habits
17:35
Indently
Рет қаралды 718 М.
OCCUPIED #shortssprintbrasil
0:37
Natan por Aí
Рет қаралды 131 МЛН