GPU optimization workshop (hosted by

  Рет қаралды 12,229

MLOps Learners

MLOps Learners

29 күн бұрын

00:30 Workshop overview
03:51 Crash course to GPU optimization (Mark Saroufim, Meta)
39:18 High performance LLM serving on NVIDIA GPUs (Sharan Chetlur, NVIDIA)
1:19:18 Block-based GPU Programming with Triton (Philippe Tillet, OpenAI)
1:59:00 Scaling data processing from CPU to distributed GPUs (William Malpica, Voltron Data)
Join the discussion on Discord: / discord
Shared note (during the event): docs.google.com/document/d/1T...
GitHub repo with schedule: github.com/mlops-discord/gpu-...
​Philippe Tillet, who’s leading the Triton team at OpenAI. Previously, he was at pretty much all major chip makers including NVIDIA, AMD, Intel, and Nervana.
​Sharan Chetlur, Principal engineer working on TensorRT-LLM at NVIDIA. He’s been working on CUDA since 2012, having optimized the performance of deep learning models from single GPU to full data center scale. Previously, he was Director of Engineer on Kernels team at Cerebras.
​William Malpica, co-founder of Voltron Data and creator of BlazingSQL. He helped scale our GPU-native query engine to handle 100TB queries!
Mark Saroufim, PyTorch core developer and cofounder of CUDA MODE. He also ran the really fun NeurIPS LLM Efficiency challenge last year. Previously, he was at Graphcore and Microsoft.

Пікірлер: 7
@KSK986
@KSK986 4 күн бұрын
Great workshop !!! Very much insightful. Thanks to the organizers and all the speakers.
@VipulVaibhaw
@VipulVaibhaw 27 күн бұрын
This was fantastic and very helpful!
@SomeshChatterjee
@SomeshChatterjee 12 күн бұрын
Thank you so much for this amazing content!!
@sankeerth1729
@sankeerth1729 19 күн бұрын
Thanks for organizing this, Chip!
@kevthedestroyer1044
@kevthedestroyer1044 24 күн бұрын
The discord link is not working, would love if someone can share a new one!
@mlopslearners
@mlopslearners 23 күн бұрын
discord.gg/6wRKjvMm
Must-have gadget for every toilet! 🤩 #gadget
00:27
GiGaZoom
Рет қаралды 2,9 МЛН
Sprinting with More and More Money
00:29
MrBeast
Рет қаралды 183 МЛН
WHO DO I LOVE MOST?
00:22
dednahype
Рет қаралды 60 МЛН
Writing Code That Runs FAST on a GPU
15:32
Low Level Learning
Рет қаралды 541 М.
4. How Kafka Works | Apache Kafka Fundamentals
26:41
Confluent
Рет қаралды 187 М.
Mind-bending new programming language for GPUs just dropped...
4:01
Marker: This Open-Source Tool will make your PDFs LLM Ready
14:11
Prompt Engineering
Рет қаралды 34 М.
A Path Towards Autonomous Machine Intelligence with Dr. Yann LeCun
1:03:05
AFOSR, Air Force Office of Scientific Research
Рет қаралды 18 М.
Developing SAP's AI Web App with OpenUSD
1:03:05
NVIDIA Omniverse
Рет қаралды 533
Must-have gadget for every toilet! 🤩 #gadget
00:27
GiGaZoom
Рет қаралды 2,9 МЛН