No video

How the Record-Breaking, Cloud Native AI Supercomputer Was Built - Peter Salanki, CoreWeave

  Рет қаралды 2,709

CNCF [Cloud Native Computing Foundation]

CNCF [Cloud Native Computing Foundation]

Күн бұрын

Don't miss out! Join us at our next Flagship Conference: KubeCon + CloudNativeCon Europe in Paris from March 19-22, 2024. Connect with our current graduated, incubating, and sandbox projects as the community gathers to further the education and advancement of cloud native computing. Learn more at kubecon.io
How the Record-Breaking, Cloud Native AI Supercomputer Was Built - Peter Salanki, CoreWeave
MLCommons released the latest MLPerfs in June, announcing a new record for AI performance by a Supercomputer running on Kubernetes. In this session, we'll cover what these benchmarks mean for the AI/ML industry and how CoreWeave and NVIDIA worked together to achieve this world-record breaking result. Software and hardware engineers will discuss:
- How leveraging Kubernetes and other CNCF technologies helped build massive GPU clusters for generative AI at breakneck speed
- How the team leveraged Argo Workflows to automate health checks, testing, and lifecycle management
- How Prometheus, Grafana, Mimir and Loki is used to track bare metal and network health & performance
- Learnings from running a record-breaking MLPerf submission on Kubernetes with Slurm on Kubernetes

Пікірлер
How We Power the Largest AI Deployments on the Planet: Running Vir... Brandon Jacobs & Lukas Gentele
25:04
CNCF [Cloud Native Computing Foundation]
Рет қаралды 1,2 М.
Coreweave's CSO on the Business of Building AI Datacenters | Odd Lots
54:34
Bloomberg Podcasts
Рет қаралды 3,6 М.
Just Give me my Money!
00:18
GL Show Russian
Рет қаралды 958 М.
Glow Stick Secret Pt.4 😱 #shorts
00:35
Mr DegrEE
Рет қаралды 18 МЛН
❌Разве такое возможно? #story
01:00
Кэри Найс
Рет қаралды 6 МЛН
Zero-Downtime Live Migration of Stateful VMs on Kubernetes - Felicitas Pojtinger, Loophole Labs
35:12
CNCF [Cloud Native Computing Foundation]
Рет қаралды 2,4 М.
Understanding the fundamentals of kubernetes networking
51:01
Devops With Syed
Рет қаралды 1,4 М.
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 152 М.
"I Hate Agile!" | Allen Holub On Why He Thinks Agile And Scrum Are Broken
8:33
CoreWeave CEO: The World Is Dependent on Nvidia
7:40
Bloomberg Technology
Рет қаралды 6 М.
Do NOT Learn Kubernetes Without Knowing These Concepts...
13:01
Travis Media
Рет қаралды 280 М.
Just Give me my Money!
00:18
GL Show Russian
Рет қаралды 958 М.