Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

  Рет қаралды 8,105

The Linux Foundation

The Linux Foundation

Күн бұрын

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Min Ong, Jina AI
With the rise of AI and machine learning applications, GPU resources have become a critical bottleneck in scaling infrastructure to efficiently serve AI workloads. Kubernetes, an open-source container orchestration platform, provides a solution to this problem through the NVIDIA device plugin which allows multiple containers to share access to GPU devices. In this talk, we will explore how Kubernetes can be used to efficiently scale AI workloads by sharing GPU resources across multiple containers. We will discuss the challenges of GPU resource management, explore various techniques for optimizing GPU usage and set resource limits to ensure fair and efficient allocation of GPU resources among containers. By the end of this talk, attendees will have a solid understanding of how Kubernetes can be used to share GPU resources across multiple containers, allowing them to make the most of their GPU investments and achieve faster, more accurate results in their AI applications.

Пікірлер
Unleashing the Power of AI in Kubernetes through K8sGPT | Alex Jones
30:01
Kubernetes Community Days UK
Рет қаралды 4,4 М.
Mastering GPU Management in Kubernetes Using the Operator Pattern- Shiva Krishna Merla & Kevin Klues
47:53
CNCF [Cloud Native Computing Foundation]
Рет қаралды 3,5 М.
Support each other🤝
00:31
ISSEI / いっせい
Рет қаралды 81 МЛН
Une nouvelle voiture pour Noël 🥹
00:28
Nicocapone
Рет қаралды 9 МЛН
Enabling Cost-Efficient LLM Serving with Ray Serve
30:28
Anyscale
Рет қаралды 6 М.
Trends in Deep Learning Hardware: Bill Dally (NVIDIA)
1:10:58
Paul G. Allen School
Рет қаралды 24 М.
GPUs in Kubernetes for AI Workloads
13:04
DevOps Toolkit
Рет қаралды 6 М.
Everything you Need to Know about using GPUs with Kubernetes - Rohit Agarwal, Google
31:33
CNCF [Cloud Native Computing Foundation]
Рет қаралды 8 М.
Kubernetes At Home: What Is Kubernetes? - Part 1
19:52
Jim's Garage
Рет қаралды 37 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,2 МЛН
Do NOT Learn Kubernetes Without Knowing These Concepts...
13:01
Travis Media
Рет қаралды 338 М.
KubeRay: A Ray cluster management solution on Kubernetes
25:00