Deploying machine learning models on Kubernetes

  Рет қаралды 20,393

mildlyoverfitted

mildlyoverfitted

Күн бұрын

Пікірлер: 52
@ludwigstumpp
@ludwigstumpp Жыл бұрын
Always a pleasure to watch someone as talented as you! Keep it up :)
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Wow, much appreciated:) Thanks:)
@abdjanshvamdjsj
@abdjanshvamdjsj Жыл бұрын
Brooooo this was so good.
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Glad you liked it!
@vishalgoklani
@vishalgoklani Жыл бұрын
Welcome back, we missed you!
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Hehe, thank you! Nice to hear that:)
@alivecoding4995
@alivecoding4995 Жыл бұрын
I agree!
@humanity-indian
@humanity-indian 7 ай бұрын
Great example. Thanks for the information
@mildlyoverfitted
@mildlyoverfitted 7 ай бұрын
My pleasure!
@fizipcfx
@fizipcfx Жыл бұрын
he is back 🎉
@kwang-jebaeg2460
@kwang-jebaeg2460 Жыл бұрын
OH !!!!! Glad to meet you again !!!!
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Glad you are here:))
@shivendrasingh9759
@shivendrasingh9759 7 ай бұрын
Really helpful for foundation on ml ops
@mildlyoverfitted
@mildlyoverfitted 7 ай бұрын
Glad to hear that!
@thinkman2137
@thinkman2137 Жыл бұрын
Thank you for detail tutorial!
@thinkman2137
@thinkman2137 Жыл бұрын
But torchserve now has kubernetes intergration
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
I will definitely look into it:) Thank you for pointing it out!!
@ivanxiecornell
@ivanxiecornell Жыл бұрын
Would appreciate a video using VScode to include docker contain files, k8s file and Fast API
@JoseMiguel_____
@JoseMiguel_____ Жыл бұрын
You're great. Thanks for sharing this in such a nice way.
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
My pleasure!
@aditya_01
@aditya_01 11 ай бұрын
great video thanks a lot really liked the explanation !!!.
@mildlyoverfitted
@mildlyoverfitted 11 ай бұрын
Glad it was helpful!
@unaibox1350
@unaibox1350 Жыл бұрын
Amazing video. In min 5:25 how did you do to open the second bash in the console? I was searching for a long time and I can't find anything. Thanks and regards!
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Thank you! You need to install a tool called tmux. One of its features is that you can have multiple panes on a single screen.
@unaibox1350
@unaibox1350 Жыл бұрын
@@mildlyoverfitted Thank you! Will dig in it now
@davidyates4857
@davidyates4857 Жыл бұрын
Great video very informative.
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Glad you liked it!
@SmiteLax
@SmiteLax Ай бұрын
Cheers mate!
@maksim3285
@maksim3285 Жыл бұрын
Thank you, it helped me a lot .
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Happy to hear that!
@bpac90
@bpac90 4 ай бұрын
excellent!! I'm curious why my search always shows garbage and videos like this never come up. This was suggested by Gemini when I asked a question about ML model deployment.
@zhijunchen1248
@zhijunchen1248 Жыл бұрын
Hi, I would like to use GPU to accelerate this demo, can you give me some tips? Thank you
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
So if you wanna use minikube this seems to be the solution. minikube.sigs.k8s.io/docs/handbook/addons/nvidia/
@zhijunchen1248
@zhijunchen1248 Жыл бұрын
@@mildlyoverfitted thankyou, i use the "--device" flag of transformers-cli to enable GPU. And I found that serving app takes up almost gpu memory and no compute power. Whatever, thankyou for your video!
@unaibox1350
@unaibox1350 Жыл бұрын
I am having a problem in the min 18:00 the model load is being killed all the time. I tried to "minikube config set memory 4096" but still having the same problem. Any idea? I've been looking for a solution for 3 hours and there is no way
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Hm, I haven't had that problem myself. However, yeh, it might be related to the lack of memory.
@davidpratr
@davidpratr 10 ай бұрын
really nice video. Would you see any benefit of using the deployment in a single node with M1 chip? I'd say somehow yes because an inference might not be taking all the CPU of the M1 chip, but how about scaling the model in terms of RAM? one of those models might take 4-7GB of RAM which makes up to 21GB of RAM only for 3 pods. What's you opinion on that?
@mildlyoverfitted
@mildlyoverfitted 10 ай бұрын
Glad you liked the video! Honestly, I filmed the video on my M1 using minikube mostly because of convenience. But on real projects I have always worked with K8s clusters that had multiple nodes. So I cannot really advocate for the single node setup other than for learning purposes.
@davidpratr
@davidpratr 10 ай бұрын
@@mildlyoverfittedgot it. So, very likely more petitions could be resolved at the same time but with a very limited scalability and probably with performance loss. By the way, what are those fancy combos with the terminal? is it tmux?
@mildlyoverfitted
@mildlyoverfitted 10 ай бұрын
@@davidpratr interesting:) yes, it is tmux:)
@alivecoding4995
@alivecoding4995 Жыл бұрын
What terminal application is this, with the different panels?
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
tmux
@lauraennature
@lauraennature Жыл бұрын
New video 🤩
@johanngerberding5956
@johanngerberding5956 Жыл бұрын
very cool video!
@mildlyoverfitted
@mildlyoverfitted Жыл бұрын
Thank you! Cheers!
@evab.7980
@evab.7980 Жыл бұрын
👏👏👏
@EvaKreplova
@EvaKreplova Жыл бұрын
Great!
@nehetnehet8109
@nehetnehet8109 Жыл бұрын
Great
@nehetnehet8109
@nehetnehet8109 Жыл бұрын
Realy goood
@kwang-jebaeg2460
@kwang-jebaeg2460 Жыл бұрын
Look forward to show your face alot :))
@SunilSamson-w2l
@SunilSamson-w2l 5 ай бұрын
the reason you got . , ? as the output for [MASK] because you didn't end your input request with a full stop. Bert Masking Models should be passed that way. "my name is [MASK]." should have been your request.
Do NOT Learn Kubernetes Without Knowing These Concepts...
13:01
Travis Media
Рет қаралды 338 М.
Machine Learning on Kubernetes | Salman Iqbal
25:45
Kubernetes Community Days UK
Рет қаралды 3,9 М.
24 Часа в БОУЛИНГЕ !
27:03
A4
Рет қаралды 7 МЛН
Deploy ML model in 10 minutes. Explained
12:41
Danil Zherebtsov
Рет қаралды 39 М.
Running Generative AI & LLM on a Kubernetes Cluster | Cloud Institute
30:32
OpenAI function calling
38:14
mildlyoverfitted
Рет қаралды 2,9 М.
Create Kubernetes Self-hosted Cluster on EC2 Instances
20:57
DevOps Avenue
Рет қаралды 457
How to Deploy LLM in your Private Kubernetes Cluster in 5 STEPS | Marcin Zablocki
17:24
GetInData | Part of Xebia
Рет қаралды 2,5 М.
How to Deploy ML Solutions with FastAPI, Docker, & AWS
28:48
Shaw Talebi
Рет қаралды 23 М.
Deploying ML Models in Production: An Overview
14:27
Valerio Velardo - The Sound of AI
Рет қаралды 46 М.
Some *EASY* Kubernetes Projects for beginners
14:40
Christian Lempa
Рет қаралды 34 М.
Deploy ML models with FastAPI, Docker, and Heroku | Tutorial
18:45