Deploying machine learning models on Kubernetes

Рет қаралды 20,393

mildlyoverfitted

Күн бұрын

Пікірлер: 52

@ludwigstumpp Жыл бұрын

Always a pleasure to watch someone as talented as you! Keep it up :)

@mildlyoverfitted Жыл бұрын

Wow, much appreciated:) Thanks:)

@abdjanshvamdjsj Жыл бұрын

Brooooo this was so good.

@mildlyoverfitted Жыл бұрын

Glad you liked it!

@vishalgoklani Жыл бұрын

Welcome back, we missed you!

@mildlyoverfitted Жыл бұрын

Hehe, thank you! Nice to hear that:)

@alivecoding4995 Жыл бұрын

I agree!

@humanity-indian 7 ай бұрын

Great example. Thanks for the information

@mildlyoverfitted 7 ай бұрын

My pleasure!

@fizipcfx Жыл бұрын

he is back 🎉

@kwang-jebaeg2460 Жыл бұрын

OH !!!!! Glad to meet you again !!!!

@mildlyoverfitted Жыл бұрын

Glad you are here:))

@shivendrasingh9759 7 ай бұрын

Really helpful for foundation on ml ops

@mildlyoverfitted 7 ай бұрын

Glad to hear that!

@thinkman2137 Жыл бұрын

Thank you for detail tutorial!

@thinkman2137 Жыл бұрын

But torchserve now has kubernetes intergration

@mildlyoverfitted Жыл бұрын

I will definitely look into it:) Thank you for pointing it out!!

@ivanxiecornell Жыл бұрын

Would appreciate a video using VScode to include docker contain files, k8s file and Fast API

@JoseMiguel_____ Жыл бұрын

You're great. Thanks for sharing this in such a nice way.

@mildlyoverfitted Жыл бұрын

My pleasure!

@aditya_01 11 ай бұрын

great video thanks a lot really liked the explanation !!!.

@mildlyoverfitted 11 ай бұрын

Glad it was helpful!

@unaibox1350 Жыл бұрын

Amazing video. In min 5:25 how did you do to open the second bash in the console? I was searching for a long time and I can't find anything. Thanks and regards!

@mildlyoverfitted Жыл бұрын

Thank you! You need to install a tool called tmux. One of its features is that you can have multiple panes on a single screen.

@unaibox1350 Жыл бұрын

@@mildlyoverfitted Thank you! Will dig in it now

@davidyates4857 Жыл бұрын

Great video very informative.

@mildlyoverfitted Жыл бұрын

Glad you liked it!

@SmiteLax Ай бұрын

Cheers mate!

@maksim3285 Жыл бұрын

Thank you, it helped me a lot .

@mildlyoverfitted Жыл бұрын

Happy to hear that!

@bpac90 4 ай бұрын

excellent!! I'm curious why my search always shows garbage and videos like this never come up. This was suggested by Gemini when I asked a question about ML model deployment.

@zhijunchen1248 Жыл бұрын

Hi, I would like to use GPU to accelerate this demo, can you give me some tips? Thank you

@mildlyoverfitted Жыл бұрын

So if you wanna use minikube this seems to be the solution. minikube.sigs.k8s.io/docs/handbook/addons/nvidia/

@zhijunchen1248 Жыл бұрын

@@mildlyoverfitted thankyou, i use the "--device" flag of transformers-cli to enable GPU. And I found that serving app takes up almost gpu memory and no compute power. Whatever, thankyou for your video!

@unaibox1350 Жыл бұрын

I am having a problem in the min 18:00 the model load is being killed all the time. I tried to "minikube config set memory 4096" but still having the same problem. Any idea? I've been looking for a solution for 3 hours and there is no way

@mildlyoverfitted Жыл бұрын

Hm, I haven't had that problem myself. However, yeh, it might be related to the lack of memory.

@davidpratr 10 ай бұрын

really nice video. Would you see any benefit of using the deployment in a single node with M1 chip? I'd say somehow yes because an inference might not be taking all the CPU of the M1 chip, but how about scaling the model in terms of RAM? one of those models might take 4-7GB of RAM which makes up to 21GB of RAM only for 3 pods. What's you opinion on that?

@mildlyoverfitted 10 ай бұрын

Glad you liked the video! Honestly, I filmed the video on my M1 using minikube mostly because of convenience. But on real projects I have always worked with K8s clusters that had multiple nodes. So I cannot really advocate for the single node setup other than for learning purposes.

@davidpratr 10 ай бұрын

@@mildlyoverfittedgot it. So, very likely more petitions could be resolved at the same time but with a very limited scalability and probably with performance loss. By the way, what are those fancy combos with the terminal? is it tmux?

@mildlyoverfitted 10 ай бұрын

@@davidpratr interesting:) yes, it is tmux:)

@alivecoding4995 Жыл бұрын

What terminal application is this, with the different panels?

@mildlyoverfitted Жыл бұрын

tmux

@lauraennature Жыл бұрын

New video 🤩

@johanngerberding5956 Жыл бұрын

very cool video!

@mildlyoverfitted Жыл бұрын

Thank you! Cheers!

@evab.7980 Жыл бұрын

👏👏👏

@EvaKreplova Жыл бұрын

Great!

@nehetnehet8109 Жыл бұрын

Great

@nehetnehet8109 Жыл бұрын

Realy goood

@kwang-jebaeg2460 Жыл бұрын

Look forward to show your face alot :))

@SunilSamson-w2l 5 ай бұрын

the reason you got . , ? as the output for [MASK] because you didn't end your input request with a full stop. Bert Masking Models should be passed that way. "my name is [MASK]." should have been your request.