Deploy LLMs using Serverless vLLM on RunPod in 5 Minutes

  Рет қаралды 5,700

AI Anytime

AI Anytime

Күн бұрын

Пікірлер: 16
@Bluedrake42
@Bluedrake42 2 ай бұрын
Finally a tutorial that isn't awful. Thank you for existing.
@pagadalasumanth7969
@pagadalasumanth7969 Ай бұрын
Oh bro I feel you so much !
@udaykiran2053
@udaykiran2053 3 ай бұрын
as you are using the llama model , what is the need for OpenAI installed to check it in the colab Notebook , can you explain
@nishalk781
@nishalk781 2 ай бұрын
I think he's using openai model for its functions, like that module has stream which will make things easier for you if u need to receive text has chunks, instead of entire text.
@matthewchung74
@matthewchung74 3 ай бұрын
Serverless on runpod with a bigger model, like llama70b on multiple gpus would be awesome!
@AIAnytime
@AIAnytime 3 ай бұрын
Coming soon 🔜
@renwar_G
@renwar_G 22 күн бұрын
Great video G
@AIAnytime
@AIAnytime 21 күн бұрын
Appreciate it
@jamesalxl3636
@jamesalxl3636 2 ай бұрын
im trying to run a 70B uncensored model, will this be possible with this method?
@SohamBasu-b1x
@SohamBasu-b1x Ай бұрын
can we set automated pause and resume in runpod endpoints ? like I want it to run for 3 hours per day in the morning? Can I set that up?
@premierleaguehighlights9061
@premierleaguehighlights9061 3 ай бұрын
Can i use deepfacelab on runpod?
@frag_it
@frag_it 3 ай бұрын
Bro do one for azure Kubernetes with vllm
@AIAnytime
@AIAnytime 3 ай бұрын
Coming soon
@frag_it
@frag_it 3 ай бұрын
@@AIAnytime make sure you do a in depth guide would be awesome to learn and apply the llama 3.1 405 B on it. You can even make it a longer playlist ppl would go crazy over it
@shekharkumar1902
@shekharkumar1902 3 ай бұрын
Sounds like a web promotion. Please create video with agentic based use case example with free of cost llms in local computer
@GodFearingPookie
@GodFearingPookie 3 ай бұрын
Nothing is free. Money has to come in
Dify + Ollama: Setup and Run Open Source LLMs Locally on CPU 🔥
21:46
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 206 М.
MAGIC TIME ​⁠@Whoispelagheya
00:28
MasomkaMagic
Рет қаралды 35 МЛН
🕊️Valera🕊️
00:34
DO$HIK
Рет қаралды 18 МЛН
CAN YOU DO THIS ?
00:23
STORROR
Рет қаралды 46 МЛН
Will A Basketball Boat Hold My Weight?
00:30
MrBeast
Рет қаралды 136 МЛН
Best GPU Providers for AI: Save Big with RunPod, Krutrim & More
16:15
Run Llama 3.1 405B with Ollama on RunPod (Local and Open Web UI)
15:52
Deploy LLMs More Efficiently with vLLM and Neural Magic
33:21
Neural Magic
Рет қаралды 787
Deploy and Use any Open Source LLMs using RunPod
27:45
AI Anytime
Рет қаралды 15 М.
9 incredible AI apps that changed my life forever
16:29
Silicon Valley Girl
Рет қаралды 310 М.
Fine Tune Qwen2 VL Model using Llama Factory
28:57
AI Anytime
Рет қаралды 3,8 М.
MAGIC TIME ​⁠@Whoispelagheya
00:28
MasomkaMagic
Рет қаралды 35 МЛН