Ollama on Kubernetes: ChatGPT for free!

  Рет қаралды 4,892

Mathis Van Eetvelde

Mathis Van Eetvelde

Күн бұрын

Пікірлер: 16
@SteveHarris-mi9ou
@SteveHarris-mi9ou 2 күн бұрын
It's a cool setup to use to run a RAG setup locally as well - nice going.
@oscarandresdiazmorales7180
@oscarandresdiazmorales7180 11 күн бұрын
Excelente video! Me encantó cómo explicaste el proceso de implementar Ollama en Kubernetes. Gracias por compartir tu conocimiento!
@Techonsapevole
@Techonsapevole 12 күн бұрын
I use docker compose but i was curious about k8s
@zulhilmizainudin
@zulhilmizainudin 9 күн бұрын
Looking forward for the next video!
@mathisve
@mathisve 3 күн бұрын
You can find the video here: kzbin.info/www/bejne/j6DQoGV6o7FshKM
@zulhilmizainudin
@zulhilmizainudin 2 күн бұрын
@@mathisve thanks!
@MuhammadRehanAbbasi-j5w
@MuhammadRehanAbbasi-j5w 10 күн бұрын
Would really like the video on how to add a GPU to this, both locally and on the cloud.
@mathisve
@mathisve 9 күн бұрын
Stay tuned for that video! I'm working on it as we speak, should be out later this week!
@HosseinOjvar
@HosseinOjvar 11 күн бұрын
Helpful tutorial thank you
@samson-olusegun
@samson-olusegun 13 күн бұрын
Would using a k8s job to make the pull API call suffice?
@mathisve
@mathisve 12 күн бұрын
Yes and no! On paper, if you only had one pod this could work. But the API call needs to be made every time a new Ollama pod is scheduled (unless you're using a PVC mounted to the pod to store the model). As far as I'm aware it's not possible to start a Kubernetes job at the creation of a new pod without using an operator.
@Sentientforce
@Sentientforce 12 күн бұрын
Can you please advise how to run ollama in k3d cluster in wsl2- windows 11 and docker desktop environment. The issue I’m not able to solve is making gpu visible in a node.
@unclesam007
@unclesam007 6 күн бұрын
here i cant deploy a simple laravel app on k8s🤒
@mathisve
@mathisve 2 күн бұрын
Do you need help with deploying Laravel on Kubernetes?
Ollama with GPU on Kubernetes: 70 Tokens/sec !
20:19
Mathis Van Eetvelde
Рет қаралды 459
Multi-Agent AI EXPLAINED: How Magentic-One Works
16:39
Sam Witteveen
Рет қаралды 11 М.
When u fight over the armrest
00:41
Adam W
Рет қаралды 30 МЛН
А я думаю что за звук такой знакомый? 😂😂😂
00:15
Денис Кукояка
Рет қаралды 1,7 МЛН
Using Ollama and N8N for AI Automation
13:43
Matt Williams
Рет қаралды 35 М.
Need animations? Use this library.
12:24
Theo - t3․gg
Рет қаралды 63 М.
Using Clusters to Boost LLMs 🚀
13:00
Alex Ziskind
Рет қаралды 71 М.
Qwen Just Casually Started the Local AI Revolution
16:05
Cole Medin
Рет қаралды 75 М.
Azure Local with low cost hardware
9:40
Microsoft Azure
Рет қаралды 35 М.
Redis vs Memcached Performance Benchmark
8:44
Anton Putra
Рет қаралды 25 М.
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
5:18
warpdotdev
Рет қаралды 131 М.
NVIDIA CEO Jensen Huang Leaves Everyone SPEECHLESS (Supercut)
18:49
Ticker Symbol: YOU
Рет қаралды 923 М.
When u fight over the armrest
00:41
Adam W
Рет қаралды 30 МЛН