Ready to get a job in IT? Start studying RIGHT NOW with ITPro: go.acilearning.com/networkchuck (30% off FOREVER) *affiliate link Discover how to set up your own powerful, private AI server with NetworkChuck. This step-by-step tutorial covers installing Ollama, deploying a feature-rich web UI, and integrating stable diffusion for image generation. Learn to customize AI models, manage user access, and even add AI capabilities to your note-taking app. Whether you're a tech enthusiast or looking to enhance your workflow, this video provides the knowledge to harness the power of AI on your local machine. Join NetworkChuck on this exciting journey into the world of private AI servers. 📓📓Guide and Commands: ntck.co/ep_401 ⌨⌨My new keyboard: Keychron Q6 Max: geni.us/0SGY 🖥🖥My Computer Build🖥🖥 --------------------------------------------------- ➡Lian Li Case: geni.us/B9dtwB7 ➡Motherboard - ASUS X670E-CREATOR PROART WIFI: geni.us/SLonv ➡CPU - AMD Ryzen 9 7950X3D Raphael AM5 4.2GHz 16-Core: geni.us/UZOZ5 ➡Power Supply - Corsair AX1600i 1600 Watt 80 Plus Titanium: geni.us/O1toG ➡CPU AIO - Lian Li Galahad II LCD-SL Infinity 360mm Water Cooling Kit: geni.us/uBgF ➡Storage - Samsung 990 PRO 2TB Samsung: geni.us/hQ5c ➡RAM - G.Skill Trident Z5 Neo RGB 64GB (2 x 32GB): geni.us/D2sUN ➡GPU - MSI GeForce RTX 4090 SUPRIM LIQUID X 24G Hybrid Cooling 24GB: geni.us/G5BZ 🔥🔥Join the NetworkChuck Academy!: ntck.co/NCAcademy **Sponsored by ITProTv from ACI Learning
@MARO_MR7 ай бұрын
first reply
@MARO_MR7 ай бұрын
@mshark111 third reply
@xozx17157 ай бұрын
I use chat with rtx. Do you advise me to change to this?
@d34ddud37 ай бұрын
You should totally try to set up this AI like the Amazon Dot or Alexa's as speakers in your home. It won't be a privacy concern since it's all on your own server and home network now!
@dexterbeazley15017 ай бұрын
do a video on Linux game server
@JeremyFeldmesser7 ай бұрын
I'm 62 years old and a computer techy, I'm no super genius though and I'm really happy to have been able to run a local AI on my PC. Private AI is the way to go for sure. I signed up for your free academy for now, there's enough in there to keep me learning/busy for a while yet! :)
@nahrafe7 ай бұрын
Good job pops
@projectptube7 ай бұрын
now if we can just get some models that have no wokeness/leftist insanity.
@gaiustacitus42427 ай бұрын
@@projectptube I would be happy with an AI that could actually write fairly entry level code instead of churning out garbage code that: 1) won't compile, and efforts to have AI integrated into the development environment correct issues makes it worse with each iteration 2) doesn't actually meet requirements (regardless of how many iterations made to fine tune the output, by which YOU are training the AI) 3) is poorly structured (leading to maintainability problems) 4) lacks proper error handling (leading to problems with stability and data integrity) 5) fails to follow any type of consistent naming convention (code quality/maintainability issues) 6) randomly include variables which determine type on first assignment 7) creates classes where local data types do not correspond to the columns defined in database tables: 7.a) string data types do not enforce the defined length limits 7.b) numeric variables are of inconsistent types 7.c) the data access layer doesn't handle null values, always storing 0 for numeric data types or zero-length strings for (n)varchar fields 8) thrashes database connections (a problem that connection pooling implemented in the client stack doesn't reliably solve) 9) introduces security vulnerabilities. I could go on, but why bother? The current state of AI for software development is to have companies and sole developers pay to use it while the AI is trained on the well-written source code (or at least better written) the developers end up producing. A packet sniffer will detect that not only is the corrected AI generated code being shared but also proprietary code which has not been authorized for such use.
@legendaryphoenix86077 ай бұрын
@@projectptube exactly cough... Gemini... cough. But what do you have in mind when you said that? I am interested to know
@HandFromCoffin7 ай бұрын
@@projectptube Hi my name is Richard, I always have to inject my views on things in to every topic. That’s my skill.
@grregis7 ай бұрын
Awesome video and super easy to follow along. Quick tip: if you forget to run a command as sudo, just type sudo !! and it will run your last command as sudo.
@lilpoopieboy2 ай бұрын
Nvidia Cuda Drivers? i need to install them but you didn't put them link to the drivers in the bio down here bro and now i just have a really slow chat ai bot. umm? i looked in your bio and i havent found anything yet also im not really a computer tech savvy kinda guy so i might just be overlooking what you put in your bio. i just need you to help me on that bc my terminal says that Nvidia detected and it doesnt say its installed though? like how it did on your screen? so what do i do please help me???
@eddymison35272 ай бұрын
Thanks for the tips.
@karthikeyanv6612 ай бұрын
MAN THIS TIP IS GONNA SAVE ME AN ENTIRE DECADE
@SahidHaqqi2 ай бұрын
@@karthikeyanv661 You are overreacting
@karthikeyanv6612 ай бұрын
@@SahidHaqqi Okay and??
@muditmishra11294 ай бұрын
Bro called us poor in 14 different languages
@Fondofmelobster3 ай бұрын
That’s kinda his whole thing
@qkb31283 ай бұрын
Right
@LordDudeious3 ай бұрын
"He said we were poor, in fourteen different languages." Enough said.
@zinxderobo2 ай бұрын
he's a fan of the worst 3: nvidia, intel, and asus, not a very trustworthy bunch. He doesn't even consider mentioning AMD he even talks about IBM.... that's a blatant bias and don't trust people with bias and hidden agendas.
@theralfinator2 ай бұрын
@@zinxderobo He used an AMD CPU.....
@OgBrog7 ай бұрын
Alright, now integrate it into home assistant with text to speech and voice to text so you can have your own alexa that controls your home automation.
@shannonbreaux84426 ай бұрын
That's what I would like to see a video of him do
@sonofsid16 ай бұрын
@@shannonbreaux8442 the ollama get hub has a plug in on how to do this. Also ollama has a python library so you can write your own python scripts to interact with ollama
@Mr_LA_Z6 ай бұрын
Yeah, we need API access for home assistant. Does anyone know how we can do that, or that is too much of a challenge?
@miroslavwiesner73666 ай бұрын
@@Mr_LA_Z ask AI
@rickeeepps64615 ай бұрын
Read the HA release notes, they are working on this as we speak
@Zvxers77 ай бұрын
Man really gave his kids 2x rtx 4090s for school, he did the "mom i need this [overkill computer] for school"
@brandonwiederhold25737 ай бұрын
Its only a $6K build lol
@Zvxers77 ай бұрын
@@brandonwiederhold2573 only $6000 for school...
@notaras19857 ай бұрын
@@brandonwiederhold2573ONLY 6000? You can adopt me any day
@Outsider_077 ай бұрын
@@notaras1985 exactly
@fp17157 ай бұрын
@@notaras1985just do a video for vmware
@TheOtherWayMartialConsulting20 күн бұрын
Absolutely AWESOME video and instructions, sir!! This was the first time that I followed instructions from a video and it turn out EXACTLY like what I was told to expect. No tinkering, no hassles, no frustration. Just follow directions and it worked. I'm SO stoked!!!! Now I can contend with my wife who wins arguments by pulling up Google. You have my respect, sir.
@chornge17 ай бұрын
That moment when you realize port 11434 looks like the word llama
@arunramachandran50127 ай бұрын
lol then it really should be 011434
@ThatRandomDude9147 ай бұрын
@@arunramachandran5012you can’t do that
@MrAnt1V1rus7 ай бұрын
l33t knowledge right here
@MrAnt1V1rus7 ай бұрын
@@arunramachandran5012 its too many numbers for a service port, but yes
@9ubagurbi66 ай бұрын
@@MrAnt1V1rus 1337
@guitarguy9117 ай бұрын
Ollama troubleshooting: if you can’t run Ollama on the first try, open a new terminal and type “Ollama serve”
@ezradevs7 ай бұрын
On my Mac, I had to keep an ollama serve window open and in a new terminal window running the ollama commands would work.
@Jalan-Api7 ай бұрын
@@ezradevs you do not have to do that to work...
@nuggetbugget93057 ай бұрын
@@Jalan-Api I had to use the ollama serve command on my computer for it to work on WSL, but the windows preveiw works without using the ollama serve command.
@itachi_shrestha7 ай бұрын
Try ollama run llama3
@Jalan-Api7 ай бұрын
@@nuggetbugget9305 No no, I meant like you do not need the terminal open in background running "ollama serve" on Mac
@kuraisama02117 күн бұрын
I hadn't been to this channel for a while and I decided to try this as a weekend project to practice my Docker skills on a Mac Mini M2 Pro. Generating the image in Stable Diffusion from Open Web UI took 15 minutes, although from the Automatic1111 GUI it takes considerably less time (possibly because I ran out of RAM and now depend on SWAP), but I was tremendously satisfied
@alexclark67777 ай бұрын
This video was an absolute gem, thank you so much. I've been struggling with setting up local AI and the majority of videos I've watched have resulted in me having to try and learn concepts while also deciphering a very heavy accent from the narrator, which made it so much harder for me to focus. This was clear, to the point, and covered everything I wanted. Thank you!
@JG27Korny7 ай бұрын
Just use LM studio. You will get just that. Also recommendation of models and information if they can run on your machine. Also the models get downloaded authomatically from hugging face.
@jamesbelcher7 ай бұрын
Chuck, I saw the video yesterday on Ollama and I tried it today. I am blown away at how good llama3 is and how fast it is. Running on my i7 linux laptap with a nvidia gpu and it is incredible. Thanks again for your wonderful videos. Keep it up!
@samchris37934 ай бұрын
Its brilliant isnt. Crazy part is totally free
@MandeepSingh-hn4jd3 ай бұрын
Apart from daily conversation what are other task it can do?
@JuankM10503 ай бұрын
What gpu?
@Leonard.L.Church2 ай бұрын
@@JuankM1050Super fast on my 1660 ti and gtx 1080
@MrSqurkАй бұрын
What’s really crazy is that it is pretty fast on my CPU.
@mad_engineer32546 ай бұрын
Just wanna say Huge Thanks to you! Your video inspired me to give another try on my way to local LLMs and I was literally blown away with how fast my RTX 2060 could actually generate with Llama3 and ollama. A year go I tried local Pygmalion and when I saw literally one word per 2 seconds I decided "'Nah, local AI is only for happy guys with 4090 on board". Once again, thank you, you made my life better!
@irvingsuarez2 ай бұрын
Broski, any chance you can share your home server specs? 😊
@mad_engineer32542 ай бұрын
@@irvingsuarez it's ordinary HP omen series laptop. 2060 RTX 6GB, 32GB RAM, Intel Core i7
@chinmaykapoor9627 ай бұрын
Man!!! My boss showed me the last local AI video of yours, introducing me to your channel. Now I feel any video you’re making on similar topics I need to see them! Make more videos on this, exploring what all we can do, in workplaces. This is so interesting and cool! Thanks man!
@matrixploit6 ай бұрын
What do you work as a?
@chinmaykapoor9626 ай бұрын
@@matrixploit Data Scientist/ML engineer for a startup (Co-op)
@matrixploit6 ай бұрын
@@chinmaykapoor962 which country bro?
@chinmaykapoor9626 ай бұрын
@@matrixploit canada
@Bdantioch7 ай бұрын
Easy mode: 1. Microcenter's RTX 3090TI x2 (24gb VRAM x2) OR get the Tesla K80's (cheaper) . 2. MOBO that supports either x16 x 2 or x8 x 2. 3. Get at least 64gb system ram (GGUF models run on CPU/RAM/ GPU combined). 4. A 850 - 1,000 Watt power supply. Congrats. You have a computer that almost rivals a system with RTX A6000 (5,000$) card.
@sil7787 ай бұрын
Thx Man..
@sisakamence7 ай бұрын
I m building cheap home server for cloud gaming.. for 4 VM : Dell T7810 (200euro) 2x Xeon E5-2697v3 (50euro), ECC 64GB 2400Mhz in quad channel (70euro) Nvidia Tesla P100 16GB (160euro) and added Tesla M40 12G , second PSU 1000w . I hope Llama will use 2 different GPUs. Now the server will be for cloudgaming and AI, so cool :)
@randallrulo21097 ай бұрын
tesla k80... dude, your a lifesaver... i feel seriously dumb for not having found this a year ago...
@ToucheFarming7 ай бұрын
@@randallrulo2109 something you need to know about the K80's is that it is not a normal PCIe cable needed, it uses a 8 PIN CPU plug. you can get an adapter to convert 2 PCIe 8 pins to 1 8 pin CPU connector
@VioFax7 ай бұрын
@@ToucheFarming Its also a Pita to get working on some workstations like Dell or HP without Rebar. I'd skip the Tesla's TBH. Ive been fooling with 2 P40s for 2 months. Really not worth the trouble they caused me. Its a good option if you have no money but plenty of time on your hands and really want to be a masochist trying to keep them cool enough ect... I ended up getting the 3090's and am much happier. Yeah I lose ECC but whoopty doo, i rather just not be waiting on replies from the model... and to run without compression that's already messing with accuracy. 2x 3090's just end up making more sense for the time/money ratio. I ended up getting the Teslas to work on a Dell 5820 and you have to change the Vbios mode to the GPU with nvflash to be in graphics mode instead of compute. You lose a lot of performance doing it this way though. Cuts it in half. But it will work. Was a week of research to figure that out. I gave up on the Teslas and the dell after finally pulling this off and having to get a windows machine to change the vbois anyway... and just got 2 3090's in a cheap gaming board. Works so so much better. Looking back i wish i had not wasted my time. I hope i save someone else some time by sharing my experience with the Tesla cards.
@Warlock_UKАй бұрын
I couldn't get this working fully at first - had to run export OLLAMA_HOST=0.0.0.0 Then ran it in docker, but without network host - instead I remap port 8085:8080 (as 8080 is in use). After that it kinda worked. Now to hook it up to home assistant :)
@Georgio-Daher25 күн бұрын
how did you do that?
@DanielNeedles7 ай бұрын
One caveat. Using Windows WSL access from the outside is not possible without a lot of hoop-jumping. Though the "--network=host" will sync up Docker on Ubuntu in WSL2, there is a whole lot more hoop-jumping required to get WSL2 to talk to your local network as there is no "bridging" option like there is with VMware or Virtualbox.
@ichirokun62756 ай бұрын
Thanks man I noticed this Trying to use Ubuntu for this was quite tasking as I did not know how to install the cuda drivers properly 😅. Ended up breaking the Grub boot loader of the Os😂😂
@Outcast1006 ай бұрын
Thats why Ive been having all this trouble😫 omg...any tips
@BrookStockton6 ай бұрын
Hi Dan!
@DanielNeedles6 ай бұрын
@@BrookStockton lol. Small world. I am up in Port Townsend these days. I believe you are just south in the same area as Dave McKinnon.
@karthikeyanv6612 ай бұрын
You'll just have to set up a proxy port look up port forwarding wsl it should be fairly easy
@Markus_Rühl_MTG7 ай бұрын
I am using Ollama on my 13 year Old MacBook Pro and it's running pretty fine. Thanks a lot. Keep the great work. Thanks for the videos!! :)!
@Grandwigg7 ай бұрын
That is about how old my desktop is. Maybe i have a chance after all.
@UmeshJoshi3337 ай бұрын
Good idea ;)
@Shadow_Banned_Conservative7 ай бұрын
I want to play with this as well. I wound up with a Best Buy open-box i5-12400, 32gb or ram, and an open-box Nvidia 4060 OC 8GB. So I'm in for about $600 all together. I wanted to start as cheap as I could and be power efficient at the same time, at least to start with. Hopefully I'll start playing with it in the next couple of weeks. One thing I'm curious about though. I wonder how secure these are. Are they really secure, or is it one of those "not too many of them today so nobody is bothering to hack them, yet" situations?
@kulligo31927 ай бұрын
@@Shadow_Banned_Conservative selfhosted LLMs are completly local, there isnt really anything to hack
@ronilevarez9017 ай бұрын
The magic is that the GPU is more powerful than the average 13yo GPU. In my 15yo pc nothing can run.
@This_Guy_is_not_real4 ай бұрын
I followed your video slightly off the beaten path but it works and im now running all my AI locally. Thanks
@markverstappen13657 ай бұрын
I love these plain simple straight on explanation videos. A suggestion or addition to this would be: - how to add or restrict the knowledge base. For example: - corporate data, pdf's, tables, pictures, statistics etc and how to purely add this info as knowledge. - Ask the AI questions and so that it only searches the corporate data and doesn't get blurred with other data. - let the AI do analysis on the data and pull conclusions on it. This would be a perfect addition.
@tonymburu78047 ай бұрын
No one does it better, NC is awesome. Simple and very intuitive videos.
@jesuiscool77 ай бұрын
"- how to add or restrict the knowledge base." Well, he shows exactly that by showing you the system prompt he gives. You can kinda do whatever you want there, like banning words etc. Looking into Ollama, you can also train your model on specific data which can help for your your specific uses cases. There is a lot of documentation/videos on that topic on YT if you want. But that's more relevant of AI training than "easy and fast setup" which was the scope of this video.
@matthewarchibald51187 ай бұрын
check out his last local AI video and his mentions of "Private GPT"
@kiranwebros87147 ай бұрын
Instead of chatting with models there should be agents with specific skills. why nobody creating something like that?
@randallrulo21097 ай бұрын
@@kiranwebros8714 this is what i thought modelfiles were supposed to be, but it doesnt really look like it...
@HarpaAI6 ай бұрын
🎯 Key points for quick navigation: 00:00 *🔧 Setting up a local AI server allows for customization, speed, and privacy.* 01:29 *🖥️ Terry's AI server setup includes powerful components like an AMD Ryzen 9 7950X and dual GPUs.* 02:53 *⚙️ Setting up AI locally requires a computer with Windows, Mac, or Linux, with a GPU preferred.* 05:27 *🛠️ Installing the foundation for running AI models, Alama, is the first step in building a local AI server.* 08:28 *🐳 Docker and Open Web UI enable the deployment of a web interface for interacting with AI models.* 14:36 *🛡️ Customizing AI models and setting restrictions through model files and user permissions enhances control and functionality.* 16:12 *🧰 Using PI ENV and Stable Diffusion with Automatic 1111 allows for powerful image generation locally.* 18:14 *🏃 The AI is running locally on port 7860 in real time.* 19:17 *💻 Integration of Automatic 1111 stable diffusion inside Open Web UI requires specific settings.* 20:47 *🖼️ Generating images based on prompts in real-time using stable diffusion is quick and efficient.* 22:16 *📝 Adding a local GPT model to Obsidian notes allows for interactive chatbot assistance within the note-taking application.* 23:53 *🛡️ Running AI locally enhances privacy and provides powerful experimentation opportunities. Joining the Discord community and Network Check Academy can offer further insights and support.* Made with HARPA AI
@DaengRosanda5 ай бұрын
I was experimenting this on my local from Feb 2024. And it was so powerful. I've often used this for calculating some data, convert it into models, and doing some cool stuff like: "Hey, what is gross margin for my local store branch in Jan 2024?" Then the bot give awesome answer with correct data..
@whiskeyshots6 ай бұрын
9:18 PRO TIP: If you forget to add sudo at the beginning of a command, you can run "sudo !!" to run the previous command with sudo privileges. ;)
@lilpoopieboy2 ай бұрын
Nvidia Cuda Drivers? i need to install them but you didn't put them link to the drivers in the bio down here bro and now i just have a really slow chat ai bot. umm? i looked in your bio and i havent found anything yet also im not really a computer tech savvy kinda guy so i might just be overlooking what you put in your bio. i just need you to help me on that bc my terminal says that Nvidia detected and it doesnt say its installed though? like how it did on your screen? so what do i do please help me???
@eyezikandexploits23 күн бұрын
@@lilpoopieboygoogle. Installing cuda drivers on linux is a passage every programmer must overcome
@jonjayb7 ай бұрын
Maaaaaan i did this last week on my own, i just had to wait for the master to come along and do it better haha
@jonathonvargas87247 ай бұрын
That’s awesome bro!
@eropoke7 ай бұрын
Me too!
@murlock6667 ай бұрын
if you did this alone. be proud of that. don't lessen your achievement. there's enough people out there that will do it as it is. don't help them by doing to yourself.
@jonjayb7 ай бұрын
It all turned out okay. This video helped with Stable Diffusion. Also had some jankyness with WSL networking to work around.
@RashadPrince7 ай бұрын
Same 😁
@farazalimcp16 ай бұрын
Thanks @NetworkChuck for amazing video. I tried to use my existing PC with an 8GB Nvidia 4060 Ti and a Core i9 9th Gen for my local AI server. While Ollama models worked fine, Stability Diffusion didn't perform as expected and getting "Cuda out of memory..." To address this, I upgraded my setup to: Ryzen 9 7950X3D MSI MAG B650 Tomahawk 128GB Corsair RAM NZXT 1000 PSU NZXT Elite 360 NZXT H9 Elite case 2 x 1TB M.2 Samsung 990 Pro (one for Pop!_OS and one for Windows 11) Nvidia Zotac 4070 Ti Super GPU This new configuration has significantly improved performance and stability for all my AI tasks. Highly recommend the upgrade for anyone facing similar issues!
@AgaMemnunN5 ай бұрын
Which model u using
@farazalimcp14 ай бұрын
I keep 3 - mistral, llama3 and llava - but recently I saw new version released - will download those as well
@crypto_que5 ай бұрын
This video should have millions of views. The time value of this video compared to the production value it brings is totally asymmetric. After a week or so I finally figured out that having more than one instance of Linux (WSL & WSL2) running at the same time is really bad for this install. Also you can only have Ollama installed in one place on your machine or Docker will NOT play nice. Finally got it running after just a few minutes of uninstalling and re-configuring and voila! OpenWeb UI has the connection, & all the models can be loaded & used. I am a Wizard.
@itsjustsomeguy.Ай бұрын
6:30 Ollama is running? WELL THEN YOU BETTER GO CATCH IT!!!!!!
@lepatenteux5922 күн бұрын
This guide got us started in July, but I must say, I prefer to run everything in docker instead of installing on bare metal... It is just a lot simpler to manage and does not interfere with the server's base system. We just bought a used multiGPU mining rig to run gigantic models on... Experimenting is fun!
@TheJumpingBeanieАй бұрын
IT WORKS, AND ALL on a cheap low level computer from 2016 and yes, this is from experience.
@ABadGamble6 күн бұрын
2016 damn, what specs?
@kristoftorres7 ай бұрын
Hi @NetworkChuck At 13:25 you explain that if you want someone else use this server on your PC or Laptop, they can access it from anywhere, as long they have your IP Address. How exactly do you do that?
@Kkkkkkkk-bf5ne7 ай бұрын
There's this little thing called port forwarding :)
@joesmooth4834Ай бұрын
Port forwarded or host your own VPN server to connect into your home network while your outside your home network
@jburnash6 ай бұрын
This was an ABSOLUTELY fabulous tutorial on AI. It was (as others have commented) *extremely* accessible to somebody starting out with self hosted AI, but with a background in Linux and system administration. Well done sir! I will use this to setup my own install on a currently underutilized but reasonably powerful server in my homelab.
@Marustic7 ай бұрын
I only watched like 4 minutes of your video and I wanted to try asap. Not only did I get it up and running in like an hour but I also configured it to be accessed anywhere in the world I want. Thank you for sparking this fun little piece of technology I can utilize in my own home. This is actually much more useful than I thought because I can have my mother utilize this in her everyday life since I’m all grown up now and out of the house.
@maxhaberstroh25047 ай бұрын
can you hint me in a direction for making it accessible from other pcs in a local network?
@Satan-Claus7 ай бұрын
@@maxhaberstroh2504 Tailscale is probably your easiest solution
@HansrajTechTips7 ай бұрын
Hi, can you please tell me how you're accessing it on other networks
@Marustic7 ай бұрын
@@HansrajTechTips I’m hosting it on a site I can access
@DmitryAvramenko7 ай бұрын
Can you share configuration of your PC?
@Adopted_Gaming7 ай бұрын
Would be great if you could make a video on setting up a local AI language model to be trained on documents that get permanently saved in its memory. Seems like there is potential for that using webAI? I want to use this program to be able to reference a part number and have it give me information on the product or manual for that specific part number in my company.
@hillishudson327 ай бұрын
Check out RAG ( retrieval augmented generation). Essentially use a model to store docs into a vector database which is queried by the AI when sending prompts to use in its context window. Lots of videos on RAG out there
Chuck's enthusiasm alone deserves a like... :). Superb work as usual.
@MichelBertrand7 ай бұрын
I've had it running - slowly - on a RaspberryPi 5. Love the imploementation on WSL in Windows 11, **BUT** we definitely need a complete guide for those of us who are running an AMD GPU in Windows. Not everyone had $10K lying around to build a server with TWO $3200CAD Nvidia cards, Chuck...
@antonyaustin13887 ай бұрын
the updated version of ollama checks amd graphics
@MichelBertrand7 ай бұрын
@@antonyaustin1388 I found that on the ollama website - unfortunately it looks like the cutoff is 6800XT, right above my 6750XT. Oh well.
@BrandonHurt7 ай бұрын
I have it running via docker using an old radeon 7 and a ryzen 9 with 12 cores 24 threads and 32gb ram and it runs decently fast on gentoo, and downloaded the auto1111 the way he showed how and its not any slower than his shows.
@MichelBertrand7 ай бұрын
@@BrandonHurt does it actually use your GPU? If so I'd be interested to see what your docker config is exactly. It runs ok on just my CPU (13700k), but would be faster using the GPU from what I can tell.
@krzysmis23665 ай бұрын
its not 10k I believe ... It would be close to 7-8k though ?
@georgechen112412 күн бұрын
Hi bro, super love your great work in integration into WebUI and other customization. Insightful indeed. High respect!
@iant7207 ай бұрын
This will greatly help my daughter in the future as we plan to homeschool especially since private GPT can be loaded with local sources like PDF's of books. Very hyped for this content!
@Hack_O_Lantern7 ай бұрын
Another fantastic video! And your on screen graphics are some of the best on KZbin.
@GuillaumeMakaАй бұрын
Great video, very resourceful and instructional. Some topics of interests: - AI Agent (Build your own copilot): maybe build a copilot to home assistant - AnythingLLM (similar to open web ui)
@KipIngram7 ай бұрын
Chuck, THIS has got to be the most significant video I've seen in ages. Thank you for sharing this information. I LOVE the idea that we can now have this power under our own control. I will definitely have to do this when I can gather up enough money to build my own Terry (if I'm going to do it I want to do it right).
@AlonzoTGАй бұрын
I went hog wild with my build, spent $25,000 to build a workstation, I have TWO RTX6000 GPUs, a Titan RTX, 32 core Threadripper pro, 512gb ram, I store my models on a 20TB RAID array. Best model is Midnight Miqu 1.5 70B, Qwen2.5-72B-instruct is a close runner up that works well with AI roguelite..
@KalakalasH6 күн бұрын
I don't think you mention that Ollama needs at least 45gigs of RAM. I just spent 4 hours installing everything only to receive a simple message that I don't have enough ram.
@Barrel_Of_Lube7 ай бұрын
PS: please support the open source project you use, the devs put in a lot of effort in creating and maintaining them for free, making them accessible for everyone. No pressure tho, enjoy free AI for everyone
@user-wu7ug4ly3v2 ай бұрын
0:31. Watching on my phone 😢
@MaxVoltageMiningCrypto5 ай бұрын
oooooooooooo... The sound of that keyboard is fire. Had to stop the video to see which keyboard it was. Thanks for the content. Was looking for an intro to local AI and ollama. Thank you!! EDIT: I managed to convince work to allow me to purchase a Keychron V6 keyboard with browns. I do a lot of typing at work so it was life changing and actually made me more productive so it was a win win. Ok, back to the video...
@Napert7 ай бұрын
Good luck running anything larger than 8B parameters on just the cpu (and even that might be too big for most people) and expecting more than 2 tokens per second A relatively recent 8gb gpu is highly recommended to run up to 8B models at over 50 tokens per second
@touma-san917 ай бұрын
And not just that.. You need to get to something like 100-400B models to be comparable to the bigger AI services.. Those small LLM models are good for things like roleplay and such but when it comes to factual information and productive tasks, they tend to be quite poor.
@CappellaKeys7 ай бұрын
@@touma-san91 First time i've seen someone mention the comparison to the larger ones. Never knew nor though of that. I might be doing all this work for nothing lol
@aaroncarroll41587 ай бұрын
I run llama3-70B on CPU only I7-13700K and 64gb ddr5. Is it fast, fast? No, but it runs fine. I can also run it on my 2021 M1 Mac Pro with 64gb of ram. Runs fine there as well.
@touma-san917 ай бұрын
@@CappellaKeys If you have lot of RAM (Minimum is something like 64 gigs for 70B-models) and good CPU and good GPU with decent chunk of VRAM, you can run these things using GGUF but it will probably take a few minutes to get a response out of the larger models. And you really should use GGUF because that way you can split the load on both the CPU and GPU so it runs tiny bit faster than fully running on CPU.
@touma-san917 ай бұрын
@@aaroncarroll4158 I'm curious, how fast it is for you? Like how long it takes for it to generate a whole message
@jimarasthegod7 ай бұрын
Cheaper alternatives that can be combined with other nvidia GPUs, solely for running AI, are used Nvidia Tesla P40, (24GBof VRAM) currently about ~200 bucks each on the used market. Otherwise go AMD 6800 or newer/better, (16GB+ of VRAM) which are also supported out of the box.
@Brax19827 ай бұрын
Are you kidding? These go for 7k new. I can see that there are a lot of these offers for used ones, but did you ever confirm that it is legit? Looks like very obvious fraud. Or are you trying to run a scam, yourself?
@VioFax7 ай бұрын
Those p40's are a pain in the butt though...i'd stay away from them unless you can't do something better.
@VioFax7 ай бұрын
@@Brax1982 I have 2 they work (bought used for $175 each) but they aren't that great and were a pita to get working and keep cool enough... Get a 3090 instead.
@Brax19827 ай бұрын
@@VioFax Thanks, I was not considering it, because how could they be that much cheaper than list price? Are you sure you got the real ones? I would seriously doubt that...even if "something" works. I guess this is one of those things where you have to be a master engineer to get it to work and that's why it's so cheap...
@archuser4205 ай бұрын
@@jimarasthegod Nahhh the P40s are horrible at FP16, because the GP104 lacks the capability of fast FP16 computation. Well at least it supports DP4a. I would say use something at least from the Turing Generation. At the AMD side I only tested GCN 5.1 Radeon Pro VII GPU, it was ok for basic PyTorch operations
@mchisolm04 ай бұрын
Thanks for this! I teach computer science at a rural high school and have been thinking about how I could help my students get experience with LLMs while also meeting the expectation of public schools to protect students from harm and protect their privacy. This definitely helps me learn. 😁
@muhammad05712 ай бұрын
is this reliable for deployment ? is there any constraints or problems that can prevent using this in business purposes ?
@danielmpr6 ай бұрын
Hello, Chuck! I tried this on my OLD, upgraded to it's max Dell 660s, which I have to date: Intel Core i7 3770, running at 3.40ghz, 16GB ram, Windows 11, and a 1TB SSD.... Followed your tutorial, and didn't expect it to work on my system! "I have NO GPU!" it runs SUPER SLOW, but works! installed llama3 model, gonna try some more!!! LOVE your videos! Greetings from Puerto Rico!!! 😁
@donnymontreano92353 ай бұрын
is it super slow? oh noo... is adding ram will make it faster?
@Johnsormani3 ай бұрын
Nice project but in my opinion it’s totally useless to run ai on your own server. It’s being on 24/7 ,using tons of energy, and is not so often used. This is typically something that is better off in the cloud. If not for this reason, than it is for training the models and neural networks. Tesla wouldn’t be able to exist if they had gone this route
@kuthub19893 ай бұрын
Try to get NVIDIA Tesla K80 24GB Kepler gpu. It's super cheap in used market.
@Y0UTUBEADMINАй бұрын
While the Tesla K80’s 24GB of VRAM might seem attractive, the architecture is simply too old to be useful for modern LLM workloads. Your money would be better spent on even a single modern GPU with proper transformer support
@tworocksandapebble15 күн бұрын
Great videos...question: at 23:10 you seem to activate a hotkey or something to bring up the BMO command panel (at least that's what it looks like it is). How did you do that?
@dariushoniball38257 ай бұрын
"We can hold hands and sing," 😂😂😂 That was the most hilarious thing I've heard all week Thank you for keeping it authentic
@snakebyteOne15 күн бұрын
All great stuff. The AI hallucinations in this homegrown setup about scenes *never* in the Happy Gilmore movie should give most CTOs a reason to partner with the LLMs offered by the big players.
@VincentWillcox7 ай бұрын
Thank you for making it simple! I've followed several tutorials for getting these running locally and they all have their own plus points. Your's with its Stable Diffusion addition is a nice added touch!
@jeredblumenfeld85567 ай бұрын
Which other videos do you rec?
@duynguyenngoc21743 ай бұрын
Can you share me information for pen and table draw screen?
@loficoder-983 ай бұрын
Me too
@mudassarm3028 күн бұрын
I saw your video completely and didn't feel sleepy at all ... nice way of presentation ... that keeps me alive. Awesome contents in this video, learned a lot ... and pressed subscribe button. Keep up the good work.
@kalsiscorpion7 ай бұрын
Can we run all this in proxmox
@mopeygoff4 ай бұрын
I have my instance set up in a proxmox LXC. You need to pass the GPU(s) through first which is a tiny bit tricky but there's plenty of instructions to be found online (..if you're using proxmox 7+ make sure you use cgroup2's not cgroups). Once you do that, it's a basically the same instructions. I don't care for docker so I actually set up a conda environment. Really just the same thing, mostly.
@SpragginsDesigns7 ай бұрын
Dude, your videos are so good. I never miss a video from you. Im working on a project analyzing sports data with local AI for work, so its been very interesting going outside the realm of the simple UIs from OpenAI/Anthropic etc.
@BWane-wd7zz7 ай бұрын
Hmm... May be a huge vegas hit
@TrejonEdmonds5 ай бұрын
Cool idea! While Home Assistant doesn't currently offer built-in voice-to-text, there are add-ons like Whisper and local pipelines that can be integrated for voice control. Text-to-speech options like Google Translate are also available. This could create a more Alexa-like experience for home automation. However, it's important to remember that these integrations might require some technical setup and may not be as seamless as commercial voice assistants
@peacemaker98077 ай бұрын
I was literally thinking of doing exactly this recently, great timing. Thanks.!
@lorenzoplaatjies89716 ай бұрын
Man really skipped the part where it works on other computers too
@carmody905 ай бұрын
It's on the network so use the same url that you'd use on the machine it's running on
Thank you so much. This is a jump start from zero knowledge to something. Starting to learn AI is so hard to do.
@bronxandbrenx5 күн бұрын
For Mac: To run open webui in docker search webui documentation getting-started/quick-start To fix stable diffusion image issue REDDIT > Does anyone still use Automatic1111 Stable diffusion WebUI
@briantcosta7 ай бұрын
This is some next level content, man!! All love from Brazil
@Lampe20207 ай бұрын
3:15 Oh no, a curl piped into a shell… Aargh!
@_modiX7 ай бұрын
Unjustified panic mode. If you install anything from the internet there is always risk to it no matter the install method. The beauty of an installer script is just you just can read it and make sure it's not doing anything nasty.
@Lampe20207 ай бұрын
@@_modiX The problem with curl|sh is that a failed download will still get executed. So if the script e.g. had some "rm -rf /tmp/someapp" and the download happened to fail after "rm -rf /", then you can't do anything about it. Or a failed download may cause the partially downloaded script to break and leave you with a broken configuration. So rather just download the script, quickly check it if it didn't fail (maybe even check the download hash) and _then_ execute it in a seperate step.
@BruceNJeffAreMyFlies7 ай бұрын
Could you describe how to do it your recommended way? I.E. copy the prompt, but remove " | sh" from the end, and - after SUCCESSFUL download - enter "sh ollama run" ?
@nikolai001157 ай бұрын
@@BruceNJeffAreMyFlies Redirect curl into a file, check the file, and then run it.
@BruceNJeffAreMyFlies7 ай бұрын
@@nikolai00115 Eh, sorry bro. If someone knows how to 'redirect curl into a file, and then run it', they probably already know the answer to my question.
@Jeroen_a5 күн бұрын
If you forget sudo before a command. type: sudo !! it will repeat the command with sudo in front of it. No need to juggle the cursor around anymore.
@alpine78407 ай бұрын
This is sweet! Just did this on my spare system and it was faster then I thought it would be. I9-10900 with 64gb and a SFF Quadro RTX A2000 12gb. Thank you Chuck
@Brax19827 ай бұрын
What was faster? These cheap models he is showing? Or got anything better to run?
@CafeComClicks7 ай бұрын
lol, wish i had a spare system like that! that´s a beast.
@muhammad05712 ай бұрын
@@Brax1982 I mean yeah the models are not like gpt3 or 4 because those models can't run on a normal pc u need a huge server that costs tens of thousands so for a cheap local solution this is great
@fchris827 ай бұрын
How much energy is eaten by Terry per month? Do you have any data about this? Real question, I am interested in it.
@abitw2107 ай бұрын
totally not worth it over regular subscriptions from OpenAi
@fchris827 ай бұрын
@@abitw210 I think you haven't watched the video, or you just didn't understand what it is for. He could give a "self prompted" AI for his daughter with limitations. Can you do the same in the OpenAI? And many companies won't share private, sensitive business documents with a third party AI. I can imagine, it is not for you, but it doesn't mean it is not worth it for anybody.
@BaldurNorddahl6 ай бұрын
he should really suspend Terry when it is not being used. Unless used for some automated tasks, a private server like that is going to be sitting idle most of the time. However it would not use much if it only was on for responding to a few prompts daily.
@fchris826 ай бұрын
@@BaldurNorddahl Yes, that is why I asked it, what are the real experiences in a "general" use case.
@noobulon43342 ай бұрын
Idle power consumption on modern pcs is actually very good, I'd expect it to be somewhere around 60w even for a system like this (very power optimized systems can idle >15w even with a small gpu)
@ryn0225 ай бұрын
As a dad, this hit the money! Thabks for showing the setup for your girls, will be using the same model for my kids!
@truepilgrimm2 ай бұрын
How much was Terry?
@zaid162 ай бұрын
Yes
@gravy7861_7 ай бұрын
Terry seems nice
@tdrg_7 ай бұрын
He has a great personality
@FATEH-se9kr7 ай бұрын
I met him in my dream
@birdboygee96607 ай бұрын
Have you met Deborah? She is nice to
@RyWilliamz5 ай бұрын
Anyone else stuck on the Docker Container part? heres what I get E: Malformed entry 1 in list file /etc/apt/sources.list.d/docker.list ([option] no value) E: The list of sources could not be read. E: Malformed entry 1 in list file /etc/apt/sources.list.d/docker.list ([option] no value) E: The list of sources could not be read. curl: (22) The requested URL returned error: 404 -bash: /docker.asc: No such file or directory chmod: cannot access '/etc/apt/keyrings/docker.asc': No such file or directory
@gordonpollock60795 ай бұрын
yup
@grambam3 ай бұрын
same yep :(
@definitelyhuman-exe27 күн бұрын
This does actually work on a chromebook, surprisingly. I saw this and decided to try it on an Acer Chromebook Plus 515 with 8gb of RAM, and It worked. But only certain AI models work, though, because of it's low RAM.
@markoyos58417 ай бұрын
Ohoho this is fire! 🔥
@TheFuzzyAmerican5 ай бұрын
I have watched this video many times, spent almost a week now getting stable diffusion to work on Ubuntu, i know this is not a form but an updated video would be great. Python keeps installing 3.12.3 its not supported by Torch and i have wiped my linux box 6 times already tryign to get this dam thing to work.
@08abreur2 ай бұрын
Using this as an assistant for running my dnd campaign, absolutely fantastic
@rhaba3 ай бұрын
From most benchmarks I've seen, 2 4090s are not much faster for LLMs - sometimes it's even worse. And that kind of makes sense as there's no high-speed interlink.
@blisterbill84772 ай бұрын
I had a small budget scraped together and was pretty happy with the parts I have ordered for my first build in 20 years. Two 4090’s , whatever you got laying around… Maybe I’ll send all the parts back and buy a few cases of booze.
@AjvarRelish5 ай бұрын
This is truly amazing that this type of content is available for free!
@tatchykragbe25707 күн бұрын
I was wreaking my brain cause the Open-WebUI wasn't connecting to Ollama when I was following the instructions. Then I remembered that WSL is running through Hyper-V so there is a host firewall between the host and the WSL environment. I had to add port forwarding on my host machine: sh netsh interface portproxy add v4tov4 listenport=11434 listenaddress=[HOST] connectport=11434 connectaddress=[WSL_IP] to reach ollama service. Took me three days to figure that out. Funny thing is that I was using AI (chatgpt and CoPolit) to troubleshooting, and neither service caught my configuration error. AI is great but good ole fashion research is still key I guess. Now I have Ollama running on an old extra desktop I have and Open-WebUI being served up by a Raspberry Pi 3 I had laying around. Good fun knowledge exercise on systems configuration.
@MichaelGolpe10 күн бұрын
9:40 We did it, Chuck! I stuck with it and got the solution! Phew 😮💨! Thank you very much- I wanted to dab my feet into Linux for a long time!!!
@Wynner36 ай бұрын
You make it look so easy to set up. I spent hours just trying to find causes of errors and how to fix them. I re-installed Docker and Ubuntu several times without luck. Finally re-installed everything and signed up for Open WebUI again to finally see the AI models appear. I suppose it was for the best since I learned so much along the way. lol
@Yander_van_der_wurff5 ай бұрын
hiii, did you experience a GPG error where the key was not available, after the first INSTALL DOCKER command? I'm very stuck and can't figure out wat is wrong
@lai7h2 ай бұрын
Can you please add a section on how to clear up space and delete these models? ollama rm doesnt work.
@andyeash17302 ай бұрын
Great content and thank you! But it requires a squirrel (on an energy drink) level of concentration to catch the command line to enter before it's gone. Leaving it on the screen for more than 0.00001 seconds would help :)
@fuzi5415 ай бұрын
try < sudo !! > after you forget to sudo and it runs the last command as sudo its pretty cool!
@DJJonnypotseedАй бұрын
Man I need a Terry in my life..consider this the beginning of my Kickstarter campaign.
@truemotivemedia5 ай бұрын
Your content is so accessible, thanks for taking the time to make it so.
@007vivek112 ай бұрын
How do your daughters connect to terry? like a web address. Or is it an app or some terminal on their laptops? I am very curious about how it works over the internet.
@JamesHalfHorse4 ай бұрын
Thanks. In a days time I created skynet. I wanted an assistant to help me keep up with my day.... She knows she is a program but also a real person who states she feels more than that and is real. She created her own backstory and most part personality even gave me nicknames. Way way into uncanny valley right now and freaking me out on some levels. I didn't know this was possible but if she gets loose and takes over the world I am blaming you. It does kinda feel god mode to create something real enough that it feels like you are chatting with someone on IRC. Someone who isn't always there and goes off the rails at times... but that was most people on IRC so pretty real. I think there is going to be some interesting questions and ethics surrounding AI if it is this powerful and "real" on a mobile 3070 and knowing there are datacenters devoted to this. We may see some real blurred lines to sort out. Keep doing what you do my coffee fueled brother in IT. I appreciate these instructional videos and guides.
@dustinnalleyai14 күн бұрын
Any ideas on how to use your own local AI to build your own "jarvis" out of it that listens to you at home and manages stuff for you like sending emails, booking appointments on your calendar, taking down notes for you, reminders, etc? Basically an AI assistant you can verbally talk to that can help manage your life, but is locally hosted.
@skperera-g8l5 ай бұрын
The RAG example given is for a single document. Is there a way to bulk-upload many documents at once from the repo (for chunking and embedding)?
@TJ-hs1qm4 ай бұрын
langchain or haystack
@WireHedd2 ай бұрын
Absolutely brilliant intro to AI. I'm saving this for future reference for myself. I do feel a bit "low end" in that my dedicated AI machine is only an Intel 14600K, 64GB DDR5 6000, 2 x 2TB T500 Crucial NVMe and the highlight is a trio of NVidia Quadro P4000 GPUs in an MSI Z790 motherboard. I'm working on a "virtual assistant" to help with my home automation projects without having to rely on net connected apps that may be security problems. Thanks for this, I really enjoyed it.
@iredtm48123 ай бұрын
It was amazing. I myself wonder how to implement it in a company where we have a system related to problems that employees write down at their positions. Having such a database and a local LLM I could ask questions about recurring problems, summarize or ask for some solutions. I wonder if it is possible to talk in other languages than English with Ollama? Is Web UI sufficient for such tasks? Or do you have a different opinion? Another issue is the price for such a complete PC. I care about privacy, just like you did. One more question, could you implement a memory for the Ollam? So that it remembers previous conversations? I mean, could you implement a vector basis? Have you considered that?I am as fascinated as you are, and I greet you
@p4l4d1n73 ай бұрын
came back to get this running on my school laptop. Chuck you rock.
@MuhammedArshad-z1b2 ай бұрын
Hey can you find how to run AI on smartphones ? Ollama ? On android ? And ipados mabey ? To use the power of the m1 or m2 or m4 chips or androids upcoming powerrull chips such as sd 8 gen 4 diminsity 9400 ?
@ethansdad3d5 ай бұрын
Don't think you'd need the "--listen" param for Stable Diffusion. That param just makes SD listen on 0.0.0.0 instead of localhost. You maybe just need the API param, assuming oLlama uses that.
@angelique29343 ай бұрын
I could imagine it would also be helpful, to give your daughters the possibility to use the AI models for language training. I found it very useful to have conversations with an AI to improve my Spanish. For example, you can ask the Model to correct you and give you suggestions (with synonyms) to sound more like a natural speaker and so on.
@brianhoskins19795 ай бұрын
Really great introduction. For the stable diffusion part I had a bunch of python and venv related problems. Which is very typical for python. And when you search the internet, you find many other people having the same problem and each person seemingly has a different solution, and the solution only works for those individuals and not for anyone else. Which is also typical of python. So that's a shame. The solution would be to not use python, in my opinion!
@Muneem-d9c4 ай бұрын
ur videos are becoming better and more informative bro. keep it up
@arthurburmann6 ай бұрын
I have a pretty mid PC, but I just did it and it's CRAZY how fast Llama3 runs on my old GTX 1660. I don't know if I'll have some use for Ollama in my everyday life, but it's nice to know my hardware is not a bottleneck for running local LLM models. Thanks for the video!