The Ultimate Getting Started with Local LLMs Guide

  Рет қаралды 1,513

GosuCoder

GosuCoder

Күн бұрын

Пікірлер: 11
@WillWehi
@WillWehi 17 күн бұрын
I wish i came across this 2 days ago. I spent nearly 10 hours reading Reddit to research this stuff to try on my still-waiting empty GPU system. You covered much in 20 minutes! Subscribed, and would love to see more.
@MemphisVon
@MemphisVon 20 күн бұрын
Thanks for this and your writingtools video. Happy I found your channel.
@DaveEtchells
@DaveEtchells 14 күн бұрын
PHENOMENALLY useful! Absolutely what I’ve needed to get started with local LLMs! One question: Doesn’t the embedding algorithm for RAG have to match the specific LLM you’re using, meaning that you’d need to use a different embedding for each model? Also, when does the embedding run? (Upon import, manually, or at runtime for each chat session?) How do you handle updated documents, can you just have the embedding run periodically, such as every night? Thanks for this reference, you saved me dozens of hours figuring this out - or more likely made it possible for me to do it at all!
@GosuCoder
@GosuCoder 14 күн бұрын
My understanding is that RAG doesn't require you to match the encoder with your language model, though they should be semantically compatible. You can mix and match (like using BERT embeddings with GPT models), as long as the encoder creates high-quality embeddings that capture the meaning of your documents effectively.
@GosuCoder
@GosuCoder 14 күн бұрын
Thank you for the kind words!
@DaveEtchells
@DaveEtchells 14 күн бұрын
@@GosuCoder Thanks for that info, it’s a little confusing to me. My conception of it was that the embeddings needed to translate into vectors having the same dimensionality and “meaning” for lack of a better term as the LLM coordinate space they’d be used with. OTOH, maybe there are universal encodings for input vectors that the LLMs are then trained against. - I guess I need to spend some time chatting with ChatGPT or Claude to learn more on the subject. (It’d make sense that embeddings would be at least somewhat universal; it’d be a PITA if you had to re-encode your whole dataset every time you wanted to try a different model.) Thanks for taking time to answer!
@PerFeldvoss
@PerFeldvoss 15 күн бұрын
Great, but I really wonder what you are on to regarding "rumtimes"... I never ran into that 😉in this context and I don't know how to get access to "Mission control" in LM Studio is it a hidden feature or not part of LM Studio?
@GosuCoder
@GosuCoder 7 күн бұрын
I probably should come back with another video to explain in depth what runtimes actually are.
@anatoliypodkladov2173
@anatoliypodkladov2173 13 күн бұрын
RAG please 🙏
@anatoliypodkladov2173
@anatoliypodkladov2173 13 күн бұрын
and deepseek
@GosuCoder
@GosuCoder 7 күн бұрын
I'm working on a RAG video now, hopefully i'll be done in the next few days!
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
Правильный подход к детям
00:18
Beatrise
Рет қаралды 11 МЛН
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19
Cracking the Enigma of Ollama Templates
7:39
Matt Williams
Рет қаралды 9 М.
Why AI is making software dev skills more valuable, not less
7:58
Steve (Builder.io)
Рет қаралды 52 М.
Anthropic’s Blueprint for Building Lean, Powerful AI Agents
28:25
Prompt Engineering
Рет қаралды 36 М.
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 354 М.
The 8 AI Skills That Will Separate Winners From Losers in 2025
19:32
8 AI Tools I Wish I Tried Sooner
16:10
Futurepedia
Рет қаралды 296 М.
'OXYGEN LEAK!'' Elon Musk Revealed WHY Starship Flight 7 Exploded...
11:01
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН