Generate LLM Embeddings On Your Local Machine

  Рет қаралды 19,229

NeuralNine

NeuralNine

Күн бұрын

Пікірлер: 21
@moumniable
@moumniable 8 ай бұрын
i just love how diverse your videos are ! even when i don't particulary look for something your videos drives me to learn more. thanks ❤
@rons96
@rons96 8 ай бұрын
Not bad, but if i may say any tip, i would say to use a sentence-transformer from hugging face for embeddings and then use a llama like llm only to customize the answer, because models created just for embeddings seems to be more accurate for this task. Also, langchain module is easier and useful than using numpy and requests, with lot more features. I use this setup most for RAG and seems to work pretty well.
@henrischomacker6097
@henrischomacker6097 7 ай бұрын
Very interesting hint. Why would you suggest to use a sentence-transformer from hugging face for creating the embeddings instead? Which method does a sentence-transformer from hugging face use to create the embeddings and which one does ollama use?
@hackdonalds
@hackdonalds 7 ай бұрын
I tried llama2 and mistral embeddings through ollama embeddings api. The similarity search results were sht compared to Xenova/all-MiniLM-L6-v2 or gte-small
@rons96
@rons96 5 ай бұрын
@@hackdonalds yes, llama for embeddings is not good, with sentence-transformers i mean that one you mentioned, then use llama to elaborate the sentence. There's another model better for embeddings but it will require more resources and i don't remember the name now.
@rahulmakwana663
@rahulmakwana663 18 күн бұрын
@@rons96instructorembedding
@godwinntowdanso4111
@godwinntowdanso4111 2 ай бұрын
Spot on. Simplified presentation
@user-td4pf6rr2t
@user-td4pf6rr2t 2 ай бұрын
Cool guide. Very well explained. +1
@mohammadalibazyar5079
@mohammadalibazyar5079 6 ай бұрын
thanks, bro... really helpful ❤
@Darkev77
@Darkev77 7 ай бұрын
Powerful video! Guys, anyone knows how I can generate these embeddings if I were to deploy my app remotely?
@iamreallybadatphysicsbutda8198
@iamreallybadatphysicsbutda8198 8 ай бұрын
Great video! 😃
@EliSpizzichino
@EliSpizzichino 6 ай бұрын
that's very interesting! I imagine you can build your local knowledge base in this way... I need to make one for code-snippets that store knowledge bits find around.... Is `d` dimension fixed by the model? does it mean I have 4096 bytes to store my embedding?
@dangalimov7435
@dangalimov7435 8 ай бұрын
Brilliant!
@ddschaefer
@ddschaefer 8 ай бұрын
Great video! But where comes faiss into play?
@peterparker5161
@peterparker5161 4 ай бұрын
I tried this with LLAMA3 8b locally. It can work if the sentences are short enough. But when I started plugins in long paragraphs (youtube transcripts) it becomes basically useless. Transformers that are creating for embedding (BERT for example) seems to work better. They also have lower computational cost compared to LLAMA. I tried again with "nomic-embed-text-v1.f16.gguf" and it works much better.
@all-in-one-890
@all-in-one-890 8 ай бұрын
First comment ❤ and ur videos are fantastic
@JuanDiegoSalamanca-oy6xs
@JuanDiegoSalamanca-oy6xs 5 ай бұрын
if you do it in Colab what url do you use?
@JJTradess
@JJTradess 8 ай бұрын
🔥
@roberthenry7283
@roberthenry7283 Ай бұрын
where is the source code
@AGASTRONICS
@AGASTRONICS 8 ай бұрын
comments[-1] #FirstComment😅
I Analyzed My Finance With Local LLMs
17:51
Thu Vu data analytics
Рет қаралды 475 М.
Using Ollama To Build a FULLY LOCAL "ChatGPT Clone"
11:17
Matthew Berman
Рет қаралды 250 М.
WILL IT BURST?
00:31
Natan por Aí
Рет қаралды 44 МЛН
Bike vs Super Bike Fast Challenge
00:30
Russo
Рет қаралды 23 МЛН
Фейковый воришка 😂
00:51
КАРЕНА МАКАРЕНА
Рет қаралды 6 МЛН
Deploying Machine Learning Models - Full Guide
22:21
NeuralNine
Рет қаралды 6 М.
What are LLM Embeddings ?
6:44
New Machina
Рет қаралды 1,8 М.
RAG from the Ground Up with Python and Ollama
15:32
Decoder
Рет қаралды 30 М.
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20
How To Connect Local LLMs to CrewAI [Ollama, Llama2, Mistral]
25:07
codewithbrandon
Рет қаралды 69 М.
DuckDB in Python - The Next Pandas Killer?
19:32
NeuralNine
Рет қаралды 24 М.
Unlimited AI Agents running locally with Ollama & AnythingLLM
15:21
Tim Carambat
Рет қаралды 127 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 460 М.
WILL IT BURST?
00:31
Natan por Aí
Рет қаралды 44 МЛН