Why are vector databases so FAST?

  Рет қаралды 20,062

Underfitted

Underfitted

Күн бұрын

Пікірлер
@farrael004
@farrael004 6 ай бұрын
Underfitted? More like Underrated.
@MrTulufan
@MrTulufan 4 ай бұрын
The actual discussion about vector database starts at 14:45. Before that, it is a just a review of embeddings and RAG framework
@ilyesbejia6566
@ilyesbejia6566 3 ай бұрын
very usefull comment thanks
@JHBG1971
@JHBG1971 6 ай бұрын
How do you only have 40k followers? Amazing content. Been looking for this for over a year. Thank you!
@ah89971
@ah89971 6 ай бұрын
You are great. People who following you are the ones who care about understanding the root concepts which is rare to find nowadays because everyone copying and pasting without understanding
@deroace
@deroace Ай бұрын
so true
@tee_iam78
@tee_iam78 5 ай бұрын
He is absolutely right. Unless you take course in vector database, it is not easy to find material on 'how vector database works at low level'. Thank you for your content.
@PuerinTheHunter
@PuerinTheHunter 6 ай бұрын
Hey Santiago, keep going with your choice of shirts!
@underfitted
@underfitted 6 ай бұрын
That's the plan!
@dorins3787
@dorins3787 6 ай бұрын
Thanks!
@underfitted
@underfitted 6 ай бұрын
Thanks
@vinj98
@vinj98 6 ай бұрын
This video, from its content to your performance, is fantastic.
@underfitted
@underfitted 6 ай бұрын
Thanks!
@LiebsterFeind
@LiebsterFeind 6 ай бұрын
Wonderful video. Any chance of a video comparing HNSW vs Faiss vs Annoy?
@toddroloff93
@toddroloff93 6 ай бұрын
Thanks for the lesson. Always good to understand how things are getting done in the background. Great Explanation!!
@emrahe468
@emrahe468 6 ай бұрын
@22:22 this really helps on understanding the efficiency of the vector search algorithms. and the drawing reminds me the SVM borders/boundaries. by the way, great shirt! :)
@underfitted
@underfitted 6 ай бұрын
Love the shirt 👕
@oseteg
@oseteg 6 ай бұрын
Thanks a lot, Santiago! You are one of two authors I follow in KZbin and mainly in LinkedIn. The content is just a gold. My question is about that serverless thing. You provide the cloud and region but don't provide your aws credentials. Does it mean that it is free? As far as I understood, the cloud provider in this case is used to store the data. What is we don't delete the database at the end? Will have the bills for storing the db?
@justindressler5992
@justindressler5992 6 ай бұрын
Thanks you for explaining this, I had the intuition that this is how the indexing worked via clustering but you helped crystallise my thoughts on this. One thing I think might have been missed is the trigonometric functions used like cosine take into account the direction of the vector towards the next cluster. So the cosine function uses the vectors like a compass. When grouping the vectors your quantizing or approximately all related vectors to the centroid. So obviously reducing accuracy because your not pointing to the exact point in the cluster but to the centre. How are the results selected is there an attempt to research the selected related records using the original vector or is it simply random selection.
@mahmoudelaskare4982
@mahmoudelaskare4982 4 ай бұрын
great video as usual , love the energy 👏🏻❤
@DarkRaviForDeath
@DarkRaviForDeath 6 ай бұрын
top tier content as always
@ernestuz
@ernestuz 6 ай бұрын
In many ways, when you calculate the embeddings, and you reduce a fragment of data to a single vector, you are calculating a kind of hash.
@carterthaxton
@carterthaxton 6 ай бұрын
Yes, like a hash, it’s a compression and normalization of the data into a short common form. But better than a hash, because it’s comparable in multiple dimensions.
@nachoeigu
@nachoeigu 6 ай бұрын
Thank you very much for this amazing content!! It is so educative :)
@hasnainahmed7605
@hasnainahmed7605 Ай бұрын
Ahan!! How lucky these 48k, subscribers are... :) BTW, you look nice in this shirt!
@KumR
@KumR 6 ай бұрын
Nice ...Can u do one on Graph Database too?
@michaelduffy5309
@michaelduffy5309 6 ай бұрын
Beautifully done.
@uwegenosdude
@uwegenosdude 6 ай бұрын
Thanks for the cool video to make me better understand this topic. If I do not want to put my data into a cloud, what other vector db could you recommend? ChromaDB?
@vedant_stone
@vedant_stone 6 ай бұрын
There's weaviate which is also open source I believe
@vedant_stone
@vedant_stone 6 ай бұрын
There's convex as well
@riemannderakhshan1037
@riemannderakhshan1037 6 ай бұрын
Have nothing to tell, than You are fantastic!
@alextiger548
@alextiger548 6 ай бұрын
fantastic stuff. thank you so much
@collinvelarde7473
@collinvelarde7473 4 ай бұрын
This was awesome. Thanks big guy.,
@nope9310
@nope9310 6 ай бұрын
"ok so I'm going to execute this" "BOOM it's just that fast!" really?... really?.... You add a cut between those two sentences? I'm hoping this was unintentional. (thankfully the next search didn't have a cut) Great video otherwise. I'd love to see you dive into the actual indexing though so we can actually see how it works. This was quite high level.
@underfitted
@underfitted 6 ай бұрын
Sorry, the cut was unintentional. My goal is to show to to build things, not how fast the tech is because that won’t be relevant in your own hardware.
@kpm25
@kpm25 4 ай бұрын
Thanks a lot, subscribed
@lokeshsharma4177
@lokeshsharma4177 6 ай бұрын
Awesome as always. I live in Florida as well what are my chances to meet you in person AND how did you automate your responses to all comments you get as ♥ !!!!!! Please write something as well 🙂
@underfitted
@underfitted 6 ай бұрын
No automation. The KZbin Studio app on my phone gives me the option to ❤️ replies. 😃
@delvoneu
@delvoneu 5 ай бұрын
Love the shirt, where did you buy it?
@underfitted
@underfitted 5 ай бұрын
Can’t remember. Probably Dillard’s
@johnmarshall4_
@johnmarshall4_ 5 ай бұрын
Thank you for this
@domineia
@domineia 6 ай бұрын
Amazing content
@rally_furymoments5294
@rally_furymoments5294 6 ай бұрын
This guy is creating amazing content and subscriber is 40k??
@underfitted
@underfitted 6 ай бұрын
Step by step
@DataPains
@DataPains 6 ай бұрын
Awesome!
@Vivek2062
@Vivek2062 3 ай бұрын
I didn't know Adam Sandlers is a VectorDB nerd!
@deroace
@deroace Ай бұрын
Wonderfull video Im trying to make an AI personality with vector databases lets hope I will get in my head an idea how to make it useing the information form the video 😅
@shahjahanmirza1616
@shahjahanmirza1616 6 ай бұрын
Im sad that you dont have any paid course. I'd buy any of your AI course.
@Drackomass
@Drackomass 6 ай бұрын
I like the shirt.
@sorin202
@sorin202 Күн бұрын
You've correctly pointed out that you don't understand how OpenAI works. You're also questioning whether it's definitely powered by a quantum chip."
@AtomicPixels
@AtomicPixels 6 ай бұрын
Vector dbs have ML indexing built in ha
@jtmuzix
@jtmuzix 6 ай бұрын
linear algebra. Orientation vs magnitude.
@damonguzman
@damonguzman 6 ай бұрын
You didn’t explain the answer.
@raunaquepatra3966
@raunaquepatra3966 6 ай бұрын
you can compress the video into 10 mins video, would be a lot better
@underfitted
@underfitted 6 ай бұрын
Yup. Still learning how to do that.
@dorins3787
@dorins3787 6 ай бұрын
Not for a 5 years old to understand. The content is very good because it takes you from zero and grows the technical level. It is one of tge best i have found.
@ricardomahfoud
@ricardomahfoud 6 ай бұрын
I disagree. To make the video 10 minutes, alot of the information will have to be either redacted or simplified. I like having longer videos that I can watch at 1.5/2x to get the best of both worlds. What makes this channelvaluable to me is the fact that it is not just a 10 minute surface explanation, but an in-depth technical explanation.
@AgustinCaniglia1992
@AgustinCaniglia1992 6 ай бұрын
Hi
@johnini
@johnini 6 ай бұрын
The shirt is okay, and the content overall is good. However, the video could have been shorter, 15 minutes. It felt too redundant, with not-so-useful examples. There was no need to include an example of Voronoi diagrams for cities. Maybe I am not the target viewer of your content. For now I will follow :)
@underfitted
@underfitted 6 ай бұрын
Be honest: The shirt is awesome!
@johnini
@johnini 6 ай бұрын
​@@underfitted I love your carisma! The shirt is awesome! Still following and looking your new video!
@blindConjecture
@blindConjecture 6 ай бұрын
This was WAY too much background context. You have to think who your audience is here. If we're interested in knowing the inner working of fast vector database lookups it's because we already know the basics like "what is a vector" and "how do you load a csv file in Python". I gave up watching after 15min because the video still hadn't even begun explaining anything about vector databases.
@underfitted
@underfitted 6 ай бұрын
Thanks for the feedback!
@nope9310
@nope9310 6 ай бұрын
There are other videos that explain that.
@cherniaktamir612
@cherniaktamir612 6 ай бұрын
why are you angry?
@teaman7v
@teaman7v 6 ай бұрын
He's foreign, not angry. Common mistake.
@underfitted
@underfitted 6 ай бұрын
I’m actually a very happy person.
@cosmicaug
@cosmicaug 6 ай бұрын
@@teaman7v, you wouldn't like him when he's angry.
@Hlfe0
@Hlfe0 6 ай бұрын
I like his style, he is passionate about what he shares 😁♥️
@subusrable
@subusrable 5 ай бұрын
is he? isn't it just his way of passionately explaining?
@timwake5830
@timwake5830 6 ай бұрын
Two minutes no info. Done w you
@mrsesh7364
@mrsesh7364 6 ай бұрын
@jamesbriggs has great videos
An introduction to Mojo (for Python developers)
15:21
Underfitted
Рет қаралды 11 М.
JISOO - ‘꽃(FLOWER)’ M/V
3:05
BLACKPINK
Рет қаралды 137 МЛН
БОЙКАЛАР| bayGUYS | 27 шығарылым
28:49
bayGUYS
Рет қаралды 1,1 МЛН
How I made $600,000 freelancing on Upwork.
48:16
Underfitted
Рет қаралды 11 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,5 МЛН
How to fine-tune a model using LoRA (step by step)
38:03
Underfitted
Рет қаралды 13 М.
A Machine Learning roadmap (the one I recommend to my students)
19:56
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 518 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,7 МЛН
How to train a model to generate image embeddings from scratch
51:44
Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress
30:54
Vector Databases simply explained! (Embeddings & Indexes)
4:23
AssemblyAI
Рет қаралды 372 М.