OpenAI Embeddings and Vector Databases Crash Course

  Рет қаралды 503,141

Adrian Twarog

Adrian Twarog

Күн бұрын

Пікірлер: 199
@photorealm
@photorealm 8 ай бұрын
That was the first video that actually gave me a understanding of how vector DB's kind of work. Thank you for sharing.
@goldenant9450
@goldenant9450 Ай бұрын
Key word being, "kind of" 😂😂
@nickfleming3719
@nickfleming3719 Жыл бұрын
That isn't a vector database. It's a relational database with vectors stored on a text column. In practice, you will have thousands of embeddings and performance will tank with this setup
@trevorbaier7072
@trevorbaier7072 Жыл бұрын
What's a more ideal solution for storing vectors?
@brookster7772
@brookster7772 Жыл бұрын
From my investigation, Redis is an excellent vector store to be used in both development and production especially when it’s a local Dockerized instance
@SussyBacca
@SussyBacca Жыл бұрын
Mongodb atlas is awesome for vectors. They have a new vector feature called knnbeta
@ParthSaneHD
@ParthSaneHD Жыл бұрын
Pinecone works too!
@amdenis
@amdenis 11 ай бұрын
You are correct, but you know that! It’s indexing is not fast enough for many serious AI projects, and its single threaded architecture does not scale. Under the hood there are many other non-vector legacy issues.
@SuperITPRO
@SuperITPRO Жыл бұрын
My ADHD normally overrides my concentration. Your tutorial pace, live coding, and narrative made me complete my 1st Open AI coded app - thank you!
@zzej
@zzej Жыл бұрын
Same
@nicholastroyandersen9505
@nicholastroyandersen9505 Жыл бұрын
Don't use ADHD as an excuse, it ain't no sickness, just personality. Take it and make it your best quality.
@ScherrerMadness
@ScherrerMadness Жыл бұрын
@@nicholastroyandersen9505it’s…. Not a personality, lmfao. It’s a very clear set of learning disabilities centered around working memory, executive function, and tuning out
@davidabellangarrido2056
@davidabellangarrido2056 Жыл бұрын
same and without knowing english
@pauls064
@pauls064 9 ай бұрын
@@nicholastroyandersen9505it’s literally a neurological condition that can be seen on scans and measured… ignorant comment
@aiadvantage
@aiadvantage Жыл бұрын
Super high quality video right here. Good job Adrian
@AdrianTwarog
@AdrianTwarog Жыл бұрын
Hey I've seen your stuff too, it's great, thanks for the nice words!
@adamduvick
@adamduvick Жыл бұрын
Let me see if I understand what’s going on here: 1) you have data you want to search semantically 2) you create a vector database capable of storing & querying data semantic search queries 3) you use OpenAI to process your data & convert it to vectors which can stored in your database 4) you store the data along with the OpenAI generated vectors 5) now you can search the data Is that all it is? I thought you were then going leverage this database to give chatgpt “long term memory” ( 0:20 ). What you’ve showed seems nice, but I don’t really see the point since most people/companies who have enough data that would need to be queried in this way would not be able to give it away to OpenAI to process.
@goldenant9450
@goldenant9450 Ай бұрын
what'd you mean "give it away to OpenAI" is everything shared with OpenAI accessible by the internal team or something? I'm pretty sure you can opt out of using your data to train their AI..at least that's the case with that chatbots.
@abijithpradeep7478
@abijithpradeep7478 Жыл бұрын
For those who already had an OpenAi account and you are facing an error while posting the HTTP request, its because your free credit has expired. You will have to add a payment method or createa new account to get free credits agin and then everything will work fine according to this tutorial.
@brookster7772
@brookster7772 Жыл бұрын
Bare metal, removing all higher level obstructions going right down to the core. I love it the best understanding of what embedding’s earlier that I have seen great job.
@tech.bharat18
@tech.bharat18 4 ай бұрын
This is by far the most easiest & concise explanation. Thanks for creating this video
@andrey20111988
@andrey20111988 6 ай бұрын
Also you can use in postman "Test", which can help you create a script to create a string with requested input and response data. Automate it! (If you need)
@codinginflow
@codinginflow Жыл бұрын
This was a great overview Adrian!
@MohamadBahri-h3k
@MohamadBahri-h3k Жыл бұрын
I have seen multiple tutorials, this is by far the best and most concise, great work man
@rasmuspiirtola4397
@rasmuspiirtola4397 Жыл бұрын
Rarely comment, but damn, you did a perfect job - I am at 8:01, haven't watched the video but had to pause and comment - until 8:01, everything was perfect; how you explain concepts and utilize tools ensures that we understand the concept in practice with ease! Great job, continue making videos; you should do consulting if you don't already do so. It's easy money with little hours with your skills and knowledge!
@nickkondoori7550
@nickkondoori7550 Жыл бұрын
Incredible teaching skills. First time ever, I loved someone who can teach "ME" the way I always wanted. Thousand thumbs up Adrian!!
@curtisblake261
@curtisblake261 11 ай бұрын
I like this video and I don't mind all the upselling. My only complaint is that if I pause the video for too long, it automatically sends me to another video in the series, which makes it hard to get back to where I was. You might assume it is user error, but it isn't. The automatic transferal and loss of context happens constantly with this KZbin video, and I've never had the problem with any other KZbin tutorial. I'm fine with the monetizing and upselling since it helps reward the content creator, I just wish it wouldn't keep making me lose my place in the tutorial.
@CodexCommunity
@CodexCommunity Жыл бұрын
This is the best video on openai embeddings I have ever seen, I am also a bit biased!
@LeoCB
@LeoCB Жыл бұрын
I just bought 2 Udemy courses, and after 5 hours, none of them talk so well about this. I appreciate it, and I will buy your book. Thanks for your content.
@AbhinavKumar-jt8kx
@AbhinavKumar-jt8kx 4 ай бұрын
This is awesome, perfect video for non-beginner developers to quickly grasp.
@anrk97
@anrk97 Жыл бұрын
Love your thumbnails. Keeps getting better with each video 👍
@AdrianTwarog
@AdrianTwarog Жыл бұрын
Thanks, I try to make them as clear to what they video represents as possible!
@Glow0110
@Glow0110 Жыл бұрын
Would be great to see a follow up video of practical applications using this.
@atursams5501
@atursams5501 Жыл бұрын
The practical application are varied: sentiment analysis term search Classification
@krisograbek
@krisograbek Жыл бұрын
Adrian, your channel is a gem! I love the way you explain complex topics and the pace of your videos! Greetings from Poland!
@pajisounds
@pajisounds Жыл бұрын
Nice video, it would have been nice with a demonstration at the end or intro, keep up the good work.
@AdrianTwarog
@AdrianTwarog Жыл бұрын
Oh good suggestion, I’ll do that next time!!
@satish1012
@satish1012 3 ай бұрын
Great So bascially if i have create LLM for my company who has multiple documents , content i need to do it 1. Pass all the documents and get Embeddings from OPEN AI 2. Store all the Embeddings in a DB 3. Create an app to to search vector DB But my question is how it can think and reason. The above approach has great for search capability but how it think like Summaration , comprehension etc
@RajShekarsdreamzzz
@RajShekarsdreamzzz Жыл бұрын
Very Good session Adrain... your way of teaching is keeping the people glued... Keep it up
@MaverickCoder-mz6hp
@MaverickCoder-mz6hp 5 ай бұрын
Nice high quality video with clear explanation of concepts. This video is engaging for learners. I would say one of the best videos out there on vector embeddings. Good Job Adrian
@karthikg752
@karthikg752 Жыл бұрын
The voice recording and explanation is really clear - surprising how tone and voice plays a major role in understanding. Was watching another video which was equally good but somehow the slang and recording made it a bit difficult to understand. Thanks
@Danimsalinas
@Danimsalinas 2 ай бұрын
Omg, thanks for this video, very straight forward and easy to understand. Thanks!
@LindsayHiebert
@LindsayHiebert Жыл бұрын
Excellent overview! Very concise, clear and relevant! Great job! Thank you Adrian! 😊
@dipayanroy964
@dipayanroy964 7 ай бұрын
I wish everyone could have presented like you, simply Super. Looking forward for more in similar way
@araujoao
@araujoao 2 ай бұрын
Thanks for Sharing. This was a great video that clearly illustrate vectorsdb, embeddings, and searching.
@FahadKiani1
@FahadKiani1 Жыл бұрын
Will you create a second part of this video where PDF's are uploaded and then analyzed?
@AmanBansil
@AmanBansil 9 ай бұрын
Absolutely LOVE this. you're so clear and concise.
@munishtyro
@munishtyro 5 ай бұрын
Simple, concise, and has everything in it. Thank You
@phil97n
@phil97n Жыл бұрын
Awesome thanks. Been studying calculus and linear algebra before I dive deep into AI. I will definitely be dealing with vector databases very soon and looking forward to it.
@daygo619ca
@daygo619ca Жыл бұрын
This tutorial was incredible - completely glued to it
@kfliden
@kfliden 7 ай бұрын
Wow, thanks I'm finally starting to get embeddings!
@defaultdefault812
@defaultdefault812 Жыл бұрын
Well done, Langchain already exists...
@ravindrasingh2411
@ravindrasingh2411 9 ай бұрын
Adrian, this is beautifully explained. Absolutely loved it :)
@meirgoldenberg5638
@meirgoldenberg5638 Жыл бұрын
How in the world did it get 0.74 score (which pretty high on the scale for 0 to 1!) for the similarity of "Hello Earth" and "OpenAI vectors and embeddings are easy"? Is there anything in common between the two?
@JeremyArtero
@JeremyArtero Жыл бұрын
This course is gold! Thanks! I have done similar steps on Astra db and it was smooth
@coinexponent1884
@coinexponent1884 Жыл бұрын
Learn vector embeddings using first principles. Always engaging, and very rewarding for the learner. Thank you!
@chrislannon
@chrislannon 9 ай бұрын
Nice work! Thanks so much for this awesome demo.
@gman2036
@gman2036 9 ай бұрын
Loved this tutorial Adrian, very straight forward and it worked the first time not like some others I've tried. Now for my question. I'm seeing this on February 2024. I did not know CHATGPT, BARD and those other AI apps until they hit the common pool that I must swim in. I take it that vectoring documents has been going on for awhile, outside of the math world. I knew of vectoring back from college in linear algebra. If this is the case, what I'm trying to do will not be new. I'm trying to vectorize my documents in order to practice doing this kind of work. So, are there IT companies out there doing this type of work already and can you name a few? How far have they gotten? Has someone already done the library of Congress for instance?
@oscargalvez7
@oscargalvez7 Жыл бұрын
Amazing tutorial! The way you explain is so easy and understandable!
@MDMUNIFHASAN-sr2jk
@MDMUNIFHASAN-sr2jk Ай бұрын
nice tutorial, i have a question for code completion which extension you use?
@mohamadchahin3040
@mohamadchahin3040 Жыл бұрын
Bro do your research first before saying only open ai has a great way to do embeddings theres a million libraries that exactly do that from sbert to fast text to glove etc... embeddings are an old concept
@ZaidKhanPathan
@ZaidKhanPathan Жыл бұрын
Wow! Easy, clear and to the point.
@ismailm123
@ismailm123 2 ай бұрын
Brilliant super simple and very easy to understand.
@cmdrls212
@cmdrls212 6 ай бұрын
This is great. I had to learn this in a crunch and I grok it now.
@edoson01
@edoson01 Жыл бұрын
Blows my mind you've spent 18m talking about the How and 30sec about the why and what.
@Art-kz6zf
@Art-kz6zf Жыл бұрын
How efficient is the vector search if you need to go through all of the records every time you search? Shouldn't there be some dedicated field type for embeddings other than blob?
@rkjellbe
@rkjellbe Жыл бұрын
Finally, found a video with the appropriate detail. For me! 😊 Thank you!
@chrismalingshu
@chrismalingshu Жыл бұрын
[Question] When input hello earth, "Hello World" scored 0.89, meanwhile "OpenAI Vectors and Embeedings are Easy!" scored 0.74. Which is quite close to the top rank text. But syntactically first and second returned text are very different. Somehow I expect the second text might scored 0.5 and below. Could you please share your thoughts on this Adrian? Thank you!
@daffertube
@daffertube Жыл бұрын
You would need to ask someone who built the transformers at openai.
@adavis912
@adavis912 10 ай бұрын
Great tutorial!!! I will be buying your book.
@GenZManhood
@GenZManhood Жыл бұрын
I get this message when I run the API. Do you need to pay OpenAI for it to work? Thanks! "error": {"message": "You exceeded your current quota, please check your plan and billing details.",
@יסמיןאיילין
@יסמיןאיילין Жыл бұрын
Me too, you found a solution?
@saik6730
@saik6730 Жыл бұрын
Best AI video ever . Made it easy to understand with 2 simple concepts . Thanks man!
@robertcormia7970
@robertcormia7970 11 ай бұрын
Well done, succinct, and excellent explainations of complex topics.
@atursams5501
@atursams5501 Жыл бұрын
Great work! How do you make these nice presentations with the fancy arrows?
@mohammadbarzegari8737
@mohammadbarzegari8737 Жыл бұрын
Perfect learning ❤🎉 master of learning ❤❤❤❤
@pablochacon7641
@pablochacon7641 Жыл бұрын
Very interesting video, but what are the prerequisites to understand & actually implement this ?
@satanrasool1802
@satanrasool1802 Жыл бұрын
Love it.. it was far simpler than I thought..
@grantomohundro3298
@grantomohundro3298 2 ай бұрын
Great tutorial man! thank you!
@MikevanDam-j9g
@MikevanDam-j9g 6 ай бұрын
This tutorial is well explained. Thanks for that. But could you explain how to do this on scale? Is it possible to have a no code tool that companies can use to store their data in a vector database? Also, retrieving this info later? It seems that there must be easier solutions for this right? (while also keeping it safe to use).
@sany2k8
@sany2k8 Жыл бұрын
Great content 👍👍👍, waiting for more OpenAI, AI related content
@matickovac
@matickovac 7 ай бұрын
Great work presenting this! Do you happen to know how similar or different this is from what Elasticsearch does when performing full-text search?
@aaronster
@aaronster 26 күн бұрын
I have followed along to 15 minutes so far. How come the scores are all fairly high, even when the search terms are not present in the database? My database included a quote from HHGG about the important of towels to hitch hikers. There were also two other rows of data containing no mention of towels. When I searched for the word "towel" the top match was 85% because it was the quote that contained the towel reference. Great! But I don't understand why the other scores were 75% and 73% though there was no mention of towels. If this was a traditional text search, those rows would not have been returned at all.
@karsonkalt7607
@karsonkalt7607 Жыл бұрын
Fantastic tutorial and explanation!!
@nadershalabi6241
@nadershalabi6241 6 ай бұрын
Thank you! Great walk through
@demetriusmds
@demetriusmds Жыл бұрын
Excellent. Thank you. Helped a lot.
@noubgaemer1044
@noubgaemer1044 10 ай бұрын
thanks for the tutorial can we use our own LLM like private GPT or Text-generation Web UI instead of OPENAI
@aillusions
@aillusions 2 ай бұрын
interesting that expected vector array was places into SELECT section rather than WHERE section
@SimonCicero-g8n
@SimonCicero-g8n Жыл бұрын
Perfect explaination!
@joostschuur
@joostschuur Жыл бұрын
How would I go about weighing the results by other meta data? Say I have a bunch of videos, and I'm searching the title/description, but want to give some amount of preference to newer videos too.
@KJB-Man
@KJB-Man Жыл бұрын
OK, it is a good video on using OpenAI to create embedding via an API. But lets say, next week. Open AI's building are destroyed by a meteor! Now, I still want to create embeddings on my dev server. Is there a software I can download and run locally that I can use until a meteor crashed on my house?
@sunnysk43
@sunnysk43 Жыл бұрын
Absolutely amazing! Thank you so much for your work!
@alexsalgado
@alexsalgado Жыл бұрын
Excellent content, what changes for audio search?
@Aayush-k3d
@Aayush-k3d 4 ай бұрын
Very well explained
@Ricocase
@Ricocase Жыл бұрын
Cool course. How does one connect it to a basic website?
@PeterAdiSaputro
@PeterAdiSaputro Жыл бұрын
In the past, I learned Support Vector Machines for doing classification. At that time, I struggled to learn the concept, although I finally was able to implement it into a program using codes made by another party. The introduction of this video suddenly revived the memory and helped me better understand the concept of SVMs that I learned years ago. Is Postman completely free and can be used without any restrictions or limitations ? Is Single Store also completely free without any restrictions or limitations ?
@Dydent10
@Dydent10 4 ай бұрын
Brilliant stuff!
@cliffordmwale5782
@cliffordmwale5782 Жыл бұрын
This is very useful. Could you also do embedings of CSV files? I have files amounting up to 5 million rows
@amdenis
@amdenis 11 ай бұрын
Redis is viable for certain scale-limited AI applications, but DL indexing can take 10’s to 100’s of times longer than a true multi-threaded vectorized db, because its architecture is limited to only a single thread execution, making it impossible to scale. However, I would hope that most serious AI developers know enough or would do a lot more research than following the advice of some random YT video. Sadly, AI education is still fairly limited as institutions are still trying to catch up- and even then, there is a lot for new grads to learn before they start to reach a real, productive stride.
@RiazSyed-n1x
@RiazSyed-n1x 11 ай бұрын
great explanation ! thanks !!
@e-Course.
@e-Course. Ай бұрын
Very interesting video , thank you
@MRGCProductions20996
@MRGCProductions20996 9 ай бұрын
isnt calculating the modulus of the subtraction of the vectors a more accurate way to find similarities?
@m67esteban
@m67esteban Ай бұрын
thank you very much! super useful!
@haissayf
@haissayf 8 ай бұрын
Need I mention that the hardest part by far is to get those pdfs nicely formatted for embedding. It's literally impossible, which makes the thumbnail fairly misleading.😒
@pranavkm4513
@pranavkm4513 Жыл бұрын
wow great video sir. Helped a lot. may i know what extension is being used in 16:40 ?
@AdrianTwarog
@AdrianTwarog Жыл бұрын
GitHub Copilot
@corejava5730
@corejava5730 Жыл бұрын
Very well explained, Thanks Adrian !! I have astaffing firm and I have a database of more than a million resumes. I m planning to create a resume search application for my recruiters. Do you think I should be using combination of Embeddings and Vector Database for above use case.
@BryanChance
@BryanChance 11 ай бұрын
Does the chuck size have an affect on the quality or accuracy of the search result? Let's say I split a document into words AND in 200 word chucks. The vector results are stored in a vector db.
@nrusimha11
@nrusimha11 Жыл бұрын
Crisp and to the point, thank you. Can I ask how you made the slides like the one at 0:52?
@fkxfkx
@fkxfkx Жыл бұрын
Bought the book. It ended on page 54, is there anything after 54 to 58? Last example was open ai fine tuning. It leaves the ft up on open ai site. How long will it be available there? Can it be brought down locally and be used in the future as local in combination with cloud model?
@AdrianTwarog
@AdrianTwarog Жыл бұрын
I’ll double check, and any updates will automatically be enabled on Gumroad!
@Ricocase
@Ricocase Жыл бұрын
​@@AdrianTwaroghow to automate text importation with sql? Must one enter each text blob manually?
@MannyBernabe
@MannyBernabe 9 ай бұрын
excellent. thx!
@ewhite_dipi
@ewhite_dipi Жыл бұрын
what are the prerequisites to understand the content in this video? And where can I learn them?
@xspydazx
@xspydazx 7 ай бұрын
yes but how do you save a vector store ? ie export it to json for upload or finetuning into the main lm ?
@Joshua.Medellin
@Joshua.Medellin Жыл бұрын
I'm a little confused.. If I created embeddings and which I'm assuming is essentially training the openai model on a specific topic for my company. Would it be able to answer questions only on the specific topic it was trained for?
@psyduck4763
@psyduck4763 Жыл бұрын
Hey man, what are those fonts you've used in this video?
@oraculox
@oraculox 11 ай бұрын
What is the quickest way to feed recognition or pattern braking data into the system?. Or just lower the AI endorphine levels hahaha.
@sivakumarkalaiselvan6831
@sivakumarkalaiselvan6831 Жыл бұрын
Hi Bro, What is the extension u used in the vs code for the code suggestions?
@omangramoswaane2211
@omangramoswaane2211 Жыл бұрын
Nice video. I love your work.
@mustafadanishmand
@mustafadanishmand 11 ай бұрын
follow up video
How to integrate OpenAI GPT3 with a Databases - Crash Course
16:49
Adrian Twarog
Рет қаралды 300 М.
How To Choose Mac N Cheese Date Night.. 🧀
00:58
Jojo Sim
Рет қаралды 105 МЛН
Thank you Santa
00:13
Nadir Show
Рет қаралды 46 МЛН
Кто круче, как думаешь?
00:44
МЯТНАЯ ФАНТА
Рет қаралды 6 МЛН
One day.. 🙌
00:33
Celine Dept
Рет қаралды 54 МЛН
Vector Databases simply explained! (Embeddings & Indexes)
4:23
AssemblyAI
Рет қаралды 357 М.
Chat with your PDF Using Ollama Llama3 - RAG
6:18
Sanjjushri Varshini
Рет қаралды 6 М.
$0 Embeddings (OpenAI vs. free & open source)
1:24:42
Rabbit Hole Syndrome
Рет қаралды 269 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 3,8 МЛН
OpenAI Assistants API - Course for Beginners
3:32:55
freeCodeCamp.org
Рет қаралды 549 М.
How to Improve LLMs with RAG (Overview + Python Code)
21:41
Shaw Talebi
Рет қаралды 78 М.
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 260 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 721 М.
How To Choose Mac N Cheese Date Night.. 🧀
00:58
Jojo Sim
Рет қаралды 105 МЛН