5. OpenAI Embeddings API - Searching Financial Documents

  Рет қаралды 138,640

Part Time Larry

Part Time Larry

Күн бұрын

Пікірлер: 159
@parttimelarry
@parttimelarry 2 жыл бұрын
Like the video? Support my content by checking out Interactive Brokers using the link below: www.interactivebrokers.com/mkt/?src=ptlPY1&url=%2Fen%2Findex.php%3Ff%3D1338 I will be starting a spinoff channel on AI in music, art, and gaming in 2023. Subscribe at: youtube.com/@parttimeai Buy Me a Coffee: buymeacoffee.com/parttimelarry Notebook: colab.research.google.com/drive/1tttDqgnWL9yJtmlOFXJqA-BjQ1Pyfpax?usp=sharing This notebook is based on the embeddings examples in the OpenAI Cookbook: github.com/openai/openai-cookbook
@jonconnor6697
@jonconnor6697 Жыл бұрын
say vector one more time :)
@SwingingInTheHood
@SwingingInTheHood 2 жыл бұрын
Not being an AI guy, I've been trying to understand embeddings and search vectors for the past two weeks. This is the first video I have found that explains it so that even I now get it. Whew! Thank you Brother!
@hishamafzal1999
@hishamafzal1999 Жыл бұрын
Can you explain me bro?
@erikstillman7336
@erikstillman7336 Жыл бұрын
Thanks!
@alanfeng99
@alanfeng99 Жыл бұрын
Extremely well presented of how embeddings work!! I "kind of" knowing how it works, but after watching this video, I can say I "know" how it works! Thanks.
@robertcormia7970
@robertcormia7970 Жыл бұрын
This is outstanding, his clarity and enthusiasm is amazing, real passion for this topic!
@SumedhKadoo
@SumedhKadoo Жыл бұрын
Thanks for sharing, you are a good teacher. Can you please share how much did this cost in terms of API usage?
@PatrickAlphaC
@PatrickAlphaC Жыл бұрын
Dank. Love your content dude.
@parttimelarry
@parttimelarry Жыл бұрын
Likewise man, been awesome to see you kill it over the years!
@Guy-Scott
@Guy-Scott Жыл бұрын
I appreciate the work you have put into this series. I like how you start small, with a few examples and build from there. Great job.
@PhilMcCarty
@PhilMcCarty 2 жыл бұрын
This is a really great tutorial, thank you.
@parttimelarry
@parttimelarry 2 жыл бұрын
Thanks so much! Really appreciate it.
@deepster5150
@deepster5150 2 жыл бұрын
How do you buy a drink and get that logo you have?
@PhilMcCarty
@PhilMcCarty 2 жыл бұрын
@@deepster5150 On my screen to the right of "Share" there's a "Thanks" button. It's probably a KZbin red or whatever the paid thing is feature.
@soesevenonesix
@soesevenonesix Жыл бұрын
Another great Video Larry! Thanks for sharing!
@wmchoi1474
@wmchoi1474 4 ай бұрын
Thanks for this amazing tutorial. Great video dude
@Raja-jo5dm
@Raja-jo5dm Жыл бұрын
Nice one larry easy to understand : )
@blender_wiki
@blender_wiki 2 жыл бұрын
Your contents are surpassing the simple high economic value and make the world a better place. 🙏🏾🙏🏽🙏🏼🙏🏻🙏🙏🏿
@rengavasan6245
@rengavasan6245 Жыл бұрын
This is the best explanation Ive come across about embeddings anywhere on the internet. Great work Larry. Subscribed :)
@jay-d-x
@jay-d-x Жыл бұрын
Excellent and simple. Thanks Larry.
@davidw8668
@davidw8668 2 жыл бұрын
Your recent video quality again has come in higher than expected - despite high priors, sir😁 damn good teaching and funny. Thanks man!
@persas1683
@persas1683 Жыл бұрын
Your tutorial help me alot. Thank you 🍾
@iamaguest2
@iamaguest2 Жыл бұрын
very much appreciated for the simple introduction. Very good at explaination of something that is not easy to understand, great job
@mariusirgens5555
@mariusirgens5555 Жыл бұрын
The espresso-milk -> latte example blew my mind 😄
@sivalokesh3997
@sivalokesh3997 Жыл бұрын
I searched for subscribe button then
@Jberv
@Jberv Жыл бұрын
Man! I just watched your video like its 5 minute length, very detailed. It took me so long to find this kind of sample/explaination. Thanks
@parttimelarry
@parttimelarry Жыл бұрын
Thanks, this video wasn't very popular when I first put it out, but I think people appreciate it now :)
@karansajnani1135
@karansajnani1135 Жыл бұрын
This is by far, the best video explaining embeddings! Thank you
@kenchang3456
@kenchang3456 Жыл бұрын
Thanks, Larry for a great video and explanation. Being an old former developer I am so fascinated with these LLMs, and your explanation has helped me organize how I will do a POC. I really appreciate you sharing your knowledge and enthusiasm.
@nedyalkokarabadzhakov5405
@nedyalkokarabadzhakov5405 Жыл бұрын
quality content deserve subscription. Thank you.
@rhemagenesis
@rhemagenesis Жыл бұрын
You came with the HEAT 🔥 🔥 🔥
@private_0
@private_0 2 жыл бұрын
Amazing video! Loving the series so far. Thank you for doing it!
@ThomasODuffy
@ThomasODuffy Жыл бұрын
Great video and explanation - thank you!
@seandiamond7983
@seandiamond7983 2 жыл бұрын
Bro you're the fkn man. I would have never known about this. I just created a search field in my Django website and it will be getting a massive upgrade now lol thanks again bro.
@lqvision
@lqvision Жыл бұрын
very patient explaination!
@munoncode
@munoncode Жыл бұрын
thank you so much for this tutorial.
@aecs71
@aecs71 Жыл бұрын
awesome video, was struggling to understand embeddings for a long time,, this video is superb
@carlhlazo5907
@carlhlazo5907 Жыл бұрын
Awesome tutorial. Using familiar concepts made it easy to follow. I was finally able to understand much better what these concepts are and how they all relate.
@thomastudu2977
@thomastudu2977 Жыл бұрын
This is exactly what I needed man, Thanks !!!
@ratral
@ratral Жыл бұрын
Thanks for the video; it is pretty straightforward and well-developed. 👍
@shanthu7803
@shanthu7803 2 жыл бұрын
Happy New Year! Great content learning a lot from you. Thank you so much for detailed explanations step by step guidance. Wish you all the best!
@rishabhpanwar8971
@rishabhpanwar8971 2 жыл бұрын
Superb content. Thanks!
@laura43400
@laura43400 Жыл бұрын
One of the best explanations out there!
@bakari007
@bakari007 2 жыл бұрын
Great Video ! Super interesting how this transformations works words to vectors ! I'm wondering how the architecture of the classifications works ! Thank you for the great quality of the video and the super interesting topic !! Don't stop !! 🥰
@testadrome
@testadrome 2 жыл бұрын
Awesome video! Keep them coming
@jessexing3456
@jessexing3456 Жыл бұрын
amazing video! how smooth it is!
@deep2mixer
@deep2mixer 2 жыл бұрын
Awesome. Disrupter R' Us. This is some high quality material.
@TomanswerAi
@TomanswerAi 2 жыл бұрын
Great video nicely explained
@businessdissection130
@businessdissection130 2 жыл бұрын
After weeks of lurking around, god showed me this video! Never going to regret this. This is exactly what I wanted. And I would say this was even better than the documentation on openAI, since it always gave me an error!
@parttimelarry
@parttimelarry 2 жыл бұрын
Thanks for the comments, glad this was helpful!
@HazemAzim
@HazemAzim 2 жыл бұрын
Very nice and soft explanation 👌 .
@Naijarianman
@Naijarianman 2 жыл бұрын
These videos are awesome, thanks a lot
@manuthvann7560
@manuthvann7560 Жыл бұрын
thanks for sharing such informative content , I've learned a lot from you, and Kepep doing a great job
@hb_dan2928
@hb_dan2928 2 жыл бұрын
thank you, really good and concise explanation
@iamnoone7415
@iamnoone7415 2 жыл бұрын
I find your tutorials on openai to be very informative and easy to follow. The instruction explained the concepts well and used examples to help illustrate the topics. Even for someone who does not have a coding background, I find the tutorials very engaging. However, I have a question about how to overcome the rate limit error.
@michaelwang6831
@michaelwang6831 Жыл бұрын
hi, have you ever been able to fix the rate limit error? I am facing the same issue..
@jskweres2
@jskweres2 Жыл бұрын
@@michaelwang6831 I have the same issue. Advice on fixing it?
@frutenfratenfriends
@frutenfratenfriends 2 жыл бұрын
This is impressive brother thanks
@serhatkoroglu6708
@serhatkoroglu6708 Жыл бұрын
you telling very clear thank you!
@MikeBtraveling
@MikeBtraveling Жыл бұрын
"The inflation is too damn hight", was great.
@processusAI
@processusAI 2 жыл бұрын
very well done video. Fit exactly what I wanted to find out
@psychurch
@psychurch Жыл бұрын
Thanks man, super clear explanations! Subscribed and excited to see what you’ve got coming up for music and ai ❤
@GonzaloRMDT
@GonzaloRMDT Жыл бұрын
Great job!
@marshall3900
@marshall3900 11 ай бұрын
Greate video!
@charyog7
@charyog7 2 жыл бұрын
thank you very much, it was very clear !
@patriciodiaz2377
@patriciodiaz2377 Жыл бұрын
Thanks so much! 🎉
@breakoutgaffe4027
@breakoutgaffe4027 Жыл бұрын
When it comes to Python tutorials Larry you are the 🐐
@undrash
@undrash 2 жыл бұрын
Amazing content, thanks
@omkarajagunde4175
@omkarajagunde4175 Жыл бұрын
Wow awesome man
@FRANKWHITE1996
@FRANKWHITE1996 2 жыл бұрын
thanks for sharing
@lastemperor1347
@lastemperor1347 2 жыл бұрын
Amazing!
@avbendre
@avbendre Жыл бұрын
you are amazing!
@Nashadelicable
@Nashadelicable 2 жыл бұрын
8:36 "this in the Matrix is how a hot dog looks like" XD
@utuberay007
@utuberay007 Жыл бұрын
Awesome..
@BenjaminRay
@BenjaminRay 2 жыл бұрын
Very cool!
@Malte.Research
@Malte.Research 2 жыл бұрын
Awesome, thanks !
@sachin-b8c4m
@sachin-b8c4m Жыл бұрын
very helpful thank you
@camuelcoleridge479
@camuelcoleridge479 2 жыл бұрын
@8:36 Part time Oracle is exposing the matrix 😂
@neelcansharma
@neelcansharma Жыл бұрын
Incredible video, you're a natural teacher. Do you know how embeddings is calculating these numerical vectors? Is it comparing to other items in their dataset?
@charlieevert7666
@charlieevert7666 2 жыл бұрын
Bad ass, thanks!!
@xiangyu326
@xiangyu326 Жыл бұрын
This is super.
@guillaumeberrido
@guillaumeberrido 2 жыл бұрын
Excellent! Next step, would be to store our data in a Vector DB like pinecone instead of a csv...
@user-wr4yl7tx3w
@user-wr4yl7tx3w 2 жыл бұрын
Would be interesting to see how such word embedding is even computed in the first place. possibly with a toy example.
@amanmeghrajani1
@amanmeghrajani1 Жыл бұрын
great vid
@kovidkasi6117
@kovidkasi6117 Жыл бұрын
Where you are creating embeddedings for text data, is it just embeddedings of each word in a text combined together or are you taking sentence embeddings?
@VeeraForBharat
@VeeraForBharat Жыл бұрын
For the last part of video, you just sounded like Jerome Powell. Great video to understand Embeddings API and how its useful for Cognitive Search. It looks like the Search term and every other sentence in the knowledge base has to go thru the Cosine Similarity check. Doing this on a small document is computationally ok but how does ChatGPT and others do this on Petabytes of data?
@MrYomantanepali
@MrYomantanepali 2 жыл бұрын
earned my subscription
@HazemAzim
@HazemAzim 2 жыл бұрын
One question is the effectiveness of embeddings varies with the size of the text? Embedding a word or phrase or sentence versus embedding a sizeable paragraph? Would this affect the accuracy ?
@chenpaul99
@chenpaul99 Жыл бұрын
Excellent demo! When using embeddings for training, how do I feedback the graded results to improve the model?
@michaeldebellis4202
@michaeldebellis4202 Жыл бұрын
When you do the dot product, doesn’t one vector need to be a column vector rather than two row vectors as in your example? My linear algebra is rusty so perhaps I’m missing something but as I recall you can only multiply (dot product) an M x N matrix with an N x M matrix(?)
@michaelwang6831
@michaelwang6831 Жыл бұрын
Question - for the Microsoft earning calls example, how do you segment the whole speech into different paragraphs as a unit to run through the AI algorithm? What is the logic? or you simply segment them based on comma and period punctuation?
@lucas.souza2022
@lucas.souza2022 Жыл бұрын
I would like to know that too, did you come to any conclusions?
@alexchanchan9359
@alexchanchan9359 2 жыл бұрын
Thank you very much. Can I ask why in the notebook, the earnings_df has not used ".apply(eval).apply(np.array)"?
@TommyInGame
@TommyInGame Жыл бұрын
This is so huge. Currently creating a “digital employee” product to limit human capital required for SMEs and startups, and this is a game changer. Would you be open to be a potential beta tester ? I’d love your feedback👍 great vid as always
@Konstantin19877
@Konstantin19877 Жыл бұрын
I wonder if embedding needs to be saved in one column and single record. So one huge chunk of data, where i can do vector search? When vector search is done it will return a swntence exactly how it was stored in db or it will compile a new based in similarities?
@orbitinggeek4000
@orbitinggeek4000 2 жыл бұрын
Thanks for the amazing video - so am I correct in understanding that even after that embeddings are generated, we still need the OpenAI API for inferences ?
@PLAYWW
@PLAYWW Жыл бұрын
thanks man
@lucawurker4911
@lucawurker4911 Жыл бұрын
How did you prepare the csv file? (How did you code the "rules" for new rows, and how did you get rid of the names of the speakers?
@forbiddenera
@forbiddenera Жыл бұрын
This is pretty powerful maybe, I did the automated support case this weekend from knowing nothing of embeddings to full responses with gpt in just a few hours (and chatgpt didn't even help on this one)
@nattyzaddy6555
@nattyzaddy6555 Жыл бұрын
Does the cosine_similarity api use openai credits? If it does, is there a similar library that doesn't use credits?
@krissn8111
@krissn8111 2 жыл бұрын
Man this damn interesting
@SumitKumar-yq1og
@SumitKumar-yq1og Жыл бұрын
Sup man, I tried to generate the fox example you showed. But, its throwing RateLimitError, even though I created a new API token and the acount is new. Please Let me know.
@nathanburley
@nathanburley 2 жыл бұрын
Question: do you know of a service which would allow me to upload pdfs to a directory, automatically get embeddings for the contents of each file as they're added, then allow semantic search for the embeddings? Preferably via API? Also, I love this video. It was super helpful in helping me get my head around embeddings as a concept.
@strkn25
@strkn25 2 жыл бұрын
What would be your use case? I'm developing a product that's very similar to what you described.
@nathanburley
@nathanburley 2 жыл бұрын
@@strkn25 technical service AI partner agent. Complex scientific instrumentation. I don't see anything around which can do this right now. I have ideas on how to trim the search based on the specific machines in question.
@mudassiraqeel-x3g
@mudassiraqeel-x3g Жыл бұрын
would you prefer this method, over fine-tuning gpt models ?
@arpankumarlahiri4180
@arpankumarlahiri4180 Жыл бұрын
Any way to sort this type of data, If my understanding is correct for large corpus , calculating distance for every sentence with respect to search keyword will grow linearly as the corpus scale?
@talal-az
@talal-az 2 жыл бұрын
Great content and video. Extremely valuable. One question though as I’m in the area of research. Is there a way to get the total similarity index score for a certain word? For example, let’s say I’m interested in doing an analysis on all the fed speeches and its relationship with bond markets by searching the term “high inflation” in all speeches and getting an overall similarity index total score for every speech that I can later use for regression purposes
@azuredominus5739
@azuredominus5739 2 жыл бұрын
You would just calculate the embeddeding for the each document then calculate the embedding for your word. Then rank each document based on the cosine similarities
@fxsignal1830
@fxsignal1830 Жыл бұрын
can I ask what are exactly your regression purposes? thank you very much
@wangjueliang
@wangjueliang 2 жыл бұрын
Thank you for another amazing video. Is this the answer to improving the intranet search results (built on SharePoint Online)?
@yamani3882
@yamani3882 2 жыл бұрын
Is there a similar library that we can use to locally vectorize the strings without having to constantly make openai calls?
@parttimelarry
@parttimelarry 2 жыл бұрын
For sure. Check out HuggingFace, will get to local and open source libraries in more detail after the OpenAI series.
@yamani3882
@yamani3882 2 жыл бұрын
@@parttimelarry Awesome, thank you!
@Jaybearno
@Jaybearno Жыл бұрын
How do you incorporate document structure into embeddings? I.e when I query relative documents for “chapter 3”, it returns all chunks contained in that section, regardless of text content
@myessmith9577
@myessmith9577 2 жыл бұрын
Could you do a video showing how you could build an API to pull out all the numbers in the financial statements (balance sheet, income statement and statement of cash flows) into excel or google sheets? Also would it be possible for have a program do this for every 10k published on the SEC website when they are published ?
@akshadawalunj7018
@akshadawalunj7018 Жыл бұрын
I have a use case: If graph/csv file is used as input. Using embedding we should get their relevancy as output. Can we implement this in nodejs?
6. OpenAI ChatGPT and DALL-E2 - Python Frontend App Development
29:28
Part Time Larry
Рет қаралды 31 М.
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
pixegami
Рет қаралды 379 М.
Жездуха 41-серия
36:26
Million Show
Рет қаралды 5 МЛН
ССЫЛКА НА ИГРУ В КОММЕНТАХ #shorts
0:36
Паша Осадчий
Рет қаралды 8 МЛН
Непосредственно Каха: сумка
0:53
К-Media
Рет қаралды 12 МЛН
The new growth hack that I am adding to all of my apps (get more users)
8:07
Reinforcement Learning in 3 Hours | Full Course using Python
3:01:58
Nicholas Renotte
Рет қаралды 455 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 542 М.
8. OpenAI Financial Advisor Q&A Embeddings - Python Tutorial
28:53
Part Time Larry
Рет қаралды 30 М.
[Webinar] How to Build a Modern Agentic System
1:00:55
Arthur
Рет қаралды 15 М.
AI Is Making You An Illiterate Programmer
27:22
ThePrimeTime
Рет қаралды 295 М.
OpenAI Embeddings Explained in 5 Minutes
5:12
Cooper Codes
Рет қаралды 16 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
4. OpenAI GPT-3 - Prompt Engineering For Financial NLP
27:54
Part Time Larry
Рет қаралды 36 М.