Пікірлер
@rChandan_Singh
@rChandan_Singh 21 сағат бұрын
There is no single method explained for non english corpus
@AIAgents-k1h
@AIAgents-k1h 2 күн бұрын
Good explanation notebook lm does the same functionality with based on rags
@ShainaD843
@ShainaD843 3 күн бұрын
This video is so intriguing!! I'm very new at trying to figure this out. Is there a way to link items based on a system id or name using several different SQL Server Tables? Connecting all items with 2 or 3 degrees of separation from that search number?
@AnaximandroAndrade
@AnaximandroAndrade 4 күн бұрын
Great explanation!
@cloudtech273
@cloudtech273 7 күн бұрын
RAG=Customized responses?
@Kookoobi.
@Kookoobi. 8 күн бұрын
What if say Charles was already there and they wanted to change the dates? @python-programming
@PranithaAnnajigari
@PranithaAnnajigari 8 күн бұрын
I couldn't find any such packages on internet or GPT ( as I wanted to understand more)
@LaLa-ww9su
@LaLa-ww9su 11 күн бұрын
I enjoyed the 9 modules. I learned a lot!
@pierrebotha2823
@pierrebotha2823 11 күн бұрын
I'm building a RAG studio voice assistant.
@arjunp5840
@arjunp5840 13 күн бұрын
Thank you for making these videos. One of the best videos series I found.
@YuktaAshok
@YuktaAshok 13 күн бұрын
Can you please share the github repo
@ASMAEELAANOUNI
@ASMAEELAANOUNI 14 күн бұрын
your steps are all clear, thank you for this simplicity
@ManuelUgarte-p9h
@ManuelUgarte-p9h 14 күн бұрын
Nice
@ManuelUgarte-p9h
@ManuelUgarte-p9h 14 күн бұрын
Nice.
@Daniel-fl4si
@Daniel-fl4si 14 күн бұрын
A dumb question, this model works only for english texts, right?
@Bartyron
@Bartyron 16 күн бұрын
I got an error with finding tesseract in the command prompt because it was not run in administrator mode.
@hamzasarwar2656
@hamzasarwar2656 17 күн бұрын
I guess we cannot append the json file as it contains single object only. The above method will fail as the file gets larger and then we won't be able to load the large file in python memory.
@HamzaMahmood1261
@HamzaMahmood1261 17 күн бұрын
thank you for being honest that this is not enough this is just a start point
@ishaanthtm
@ishaanthtm 19 күн бұрын
Kernel dies on windows 11 while running get_similar_images method
@mkazigwa3783
@mkazigwa3783 19 күн бұрын
@thedevmachine
@thedevmachine 19 күн бұрын
The documents are not not sent to the LLM. Only the results of your query.
@jannik93
@jannik93 16 күн бұрын
Can you explain how the database and its documents are searched for this query information?
@thedevmachine
@thedevmachine 16 күн бұрын
@@jannik93 from the document chucks there is embeddings created and stored in the vector database. When you ask a question this question is also converted as embedding with that the cevtor database is searched for matching results with distance value and returns the text from the documents from the database and passed to the LLM with question and answer and how the LLM should handle it.
@jannik93
@jannik93 16 күн бұрын
@@thedevmachine thank you! For understanding: You mean "document chunks"? Are the embeddings created by the retrieval model or the ai?
@thedevmachine
@thedevmachine 16 күн бұрын
@@jannik93 you read the document as text create chunks from it then from each chunk you create embeddings and store it on the vector database. To use to ask a question that question is also conveted to embedding so you do search and you get result or results depending on your question the result contains the relevant chunks as text and you pass the original question and the answers with a prompt to a LLM to generate a natural response.
@jannik93
@jannik93 16 күн бұрын
@@thedevmachine I think I got it now. Thank you very much!
@waelhussein4606
@waelhussein4606 21 күн бұрын
Great videos, thanks! It’s important to understand that IDF reduces the weight of common words that frequently appear in most documents within the corpus, as these words contribute little to document classification. Conversely, it highlights less common words, making them more important for distinguishing the documents in which they appear.
@waelhussein4606
@waelhussein4606 21 күн бұрын
Thanks for the videos. I love your energy and clarity. I would love to learn more about how you determined the keywords for each cluster.
@Tinytots199
@Tinytots199 25 күн бұрын
I can't remove now??
@paulkabiito8355
@paulkabiito8355 27 күн бұрын
Beautiful, more of these please
@TheMrWARLORD
@TheMrWARLORD 28 күн бұрын
How is the vector database generated ?
@beautifulmind684
@beautifulmind684 29 күн бұрын
mark
@ysyvon
@ysyvon Ай бұрын
Thank you for this video series! I was a Computer Science and Mathematics Major, but realised that the pure science wasn't really my passion, so I switched my major to History, but I still love computer science and especially Python, so I am still getting my minor in Comp Sci and Linguistics. The Digital Humanities is really right up my alley, having this as a resource is so helpful!
@lyhs0219
@lyhs0219 Ай бұрын
Actually RAG consists of 2 parts, you presented the query part, but the other part is about breaking down the document into chunks and also generate vector embeddings from them
@traveling-historian
@traveling-historian Ай бұрын
I mentioned those aspects. I didn't cover chunking because of the issue of timing in a short (60 seconds). Chunking is a pre-step in RAG systems and not an essential one. Some documents don't require chunking, such as Tweets for example. I did mention vector databases.
@talielnussen3867
@talielnussen3867 Ай бұрын
I legit thought this was shells on sand at the beach
@niflag
@niflag Ай бұрын
numpy has a different linear algebra library by default on conda vs pip. Also, downloading a wheel file is not a 'tedious task;' compiling a wheel is.
@SteveGiome
@SteveGiome Ай бұрын
the share.streamlit is shown "You do not have access to this app or it does not exist". is it pay-only now? or just temporary disable.
@niflag
@niflag Ай бұрын
So quiet
@icns01
@icns01 Ай бұрын
A superb explanation...thank you.
@Fun-jb2yn
@Fun-jb2yn Ай бұрын
hey can u pls make a OCR for extracting data from a blood report pls pls
@markomarjanovic8348
@markomarjanovic8348 Ай бұрын
You are a fantastic teacher!! Love the way you conceptualized this and the way you explain things!
@AoyamaIchome
@AoyamaIchome Ай бұрын
French revolution happened in 1789
@HesamFarsiDehnavi
@HesamFarsiDehnavi Ай бұрын
Amazing
@rikii7237
@rikii7237 Ай бұрын
fifty_points to gryffindor
@rikii7237
@rikii7237 Ай бұрын
Thank you SO MUCH for keeping these videos short and straight to the point.
@kishork2211
@kishork2211 Ай бұрын
A very clear & simple explanation about RAG. Now another one question is hen you want to use this in real business case , how do you consider business data feed into it .
@rubo111
@rubo111 Ай бұрын
Concise.
@EranM
@EranM Ай бұрын
then came gpt and the rise of LLMS pissing on those models...
@jcwade
@jcwade Ай бұрын
dont combine unneeded tech in a video about another one, hold control variables constant and simple.
@saranshtiwari8543
@saranshtiwari8543 Ай бұрын
Hey KZbin, Why am I seeing this after 2 years? Recommend videos like these as soon as they get uploaded!
@uswakhan3050
@uswakhan3050 Ай бұрын
how to apply ocr on different language text
@grgr1467
@grgr1467 Ай бұрын
hi ! where can i find the source file you used?
@geetharagiphani6670
@geetharagiphani6670 Ай бұрын
Could you please share the text file that that’s been used in this video?
@JennaHasm
@JennaHasm 2 ай бұрын
I LOLed so much at your channel's description.
@johnyoung8848
@johnyoung8848 2 ай бұрын
in the previous video trying to run the model The kernel appears to have died. It will restart automatically. -- running py3.10.2 - installed via conda latest spacy etc etc.. any ideas