If you are interested in leanring more about Advanced RAG Course, signup here: tally.so/r/3y9bb0
@qwertyntarantino19379 ай бұрын
that's definitely a hot topic
@JosephCardwell9 ай бұрын
by 51 seconds we have the most direct explanation of embedding on youtube.
@hl2368 ай бұрын
Thanks for this. There is a lot of obsession over LLMs but I RAG has huge room for innovation that will multiply the performance of ai applications.
@engineerprompt8 ай бұрын
I agree, I am personally really interested in RAG and see that as the main application that will assist people in their workflows before we see anything else
@maxlgemeinderat92029 ай бұрын
nice! Yes another video which uses this in langchain would be cool!
@hl2368 ай бұрын
Yes please!
@TeamDman9 ай бұрын
Thank you for the great walkthroughs and insights! RAGatouille interface looks great, can't wait to mess around with it
@engineerprompt9 ай бұрын
thanks, have fun :)
@yusufersayyem72429 ай бұрын
Go Ahead Sir..... ❤
@engineerprompt9 ай бұрын
thank you :)
@henkhbit57489 ай бұрын
Thanks, would like to see a combination of colbert and langchain optimal chunking method.
@nirmalthacker85669 ай бұрын
me too please
@mowlanicabilla50028 ай бұрын
Thanks for the clear and concise explanation.! What metrics can be used to evaluate the output of these models.?
@PoGGiE067 ай бұрын
Super interesting. I want to use dspy with ragatouille/colbert2 for embedding and retrieval. I’d like to use llama index with a different vectordb, e.g. chromadb, pinecone, or qdrant. I want to use ollama with llama 3 to then summarise my retrieved rag data, and combine with some basic analysis of my own dataset. How feasible is that now? I assume that i can use dspy to finetrain on my specific analysis cases if necessary.
@youngchrisyang6 ай бұрын
Great content, thanks! Also curious what tool did you use to come up with such beautiful graphs on the "blackboard"
@engineerprompt6 ай бұрын
I use excalidraw.com
@borisrusev94749 ай бұрын
So what's the disadvantage of using CoBERTv2? Or are you saying it's strictly better?
@engineerprompt9 ай бұрын
At the moment, the number of vectors store supports are limited, I think only FAISS supports that. You will need a GPU to run this. In THEORY, it should perform better than dense retrieval but probably need better evals.
@LoveWorldamineK9 ай бұрын
yes please make the next video with RAG and integrate it and also please can you create for us a video tutorial demonstrating how to build a chatbot that inputs in XLS or CSV format, prompts the user for input, and provides charts as output. using OPENAI API
@utkarshtripathi91188 ай бұрын
Hii have you figured out solutions for this ??
@LoveWorldamineK8 ай бұрын
@@utkarshtripathi9118 Still m working on it
@sohelshaikhh8 ай бұрын
Nicely explained! also, wanted to know about time comparision between embedding retrievers and colBERT
@engineerprompt8 ай бұрын
From my experience, colBERT is usually faster.
@VenkatesanVenkat-fd4hg9 ай бұрын
Can you discuss newly pdf handling with tables & docx files parser....
@THE-AI_INSIDER9 ай бұрын
Please make a video on Rag with a UI where input is a file pdf or csv + Colbert behind the scenes
@engineerprompt9 ай бұрын
will do!
@shubhamvijayvargiya41198 ай бұрын
Please make a video on how to handle dynamic tabular data in pdf to feed in llm and query on tables data, as tables structure gets messed up when creating vectors.
@JMai-ci9nl8 ай бұрын
Thanks for the video and sharing, I can't seem to pass the loader.load_data("Orca_paper.pdf") line in the colab notebook. The load_data call complains about 'str' has no 'name' attribute.
@JMai-ci9nl8 ай бұрын
fixed, you need documents = loader.load_data(pathlib.Path("Orca_paper.pdf")), the load_data expects a Path object, not str.
@JMai-ci9nl8 ай бұрын
BTW, the load_data() method by default parses the pdf page by page into multiple documents, in case you are wondering like I do.
@VenkatesanVenkat-fd4hg9 ай бұрын
Can you discuss on tables in Pdf files for RAG & other .docx files loader as pdf parser but some os there......
@jaysonp94269 ай бұрын
Wait for the second example you used GPT4 for embeddings instead of ada? Did I miss something?
@engineerprompt9 ай бұрын
Its the tokenizer not the LLM. Probably can replace that with tiktoken package to get tokens.
@sanoussabarry42188 ай бұрын
Gread job !!
@dheerajsai2366 ай бұрын
Whenever I am doing Rag.search ,I am getting the name of the document in contents rather than answers for the query . how do I solve it ? Please kindly help
@AdamTwardoch8 ай бұрын
@engineerprompt Is there a reason why you design your videos so that they must be viewed on a large screen? The font used on the diagram slides is obviously completely unreadable on a phone.
@shameekm21469 ай бұрын
Thank you so much for this... :). I deal with large number of documents. I find dense retrieval is very bad at it. Let me check this approach and comment back.
@engineerprompt9 ай бұрын
Please do share your experience. Would love to see what you find.
@ShreyasVaishnav7 ай бұрын
How can we use this with Chroma ?
@Abdoana8 ай бұрын
So We can try this with local gpt?
@nicholasdudfield86109 ай бұрын
Nice!
@utkarshtripathi91188 ай бұрын
Please bring next video fast
@mohsenghafari76529 ай бұрын
hi. please help me. how to create custom model from many pdfs in Persian language? tank you.
@aghast6668 ай бұрын
As I dive into the world of storytelling and creative expression, VideoGPT emerged as my trusted ally, subtly enhancing the quality of my videos without stealing the spotlight.