If you want to learn RAG Beyond Basics, checkout this course: prompt-s-site.thinkific.com/courses/rag
@jfbaro24 ай бұрын
Does it cover how to minimize (or even eliminate) hallucinations, and that the result would ALWAYS consider the content added into the RAG "database"?
@rubencabrera85194 ай бұрын
This is the best AI channel out there, PERIOD. Thanks for sharing your knowledge
@aerotheory6 ай бұрын
Keep going with this approach, it is something I have been struggling with.
@waju32346 ай бұрын
Me too. For my case, the answer is normally hidden behind the data, context and the images.
@ilaydelrey31226 ай бұрын
a nice open source and self hosted version would be great
@AI-Teamone6 ай бұрын
Such an insightful information, Eagerly waiting for more multimodel approches.
@tasfiulhedayet6 ай бұрын
We need more videos on this topic
@Techn0man1ac6 ай бұрын
What about make same, but using LLAMA3 or less local LLM?
@b.lem.24993 ай бұрын
Thanks, is there a video of the same project, but with langchain instead of llama index?
@ScottzPlaylists5 ай бұрын
Need to do it all in open source. No API Keys.
@BarryMarkGee5 ай бұрын
Out of interest what is the application called that you used to illustrate the flows? (2:53 in the video) thanks.
@engineerprompt5 ай бұрын
I am using mermaid code for this.
@BarryMarkGee5 ай бұрын
@@engineerprompt thanks. Great video btw 👍🏻
@legendchdou95786 ай бұрын
Very nice video but if you can do it with open source embedding model it would be very cool. thank you for the video
@AyishaAshraf-s2f3 ай бұрын
Use case is to extract the relevant text information along with images available in the file using generative ai, When any prompt is given then relevant text information and image should display as response.
@ai-touch96 ай бұрын
I appreciate your effort. Pl create one to fine tune the model for efficient retrieval if possible, with lang chain.
@ArdeniusYT6 ай бұрын
Hi your videos are very helpful thank you
@engineerprompt6 ай бұрын
Glad you like them!
@vinayakaholla6 ай бұрын
Can you pls dive deeper into why qdrant was used and other vector dbs limitations to store both text and image embeddings, thx
@engineerprompt6 ай бұрын
will see if I can create a video on it.
@RolandoLopezNieto6 ай бұрын
Lots of good info, thanks
@avinashnair50644 ай бұрын
can you make it using comeplete open source models?
@Makkar-b3vАй бұрын
Great stuff.
@RedCloudServices5 ай бұрын
do you think all of this is now replaced with Gemini ?
@BACA016 ай бұрын
Thanks your videos are very helpful. I have several Gigs of pdf ebooks that i would like to process with RAG. What do you think what approach would be the best, this or a graphrag. In my case i'm looking only for local models as the costs would be very high. What if to convert all pdf pages into images first and then process them with local model like phi 3 vision and then process it with Graphrag, would it work out?
@mohsenghafari76526 ай бұрын
it's great job! Thanks
@engineerprompt6 ай бұрын
thanks :)
@codelucky6 ай бұрын
Is it better than GraphRAG? How does the output quality compare to it?
@engineerprompt6 ай бұрын
You could potentially create a graphRAG on top of it.
@JNET_Reloaded6 ай бұрын
wheres the code used?
@ignaciopincheira236 ай бұрын
It is essential to conduct a thorough preprocessing of the documents before entering them into the RAG. This involves extracting the text, tables, and images, and processing the latter through a vision module. Additionally, it is crucial to maintain content coherence by ensuring that references to tables and images are correctly preserved in the text. Only after this processing should the documents be entered into a LLM.
@engineerprompt6 ай бұрын
agree!
@jtjames796 ай бұрын
That's a lot of work. Can an AI do this?
@engineerprompt6 ай бұрын
@@jtjames79 Yup :)
@erdi749Ай бұрын
I think that is the major point of ColPali. Regardless of its content, each PDF page is an taken as an image. Thus, there is no need for OCR, Layout etc. For sure, it has some limitations(i.e for complex queries, multiple pages will be retrieved with high scores, this may quickly overwhelm the context window of downstream generation task ) but based on my experience, ColPali based RAG(pick a vison LM say Qwen-vl) works great.
@amanharis18456 ай бұрын
Can we do this method using Langchain ?
@engineerprompt6 ай бұрын
Yes, will be creating a video on it.
@cristiantironi2962 ай бұрын
What if the user query contain text + image?
@engineerprompt2 ай бұрын
You can you a VLM to generate description of the images and send that as part of the text query
@cristiantironi2962 ай бұрын
@@engineerprompt yeah as i was expected, but what if i pass an image that VLM doesn't understand, for example personal image not available online, i should first fine tune the VLM on my images then do what u said right?
@garfield5846 ай бұрын
Thanks
@RickySupriyadi6 ай бұрын
I except image generation will be have another kind of breed... image gen based on image understanding based on facts
@redbaron35556 ай бұрын
This approach is not good enough to add value. The pictures and text needs to be referenced and linked in both vector stores to create better similarities.