Multimodal RAG with Qwen-2 and ColPali: Ask Questions from Images 🔥

  Рет қаралды 3,096

AI Anytime

19 күн бұрын

In this tutorial, I demonstrate how to use Qwen-2-VL-7B Instruct and ColPali for building a multimodal RAG engine. You'll learn how to process a PDF containing images and ask questions about those images. I also walk you through the indexing process using ColPali, making document retrieval easy and efficient. All the coding is done in Colab for ease of use. 😊
Don't forget to like, comment, and subscribe for more tutorials! 🔥📚
GitHub: github.com/AIAnytime/MultiModal-RAG-using-Qwen-2-VL-and-Colpali
Colpali GitHub: github.com/illuin-tech/colpali?tab=readme-ov-file
Byalidi GitHub: github.com/AnswerDotAI/byaldi
Qwen2 VL: huggingface.co/Qwen/Qwen2-VL-7B-Instruct
Join this channel to get access to perks:
kzbin.info/door/-zVytOQB62OwMhKRi0TDvgjoin
To further support the channel, you can contribute via the following methods:
Bitcoin Address: 32zhmo5T9jvu8gJDGW3LTuKBM1KPMHoCsW
UPI: sonu1000raw@ybl
#qwen2vl #multimodal #rag #ai

Пікірлер: 11
@gerhardheinzerling9880
@gerhardheinzerling9880 15 күн бұрын
Thank you so much for the video. Just great! We have got PDFs with vector graphics in it. So we can just simple get the images from the PDF. Any idea?
@samketola919
@samketola919 18 күн бұрын
How can we extract images along with their figure captions from a PDF?
@mahajanvinod97
@mahajanvinod97 18 күн бұрын
I’m encountering an issue where, when I ask a question, the system immediately searches the document for a solution. How can I prevent this? I want the LLM to first fully understand the problem before searching for an answer in the document. Could you please help me with this?
@SnehaRoy-xf3zv
@SnehaRoy-xf3zv 18 күн бұрын
Interesting project
@IsmailIfakir
@IsmailIfakir 17 күн бұрын
is there any multimodal llm can fine-tuning for sentiment analysis
@mayukhbanerjee1147
@mayukhbanerjee1147 19 күн бұрын
Wher from can I read about the architecture of RAGs ?
@RedCloudServices
@RedCloudServices 15 күн бұрын
Can you make a video creating a chatbot with this method?
@Jogipraveen
@Jogipraveen 17 күн бұрын
I am getting image with some other text , how can we get exact image only
@proudestberozgaar
@proudestberozgaar 18 күн бұрын
Cant we send multiple images in a single prompt to qwen?
@Innovative_2001
@Innovative_2001 18 күн бұрын
You try, let others also know
@proudestberozgaar
@proudestberozgaar 18 күн бұрын
@@Innovative_2001 we can