Say Goodbye to RAG! Build your own Streamlit App powered by Gemini 1.5 Pro's Multimodality 🚀

  Рет қаралды 2,897

Avi Patel

Avi Patel

Күн бұрын

Пікірлер
@aricoleman5802
@aricoleman5802 3 ай бұрын
Great video Keep up the good work
@AviPatel68
@AviPatel68 3 ай бұрын
Thank you! Appreciate your comment and encouragement.
@saeedzf
@saeedzf 3 ай бұрын
Great video, please make more, thanks.
@AviPatel68
@AviPatel68 3 ай бұрын
Thanks, will do!
@yvessoke8558
@yvessoke8558 2 ай бұрын
thanks
@paolosorgente7097
@paolosorgente7097 2 ай бұрын
Great videos, Google has never given me confidence like creating agents for evaluating documents, something that Claude does very well, months ago you made a video on this use case... now with Gemini could it be better than Claude what do you say?
@AviPatel68
@AviPatel68 Ай бұрын
I do believe Google DeepMind (Gemini) has the edge. Over the last many months, they have started to leverage some of their other own great work such as Mixture of Experts (MOE) architecture, etc. Personally, as long as Google has Demis Hassabis, they will continue to improve and do this the right way. My opinion.
@ruidinis75
@ruidinis75 3 ай бұрын
That's a awsome job. I do have a doubt, when I load 3 documents and ask a question it tends to talk just about 1 document, the first or the last, do you know why ? They are different documents
@AviPatel68
@AviPatel68 3 ай бұрын
Assuming you are using the non-VertexAI code then it's basically taking your 3 docs and stitching them together. if uploaded_files: text = "" for pdf in uploaded_files: pdf_reader = PdfReader(pdf) for page in pdf_reader.pages: text += page.extract_text() As good as Gemini 1.5 is, given how these docs are stitched it may provide response from 2 of the 3 reports. Try prompting like: what is said about the economy in each of the reports? break it down by each report. Providing the instruction to break it down by each report works for me, also perhaps try break it down by each of the 3 reports. Hope this helps.
@insoo34
@insoo34 3 ай бұрын
Great video, thanks. Can I ask a question in error? Are path2 and path3 the same directory? I set path2 and path3 as my home directory and when I upload a pdf in path2 there is no error, but when I upload an image in path3 I get an error saying there is no image loaded. FileNotFoundError : [Errno 2] Why is this?
@basese83
@basese83 3 ай бұрын
me 2
@ruidinis75
@ruidinis75 3 ай бұрын
You just need to create a directory and make it copy the image in there and then pass that directory and it works just fine
@AviPatel68
@AviPatel68 3 ай бұрын
this was a work around i did to overcome the streamlit's file uploader. so, fpath is just the file name extracted when you upload the file/image. fpath2 is you providing the directory of the location of the images. fpath3 just puts together fpath2 and fpath. if the image i upload is bigcat.jpg, it's assigned to fpath. and if the directory where this image is located is /users/johndoe/images then fpath2 is equal to this directory. then, fpath3 puts these 2 things together so fpath3 would /users/johndoe/images/bigcat.jpg. i know it's confusing, if someone (which i know someone will) comes up with a more direct way to do it, please let us know. hope this helps.
@ruidinis75
@ruidinis75 3 ай бұрын
Can we have the type writer machine effect ? I hadd to create a function for that because the response is not a generator or stream type
@AviPatel68
@AviPatel68 3 ай бұрын
Set stream=True when calling for the response. Something like: response = model.generate_content([prompt], generation_config, stream=True). Hope this helps.
@Emily-k7r
@Emily-k7r 2 күн бұрын
Isn't this one kind of RAG?
How To Build Web Apps using V0 + Claude AI + Cursor AI
13:32
Ras Mic
Рет қаралды 135 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 25 МЛН
Turn Off the Vacum And Sit Back and Laugh 🤣
00:34
SKITSFUL
Рет қаралды 8 МЛН
113 Класс Atomic [Java/Kotlin с нуля]
39:58
Senior Javasky
Рет қаралды 2
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 72 М.
Coding with Cursor AI | Is this the Future of Coding?!
12:43
Tiff In Tech
Рет қаралды 82 М.
Building Production-Ready RAG Applications: Jerry Liu
18:35
AI Engineer
Рет қаралды 333 М.