Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4.

Рет қаралды 61,338

Күн бұрын

Пікірлер: 172

@FutureSmartAI Жыл бұрын

Code: blog.futuresmart.ai/building-a-document-based-question-answering-system-with-langchain-pinecone-and-llms-like-gpt-4-and-chatgpt 📌 Hey everyone! Enjoying these NLP tutorials? Check out my other project, AI Demos, for quick 1-2 min AI tool demos! 🤖🚀 🔗 KZbin: www.youtube.com/@aidemos.futuresmart (Read more) We aim to educate and inform you about AI's incredible possibilities. Don't miss our AI Demos KZbin channel and website for amazing demos! 🌐 AI Demos Website: www.aidemos.com/ Subscribe to AI Demos and explore the future of AI with us!

@LaveshNK Жыл бұрын

Just an update, unstructured 0.6.1 does not support local-inference. For the import statements you have to add *!pip install unstructured[local-inference]* as well to load your documents. Thanks for the content!

@Yogic-ignition Жыл бұрын

for everyone getting error on: embeddings = OpenAIEmbeddings(model_name="ada") text= "Hello world" query_result = embeddings.embed_query(text) len(query_result) can go ahead and use: response = openai.Embedding.create( input="Hello world", model="text-embedding-ada-002" ) embeddings = response['data'][0]['embedding'] len(embeddings)

@vedantpandya9655 8 ай бұрын

its still not working bro

@extrememike Жыл бұрын

Good explanations. The flow diagrams really helps with he big picture. Thanks!

@FutureSmartAI Жыл бұрын

Glad it was helpful!

@entertainmentbuzz4934 5 ай бұрын

Very useful information in the video. Thanks.!

@dogtens1060 Жыл бұрын

nice tutorial, thank you Pradip!

@FutureSmartAI Жыл бұрын

My pleasure!

@kevon217 Жыл бұрын

This was a very illuminating demo. Appreciate it!

@port7421 Жыл бұрын

It was a very helpful presentation. Thanks and greetings from Poland.

@FutureSmartAI Жыл бұрын

You are welcome!

@SaiKiranAdusumilli Жыл бұрын

Fastest research on complete langchain and you provided the best notes ❤🎉

@FutureSmartAI Жыл бұрын

Glad you think so!

@FindMultiBagger Жыл бұрын

Crisp and up to the point !!! Great work need more tutorials on LLM

@FutureSmartAI Жыл бұрын

Thank You

@rotormeeeeeeee Жыл бұрын

This is first-class information. Thank you. I just subscribed!

@rishikapandit6780 Жыл бұрын

Sir in the code it is giving an error saying that pinecone does not have an attribute named ‘init’ how do we resolve this ?

@levius_24 9 ай бұрын

Hey Pradip, great video! Do you know if it's possible to automatically create a pinecone db index from code? So that you don't have to create them manually

@rayen1722 Жыл бұрын

Very useful video! Thank you :)

@FutureSmartAI Жыл бұрын

Glad it was helpful!

@zahramovahedinia1896 Жыл бұрын

This was awesome!!

@100p Жыл бұрын

Terrific work! Thank you :)

@ujjwalsrivastava6248 6 ай бұрын

Sir is it required to have open ai key or call openai library as i want q&a using the provided document only?

@ShadowD2C 7 ай бұрын

wouldve been helpful to have the docs included with the colab files to run some tests straight away

@thomasguillemard4873 Жыл бұрын

Clear explanations and great code! Thanks!

@Lakshita-z5i 9 ай бұрын

the embedding query part is not working I have tried other solution but even they give error 14:52

@FutureSmartAI 9 ай бұрын

Library has changed significantly. I am creating new video

@mohsinaliriad5278 Жыл бұрын

Looking for something similar and found the best one. Thanks @Pradip

@FutureSmartAI Жыл бұрын

Glad you liked it! Check other videos also.

@dianaliu7543 11 ай бұрын

This is great. How to deploy this question-answering on AWS?

@FutureSmartAI 11 ай бұрын

Check I have two video on How to deploy streamlit on aws ec2 And deploying ChatGPT + FastAPI on ec2

@snehitvaddi 9 ай бұрын

Hello! I’m working on creating an idiom dataset to fine-tune LLaMa2 for suggesting idioms based on different scenarios. I have a PDF full of idioms and I’m wondering if there’s a way to extract all the idioms using GPT or any other Large Language Model. Is there a cost-effective or free method to generate this dataset? Also, could you advise on how the data should be structured for fine-tuning the LLM? Should it be similar to a QnA format or something else?

@hitendrasingh01 Жыл бұрын

best content thankyou.

@FutureSmartAI Жыл бұрын

Glad you enjoyed it

@mohanvishe2889 8 ай бұрын

Easy To understand tutroial👍

@rhiteshkumarsingh4401 Жыл бұрын

do you recommend deleting the index in pinecone to avoid getting billed for it?

@FutureSmartAI Жыл бұрын

Yes, pinecone is very costly.

@rhiteshkumarsingh4401 Жыл бұрын

@@FutureSmartAI can we use chroma db instead? does it provide the same functionality as pinecone?

@FutureSmartAI Жыл бұрын

@@rhiteshkumarsingh4401 Yes we can use chroma and others also. I use pinecone because we can use it as API and it is cloud based

@LaveshNK Жыл бұрын

@@FutureSmartAI How different would it be if you used chromadb? could you make a video on that

@FutureSmartAI Жыл бұрын

@@LaveshNK It will be similar just import and vector db object changes all things remain same. Even lanhchain use chromadb as default

@shinycaroline3722 9 ай бұрын

Nice tutorial and well explained 👍 But suppose i want to create the embeddings once for all and then access it through its index when required then how could that be done? I came across this function Pincone.from_existing_index(), but when i tried it didn't work out. Not sure whether the issue is at the langchain end. Because creating the embeddings for each run is not the correct approach right?

@khari_baat Жыл бұрын

Thank you Dear.

@FutureSmartAI Жыл бұрын

You're most welcome

@islamicinterestofficial Жыл бұрын

It is something like private GPT? Is our document and questions are not going to openAI servers? Please answer this?

@FutureSmartAI Жыл бұрын

No its not private. we send context and question to open ai to get answer.

@islamicinterestofficial Жыл бұрын

@@FutureSmartAI So, what's the benefit of using openai embeddings and directly using their API in python. Will it not be the same way?

@islamicinterestofficial Жыл бұрын

@@FutureSmartAI And can you use make some tutorial to use pertained model for question answering with document offline. I don't want to use any third party API and want to build private solution

@FutureSmartAI Жыл бұрын

@@islamicinterestofficialNot sure what you are asking. Both open ai embeding and llm models are api as service thay you can use as rest api or python library. If you dont want to send your data you can use open source embeding and open source llm

@shahabuddin-pc8jr 9 ай бұрын

great work i love this ❤❤

@FutureSmartAI 9 ай бұрын

Glad you like it!

@akarshghale2932 Жыл бұрын

Hello! Can you please share whether it is possible to mention the namespace in langchain when using Pinecone loader?

@polly28-9 7 ай бұрын

Thanks for the video! Well Done! I want to know how to make the chatbot to return a list of results. Not only one result, but a list of relevant answer to the input question. I do not know what to change: search index, search parameters, metric_type or what? Can you help me? Thanks!

@FutureSmartAI 6 ай бұрын

Hi can you share scenario that requires multiple answer for single question?

@polly28-9 6 ай бұрын

@@FutureSmartAI Hi, we have a database of various customer inquiries and many of these inquiries are about the same issues. We have one topic in many inquiries from different customers. I want the chatbot, when I ask it about something, to return to me all the inquiries from the various customers on this topic. In other words, chatbot must return list of all inquiries about the same topic. How can I do this? Change search params ?Now I use this to search: vector_store = get_vector_store() retriever = vector_store.as_retriever( # search_type="mmr", search_type="similarity", search_kwargs={'k': 6, 'lambda_mult': 0.25} ) but I am not sure. Can you help me how to do that? Thanks!

@Ds12781 Жыл бұрын

Thank you foe sharing. You are using similarity search to retrieve relevant chunks, but will this provide all relevant documents? Some relevant documents may be missed and this can lead to inaccurate answers. There may be a use case where accuracy is needed without losing any information.

@FutureSmartAI Жыл бұрын

you can bring more chunks, there are different modes available for generating response. Eg. we can pass alll chunks to gpt and generetae answer , other way pass each chunk at a time and then generate and refine answer. Here is good documentation: python.langchain.com/docs/modules/chains/document/ I also talked abou this in my recent video: kzbin.info/www/bejne/a3-qaaCbm6qmebc

@nabinbhusalofficial 6 ай бұрын

Will it work for low resource language like Nepali? What should be taken care of in that case?

@FutureSmartAI 6 ай бұрын

You should check it else you can find any open source llm that finetune with more Nepali data. Here is list of languages it support but i coulnt find Nepali there help.openai.com/en/articles/8357869-how-to-change-your-language-setting-in-chatgpt

@borisguzmancaceres9105 Жыл бұрын

Love your videos, I have to do something like that on my job, can you help me? I have to do a chatbot trained by multiple documents, using templates, streamlit, openai etc.

@FutureSmartAI Жыл бұрын

Yes you can easily do with multiple docs. did you check new assistent api ? it has made things simpler now. kzbin.info/www/bejne/r6CToqxtrd6XaaMsi=x9WoNOifwwj2Yz6k

@saadkhattak7258 Жыл бұрын

Hi Pradip, Hope you are doing well :) I installed all the dependencies and was running the following cell, directory = '/content/data' def load_docs(directory): loader = DirectoryLoader(directory) documents = loader.load() return documents documents = load_docs(directory) len(documents) I got this error: ImportError: cannot import name 'is_directory' from 'PIL._util' (/usr/local/lib/python3.9/dist-packages/PIL/_util.py) Any idea how to resolve this? I have faced this error before as well.

@FutureSmartAI Жыл бұрын

Yes, I had faced this. If you restart runtime, it will resolve. github.com/obss/sahi/discussions/781

@saadkhattak7258 Жыл бұрын

@@FutureSmartAI Okay thanks for sharing :) Let me try UPDATE: It worked :)

@sauradeepdebnath437 Жыл бұрын

@@FutureSmartAI thanks it worked. had to change the version of PIL to 6.2.2 and then Restart the Run time

@satheeshthangaraj5614 Жыл бұрын

Hi Pradip, thanks for sharing, if we want to deploy this code in AWS as web app what changes we should done in this code.

@FutureSmartAI Жыл бұрын

You can integrate this code into the streamlit app and deploy it on ec2. check this: kzbin.info/www/bejne/jWjOdaqpjKudrKc and this: kzbin.info/www/bejne/b2GXlIpvoa9qgrM

@satheeshthangaraj5614 Жыл бұрын

Thank You

@satheeshthangaraj5614 Жыл бұрын

Can we use Django framework to build this ML app?

@FutureSmartAI Жыл бұрын

@@satheeshthangaraj5614 yes

@SeBa-mg3ms Жыл бұрын

Hi, Great tutorial ! Is it possibble for chatGPT to answear with a image that is in the PDF? Like if you ask him about some document where there will be a description of a tiger and a tiger image, can it answear with summary about the tiger and a image?

@FutureSmartAI Жыл бұрын

No. at this moment it only understand text

@FindMultiBagger Жыл бұрын

Subscribed !!! ♥️

@BREWRBlitzHotline-el1cj Жыл бұрын

Can we add scraping your own web pages to it easily?

@FutureSmartAI Жыл бұрын

Yes

@ambrosionguema9200 Жыл бұрын

Hi Pradip, excelent. but I can't extract source metadata. I dont understand.

@basavaakash8846 Жыл бұрын

how to calculat accuracy of the model

@sauradeepdebnath437 Жыл бұрын

Great video ! One question though---for the vector store --why did we use "ada" model, instead of say GPT 3.5 ---which we are using downstream as LLM anyway ?

@FutureSmartAI Жыл бұрын

we need to use embeding model and not completion model

@larawehbee Жыл бұрын

Interesting!! Thanks for sharing!! Can we use the Langchain with haystack ? and Elastic search instead of Pinecone ?

@FutureSmartAI Жыл бұрын

Yes Langchain supoort elastic search also. Hay stack you need to check

@ljfi3324 Жыл бұрын

Hello, greetings from México! 🇲🇽 I have a question, when the credits of GPT run out what other AI model do you recommend using? Another amazing vide

@FutureSmartAI Жыл бұрын

There are other open source options are available, but they are not that good. You should check and see whether it works for your use case. There are a few alternatives Alpaca Vicuna GPT4ALL Flan-UL2

@sahil0094 Жыл бұрын

Is this fine tuning of LLM or what? I understand we are using embeddings to create vectors of documents and using that as context. So context for llm here would be what? All documents vector or just 4096 tokens in case of gpt 3

@FutureSmartAI Жыл бұрын

No its finetuning. Here we are not asking model to remeber rather we are providing model knowledge realtime. It has many advantges: You can keep adding more data to knwoledge base and you dont need to worry fientuning again GPT 3.5 and 4 , are not availabe for finetuning yet

@axysharma2010 Жыл бұрын

How to show the document path alongwith get_answer(query) call without using print similar_docs?

@larawehbee Жыл бұрын

Thanks for the informative video. I would really appreciate your response for my questions . In the text splitter, it splits the document to chunks so let's say i have a pdf of 7 pages, so the 7 pages will be saved in differet splitted chunks in the vector db ?and another question, the answer will be from a specific chunk right, and not the whole document?

@FutureSmartAI Жыл бұрын

Yes those 7 pages will be split in different chunk based on chunk size and we will be maching user query against those chunks.

@larawehbee Жыл бұрын

@@FutureSmartAI Amazing thanks. Im using Sentence Transformers embeddings to avoid using OpenAI API, because i need on premises solution, but the results come so weak. Do you recommend a better transformer ? Do you think llamaEmbeddings might be a better fit and close to openai embeddings?

@Rider-jn6zh 9 ай бұрын

Hello brother, Can you please upload videos on how to evaluate llm model and which evaluation metrics can be used for specific usecase. As I am getting this question in every interview and not able to answer itt

@shahabuddin-pc8jr 9 ай бұрын

but how to connect these things with any mobile frame work like flutter from there we upload document and also pass query like conversation

@FutureSmartAI 9 ай бұрын

You will need to create api that your flutter app will use

@axysharma2010 Жыл бұрын

I am getting error AttributeError: 'tuple' object has no attribute 'page_content' when i run get_answer(query)

@HiteshGulati Жыл бұрын

Hi thanks for the detail video. I was able to follow your video and create a QnA chat bot. One place I am stuck is how can I reuse the embedings created earlier, is there a way to fetch already saved embedings from pinecone db into docsearch variable. Any suggestion would be helpful :)

@FutureSmartAI Жыл бұрын

Yes we can re use embedding. we have to use same index name. Did you check my other pinecone videos? kzbin.info/www/bejne/aoLEoJeMmbqHnJI

@HiteshGulati Жыл бұрын

@@FutureSmartAI Thanks I'll check this video.

@MohitKumar-gp6nr Жыл бұрын

I have some JSON files which I want to use for chatbot data source. How to store the JSON information in Croma DB using embedding and then retrieve it based on the user query. I googled a lot but did not find any answers.

@rinkugangishetty5713 10 ай бұрын

I have content of nearly 100 pages. Each page have nearly 4000 characters. What chunk size I can choose and what retrieval method I can you for optimised answers?

@FutureSmartAI 10 ай бұрын

It depends on what embeding you are using for Eg many sentence transformers embeding will only support up 512 tokens length and will ignore all other text after that. retrieval method: You should experiment but start with default

@rinkugangishetty5713 10 ай бұрын

@@FutureSmartAI I'm using embeddings type "Ada". I'm using MultiQueryReteiever.

@rohith646 Жыл бұрын

hey pradip can we download it as a model and build a UI for asking questions so that it looks like a chatbot

@FutureSmartAI Жыл бұрын

you can use this kzbin.info/www/bejne/pHKumauHaM2Wg6M

@manikantasurapathi92 Жыл бұрын

Hey, @pradip I'm using windows laptop and running my code in VS Code. I'm not able to use apt-get install poppler-utils. Can you please help

@FutureSmartAI Жыл бұрын

Hi we dont use ap-get install for windows. See if this helps towardsdatascience.com/poppler-on-windows-179af0e50150

@omkarmalpure3463 Жыл бұрын

which type of documents does the langchain support ? like excel , or pdf s etc ?

@FutureSmartAI Жыл бұрын

many types it supports python.langchain.com/docs/modules/data_connection/document_loaders/

@patrickhilpold7032 Жыл бұрын

Has someone turned this into a streamlit app and would share the github? Would really appreciate that!

@SageLewis Жыл бұрын

I agree. I would LOVE to see how to turn this into a Streamlit app.

@Gautamkumar-tk1xt Жыл бұрын

Why this error is popping on my window machine 'apt-get' is not recognized as an internal or external command, operable program or batch file.

@FutureSmartAI Жыл бұрын

Hi we dont use ap-get install for windows. See if this helps towardsdatascience.com/poppler-on-windows-179af0e50150

@nikk6489 Жыл бұрын

Nice Explanation and video. Few questions what will be the overhead cost of using OpenAI API Key and even Pinecone? Can you give some idea or thought on how we can create the QA System without using the OpenAI API key etc. Many thanks in advance. Cheers!!

@FutureSmartAI Жыл бұрын

Then we have to use open source alternatives. I am exploring some open source llms

@saswatmishra1256 Жыл бұрын

Can you make a video on how to make the chatbot using open source embedding like instruct and any open source llm. Btw great video ❤

@FutureSmartAI Жыл бұрын

I have video with open source embeding and open source vector db but not open source LLM. Will do.

@sunil_modi1 Жыл бұрын

Very Useful video for document question answering but as i use session state for storing chat conversation it fails giving correct answer. would you please provide some lecture or article to refer.

@FutureSmartAI Жыл бұрын

Did you check my recent video? Langchain and streamlit chat? there I have shown how to refine query to get correct answer.

@pradeept328 Жыл бұрын

Thanks for uploading. But if I ask any question that is not related to the indexed documents, then also it is generating the answer from its own world knowledge. How to prevent it?

@FutureSmartAI Жыл бұрын

You can restrict it. You can add "Don't use general knowledge from outside the context."

@VenkatesanVenkat-fd4hg Жыл бұрын

Thanks for video. How to get the page number also..

@FutureSmartAI Жыл бұрын

I haven't tried yet with langchain. Did you try processing multi page pdf?

@aanchalgupta2577 Жыл бұрын

Hi pradip, first of all nice video.Can you please let me know I have a data but 50-60 percent is labeled and rest unlabeled so is it possible to use this mechanism on that as I want to get the labels of unlabeled using similarity search.

@FutureSmartAI Жыл бұрын

you mean you want to take unlabeled example and use semantic search to find label of it? If yes then you can first index all examples for which you have labels. while inserting in pinecone you can add meta data to each vector. When you find matching examples from pincecone you will also get that metadata check my other pinecone videos. So if you take test example t and find top 5 matching examples from pinecone then get metadata of those 5 examples and take majority vote of those 5 labels and assign that label to test example t.

@aanchalgupta2577 Жыл бұрын

@@FutureSmartAI Thanks a lot.

@aanchalgupta2577 Жыл бұрын

I am able to see only one video ,this one based on pinecone.Can you please share the link of pinecone playlist

@FutureSmartAI Жыл бұрын

@@aanchalgupta2577 kzbin.info/www/bejne/mIiyp4KVmN1_ftk kzbin.info/www/bejne/aoLEoJeMmbqHnJI

@Venkatesh-vm4ll Жыл бұрын

What if we created like api, every time new file will be uploaded, we are storing the embedding in same index, this is right or any different approach, for every file upload whether we need to create new index?

@FutureSmartAI Жыл бұрын

you can insert into same index

@harinisri2962 Жыл бұрын

I tried conversationalbufferwindowmemory, My model is generating answers for out-of-context questions. How can I restrict that?

@FutureSmartAI Жыл бұрын

You can add custom prompt

@davidtowers7851 Жыл бұрын

First rate notes.

@shrutinathavani Жыл бұрын

does this answer based on sementics search

@FutureSmartAI Жыл бұрын

Yes

@shrutinathavani Жыл бұрын

@@FutureSmartAI ImportError: cannot import name 'is_directory' from 'PIL._util' this error even though i have upgraded packages of the libraries used

@kamalthej9794 Жыл бұрын

Hi i am unable to resolve this error. Can you please help me in this. !pip install pinecone ERROR: Could not find a version that satisfies the requirement pinecone (from versions: none) ERROR: No matching distribution found for pinecone

@FutureSmartAI Жыл бұрын

pip install pinecone-client

@learnforjannah7763 Жыл бұрын

directory = '/content/data' def load_docs(directory): loader = DirectoryLoader(directory) documents = loader.load() return documents documents = load_docs(directory) len(documents) this code are not work. plz solve this problem.

@FutureSmartAI Жыл бұрын

what is error?

@rekha388 Жыл бұрын

@@FutureSmartAI WARNING:langchain.embeddings.openai:Retrying langchain.embeddings.openai.embed_with_retry.._embed_with_retry in 4.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details.. how to fix this

@nezubn Жыл бұрын

I have tabular data and the columns have Questions, Answers and User Queries. Now if a new user comes and asks a new question, I would like to match it with the existing set of questions --> answers. As a query can be asked in many different ways the context recognition will be really important. Can I pass it as csv as you passed a pdf in this current setup?

@FutureSmartAI Жыл бұрын

You can treat each row of csv as one chunk and you can insert it in pinecone. When users ask any questions you can retrive similar rows.

@SaveThatMoney411 Жыл бұрын

Trying to figure how to use this to write my academic review papers and research articles.

@FutureSmartAI Жыл бұрын

While writing if you need factual info from specific articles or sources you can use this.

@vishalsugandh 10 ай бұрын

Is it possible to use this for millions of documents?

@FutureSmartAI 10 ай бұрын

Yes

@sahil0094 Жыл бұрын

How to measure accuracy of output of LLMs?

@basavaakash8846 Жыл бұрын

same doubt

@gunngunn6763 Жыл бұрын

Hi...how can one give access to other people just like ChatGpt without running code everytime?

@FutureSmartAI Жыл бұрын

You can integrate this is streamlit and deploy it. check my other videos

@mattiabolognesi1787 Жыл бұрын

This is amazing man! Great content I would love to know how to expand the knowledge of this program also to the general one of chatGPT if it does not find the relevant information in the embedding, any suggestions?

@FutureSmartAI Жыл бұрын

You can focus on having better embeding and if you dont find answer in via semantic serach you can ask GPT to answer it from its own knowedge

@mdsohailahmed7936 Жыл бұрын

Sir how to use pinecone with json data

@FutureSmartAI Жыл бұрын

you should be able to calculate embeding for it.

@shrutinathavani Жыл бұрын

hello sir i am getting a validation error... could you please check ValidationError: 1 validation error for OpenAIEmbeddings model_name extra fields not permitted (type=value_error.extra)

@shrutinathavani Жыл бұрын

embeddings = OpenAIEmbeddings(model_name="ada") query_result = embeddings.embed_query("Hello world") len(query_result) for this cell!!

@xXswagXxbro Жыл бұрын

@@shrutinathavani removed the argument from OpenAIEmbeddings function embeddings = OpenAIEmbeddings()

@jessicajames8724 Жыл бұрын

Did you resolve this ? Even im facing the same error

@jessicajames8724 Жыл бұрын

@FutureSmartAI

@SaMuEaL0707 9 ай бұрын

@@shrutinathavani embeddings = OpenAIEmbeddings() query_result = embeddings.embed_query("Hello world") len(query_result) do this

@PedramAbrari Жыл бұрын

when I run the question_answering function and I use chain_type of map_reduce, I get the following error. ValueError: OpenAIChat currently only supports single prompt. if I use chain_type of stuff, I get the following error: InvalidRequestError: The model: `gpt-4` does not exist What am I doing wrong here?

@FutureSmartAI Жыл бұрын

if I use chain_type of stuff, I get the following error: InvalidRequestError: The model: `gpt-4` does not exist This means you dont have access to gpt4 model yet. chain_type of map_reduce ? It will make multiple request. you can read hear more about stuff vs map reduce. docs.langchain.com/docs/components/chains/index_related_chains

@rahuldinesh2840 9 ай бұрын

I think database like MySQL is better than vector DB.

@Anonymus_123 Жыл бұрын

How would I restrict the answers apart from the training data? Like if I type a query "What is Rasa Chatbot?" since I don't have it in my training data, I expect it to give an answer like "I don't know". But I am unable to do so. I am getting the answer which I should not. It would be glad for anyone who can solve my problem. Also I am getting an error like tesseract is not installed error whenever I am trying to load the documents

@FutureSmartAI Жыл бұрын

It restricts answers to only documents and say I don't know if its not present in document

@Anonymus_123 Жыл бұрын

@@FutureSmartAI Would you please give me code for that on how to restrict apart from what it is been trained.

@harinisri2962 Жыл бұрын

Hi, I too have the same question. In case if you have tried this, Can you pls confirm me if it restricts the irrelavent queries which are not present in the document?

@FutureSmartAI Жыл бұрын

@@harinisri2962 Prompt will have that logic github.com/hwchase17/langchain/blob/master/langchain/chains/question_answering/stuff_prompt.py prompt_template = """Use the following pieces of context to answer the question at the end. If you don't know the answer, just say that you don't know, don't try to make up an answer. {context} Question: {question} Helpful Answer:"""

@aislayer2866 Жыл бұрын

if you get this error when loading the data: "ImportError: cannot import name 'is_directory'" use pillow==9.5

@FutureSmartAI Жыл бұрын

I have impoved audio. check my recent videos

@moviespalace17 Жыл бұрын

embeddings = OpenAIEmbeddings(model_name="ada") query_result = embeddings.embed_query("Hello world") len(query_result) I'm facing the below error from the above block of code: ValidationError Traceback (most recent call last) in () ----> 1 embeddings = OpenAIEmbeddings(model_name="ada") 2 3 query_result = embeddings.embed_query("Hello world") 4 len(query_result) /usr/local/lib/python3.10/dist-packages/pydantic/main.cpython-310-x86_64-linux-gnu.so in pydantic.main.BaseModel.__init__() ValidationError: 2 validation errors for OpenAIEmbeddings model_name extra fields not permitted (type=value_error.extra) __root__ Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass `openai_api_key` as a named parameter. (type=value_error)

@FutureSmartAI Жыл бұрын

Did you add open ai key in as env variable? you can pass key as parameter also embeddings = OpenAIEmbeddings(openai_api_key="my-api-key")