Open Source Generative AI in Question-Answering (NLP) using Python

Рет қаралды 34,634

Күн бұрын

Generative question-answering focuses on the generation of multi-sentence answers to open-ended questions. It usually works by searching massive document stores for relevant information and then using it to generate answers synthetically. This tutorial demonstrates how to build a question-answering system using generative AI.
🌲 Pinecone example:
github.com/pinecone-io/exampl...
🤖 70% Discount on the NLP With Transformers in Python course:
bit.ly/nlp-transformers
🎉 Subscribe for Article and Video Updates!
/ subscribe
/ membership
👾 Discord:
/ discord
00:00 What is generative AI and Q&A?
01:02 Generative question-answering architecture
04:36 Getting code and prerequisites
05:06 Data preprocessing
07:41 Embedding and indexing text
13:50 BART text generation model
14:52 Querying with generative question-answering
17:45 Asking questions and getting results
21:29 Final notes

Пікірлер: 71

@adambenari3944 Жыл бұрын

Hey James, been following you for 8 months now. Really enjoy your videos man! Thanks for everything you do.

@jamesbriggs Жыл бұрын

that's awesome. Thanks for sticking around! 🙏

@shamaldesilva9533 Жыл бұрын

This is amazing , you are a hidden gem in the NLP education space 🤩🤩🤩😍😍😍

@temiwale88 Жыл бұрын

Shamal - seriously man! I agree! James is a blessing to us!

@jamesbriggs Жыл бұрын

Thanks both 🙏

@ygshkmr123 Жыл бұрын

You are amazing men. Please continue your advance NLP tutorials it's help a lot to learn.

@jamesbriggs Жыл бұрын

I will :)

@anujchourange1792 Жыл бұрын

Just what I needed! AMAZING work man🙌🏻 lots of love ❣️

@jamesbriggs Жыл бұрын

🙌

@ParijatSharma97 Жыл бұрын

Really Appreciate the hard work you do.

@kanakraj3198 Жыл бұрын

I like your videos, and I would love to see videos on training/finetuning the Generative Text Models.

@SudipBishwakarma Жыл бұрын

I've been going through lots of videos on KZbin regarding creating QA chat bots but gotta admit your videos are really concise and helpful.

@fgfanta Жыл бұрын

Crystal clear, thank you! At the end, it seems to me that the result is not very different from what one would get from an extractive model, except then the generator puts together an answer in English. Makes me wonder if it would be possible to take a generator instead (like BART or GPT 3.5) and fine-tune it to directly answer questions based on the corpus of text (a part of Wikipedia here), without going through the extraction phase first. Perhaps it would take to rent a datacenter for a couple weeks?

@ChocolateMilkCultLeader Жыл бұрын

If you can, I would love using one of the open source LLMs like Meta's OPT or Bloom to create chatbots (like ChatGPT). I think you're one of the best in the space, and would love to see how you do it

@jamesbriggs Жыл бұрын

planning on doing some videos with LLMs, both open source and not - they'll be coming soon :)

@Tiger-Tippu 7 ай бұрын

HI James,@4:07 is pinecone resposible to covert matched embedding back to text ?

@ADHDOCD Жыл бұрын

Woah! somebody that actually zooms into the text to make it visible. Your videos get 6 out of 5 stars for the attention to detail!

@kay_o Жыл бұрын

Underated channel, thanks for videos

@jamesbriggs Жыл бұрын

Thanks man

@mariuskamga3238 Жыл бұрын

Amazing tutorial ! in my use case I would want to do create a question answering system such that basically for each document I can query something and if the answer exist in the document( a kind of a text extraction model) the model give me it back I think it doesn't really fit what you show in the tutorial do you have any recommendations in terms of model or suggestions ?..

@ylazerson Жыл бұрын

You are thee best!

@henkhbit5748 Жыл бұрын

James, enjoyed very much this video. You are the only one which explains current advances in NLP topics in depth. I suppose u can replace wiki data with your own documents. But is it enough to have only an id and some context string or do I need to supply more metadata? btw; It would be awesome if this example can be extended similar to chatgpt. for example Q1 -> A1 but A1 is not correct and try with answer A2(for example by google Q1 and the results feed into generator) etc...I did search in the past for chatbot using RL(reinforced learning) but did not found any and see that chatgpt is using RL also....

@jamesbriggs Жыл бұрын

thanks a ton :) Yes you can replace with your own docs, and no all you need is an ID and context string, it can be useful to include some info on where the document comes from in the metadata but it isn't necessary Yes for sure, I'd love to see chatgpt references it's sources, and for more niche subjects (like recent NLP papers) it would help chatgpt answer questions accurately

@yuchentuan7011 Жыл бұрын

Thanks for the tutorial. Do you have code for fine-tuning this BART QA model?

@tbmakhriza6335 Жыл бұрын

This video details what's in the pinecone documentation, thanks for that. Chatgpt has the ability to do some statistical analysis as well as SWOT. Does this analytical capability exist in any particular pipeline, and can BERT do it? Thanks for your reply

@aayushsmarten Жыл бұрын

This tutorial was a GEM 💎Loved it. Actually for me, I have used Haystack to replicate the same. There I used the same LFQA technique with GenerativeQAPiepeline, I could observe that the answers that the model gave were more "complete". One more thing I would like to ask is, is it possible to give the structural data (tables) and also the unstructured data together as the context and let the model give answer from both of the datasets? Again, Here we would like to see the "generative response" from the tables as well, instead of the "extractive". Like, If I am feeding in the data of USA economy (past 5 years - CSV) + All news articles (for past month - Text), then asking question like: "How is the USA economy in past 6 months?" then instead of "extracting" the answer like "+6% GDP" it should generate answers from the data like: "It is perorming really good which is compared to last year at 4%..." etc. Can we do that? How? Please guide. Thank you.

@temiwale88 Жыл бұрын

Bro. This is incredible! I need this man. Questions: - I can probably use elasticsearch to get my contexts vs. Pinecone right? - If the generative model doesn't have a good answer, how do we retrieve the confidence score so we can trigger some other logic? - we can easily fine-tune the model or replace it with a fine-tuned GPT model right? Do you have resources for fine-tuning? - do you have better ideas of building a chatbot for a specific domain / corpus of Q&A tasks? Thanks and I'm sorry for all the questions!

@jamesbriggs Жыл бұрын

hey Elijah, glad it helps, for your questions: 1. Yes but you might need to use a sparse embedding model if using typical elasticsearch, so you will miss out on the semantic search part - alternatively you can use their kNN / ANN service, from what I've heard it's decent but doesn't scale to very big datasets (if you're in the *few millions*, it shouldn't be a problem) 2. As far as I know there isn't a "confidence score" output by the generative model, unless it's possible to extract the confidence for the token predictions (I imagine this is possible, but I haven't tried) -- as an alternative you could try calculating the semantic similarity using the retriever or some other QA retriever model and using that as a confidence score 3. Yes you can, I actually did something like this with GPT-3 here kzbin.info/www/bejne/maDEkoaurthoqdE - I don't have anything on fine-tuning GPT-like models 4. I think the video linked above is the best I've ever seen for Q&A, other than ChatGPT. If you wanted to adapt it as a chatbot you could initialize the input with something like "you are a chatbot" and preceed each user input with "user: " and each answer with "chatbot: " and with each step in the conversation you input the previous steps I hope that helps!

@temiwale88 Жыл бұрын

@@jamesbriggs helps a lot! Thanks James! Do you have something on pinecone+elasticsearch architecture? I have an app that we've decided to use elasticsearch but need semantic search capability and will scale to at least billions.

@jamesbriggs Жыл бұрын

There's an upcoming article covering indexing with elasticsearch to pinecone - I'll share that when ready If you're looking to merge sparse + dense (semantic) search, there is an hybrid search feature in private preview at Pinecone - or you can use elasticsearch as the sparse index, pinecone as the dense index, query both and then combine result scores for each (shared) record to give the final records. I think hybrid search with elasticsearch+pinecone is a good use-case so I could look into doing something on it

@temiwale88 Жыл бұрын

@@jamesbriggs thanks again James! I'll look forward to that article. I think we can do dense vectors now in elasticsearch with knn search. You can bring in a sentence-transformer model from huggingface per their docs. Thanks again man! Keep bringing out bangers!

@FatimaHABIB-jm4ji Жыл бұрын

Thanks a lot, such a great video. I am working on French dataset and I want to know if there exist a French model trained on a French corpus and tested on a QA task

@Truizify Жыл бұрын

James, thanks for your awesome videos. In the future, it would be great if you used something other than pinecone (maybe something open-source, like milvus) for the vector database, just so we can see other options!

@jamesbriggs Жыл бұрын

Thanks for watching! Sometime soon I'll likely cover Elasticsearch and Faiss some more. I work with Pinecone so that will remain the main tool, but may do some Milvus/Weaviate at some point :)

@Truizify Жыл бұрын

@@jamesbriggs awesome, thanks for the quick reply! Will look forward to it. Would you recommend any of those options to someone looking to maintain their own vector database?

@jamesbriggs Жыл бұрын

I've only used Elasticsearch and Faiss, naturally neither of those are fully packaged as vector databases (elastic search is but the ANN search is not ideal). For the others, I haven't used them extensively so I couldn't say, Milvus and Weaviate are on a pretty similar level as far as I know. I think Milvus might be a little behind Weaviate in development of features like hybrid search - but I'm not sure

@sarahkamraoui4744 Жыл бұрын

hello @James Briggs can this be done using gpt-2 instead of bart ?

@akrambasha9297 Жыл бұрын

I have a Rule/Acts which are separated into different columns as Rule Number,Rule Discription,Subrules etc. How to train them in NLP

@loading757 10 ай бұрын

sir please reply, The error message you're encountering is related to resource quotas and limitations within the Pinecone service. Pinecone is a platform that allows you to build and deploy vector similarity search systems, and the error you're seeing indicates that you're trying to create an index that exceeds the allocated resource quota for your project. this is the error, since i cant afford pinecone pro, is there any ways to fix this?

@KayYesYouTuber Жыл бұрын

Very nice video. Can I use Elasticsearch dense vector instead of pinecone?

@jamesbriggs Жыл бұрын

yeah but I wouldn't recommend for larger datasets, it's slower, has less features, is less accurate, and more expensive - but if you're working with smaller datasets

@dre9957 Жыл бұрын

Waoh exactly what i neede for my Thesis project please How do I get the final model ready for production with flask API I hope my question is not too dumb😅

@dre9957 Жыл бұрын

@jamesbriggs

@lutfiikbalmajid Жыл бұрын

What notebook provider do you used?

@nitishkumarharsoor6079 Жыл бұрын

Any idea how i can fine tune my dataset to extract email signatures?

@pratik6447 Жыл бұрын

The model hallucinates if answer is not in context. Why can't it say, the answer is not in context?

@ahmedgames9335 9 ай бұрын

what if i want the model return None if no resutl or spesific word i can do that ??

@venkatesanr9455 Жыл бұрын

Thanks for the valuable video. The Bart model chosen whether it is under text generation of huggingface or other?

@jamesbriggs Жыл бұрын

Bart is one example we can use for text generation, in huggingface there are a few similar models like T5 and GPT-2 that could also be used. If you have the resources BLOOM could return better results too. Outside of Huggingface you can use OpenAI's GPT-3, I did another video on that here: kzbin.info/www/bejne/maDEkoaurthoqdE

@venkatesanr9455 Жыл бұрын

@@jamesbriggs Thanks for the replies. Actually, I hav tried text summarization using T5, Bart and pegasus. I hav learnt abstractive qa system from your video today. I hav only one doubt whether the response of generative answer will be good from the predefined context of retriever.

@jamesbriggs Жыл бұрын

If using large language models (LLMs) like GPT-3 or BLOOM the generated answers are very good, almost perfect most of the time. Using T5, Bart, and Pegasus you will still get good results but they will fail more frequently. Nonetheless, adding the context (as we do here) improves generated answers significantly.

@venkatesanr9455 Жыл бұрын

@@jamesbriggs Thanks for your kind replies.

@xspydazx 7 ай бұрын

I am sewrching for video on answer generation using hf model . Hopefully this is the one ..., I would like to train the model with a simular "sqaud styled dataset" as well as just a corpus of documents .. then when presented with a question to get an answer generated .. from the corpus (either exact as a specific span ? Ie quote) or a text generated answer (guess) perhaps based on probablity or simularity ? .. can the model be trained for two purposes ie , first to provide text generation , then fine tuned as a qa model ? Or vice versa ? So that my languge model can perform various functions in a single model instead of making many models for different purposes .. (ps: im a .net dev not a python) .. henxe still confusing (pytorch methods only ) + hugging face ? . Also is the GPT model a general all purpose model ?

@neel_aksh Жыл бұрын

Can we use gpt 2 for please reply fast?

@ylazerson Жыл бұрын

Would it make sense to use ChatGPT for the text generation model?

@jamesbriggs Жыл бұрын

Definitely would, it would probably result in a similar result to the GPT-3 video, but I assume better (considering the improvements in chatgpt)

@nikhilgjog Жыл бұрын

dont we need to truncate the text as the sentence transformer cannot look at very long text?

@jamesbriggs Жыл бұрын

the wiki dataset (as far as I know) contains mostly short sentences - but for those that are longer, as we used sentence-transformers that will automatically truncate anything over the max length of the model

@tushitdave9795 Жыл бұрын

Hey James, Thanks for sharing this video. Any information how to solve this error: ""MaxRetryError: HTTPSConnectionPool(host='controller.your_environment.pinecone.io', port=443): Max retries exceeded with url: /databases (Caused by NewConnectionError(': Failed to establish a new connection: [Errno -2] Name or service not known'))""

@jamesbriggs Жыл бұрын

Most likely it is the environment in your Pinecone.init call that is wrong, default for new projects is now “us-east1-gcp” but you can check for your in the Pinecone console next to your api key

@sharvaripatil970 10 ай бұрын

How can my chatbot respond based on previously asked questions?

@xspydazx 7 ай бұрын

Good question ! Should you pass the history in a text file as user(q) ,AI(answer) , context (prev message) .. and retrain a new model ? Ie fine-tune it with the new history ??