Retrieval-Augmented Generation chatbot, part 2 - LangChain, Hugging Face, OpenSearch, AWS

  Рет қаралды 6,332

Julien Simon

Julien Simon

9 ай бұрын

We'll walk you through the creation of a Retrieval-Augmented Generation (RAG) chatbot using open-source tools and AWS services like LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless.
Part 1: • Retrieval-Augmented Ge... - LangChain, Hugging Face, FAISS, Amazon SageMaker, and Amazon TextTract.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️
We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them.
Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain.
Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.
- Notebook: gitlab.com/juliensimon/huggin...
- LangChain: www.langchain.com/
- Amazon OpenSearch Serverless: docs.aws.amazon.com/opensearc...
- Embedding leaderboard: huggingface.co/spaces/mteb/le...
- Embedding model: huggingface.co/BAAI/bge-small...
- LLM: huggingface.co/mistralai/Mist...

Пікірлер: 18
@pfunnell
@pfunnell 9 ай бұрын
this is great, my son and I have both been working on something similar, each for different use cases, this is going to help both of us, salut!
@juliensimonfr
@juliensimonfr 9 ай бұрын
Glad I could help!
@mtin79
@mtin79 9 ай бұрын
Merci beaucoup! Very helpful 👍🏻
@juliensimonfr
@juliensimonfr 9 ай бұрын
You're welcome!
@Martyniqo
@Martyniqo Ай бұрын
Thanks a lot!
@juliensimonfr
@juliensimonfr Ай бұрын
You're welcome!
@TheMrGoodkind
@TheMrGoodkind 8 ай бұрын
This is really great! Thank you! If I want to add this RAG-augmented chatbot to my personal website, how would I do that?
@WagnerHeleno
@WagnerHeleno 7 ай бұрын
Hi Julien, your video is excelente. I have a question, with this solution (using opensearch service) is possible to deploy thought Lambda Service too?
@juliensimonfr
@juliensimonfr 6 ай бұрын
Hi, serverless inference on AWS is interesting, but no GPUs...
@XShollaj
@XShollaj 9 ай бұрын
Thank you Julien! Will there be a tutorial deploying this in a front end chat interface ?
@juliensimonfr
@juliensimonfr 9 ай бұрын
no, I couldn't write UI code to save my life ;) Gradio has a chatbot interface, this would probably be a good place to start www.gradio.app/docs/chatbot
@XShollaj
@XShollaj 9 ай бұрын
@@juliensimonfr Thank you! Highest standards for tutorials as always!
@Ben-gp5ty
@Ben-gp5ty 3 ай бұрын
Julien, if we have a document in S3 that when deleted, i want to trigger a lambda to delete the chunks and embeddings in opensearch belonging to this document. How do I do so ?
@juliensimonfr
@juliensimonfr 3 ай бұрын
Each chunk should have metadata on the source document, which you could use to query and delete.
@sergioquintero4624
@sergioquintero4624 8 ай бұрын
Hi. Can you explain a little more about the cost of this PoC ? Thanks
@juliensimonfr
@juliensimonfr 6 ай бұрын
Check the pricing for the AWS services involved :)
@caiyu538
@caiyu538 9 ай бұрын
thumb up first and then watch.
@user-qy8wf8rx4q
@user-qy8wf8rx4q 2 ай бұрын
thanks julien i dont like this service im struggling myself
Опасность фирменной зарядки Apple
00:57
SuperCrastan
Рет қаралды 12 МЛН
No empty
00:35
Mamasoboliha
Рет қаралды 10 МЛН
RAG But Better: Rerankers with Cohere AI
23:43
James Briggs
Рет қаралды 56 М.
Launch your own LLM (Deploy LLaMA 2 on Amazon SageMaker with Hugging Face Deep Learning Containers)
1:48:01
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 435 М.
Hugging Face LLMs with SageMaker + RAG with Pinecone
32:30
James Briggs
Рет қаралды 17 М.
Managed RAG Deployment on Amazon Bedrock - Deployed in Minutes
5:10
What are AI Agents?
12:29
IBM Technology
Рет қаралды 113 М.
Vector Search RAG Tutorial - Combine Your Data with LLMs with Advanced Search
1:11:47
Частая ошибка геймеров? 😐 Dareu A710X
1:00
Вэйми
Рет қаралды 4,9 МЛН
Look, this is the 97th generation of the phone?
0:13
Edcers
Рет қаралды 8 МЛН
Rate This Smartphone Cooler Set-up ⭐
0:10
Shakeuptech
Рет қаралды 6 МЛН
Мой новый мега монитор!🤯
1:00
Корнеич
Рет қаралды 244 М.