RAG Explained

  Рет қаралды 72,262

IBM Technology

IBM Technology

Күн бұрын

Get the interactive demo → ibm.biz/BdmPEb
Learn about the technology → ibm.biz/BdmPEp
Oftentimes, GAI and RAG discussions are interconnected. Learn more about about RAG is and how it works alongside your databases, LLMs and vector databases for better results with Luv Aggarwal and Shawn Brennan.
AI news moves fast. Sign up for a monthly newsletter for AI updates from IBM → ibm.biz/BdmP2c

Пікірлер: 60
@amritbro
@amritbro 3 ай бұрын
Very simple and clear explanation.. cheers to IBM
@paulaichniowski968
@paulaichniowski968 3 ай бұрын
Shawn & Luv!!!! Awesome job!!!!
@akshitgoel1183
@akshitgoel1183 Ай бұрын
Hi I have few Questions , Please find time to answer 0. Are we filtering the Data in the Vector DB ? (If Yes? then ) 1. How are we filtering the relevant data from our vector DB to augment to our prompt for the LLM ? 1. 1 who is doing this process , another LLM, our own code , or some-different tool? 1.2 Are we feeding the complete data as whole to the LLM? 1.2 if we are filtering the vector data using Rule Based Mechanism then what will be the use case of LLM , how is the power of LLM being drawn if we are the one who is making the decision what to feed as a relevant data to the LLM?
@satish1012
@satish1012 11 күн бұрын
Hi This is what my understanding Storing Data as Embeddings: Correctness: Storing data (documents, images, etc.) as embeddings in a Vector DB is a valid approach. Embeddings represent the data in a high-dimensional vector space, capturing its semantic meaning. Consideration: Ensure that the embedding model you use is appropriate for your data type. For images, you might use a different model (e.g., CLIP) compared to text embeddings. Searching with Embeddings: Correctness: Converting the search query into embeddings and then comparing these embeddings with those stored in your Vector DB is correct. This allows for semantic search, which is more effective than keyword-based search. Consideration: Ensure that the conversion process and similarity calculations (e.g., cosine similarity) are implemented correctly. The returned plain text should be accurately relevant to the search query. Summarization by LLM: Correctness: Sending the retrieved plain text content to an LLM for summarization is appropriate. LLMs are designed to generate summaries and provide concise explanations based on the input text. Consideration: Ensure that the LLM is correctly configured for summarization tasks. Provide clear instructions or prompts to achieve the desired summarization quality. Returning Summarized Text to User: Correctness: Receiving the summarized text from the LLM and returning it to the user is the final step in the process. This is standard practice for providing user-friendly summaries. Consideration: Validate that the summarized content meets user expectations and provides accurate, meaningful information. 1. We Store all out relevant data in Vector DB ..like documents , images etc as part of Embeddings 2. When User Searches, it will not hit LLB directly, it will convert our search into Embeddings and return the resutlt with plain text 3. Then we send the same to text for summarization to LLM 4. Then LLM returns the summarized text back to user
@LorenzoMarkovian
@LorenzoMarkovian Ай бұрын
Clear and simple, thank you guys and thank you IBM
@emteiks
@emteiks 3 ай бұрын
There is good point of halucinations of AI and the video unfortunately does not address it. The data governance is not addressing this issue we still can have a scenario where input is valid but output generated by AI is a garbage.
@vintastic_
@vintastic_ 3 ай бұрын
What's the solution?
@frackinfamous6126
@frackinfamous6126 3 ай бұрын
@@vintastic_you have to make sure the relevant data is going to the model. Good info into the data base is only half the battle. Semantic chunking. Size of chunks..types of search. Type of vector database used. For example PG Vector is a Postgres plugin and is not near as good at retrieval (usually) as something like pinecone
@frackinfamous6126
@frackinfamous6126 3 ай бұрын
Then the prompt used can also tremendously affect the model. You have to put it in the right context and use industry specific terms when prompting. Even a genius needs context or a bit of time to think. No matter who good the model, you have to know a bit about the specific industry to obtain great results. It’s like explains a noise to you mechanic or telling them you have a miss-fire on cylinder 1.
@miraculixxs
@miraculixxs 2 ай бұрын
​@@vintastic_ human expert review for every response
@aaryaxz
@aaryaxz 3 ай бұрын
Hey! If I ask a RAG-based language model, "Tell me the features of the iPhone 17," what will it tell me? Will it say it doesn't know or will it hallucinate? I understand that once the iPhone 17 is released, the database will be updated to provide the correct information. But what happens if I ask about it before its release?
@EduardoSantAnna
@EduardoSantAnna 2 ай бұрын
I can see two scenarios here: if it is indeed RAG-based, then you have provided info about the yet-to-be-released iPhone 17. So the LLM will respond based on that. If you don't have it in your additional documents/vector DBs, then I'd recommend you to have always added something along the lines of "only answer with facts you have access to" to your system prompt and to set Temperature to a low number. (Temperature is a parameters in LLMs that defines how "creative" the model can be) Great question and it highlights the importance of having experts in GenAI guiding enterprises on how to implement this in a way that suits their use cases.
@fanzhang8823
@fanzhang8823 Ай бұрын
1. Vector DB may or may not stored relevant information related to the question. 2. LLM may already have information or more accurate information to the question. So RAG may not always be helpful for GenAi applications
@vvishnuk
@vvishnuk 3 ай бұрын
Neat and detail explanation.
@Thomas___018s
@Thomas___018s Ай бұрын
Poetic journey: the essence of refund details and expected actions
@mzimmerman1988
@mzimmerman1988 3 ай бұрын
nice work.
@akhil7110
@akhil7110 3 ай бұрын
Does not address how do you validate the Q1 results returned are accurate. You should have built in a process parallel to querying the LLM of actually querying the results and training the LLM to address any discrepancies, if that is possible or correct them.
@THEaiGAI
@THEaiGAI 3 ай бұрын
Awesome video
@stephaniakossman2923
@stephaniakossman2923 15 күн бұрын
Very clear explanation, thanks! How do you manage to avoid using "blackbox" models?
@research2you-su9om
@research2you-su9om 3 ай бұрын
Thanks for the straight forward description of RAG.
@bayesian7404
@bayesian7404 3 ай бұрын
Good job. I still need to learn more about data accuracy in a LLM.
@Laura__sa8q
@Laura__sa8q Ай бұрын
The demanding world of refund specifics and anticipated actions
@CumaliTurkmenoglu-zn7hp
@CumaliTurkmenoglu-zn7hp Ай бұрын
Great explanation. Thank you
@datoalavista581
@datoalavista581 3 ай бұрын
Thank for sharing !!
@nachoeigu
@nachoeigu 3 ай бұрын
Cool explanation
@hathanh4650
@hathanh4650 2 ай бұрын
Thank you for your sharing! Very helpful and easy to follow. Just one question, is there anyway we can test or reinforcement train the model to make sure the outputs are appropriate?
@AgentBangla
@AgentBangla 3 ай бұрын
Excellent video, love it
@ammularajeshsagar7953
@ammularajeshsagar7953 3 ай бұрын
Informative
@user-ht9st4up8q
@user-ht9st4up8q 3 ай бұрын
Interesting , thanks both
@George-j3George_372u
@George-j3George_372u Ай бұрын
Ironically, because transfers between banks and cards always go so smoothly, don't they? But seriously, it's all good.
@Patricia___361n
@Patricia___361n Ай бұрын
Behind the scenes: Binance CEO shares insights into future developments in an exclusive interview
@DiegoSarasua-jn2wh
@DiegoSarasua-jn2wh 3 ай бұрын
Thanks guys, very clear!
@siddid7620
@siddid7620 Ай бұрын
7:10 sounds like support for open source.
@mictow
@mictow 14 күн бұрын
Did you need to learn to write backwards for this videos?? or is there a product that help you with this nice board?
@nguyentran7068
@nguyentran7068 4 күн бұрын
You record the video then mirror the image. They simple write on the clear board
@5uryaprakashPi
@5uryaprakashPi 2 ай бұрын
So with a rag approach, can I say that we can update the original vector db with our own processed data?
@yinlan2672
@yinlan2672 Ай бұрын
How to do this video? screen as the board.
@hassanabida
@hassanabida 3 ай бұрын
Gotcha.
@gatsby66
@gatsby66 3 ай бұрын
Nobody's ever been fired for buying RAGs from IBM.
@miraculixxs
@miraculixxs 2 ай бұрын
... yet
@ColmFearon-dc8dr
@ColmFearon-dc8dr 2 ай бұрын
Do these guys write on the whiteboard backwards or how does that work?
@JShaker
@JShaker Ай бұрын
Yes they learned to write backwards for this video because it's cheaper to do that than to run the algorithm to flip a video horizontally
@mackkaputo8989
@mackkaputo8989 Ай бұрын
@@JShaker 😂😂😂
@evgenii.panaite
@evgenii.panaite 3 ай бұрын
ok, gotcha 👌
@dheerajkumar-uk6ec
@dheerajkumar-uk6ec Ай бұрын
exactly love
@topcommenter
@topcommenter 2 ай бұрын
🤔I think Luv saw the connection the entire time
@bangyor5949
@bangyor5949 2 ай бұрын
Do LLMs store our sensitive data when using RAG?
@satish1012
@satish1012 11 күн бұрын
No , Vector DB stores all the sensitive and confidential data Once obtained send that data to LLM to get it summarized because Vector DB would have given only the connecting data for the string
@SidStrong-l9t
@SidStrong-l9t 2 ай бұрын
Terrible Analogy! When a journalist is ants to do research, he goes to a library and asks the librarian?? As opposed to doing a google search ? This scenario is from last century before Luv was born ?
@BlueBearOne
@BlueBearOne 3 ай бұрын
And then a wide spread global epidemic crisis is brought to light wherein our gold standard "books" (peer reviewed journals) are rife with bad and corrupt data due to mismatched incentivization and misalignment of directives; and we then realize...how much good data through science do we really have? Shame we polluted the books we are supposed to be able to trust now that we have this magnificent technology here. 😭
@godlymajins
@godlymajins 3 ай бұрын
Dude, keep on topic. This isn’t the place for your grievances
@markmotarker
@markmotarker 2 ай бұрын
lol. must be annoying to talk to "Luv". "Hi, Luv", "Exactly, Luv"
@sk3ffingtonai
@sk3ffingtonai 3 ай бұрын
👏👏
@vrohan07
@vrohan07 3 ай бұрын
kabhi haans bhi liya karo.. (Smile a bit bros)
@maliciousinferno
@maliciousinferno 3 ай бұрын
How in the world is this dude writing inverted for us too read straight lol
@inriinriinriinriinri
@inriinriinriinriinri 2 ай бұрын
they mirror the video) you can notice that most of the people writing on a glass board are left handed(in reality 90% of planet’s population is right handed), that’s also because they mirrored the video
@ObscuredByCIouds
@ObscuredByCIouds 2 ай бұрын
It's a skill only left-handed people have
@Lemonsstored
@Lemonsstored 3 ай бұрын
Boring
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 130 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 186 М.
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 27 МЛН
Zombie Boy Saved My Life 💚
00:29
Alan Chikin Chow
Рет қаралды 9 МЛН
Get 10 Mega Boxes OR 60 Starr Drops!!
01:39
Brawl Stars
Рет қаралды 17 МЛН
123 GO! Houseによる偽の舌ドッキリ 😂👅
00:20
123 GO! HOUSE Japanese
Рет қаралды 5 МЛН
What is a Vector Database?
8:12
IBM Technology
Рет қаралды 72 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 447 М.
What is LangChain?
8:08
IBM Technology
Рет қаралды 194 М.
The Boundary of Computation
12:59
Mutual Information
Рет қаралды 1 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 939 М.
4 Methods of Prompt Engineering
12:42
IBM Technology
Рет қаралды 130 М.
How to Improve LLMs with RAG (Overview + Python Code)
21:41
Shaw Talebi
Рет қаралды 49 М.
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 90 М.
Vector Databases simply explained! (Embeddings & Indexes)
4:23
AssemblyAI
Рет қаралды 314 М.
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Computerphile
Рет қаралды 195 М.
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 27 МЛН