What is Retrieval-Augmented Generation (RAG)?

  Рет қаралды 455,554

IBM Technology

IBM Technology

Күн бұрын

Try RAG with watsonx → ibm.biz/BdMsRT
Learn more about RAG→ ibm.biz/BdMsRt
Large language models usually give great answers, but because they're limited to the training data used to create the model. Over time they can become incomplete--or worse, generate answers that are just plain wrong. One way of improving the LLM results is called "retrieval-augmented generation" or RAG. In this video, IBM Senior Research Scientist Marina Danilevsky explains the LLM/RAG framework and how this combination delivers two big advantages, namely: the model gets the most up-to-date and trustworthy facts, and you can see where the model got its info, lending more credibility to what it generates.
Get started for free on IBM Cloud → ibm.biz/sign-up-now
Subscribe to see more videos like this in the future → ibm.biz/subscribe-now

Пікірлер: 360
@xzskywalkersun515
@xzskywalkersun515 5 ай бұрын
This lecturer should be given credit for such an amazing explanation.
@cosmicscattering5499
@cosmicscattering5499 3 ай бұрын
I was thinking the same, she explained this so clearly.
@tariqmking
@tariqmking Ай бұрын
Yes this was excellently explained, kudos to her.
@brianmi40
@brianmi40 Ай бұрын
Or at least credit for being able to write backwards!
@victoriamilhoan512
@victoriamilhoan512 2 күн бұрын
The connection between a human answering a question in real life vs how LLMs (with or without RAG) do it was so helpful!
@vt1454
@vt1454 6 ай бұрын
IBM should start a learning platform. Their videos are so good.
@XEQUTE
@XEQUTE 5 ай бұрын
i think they already do
@srinivasreddyt9555
@srinivasreddyt9555 Ай бұрын
Yes, they have it already. KZbin.
@siddheshpgaikwad
@siddheshpgaikwad 22 күн бұрын
Its mirrored video, she wrote naturally and video was mirrored later
@Hossam_Ahmed_
@Hossam_Ahmed_ 21 күн бұрын
They have skill build but not videos at least most of the content
@CaptPicard81
@CaptPicard81 18 күн бұрын
They do, I recently attended a week long AI workshop based on an IBM curriculum
@ghtgillen
@ghtgillen 7 ай бұрын
Your ability to write backwards on the glass is amazing! ;-)
@jsonbourne8122
@jsonbourne8122 6 ай бұрын
They flip the video
@Paul-rs4gd
@Paul-rs4gd 3 ай бұрын
@@jsonbourne8122 So obvious, but I did not think of it. My idea was way more complicated!
@natoreus
@natoreus 6 сағат бұрын
I'm sure it was already said, but this video is the most thorough, simple way I've seen RAG explained on YT hands down. Well done.
@jordonkash
@jordonkash 2 ай бұрын
4:15 Marina combines the colors of the word prompt to emphasis her point. Nice touch
@maruthuk
@maruthuk 7 ай бұрын
Loved the simple example to describe how RAG can be used to augment the responses of LLM models.
@ericadar
@ericadar 5 ай бұрын
Marina is a talented teacher. This was brief, clear and enjoyable.
@TheAllnun21
@TheAllnun21 5 ай бұрын
Wow, this is the best beginner's introduction I've seen on RAG!
@m.kaschi2741
@m.kaschi2741 5 ай бұрын
Wow, I opened youtube coming from the ibm blog just to leave a comment. Clearly explained, very good example, and well presented as well!! :) Thank you
@geopopos
@geopopos 2 ай бұрын
I love seeing a large company like IBM invest in educating the public with free content! You all rock!
@vikramn2190
@vikramn2190 7 ай бұрын
I believe the video is slightly inaccurate. As one of the commenters mentioned, the LLM is frozen and the act of interfacing with external sources and vector datastores is not carried out by the LLM. The following is the actual flow: Step 1: User makes a prompt Step 2: Prompt is converted to a vector embedding Step 3: Nearby documents in vector space are selected Step 4: Prompt is sent along with selected documents as context Step 5: LLM responds with given context Please correct me if I'm wrong.
@DJ-lo8qj
@DJ-lo8qj 21 күн бұрын
I’m not sure. Looking at OpenAI documentation on RAG, they have a similar flow as demonstrated in this video. I think the retrieval of external data is considered to be part of the LLM (at least per OpenAI)
@PlaytimeEntertainment
@PlaytimeEntertainment 20 күн бұрын
I do not think retrieval is part of LLM. LLM is the best model at the end of convergence after training. It can't be modified rather after LLM response you can always use that info for next flow of retrieval
@Lucildor
@Lucildor 3 ай бұрын
Please keep all these videos coming! They are so easy to understand and straightforward. Muchas gracias!
@GregSolon
@GregSolon 3 ай бұрын
One of the easiest to understand RAG explanations I've seen - thanks.
@aam50
@aam50 5 ай бұрын
That's a really great explanation of RAG in terms most people will understand. I was also sufficiently fascinated by how the writing on glass was done to go hunt down the answer from other comments!
@ntoscano01
@ntoscano01 3 ай бұрын
Very well explained!!! Thank you for your explanation of this. I’m so tired of 45 minute KZbin videos with a college educated professional trying to explain ML topics. If you can’t explain a topic in your own language in 10 minutes or less than you have failed to either understand it yourself or communicate effectively.
@444Yielding
@444Yielding 20 күн бұрын
This video is highly underviewed for as informative as it is!
@kingvanessa946
@kingvanessa946 3 ай бұрын
For me, this is the most easy-to-understand video to explain RAG!
@projectfocrin
@projectfocrin 5 ай бұрын
Great explanation. Even the pros in the field I have never seen explain like this.
@Shailendrashail
@Shailendrashail 8 ай бұрын
Good Explanation of RAG. Thanks for sharing.
@jyhherng
@jyhherng 6 ай бұрын
this let's me understand why the embeddings used to generate the vectorstore is a different set from the embeddings of the LLM... Thanks, Marina!
@paulaenchina
@paulaenchina 3 ай бұрын
This is the best explanation I have seen so far for RAG! Amazing content!
@ReflectionOcean
@ReflectionOcean 5 ай бұрын
1. Understanding the challenges with LLMs - 0:36 2. Introducing Retrieval-Augmented Generation (RAG) to solve LLM issues - 0:18 3. Using RAG to provide accurate, up-to-date information - 1:26 4. Demonstrating how RAG uses a content store to improve responses - 3:02 5. Explaining the three-part prompt in the RAG framework - 4:13 6. Addressing how RAG keeps LLMs current without retraining - 4:38 7. Highlighting the use of primary sources to prevent data hallucination - 5:02 8. Discussing the importance of improving both the retriever and the generative model - 6:01
@past_life_project
@past_life_project 3 ай бұрын
I have watched many IBM videos and this is the undoubtedly the best ! I will be searching for your videos now Marina!
@rujmah
@rujmah 2 ай бұрын
Brilliant explanation and illustration. Thanks for your hard work putting this presentation together.
@TheMsksk
@TheMsksk 8 ай бұрын
Great video as always. Thanks for sharing.
@hamidapremani6151
@hamidapremani6151 2 ай бұрын
The explanation was spot on! IBM is the go to platform to learn about new technology with their high quality content explained and illustrated with so much simplicity.
@redwinsh258
@redwinsh258 6 ай бұрын
The interesting part is not retrieval from the internet, but retrieval from long term memory, and with a stated objective that builds on such long term memory, and continually gives it "maintenance" so it's efficient and effective to answer. LLMs are awesome because even though there are many challenges ahead, they sort of give us a hint of what's possible, without them it would be hard to have the motivation to follow the road
@evaiintelligence
@evaiintelligence 22 күн бұрын
Marina has done a great job explaining LLM and RAGs in simple terms.
@vnaykmar7
@vnaykmar7 5 ай бұрын
Such an amazing explanation. Thank you ma'am!
@francischacko3627
@francischacko3627 17 күн бұрын
perfect explanation understood every bit , no lags kept it very interesting ,amazing job
@gaemrpaterso-ri2jd
@gaemrpaterso-ri2jd 8 ай бұрын
Great video, you guys should do one on promising tech industries
@rvssrkrishna2
@rvssrkrishna2 2 ай бұрын
Very precise and exact information on RAG in a nutshell. Thank you for saving my time.
@user-cd6hp5kc1n
@user-cd6hp5kc1n 7 ай бұрын
The ability to write backwards, much less cursive writing backwards, is very impressive!
@IBMTechnology
@IBMTechnology 7 ай бұрын
See ibm.biz/write-backwards
@jsonbourne8122
@jsonbourne8122 6 ай бұрын
Left hand too!
@NishanSaliya
@NishanSaliya 5 ай бұрын
@@IBMTechnology Thanks .... I was reading comments to check for an answer for that question!
@afshinkarimi2382
@afshinkarimi2382 8 ай бұрын
Great video. Thanks for sharing
@rafa1rafa
@rafa1rafa 5 ай бұрын
Great explanation! The video was very didactic, congratulations!
@kunalsoni7681
@kunalsoni7681 6 ай бұрын
Thanks for letting us know about this feature of LLM :)
@Anubis2828
@Anubis2828 2 ай бұрын
Great, simple, quick explanation
@HimalayJoriwal
@HimalayJoriwal 2 ай бұрын
Best explanation so far from all the content on internet.
@bdouglas
@bdouglas Ай бұрын
That was excellent, simple, and elegant! Thank you!
@mstarlingc
@mstarlingc 5 ай бұрын
Pretty simple explanation, thank you
@Aryankingz
@Aryankingz 7 ай бұрын
That's what Knowledge graphs are for, to keep LLMs grounded with a reliable source and up-to-date.
@sawyerburnett8319
@sawyerburnett8319 4 ай бұрын
Wow, having a lightbulb moment finally after hearing this mentioned so often. Makes more sense now!
@PaulGrew-wl7mh
@PaulGrew-wl7mh Ай бұрын
An amazing explanation that made RAG understandable in about 4:23 minutes!
@rockochamp
@rockochamp 5 ай бұрын
very well executed presentation. i had to think twice about how you can write in reverse but then i RAGed my system 2 :)
@toenytv7946
@toenytv7946 2 ай бұрын
Great down the rabbit hole video. Very deep and understandable. IBM academy worthy in my opinion.
@javi_park
@javi_park 3 ай бұрын
hold up - the fact that the board is flipped is the most underrated modern education marvel nobody's talking about
@RiaKeenan
@RiaKeenan 3 ай бұрын
I know, right?!
@euseikodak
@euseikodak 3 ай бұрын
Probably they filmed it in front of a glass board and flipped the video on edition later on
@politicallyincorrect1705
@politicallyincorrect1705 3 ай бұрын
Filmed in front of a non-reflective mirror.
@TheTomtz
@TheTomtz Ай бұрын
Just simply write on a glass board ,record it from the other side and laterally flip the image! Simple aa that.. and pls dont distract people from the contents being lectured by thinkin about the process behind the rec🤣
@thewallstreetjournal5675
@thewallstreetjournal5675 Ай бұрын
Is the board fliped or has she been flipped?
@xdevs23
@xdevs23 Ай бұрын
The entire video I've been wondering how they made the transparent whiteboard
@user-im6ub3sf6m
@user-im6ub3sf6m 3 ай бұрын
Great explanation with an example. Thank you
@thomasbrowne6649
@thomasbrowne6649 Ай бұрын
This is excellent and I hope IBM does well in this space. We need a reliable, non-hype vendor.
@preciousrose2715
@preciousrose2715 24 күн бұрын
This was such an amazing explanation!
@mohamadhijazi3895
@mohamadhijazi3895 Ай бұрын
The video is short and consice yet the delivery is very elegant. She might be the best instructor that have teached me. Any idea how the video was created?
@user-hk5dk9rb6p
@user-hk5dk9rb6p 4 ай бұрын
Fantastic video and explanation. Thank you!
@laurentpastorelli1354
@laurentpastorelli1354 4 ай бұрын
Super good and clear, well done!
@oieieio741
@oieieio741 5 ай бұрын
Very Helpful! Great explanation. thx IBM
@eddisonlewis8099
@eddisonlewis8099 3 ай бұрын
AWESOME EXPLANATION OF THE CONCEPT RAG
@Kekko400D
@Kekko400D 3 ай бұрын
Fantastic explanation, proud to be an IBMer
@ashwinkumar675
@ashwinkumar675 17 күн бұрын
This is so well explained! Thank you 👍🏻✅
@alexiojunior7867
@alexiojunior7867 28 күн бұрын
wow this was an amazing Explanation ,very easy to understand
@JasonVonHolmes
@JasonVonHolmes Ай бұрын
This was explained fantastically.
@zuzukouzina-original
@zuzukouzina-original 3 ай бұрын
Very clear explanation, much respect 🫡
@khalidelgazzar
@khalidelgazzar 5 ай бұрын
Great explanation. Thank you!😊
@lauther_27
@lauther_27 5 ай бұрын
Amazing video, thanks IBM ❤
@johnmccullough7084
@johnmccullough7084 6 ай бұрын
Appreciate the succinct explanation. 👍
@AntenorTeixeira
@AntenorTeixeira 4 ай бұрын
That's the best video about RAG that I've watched
@MraM23
@MraM23 3 ай бұрын
Great lessons! Nice of you to step out 🙃 and make such engaging and educative content This is a very useful in helping us in critical thinking. Thank you for sharing this video. 👍 Current ai models may impose neurotypical norms and expectations based on current data trained on . 🤔 Curious to see more on how IBM approach the challenges and limitations of Ai
@user-bo1kv5zy3w
@user-bo1kv5zy3w 7 ай бұрын
Awesome explanation. Love you.
@ericmcnally5128
@ericmcnally5128 2 ай бұрын
This is a fantastic lesson video.
@prasannakulkarni5664
@prasannakulkarni5664 Ай бұрын
the color coding on your whiteboard is really apt here !
@aniket_mishr
@aniket_mishr Ай бұрын
The explanation was very good 💯.
@kallamamran
@kallamamran 4 ай бұрын
We also need the models to cross check their own answers with the sources of information before printing out the answer to the user. There is no self control today. Models just say things. "I don't know" is actually a perfectly fine answer sometimes!
@AC-xd7sw
@AC-xd7sw 4 ай бұрын
Insightful, please more video like this
@deltawhiplash1614
@deltawhiplash1614 6 күн бұрын
This is a really good video thank you for sharing this knowledge
@star2k279
@star2k279 4 ай бұрын
Thank you for such a great explanation.
@Junglytics
@Junglytics 3 ай бұрын
Great video, excellent explanation!
@terencelewis4985
@terencelewis4985 3 ай бұрын
Excellent explanation!
@AdarshKumar-kx2cn
@AdarshKumar-kx2cn 2 ай бұрын
Beautifully explained....thanks
@sprintwithcarlos
@sprintwithcarlos 5 ай бұрын
Great explanation!
@katsunoi
@katsunoi 5 ай бұрын
nice video - great explanation!
@sumedhaj9017
@sumedhaj9017 2 ай бұрын
Amazing explanation! Thank you:)
@421sap
@421sap 5 ай бұрын
Thank you, Marina Danilevsky ....
@janhorak8799
@janhorak8799 2 ай бұрын
Did all the speakers have to learn how to write in a mirrored way or is this effect reached by some digital trick?
@VlogBySKSK
@VlogBySKSK 26 күн бұрын
There is a digital mirroring technique which is used to show the content this way...
@mao-tse-tung
@mao-tse-tung 17 күн бұрын
She was right handed before the mirror effect
@user-xf4vm2gf6g
@user-xf4vm2gf6g 3 ай бұрын
Excellent ! thank you for sharing this knowledge !
@stanislavzayarsky
@stanislavzayarsky 3 ай бұрын
Finally, we got a clear explanation!
@sudhakarveeraraghavan5832
@sudhakarveeraraghavan5832 Ай бұрын
Very well explained and it is easily understandable to non AI person as well. Thanks.
@shashankshekharsingh9336
@shashankshekharsingh9336 10 күн бұрын
very good and clear explanation
@berkeokur99
@berkeokur99 6 ай бұрын
Love the neon markers, also the content of course
@mohammadsubhan1318
@mohammadsubhan1318 3 ай бұрын
Nicely explained 👍
@siddharth4251
@siddharth4251 Ай бұрын
Amazing explanation, finally i understand it.
@MarshallMelnychuk
@MarshallMelnychuk 7 ай бұрын
Thank you Marina, very helpful and informative video. One question I have is; how do you make these videos like this? Being able to on a screen facing the camera, this is great. What's your secret?
@PeterCooperUK
@PeterCooperUK 7 ай бұрын
Sometimes these are done on transparent "whiteboards" and the video is then flipped horizontally.
@ChristopherSmithGPlus
@ChristopherSmithGPlus 7 ай бұрын
kzbin.info/www/bejne/m4eygXeHarCMqdEsi=LADnROL0SF33Hg54
@IBMTechnology
@IBMTechnology 7 ай бұрын
See ibm.biz/write-backwards
@lowkeyproducktvt2101
@lowkeyproducktvt2101 6 ай бұрын
@@IBMTechnology okay now i get it !!!
@rayuduaddagarla3857
@rayuduaddagarla3857 6 ай бұрын
IBM should hire left hand writers so it will right handed after flip 😊
@carvalhoribeiro
@carvalhoribeiro Ай бұрын
Amazing work. Thanks for sharing this.
@johanhumblet6090
@johanhumblet6090 4 ай бұрын
very well explained!
@shinemuphy
@shinemuphy 5 ай бұрын
Excellent explanation. thx
@rahulberry4806
@rahulberry4806 14 күн бұрын
thanks for the great explanation
@mrhassell
@mrhassell 22 күн бұрын
RAG combines the generative power of LLMs with the precision of specialized data search mechanisms, resulting in nuanced and contextually relevant responses.
@mayankbumb7272
@mayankbumb7272 15 сағат бұрын
Great explanation
@kevinmulligan2006
@kevinmulligan2006 Ай бұрын
tokens as a [word] is what I'm working on right now (solo, self learning LLM techniques), this video helped me realize how the model doesn't know what it's outputting obviously, but AI-AI is different, so building tokens that have dimensional vectors that process in a separate model, can be used for explainable AI.
@kevinmulligan2006
@kevinmulligan2006 Ай бұрын
meaning a separate model processes the response itself, meta, it's for building evolution learning. AI-AI machine learning, you need a way to configure in between the iterations.
@randomforest_dev
@randomforest_dev 22 күн бұрын
Very good explanation!
@sk-6032
@sk-6032 2 күн бұрын
Very well explained 🙏🏼👍
@gbluemink
@gbluemink 3 ай бұрын
So the question I have here is when I have an answer from my LLM but not the Rag data, what is the response to the user? "I don't know" or the LLM response that may be out of date or without a reliable source? Looks like a question for an LLM :)
@vic3sax972
@vic3sax972 2 ай бұрын
Nice explanation
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 63 М.
What is LangChain?
8:08
IBM Technology
Рет қаралды 105 М.
ISSEI funny story😂😂😂Strange World | Pink with inoCat
00:36
ISSEI / いっせい
Рет қаралды 13 МЛН
маленький брат прыгает в бассейн
00:15
GL Show Russian
Рет қаралды 2,6 МЛН
What is a Vector Database?
8:12
IBM Technology
Рет қаралды 36 М.
Stanford CS25: V3 I Retrieval Augmented Language Models
1:19:27
Stanford Online
Рет қаралды 125 М.
RAG for LLMs explained in 3 minutes
3:15
Manny Bernabe
Рет қаралды 10 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 362 М.
What is Prompt Tuning?
8:33
IBM Technology
Рет қаралды 156 М.
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 217 М.
The most important AI trends in 2024
9:35
IBM Technology
Рет қаралды 179 М.
What are Generative AI models?
8:47
IBM Technology
Рет қаралды 886 М.
Prompt Engineering Tutorial - Master ChatGPT and LLM Responses
41:36
freeCodeCamp.org
Рет қаралды 1,2 МЛН