What is Retrieval Augmented Generation (RAG) - Augmenting LLMs with a memory

  Рет қаралды 26,911

What's AI by Louis-François Bouchard

What's AI by Louis-François Bouchard

4 ай бұрын

► Jump on our free RAG course from the Gen AI 360 Foundational Model Certification (Built in collaboration with Activeloop, Towards AI, and the Intel Disruptor Initiative): learn.activeloop.ai/courses/rag
►Twitter: / whats_ai
►My Newsletter (My AI updates and news clearly explained): louisbouchard.substack.com/
►Support me on Patreon: / whatsai
►Join Our AI Discord: / discord
How to start in AI/ML - A Complete Guide:
►www.louisbouchard.ai/learnai/
Become a member of the KZbin community, support my work and get a cool Discord role :
/ @whatsai
#ai #llm #rag

Пікірлер: 28
@letseat3553
@letseat3553 2 ай бұрын
RAG is just 'full text indexing' on the local data with the ranked results fed into the context window and sent to the LLM along with the question. Every time I see it described as something of a database guy for the last 30 years all I see are new words describing long solved problems.
@rajeshbasnet4454
@rajeshbasnet4454 Ай бұрын
You mean like how elastic search does indexing ?
@ahmedzouaoui8177
@ahmedzouaoui8177 20 күн бұрын
Well new cars have wheels which is a technology that has thousands of years of existence. It does not mean that new cars are 'obsolete' but using an old tech to improve a new one is a great way of doing engineering !
@Parsley1965
@Parsley1965 4 ай бұрын
Truly excellent video!
@helainz7198
@helainz7198 12 күн бұрын
Et cetera bien sur mon poto
@user-oh4jz9zu5v
@user-oh4jz9zu5v 4 ай бұрын
Now I understood, What is RAG - Retrieval Augmented Generation ,Very Informative Video, Liked your Video 👍
@Kama45
@Kama45 12 күн бұрын
Subbed
@sabriboubaker
@sabriboubaker 4 ай бұрын
Great video, straight to the point. Thanks again
@WhatsAI
@WhatsAI 4 ай бұрын
Thank you Sabri! :)
@Plink2120
@Plink2120 4 ай бұрын
Vraiment clair et précis merci
@finn_the_dog
@finn_the_dog 4 ай бұрын
Great video. Would you make a video the different types of RAGs? Or how to prepare data for a RAG, for example when your document has tables, math formulas, references to images, I haven't seen much content about how to handle diverse data inside a document with RAGs. Cheers
@WhatsAI
@WhatsAI 4 ай бұрын
Great idea, thank you! Will definitely look into multi modal RAG! :)
@prattipatimanojsai
@prattipatimanojsai 4 ай бұрын
Very Informative and useful!! Thanks
@bhanujinaidu
@bhanujinaidu Ай бұрын
Thanks , very clear excellent explanation
@WhatsAI
@WhatsAI Ай бұрын
Thank you! :)
@chairwood
@chairwood 4 ай бұрын
thx. i enjoyed this video
@WhatsAI
@WhatsAI 4 ай бұрын
Glad to hear so my friend! 😊
@MK-ce7im
@MK-ce7im 2 ай бұрын
I think this is the best video I have seen on this topic. Wanted to ask if we can use RAG offline maybe with Mistral model ?
@WhatsAI
@WhatsAI 2 ай бұрын
Of course you can host everything locally if you have the capacity! :)
@martinkrueger937
@martinkrueger937 3 ай бұрын
by any chance do you know which RAG system/framework is giving out the best performance?
@WhatsAI
@WhatsAI 3 ай бұрын
From our work we like to use llamaindex for many parts and adapt on our own code for more personalized settings!
@rhans6598
@rhans6598 3 ай бұрын
Thanks but what's the point of sound effects?
@Mr_Arun_Raj
@Mr_Arun_Raj 4 ай бұрын
After integrating with RAG. latency increased....
@WhatsAI
@WhatsAI 4 ай бұрын
That is for sure! There is some downsides but the latency if very little.
@paulwillisorg
@paulwillisorg Ай бұрын
The accent of the speaker is pretty heavy.
@WhatsAI
@WhatsAI Ай бұрын
Hope it’s still easy to understand!
@kunjs
@kunjs 3 ай бұрын
google launched gemini advanced 1.5, a RAG killer 💀
@WhatsAI
@WhatsAI 3 ай бұрын
A database can be much larger than this context window and much more efficient I believe. It’s unsure how good the models are vs gpt4 yet. Plus, sending millions of tokens for every prompt will be extremely expensive for each request, haha! It’s good for some use cases like sending a full repo once and asking questions but not for working with customers and handling many requests I believe.
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 81 М.
What is LangChain?
8:08
IBM Technology
Рет қаралды 132 М.
[柴犬ASMR]曼玉Manyu&小白Bai 毛发护理Spa asmr
01:00
是曼玉不是鳗鱼
Рет қаралды 50 МЛН
100❤️
00:19
Nonomen ノノメン
Рет қаралды 38 МЛН
КАК СПРЯТАТЬ КОНФЕТЫ
00:59
123 GO! Shorts Russian
Рет қаралды 3 МЛН
Miracle Doctor Saves Blind Girl ❤️
00:59
Alan Chikin Chow
Рет қаралды 51 МЛН
What are Mixture of Experts (GPT4, Mixtral…)?
12:07
What's AI by Louis-François Bouchard
Рет қаралды 1,2 М.
Vector Databases simply explained! (Embeddings & Indexes)
4:23
AssemblyAI
Рет қаралды 277 М.
RAG for LLMs explained in 3 minutes
3:15
Manny Bernabe
Рет қаралды 13 М.
AI Leader Reveals The Future of AI AGENTS (LangChain CEO)
16:22
Matthew Berman
Рет қаралды 87 М.
The Risks of AI-Generated Code: What Every Developer Should Know
10:10
What's AI by Louis-François Bouchard
Рет қаралды 504
How ChatGPT Works Technically | ChatGPT Architecture
7:54
ByteByteGo
Рет қаралды 703 М.
Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!
36:15
StatQuest with Josh Starmer
Рет қаралды 575 М.
Chunking Strategies in RAG: Optimising Data for Advanced AI Responses
14:02
LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners
12:44
Выложил СВОЙ АЙФОН НА АВИТО #shorts
0:42
Дмитрий Левандовский
Рет қаралды 1,5 МЛН
POCO F6 PRO - ЛУЧШИЙ POCO НА ДАННЫЙ МОМЕНТ!
18:51
Топ-3 суперкрутых ПК из CompShop
1:00
CompShop Shorts
Рет қаралды 373 М.