RAG vs. Fine Tuning

  Рет қаралды 18,629

IBM Technology

IBM Technology

Күн бұрын

Get the guide to GAI, learn more → ibm.biz/BdKTbF
Learn more about the technology → ibm.biz/BdKTbX
Join Cedric Clyburn as he explores the differences and use cases of Retrieval Augmented Generation (RAG) and fine-tuning in enhancing large language models. This video covers the strengths, weaknesses, and common applications of both techniques, and provides insights on how to choose between them using machine learning and natural language processing principles
AI news moves fast. Sign up for a monthly newsletter for AI updates from IBM → www.ibm.com/ac...
#AI #LargeLanguageModels #FineTuning #RAG #ReinforcementLearning #MachineLearning #NaturalLanguageProcessing

Пікірлер: 38
@Kk-ed1gr
@Kk-ed1gr 19 күн бұрын
Thank you for the clarification, I had this question in mind last week, and I am glad that you have provided the answers I need.
@IBMTechnology
@IBMTechnology 18 күн бұрын
Glad it was helpful!
@yusufersayyem7242
@yusufersayyem7242 19 күн бұрын
35 minutes after downloading the clip, I received a notification, perhaps due to the weak internet in my country.... Finally, I would like to thank you sir for this wonderful explanation
@IBMTechnology
@IBMTechnology 18 күн бұрын
You're welcome!
@FauziFayyad
@FauziFayyad 19 күн бұрын
I have just watched the 1 years ago, then it updated today. Amazingg 🎉
@educationrepublic9273
@educationrepublic9273 8 күн бұрын
Love IBM's short and sharp explainers! Thank you for an excellent video once again :)
@Criszusep
@Criszusep 19 күн бұрын
Euro 2024 World Championship. Nice... of course the LLM could't give a response 😂
@umakrishnamarineni3520
@umakrishnamarineni3520 18 күн бұрын
The RAG isn't updated with new tournament 😂😅
@florentromanet5439
@florentromanet5439 19 күн бұрын
I wanted to scream "WHY NOT BOTH⁉️) until 7:35 😂
@MukeshKala
@MukeshKala 18 күн бұрын
Great explanation ❤
@bharathYerukola-gt7vt
@bharathYerukola-gt7vt 19 күн бұрын
Make a vedio on termonolgioes are often used on ai like benchmark and art of the state and etcc ❤
@infotainmentunlimitedbyrohit
@infotainmentunlimitedbyrohit 16 күн бұрын
Thank you 🙏💛
@mark-lq4rk
@mark-lq4rk 19 күн бұрын
Thank you for the fascinating presentation. Assume certain conditions are similar, how would the cost of rag and fine-tuning differ?
@IBMTechnology
@IBMTechnology 18 күн бұрын
RAG is generally more cost-efficient than fine-tuning because it limits resource costs by leveraging existing data and eliminating the need for extensive training stages.
@GG-uz8us
@GG-uz8us 4 күн бұрын
I would like to see a real app that is in production with RAG and fine-tuning.
@Siapanpeteellis
@Siapanpeteellis 19 күн бұрын
What happens to a model when it is fine-tuned? do you use a database for RAG?
@cloudnativecedric
@cloudnativecedric 19 күн бұрын
Good question! So with fine-tuning, using an approach like PEFT (Parameter-Efficient Fine-Tuning) which only updates a subset of the full model's parameters, we have new model weights and biases, which could then shared, deployed on a server, etc. for model inferencing with AI-enabled applications. For RAG, yes indeed, the most common method is with a vector database and turning your data into embeddings to search for similarity when using the LLM. But, there's other ways of setting up RAG pipelines too :)
@jasonrhtx
@jasonrhtx 17 күн бұрын
@@cloudnativecedricWhen would it make sense to first use PEFT, then apply RAG? Do both PEFT and RAG assign/label semantic relationships to the texts of user-added corpora and store these in a graph database?
@JamilaJibril-e8h
@JamilaJibril-e8h 19 күн бұрын
Uhhh okay i see you .....😂😂😂
@memehub2002
@memehub2002 19 күн бұрын
cool
@einjim
@einjim 18 күн бұрын
So, You are all told to wear your watch on your right hand right?!
@bharathYerukola-gt7vt
@bharathYerukola-gt7vt 19 күн бұрын
Nice vedio and also make a vedio on neural networks in deep like how neiral network is interlinked with deep learning and machine learning and what is actaully neuarl network and architecuture and why architectute is inporatnt fir neural networks and what is neural network actalkuy like a technique or mathematical expression or anything else so make a vedio on all these
@hi5wifi-s567
@hi5wifi-s567 18 күн бұрын
Using “Fine Tuning” , then machine ( accounting software) can be a bookkeeper to prepare financial records for …?
@cloudnativecedric
@cloudnativecedric 16 күн бұрын
Just some ideas from the top of my head for fine-tuning with financial records are preparing financial statements, tax preparation (fine tuning on region-specific tax rules and historical data), expense tracking & categorization, etc.
@ggggdyeye
@ggggdyeye 12 күн бұрын
sir can you tell me how to make the vectorstore and store it in a specific file to use it every time.
@SandraGarcia-t1k
@SandraGarcia-t1k 2 күн бұрын
White Deborah Wilson Susan Garcia Cynthia
@cho7official55
@cho7official55 17 күн бұрын
I thought the retriever was on the far right, and llm in the middle of both, was I wrong, partially, is that schematic representation doesn't fathom all of the architecture, I'd like to go deeper on that matter.
@cloudnativecedric
@cloudnativecedric 16 күн бұрын
There are a lot of variances with the RAG approach that can lead to different architectures, but there's a full video on the IBM Technology channel that dives into RAG as well!
@SandraGarcia-t1k
@SandraGarcia-t1k 5 күн бұрын
Garcia Kimberly Lopez Karen Hall Mark
@harryli7557
@harryli7557 3 күн бұрын
Large Manguage Model! 2:08
@andiglazkov4915
@andiglazkov4915 19 күн бұрын
Thanks ☺️
@IBMTechnology
@IBMTechnology 18 күн бұрын
You're welcome!
@ElaraArale
@ElaraArale 19 күн бұрын
Thank you~!
@IBMTechnology
@IBMTechnology 18 күн бұрын
You're welcome!
@atanasmatev9600
@atanasmatev9600 16 күн бұрын
Large Language model is "LMM"?
@cloudnativecedric
@cloudnativecedric 12 күн бұрын
Whoops! Good catch, sometimes I mess up when speaking and writing at the same time, it should be “LLM”.
@rfflduck
@rfflduck 19 күн бұрын
Great video!
@IBMTechnology
@IBMTechnology 18 күн бұрын
Thanks for the visit
What are AI Agents?
12:29
IBM Technology
Рет қаралды 486 М.
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 146 М.
Inside Out 2: BABY JOY VS SHIN SONIC 3
00:19
AnythingAlexia
Рет қаралды 9 МЛН
А ВЫ ЛЮБИТЕ ШКОЛУ?? #shorts
00:20
Паша Осадчий
Рет қаралды 9 МЛН
An Unknown Ending💪
00:49
ISSEI / いっせい
Рет қаралды 55 МЛН
The joker favorite#joker  #shorts
00:15
Untitled Joker
Рет қаралды 30 МЛН
What are Word Embeddings?
8:38
IBM Technology
Рет қаралды 10 М.
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 257 М.
RAG Explained in 7 Minutes: The Future of AI?
7:52
AIwithAustin․com
Рет қаралды 3,5 М.
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 95 М.
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
RAG Explained
8:03
IBM Technology
Рет қаралды 90 М.
10 Principles for Secure by Design: Baking Security into Your Systems
17:28
LoRA & QLoRA Fine-tuning Explained In-Depth
14:39
Entry Point AI
Рет қаралды 42 М.
The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!
16:14
What is Artificial Superintelligence (ASI)?
10:12
IBM Technology
Рет қаралды 10 М.
Inside Out 2: BABY JOY VS SHIN SONIC 3
00:19
AnythingAlexia
Рет қаралды 9 МЛН