RAGAS - Evaluate your LangChain RAG Pipelines

  Рет қаралды 8,459

Coding Crash Courses

Coding Crash Courses

Күн бұрын

Пікірлер: 32
@mohammed333suliman
@mohammed333suliman 3 ай бұрын
Great video , thank you
@codingcrashcourses8533
@codingcrashcourses8533 3 ай бұрын
thank you for your comment :)
@kumarrajaakula9064
@kumarrajaakula9064 17 күн бұрын
I want to know one thing that even the ground truth is generated by an LLM, how can we determine whether it is correct for a particular query?
@codingcrashcourses8533
@codingcrashcourses8533 17 күн бұрын
You probably want to create your own dataset for that. I also dont want the llm to define a ground truth
@GenerativeAI-Guru
@GenerativeAI-Guru 6 ай бұрын
I was waiting for this thank you so much, is it possible to add how to evaluate accuracy using F1 scoring or other methods
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
Not out of the box, F1 scores can be easily caculated with pandas (to_pandas) like this: F1 = 2*precision*recall/(precision+recall)
@GenerativeAI-Guru
@GenerativeAI-Guru 6 ай бұрын
@@codingcrashcourses8533 thanks
@maxlgemeinderat9202
@maxlgemeinderat9202 6 ай бұрын
you could also calculate the RAGAS score which is the mean across all metrics
@maxlgemeinderat9202
@maxlgemeinderat9202 6 ай бұрын
Nice one! Also a big fan of RAGAS, however there are still many bugs that come with RAGAS, especially when trying to evaluate with local llms
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
yes, it´s still far away from perfect, but good that frameworks like these are developed
@nguyenquynghia9755
@nguyenquynghia9755 3 ай бұрын
I switched to using RecursiveCharacterTextSplitter, but my context relevance is still low. Do you know why?
@seallyolme
@seallyolme 6 ай бұрын
This is awesome! Great and clear video :)
@robertputneydrake
@robertputneydrake 6 ай бұрын
Nice, Meister! Machste irgendwann das Thema Code RAG ggf. mit Knowledge-Graphen?
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
Currently no plans on working with knowledge graphs, since I don´t have experience with these. But maybe in the future :)
@doggydoggy578
@doggydoggy578 Ай бұрын
Does this need an open ai key ? i have zero in my account
@codingcrashcourses8533
@codingcrashcourses8533 Ай бұрын
@@doggydoggy578 yes
@doggydoggy578
@doggydoggy578 Ай бұрын
@@codingcrashcourses8533 did you mention this in this video ? i think it's super important
@doggydoggy578
@doggydoggy578 Ай бұрын
@@codingcrashcourses8533 very nice, not at all an important detail to conveniently leave out
@mosheragomaa5544
@mosheragomaa5544 4 ай бұрын
So simple, helpful and clear! Very interesting. Thanks for the video
@alexandershevchenko4167
@alexandershevchenko4167 6 ай бұрын
Thank you for the video! Yeah, It will be really intereseting to know how to perform RAGAS in CI/CD pipline. Can you record video for this one please? Will be really helpful
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
Maybe in a few weeks
@M10n8
@M10n8 6 ай бұрын
Excellent timing ;-) Thanks for video
@fire17102
@fire17102 6 ай бұрын
It's there an ai pipeline to auto optimize the rag quality? Seems like the obvious next step... Great video 🙏👍
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
You probably would have to build something like that on your own, since there are so many ways how a pipeline could look like. You could also work on your prompt and so on.
@fire17102
@fire17102 6 ай бұрын
@@codingcrashcourses8533 I'd always want to manually make changes I think are best, but I'd still like to see a full matrix of hyperperameters to remove alot of the guess work. Chunk size for example. More over I'd like to benchmark everything and add scoring functions. For example a score for fact checking - see Lucidate's last video. And also IndyDevDan last video battle royal of models, I suggested to combine it with something like you do with rag params and what I suggest for full pipeline benchmark with ai suggested optimization
@Challseus
@Challseus 6 ай бұрын
Another banger! :)
@MMO-g2w
@MMO-g2w 6 ай бұрын
Bro is on fire this month!
@codingcrashcourses8533
@codingcrashcourses8533 6 ай бұрын
You guys give me so many requests on topics 😀
@MMO-g2w
@MMO-g2w 6 ай бұрын
@@codingcrashcourses8533 i was, i am and i will support you till the end. Ur videos helped my sooooooooooo much.
@andreypetrunin5702
@andreypetrunin5702 6 ай бұрын
Огромное спасибо за видео!!
LangChain vs. LlamaIndex - What Framework to use for RAG?
16:51
Coding Crash Courses
Рет қаралды 16 М.
RAPTOR - Advanced RAG with LangChain
15:43
Coding Crash Courses
Рет қаралды 10 М.
Amazing Parenting Hacks! 👶✨ #ParentingTips #LifeHacks
00:18
Snack Chat
Рет қаралды 22 МЛН
How to evaluate an LLM-powered RAG application automatically.
50:42
AI Agent Evaluation with RAGAS
19:42
James Briggs
Рет қаралды 13 М.
Supercharge Your RAG with Contextualized Late Interactions
17:45
Prompt Engineering
Рет қаралды 12 М.
The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!
16:14
LangChain Advanced RAG - Two-Stage Retrieval with Cross Encoder (BERT)
14:21
Coding Crash Courses
Рет қаралды 10 М.
LangChain - Advanced RAG Techniques for better Retrieval Performance
24:57
Coding Crash Courses
Рет қаралды 31 М.
Is Tree-based RAG Struggling? Not with Knowledge Graphs!
9:06
Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j
15:01
Coding Crash Courses
Рет қаралды 19 М.