RAGAS - Evaluate your LangChain RAG Pipelines

Рет қаралды 9,680

Coding Crash Courses

Күн бұрын

Пікірлер: 36

@melika1725 Ай бұрын

I am working on RAG systems for my master thesis. Thank you for this video. really thank you!

@codingcrashcourses8533 Ай бұрын

You´re welcome. On my channel I have more RAG videos and I also offer an Advanced RAG course on Udemy :)

@seallyolme 7 ай бұрын

This is awesome! Great and clear video :)

@mosheragomaa2 5 ай бұрын

So simple, helpful and clear! Very interesting. Thanks for the video

@M10n8 8 ай бұрын

Excellent timing ;-) Thanks for video

@MMO-g2w 8 ай бұрын

Bro is on fire this month!

@codingcrashcourses8533 8 ай бұрын

You guys give me so many requests on topics 😀

@MMO-g2w 8 ай бұрын

@@codingcrashcourses8533 i was, i am and i will support you till the end. Ur videos helped my sooooooooooo much.

@maxlgemeinderat9202 8 ай бұрын

Nice one! Also a big fan of RAGAS, however there are still many bugs that come with RAGAS, especially when trying to evaluate with local llms

@codingcrashcourses8533 8 ай бұрын

yes, it´s still far away from perfect, but good that frameworks like these are developed

@andreypetrunin5702 8 ай бұрын

Огромное спасибо за видео!!

@Challseus 8 ай бұрын

Another banger! :)

@mohammed333suliman 5 ай бұрын

Great video , thank you

@codingcrashcourses8533 5 ай бұрын

thank you for your comment :)

@robertputneydrake 8 ай бұрын

Nice, Meister! Machste irgendwann das Thema Code RAG ggf. mit Knowledge-Graphen?

@codingcrashcourses8533 8 ай бұрын

Currently no plans on working with knowledge graphs, since I don´t have experience with these. But maybe in the future :)

@alexandershevchenko4167 8 ай бұрын

Thank you for the video! Yeah, It will be really intereseting to know how to perform RAGAS in CI/CD pipline. Can you record video for this one please? Will be really helpful

@codingcrashcourses8533 8 ай бұрын

Maybe in a few weeks

@nguyenquynghia9755 5 ай бұрын

I switched to using RecursiveCharacterTextSplitter, but my context relevance is still low. Do you know why?

@kumarrajaakula9064 2 ай бұрын

I want to know one thing that even the ground truth is generated by an LLM, how can we determine whether it is correct for a particular query?

@codingcrashcourses8533 2 ай бұрын

You probably want to create your own dataset for that. I also dont want the llm to define a ground truth

@GenerativeAI-Guru 8 ай бұрын

I was waiting for this thank you so much, is it possible to add how to evaluate accuracy using F1 scoring or other methods

@codingcrashcourses8533 8 ай бұрын

Not out of the box, F1 scores can be easily caculated with pandas (to_pandas) like this: F1 = 2*precision*recall/(precision+recall)

@GenerativeAI-Guru 8 ай бұрын

@@codingcrashcourses8533 thanks

@maxlgemeinderat9202 8 ай бұрын

you could also calculate the RAGAS score which is the mean across all metrics

@fire17102 8 ай бұрын

It's there an ai pipeline to auto optimize the rag quality? Seems like the obvious next step... Great video 🙏👍

@codingcrashcourses8533 8 ай бұрын

You probably would have to build something like that on your own, since there are so many ways how a pipeline could look like. You could also work on your prompt and so on.

@fire17102 8 ай бұрын

@@codingcrashcourses8533 I'd always want to manually make changes I think are best, but I'd still like to see a full matrix of hyperperameters to remove alot of the guess work. Chunk size for example. More over I'd like to benchmark everything and add scoring functions. For example a score for fact checking - see Lucidate's last video. And also IndyDevDan last video battle royal of models, I suggested to combine it with something like you do with rag params and what I suggest for full pipeline benchmark with ai suggested optimization