LLM Evaluation With MLFLOW And Dagshub For Generative AI Application

  Рет қаралды 16,573

Krish Naik

Krish Naik

Күн бұрын

With the emerging of ChatGPT, LLMs have shown its power of text generation in various fields, such as question answering, translating and text summarization. Evaluating LLMs’ performance is slightly different from traditional ML models, as very often there is no single ground truth to compare against. MLflow provides an API mlflow.evaluate() to help evaluate your LLMs.
mlflow.org/doc...
Code:github.com/kri...
---------------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

Пікірлер: 26
@krishnaik06
@krishnaik06 4 ай бұрын
Subscribe if you want to become a Data Scientist :)
@jaisingh1292
@jaisingh1292 4 ай бұрын
Hi Krish can you please take a look at dvc and mlflow and can they be combined and used for gen ai , the demand for LLMops is increasing. For cloud its fine but industries want on prem solution as well it would be great if you can make any video on the same
@DarkShadow-bq5yb
@DarkShadow-bq5yb 4 ай бұрын
can you create video for Data engineering using LLMs
@HimanshuGupta-ps3ib
@HimanshuGupta-ps3ib 3 ай бұрын
Great video Krish! Watching your content from USA.
@anubhavsarkar5745
@anubhavsarkar5745 15 күн бұрын
Very Helpful Video Krish. Can you please show us the same LLM Model Evaluation using MLFlow, but using HuggingFace LLMs instead of OpenAI ? I am facing a lot of issues in adding payment method to the OpenAI platform.
@skyplanet9858
@skyplanet9858 Ай бұрын
Dear Kirish, thank you very much for the great video. Dagshub is an Israeli company that's complicit in genocide. I wish you could explore another tool.
@adityamaurya9521
@adityamaurya9521 3 күн бұрын
Can we do this with Mistral API keys ?
@2dapoint424
@2dapoint424 4 ай бұрын
Krish, @10:15 in VS code what is that cat like icon on top right next to the run button?
@adityarajshukla9422
@adityarajshukla9422 2 ай бұрын
copilot
@shriharinair1999
@shriharinair1999 4 ай бұрын
can you from now on focus more on non paid apis?
@prayagbrahmbhatt6375
@prayagbrahmbhatt6375 4 ай бұрын
Need a video explanation on promptfoo, an opensource LLM evaluation library.
@SantoshYadav-f4r4s
@SantoshYadav-f4r4s 3 ай бұрын
Hi Krish ! This is excellent video, will you be able to make a video on Azure AI Prompt flow, please?
@sourabhsomdeve1373
@sourabhsomdeve1373 22 күн бұрын
Nice
@pratiksitapara8962
@pratiksitapara8962 4 ай бұрын
Great video on Evals! Thanks! But It would be great to see or if you could make a video on Evaluating RAG on Traditional metrics? What insights we can get by using traditional evals or automatic evals? What is the current SOTA methods for RAG Evals!?
@YorkYongYeo
@YorkYongYeo 4 ай бұрын
thanks for sharing this! just what i needed for reference. Liked and looking forward to the one for RAG evaluation
@GiovanneAfonso
@GiovanneAfonso 3 ай бұрын
Incredible content, thank you +1sub
@pavanpraneeth4659
@pavanpraneeth4659 4 ай бұрын
Awesome please in future show how to integrate this with aws bedrock please
@arpitqw1
@arpitqw1 3 ай бұрын
what is the use of daghub ?, same once can see in local tracking server.
@surbhirohilla5139
@surbhirohilla5139 3 ай бұрын
You deserve more appreciation for this
@ashishdayal172
@ashishdayal172 4 ай бұрын
sounds like tensorflow hub
@ridj41
@ridj41 4 ай бұрын
Krish honestly the content has become a bit uninteresting tbh from the past few videos. Would love if you try out some new kind of projects, some advanced ones to use in our resumes using GenAi
@krishnaik06
@krishnaik06 4 ай бұрын
brother this is really important. I usually make all the videos based on the job market :)
@RishiRajxtrim
@RishiRajxtrim 4 ай бұрын
Good evening
How to evaluate an LLM-powered RAG application automatically.
50:42
How To Get Married:   #short
00:22
Jin and Hattie
Рет қаралды 23 МЛН
Миллионер | 1 - серия
34:31
Million Show
Рет қаралды 2,1 МЛН
小丑妹妹插队被妈妈教训!#小丑#路飞#家庭#搞笑
00:12
家庭搞笑日记
Рет қаралды 38 МЛН
AI vs ML vs DL vs Generative Ai
16:00
Krish Naik
Рет қаралды 45 М.
MLFlow Tutorial | ML Ops Tutorial
50:31
codebasics
Рет қаралды 19 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Generative AI In AWS-AWS Bedrock Crash Course #awsbedrock #genai
37:16
Evaluation for Large Language Models and Generative AI - A Deep Dive
1:16:49
Rajistics - data science, AI, and machine learning
Рет қаралды 9 М.
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 274 М.
Why Agent Frameworks Will Fail (and what to use instead)
19:21
Dave Ebbelaar
Рет қаралды 66 М.
How To Get Married:   #short
00:22
Jin and Hattie
Рет қаралды 23 МЛН