Data Versioning in Generative AI: A Pathway to Cost-effective ML

  Рет қаралды 49

MLOps World: Machine Learning in Production

MLOps World: Machine Learning in Production

4 ай бұрын

Speaker: Dmitry Petrov, CEO, DVC
For 5 years we have been building DVC and we know how data versioning helps teams. The evolving Generative AI workflows are different and require an evolution of versioning workflows to accomplish Generative AI goals. This new era thrives on vast amounts of unstructured data, which include everything from images, videos, and audio, to MRI scans, document scans, and plain text dialogues. This data, often scaling into billions of objects, together with the resource-consuming task of scoring models on expensive GPU hardware or using model APIs like ChatGPT, brings forth unique challenges in the field of data management and versioning.
In this talk, we will delve into data versioning in the context of generative AI. Our focus will be on strategies that assist businesses in minimizing their processing time and the volume of API calls to external models like ChatGPT, resulting in substantial cost savings. Furthermore, we will discuss effective methodologies for sharing datasets amongst ML researchers to promote seamless collaboration.
Lastly, we will examine the pivotal transformations generative AI has introduced to data versioning in the past year including annotations and embeddings versioning. Together, these insights will provide attendees with an in-depth understanding of the rapidly evolving data management landscape in the era of generative AI. ""
1. How data management is different in a Generative AI environment compared to traditional ML
2. How to save cost on compute and API calls using data versioning
3. Dataset sharing in the team as a way to improve collaboration
4. How to efficiently version annotation, embeddings, and auto-labels together with data

Пікірлер
Building ML and GenAI Systems with Metaflow
47:12
MLOps World: Machine Learning in Production
Рет қаралды 146
Finding training inefficiencies with CentML DeepView
40:07
MLOps World: Machine Learning in Production
Рет қаралды 44
HAH Chaos in the Bathroom 🚽✨ Smart Tools for the Throne 😜
00:49
123 GO! Kevin
Рет қаралды 16 МЛН
哈莉奎因怎么变骷髅了#小丑 #shorts
00:19
好人小丑
Рет қаралды 54 МЛН
From Idea to Production: AI Infra for Scaling LLM Apps
38:26
MLOps World: Machine Learning in Production
Рет қаралды 252
Function Calling for LLMs: RAG without a Vector Database
39:02
MLOps World: Machine Learning in Production
Рет қаралды 259
The Who, What, and Why of Data Lake Table Formats
47:33
MLOps World: Machine Learning in Production
Рет қаралды 80
Why AI apps don't work in prod: AI Reliability Survey
32:25
MLOps World: Machine Learning in Production
Рет қаралды 79
Making Enterprise GenAI Safe and Effective - Tools and Approaches
35:21
MLOps World: Machine Learning in Production
Рет қаралды 83
How To Prepare AI For Uses In Science
23:49
Forbes
Рет қаралды 26 М.
Radxa X4: An N100 Pi
20:48
ExplainingComputers
Рет қаралды 55 М.
The BEST component for your RAG system
44:56
MLOps World: Machine Learning in Production
Рет қаралды 408
Private, Local AI
36:56
MLOps World: Machine Learning in Production
Рет қаралды 220