Evaluating LLM-based Applications

How to Build LLMs on Your Company’s Data While on a Budget

[1hr Talk] Intro to Large Language Models

Let's all try it too‼︎#magic#tenge

How to open a can? 🤪 lifehack

Зачем командирам БМ-13 "Катюша" выдавали презервативы? #shorts

О, сосисочки! (Или корейская уличная еда?)

Evaluating LLM-based Applications

Рет қаралды 18,359

Databricks

9 ай бұрын

Evaluating LLM-based applications can feel like more of an art than a science. In this workshop, we'll give a hands-on introduction to evaluating language models. You'll come away with knowledge and tools you can use to evaluate your own applications, and answers to questions like:
- Where do I get evaluation data from, anyway?
- Is it possible to evaluate generative models in an automated way?
- What metrics can I use?
- What's the role of human evaluation?
Talk by: Josh Tobin
Here’s more to explore:
LLM Compact Guide: dbricks.co/43WuQyb Big Book of MLOps: dbricks.co/3r0Pqiz
Connect with us: Website: databricks.com
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc
Facebook: / databricksinc

Пікірлер: 9

@AnandShah-ds 6 ай бұрын

Evaluations aside, I really enjoyed the presentation. I was hooked. Great story-telling skills Josh. Thanks for sharing your experience. We count on volunteers like you to spread knowledge.

@ndamulelosbg8887

@ndamulelosbg8887 2 ай бұрын

This is an exellent coverage of the challenging task of llm evaluatuon

@ndamulelosbg8887

@ndamulelosbg8887 2 ай бұрын

"Your opininon on LLMs does not matter" - I found this to be a great quote

@vaishnavipatil3319

@vaishnavipatil3319 9 ай бұрын

Thank you for clearing this concepts. Would like to see more videos from you on evaluation frameworks, methods.

@asfandiyar5829

@asfandiyar5829 8 ай бұрын

Just what I was after. Thanks

@manishsharma2211

@manishsharma2211 8 ай бұрын

Good work

@SpartanPanda 7 ай бұрын

Great storyline

@bharath_v 5 ай бұрын

Good One!

@threevia.travel

@threevia.travel 3 ай бұрын

Very generic, expected something more tangible! Sounds common sense which might work or might not work

How to Build LLMs on Your Company’s Data While on a Budget

40:37

How to Build LLMs on Your Company’s Data While on a Budget

Databricks

Рет қаралды 25 М.

[1hr Talk] Intro to Large Language Models

59:48

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

Рет қаралды 1,8 МЛН

Let's all try it too‼︎#magic#tenge

00:26

Let's all try it too‼︎#magic#tenge

Nonomen ノノメン

Рет қаралды 44 МЛН

How to open a can? 🤪 lifehack

00:25

How to open a can? 🤪 lifehack

Mr.Clabik - Friends

Рет қаралды 13 МЛН

Зачем командирам БМ-13 "Катюша" выдавали презервативы? #shorts

00:59

Зачем командирам БМ-13 "Катюша" выдавали презервативы? #shorts

Поле брани

Рет қаралды 3,8 МЛН

О, сосисочки! (Или корейская уличная еда?)

00:32

О, сосисочки! (Или корейская уличная еда?)

Кушать Хочу

Рет қаралды 3,3 МЛН

Evaluating LLM-based Applications // Josh Tobin // LLMs in Prod Conference Part 2

49:50

Evaluating LLM-based Applications // Josh Tobin // LLMs in Prod Conference Part 2

MLOps.community

Рет қаралды 4,2 М.

Fine-tuning Large Language Models (LLMs) | w/ Example Code

28:18

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Shaw Talebi

Рет қаралды 221 М.

What Language Model To Choose For Your Project? 🤔 LLM Evaluation

13:07

What Language Model To Choose For Your Project? 🤔 LLM Evaluation

Analytics Camp

Рет қаралды 408

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

30:25

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

MLOps.community

Рет қаралды 8 М.

Evaluation for Large Language Models and Generative AI - A Deep Dive

1:16:49

Evaluation for Large Language Models and Generative AI - A Deep Dive

Rajistics - data science, AI, and machine learning

Рет қаралды 7 М.

Deep Dive into LLM Evaluation with Weights & Biases

59:11

Deep Dive into LLM Evaluation with Weights & Biases

DeepLearningAI

Рет қаралды 16 М.

"okay, but I want Llama 3 for my specific use case" - Here's how

24:20

"okay, but I want Llama 3 for my specific use case" - Here's how

David Ondrej

Рет қаралды 77 М.

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

15:21

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Entry Point AI

Рет қаралды 39 М.

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

30:29

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

Chris Hay

Рет қаралды 5 М.

How to evaluate LLM Applications - Webinar by deepset.ai

58:59

How to evaluate LLM Applications - Webinar by deepset.ai

deepset

Рет қаралды 883

Готовый миниПК от Intel (но от китайцев)

36:25

Готовый миниПК от Intel (но от китайцев)

Ремонтяш

Рет қаралды 64 М.

Не работает после профилактики / Геймпад Sony DUALSHOCK 4 | РЕМОНТ

3:30

Не работает после профилактики / Геймпад Sony DUALSHOCK 4 | РЕМОНТ

Remonter

Рет қаралды 29 М.

😍Lucky Day!! i Found Galaxy Z Fold 2, iPhone 11 & More! - Restoration Broken iPhone X

17:42

😍Lucky Day!! i Found Galaxy Z Fold 2, iPhone 11 & More! - Restoration Broken iPhone X

JaiPhone

Рет қаралды 3,1 МЛН

Пленка или защитное стекло: что лучше?

0:52

Пленка или защитное стекло: что лучше?

Слава 100пудово!

Рет қаралды 1,5 МЛН

Impossible sigma 🤣 - para SAMSUNG A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 /// FREEFIR

1:00

Impossible sigma 🤣 - para SAMSUNG A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 /// FREEFIR

RIHAN ARMY YT

Рет қаралды 6 МЛН

Как открыть дверь в Jaecoo J8? Удобно?🤔😊

0:27

Как открыть дверь в Jaecoo J8? Удобно?🤔😊

Суворкин Сергей

Рет қаралды 1,2 МЛН

Apple Event - May 7

38:22

Apple Event - May 7

Apple

Рет қаралды 6 МЛН