Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

  Рет қаралды 813

Weights & Biases

Weights & Biases

Күн бұрын

Пікірлер: 2
@I-0-0-I
@I-0-0-I 26 күн бұрын
I really enjoyed this episode. Like you said, so much practical info. One crazy thing: Joseph mentioned that he would like non-industry folks to learn about the arena. I just tried to post comments on Reddit with links to Chatbot and WebDev arena, and they get immediately [removed by Reddit]. If you have any contacts there, it might be worth reaching out. Looking around Reddit, it appears that this is happening to everyone. I am super curious as to why Reddit is banning links to the arena sites.
@haraldwolte3745
@haraldwolte3745 16 күн бұрын
How can people run their own internal benchmarks, as recommended in this conversation? Seems like it would require a lot of complicated custom systems
How to Build Effective AI Agents (without the hype)
24:27
Dave Ebbelaar
Рет қаралды 100 М.
Building AI agents using Weights & Biases
13:11
Weights & Biases
Рет қаралды 960
«Жат бауыр» телехикаясы І 30 - бөлім | Соңғы бөлім
52:59
Qazaqstan TV / Қазақстан Ұлттық Арнасы
Рет қаралды 340 М.
Air Sigma Girl #sigma
0:32
Jin and Hattie
Рет қаралды 45 МЛН
Thank you mommy 😊💝 #shorts
0:24
5-Minute Crafts HOUSE
Рет қаралды 33 МЛН
LLMOps in action: Streamlining the path from prototype to production
40:49
This AI Robot Is Doing the Impossible - Unitree x ElizaWakesUp
9:30
AI Revolution
Рет қаралды 163 М.
Unlocking the potential of MLOps and LLMOps
1:06:33
Weights & Biases
Рет қаралды 527
Fine tuning Azure OpenAI Service Models with Weights & Biases
21:52
Weights & Biases
Рет қаралды 296
Pixel 7 и 7 Pro с Face ID - лучше iPhone 14 Pro!
21:12
Rozetked
Рет қаралды 457 М.
Такого Корпуса для ПК нет ни у кого в России
1:00
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 847 М.
КОРОЧЕ ГОВОРЯ, НЕДЕЛЯ БЕЗ ТЕЛЕФОНА
3:54
Самые простые строительные леса
0:54
Канал ИДЕЙ
Рет қаралды 1 МЛН