Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84

No video

Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84

Рет қаралды 6,096

Stanford MLSys Seminars

Stanford MLSys Seminars

Күн бұрын

Пікірлер: 8

@voncolborn9437

@voncolborn9437 8 ай бұрын

Great presentation. It is interesting to see the practical side of running a bunch of LLMs. Ops makes it happen. Coming from the old, really old, school of computing with massive multi-user, time-share systems, it is interesting to see how no matter how much computing changes, aspects of it remain the same. Through-put, latency, caching and scheduling is still central. All that seems to have changed is the problem domain. We do, in deed, live in intereswting times.

@conan_der_barbar

@conan_der_barbar 9 ай бұрын

great talk! still waiting for the open source release 👀

@Gerald-iz7mv 5 ай бұрын

hi, do you have any links to benchmarks you can run to measure latency, throughput for different model and frameworks etc?

@suleimanshehu5839

@suleimanshehu5839 8 ай бұрын

Please create a video on fine tuning MoE LLM using LoRa adapters such as Mixtural 8x7B MoE LLM within your framework

@fastcardlastname3353

@fastcardlastname3353 9 ай бұрын

This shall change the landscape of multiple agents if it's promised.

@mohamedfouad1309

@mohamedfouad1309 9 ай бұрын

Github link😅

@nithinrao7191 10 ай бұрын

Second

@absbi0000 10 ай бұрын

First

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

47:35

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

Stanford MLSys Seminars

Рет қаралды 3,6 М.

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

56:32

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Stanford MLSys Seminars

Рет қаралды 3,9 М.

This Kind Couple Gave Me a New Home! 🏡💖 #heartwarming #storytime #creative

00:35

This Kind Couple Gave Me a New Home! 🏡💖 #heartwarming #storytime #creative

Friendeez

Рет қаралды 21 МЛН

هذه الحلوى قد تقتلني 😱🍬

00:22

هذه الحلوى قد تقتلني 😱🍬

Cool Tool SHORTS Arabic

Рет қаралды 91 МЛН

Чёрная ДЫРА 🕳️ | WICSUR #shorts

00:49

Чёрная ДЫРА 🕳️ | WICSUR #shorts

Бискас

Рет қаралды 7 МЛН

📱🪢 Mom's Wild Lesson: Phone Tied to Thread! See What Happens Next! 😱 #reaction #cats #funny #prank

00:18

📱🪢 Mom's Wild Lesson: Phone Tied to Thread! See What Happens Next! 😱 #reaction #cats #funny #prank

PuffPaw

Рет қаралды 3,5 МЛН

This AI Agent JUST CRUSHED Cursor and Devin!!! 💥 AI Coding with Deployments 💥

12:35

This AI Agent JUST CRUSHED Cursor and Devin!!! 💥 AI Coding with Deployments 💥

1littlecoder

Рет қаралды 234

LoRAX: Serve 1000s of Fine-Tuned LLMs on a Single GPU - Travis Addair, Predibase, Inc.

31:43

LoRAX: Serve 1000s of Fine-Tuned LLMs on a Single GPU - Travis Addair, Predibase, Inc.

The Linux Foundation

Рет қаралды 248

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

55:59

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Stanford MLSys Seminars

Рет қаралды 8 М.

Workshop on Useful and Reliable AI Agents

3:32:19

Workshop on Useful and Reliable AI Agents

Princeton Language & Intelligence

Рет қаралды 2 М.

Training and deploying open-source large language models

39:53

Training and deploying open-source large language models

Niels Rogge

Рет қаралды 16 М.

The Next 100x - Gavin Uberti | Stanford MLSys #92

59:21

The Next 100x - Gavin Uberti | Stanford MLSys #92

Stanford MLSys Seminars

Рет қаралды 5 М.

Democratizing Foundation Models via k-bit Quantization - Tim Dettmers | Stanford MLSys #82

58:25

Democratizing Foundation Models via k-bit Quantization - Tim Dettmers | Stanford MLSys #82

Stanford MLSys Seminars

Рет қаралды 3,7 М.

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

57:05

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Stanford MLSys Seminars

Рет қаралды 4,6 М.

LoRA Land: How We Trained 25 Fine-Tuned Mistral-7b Models that Outperform GPT-4

57:06

LoRA Land: How We Trained 25 Fine-Tuned Mistral-7b Models that Outperform GPT-4

Predibase

Рет қаралды 6 М.

Enabling Cost-Efficient LLM Serving with Ray Serve

30:28

Enabling Cost-Efficient LLM Serving with Ray Serve

Anyscale

Рет қаралды 5 М.

This Kind Couple Gave Me a New Home! 🏡💖 #heartwarming #storytime #creative

00:35

This Kind Couple Gave Me a New Home! 🏡💖 #heartwarming #storytime #creative

Friendeez

Рет қаралды 21 МЛН