No video

Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84

  Рет қаралды 6,096

Stanford MLSys Seminars

Stanford MLSys Seminars

Күн бұрын

Пікірлер: 8
@voncolborn9437
@voncolborn9437 8 ай бұрын
Great presentation. It is interesting to see the practical side of running a bunch of LLMs. Ops makes it happen. Coming from the old, really old, school of computing with massive multi-user, time-share systems, it is interesting to see how no matter how much computing changes, aspects of it remain the same. Through-put, latency, caching and scheduling is still central. All that seems to have changed is the problem domain. We do, in deed, live in intereswting times.
@conan_der_barbar
@conan_der_barbar 9 ай бұрын
great talk! still waiting for the open source release 👀
@Gerald-iz7mv
@Gerald-iz7mv 5 ай бұрын
hi, do you have any links to benchmarks you can run to measure latency, throughput for different model and frameworks etc?
@suleimanshehu5839
@suleimanshehu5839 8 ай бұрын
Please create a video on fine tuning MoE LLM using LoRa adapters such as Mixtural 8x7B MoE LLM within your framework
@fastcardlastname3353
@fastcardlastname3353 9 ай бұрын
This shall change the landscape of multiple agents if it's promised.
@mohamedfouad1309
@mohamedfouad1309 9 ай бұрын
Github link😅
@nithinrao7191
@nithinrao7191 10 ай бұрын
Second
@absbi0000
@absbi0000 10 ай бұрын
First
Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85
47:35
Stanford MLSys Seminars
Рет қаралды 3,6 М.
Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86
56:32
هذه الحلوى قد تقتلني 😱🍬
00:22
Cool Tool SHORTS Arabic
Рет қаралды 91 МЛН
Чёрная ДЫРА 🕳️ | WICSUR #shorts
00:49
Бискас
Рет қаралды 7 МЛН
Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83
55:59
Stanford MLSys Seminars
Рет қаралды 8 М.
Workshop on Useful and Reliable AI Agents
3:32:19
Princeton Language & Intelligence
Рет қаралды 2 М.
Training and deploying open-source large language models
39:53
Niels Rogge
Рет қаралды 16 М.
The Next 100x - Gavin Uberti | Stanford MLSys #92
59:21
Stanford MLSys Seminars
Рет қаралды 5 М.
Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89
57:05
Stanford MLSys Seminars
Рет қаралды 4,6 М.
Enabling Cost-Efficient LLM Serving with Ray Serve
30:28
Anyscale
Рет қаралды 5 М.