vLLM and Neural Magic Office Hours - June 5, 2024

  Рет қаралды 449

Neural Magic

4 ай бұрын

As one of the top contributors to the vLLM project, Neural Magic teams up with the vLLM team from UC Berkeley every 2 weeks to host open office hours. Check out our session from our June 5, 2024 session, where we answered some great questions from participants.
We kicked off our June 5th session with a quick recap on vLLM and how Neural Magic can support enterprises today to successfully integrate vLLM as a part of their AI strategy. You'll hear answers to audience questions about post-training quantization, maximizing GPU usage for 70B LLMs, differences between vLLM and Hugging Face TGI, cache management, tensor parallelism, and more. You can see the session slides here: docs.google.com/presentation/d/1B50uCXzAarawDDizElNzi2o55fkgJZSm/edit#slide=id.p1
Do you have questions about vLLM that you'd like addressed directly by the experts? Join our next vLLM office hours and post your questions here: neuralmagic.com/community-office-hours/

Пікірлер
REAL 3D brush can draw grass Life Hack #shorts #lifehacks
00:42
MrMaximus
Рет қаралды 10 МЛН
pumpkins #shorts
00:39
Mr DegrEE
Рет қаралды 111 МЛН
Help Me Celebrate! 😍🙏
00:35
Alan Chikin Chow
Рет қаралды 85 МЛН
Давайте поцарапаем iPhone 16 Pro Max!
0:57
Wylsacom
Рет қаралды 4,2 МЛН
Кому новенький айфон
0:19
Новостной Гусь
Рет қаралды 3,9 МЛН
Keyboard Cleaning Hack
0:36
IAM
Рет қаралды 8 МЛН
Кто-то еще помнит про эту консоль?
0:51
ПРОСТО ЛЕШКА
Рет қаралды 2,7 МЛН