Wasm Is Becoming the Runtime for LLMs - Michael Yuan, Second State

  Рет қаралды 2,911

CNCF [Cloud Native Computing Foundation]

CNCF [Cloud Native Computing Foundation]

8 ай бұрын

Wasm Is Becoming the Runtime for LLMs - Michael Yuan, Second State
Today’s LLM apps, including inference apps and agents, are mostly written in Python. But this is about to change. Python is too slow, too bloated, and too complicated to install and manage. That’s why popular LLM frameworks, such as llama2.c, whisper.cpp, llama.rs, all thrive to have zero Python dependency. All those post-Python LLM applications and frameworks are written in compiled languages (C/C++/Rust) and can be compiled into Wasm. With WASI NN, you can now create complex LLM apps in Rust and run them in Wasm sandboxes. Rust and Wasm could be high-performance and developer-friendly alternatives to Python today. The combination to develop and run LLM apps is more efficient, safe, high performance with small footrprint. In this talk, Michael will demonstrate how to run llama2 series of models in Wasm, how to develop LLM agents in Rust and run them in Wasm. In-production use cases, like LLM-based code review and book-based learning assistants, will be discussed and demoed.

Пікірлер: 4
@Dorisoft
@Dorisoft 5 ай бұрын
This is gold
@jonton6981
@jonton6981 8 ай бұрын
Agree. I think data connector and transformation plugins are a big opportunity too.
@autohmae
@autohmae 7 ай бұрын
3:22 Java set out to do it and Javascript was only named Javascript because of a deal between Sun and Netscape, Javascript was just gonna be it's little brother that would be used as the glue-code in the browser to run Java... it was never supposed to be this way. Even in 1995 Javascript was more used than Java, until 1997 when Java passed it. Both surpassed C++ before 2000 and 3 years or so both passed C... C which was the portable language before all of this.
@autohmae
@autohmae 7 ай бұрын
3:07 I checked, it's funny he''s "Python bashing" in this talk because it used Python and Python curl library to install wasm runtime and plugin. The runtime is 60MB and the plugins are less than 1 MB or a few MB.
Lightning Talk: WebAssembly from the Inside Out - Edoardo Vacchi, Tetrate
13:11
CNCF [Cloud Native Computing Foundation]
Рет қаралды 253
State of WebAssembly outside the browser by ABDEL SGHIOUAR
37:51
Я нашел кто меня пранкует!
00:51
Аришнев
Рет қаралды 4,8 МЛН
HOW DID HE WIN? 😱
00:33
Topper Guild
Рет қаралды 45 МЛН
路飞被小孩吓到了#海贼王#路飞
00:41
路飞与唐舞桐
Рет қаралды 72 МЛН
Fast and Efficient Log Processing with Wasm and eBPF - Michael Yuan, Second State
34:18
CNCF [Cloud Native Computing Foundation]
Рет қаралды 391
I spent six months rewriting everything in Rust
15:11
chris biscardi
Рет қаралды 415 М.
Wasm GC: What Exactly Is It (and Why I Should Care) - Ivan Mikushin, VMware
30:44
WASM is Awesome! Explained with Examples | Ft Docker
13:43
ByteMonk
Рет қаралды 4,7 М.
Красиво, но телефон жаль
0:32
Бесполезные Новости
Рет қаралды 987 М.
НЕ ПОКУПАЙ СМАРТФОН, ПОКА НЕ УЗНАЕШЬ ЭТО! Не ошибись с выбором…
15:23
Отдых для геймера? 😮‍💨 Hiper Engine B50
1:00