No video

Efficient and Cross-Platform AI Inference Apps Using Rust and Wasm - Michael Yuan, WasmEdge

  Рет қаралды 814

The Linux Foundation

The Linux Foundation

8 ай бұрын

Efficient and Cross-Platform AI Inference Apps Using Rust and Wasm - Michael Yuan, WasmEdge
Today’s AI inference apps are primarily written in Python or C and then wrapped in a container or VM for cloud deployment. Those apps are heavyweight (esp with Python), not portable across CPU/GPU platforms, difficult for devs (esp with C), and very slow with Python-based data processing. Wasm has emerged as a strong alternative runtime for AI inference workloads. Developers write inference functions in Rust / JS / Python, and then run them in Wasm sandboxes. Wasm functions are tiny, fast, safe, and very easy to develop. They run without modification on almost any device/OS, and can automatically take advantage of the device’s CPU or GPU or other hardware accelerators. They are securely isolated for cloud-native deployment and can be managed by container tools. In this talk, we will start with the architecture of Wasm-based AI services. Then we will deep dive into how to create Pytorch and TF inference functions, as well as newer LLM frameworks such as GGML, in Rust and running these in Wasm. We will demonstrate complete examples using Google Mediapipe models and llama2 LLM models.

Пікірлер
Inside GPT - Large Language Models Demystified - Alan Smith - NDC Oslo 2024
1:00:22
Just Give me my Money!
00:18
GL Show Russian
Рет қаралды 585 М.
Can This Bubble Save My Life? 😱
00:55
Topper Guild
Рет қаралды 87 МЛН
Кадр сыртындағы қызықтар | Келінжан
00:16
ISSEI & yellow girl 💛
00:33
ISSEI / いっせい
Рет қаралды 25 МЛН
Why Isn't Functional Programming the Norm? - Richard Feldman
46:09
You need to learn AI in 2024! (And here is your roadmap)
45:21
David Bombal
Рет қаралды 691 М.
A Very Simple Transformer Encoder for Time Series Forecasting in PyTorch
15:34
Let's Learn Transformers Together
Рет қаралды 6 М.
Linus Torvalds: Speaks on Hype and the Future of AI
9:02
SavvyNik
Рет қаралды 178 М.
Crossing The Barrier Between Kotlin and Rust (and back)! | Tarik Eshaq
37:41
The Only Unbreakable Law
53:25
Molly Rocket
Рет қаралды 325 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 225 М.
Rust Data Modelling Without Classes
11:25
No Boilerplate
Рет қаралды 170 М.
Creator of git, Linus Torvalds Presents the Fundamentals of git
1:10:15
Developers Alliance
Рет қаралды 83 М.
Cracking Enigma in 2021 - Computerphile
21:20
Computerphile
Рет қаралды 2,5 МЛН
Just Give me my Money!
00:18
GL Show Russian
Рет қаралды 585 М.