No video

Accelerating Transformers with Hugging Face Optimum and Infinity

  Рет қаралды 352

MLOps World: Machine Learning in Production

MLOps World: Machine Learning in Production

11 ай бұрын

Speakers:
Lewis Tunstall, Machine Learning Engineer
Philipp Schmid is a Machine Learning Engineer and Tech Lead at Hugging Face, where he leads the collaboration with the Amazon SageMaker team. He is passionate about democratizing and productionizing cutting-edge NLP models and improving the ease of use for Deep Learning.
Philipp Schmid, Technical Lead, Hugging Face
Lewis Tunstall is a machine learning engineer at Hugging Face, where he focuses on developing tools for the NLP community and teaching people to use them effectively. He’s built machine learning applications for startups and enterprises in the domains of NLP, topological data analysis, and time series. Lewis has a PhD in theoretical physics and has held research positions in Australia, the US, and Switzerland.
Abstract:
Since their introduction in 2017, Transformers have become the de facto standard for tackling a wide range of NLP tasks in both academia and industry. However, in many situations accuracy is not enough - your state-of-the-art model is not very useful if it’s too slow or large to meet the business requirements of your application.
In this talk, Lewis Tunstall and Philipp Schmid, Machine Learning Engineers will give an overview of Hugging Face’s efforts to accelerate the predictions of Transformer models. They'll discuss a new open-source library called Optimum, which enables developers to train and run Transformers on targeted hardware. They'll also introduce Infinity, which is a containerised solution that delivers millisecond-scale latencies in production environments.

Пікірлер: 1
@Gerald-iz7mv
@Gerald-iz7mv 2 ай бұрын
Hi, how to export to onnx using cuda?
A Zero Downtime Set up for Models: How and Why
40:06
MLOps World: Machine Learning in Production
Рет қаралды 92
Пранк пошел не по плану…🥲
00:59
Саша Квашеная
Рет қаралды 6 МЛН
БАБУШКИН КОМПОТ В СОЛО
00:23
⚡️КАН АНДРЕЙ⚡️
Рет қаралды 18 МЛН
Smart Sigma Kid #funny #sigma #comedy
00:40
CRAZY GREAPA
Рет қаралды 29 МЛН
Doing This Instead Of Studying.. 😳
00:12
Jojo Sim
Рет қаралды 8 МЛН
🤗 Hugging Face Transformers Agent | LangChain comparisons
11:57
Automated Prompt Engineering with DSPy + DSPy Visualization
36:27
Few-shot learning in production
1:21:41
HuggingFace
Рет қаралды 14 М.
Easier, Faster Training for your Hugging Face models
19:03
Microsoft Developer
Рет қаралды 2,7 М.
Accelerate Transformer inference on CPU with Optimum and ONNX
16:32
Julien Simon
Рет қаралды 4,3 М.
Create Your Own AI: Transformer Agents Tutorial
9:49
AssemblyAI
Рет қаралды 13 М.
Huggingface.js: Step-by-Step Guide to Getting Started
11:55
Developers Digest
Рет қаралды 18 М.
Пранк пошел не по плану…🥲
00:59
Саша Квашеная
Рет қаралды 6 МЛН