Accelerating Transformers with Optimum Neuron, AWS Trainium and AWS Inferentia2

Transformer training shootout, part 2: AWS Trainium vs. NVIDIA V100

Accelerate Transformer training with AWS Trainium

📦 + 🥎 или игра для тех, у кого нет игр #partygames #games #игры #веселыеигры #funnygames #challenge

Самое доброе видео, которое я хотел сделать, а Костя всё испортил 😠

Шерзаттың отбасына қандай қысым жасалып жатыр? / Марқұмның әкесі және әпкесімен сұхбат

ROSÉ & Bruno Mars - APT. (Official Music Video)

Accelerating Transformers with Optimum Neuron, AWS Trainium and AWS Inferentia2

Рет қаралды 1,924

Julien Simon

Julien Simon

Күн бұрын

In this video, I show you how to accelerate Transformer training and inference with the Hugging Face Optimum Neuron library, a hardware acceleration library dedicated to AWS Trainium and AWS Inferentia 2, two custom AI chips designed by AWS.
First, changing a single line of code, I show you how to train a Vision Transformer model on the food101 datasets (75K training images). On a trn1.32xlarge instance, the model trains in under a minute per epoch.
Then, I show you how to export a DistilBERT model from the hub to Inferentia2. Running a benchmark on a inf2.xlarge instance, we get over 2000 predictions per second and P99 1-millisecond latency!
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️
Amazon EC2 Trn1: aws.amazon.com...
Amazon EC2 Inf2: aws.amazon.com...
Hugging Face Neuron AMI: aws.amazon.com...
Optimum Neuron documentation: huggingface.co...
Optimum Neuron Github: github.com/hug...
Code: gitlab.com/jul...

Пікірлер: 4

@caiyu538 Жыл бұрын

Great lectures. Great teacher.

@smartin2minutes

@smartin2minutes Жыл бұрын

Hi @Julien Simon, thanks a lot for the video. I have checked your code. Looks like something I can use. One question, that I could not find anywhere, is there a way to use 'pipelines` with a Optimum Neuron model? I have a token classification task and pipeline just makes things easier to maintain. Will be very helpful if you have any examples. Thanks.

@FalahgsGate Жыл бұрын

thanks for best video ❤

@juliensimonfr Жыл бұрын

Most welcome

Transformer training shootout, part 2: AWS Trainium vs. NVIDIA V100

10:10

Transformer training shootout, part 2: AWS Trainium vs. NVIDIA V100

Julien Simon

Рет қаралды 3,1 М.

Accelerate Transformer training with AWS Trainium

19:30

Accelerate Transformer training with AWS Trainium

Julien Simon

Рет қаралды 1,6 М.

📦 + 🥎 или игра для тех, у кого нет игр #partygames #games #игры #веселыеигры #funnygames #challenge

00:51

📦 + 🥎 или игра для тех, у кого нет игр #partygames #games #игры #веселыеигры #funnygames #challenge

Двое играют | Наташа и Вова

Рет қаралды 2,3 МЛН

Самое доброе видео, которое я хотел сделать, а Костя всё испортил 😠

00:49

Самое доброе видео, которое я хотел сделать, а Костя всё испортил 😠

Miracle

Рет қаралды 3,2 МЛН

Шерзаттың отбасына қандай қысым жасалып жатыр? / Марқұмның әкесі және әпкесімен сұхбат

47:12

Шерзаттың отбасына қандай қысым жасалып жатыр? / Марқұмның әкесі және әпкесімен сұхбат

Jas Аlash

Рет қаралды 381 М.

ROSÉ & Bruno Mars - APT. (Official Music Video)

02:54

ROSÉ & Bruno Mars - APT. (Official Music Video)

ROSÉ

Рет қаралды 86 МЛН

Accelerating Transformers with Hugging Face Optimum and Infinity

1:28:19

Accelerating Transformers with Hugging Face Optimum and Infinity

MLOps World: Machine Learning in Production

Рет қаралды 421

Build high-performance foundation models with AWS Trainium & Inferentia | AWS AI Infrastructure Day

21:12

Build high-performance foundation models with AWS Trainium & Inferentia | AWS AI Infrastructure Day

AWS Events

Рет қаралды 197

30 Programming Truths I know at 30 that I Wish I Knew at 20

17:41

30 Programming Truths I know at 30 that I Wish I Knew at 20

Liam Walsh

Рет қаралды 2,1 М.

Accelerate Transformers on Amazon SageMaker with AWS Trainium and AWS Inferentia

19:59

Accelerate Transformers on Amazon SageMaker with AWS Trainium and AWS Inferentia

Julien Simon

Рет қаралды 1,3 М.

AWS re:Invent 2022 - [NEW LAUNCH!] Introducing AWS Inferentia2-based EC2 Inf2 instances (CMP334)

40:34

AWS re:Invent 2022 - [NEW LAUNCH!] Introducing AWS Inferentia2-based EC2 Inf2 instances (CMP334)

AWS Events

Рет қаралды 2,8 М.

Accelerate Transformer inference with AWS Inferentia

20:25

Accelerate Transformer inference with AWS Inferentia

Julien Simon

Рет қаралды 2,4 М.

Accelerate PyTorch Transformers with Intel Sapphire Rapids, part 1

20:25

Accelerate PyTorch Transformers with Intel Sapphire Rapids, part 1

Julien Simon

Рет қаралды 1 М.

AWS On Air ft. Silicon Innovation: Trainium and Inferentia | AWS Events

27:00

AWS On Air ft. Silicon Innovation: Trainium and Inferentia | AWS Events

AWS Events

Рет қаралды 1,7 М.

Accelerate Transformer inference on CPU with Optimum and ONNX

16:32

Accelerate Transformer inference on CPU with Optimum and ONNX

Julien Simon

Рет қаралды 4,7 М.

NLP Demystified 15: Transformers From Scratch + Pre-training and Transfer Learning With BERT/GPT

1:52:27

NLP Demystified 15: Transformers From Scratch + Pre-training and Transfer Learning With BERT/GPT

Future Mojo

Рет қаралды 71 М.

📦 + 🥎 или игра для тех, у кого нет игр #partygames #games #игры #веселыеигры #funnygames #challenge

00:51

📦 + 🥎 или игра для тех, у кого нет игр #partygames #games #игры #веселыеигры #funnygames #challenge

Двое играют | Наташа и Вова

Рет қаралды 2,3 МЛН