Accelerate Transformer training with AWS Trainium

  Рет қаралды 1,670

Julien Simon

Julien Simon

Күн бұрын

In this video, I show you how to accelerate Transformer training with AWS Trainium, a new custom chip designed by AWS.
First, I walk you through the setup of an Amazon EC2 trn1.32xlarge instance, equipped with 16 Trainium chips. Then, I run a natural language processing job where I adapt existing Transformer training code for Trainium, accelerating a BERT model to classify the Yelp review datatset. Finally, I run the job on 1, 8, and 32 Neuron cores.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️
⭐️⭐️⭐️ Want to buy me a coffee? I can always use more :) www.buymeacoff... ⭐️⭐️⭐️
AWS Trainium: aws.amazon.com...
AWS Neuron SDK documentation: awsdocs-neuron...
AWS Neuron SDK samples: github.com/aws...
Hugging Face tutorial: huggingface.co...
Setup steps and code: gitlab.com/jul...
Interested in hardware acceleration? Check out my other videos :
Habana Gaudi: • Accelerate Transformer...
Graphcore: • Accelerate Transformer...
Trainium on SageMaker: • Accelerate Transformer...

Пікірлер: 1
@junkmail75034
@junkmail75034 2 жыл бұрын
Hey Julien thanks for this helpful video. In 16:23, you saved the checkpoint in pt. Per your example, this is how I load checkpoint back: model = AutoModelForSequenceClassification.from_pretrained("bert-base-cased", num_labels=5) ckpt = torch.load("checkpoints/checkpoint.pt") model_weights = ckpt['state_dict'] model.load_state_dict(model_weights) model.to('xla') Hope this is right.
THIS is HARDEST MACHINE LEARNING model I've EVER coded
0:36
Nicholas Renotte
Рет қаралды 263 М.
Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy
00:17
Leisi Crazy
Рет қаралды 80 МЛН
黑的奸计得逞 #古风
00:24
Black and white double fury
Рет қаралды 25 МЛН
Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen
00:21
TheSoul Music Family
Рет қаралды 32 МЛН
Un coup venu de l’espace 😂😂😂
00:19
Nicocapone
Рет қаралды 13 МЛН
Meet Saudi's first HUMANOID Robot!
0:29
Dhruv Rathee Shorts
Рет қаралды 19 МЛН
Transformer training shootout: AWS Trainium vs. NVIDIA A10G
12:09
Designing Custom ML Pipelines with AWS SageMaker
15:43
Data Science Salon
Рет қаралды 625
Deploying Arcee SuperNova on AWS
13:06
Julien Simon
Рет қаралды 52 М.
AWS Trainium Explained in 3 minutes | Faster Deep Learning Model Training with Amazon AI Tools
3:15
FreeBirds Crew - Data Science and Generative AI
Рет қаралды 529
Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy
00:17
Leisi Crazy
Рет қаралды 80 МЛН