Deep Dive: Compiling deep learning models, from XLA to PyTorch 2

  Рет қаралды 873

Julien Simon

Julien Simon

Күн бұрын

Compilation is an excellent technique to accelerate the training and inference of deep learning models, especially if it can be completely automated!
In this video, we discuss deep learning compilation, from the early days of TensorFlow to PyTorch 2. Along the way, you'll learn about key technologies such as XLA, PyTorch/XLA, OpenXLA, TorchScript, HLO, TorchDynamo, TorchInductor, and more. You'll see where they fit and how they help accelerate models on a wide range of devices, including custom chips like Google TPU and AWS Inferentia 2. Of course, we'll also share some simple examples, including how to easily accelerate Hugging Face models with PyTorch 2 and torch.compile().
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️
02:10 TensorFlow 1.x and graph mode
05:52 TensorFlow XLA
09:35 PyTorch TorchScript
14:25 PyTorch/XLA and lazy tensors
17:28 PyTorch/XLA example with Google TPU
21:40 A quick look at HLO
24:05 OpenXLA
25:50 PyTorch/XLA example with AWS Inferentia 2
29:10 PyTorch 2 : torch.compile()
34:37 Hugging Face models with PyTorch 2
36:10 BERT on CPU with Torch Inductor and IPEX backends

Пікірлер: 4
@cybermanaudiobooks3231
@cybermanaudiobooks3231 5 ай бұрын
These chronological overviews are superb. Another great video. Thanks Julien!
@juliensimonfr
@juliensimonfr 5 ай бұрын
Glad you like them!
@imranullah3097
@imranullah3097 5 ай бұрын
Can we use compile for tpu ?
@juliensimonfr
@juliensimonfr 4 ай бұрын
Sure. cloud.google.com/tpu/docs/run-calculation-pytorch
Deep Dive: Quantizing Large Language Models, part 2
27:13
Julien Simon
Рет қаралды 1 М.
EVOLUTION OF ICE CREAM 😱 #shorts
00:11
Savage Vlogs
Рет қаралды 10 МЛН
Double Stacked Pizza @Lionfield @ChefRush
00:33
albert_cancook
Рет қаралды 115 МЛН
Mama vs Son vs Daddy 😭🤣
00:13
DADDYSON SHOW
Рет қаралды 48 МЛН
What are AI Agents?
12:29
IBM Technology
Рет қаралды 113 М.
Key Value Cache in Large Language Models Explained
17:37
Tensordroid
Рет қаралды 750
Lightning Talk: Triton Compiler - Thomas Raoux, OpenAI
16:13
This is why Deep Learning is really weird.
2:06:38
Machine Learning Street Talk
Рет қаралды 377 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 845 М.
What is PyTorch? (Machine/Deep Learning)
11:57
IBM Technology
Рет қаралды 26 М.
Deploying Llama3 with Inference Endpoints and AWS Inferentia2
10:07
torchdynamo deep dive
1:35:59
Edward Z. Yang's PyTorch and PL
Рет қаралды 14 М.
Todos os modelos de smartphone
0:20
Spider Slack
Рет қаралды 65 МЛН
iPhone 15 Pro в реальной жизни
24:07
HUDAKOV
Рет қаралды 489 М.
Klavye İle Trafik Işığını Yönetmek #shorts
0:18
Osman Kabadayı
Рет қаралды 8 МЛН
Сколько реально стоит ПК Величайшего?
0:37
Bluetooth connected successfully 💯💯
0:16
Blue ice Comedy
Рет қаралды 1,3 МЛН