Deep Dive: Compiling deep learning models, from XLA to PyTorch 2

Deep Dive: Quantizing Large Language Models, part 2

PyTorch 2.0: Unlocking the Power of Deep Learning with the Torch Compile API - Christian Keller

EVOLUTION OF ICE CREAM 😱 #shorts

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

Double Stacked Pizza @Lionfield @ChefRush

Mama vs Son vs Daddy 😭🤣

Deep Dive: Compiling deep learning models, from XLA to PyTorch 2

Рет қаралды 873

Julien Simon

Julien Simon

Күн бұрын

Compilation is an excellent technique to accelerate the training and inference of deep learning models, especially if it can be completely automated!
In this video, we discuss deep learning compilation, from the early days of TensorFlow to PyTorch 2. Along the way, you'll learn about key technologies such as XLA, PyTorch/XLA, OpenXLA, TorchScript, HLO, TorchDynamo, TorchInductor, and more. You'll see where they fit and how they help accelerate models on a wide range of devices, including custom chips like Google TPU and AWS Inferentia 2. Of course, we'll also share some simple examples, including how to easily accelerate Hugging Face models with PyTorch 2 and torch.compile().
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️
02:10 TensorFlow 1.x and graph mode
05:52 TensorFlow XLA
09:35 PyTorch TorchScript
14:25 PyTorch/XLA and lazy tensors
17:28 PyTorch/XLA example with Google TPU
21:40 A quick look at HLO
24:05 OpenXLA
25:50 PyTorch/XLA example with AWS Inferentia 2
29:10 PyTorch 2 : torch.compile()
34:37 Hugging Face models with PyTorch 2
36:10 BERT on CPU with Torch Inductor and IPEX backends

Пікірлер: 4

@cybermanaudiobooks3231

@cybermanaudiobooks3231 5 ай бұрын

These chronological overviews are superb. Another great video. Thanks Julien!

@juliensimonfr 5 ай бұрын

Glad you like them!

@imranullah3097

@imranullah3097 5 ай бұрын

Can we use compile for tpu ?

@juliensimonfr 4 ай бұрын

Sure. cloud.google.com/tpu/docs/run-calculation-pytorch

Deep Dive: Quantizing Large Language Models, part 2

27:13

Deep Dive: Quantizing Large Language Models, part 2

Julien Simon

Рет қаралды 1 М.

PyTorch 2.0: Unlocking the Power of Deep Learning with the Torch Compile API - Christian Keller

15:23

PyTorch 2.0: Unlocking the Power of Deep Learning with the Torch Compile API - Christian Keller

The Linux Foundation

Рет қаралды 2,8 М.

EVOLUTION OF ICE CREAM 😱 #shorts

00:11

EVOLUTION OF ICE CREAM 😱 #shorts

Savage Vlogs

Рет қаралды 10 МЛН

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

00:59

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

疯狂导演刘浩影

Рет қаралды 61 МЛН

Double Stacked Pizza @Lionfield @ChefRush

00:33

Double Stacked Pizza @Lionfield @ChefRush

albert_cancook

Рет қаралды 115 МЛН

Mama vs Son vs Daddy 😭🤣

00:13

Mama vs Son vs Daddy 😭🤣

DADDYSON SHOW

Рет қаралды 48 МЛН

2017 EuroLLVM Developers’ Meeting: D. Majnemer “XLA: Accelerated Linear Algebra”

36:41

2017 EuroLLVM Developers’ Meeting: D. Majnemer “XLA: Accelerated Linear Algebra”

LLVM

Рет қаралды 4,9 М.

What are AI Agents?

12:29

What are AI Agents?

IBM Technology

Рет қаралды 113 М.

Key Value Cache in Large Language Models Explained

17:37

Key Value Cache in Large Language Models Explained

Tensordroid

Рет қаралды 750

Lightning Talk: Accelerated Inference in PyTorch 2.X with Torch...- George Stefanakis & Dheeraj Peri

12:58

Lightning Talk: Accelerated Inference in PyTorch 2.X with Torch...- George Stefanakis & Dheeraj Peri

PyTorch

Рет қаралды 1,6 М.

Lightning Talk: Triton Compiler - Thomas Raoux, OpenAI

16:13

Lightning Talk: Triton Compiler - Thomas Raoux, OpenAI

PyTorch

Рет қаралды 7 М.

This is why Deep Learning is really weird.

2:06:38

This is why Deep Learning is really weird.

Machine Learning Street Talk

Рет қаралды 377 М.

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Welch Labs

Рет қаралды 845 М.

What is PyTorch? (Machine/Deep Learning)

11:57

What is PyTorch? (Machine/Deep Learning)

IBM Technology

Рет қаралды 26 М.

Deploying Llama3 with Inference Endpoints and AWS Inferentia2

10:07

Deploying Llama3 with Inference Endpoints and AWS Inferentia2

Julien Simon

Рет қаралды 6 М.

torchdynamo deep dive

1:35:59

torchdynamo deep dive

Edward Z. Yang's PyTorch and PL

Рет қаралды 14 М.

Todos os modelos de smartphone

0:20

Todos os modelos de smartphone

Spider Slack

Рет қаралды 65 МЛН

Water Mobile 😈 vs Galaxy Z Fold 6 - Water Gaming Test #freefire #shortsfeed #water #trending

0:19

Water Mobile 😈 vs Galaxy Z Fold 6 - Water Gaming Test #freefire #shortsfeed #water #trending

Sameer Gaming

Рет қаралды 1 МЛН

iPhone 15 Pro в реальной жизни

24:07

iPhone 15 Pro в реальной жизни

HUDAKOV

Рет қаралды 489 М.

Klavye İle Trafik Işığını Yönetmek #shorts

0:18

Klavye İle Trafik Işığını Yönetmek #shorts

Osman Kabadayı

Рет қаралды 8 МЛН

Сколько реально стоит ПК Величайшего?

0:37

Сколько реально стоит ПК Величайшего?

CONSTRUCT PC

Рет қаралды 3,8 МЛН

Bluetooth connected successfully 💯💯

0:16

Bluetooth connected successfully 💯💯

Blue ice Comedy

Рет қаралды 1,3 МЛН

Подозрительно выгодный ИГРОВОЙ ПК с OZON за 45 тысяч

22:48

Подозрительно выгодный ИГРОВОЙ ПК с OZON за 45 тысяч

Ремонтяш

Рет қаралды 369 М.