Lightning Talk: Accelerating Inference on CPU with Torch.Compile - Jiong Gong, Intel

  Рет қаралды 881

PyTorch

PyTorch

7 ай бұрын

Lightning Talk: Accelerating Inference on CPU with Torch.Compile - Jiong Gong, Intel
For the torch.compile CPU backend, we have optimized the static shapes of the float32 path and achieved good performance speedups on popular models. Starting with PyTorch 2.0, we have further enhanced this feature by addressing several issues and optimizing the bfloat16 precision path. The dynamic shape path is also supported, which allows users to get good performance on dynamic shape models, such as GPTJ and Llama, as well as using low precision bfloat16 data type to further improve performance on the 4th generation of Intel Xeon Scalable Processors (Sapphire Rapids) using Advanced Matrix Extensions (AMX) instruction set extension and lower memory footprint. In this topic, we will introduce the key optimization technologies used in the CPU inference path of torch.compile, such as GEMM fusions, vectorization of low precision bfloat16 path, and constant folding with freezing path. We will also discuss how to solve issues that arose when supporting the path of the dynamic shape. Currently, the dynamic shape and bfloat16 paths can work well as static shape path. The geometric mean speedup of the bfloat16 path can range from 1.4x to 2.3x compared to eager mode on Sapphire Rapids.

Пікірлер
[Vowel]물고기는 물에서 살아야 해🐟🤣Fish have to live in the water #funny
00:53
Follow @karina-kola please 🙏🥺
00:21
Andrey Grechka
Рет қаралды 26 МЛН
1🥺🎉 #thankyou
00:29
はじめしゃちょー(hajime)
Рет қаралды 52 МЛН
СҰЛТАН СҮЛЕЙМАНДАР | bayGUYS
24:46
bayGUYS
Рет қаралды 688 М.
Mythbusters Demo GPU versus CPU
1:34
NVIDIA
Рет қаралды 6 МЛН
Coding Communication & CPU Microarchitectures as Fast As Possible
5:01
Lightning Talk: Triton Compiler - Thomas Raoux, OpenAI
16:13
Scaling inference on CPUs with TorchServe
10:03
PyTorch
Рет қаралды 2,8 М.
КРОВАТЬ БУДИЛЬНИК (@easygadgetx - Instagram)
0:15
В ТРЕНДЕ
Рет қаралды 2,5 МЛН
когда достали одноклассники!
0:49
БРУНО
Рет қаралды 1,6 МЛН
1000 iq guy 😱 @fash
0:11
Tie
Рет қаралды 18 МЛН
Teddy Bear CAPSULES? 😱🧸
0:32
LosWagners ENG
Рет қаралды 8 МЛН
Colgate mix Kar Diya 😱 #shorts
0:31
KK Super Arts
Рет қаралды 99 МЛН