Lightning Talk: Lessons from Using Pytorch 2.0 Compile in IBM's Watsonx.AI Inference - Antoni Martin

  Рет қаралды 204

PyTorch

PyTorch

7 ай бұрын

Lightning Talk: Lessons from Using Pytorch 2.0 Compile in IBM's Watsonx.AI Inference - Antoni Viros i Martin, IBM Research
In this talk we will cover lessons learned about PT 2.0 compile after using it in IBM’s Watsonx.AI stack with NVIDIA GPUs and custom IBM accelerators as the main inference acceleration solution. Specifically, we will cover the results of our latency and throughput experiments with a range of LLM models, ranging from encoder-only, encoder-decoder, and decoder-only transformer models. We will talk about performance comparisons with other approaches in the field as well as our collaboration with the core PyTorch team to fix some of the bugs we have encountered when using features such as dynamic shapes and CUDA graph trees. We will also comment on how we have been using the torch.compile() API to compile and run models on IBM’s AIU accelerator and why we have made that choice. Finally, we will also cover the interaction of parallel approaches such as Tensor Parallel for bigger models combined with Compile for inference workloads.

Пікірлер
Lightning Talk: Triton Compiler - Thomas Raoux, OpenAI
16:13
Make me the happiest man on earth... 🎁🥹
00:34
A4
Рет қаралды 8 МЛН
Why You Should Always Help Others ❤️
00:40
Alan Chikin Chow
Рет қаралды 34 МЛН
WHY IS A CAR MORE EXPENSIVE THAN A GIRL?
00:37
Levsob
Рет қаралды 18 МЛН
Generative AI for business
34:21
IBM Research
Рет қаралды 270 М.
What is PyTorch? (Machine/Deep Learning)
11:57
IBM Technology
Рет қаралды 21 М.
Why Are There So Many Foundation Models?
5:14
IBM Technology
Рет қаралды 25 М.
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
😨 СТАЛА ПИЛОТОМ НА 24 ЧАСА
0:36
Настя, это где?
Рет қаралды 7 МЛН
Брат вор? 😳 #shorts
0:27
Julia Fun
Рет қаралды 3,6 МЛН
As aventuras de Tatá e Decinho 275
0:14
Tammy e Sarayva
Рет қаралды 36 МЛН
Батырға жаңа үміткер келді😱 Бір Болайық! 07.06.24
14:07
Бір болайық / Бир Болайык / Bir Bolayiq
Рет қаралды 114 М.