Deep Dive: Quantizing Large Language Models, part 1

Deep Dive: Quantizing Large Language Models, part 2

Yann Dubois: Scalable Evaluation of Large Language Models

Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen

Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры

哈哈大家为了进去也是想尽办法！#火影忍者 #佐助 #家庭

Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim

Deep Dive: Quantizing Large Language Models, part 1

Рет қаралды 17,505

Julien Simon

Julien Simon

Күн бұрын

Пікірлер: 20

@matthewrice7590

@matthewrice7590 7 ай бұрын

Thanks for this, Julien

@juliensimonfr 7 ай бұрын

you're welcome :)

@joaogalego7229

@joaogalego7229 7 ай бұрын

Watching this at 1.25x speed. High-quality content as usual. Keep it up, Julien 💪

@itayatelis2898

@itayatelis2898 5 ай бұрын

Love your content! thank you!

@juliensimonfr 5 ай бұрын

Glad you enjoy it!

@road2nohand 7 ай бұрын

Glorious Content :D

@juliensimonfr 7 ай бұрын

Glad you like it!

@Joe-nh9fy 6 ай бұрын

Great explanation! I have one question... Is it common practice to regularize the LLM cost function like with L2 to reduce the weight "outliers" while training?

@juliensimonfr 6 ай бұрын

I don't think there is a strong consensus. It looks like regularization during fine-tuning can help with generalization. There are new ideas too, like noisy embeddings wandb.ai/byyoung3/ml-news/reports/A-New-Method-For-LLM-Regularization--Vmlldzo1ODIyMzIw

@bibiworm 4 ай бұрын

I have been wanting to understand quantization for a very long time. Thank you! Would you mind sharing the slides please? Thank you.

@juliensimonfr 2 ай бұрын

Hi, you can find the slides on Slideshare at fr.slideshare.net/slideshow/julien-simon-deep-dive-quantizing-llms/270921785

@jacehua7334 7 ай бұрын

🔥 🔥 🔥

@juliensimonfr 7 ай бұрын

:)

@AI-Projects24 2 ай бұрын

Is there any chance to get the slides? Its very well organized and presented. Thank you so much for your work✨🔥🔥

@juliensimonfr 2 ай бұрын

Hi, you can find the slides on Slideshare at Slides: fr.slideshare.net/slideshow/julien-simon-deep-dive-quantizing-llms/270921785

@monishostwal8255

@monishostwal8255 6 ай бұрын

what is meant by calibration dataset? is it eqivalent to evaluation set?

@juliensimonfr 6 ай бұрын

Pretty much, yes. It's used to figure out the "best" hyperparameter values.

@monishostwal8255

@monishostwal8255 6 ай бұрын

okay got it thanks

@caiyu538 7 ай бұрын

👍

@juliensimonfr 7 ай бұрын

😃

Deep Dive: Quantizing Large Language Models, part 2

27:13

Deep Dive: Quantizing Large Language Models, part 2

Julien Simon

Рет қаралды 1,4 М.

Yann Dubois: Scalable Evaluation of Large Language Models

1:37:47

Yann Dubois: Scalable Evaluation of Large Language Models

Mayur Naik

Рет қаралды 3,8 М.

Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen

00:21

Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen

TheSoul Music Family

Рет қаралды 32 МЛН

Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры

00:47

Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры

Двое играют | Наташа и Вова

Рет қаралды 12 МЛН

哈哈大家为了进去也是想尽办法！#火影忍者 #佐助 #家庭

00:33

哈哈大家为了进去也是想尽办法！#火影忍者 #佐助 #家庭

火影忍者一家

Рет қаралды 130 МЛН

Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim

17:24

Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim

kak budto

Рет қаралды 1,4 МЛН

LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners

12:44

LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners

Rabbitmetrics

Рет қаралды 787 М.

Deep dive - Better Attention layers for Transformer models

40:54

Deep dive - Better Attention layers for Transformer models

Julien Simon

Рет қаралды 10 М.

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Welch Labs

Рет қаралды 1,2 МЛН

LLM inference optimization: Architecture, KV cache and Flash attention

44:06

LLM inference optimization: Architecture, KV cache and Flash attention

YanAITalk

Рет қаралды 1 М.

The Most Important Algorithm in Machine Learning

40:08

The Most Important Algorithm in Machine Learning

Artem Kirsanov

Рет қаралды 480 М.

Were RNNs All We Needed? (Paper Explained)

27:48

Were RNNs All We Needed? (Paper Explained)

Yannic Kilcher

Рет қаралды 45 М.

Comparing Quantizations of the Same Model - Ollama Course

10:29

Comparing Quantizations of the Same Model - Ollama Course

Matt Williams

Рет қаралды 6 М.

Deep Dive: Optimizing LLM inference

36:12

Deep Dive: Optimizing LLM inference

Julien Simon

Рет қаралды 23 М.

Generative Model That Won 2024 Nobel Prize

33:04

Generative Model That Won 2024 Nobel Prize

Artem Kirsanov

Рет қаралды 126 М.

[1hr Talk] Intro to Large Language Models

59:48

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

Рет қаралды 2,2 МЛН

Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen

00:21

Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen

TheSoul Music Family

Рет қаралды 32 МЛН