Deep Dive: Quantizing Large Language Models, part 1

  Рет қаралды 17,505

Julien Simon

Julien Simon

Күн бұрын

Пікірлер: 20
@matthewrice7590
@matthewrice7590 7 ай бұрын
Thanks for this, Julien
@juliensimonfr
@juliensimonfr 7 ай бұрын
you're welcome :)
@joaogalego7229
@joaogalego7229 7 ай бұрын
Watching this at 1.25x speed. High-quality content as usual. Keep it up, Julien 💪
@itayatelis2898
@itayatelis2898 5 ай бұрын
Love your content! thank you!
@juliensimonfr
@juliensimonfr 5 ай бұрын
Glad you enjoy it!
@road2nohand
@road2nohand 7 ай бұрын
Glorious Content :D
@juliensimonfr
@juliensimonfr 7 ай бұрын
Glad you like it!
@Joe-nh9fy
@Joe-nh9fy 6 ай бұрын
Great explanation! I have one question... Is it common practice to regularize the LLM cost function like with L2 to reduce the weight "outliers" while training?
@juliensimonfr
@juliensimonfr 6 ай бұрын
I don't think there is a strong consensus. It looks like regularization during fine-tuning can help with generalization. There are new ideas too, like noisy embeddings wandb.ai/byyoung3/ml-news/reports/A-New-Method-For-LLM-Regularization--Vmlldzo1ODIyMzIw
@bibiworm
@bibiworm 4 ай бұрын
I have been wanting to understand quantization for a very long time. Thank you! Would you mind sharing the slides please? Thank you.
@juliensimonfr
@juliensimonfr 2 ай бұрын
Hi, you can find the slides on Slideshare at fr.slideshare.net/slideshow/julien-simon-deep-dive-quantizing-llms/270921785
@jacehua7334
@jacehua7334 7 ай бұрын
🔥 🔥 🔥
@juliensimonfr
@juliensimonfr 7 ай бұрын
:)
@AI-Projects24
@AI-Projects24 2 ай бұрын
Is there any chance to get the slides? Its very well organized and presented. Thank you so much for your work✨🔥🔥
@juliensimonfr
@juliensimonfr 2 ай бұрын
Hi, you can find the slides on Slideshare at Slides: fr.slideshare.net/slideshow/julien-simon-deep-dive-quantizing-llms/270921785
@monishostwal8255
@monishostwal8255 6 ай бұрын
what is meant by calibration dataset? is it eqivalent to evaluation set?
@juliensimonfr
@juliensimonfr 6 ай бұрын
Pretty much, yes. It's used to figure out the "best" hyperparameter values.
@monishostwal8255
@monishostwal8255 6 ай бұрын
okay got it thanks
@caiyu538
@caiyu538 7 ай бұрын
👍
@juliensimonfr
@juliensimonfr 7 ай бұрын
😃
Deep Dive: Quantizing Large Language Models, part 2
27:13
Julien Simon
Рет қаралды 1,4 М.
Yann Dubois: Scalable Evaluation of Large Language Models
1:37:47
Mayur Naik
Рет қаралды 3,8 М.
Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen
00:21
TheSoul Music Family
Рет қаралды 32 МЛН
Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры
00:47
哈哈大家为了进去也是想尽办法!#火影忍者 #佐助 #家庭
00:33
火影忍者一家
Рет қаралды 130 МЛН
Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim
17:24
LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners
12:44
Deep dive - Better Attention layers for Transformer models
40:54
Julien Simon
Рет қаралды 10 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,2 МЛН
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 480 М.
Were RNNs All We Needed? (Paper Explained)
27:48
Yannic Kilcher
Рет қаралды 45 М.
Comparing Quantizations of the Same Model - Ollama Course
10:29
Matt Williams
Рет қаралды 6 М.
Deep Dive: Optimizing LLM inference
36:12
Julien Simon
Рет қаралды 23 М.
Generative Model That Won 2024 Nobel Prize
33:04
Artem Kirsanov
Рет қаралды 126 М.
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Рет қаралды 2,2 МЛН
Cool Parenting Gadget Against Mosquitos! 🦟👶 #gen
00:21
TheSoul Music Family
Рет қаралды 32 МЛН