The SECRET Behind ChatGPT's Training That Nobody Talks About | FSDP Explained

  Рет қаралды 1,501

Developers Hutt

Developers Hutt

Күн бұрын

Пікірлер: 10
@def1antamvs
@def1antamvs Ай бұрын
Beautiful presentation, I was fully engaged the whole time. Thanks for this. You made a very complex process seem so simple!
@DevelopersHutt
@DevelopersHutt Ай бұрын
I'm glad you liked it!
@deependu__
@deependu__ 7 күн бұрын
thanks for the video and visualization.
@ekram_007
@ekram_007 Ай бұрын
Awesome... Thanks for the video.
@yehanwasura
@yehanwasura Ай бұрын
this was in my recommendations so just randomly clicked it and My god i love the video, although I already have the knowledge.... Still enjoy watching this cuz it's very nicely presented, Love it ❤.
@carsworld3433
@carsworld3433 14 күн бұрын
Hey im a student currently and have some knowledge of deep learning but for a long time I'm struck as I don't know what shall I do after learning basics can you please help
@DevelopersHutt
@DevelopersHutt 7 күн бұрын
Start with building a project, it'll help you solidify your learning.
@johnlim9416
@johnlim9416 Ай бұрын
Err Sharded, not Shaded. FSDP == Fully Sharded Data Parallel
@DevelopersHutt
@DevelopersHutt Ай бұрын
Oops 😔
@Karmicinnovations
@Karmicinnovations Ай бұрын
Similar to torrents
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 1 МЛН
2024's Biggest Breakthroughs in Computer Science
10:47
Quanta Magazine
Рет қаралды 231 М.
1% vs 100% #beatbox #tiktok
01:10
BeatboxJCOP
Рет қаралды 67 МЛН
Гениальное изобретение из обычного стаканчика!
00:31
Лютая физика | Олимпиадная физика
Рет қаралды 4,8 МЛН
Quando A Diferença De Altura É Muito Grande 😲😂
00:12
Mari Maria
Рет қаралды 45 МЛН
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 399 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
Anthropic MCP + Ollama. No Claude Needed? Check it out!
18:06
What The Func? w/ Ed Zynda
Рет қаралды 9 М.
How GitHub's Database Self-Destructed in 43 Seconds
12:04
Kevin Fang
Рет қаралды 1 МЛН
How Fully Sharded Data Parallel (FSDP) works?
32:31
Ahmed Taha
Рет қаралды 17 М.
Google’s Quantum Chip: Did We Just Tap Into Parallel Universes?
9:34
Slaying OOMs with PyTorch FSDP and torchao
49:38
Hamel Husain
Рет қаралды 2,8 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 866 М.