Thermodynamic Gradient Descent

  Рет қаралды 2,905

hu-po

hu-po

Ай бұрын

Like 👍. Comment 💬. Subscribe 🟥.
🏘 Discord: / discord
github.com/hu-po/docs
Thermodynamic Natural Gradient Descent
arxiv.org/pdf/2405.13817

Пікірлер: 13
@JatinKashyap-Innovision
@JatinKashyap-Innovision 24 күн бұрын
Loves your paper streams. Keep 'em coming. I watches them to start my day.
@michaeltraynor5893
@michaeltraynor5893 26 күн бұрын
This guy's energy stresses me out but like, in a way I find comforting? Very strange. Also love the shade thrown at extropic (even though I have a soft spot for Gill as a Canadian). Thanks for introducing me to Normal I didn't know about them.
@tairad65
@tairad65 Ай бұрын
My new fav channel
@wolpumba4099
@wolpumba4099 Ай бұрын
Summary starts at 1:29:40
@wolpumba4099
@wolpumba4099 Ай бұрын
Summary of "Thermodynamic Gradient Descent" * *Challenge:* Second-order optimization methods like Natural Gradient Descent (NGD) offer faster convergence for AI training but are computationally expensive on digital computers. * *Innovation:* Thermodynamic Natural Gradient Descent (TNGD) uses a hybrid system, combining a GPU with a specialized analog computer called a Stochastic Processing Unit (SPU). * *SPU Magic:* The SPU leverages the physics of heat dissipation, implementing an Ornstein-Uhlenbeck process to efficiently approximate NGD at a computational cost similar to first-order methods (SGD, Adam). * *Benefits:* * Faster convergence than SGD/Adam, particularly for large models [not shown yet?] and complex tasks. * Smooth interpolation between first and second-order optimization by controlling the SPU's evolution time. * Inherent momentum-like effect due to system delays further improves performance. * *Proof-of-Concept:* TNGD demonstrates its superiority over Adam on MNIST classification and shows promising results on language model fine-tuning (distilled BERT), outperforming both pure NGD and Adam. * *Looking Ahead:* TNGD represents an early step in thermodynamic computing for AI. Scaling up the technology, refining the implementation, and exploring its wider applicability are key next steps. i used gemini 1.5 pro to summarize the transcript and paper
@badrraitabcas
@badrraitabcas Ай бұрын
@@wolpumba4099 the disclaimer is dope
@BlueBirdgg
@BlueBirdgg 28 күн бұрын
Ty for your videos.
@PulsatingShadow
@PulsatingShadow Ай бұрын
Thanks
@rickybloss8537
@rickybloss8537 Ай бұрын
I believe the third order is jolt
@johnny5941
@johnny5941 23 күн бұрын
I searched Wikipedia:4th snap(jounce),5th crackle,6th pop. I am assuming op knows 3rd is jerk
@j.rumbleseed
@j.rumbleseed 29 күн бұрын
Yep yep, and the big winner will be the one that replaces the FPGA's that are a work around, and introduces the code in the thermodynamic actuation of the system. Soon it seems.
@TreeLuvBurdpu
@TreeLuvBurdpu 29 күн бұрын
It's important to understand that the government funding of the chip fabs always means that they will be responding to political incentives, political "likes", rather than actual market incentives from people with actual skin in the game, and it will be slower to respond to changes.
@MDNQ-ud1ty
@MDNQ-ud1ty 27 күн бұрын
Why the heck is everyone pronouncing ss - ß as sh? I have seen this in almost every CS video now ;/
Intro to Gradient Descent || Optimizing High-Dimensional Equations
11:04
Dr. Trefor Bazett
Рет қаралды 61 М.
Susan McConnell (Stanford): Designing effective scientific presentations
42:09
Science Communication Lab
Рет қаралды 525 М.
Пробую самое сладкое вещество во Вселенной
00:41
МАМА И STANDOFF 2 😳 !FAKE GUN! #shorts
00:34
INNA SERG
Рет қаралды 2,7 МЛН
Дибала против вратаря Легенды
00:33
Mr. Oleynik
Рет қаралды 2,6 МЛН
Gradient Descent in 3 minutes
3:06
Visually Explained
Рет қаралды 163 М.
WHAT IS CFD:  Introduction to Computational Fluid Dynamics
13:07
DMS | Marine Consultant
Рет қаралды 204 М.
Thermodynamics Chemistry | Thermodynamic Process
6:21
PLAY Chemistry
Рет қаралды 202 М.
How Well Can DeepMind's AI Learn Physics? ⚛
7:18
Two Minute Papers
Рет қаралды 1,6 МЛН
Google's RAG Experiment - NotebookLM
13:39
Sam Witteveen
Рет қаралды 14 М.
Variational Autoencoders
15:05
Arxiv Insights
Рет қаралды 481 М.
Road Less Scheduled
1:41:44
hu-po
Рет қаралды 1,4 М.
Обзор Sonos Ace - лучше б не выпускали...
16:33
Asus  VivoBook Винда за 8 часов!
1:00
Sergey Delaisy
Рет қаралды 1 МЛН
How To Unlock Your iphone With Your Voice
0:34
요루퐁 yorupong
Рет қаралды 26 МЛН
APPLE совершила РЕВОЛЮЦИЮ!
0:39
ÉЖИ АКСЁНОВ
Рет қаралды 3,6 МЛН
iPhone 12 socket cleaning #fixit
0:30
Tamar DB (mt)
Рет қаралды 49 МЛН