No video

18 January 2024: Hans Kersting (Yahoo! Research)

  Рет қаралды 125

UCL Statistical Science seminars

UCL Statistical Science seminars

Күн бұрын

Title: The beneficial role of stochastic noise in SGD
Abstract: The data sets used to train modern machine-learning models are often huge, e.g. millions of images. This makes it too expensive to compute the true gradient over all data sets. In each gradient descent (GD) step, a stochastic gradient is thus computed over a subset ("mini-batch”) of data. The resulting stochastic gradient descent (SGD) algorithm, and its variants, is the main workhorse of modern machine learning. Until recently, most machine-learning researchers would have preferred to use GD, if they could, and considered SGD only as a fast approximation to GD. But new research suggests that the stochasticity in SGD is part of the reason why SGD works so well. In this talk, we investigate multiple theories on the advantages of the noise in SGD, including better generalization in flatter minima (‘implicit bias’) and faster escapes from difficult parts of the landscapes (such as saddle points and local minima). We highlight how correlating noise can help optimization and zoom in on the question which noise structure would be optimal for SGD.

Пікірлер
19 January 2023: Ilina Yozova
51:22
UCL Statistical Science seminars
Рет қаралды 149
11 January 2024: Sam Power (University of Bristol)
55:27
UCL Statistical Science seminars
Рет қаралды 262
Get 10 Mega Boxes OR 60 Starr Drops!!
01:39
Brawl Stars
Рет қаралды 18 МЛН
Before VS during the CONCERT 🔥 "Aliby" | Andra Gogan
00:13
Andra Gogan
Рет қаралды 9 МЛН
КТО ЛЮБИТ ГРИБЫ?? #shorts
00:24
Паша Осадчий
Рет қаралды 2,4 МЛН
25 April 2024: Brendan Murphy (University College Dublin)
54:19
UCL Statistical Science seminars
Рет қаралды 60
Mr. Jakiw Pidstrigach | Infinite-Dimensional Diffusion Models
47:49
INI Satellite Events
Рет қаралды 105
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 288 М.
The Greenwich Meridian is in the wrong place
25:07
Stand-up Maths
Рет қаралды 828 М.
The Clever Way to Count Tanks - Numberphile
16:45
Numberphile
Рет қаралды 1 МЛН
Denis Noble explains his revolutionary theory of genetics | Genes are not the blueprint for life
14:33
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 954 М.