Gérard Ben Arous - 1/4 Random Matrices and Dynamics of Optimization in Very High Dimensions

  Рет қаралды 5,037

Institut des Hautes Etudes Scientifiques (IHES)

Institut des Hautes Etudes Scientifiques (IHES)

Күн бұрын

Machine learning and Data science algorithms involve in their last stage the need for optimization of complex random functions in very high dimensions. Simple algorithms like Stochastic Gradient Descent (with small batches) are used very effectively. I will concentrate on trying to understand why these simple tools can still work in these complex and very over-parametrized regimes. I will first introduce the whole framework for non-experts, from the structure of the typical tasks to the natural structures of neural nets used in standard contexts. l will then cover briefly the classical and usual context of SGD in finite dimensions. I will then survey recent work with Reza Gheissari (Northwestern), Aukosh Jagannath (Waterloo) giving a general view for the existence of projected “effective dynamics” for “summary statistics” in much smaller dimensions, which still rule the performance of very high dimensional systems, as well . These effective dynamics (as their so-called “critical regime”) define a dynamical system in finite dimensions which may be quite complex, and rules the performance of the learning algorithm. The next step will be to understand how the system finds these “summary statistics”. This is done in the next work with the same authors and with Jiaoyang Huang (Wharton, U-Penn). This is based on a dynamical spectral transition of Random Matrix Theory: along the trajectory of the optimization path, the Gram matix or the Hessian matrix develop outliers which carry these effective dynamics. I will naturally first come back to the Random Matrix Tools needed here (the behavior of the edge of the spectrum and the BBP transition) in a much broader context. And then illustrate the use of this point of view on a few central examples of ML: multilayer neural nets for classification (of Gaussian mixtures), and the XOR examples, for instance.
Gérard Ben Arous (CIMS, New York University)
===
Find this and many more scientific videos on www.carmin.tv/ - a French video platform for mathematics and their interactions with other sciences offering extra functionalities tailored to meet the needs of the research community.
===

Пікірлер
Gérard Ben Arous - 2/4 Random Matrices and Dynamics of Optimization in Very High Dimensions
2:05:07
Institut des Hautes Etudes Scientifiques (IHES)
Рет қаралды 1,7 М.
The Key Equation Behind Probability
26:24
Artem Kirsanov
Рет қаралды 161 М.
СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️
01:01
DO$HIK
Рет қаралды 3,3 МЛН
Gérard Ben Arous - 3/4 Random Matrices and Dynamics of Optimization in Very High Dimensions
1:51:36
Institut des Hautes Etudes Scientifiques (IHES)
Рет қаралды 1 М.
The SECRET To Why Jim Simons' Hedge Fund Remains The BEST...
2:40
Terence Tao Teaches Mathematical Thinking
1:21:24
Raterfy
Рет қаралды 43 М.
Dynamics and Critical Scaling - Gérard Ben Arous
1:09:11
Institute for Advanced Study
Рет қаралды 1,8 М.
2024's Biggest Breakthroughs in Math
15:13
Quanta Magazine
Рет қаралды 612 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,2 МЛН
Alain Connes - Prime, Knots and the Adele Class Space
1:13:39
Institut des Hautes Etudes Scientifiques (IHES)
Рет қаралды 6 М.
СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️
01:01
DO$HIK
Рет қаралды 3,3 МЛН