Five Miracles of Mirror Descent, Lecture 2/9

  Рет қаралды 4,086

Sebastien Bubeck

Sebastien Bubeck

Күн бұрын

Пікірлер: 8
@honey-py9pj
@honey-py9pj Жыл бұрын
In 9:49 it is stated that opt is a vector that has everywhere 0 expect from one coordinate , let's say the i-th. Why exactly?To find this we take the gradient of cumulative losses for a fixed probability distribution p , right? And then?
@avi111986
@avi111986 4 жыл бұрын
I'm not sure about Approach 1 suggested around 4:00 . How does he perform GD on the function < l_t, p >, when l_t is not known in advance? If l_t is known to the player at the beginning of time t why not just choose some expert with 0 loss? I'm probably missing something in the problem description. Can someone please help me out here?
@SebastienBubeck
@SebastienBubeck 4 жыл бұрын
The suggestion is to get p_{t+1} from p_t by a step of gradient descent on the function < l_t, p >. In particular, this operation can be performed at the beginning of round t+1 (when you need p_{t+1}), and thus at a time when l_t is known to the player.
@scose
@scose 4 жыл бұрын
Is there a written reference for this Riemannian interpretation of mirror descent? It seems different from the interpretation in your work "Convex Optimization: Algorithms and Complexity", which doesn't mention manifolds.
@SebastienBubeck
@SebastienBubeck 4 жыл бұрын
Unfortunately I have not written it yet, but I have plans to do it at some point in the future... For the moment you can take a look at this paper arxiv.org/abs/2004.01025 , although they have a different interpretation than mine.
@scose
@scose 4 жыл бұрын
@@SebastienBubeck Thank you!
@christianholtz5182
@christianholtz5182 Жыл бұрын
Hi, me again. I start a company called 5k education. Book learning, watch lectures on tv with 3 friends, testing. 2 degrees for 5k (since most fail, see executive function).
@cpthddk
@cpthddk 4 жыл бұрын
Cameraman kind of sucks on this one... I feel like they didn't get that the content was important, not seb's hot bod
Five Miracles of Mirror Descent, Lecture 3/9
1:00:19
Sebastien Bubeck
Рет қаралды 2,4 М.
TCS+ talk: Sébastien Bubeck
1:12:11
TCS+
Рет қаралды 2,3 М.
She made herself an ear of corn from his marmalade candies🌽🌽🌽
00:38
Valja & Maxim Family
Рет қаралды 18 МЛН
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 41 МЛН
Five Miracles of Mirror Descent, Lecture 9/9
57:04
Sebastien Bubeck
Рет қаралды 1,2 М.
Five Miracles of Mirror Descent, Lecture 6/9
48:05
Sebastien Bubeck
Рет қаралды 1,1 М.
Physics of AI
1:00:04
Sebastien Bubeck
Рет қаралды 33 М.
Sparks of AGI: early experiments with GPT-4
48:32
Sebastien Bubeck
Рет қаралды 1,7 МЛН
Five Miracles of Mirror Descent, Lecture 7/9
56:23
Sebastien Bubeck
Рет қаралды 793
Five Miracles of Mirror Descent, Lecture 8/9
49:25
Sebastien Bubeck
Рет қаралды 805
A Universal Law of Robustness
29:38
Sebastien Bubeck
Рет қаралды 11 М.
Textbooks Are All You Need
37:16
Sebastien Bubeck
Рет қаралды 18 М.
Unveiling Transformers with LEGO
46:42
Sebastien Bubeck
Рет қаралды 7 М.
She made herself an ear of corn from his marmalade candies🌽🌽🌽
00:38
Valja & Maxim Family
Рет қаралды 18 МЛН