Lecture 2, 2024, Stochastic finite and infinite horizon DP, approximation in value and policy space

Lecture 3, 2024, LQ Problems, Approximation in Value Space, VI, and PI, Newton's Method, Examples

Lecture 4, 2024, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

😺🍫 خدعة الشوكولاتة المذهلة لقطتي! شاهد كيف تعلمني قطتي القيام بها! 😂🎉

Une nouvelle voiture pour Noël 🥹

Мен атып көрмегенмін ! | Qalam | 5 серия

Lecture 2, 2024, Stochastic finite and infinite horizon DP, approximation in value and policy space

Рет қаралды 1,679

Dimitri Bertsekas

Dimitri Bertsekas

Күн бұрын

Пікірлер: 2

@amitbhaya3571 4 ай бұрын

On the slide explaining rollout for the TSP (starting around 22:30), the cost from node AC to node ACD should be 1 not 3 (in order to be compatible with the matrix of intercity travel costs). Similarly the cost from ADB to ADBC should be 1 and not 20. If these corrections are made, rollout applied from A will lead to the path A-C-D-B-A (cost 26) and will not recover the optimal path A-B-D-C-A (shown in red on the slide) with cost 13. This shows that rollout could lead to suboptimal outcomes. However, if the A to C cost is changed to 3 [in the matrix of intercity travel costs], then everything works as advertised. Side remark: I also tried modifying the matrix to reflect the (black) numbers on the slide (C to D cost 3, B to C cost 20), but then there is a lot more to change!

@CoyNDPL 6 ай бұрын

I would like to ask for some clarification: On slide 15 / 34 at 48:10, L is introduced with a subscript "N-1", but all of the variables L depends on {a, b, q, r} are time invariant in this example. Is it useful, in general, to show that L has some time dependence? Thanks in advance!

Lecture 3, 2024, LQ Problems, Approximation in Value Space, VI, and PI, Newton's Method, Examples

1:42:27

Lecture 3, 2024, LQ Problems, Approximation in Value Space, VI, and PI, Newton's Method, Examples

Dimitri Bertsekas

Рет қаралды 850

Lecture 4, 2024, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

1:46:26

Lecture 4, 2024, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

Dimitri Bertsekas

Рет қаралды 571

00:47

Natan por Aí

Рет қаралды 30 МЛН

😺🍫 خدعة الشوكولاتة المذهلة لقطتي! شاهد كيف تعلمني قطتي القيام بها! 😂🎉

00:30

😺🍫 خدعة الشوكولاتة المذهلة لقطتي! شاهد كيف تعلمني قطتي القيام بها! 😂🎉

PuffPaw Arabic

Рет қаралды 17 МЛН

Une nouvelle voiture pour Noël 🥹

00:28

Une nouvelle voiture pour Noël 🥹

Nicocapone

Рет қаралды 9 МЛН

Мен атып көрмегенмін ! | Qalam | 5 серия

25:41

Мен атып көрмегенмін ! | Qalam | 5 серия

kak budto

Рет қаралды 1,2 МЛН

Lecture 8, 2024, Rollout for stochastic DP. Value space approx for infinite state and control spaces

1:32:39

Lecture 8, 2024, Rollout for stochastic DP. Value space approx for infinite state and control spaces

Dimitri Bertsekas

Рет қаралды 393

Lecture 5, 2024, Deterministic Rollout, cost improvement, sequential improvement, multiagent rollout

1:30:28

Lecture 5, 2024, Deterministic Rollout, cost improvement, sequential improvement, multiagent rollout

Dimitri Bertsekas

Рет қаралды 552

Eleanor Crane: Quantum Computation with Fermions, Bosons, and Qubits

59:01

Eleanor Crane: Quantum Computation with Fermions, Bosons, and Qubits

QUINFOG CSIC

Рет қаралды 16

ML Tutorial: Gaussian Processes (Richard Turner)

1:53:32

ML Tutorial: Gaussian Processes (Richard Turner)

Marc Deisenroth

Рет қаралды 138 М.

Dimitri Bertsekas, Convex Optimization: A Journey of 60 Years, Lecture at MIT

24:30

Dimitri Bertsekas, Convex Optimization: A Journey of 60 Years, Lecture at MIT

Dimitri Bertsekas

Рет қаралды 2,8 М.

2024 MIT Integration Bee - Finals

1:09:25

2024 MIT Integration Bee - Finals

MIT Integration Bee

Рет қаралды 756 М.

Lecture 11, 2023: Review of off-line training, approximate VI and PI, aggregation, course overview

1:44:36

Lecture 11, 2023: Review of off-line training, approximate VI and PI, aggregation, course overview

Dimitri Bertsekas

Рет қаралды 709

Lecture 9, 2024, Bayesian optimization and adaptive control with a POMDP approach. Wordle case study

1:10:09

Lecture 9, 2024, Bayesian optimization and adaptive control with a POMDP approach. Wordle case study

Dimitri Bertsekas

Рет қаралды 2,6 М.

Lecture 6, 2024, Multistep Approximation in Value Space, Constrained Rollout, Multiagent Rollout

1:27:03

Lecture 6, 2024, Multistep Approximation in Value Space, Constrained Rollout, Multiagent Rollout

Dimitri Bertsekas

Рет қаралды 445

Lecture 2: Experimental Facts of Life

1:20:12

Lecture 2: Experimental Facts of Life

MIT OpenCourseWare

Рет қаралды 1,7 МЛН

00:47

Natan por Aí

Рет қаралды 30 МЛН