Markov Decision Processes Four - Georgia Tech - Machine Learning

  Рет қаралды 53,634

Udacity

Udacity

Күн бұрын

Пікірлер: 15
@EmilyXieX
@EmilyXieX 4 жыл бұрын
This is super clear. Thanks so much for making this video.
@audic2350
@audic2350 2 жыл бұрын
The greatest video I could watch to understand MDP.
@vishalkumarpandey5546
@vishalkumarpandey5546 Жыл бұрын
Such an insightful discussion based explanation. Great 👍
@QQ-xx7mo
@QQ-xx7mo 6 жыл бұрын
Awesome videos, Thank you
@cigxhang486
@cigxhang486 11 ай бұрын
so the policy tells you the next action to take in order for you to reach the reward eventually?
@renskirchner6309
@renskirchner6309 4 жыл бұрын
You're a genius
@renskirchner6309
@renskirchner6309 4 жыл бұрын
When it comes to explanation imean
@enditend2
@enditend2 9 жыл бұрын
no part 5?
@braineedly7543
@braineedly7543 2 жыл бұрын
Is decision of policy based on model?
@lahaale5840
@lahaale5840 7 жыл бұрын
is the reward by given? or where is the reward come from? is it equivalent to label data in supervise learning?
@oldcowbb
@oldcowbb 3 жыл бұрын
i think it is more like the cost function associated with whether the prediction matches with the label, it is some numerical function to indicate what you want the algorithm to optimize, like matching labels in classification or getting closer to the goal in navigation
@braineedly7543
@braineedly7543 2 жыл бұрын
@@oldcowbb so we should store every reward of each state?
@oldcowbb
@oldcowbb 2 жыл бұрын
@@braineedly7543 well you can't solve an MDP without the reward so yes
@joselabaki8290
@joselabaki8290 2 жыл бұрын
The Instructor is excellent, unfortunately, the explanation is slowed down, sometimes "blurred" because of the non-stop interjections. I believe a single voice is more than enough.
More About Rewards - Georgia Tech - Machine Learning
5:06
Udacity
Рет қаралды 13 М.
How Strong Is Tape?
00:24
Stokes Twins
Рет қаралды 96 МЛН
Сестра обхитрила!
00:17
Victoria Portfolio
Рет қаралды 958 М.
When you have a very capricious child 😂😘👍
00:16
Like Asiya
Рет қаралды 18 МЛН
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19
RL Course by David Silver - Lecture 2: Markov Decision Process
1:42:05
Google DeepMind
Рет қаралды 650 М.
Markov Chains Clearly Explained! Part - 1
9:24
Normalized Nerd
Рет қаралды 1,3 МЛН
Policies - Georgia Tech - Machine Learning
5:05
Udacity
Рет қаралды 25 М.
Markov Decision Processes
43:18
Bert Huang
Рет қаралды 78 М.
How Strong Is Tape?
00:24
Stokes Twins
Рет қаралды 96 МЛН