That is the best video I watched so far to understand this topic
@hobby_coding4 жыл бұрын
very good lecture maybe the best introduction to this topic i've ever seen on youtube
@Pexers.3 жыл бұрын
Thank you, I spent hours in this algorithm, finally understood it !
@coeusmaze94135 жыл бұрын
The video provides intuitive but deep understanding in MDP
@syedrumman39202 жыл бұрын
This is such a clear explanation!! Ty for this!! I wish I had taken your class while I was in VT!
@srujayop2 жыл бұрын
Is the reward R(s) actually R(s')? And should that also be multiplied with the transition probability? max(over a) sum P(s', r|,s, a) [r + gamma*V(s')] ? I am trying to relate the equation presented in the video to standard notation 4 par notation.
@Ahmed.r.a7 ай бұрын
thank you for this brilliant explanation. I wished there was a Question with solution to practice on.
@jub8891 Жыл бұрын
thank you so much, you explain the subject very well and have helped me to understand..
@ryanflynn3865 жыл бұрын
This is a great explanation video, thanks so much. Your voice is easy to listen to too haha.
@berty385 жыл бұрын
Ryan Flynn Thanks! I’m glad it’s helpful. My smooth voice is a huge disadvantage when I teach morning classes and my students all fall asleep.
@tarik86224 жыл бұрын
Very interesting topic. And i think that you will make a fortune if you use your voice in publicity field. Best regards.
@jff7113 жыл бұрын
Thank you very much, very well explained.
@xruan65824 жыл бұрын
can anyone explain (32:00) the switch between two modes (i.e. represented by green and red arrow). To me the green one seems like deterministic rule, the red one seems like stochastic rule. Can they exist simultaneously?
@behmandtirgar4 жыл бұрын
I have a question at time 8:30 : if we take an action to go to the left, why Pr(c | b, left) isn't 0.00? (we go to another side)
@seanxu6741 Жыл бұрын
Fantastic video! Thanks a lot!
@treegnome23714 жыл бұрын
at 17:35, why isn't it gamma = (0,1), instead of (0,1]...if gamma = 1, the influence of the actions farther down the road stays the same as all other actions, rather than shrinking the influence...right?
@JustinMasayda2 жыл бұрын
This was fantastic, thank you!
@sanskarshrivastava51933 жыл бұрын
Best video for MDP on youtube
@Throwingness3 жыл бұрын
Around 34:00 when there are equations on the screen you should have had a pointer or something to point at what you are talking about. It's not clear.
@richardm59164 жыл бұрын
Realy great explaintion on Machine learning
@consolesblow5 жыл бұрын
Thanks a lot! I found this very helpful.
@joshuasegal41615 жыл бұрын
What software are you using to make this?? It looks like you have like an infinite page which gives a really clean look
@berty385 жыл бұрын
Nothing too fancy. This was done with Apple Keynote, and I'm faking that scrolling effect with "Magic Move" animations. I'm always looking for better tools to build useful visuals for lectures.
@quantlfc2 жыл бұрын
Absolutely amazing lecture!!!
@JebbigerJohn Жыл бұрын
This is so good!!!
@ismailasmcalskan25524 жыл бұрын
Really good video about this topic. Thank you
@_brenda49753 жыл бұрын
much better than my lecturer
@linfrancis52045 жыл бұрын
Great video. Thank you. Could you please make a similar video while we consider a two-dimensional Markov chain with more states?
@peterkimemiah96693 жыл бұрын
Very good easy to understand.
@sander1426-24 жыл бұрын
Thanks for the explanation!
@rezadarooei2485 жыл бұрын
Thanks for your nice tutorial is it possible upload the slides?
@zenchiassassin2834 жыл бұрын
What textbook ? thank you very much
@EdupugantiAadityaaeb Жыл бұрын
What is the name of textbook
@jaideep_yes5 жыл бұрын
Thank you.
@y-30844 жыл бұрын
excellent
@dminn4 жыл бұрын
God bless
@ahmet94465 жыл бұрын
The best I find is [4, 1]. I couldn't achieve [4.2, 1.2]. Does anyone achieve [4.2, 1.2]?