i feel like i can graduate after watching this video
@leo.y.comprendo3 жыл бұрын
Thanks for the video! I have a question, what is the loss function used for the policy network?
@daniamartinez48172 жыл бұрын
Thank you so much!
@brunomelicio22483 жыл бұрын
Very good explanation. Thank you very much! Keep up the good work.
@simmingi110 ай бұрын
for what? for you? what a selfish person 😅😅
@amrahmed20092 жыл бұрын
Very well explained. Thank you
@maxschumacher122 жыл бұрын
Excellent explanation!
@ParagMantri3 жыл бұрын
This is very well explained.
@overgeared3 жыл бұрын
excellent, thanks
@Firestorm-tq7fy2 жыл бұрын
the video was not bad but sry, this has nothing todo with continuous action spaces. you simply described actor-critir RL and not continous action spaces...
@wesnaw100 Жыл бұрын
It's a bit confusing because he doesn't go into detail on what the actor network is outputting, but it is indeed outputting continuous actions.
@Firestorm-tq7fy Жыл бұрын
@@wesnaw100 sry, but no. Continues action spaces are formatted as 2 outputs generating a distribution (variance and mean)
@kkyars Жыл бұрын
@@Firestorm-tq7fy yes, and that is a continuous distrbution