Deep Deterministic Policy Gradients

Рет қаралды 19,810

CIS 522 - Deep Learning

Күн бұрын

Пікірлер: 14

@sharvani6133 2 жыл бұрын

Thank you for the video!

@BananaLassi Жыл бұрын

i feel like i can graduate after watching this video

@leo.y.comprendo 3 жыл бұрын

Thanks for the video! I have a question, what is the loss function used for the policy network?

@daniamartinez4817 2 жыл бұрын

Thank you so much!

@brunomelicio2248 3 жыл бұрын

Very good explanation. Thank you very much! Keep up the good work.

@simmingi1 10 ай бұрын

for what? for you? what a selfish person 😅😅

@amrahmed2009 2 жыл бұрын

Very well explained. Thank you

@maxschumacher12 2 жыл бұрын

Excellent explanation!

@ParagMantri 3 жыл бұрын

This is very well explained.

@overgeared 3 жыл бұрын

excellent, thanks

@Firestorm-tq7fy 2 жыл бұрын

the video was not bad but sry, this has nothing todo with continuous action spaces. you simply described actor-critir RL and not continous action spaces...

@wesnaw100 Жыл бұрын

It's a bit confusing because he doesn't go into detail on what the actor network is outputting, but it is indeed outputting continuous actions.

@Firestorm-tq7fy Жыл бұрын

@@wesnaw100 sry, but no. Continues action spaces are formatted as 2 outputs generating a distribution (variance and mean)

@kkyars Жыл бұрын

@@Firestorm-tq7fy yes, and that is a continuous distrbution