KZbin algo, please make the relevance score of this video to 10/10. This video is too good to be ignored
@CodeEmporium Жыл бұрын
Thank you! Now if only the KZbin gods listen
@slitihela18609 ай бұрын
can you prepare a video for Double Q-Learning Network and Dueling Double Q-Learning Network please
@vanilan3585 Жыл бұрын
you just make video. what am i about to study😃
@borneoland-hk2il2 ай бұрын
So there is only two method-based in RL, Value-based, and Policy Gradient-based, Actor-Critic based is fall into category Policy Gradient-based, for confirmation is that correct? and from what source this information? or would you like to cover some Actor-Critic based method RL videos?
@jsp99120410 ай бұрын
Thanks alot!!😀
@alirezasalehabadi14225 ай бұрын
Thank you.
@rinibhasin177 ай бұрын
Confused :(
@bhaveshachhada724210 ай бұрын
I was confused. You made me more confused. This doesn't explain the intuition.
@RelaxHERE-zk8ts2 ай бұрын
lol what was confusing here he simply told about the policy generation and value function based policy generation method.. then told two types of policy generation methods from value functions which are V(s) and Q(s,a).. the simple intution was to be able to detect maximum reward state.. you should watch first markov decision process then it will make sense.