Reinforcement Learning Course: Intro to Advanced Actor Critic Methods

Рет қаралды 79,143

freeCodeCamp.org

Күн бұрын

Пікірлер: 60

@MachineLearningwithPhil 3 жыл бұрын

Hey I know that guy! Any questions, please leave them down below!

@TheRealKitWalker 3 жыл бұрын

It was very helpful. Thanks for sharing your knowledge and pointers. 👏👏👌👌👍👍

@AungBaw 3 жыл бұрын

Thank you Phil 👏👏👏👏👏

@daydreamed 3 жыл бұрын

Tell him we are grateful for such a great video

@ashwinkotgire2303 2 жыл бұрын

Please make a video on ACKTR(actor critic using knonecker factorizations)

@Joy_jester Жыл бұрын

Love the videos, but is it possible to get the videos for pytorch specifically? Thanks

@rachadelmoutaouaffiq8752 3 жыл бұрын

Guys we should literally donate to this channel once Hired , is more useful than most universities

@dhanniekristanto 3 жыл бұрын

agree

@Frost_Byte_Tech 3 жыл бұрын

Very true

@Am0re98 3 жыл бұрын

Agree!!

@kiran10110 3 жыл бұрын

This is why the computer science and software engg field is so successful and growing so quickly. We keep everything open source and freely available to anyone willing to learn. That’s so rare these days. There are so many other fields that lock up their knowledge in university courses and paywalls.

@BlurryBit 3 жыл бұрын

Man this channel is a goldmine 😂 Nothing new though as this is not the first course I saw here. This course is going to be very helpful for me. Thank you for the work you guys are putting into teaching people like me. P.s. double thanks for the nextjs course as well. It was very helpful.

@quincylarsonmusic 3 жыл бұрын

Thanks!

@muhammadaarizmarzuq295 2 жыл бұрын

ok sub bot

@quincylarsonmusic 2 жыл бұрын

@@muhammadaarizmarzuq295 I am not a bot.

@AungBaw 3 жыл бұрын

Personally like the style of few slides, no BS, no nothing Sir, straight to the coding. Strong work.

@vpundir3024 3 жыл бұрын

As a first viewer and a young coder I love code camp well my age is 12 and in picture he is my dad so don’t be confused

@geekyprogrammer4831 3 жыл бұрын

Nobody cares if you are 12.

@BattleOfficial_ 3 жыл бұрын

@@geekyprogrammer4831 bro chill, sort your own problems out before hating on others.

@geekyprogrammer4831 3 жыл бұрын

@@BattleOfficial_ No hate but revealing age doesn't pertain to the content.

@BattleOfficial_ 3 жыл бұрын

@@geekyprogrammer4831 alright but he's young and is excited by the video, just move on the kids 12, no need to reply.

@trentonspears5304 3 жыл бұрын

Oohhh sweet! Machine Learning with Phil is awesome!

@ketchupparty9997 3 жыл бұрын

This was exactly what I wanted to learn. Thank you

@kbhaskar36 3 жыл бұрын

This channel is awesome. Its content and support is beyond any words.. Thank you so much for all the quality content Team.

@cescabhi 3 жыл бұрын

Wow, I just turned in my project with an actor critic algorithm THIS WEEK. -__- *cries

@prajyotkumar9644 3 жыл бұрын

good job....kinda gives me hope too. Ik its stupid....

@attilasarkany6123 9 ай бұрын

Hi everyone. Could you recommend any paper or longer discussion about the limitations of actor critic models for continuous space

@Falconoo7383 2 жыл бұрын

Thank you for the awesome video. Can you please characterize all the DRL models? If possible.

@smitasingh9764 2 жыл бұрын

Thank you so much for putting an effort to do the whole implementation which is relatively bit easier to grasp than the paper. I am very new to RL and I have a rather weird question(cause no one actually addressed but ignore if I am being stupid), so when for the first time you call the learn function after doing 20 steps, wouldn't the new_probs be equal to the old_probs, because essentially the neural network didn't learn anything so would both these values be random until like several iteration? And if actually they would be random, how is the agent learning?

@kkyars Жыл бұрын

it is essentially stricly exploring at the begginging, and learning only comes into effect once the environemnt dynamics of the rewards begin to affect the random values by increasing or decreasing rthe probabilites of actions

@tarifcemay3823 2 жыл бұрын

I thought prob_ratio must equal to one if we replay the same action as the actor is updated after replay . am I right?

@marohs5606 3 жыл бұрын

Woow 😍😍😍 thank you 👏👏👏👏👏

@vpundir3024 3 жыл бұрын

👍 great

@SnehaChendke-l2z Жыл бұрын

Are these methods suggested for NLP tasks such as Text classification?

@lakshmichaitanya1316 3 жыл бұрын

We need a new vue.js course!

@mohamednasrel-dinazouzmoha8210 3 жыл бұрын

لماذا لا توجد ترجمه مصاحبة لهذا الفيديو

@saifelshasly973 3 жыл бұрын

Hello my brother, I am an Arab

@2minuteschool929 2 жыл бұрын

Thanks

@playerxgaming2377 3 жыл бұрын

Super

@ayarzuki 3 жыл бұрын

Please Turn On the Auto English subtitle. I am not English native

@codinghacker822 3 жыл бұрын

What is coding

@technicalbro8409 3 жыл бұрын

*This channel is pure bitcoin*

@vanvothe4817 3 жыл бұрын

Simle man with vanila vim

@AbdulMunim-kx6np 3 жыл бұрын

What does actor critic method means?

@MachineLearningwithPhil 3 жыл бұрын

it means we're using neural networks to do two separate things: decide what to do (the actor) and decide whether or not that action was valuable (the critic). The two networks help each other learn how to select the most profitable actions over time.

@AbdulMunim-kx6np 3 жыл бұрын

@@MachineLearningwithPhil that is interesting