Reinforcement Learning Course: Intro to Advanced Actor Critic Methods

  Рет қаралды 79,143

freeCodeCamp.org

freeCodeCamp.org

Күн бұрын

Пікірлер: 60
@MachineLearningwithPhil
@MachineLearningwithPhil 3 жыл бұрын
Hey I know that guy! Any questions, please leave them down below!
@TheRealKitWalker
@TheRealKitWalker 3 жыл бұрын
It was very helpful. Thanks for sharing your knowledge and pointers. 👏👏👌👌👍👍
@AungBaw
@AungBaw 3 жыл бұрын
Thank you Phil 👏👏👏👏👏
@daydreamed
@daydreamed 3 жыл бұрын
Tell him we are grateful for such a great video
@ashwinkotgire2303
@ashwinkotgire2303 2 жыл бұрын
Please make a video on ACKTR(actor critic using knonecker factorizations)
@Joy_jester
@Joy_jester Жыл бұрын
Love the videos, but is it possible to get the videos for pytorch specifically? Thanks
@rachadelmoutaouaffiq8752
@rachadelmoutaouaffiq8752 3 жыл бұрын
Guys we should literally donate to this channel once Hired , is more useful than most universities
@dhanniekristanto
@dhanniekristanto 3 жыл бұрын
agree
@Frost_Byte_Tech
@Frost_Byte_Tech 3 жыл бұрын
Very true
@Am0re98
@Am0re98 3 жыл бұрын
Agree!!
@kiran10110
@kiran10110 3 жыл бұрын
This is why the computer science and software engg field is so successful and growing so quickly. We keep everything open source and freely available to anyone willing to learn. That’s so rare these days. There are so many other fields that lock up their knowledge in university courses and paywalls.
@BlurryBit
@BlurryBit 3 жыл бұрын
Man this channel is a goldmine 😂 Nothing new though as this is not the first course I saw here. This course is going to be very helpful for me. Thank you for the work you guys are putting into teaching people like me. P.s. double thanks for the nextjs course as well. It was very helpful.
@quincylarsonmusic
@quincylarsonmusic 3 жыл бұрын
Thanks!
@muhammadaarizmarzuq295
@muhammadaarizmarzuq295 2 жыл бұрын
ok sub bot
@quincylarsonmusic
@quincylarsonmusic 2 жыл бұрын
@@muhammadaarizmarzuq295 I am not a bot.
@AungBaw
@AungBaw 3 жыл бұрын
Personally like the style of few slides, no BS, no nothing Sir, straight to the coding. Strong work.
@vpundir3024
@vpundir3024 3 жыл бұрын
As a first viewer and a young coder I love code camp well my age is 12 and in picture he is my dad so don’t be confused
@geekyprogrammer4831
@geekyprogrammer4831 3 жыл бұрын
Nobody cares if you are 12.
@BattleOfficial_
@BattleOfficial_ 3 жыл бұрын
@@geekyprogrammer4831 bro chill, sort your own problems out before hating on others.
@geekyprogrammer4831
@geekyprogrammer4831 3 жыл бұрын
@@BattleOfficial_ No hate but revealing age doesn't pertain to the content.
@BattleOfficial_
@BattleOfficial_ 3 жыл бұрын
@@geekyprogrammer4831 alright but he's young and is excited by the video, just move on the kids 12, no need to reply.
@trentonspears5304
@trentonspears5304 3 жыл бұрын
Oohhh sweet! Machine Learning with Phil is awesome!
@ketchupparty9997
@ketchupparty9997 3 жыл бұрын
This was exactly what I wanted to learn. Thank you
@kbhaskar36
@kbhaskar36 3 жыл бұрын
This channel is awesome. Its content and support is beyond any words.. Thank you so much for all the quality content Team.
@cescabhi
@cescabhi 3 жыл бұрын
Wow, I just turned in my project with an actor critic algorithm THIS WEEK. -__- *cries
@prajyotkumar9644
@prajyotkumar9644 3 жыл бұрын
good job....kinda gives me hope too. Ik its stupid....
@attilasarkany6123
@attilasarkany6123 9 ай бұрын
Hi everyone. Could you recommend any paper or longer discussion about the limitations of actor critic models for continuous space
@Falconoo7383
@Falconoo7383 2 жыл бұрын
Thank you for the awesome video. Can you please characterize all the DRL models? If possible.
@smitasingh9764
@smitasingh9764 2 жыл бұрын
Thank you so much for putting an effort to do the whole implementation which is relatively bit easier to grasp than the paper. I am very new to RL and I have a rather weird question(cause no one actually addressed but ignore if I am being stupid), so when for the first time you call the learn function after doing 20 steps, wouldn't the new_probs be equal to the old_probs, because essentially the neural network didn't learn anything so would both these values be random until like several iteration? And if actually they would be random, how is the agent learning?
@kkyars
@kkyars Жыл бұрын
it is essentially stricly exploring at the begginging, and learning only comes into effect once the environemnt dynamics of the rewards begin to affect the random values by increasing or decreasing rthe probabilites of actions
@tarifcemay3823
@tarifcemay3823 2 жыл бұрын
I thought prob_ratio must equal to one if we replay the same action as the actor is updated after replay . am I right?
@marohs5606
@marohs5606 3 жыл бұрын
Woow 😍😍😍 thank you 👏👏👏👏👏
@vpundir3024
@vpundir3024 3 жыл бұрын
👍 great
@SnehaChendke-l2z
@SnehaChendke-l2z Жыл бұрын
Are these methods suggested for NLP tasks such as Text classification?
@lakshmichaitanya1316
@lakshmichaitanya1316 3 жыл бұрын
We need a new vue.js course!
@mohamednasrel-dinazouzmoha8210
@mohamednasrel-dinazouzmoha8210 3 жыл бұрын
لماذا لا توجد ترجمه مصاحبة لهذا الفيديو
@saifelshasly973
@saifelshasly973 3 жыл бұрын
Hello my brother, I am an Arab
@2minuteschool929
@2minuteschool929 2 жыл бұрын
Thanks
@playerxgaming2377
@playerxgaming2377 3 жыл бұрын
Super
@ayarzuki
@ayarzuki 3 жыл бұрын
Please Turn On the Auto English subtitle. I am not English native
@codinghacker822
@codinghacker822 3 жыл бұрын
What is coding
@technicalbro8409
@technicalbro8409 3 жыл бұрын
*This channel is pure bitcoin*
@vanvothe4817
@vanvothe4817 3 жыл бұрын
Simle man with vanila vim
@AbdulMunim-kx6np
@AbdulMunim-kx6np 3 жыл бұрын
What does actor critic method means?
@MachineLearningwithPhil
@MachineLearningwithPhil 3 жыл бұрын
it means we're using neural networks to do two separate things: decide what to do (the actor) and decide whether or not that action was valuable (the critic). The two networks help each other learn how to select the most profitable actions over time.
@AbdulMunim-kx6np
@AbdulMunim-kx6np 3 жыл бұрын
@@MachineLearningwithPhil that is interesting
@mrrishiraj88
@mrrishiraj88 3 жыл бұрын
👍🙏
@tharunkumar5512
@tharunkumar5512 Жыл бұрын
I have an actor critic algorithm. I want you to implement the python code. I will pay for that.
@StephenRayner
@StephenRayner 3 жыл бұрын
Semiconductor physics!
@muhammadaarizmarzuq295
@muhammadaarizmarzuq295 2 жыл бұрын
wish i found this earlier UwU
@ananyobratapal5521
@ananyobratapal5521 3 жыл бұрын
First like!
@saiiyengar1946
@saiiyengar1946 3 жыл бұрын
I thought it was @wojespn for a second
@kkyars
@kkyars Жыл бұрын
this is just a compilation of pre existing videos, this should have been clarified
@captaincomputer5891
@captaincomputer5891 3 жыл бұрын
First here.
@playerxgaming2377
@playerxgaming2377 3 жыл бұрын
600 like
@Kairos_91
@Kairos_91 3 жыл бұрын
호엥
@antianti4331
@antianti4331 2 жыл бұрын
Useless stuff, no theoretical explanation provided at all. No sense to learn to code if you don't understand things on paper.
@iamr0b0tx
@iamr0b0tx 10 ай бұрын
Thanks
@fandre1234
@fandre1234 3 жыл бұрын
Thanks!
Overview of Deep Reinforcement Learning Methods
24:50
Steve Brunton
Рет қаралды 66 М.
Reinforcement Learning from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 80 М.
It works #beatbox #tiktok
00:34
BeatboxJCOP
Рет қаралды 41 МЛН
Try this prank with your friends 😂 @karina-kola
00:18
Andrey Grechka
Рет қаралды 9 МЛН
When you have a very capricious child 😂😘👍
00:16
Like Asiya
Рет қаралды 18 МЛН
AI Foundations Course - Python, Machine Learning, Deep Learning, Data Science
10:22:26
An introduction to Policy Gradient methods - Deep Reinforcement Learning
19:50
Learn Machine Learning Like a GENIUS and Not Waste Time
15:03
Infinite Codes
Рет қаралды 226 М.
Policy Gradient Theorem Explained - Reinforcement Learning
59:36
Elliot Waite
Рет қаралды 65 М.
2024's Biggest Breakthroughs in Math
15:13
Quanta Magazine
Рет қаралды 298 М.
How I’d learn ML in 2024 (if I could start over)
7:05
Boris Meinardus
Рет қаралды 1,3 МЛН
Learn PyTorch for deep learning in a day. Literally.
25:36:58
Daniel Bourke
Рет қаралды 1,6 МЛН
MIT 6.S191: Reinforcement Learning
1:00:19
Alexander Amini
Рет қаралды 63 М.
Python + PyTorch + Pygame Reinforcement Learning - Train an AI to Play Snake
1:38:34
An introduction to Reinforcement Learning
16:27
Arxiv Insights
Рет қаралды 665 М.
It works #beatbox #tiktok
00:34
BeatboxJCOP
Рет қаралды 41 МЛН