Reinforcement Learning: Machine Learning Meets Control Theory

  Рет қаралды 246,962

Steve Brunton

Steve Brunton

Күн бұрын

Reinforcement learning is a powerful technique at the intersection of machine learning and control theory, and it is inspired by how biological systems learn to interact with their environment. In this video, we provide a high level overview of reinforcement learning, along with leading algorithms and impressive applications.
Citable link for this video: doi.org/10.52843/cassyni.x2t0sp
@eigensteve on Twitter
eigensteve.com
databookuw.com
%%% CHAPTERS %%%
0:00 Introduction
3:34 Reinforcement Learning Overview
7:30 Mathematics of Reinforcement Learning
12:32 Markov Decision Process
13:33 Credit Assignment Problem
15:38 Optimization Techniques for RL
18:54 Examples of Reinforcement Learning
21:50 Q-Learning
23:53 Hindsight Replay

Пікірлер: 247
@ronniec8805
@ronniec8805 3 жыл бұрын
Steve is a phenomenal lecturer, isn't he?
@cnbrksnr
@cnbrksnr 3 жыл бұрын
never seen a better one
@uforskammet
@uforskammet 3 жыл бұрын
very much so
@flybekvc
@flybekvc 3 жыл бұрын
He is!
@mail2cmin
@mail2cmin 3 жыл бұрын
Yessss
@reocam8918
@reocam8918 3 жыл бұрын
no, he is the most phenomenal one!! Respect
@whasuklee
@whasuklee 3 жыл бұрын
*"WELCOME BACK"*
@steven-bt7ud
@steven-bt7ud 3 жыл бұрын
I wish i knew this channel at the start of quarantine
@hysenndregjoni853
@hysenndregjoni853 3 жыл бұрын
I found about the channel just as quarantine had started. It was quite the treat.
@christiankraghjespersen994
@christiankraghjespersen994 3 жыл бұрын
I still have no idea as to who could possibly dislike these videos
@phaZZi6461
@phaZZi6461 3 жыл бұрын
u
@devashishbose1521
@devashishbose1521 3 жыл бұрын
@@phaZZi6461 I wanted to add a comment but 69 looks so good \
@sankalp1391
@sankalp1391 3 жыл бұрын
Would love for a full series on how can we use RL to control real world dynamical systems!
@ethanspastlivestreams
@ethanspastlivestreams 3 жыл бұрын
Viewing reinforcement learning as time delayed supervised learning is a really good way of looking at it.
@JousefM
@JousefM 3 жыл бұрын
Indeed!
@baseljamal8907
@baseljamal8907 3 жыл бұрын
I just cannot express how grateful I am to prof Steve Brunton for posting these videos. Waking up at 6am to watch him explain is the most satisfying thing ever. Thank you! We all are grateful.
@sistemasecontroles
@sistemasecontroles 3 жыл бұрын
Great channel! Please record more videos on the edge of reinforcement learning and control theory. Congrats on your work.
@thiagocesarlousadamarsola3990
@thiagocesarlousadamarsola3990 3 жыл бұрын
This sweet spot between control theory and machine learning definitely interests me, especially applied to astrodynamical systems. Please, continue making these videos, Professor Brunton!
@AW_tuber
@AW_tuber 2 жыл бұрын
The lecture was very well constructed. Well done! As an electrical engineering student trying to specialize in ML I find that you really hit the mark when it comes to putting these though and convoluted topics together with examples.
@LuisGonzalez-gz1qg
@LuisGonzalez-gz1qg 3 жыл бұрын
Never clicked a video that fast 😆. Great content prof as always love it!
@s3pi0n
@s3pi0n 2 жыл бұрын
This is THE BEST explanation on reinforcement learning over all the articles, books, or youtube videos, that I've seen so far. Period.
@ForeignChaos
@ForeignChaos 3 жыл бұрын
Is there something you dont know dude? You seem to be an expert on everything. You are such an inspiration.
@wizardOfRobots
@wizardOfRobots 3 жыл бұрын
Wow! I would love to see Prof take on RL topics!
@subhikshaniyer613
@subhikshaniyer613 Жыл бұрын
Every time he said "good", i felt appreciated for not giving up on a lecture whose subject is far, far away from mine and im pushing myself to try and learn the concept. thank you, steve.. much love.!
@nahidmahmud8234
@nahidmahmud8234 3 жыл бұрын
I am doing research on the Model-based RL for safety-critical systems; I really enjoy doing it. These are so cool. Thanks for making videos on this topic!
@kouider76
@kouider76 3 жыл бұрын
Simply great subject and excellent presentation thank you prof for all your efforts
@fzigunov
@fzigunov 3 жыл бұрын
Looks like I'm not the only one working on a video early in the morning! Really cool stuff, love the doggie!!
@ghostofhacker2818
@ghostofhacker2818 3 жыл бұрын
I just found out your channel and the contents you cover is a treasure to me!
@souravjha2146
@souravjha2146 2 жыл бұрын
I am binge watching this chanel from past 3 hours
@Spiegeldondi
@Spiegeldondi 3 жыл бұрын
I love how you emphasize the intersection between machine learning and control (theory). That's exactely what sparks my interest about reinforcement learning!
@Eigensteve
@Eigensteve 3 жыл бұрын
Glad you like it! I always found this connection fascinating and a very natural way to merge the two fields.
@SRIMANTASANTRA
@SRIMANTASANTRA 3 жыл бұрын
Hi Professor Steve, Lovely presentation.
@alistja4337
@alistja4337 3 жыл бұрын
Explained in an understandable way and RL nicely connected to control theory!
@diegoguisasola3858
@diegoguisasola3858 3 жыл бұрын
I really love your content, please keep spoiling us! These were the fastest 26 minutes! I learnt a lot and I'm looking forward to the python lab implementations of these concepts! Thank you very much for your work.
@sachinr3823
@sachinr3823 3 жыл бұрын
Waiting this topic from long time, your lectures are so clear. Thanks lot.
@Headbangnuker
@Headbangnuker 3 жыл бұрын
Just wanted to comment about how much I love these videos. Last year while applying for PhDs I was searching for passions. In a discussion with my friend (a computer scientist), I accidentally outlined genetic programming without knowing it. My friend told me so and I went researching. Found these videos and became enthralled. Now I have a PhD studentship in soft robotics and plan to use SINDy to help with modelling and control and honestly think that giving machines brains may be my future work too. Thanks Brunton, my passion was helped by your own.
@Eigensteve
@Eigensteve 3 жыл бұрын
That is amazing to hear! Helping people develop their passions is exactly why I do this!
@JousefM
@JousefM 3 жыл бұрын
Theeeere we go Steve! Waited for this :)
@riccardodelpozzo8683
@riccardodelpozzo8683 Жыл бұрын
phenomenal video, thank you
@msauditech
@msauditech 4 ай бұрын
That's an awesome video indeed. A great introduction to RL!
@terryliu3635
@terryliu3635 23 күн бұрын
Awesome lecture! Thanks Steve. I really enjoyed watching this!
@givemeArupee
@givemeArupee 3 ай бұрын
Steven lectures are great help to the society ❤
@cuongnguyentranhuu4616
@cuongnguyentranhuu4616 2 жыл бұрын
you have created such high quality content that i just really enjoy watching it instead of playing games :)))
@Voke
@Voke 3 жыл бұрын
Great video! If everyone was as great on KZbin as your delivery we would have a lot more passion in the area. Keep up the good work, train on!
@RasitEvduzen
@RasitEvduzen 3 жыл бұрын
Professor you're awesome. My thesis topic is, deep reinforcement learning based robotic arm torque control. I love control theory and machine learning. Thx for your support.
@arrahul316
@arrahul316 Жыл бұрын
Amazing Clarity
@sounghwanhwang5422
@sounghwanhwang5422 2 жыл бұрын
the fantastic lecture that I've ever seen...
@focusonlife3242
@focusonlife3242 2 жыл бұрын
Dude, you are the best lecturer. DONE
@carlphilip4393
@carlphilip4393 2 жыл бұрын
Dear Steve, Im very, very grateful that I get to watch such extraordinary instructive videos for free!!! Thinking that elsewhere in the world people are killing others atm (as in Kabul), it gives me a lot of hope seeing how people like you just make the world a little better and allmost brings tears into my eyes. You have such great talent in teaching, thank you!
@timanb2491
@timanb2491 2 жыл бұрын
it's brilliant ! . Keep working with this topic please
@mertbozkir
@mertbozkir 2 жыл бұрын
Perfect video, I will watch all other at one time 😍
@leopardus4712
@leopardus4712 3 жыл бұрын
Keep up the good work, love your videos
@chrisogonas
@chrisogonas 2 жыл бұрын
Very well illustrated! Thanks
@merv893
@merv893 Жыл бұрын
How very mean, I was looking forward to see the trial 7 right away. Great explaining. Thanks
@Kong9901
@Kong9901 3 жыл бұрын
That's so interesting and well explained. Thank you !
@Eigensteve
@Eigensteve 3 жыл бұрын
Glad you liked it!
@marco_gallone
@marco_gallone 3 жыл бұрын
I’ve been following your content for a at least 4 years now! It’s the reason I am a robotics control engineer now, you pulled me through 4th year control systems with your conveniently-timed boot camp. Please keep up the great content! PS are you accepting PhD students?
@aniruddhadatta925
@aniruddhadatta925 3 жыл бұрын
Amazing feeling to watch a video After completing a project on the same topic
@aiahmed608
@aiahmed608 2 жыл бұрын
Thank you, professor!
@pierredubois8715
@pierredubois8715 3 жыл бұрын
Thank you so much for this lecture. I really enjoy your videos, this is helpful as a PhD student. I also bought your book "Data-driven science and engineering" which have nice explanations for the tools I use. Keep on this awesome work! Greetings from France!
@TheRestalyn
@TheRestalyn Жыл бұрын
love your lectures
@subramaniannk3364
@subramaniannk3364 3 жыл бұрын
Yay! Hero has decided to teach Reinforcement Learning
@TheProblembaer2
@TheProblembaer2 5 ай бұрын
This is really really great teaching.
@tai94bn
@tai94bn Жыл бұрын
It's really interesting to watch this video, although I have also studied and read it a few times, its boredom is hard to describe. thank you teacher
@theclassoftorchia3856
@theclassoftorchia3856 Жыл бұрын
Hi, Steve. I've been working on Fluid Mechanics 25 years or so. Always using experimental and some analytical tools to approach the subject. I had a lot colleagues migrating to CFD back in the 2000s because these methods seem to find valid results with "little" effort in comparison to expensive, frustrating and time-consuming experiments. So I always disregarded CFD as nice tool that could predict a lot of stuff that you will never know if it is correct or not. However, I have to say, that from some time, reinforced (see what I did there?) by new material that I am studying and your papers on ML for Fluid Mechanics I am looking at the subject with new eyes. Thank you very much for your material and the dedication you put in every video.
@teegnas
@teegnas 3 жыл бұрын
As a CS grad student who took RL in the last semester ... this is truly the best refresher I have seen until now. Thanks a lot for uploading.
@Eigensteve
@Eigensteve 3 жыл бұрын
Great to hear!
@pellythirteen5654
@pellythirteen5654 2 жыл бұрын
Your series are excellent . They have a good pace and use powerful graphics to explain difficult concepts. I've watched many of your videos on my TV which doesn't allow me to give a thumbs up. See here it is. I am not a Python programmer , but I am sure that those watching who DO use Python must have itchy fingers.
@melvinlara6151
@melvinlara6151 3 жыл бұрын
I was waiting for this!!!
@francesco884
@francesco884 2 жыл бұрын
Thank you, professor Steve Brunton. I am pleased to inform you that I am considering to do, after my master degree in computer engineering, a PhD related to the Data Driven Control Theory subject and the merit in part is also your.
@Lucas_Lima606
@Lucas_Lima606 Жыл бұрын
thank you very much for your lesson, it is really useful to me!
@pardonchawatama941
@pardonchawatama941 2 жыл бұрын
Great lesson.. Thank you
@matejsuty5024
@matejsuty5024 2 жыл бұрын
Great video, thanks.
@fahimehjabbarinia401
@fahimehjabbarinia401 Жыл бұрын
the best one i have ever seen
@radhen171992
@radhen171992 3 жыл бұрын
I really like your videos. Keep up the good work! :)
@HD-qq3bn
@HD-qq3bn 3 жыл бұрын
I really like your explanation
@spencerhong4687
@spencerhong4687 2 жыл бұрын
Mr.Brunton saves me from my final review. His lectures made crystal clear those seemingly unfathomable terms. Just watched him videos for days and I already like him!
@spencerhong4687
@spencerhong4687 2 жыл бұрын
those bipedals are too cute they deserve another cmt
@HD-qq3bn
@HD-qq3bn 3 жыл бұрын
we also look forward to your explanation for GAN in the future
@pliniocastro1546
@pliniocastro1546 3 жыл бұрын
Kudos on the awesome lecture
@loopuleasa
@loopuleasa 3 жыл бұрын
top quality this is what they said about education on the internet that "the best teacher can teach everyone" this is that video for this topic
@thelazygardener9493
@thelazygardener9493 3 жыл бұрын
I've been seriously considering starting a degree in A.I./Machine learning but with videos of this quality available for free, it is hard to justify the cost. Subscribed and liked!
@thelazygardener9493
@thelazygardener9493 3 жыл бұрын
Just incase you read this and have time to reply... Do you have any suggestions for an education path to your level of understanding? There are degrees for data science, computer science, artificial intelligence, software engineering, etc. They all seem so inter-related. I want to know them all but I'm struggling to pick a starting point. My current level of related education is highschool level advanced maths and a year of teaching myself MQL4/5 and R code mostly from free resources online. Just so you know my starting point (or state haha).
@dr.-ing.shehzadhasan3387
@dr.-ing.shehzadhasan3387 3 жыл бұрын
You have a nice way of explaining the topics.
@Eigensteve
@Eigensteve 3 жыл бұрын
Thanks!
@pradiptahafid
@pradiptahafid Жыл бұрын
What I like is I don't pay for this knowledge. I was planning to take a data science certificate, but you know what. Let me spend 6 months learning by myself I have spent a solid 1 month only on your videos starting from SVD. it has been amazing. I love when a small thing builds up into a bigger thing. Soon I will make a sample project based on what I have learned from your video.
@5621346
@5621346 3 жыл бұрын
nice lecture sir! thanks a lot!
@technokratz5299
@technokratz5299 3 жыл бұрын
Excellent video ❤️
@Turcian
@Turcian 3 жыл бұрын
I think it's also important to mention the distinction between discrete and continuous action spaces.
@minglee5164
@minglee5164 2 жыл бұрын
RL can be interpreted from this perspective, amazing
@tabindahayat3492
@tabindahayat3492 2 жыл бұрын
I love u, Steve! I have been currently working on Machine Teaching and Project Bonsai. I really needed to know this.
@car0lm1k3
@car0lm1k3 3 жыл бұрын
i have been trying to teach my guys that machine learning and control theory (fuzzy autotuning) is the same principle. This video will be used!
@HassanKhan-cs8ho
@HassanKhan-cs8ho 3 жыл бұрын
Legendary!!!!
@carolineseffen7458
@carolineseffen7458 Ай бұрын
Thank you ❤
@SiriGadipudi
@SiriGadipudi Жыл бұрын
All of your lecture series are very good and very helpful. A series on convex optimization problems would be good. Any thoughts about it?
@hudhuduot
@hudhuduot 3 жыл бұрын
Steve is one of the gifted teachers. I wish you can guide postgraduate to make a good publication in control and learning by highlighting the hot topics and promising research aspects.
@Eigensteve
@Eigensteve 3 жыл бұрын
Thanks so much!
@be_milad
@be_milad Жыл бұрын
I wish my teachers had seen your videos before trying to teach us these subjects :)
@MichaelEvans-yq7xj
@MichaelEvans-yq7xj 3 жыл бұрын
Hi Steve. I am an amateur mathematician (hoping to go pro) who is really into category theory. Have you or your team ever looked at this? Usually, when you see two subjects talking about the same thing, it's a good bet that category theory is working in the background. And I just looked at category theorist Tai-Danae Bradley and her explanation of SVD in terms of category theory. Thanks! AWESOME CHANNEL!
@ramanikrishnamurthy7086
@ramanikrishnamurthy7086 3 жыл бұрын
I thought I was witnessing a breakthrough concept trying to link deterministic control theory with machine learning. But, when you mentioned the words probability and policy, I was disappointed. Looking forward to more conceptual lectures. Could also highlight real world applications. Thanks.
@Anon_life
@Anon_life 3 жыл бұрын
Thank you!
@hazemahmed8333
@hazemahmed8333 3 жыл бұрын
amazing work as usual !! ... could you please consider doing a lecture about whole body control for robotics?
@JoeM370
@JoeM370 5 ай бұрын
The essence of this content is profoundly influential. A book with akin messages was transformative. "Game Theory and the Pursuit of Algorithmic Fairness" by Jack Frostwell
@elsabassi2533
@elsabassi2533 Жыл бұрын
hey Steve, love your videos! Wondering if the videos in this playlist are in the correct order?
@youtubeenjoyer199
@youtubeenjoyer199 Жыл бұрын
Love the videos
@usmleck7000
@usmleck7000 3 жыл бұрын
Amazing 🤩
@praharaj2007
@praharaj2007 3 жыл бұрын
Thanks Steve.
@Eigensteve
@Eigensteve 3 жыл бұрын
You are very welcome!
@tytuer
@tytuer 2 жыл бұрын
After watching these videos I have actually understood the concept of reinforcement learning. I might be wrong but to me it seems it generalizes the feedback loops into more abstract concepts of agent action policy environment etc. In a feedback loop we have control policy which is a PID controller that controls the behaviour of a plant it is attached to. The model of the plant is environment here and the action is taken by the output of the PID controller. The reward in feedback loop is to converge to desired output value at the steady state by ignoring its transition time values, so it is in a sense a semi supervised learning. The states in the feedback loop is derivative components of the system. In noisy systems, sometimes it is crucial to remove derivative component to avoid impulsive behaviour which corresponds to state feedback from environment to agent in RL. By thinking like this, RL is more meaningful to me as an engineer, that RL is a generalized feedback system where we try to get a desired output given some input to the system. Thank you for these video series!!
@user-eh5zk5bb9k
@user-eh5zk5bb9k 2 жыл бұрын
Nice!
@gama3181
@gama3181 3 жыл бұрын
Wow! Thank you so much. Maybe the next lecture can be about UMAP please :D?
@lazyoneswapples2962
@lazyoneswapples2962 Жыл бұрын
A very well done lecture. Bravo! I'd like to make a suggestion, if I may, to modify the Policy function as pi(s,a) = Pr(A = a, S = s); A is the place holder for an action, and a is the actions of taking; S is the place holder for the state and s is the given state.
@TomOtero1984
@TomOtero1984 3 жыл бұрын
Glad this content is on KZbin -- the past year kind of derailed me going to grad school. Question, could the reward structure of a chess game be broken into incremental steps? As in, the main reward is to win, but couldn't a game be discretized into incremental rewards defined by the value of a target and the probability that a sequence of moves would capture a high value target? Or, is that just Q-Learning in different words?
@carriefu458
@carriefu458 2 жыл бұрын
Prof Brunton: You are one bad-ass teacher!!!🤓
@combinedmathswithpraveen144
@combinedmathswithpraveen144 3 жыл бұрын
It would be an honor to be supervised for a PhD by him.
@jazonsamillano
@jazonsamillano Жыл бұрын
This guy is super smart!
@qpellidomombre
@qpellidomombre Жыл бұрын
Awesome
@ajj7794
@ajj7794 2 жыл бұрын
just a question, i am viewing the Control boot camp playlist, so it goes from control dynamic systems with non minimum phase to control theory and convid 19 to reinforcement learning. is this the correct way of viewing the videos? i feel there was for to talk about on the previous topics maybe i am wrong. Thanks to you/your team for all your amazing videos. i finished my controls classes 3 years ago when i found ur videos, i have been going throught all your playlists and am loving it! Is there a specific job that requires/teaches these skills? the closest job to this i found were graduate automation engineer .
Deep Reinforcement Learning: Neural Networks for Learning Control Laws
21:15
Reinforcement Learning Series: Overview of Methods
21:37
Steve Brunton
Рет қаралды 84 М.
когда одна дома // EVA mash
00:51
EVA mash
Рет қаралды 4,4 МЛН
NO NO NO YES! (Fight SANTA CLAUS) #shorts
00:41
PANDA BOI
Рет қаралды 56 МЛН
Which one will take more 😉
00:27
Polar
Рет қаралды 40 МЛН
КАК ГЛОТАЮТ ШПАГУ?😳
00:33
Masomka
Рет қаралды 2,1 МЛН
Everything You Need to Know About Control Theory
16:08
MATLAB
Рет қаралды 494 М.
Something Strange Happens When You Follow Einstein's Math
37:03
Veritasium
Рет қаралды 4,6 МЛН
Deep Reinforcement Learning for Fluid Dynamics and Control
17:35
Steve Brunton
Рет қаралды 41 М.
Robust Principal Component Analysis (RPCA)
22:11
Steve Brunton
Рет қаралды 67 М.
Интел подвинься, ARM уже в ПК!
14:06
PRO Hi-Tech
Рет қаралды 153 М.
Cách sửa này được không các bạn?
1:00
Cơ Khí Toàn Nghĩa
Рет қаралды 841 М.
Я Создал Новый Айфон!
0:59
FLV
Рет қаралды 1,3 МЛН
iPhone 15 в реальной жизни
20:03
HUDAKOV
Рет қаралды 678 М.