Interesting example on 14:00. Watched your 2 series. Ty very much!
@monkeysareaproblem17437 ай бұрын
looks like the colab code is outdated step # setup the environment, and record a video every 50 episodes gives: AttributeError: module 'gym.wrappers' has no attribute 'Monitor' also a lot of deprecation warnings
@BlueBirdgg8 ай бұрын
Thank you for the classes! Incredible so far!
@ichaa3tech10 ай бұрын
This is seriously very underrated, the best
@bello313710 ай бұрын
You extract just the key things needed in sutton's book. I follow his book while following your videos and bunch others as well 😁 thank you
@SaschaRobitzki11 ай бұрын
Why is the action q* going from class3 to the pub 9.4?
@fatemehnorouzi172211 ай бұрын
I have watched several lectures about RL and you have been the best one
@bello313710 ай бұрын
same here.
@ichaa3tech10 ай бұрын
saaame
@mohammadnadeem839 Жыл бұрын
i am the 2k th person added in your subscriber list
@lovol2 Жыл бұрын
Thanks. Was good to hear there is lack of consistency in the notation used in papers as that was confusing!
@wajidiqbal5633 Жыл бұрын
thank you prof, for this elegant explanation....
@compsciorbust9562 Жыл бұрын
Incomplete list of timestamps: 0:00 - Introduction 3:01 - Installing Conda 13:25 - Using Torch (tensors) 23:00 - Using Torch (data parsing) 32:15 - Using backprop 52:00 (ish) - Using convolution
@ARREYAR Жыл бұрын
The best lectures on RL
@hayaquraan-p3o Жыл бұрын
it was perfect thank you so much
@xiaocenliu Жыл бұрын
thank you so much for not assuming students knowing everything 😭. You explain it so clear!
@jrohit1110 Жыл бұрын
Chapter 12 Sutton and Barto. This is what I was looking for. Thanks for beautiful the explanation!
@AymenSekhri-gw8wh Жыл бұрын
Thank you so much ! rich content for free.
@AymenSekhri-gw8wh Жыл бұрын
Good Explanation, thank you so much
@yuktikaura Жыл бұрын
Possible for you to share the latex template for the presentation?
@ichaa3tech9 ай бұрын
riight
@chanpreetsingh007 Жыл бұрын
thanx
@yuktikaura Жыл бұрын
How do we conclude that ->success: {stats/episode} would always be monotonically increasing? And how does it indicate convergence with value as 0.54?
@cwkx Жыл бұрын
Hi I can't remember what I said here, it was over 2 years ago - we go into depth on the convergence properties for this in our practicals, which are on my github: github.com/cwkx/materials/raw/main/reinforcement-learning/rl-answers4.pdf and github.com/cwkx/materials/blob/main/reinforcement-learning/rl-answers5.pdf etc
@yuktikaura Жыл бұрын
Thanks for a well explained topic.
@yuktikaura Жыл бұрын
Thank you for doing these videos... I found them really very helpful
@hom01 Жыл бұрын
Amazing lecture, thanks for uploading
@maryamfarajzadeh2262 Жыл бұрын
I really recommend this video! Just perfect! I wish POMDP was also well explained in the same way!
@GeneralKenobi69420 Жыл бұрын
sassy 💅
@danielji51842 жыл бұрын
good lecture
@agustinrovira29552 жыл бұрын
Great job !
@benmiss17672 жыл бұрын
Very very useful and informative thank you very much!
@rladndud17222 жыл бұрын
35:02 implicit networks
@alebadi2 жыл бұрын
You started not knowing what you are talking about.
@phenixzhang42242 жыл бұрын
As a newcomer who has already learned pytoch, this class has allowed me to consolidate the actual combat practice of torch and understand the magic of vision. In addition, I would like to ask Willcocksr: How should we learn more about using pytoch to build our own network?
@aliheidary3552 жыл бұрын
could you speak louder in your next video
@BlissfulBasilisk2 жыл бұрын
On Langevin Dynamics in Machine Learning - Michael I. Jordan (Video Link): kzbin.info/www/bejne/h4XRm6Saraxsb8U&ab_channel=InstituteforAdvancedStudy
@bing67402 жыл бұрын
The example of car accident got me
@malathreayad2 жыл бұрын
Excellent explanation, well done
@AhamedKabeer-wn1jb2 жыл бұрын
Thank you..
@엄의섭2 жыл бұрын
By far one of the most thorough and helpful explanations I've encountered! THANKS!
@shahardagan15842 жыл бұрын
Hi great lectures! i would like to know if you can recommend me more courses and resources to advance in the field
@cwkx2 жыл бұрын
I'd just recommend just getting into a habit of reading the latest papers from the top venues such as ICLR, NeurIPS and CVPR when they've been reviewed, e.g. find a sorted ranked list of the best papers and read the abstracts and Ctrl+F any terms interesting to you - e.g. tanelp.github.io/neurips2021/ and papers.labml.ai/papers/iclr_2022?sort_by=conference_score&dsc=0
@Manishkumar-ww4gm3 жыл бұрын
Very nice explanation sir.Thank you
@amirmahdikhosrvitabrizi75163 жыл бұрын
It was amazing, this lecture has made my life much easier. thank you.
@Jannls3 жыл бұрын
Your video helped me a lot! Very informative and easy to understand. Thank you!
@joaoborges20143 жыл бұрын
Amazing lecture, thank you so much!
@saharrahimimalakshan54853 жыл бұрын
It was an amazing video, you explained the issues in the best way. Thank you
@phenixzhang42243 жыл бұрын
Recently I followed the teacher’s second class on and off, involving some basic theoretical knowledge related to backpropagation. The main difficulty is still in the formula described in English, but I can feel the improvement of listening. I also manually offline deduced the back-propagation algorithm and implemented it in python, keep going!
@AECTechJourneys3 жыл бұрын
This is gem. Great content.
@johnnassour3 жыл бұрын
Would you please check the calculation for 0.34? It is 0.3125 in my calculation. Thank you.
@cwkx3 жыл бұрын
0.34375 = 0.25*0.0625 (left) + 0.25*0.0625 (up) + 0.25*0.25 (down) + 0.25*1 (right) - you can see this clearly if you go to the colab notebook in the comment, then in the Policy evaluation section, where it first says "# evaluate this policy ... V = policy_evaluation(env,policy,draw=False)" change draw=True and you'll see all the intermediate steps and you can see 0.34375.
@johnnassour3 жыл бұрын
@@cwkx so you use the updated valves in the same episode. I thought they will be used in the next episode.
@harrivayrynen Жыл бұрын
@@johnnassour Yes there must be mistake in video's calculations. If you look algorithm, the array is updated when all states are gone through. But still very good video series, thanks for that.
@phenixzhang42243 жыл бұрын
Deep learning is indeed developing faster and faster, which requires us to understand it in essence, including biological perspectives, historical perspectives, etc. I hope I can build a more systematic understanding of deep learning through this course. By the way, Teacher Willcocks's English expression is very fluent, and it doesn't sound particularly strenuous.
@Dian87barry3 жыл бұрын
Hello Dear Sir Interesting Video, Please did it possible to have the code for this video
@cwkx3 жыл бұрын
Hi Mamadou, all code where available are in the video descriptions (Colab links).
@Dian87barry3 жыл бұрын
@@cwkx thank you
@alialtan81823 жыл бұрын
Hi there,big fan! I learned that you also had expertise in security, is there any chance you will consider teaching them in youtube ?
@cwkx3 жыл бұрын
Hi Ali, many thanks for the kind comment - unfortunately I don't think I can get permission for this due to the sensitive nature of some of the security materials/discussions/exploits/real-world stories not covered in the slides.