Reinforcement Learning from scratch

  Рет қаралды 28,648

Graphics in 5 Minutes

Graphics in 5 Minutes

Күн бұрын

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost

Пікірлер: 36
@darthvader4899
@darthvader4899 Ай бұрын
this is video is super underrated. In fact the whole channel is underrated.
@themathguy3149
@themathguy3149 6 ай бұрын
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@metaljacket8102
@metaljacket8102 22 күн бұрын
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@tushargupta1999
@tushargupta1999 Ай бұрын
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@a.aspden
@a.aspden 7 ай бұрын
Your videos are great. Looking forward to more!
@ashketchum1244
@ashketchum1244 8 ай бұрын
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@marcinstrzesak346
@marcinstrzesak346 7 ай бұрын
Great video, very helpful, easy to understand.
@gmjammin4367
@gmjammin4367 8 ай бұрын
Amazing video as always :)!
@moldo800
@moldo800 3 ай бұрын
Excellent. Congratulations ❤
@mado.madeleine
@mado.madeleine 8 ай бұрын
Super helpful! Thank you 🙏🏽
@jameslibby5215
@jameslibby5215 7 ай бұрын
Very very underrated channel
@benc7910
@benc7910 3 ай бұрын
Underrated, two Rs
@jameslibby5215
@jameslibby5215 3 ай бұрын
@@benc7910 thank ya sir
@nikbivation
@nikbivation 8 ай бұрын
thank you for this!
@cloudysh
@cloudysh 22 күн бұрын
This was so surprisingly great :3
@themax2go
@themax2go Ай бұрын
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@ireoluwaTH
@ireoluwaTH 8 ай бұрын
Thank you!!!
@mohajeramir
@mohajeramir 16 күн бұрын
Excellent
@CptDoge-rn3ou
@CptDoge-rn3ou 6 ай бұрын
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@kniv0gaffel
@kniv0gaffel 6 ай бұрын
Brilliant
@solveigberling1662
@solveigberling1662 Ай бұрын
That was dope
@BlueBirdgg
@BlueBirdgg 8 ай бұрын
Can you playlist each one of your topics plz? I wanted to post on Twitter(X) your video topics but could only post a single video at a time. Great content by the way. Ty very much. Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min
@g5min 8 ай бұрын
Good idea! Here's one on generative AI: kzbin.info/aero/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo Here's one on reinforcement learning kzbin.info/aero/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL Here's one on LLMs + text-to-image kzbin.info/aero/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu
@BlueBirdgg
@BlueBirdgg 8 ай бұрын
@@g5min Ty!
@edvinbeqari7551
@edvinbeqari7551 3 ай бұрын
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@bombur9007
@bombur9007 23 күн бұрын
how many layers should such network have
@nischalyou
@nischalyou 7 ай бұрын
whats the name of this video game ?
@mineq4967
@mineq4967 Ай бұрын
but by what number do you change the weights like you never told us
@axe863
@axe863 5 ай бұрын
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@macratak
@macratak 8 ай бұрын
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min
@g5min 8 ай бұрын
I think that character/game-AI is pretty central to graphics
@pw7225
@pw7225 8 ай бұрын
Why so negative?
@revimfadli4666
@revimfadli4666 8 ай бұрын
​@@g5minespecially AI image generation or processing nowadays
@FRANKONATOR123
@FRANKONATOR123 8 ай бұрын
Can you share the source code for this project
@g5min
@g5min 8 ай бұрын
You can follow the link to the Karpathy site at the end of the video, repeated here: karpathy.github.io/2016/05/31/rl/
@herikaniugu
@herikaniugu 6 ай бұрын
Imagine using reinforcement learning in quantitative finance 😊
Reinforcement Learning:  AlphaGo
8:14
Graphics in 5 Minutes
Рет қаралды 8 М.
An introduction to Reinforcement Learning
16:27
Arxiv Insights
Рет қаралды 636 М.
ВИРУСНЫЕ ВИДЕО / Виноградинка 😅
00:34
Светлый Voiceover
Рет қаралды 7 МЛН
BRAWLER MUTATIONS WILL BREAK THE GAME! - Brawl Talk
09:34
Brawl Stars
Рет қаралды 25 МЛН
Final muy inesperado 😨
01:00
Juan De Dios Pantoja
Рет қаралды 49 МЛН
NO NO NO YES! (Fight SANTA CLAUS) #shorts
00:41
PANDA BOI
Рет қаралды 54 МЛН
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 147 М.
A. I. Learns to Play Starcraft 2 (Reinforcement Learning)
17:42
Reinforcement Learning, by the Book
18:19
Mutual Information
Рет қаралды 66 М.
Neural Networks Explained from Scratch using Python
17:38
Bot Academy
Рет қаралды 307 М.
MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)
1:07:30
How ChatGPT is Trained
13:43
Ari Seff
Рет қаралды 513 М.
ВИРУСНЫЕ ВИДЕО / Виноградинка 😅
00:34
Светлый Voiceover
Рет қаралды 7 МЛН