Reinforcement Learning from scratch

  Рет қаралды 78,523

Graphics in 5 Minutes

Graphics in 5 Minutes

Күн бұрын

Пікірлер: 53
@darthvader4899
@darthvader4899 8 ай бұрын
this is video is super underrated. In fact the whole channel is underrated.
@william_8844
@william_8844 4 ай бұрын
Maybe i should follow the channel then 😅. This was my first vid, and the explanation was really well simplified
@themathguy3149
@themathguy3149 Жыл бұрын
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@tushargupta1999
@tushargupta1999 9 ай бұрын
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@ashketchum1244
@ashketchum1244 Жыл бұрын
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@limeducky0209
@limeducky0209 11 күн бұрын
This was so much easier to understand than the other RL videos that came up when I searched this topic
@jameslibby5215
@jameslibby5215 Жыл бұрын
Very very underrated channel
@benc7910
@benc7910 11 ай бұрын
Underrated, two Rs
@jameslibby5215
@jameslibby5215 11 ай бұрын
@@benc7910 thank ya sir
@Arivan_Abdulla
@Arivan_Abdulla 4 ай бұрын
Too beautiful you can watch this kind of videos all the day without get bored
@metaljacket8102
@metaljacket8102 8 ай бұрын
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@Bet-s4g
@Bet-s4g 3 ай бұрын
This is super underrated video
@mind6861
@mind6861 6 ай бұрын
Can we have the code for this
@poopcoder468
@poopcoder468 Ай бұрын
Lol😅😅😅😅😅😅
@CptDoge-rn3ou
@CptDoge-rn3ou Жыл бұрын
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@themax2go
@themax2go 9 ай бұрын
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@cloudysh
@cloudysh 8 ай бұрын
This was so surprisingly great :3
@a.aspden
@a.aspden Жыл бұрын
Your videos are great. Looking forward to more!
@Sumpydumpert
@Sumpydumpert 6 ай бұрын
I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge
@marcinstrzesak346
@marcinstrzesak346 Жыл бұрын
Great video, very helpful, easy to understand.
@moldo800
@moldo800 11 ай бұрын
Excellent. Congratulations ❤
@swannschilling474
@swannschilling474 5 ай бұрын
Thanks a lot for this one! 😊
@luiseduardocraizer7416
@luiseduardocraizer7416 6 ай бұрын
Excellent content!
@mohajeramir
@mohajeramir 8 ай бұрын
Excellent
@gmjammin4367
@gmjammin4367 Жыл бұрын
Amazing video as always :)!
@jdlopes06
@jdlopes06 5 ай бұрын
Thank you!
@jaideepraulji1395
@jaideepraulji1395 4 ай бұрын
Superb
@mado.madeleine
@mado.madeleine Жыл бұрын
Super helpful! Thank you 🙏🏽
@anthonyortiz7924
@anthonyortiz7924 3 ай бұрын
What a great series! I have a question for the experts... was it necessary to map velocity as an input? I'm guessing it's not absolutely necessary and was done to make the training faster? My guess is based on the assumption that the timing of the ball x/y changes to the inputs have an effect, but I may be wrong.
@BlueBirdgg
@BlueBirdgg Жыл бұрын
Can you playlist each one of your topics plz? I wanted to post on Twitter(X) your video topics but could only post a single video at a time. Great content by the way. Ty very much. Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min
@g5min Жыл бұрын
Good idea! Here's one on generative AI: kzbin.info/aero/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo Here's one on reinforcement learning kzbin.info/aero/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL Here's one on LLMs + text-to-image kzbin.info/aero/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu
@BlueBirdgg
@BlueBirdgg Жыл бұрын
@@g5min Ty!
@n4mmenam
@n4mmenam Жыл бұрын
Brilliant
@nikbivation
@nikbivation Жыл бұрын
thank you for this!
@ireoluwaTH
@ireoluwaTH Жыл бұрын
Thank you!!!
@NR_5tudio
@NR_5tudio Ай бұрын
i just have a quastion, what is that thing ? 6:20 its like a worm ? like. i didnt take it in my math class.... im 16 years btw i mean the one u added
@william_8844
@william_8844 4 ай бұрын
I get how the model can see moves and output up or down action. But I don't get how model tracks the score for rewards etc Can someone explain how the reward is fed into model
@maxim_ml
@maxim_ml 7 ай бұрын
that was good
@edvinbeqari7551
@edvinbeqari7551 11 ай бұрын
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@bombur9007
@bombur9007 8 ай бұрын
how many layers should such network have
@axe863
@axe863 Жыл бұрын
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@mineq4967
@mineq4967 8 ай бұрын
but by what number do you change the weights like you never told us
@derp__king6144
@derp__king6144 8 күн бұрын
Facing the same problem
@nischalyou
@nischalyou Жыл бұрын
whats the name of this video game ?
@gaydemaupassant6263
@gaydemaupassant6263 6 ай бұрын
Pls o want the code plsss
@FRANKONATOR123
@FRANKONATOR123 Жыл бұрын
Can you share the source code for this project
@g5min
@g5min Жыл бұрын
You can follow the link to the Karpathy site at the end of the video, repeated here: karpathy.github.io/2016/05/31/rl/
@herikaniugu
@herikaniugu Жыл бұрын
Imagine using reinforcement learning in quantitative finance 😊
@macratak
@macratak Жыл бұрын
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min
@g5min Жыл бұрын
I think that character/game-AI is pretty central to graphics
@pw7225
@pw7225 Жыл бұрын
Why so negative?
@revimfadli4666
@revimfadli4666 Жыл бұрын
​@@g5minespecially AI image generation or processing nowadays
Reinforcement Learning:  AlphaGo
8:14
Graphics in 5 Minutes
Рет қаралды 19 М.
An introduction to Reinforcement Learning
16:27
Arxiv Insights
Рет қаралды 664 М.
Quando A Diferença De Altura É Muito Grande 😲😂
00:12
Mari Maria
Рет қаралды 34 МЛН
Thank you Santa
00:13
Nadir Show
Рет қаралды 59 МЛН
Don’t Choose The Wrong Box 😱
00:41
Topper Guild
Рет қаралды 51 МЛН
黑天使被操控了#short #angel #clown
00:40
Super Beauty team
Рет қаралды 53 МЛН
The Man Who Solved the $1 Million Math Problem...Then Disappeared
10:45
MIT 6.S191: Reinforcement Learning
1:00:19
Alexander Amini
Рет қаралды 61 М.
Large Language Models from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 352 М.
AI Learns Insane Monopoly Strategies
11:30
b2studios
Рет қаралды 10 МЛН
Meet Willow, our state-of-the-art quantum chip
6:39
Google Quantum AI
Рет қаралды 709 М.
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 536 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,4 МЛН
The Man Who Solved the World’s Most Famous Math Problem
11:14
Newsthink
Рет қаралды 1,1 МЛН
Reinforcement Learning Series: Overview of Methods
21:37
Steve Brunton
Рет қаралды 104 М.
Quando A Diferença De Altura É Muito Grande 😲😂
00:12
Mari Maria
Рет қаралды 34 МЛН