Рет қаралды 186
Go watch Dork Matter Girl! / @dorkmattergirl
Here are my Reinforcement Learning examples:
github.com/TedTinker/rl_example
github.com/TedTinker/rl_rnn_example
github.com/TedTinker/rl_ac_example
github.com/TedTinker/rl_sac_example
patreon.com/thinkstr
00:00 Meet the two of us!
01:32 Introduction
03:10 Reinforcement Learning
04:24 Cart Pole
06:15 Early Architecture
09:24 Bellman's Equation
13:50 Q-Maxing in Cartpole Part 1
17:14 This is a stupid game
19:40 Q-Maxing in Cartpole Part 2
21:52 Pseudocode
24:30 Hyper Parameters
25:35 Results
26:50 Recurrent Networks
31:50 Actor Critic (Pendulum)
34:34 Soft Actor and Entropy
40:44 Conclusion