Meeting Dork Matter Girl about Reinforcement Learning!

  Рет қаралды 186

Thinkstr

Thinkstr

Күн бұрын

Go watch Dork Matter Girl! / @dorkmattergirl
Here are my Reinforcement Learning examples:
github.com/TedTinker/rl_example
github.com/TedTinker/rl_rnn_example
github.com/TedTinker/rl_ac_example
github.com/TedTinker/rl_sac_example
patreon.com/thinkstr
00:00 Meet the two of us!
01:32 Introduction
03:10 Reinforcement Learning
04:24 Cart Pole
06:15 Early Architecture
09:24 Bellman's Equation
13:50 Q-Maxing in Cartpole Part 1
17:14 This is a stupid game
19:40 Q-Maxing in Cartpole Part 2
21:52 Pseudocode
24:30 Hyper Parameters
25:35 Results
26:50 Recurrent Networks
31:50 Actor Critic (Pendulum)
34:34 Soft Actor and Entropy
40:44 Conclusion

Пікірлер
Fool-proof RNN explanation | What are RNNs, how do they work?
16:05
How AI Discovered a Faster Matrix Multiplication Algorithm
13:00
Quanta Magazine
Рет қаралды 1,4 МЛН
Just try to use a cool gadget 😍
00:33
123 GO! SHORTS
Рет қаралды 85 МЛН
터키아이스크림🇹🇷🍦Turkish ice cream #funny #shorts
00:26
Byungari 병아리언니
Рет қаралды 26 МЛН
버블티로 체감되는 요즘 물가
00:16
진영민yeongmin
Рет қаралды 76 МЛН
Why Does Scrum Make Programmers HATE Coding?
16:14
Thriving Technologist
Рет қаралды 496 М.
Let's try Reinforcement Learning
8:28
Thinkstr
Рет қаралды 259
Deep Q Learning is Simple with PyTorch | Full Tutorial 2020
38:55
Machine Learning with Phil
Рет қаралды 99 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Agile & Scrum Don't Work | Allen Holub In The Engineering Room Ep. 9
1:12:35
Continuous Delivery
Рет қаралды 109 М.
The Attention Mechanism in Large Language Models
21:02
Serrano.Academy
Рет қаралды 82 М.
More on Friston's Free Energy Principle
13:43
Thinkstr
Рет қаралды 3,6 М.
793: Bayesian Methods and Applications - with Alexandre Andorra
1:31:57
Super Data Science: ML & AI Podcast with Jon Krohn
Рет қаралды 1 М.
Just try to use a cool gadget 😍
00:33
123 GO! SHORTS
Рет қаралды 85 МЛН