Meeting Dork Matter Girl about Reinforcement Learning!

Fool-proof RNN explanation | What are RNNs, how do they work?

How AI Discovered a Faster Matrix Multiplication Algorithm

Just try to use a cool gadget 😍

터키아이스크림🇹🇷🍦Turkish ice cream #funny #shorts

Giảm áp lực cho thắt lưng với mẹo này. #vungocson #drson #shorts

버블티로 체감되는 요즘 물가

Meeting Dork Matter Girl about Reinforcement Learning!

Рет қаралды 186

Thinkstr

Күн бұрын

Go watch Dork Matter Girl! / @dorkmattergirl
Here are my Reinforcement Learning examples:
github.com/TedTinker/rl_example
github.com/TedTinker/rl_rnn_example
github.com/TedTinker/rl_ac_example
github.com/TedTinker/rl_sac_example
patreon.com/thinkstr
00:00 Meet the two of us!
01:32 Introduction
03:10 Reinforcement Learning
04:24 Cart Pole
06:15 Early Architecture
09:24 Bellman's Equation
13:50 Q-Maxing in Cartpole Part 1
17:14 This is a stupid game
19:40 Q-Maxing in Cartpole Part 2
21:52 Pseudocode
24:30 Hyper Parameters
25:35 Results
26:50 Recurrent Networks
31:50 Actor Critic (Pendulum)
34:34 Soft Actor and Entropy
40:44 Conclusion

Пікірлер

Fool-proof RNN explanation | What are RNNs, how do they work?

16:05

Fool-proof RNN explanation | What are RNNs, how do they work?

Mısra Turp

Рет қаралды 16 М.

How AI Discovered a Faster Matrix Multiplication Algorithm

13:00

How AI Discovered a Faster Matrix Multiplication Algorithm

Quanta Magazine

Рет қаралды 1,4 МЛН

Just try to use a cool gadget 😍

00:33

Just try to use a cool gadget 😍

123 GO! SHORTS

Рет қаралды 85 МЛН

터키아이스크림🇹🇷🍦Turkish ice cream #funny #shorts

00:26

터키아이스크림🇹🇷🍦Turkish ice cream #funny #shorts

Byungari 병아리언니

Рет қаралды 26 МЛН

Giảm áp lực cho thắt lưng với mẹo này. #vungocson #drson #shorts

00:24

Giảm áp lực cho thắt lưng với mẹo này. #vungocson #drson #shorts

Vu Ngoc Son

Рет қаралды 30 МЛН

버블티로 체감되는 요즘 물가

00:16

버블티로 체감되는 요즘 물가

진영민yeongmin

Рет қаралды 76 МЛН

Why Does Scrum Make Programmers HATE Coding?

16:14

Why Does Scrum Make Programmers HATE Coding?

Thriving Technologist

Рет қаралды 496 М.

Watch the best analysis moments of CNN's Presidential Debate

34:37

Watch the best analysis moments of CNN's Presidential Debate

CNN

Рет қаралды 1 МЛН

Let's try Reinforcement Learning

8:28

Let's try Reinforcement Learning

Thinkstr

Рет қаралды 259

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

38:55

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Machine Learning with Phil

Рет қаралды 99 М.

Generative AI in a Nutshell - how to survive and thrive in the age of AI

17:57

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Henrik Kniberg

Рет қаралды 1,6 МЛН

Agile & Scrum Don't Work | Allen Holub In The Engineering Room Ep. 9

1:12:35

Agile & Scrum Don't Work | Allen Holub In The Engineering Room Ep. 9

Continuous Delivery

Рет қаралды 109 М.

The Attention Mechanism in Large Language Models

21:02

The Attention Mechanism in Large Language Models

Serrano.Academy

Рет қаралды 82 М.

More on Friston's Free Energy Principle

13:43

More on Friston's Free Energy Principle

Thinkstr

Рет қаралды 3,6 М.

793: Bayesian Methods and Applications - with Alexandre Andorra

1:31:57

793: Bayesian Methods and Applications - with Alexandre Andorra

Super Data Science: ML & AI Podcast with Jon Krohn

Рет қаралды 1 М.

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

1:06:56

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

CS50

Рет қаралды 790 М.

Just try to use a cool gadget 😍

00:33

Just try to use a cool gadget 😍

123 GO! SHORTS

Рет қаралды 85 МЛН