RTD3 (Recurrent twin-delayed deep deterministic policy gradient)

  Рет қаралды 437

Thinkstr

Thinkstr

Күн бұрын

It's been a while since I've released a video! I'm pretty busy lately, and my family's having a weird time, so I don't know when the next one will be out.
arxiv.org/pdf/2110.12628.pdf
github.com/zhihanyang2022/off-policy-continuous-control
patreon.com/thinkstr

Пікірлер: 4
@ritvikmath
@ritvikmath 2 жыл бұрын
This reminds me of the KZbin channel Primer but taken to the next level. Super cool!
@Thinkstr
@Thinkstr 2 жыл бұрын
Great to hear from you, Ritvik! Thanks for recommending Primer, it looks REALLY cool
@revimfadli4666
@revimfadli4666 Жыл бұрын
Why has this channel not blow up yet(despite better upload schedule than most other AI edutainment KZbinrs)?
@Thinkstr
@Thinkstr Жыл бұрын
Hey, thanks for watching! I think it might be because my subject-matter sort of changes to whatever I'm interested in at the moment, haha.
Reinforcement Learning - "DDPG" explained
6:53
Aylwin Wei
Рет қаралды 28 М.
DDPG and TD3 (RLVS 2021 version)
16:53
Olivier Sigaud
Рет қаралды 6 М.
A pack of chips with a surprise 🤣😍❤️ #demariki
00:14
Demariki
Рет қаралды 54 МЛН
The child was abused by the clown#Short #Officer Rabbit #angel
00:55
兔子警官
Рет қаралды 14 МЛН
Получилось у Вики?😂 #хабибка
00:14
ХАБИБ
Рет қаралды 6 МЛН
small vs big hoop #tiktok
00:12
Анастасия Тарасова
Рет қаралды 21 МЛН
Deep Deterministic Policy Gradients
8:36
CIS 522 - Deep Learning
Рет қаралды 17 М.
Welcome to Smallville!
23:28
Thinkstr
Рет қаралды 180
More on Friston's Free Energy Principle
13:43
Thinkstr
Рет қаралды 3,6 М.
How to remember (instead of catastrophically forget)
4:54
AIs learn to WALK
20:21
Pezzza's Work
Рет қаралды 53 М.
Future Proof Your Tech Career In the Age of AI
10:21
Travis Media
Рет қаралды 15 М.
Meeting Dork Matter Girl about Reinforcement Learning!
48:50
Are you a fish? (Your Inner Fish)
12:53
Thinkstr
Рет қаралды 1,8 М.
A pack of chips with a surprise 🤣😍❤️ #demariki
00:14
Demariki
Рет қаралды 54 МЛН