RTD3 (Recurrent twin-delayed deep deterministic policy gradient)

Reinforcement Learning - "DDPG" explained

DDPG and TD3 (RLVS 2021 version)

A pack of chips with a surprise 🤣😍❤️ #demariki

The child was abused by the clown#Short #Officer Rabbit #angel

Получилось у Вики?😂 #хабибка

small vs big hoop #tiktok

RTD3 (Recurrent twin-delayed deep deterministic policy gradient)

Рет қаралды 437

Thinkstr

Күн бұрын

It's been a while since I've released a video! I'm pretty busy lately, and my family's having a weird time, so I don't know when the next one will be out.
arxiv.org/pdf/2110.12628.pdf
github.com/zhihanyang2022/off-policy-continuous-control
patreon.com/thinkstr

Пікірлер: 4

@ritvikmath 2 жыл бұрын

This reminds me of the KZbin channel Primer but taken to the next level. Super cool!

@Thinkstr 2 жыл бұрын

Great to hear from you, Ritvik! Thanks for recommending Primer, it looks REALLY cool

@revimfadli4666

@revimfadli4666 Жыл бұрын

Why has this channel not blow up yet(despite better upload schedule than most other AI edutainment KZbinrs)?

@Thinkstr Жыл бұрын

Hey, thanks for watching! I think it might be because my subject-matter sort of changes to whatever I'm interested in at the moment, haha.

Reinforcement Learning - "DDPG" explained

6:53

Reinforcement Learning - "DDPG" explained

Aylwin Wei

Рет қаралды 28 М.

DDPG and TD3 (RLVS 2021 version)

16:53

DDPG and TD3 (RLVS 2021 version)

Olivier Sigaud

Рет қаралды 6 М.

A pack of chips with a surprise 🤣😍❤️ #demariki

00:14

A pack of chips with a surprise 🤣😍❤️ #demariki

Demariki

Рет қаралды 54 МЛН

The child was abused by the clown#Short #Officer Rabbit #angel

00:55

The child was abused by the clown#Short #Officer Rabbit #angel

兔子警官

Рет қаралды 14 МЛН

Получилось у Вики?😂 #хабибка

00:14

Получилось у Вики?😂 #хабибка

ХАБИБ

Рет қаралды 6 МЛН

small vs big hoop #tiktok

00:12

small vs big hoop #tiktok

Анастасия Тарасова

Рет қаралды 21 МЛН

Deep Deterministic Policy Gradients

8:36

Deep Deterministic Policy Gradients

CIS 522 - Deep Learning

Рет қаралды 17 М.

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

27:14

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

3Blue1Brown

Рет қаралды 2,6 МЛН

Welcome to Smallville!

23:28

Welcome to Smallville!

Thinkstr

Рет қаралды 180

More on Friston's Free Energy Principle

13:43

More on Friston's Free Energy Principle

Thinkstr

Рет қаралды 3,6 М.

How to remember (instead of catastrophically forget)

4:54

How to remember (instead of catastrophically forget)

Thinkstr

Рет қаралды 994

AIs learn to WALK

20:21

AIs learn to WALK

Pezzza's Work

Рет қаралды 53 М.

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

36:26

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

Serrano.Academy

Рет қаралды 93 М.

Future Proof Your Tech Career In the Age of AI

10:21

Future Proof Your Tech Career In the Age of AI

Travis Media

Рет қаралды 15 М.

Meeting Dork Matter Girl about Reinforcement Learning!

48:50

Meeting Dork Matter Girl about Reinforcement Learning!

Thinkstr

Рет қаралды 186

Are you a fish? (Your Inner Fish)

12:53

Are you a fish? (Your Inner Fish)

Thinkstr

Рет қаралды 1,8 М.

A pack of chips with a surprise 🤣😍❤️ #demariki

00:14

A pack of chips with a surprise 🤣😍❤️ #demariki

Demariki

Рет қаралды 54 МЛН