Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG) | Tensorflow 2 Tutorial

  Рет қаралды 40,844

Machine Learning with Phil

Machine Learning with Phil

Күн бұрын

Deep Deterministic Policy Gradients (DDPG) is an actor critic algorithm designed for use in environments with continuous action spaces. This makes it great for fields like robotics, that rely on applying continuous voltages to electric motors. You'll get a crash course with a quick lecture, followed by a live coding tutorial.
Despite being an actor critic method, DDPG makes use of a number of innovations from deep Q learning. We're going to make use of a replay memory for training our agent, as well as target actor and target critic networks for learning stability. One key difference is that DDPG uses a soft update rule for the target network parameters, rather than a direct hard copy of the online networks.
In this tutorial we're going to use Tensorflow 2 to implement a deep deterministic policy gradient agent in the pendulum environment from the Open AI gym.
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
www.neuralnet....
Or, pickup my Udemy courses here:
Deep Q Learning:
www.udemy.com/...
Actor Critic Methods:
www.udemy.com/...
Curiosity Driven Deep Reinforcement Learning
www.udemy.com/...
Natural Language Processing from First Principles:
www.udemy.com/...
Reinforcement Learning Fundamentals
www.manning.co...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: bit.ly/3fXHy8W
Grokking Deep Learning: bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: bit.ly/2VNAXql
Come hang out on Discord here:
/ discord
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Code for this video is here:
github.com/phi...
Website: www.neuralnet.ai
Github: github.com/phi...
Twitter: / mlwithphil

Пікірлер: 96
How to Implement Deep Learning Papers | DDPG Tutorial
1:54:02
Machine Learning with Phil
Рет қаралды 38 М.
Everything You Need To Master Actor Critic Methods | Tensorflow 2 Tutorial
40:47
Machine Learning with Phil
Рет қаралды 49 М.
SHAPALAQ 6 серия / 3 часть #aminkavitaminka #aminak #aminokka #расулшоу
00:59
Аминка Витаминка
Рет қаралды 1,6 МЛН
Policy Gradient Theorem Explained - Reinforcement Learning
59:36
Elliot Waite
Рет қаралды 61 М.
How I’d learn ML in 2024 (if I could start over)
7:05
Boris Meinardus
Рет қаралды 1,1 МЛН
Deep Deterministic Policy Gradients
8:36
CIS 522 - Deep Learning
Рет қаралды 18 М.
L5 DDPG and SAC (Foundations of Deep RL Series)
12:12
Pieter Abbeel
Рет қаралды 20 М.
Reinforcement Learning - "DDPG" explained
6:53
Aylwin Wei
Рет қаралды 30 М.
Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial
1:02:47
Machine Learning with Phil
Рет қаралды 64 М.
Policy Gradient Methods | Reinforcement Learning Part 6
29:05
Mutual Information
Рет қаралды 30 М.
How To Self Study AI FAST
12:54
Tina Huang
Рет қаралды 549 М.
Reinforcement Learning from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 61 М.