Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG)

Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG) | Tensorflow 2 Tutorial

Рет қаралды 40,844

Күн бұрын

Deep Deterministic Policy Gradients (DDPG) is an actor critic algorithm designed for use in environments with continuous action spaces. This makes it great for fields like robotics, that rely on applying continuous voltages to electric motors. You'll get a crash course with a quick lecture, followed by a live coding tutorial.
Despite being an actor critic method, DDPG makes use of a number of innovations from deep Q learning. We're going to make use of a replay memory for training our agent, as well as target actor and target critic networks for learning stability. One key difference is that DDPG uses a soft update rule for the target network parameters, rather than a direct hard copy of the online networks.
In this tutorial we're going to use Tensorflow 2 to implement a deep deterministic policy gradient agent in the pendulum environment from the Open AI gym.
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
www.neuralnet....
Or, pickup my Udemy courses here:
Deep Q Learning:
www.udemy.com/...
Actor Critic Methods:
www.udemy.com/...
Curiosity Driven Deep Reinforcement Learning
www.udemy.com/...
Natural Language Processing from First Principles:
www.udemy.com/...
Reinforcement Learning Fundamentals
www.manning.co...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: bit.ly/3fXHy8W
Grokking Deep Learning: bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: bit.ly/2VNAXql
Come hang out on Discord here:
/ discord
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Code for this video is here:
github.com/phi...
Website: www.neuralnet.ai
Github: github.com/phi...
Twitter: / mlwithphil