Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG) | Tensorflow 2 Tutorial
Machine Learning with Phil Machine Learning with Phil
40.8K subscribers
38,270 views
0

 Published On Nov 4, 2020

Deep Deterministic Policy Gradients (DDPG) is an actor critic algorithm designed for use in environments with continuous action spaces. This makes it great for fields like robotics, that rely on applying continuous voltages to electric motors. You'll get a crash course with a quick lecture, followed by a live coding tutorial.

Despite being an actor critic method, DDPG makes use of a number of innovations from deep Q learning. We're going to make use of a replay memory for training our agent, as well as target actor and target critic networks for learning stability. One key difference is that DDPG uses a soft update rule for the target network parameters, rather than a direct hard copy of the online networks.

In this tutorial we're going to use Tensorflow 2 to implement a deep deterministic policy gradient agent in the pendulum environment from the Open AI gym.

Learn how to turn deep reinforcement learning papers into code:

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.


Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to [email protected]

https://www.neuralnet.ai/courses

Or, pickup my Udemy courses here:

Deep Q Learning:
https://www.udemy.com/course/deep-q-l...

Actor Critic Methods:
https://www.udemy.com/course/actor-cr...

Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosit...

Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-...
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/rei...

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql

Come hang out on Discord here:
  / discord  

Need personalized tutoring? Help on a programming project? Shoot me an email! [email protected]

Code for this video is here:
https://github.com/philtabor/Youtube-...

Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter:   / mlwithphil  

show more

Share/Embed