Twin-Delayed-DDPG

Pytorch Implementation of Twin Delayed Deep Deterministic Policy Gradients for Continuous Control

Stars

11

View Code on GitHub Visit Website View on X

Ecosystems: PyTorch

Twin Delayed DDGP

Pytorch Implementation of Twin Delayed Deep Deterministic Policy Gradients Algorithm for Continuous Control as described by the paper Addressing Function Approximation Error in Actor-Critic Methods by Scott Fujimoto, Herke van Hoof, David Meger.

Results

BipedalWalker-V3

Environment Link: https://gym.openai.com/envs/BipedalWalker-v2/

Mean Reward: 295.263390447903 sampled over 20 evaluation episodes.

Experiment Conducted on Free-P5000 instance provided by Paperspace Gradient.

LunarLanderContinuous-V2

Environment Link: https://gym.openai.com/envs/LunarLanderContinuous-v2/

Mean Reward: 272.55341062406666 sampled over 20 evaluation episodes.

Experiment Conducted on Free-P5000 instance provided by Paperspace Gradient.

Reference

@misc{1802.09477,
    Author = {Scott Fujimoto and Herke van Hoof and David Meger},
    Title = {Addressing Function Approximation Error in Actor-Critic Methods},
    Year = {2018},
    Eprint = {arXiv:1802.09477},
}

Related Projects

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

27 Sep 2018 1,653

gym-microrts-paper-sb3

RL agent to play μRTS with Stable-Baselines3 and PyTorch

PyTorch-ML

Implement DNN or ML models and advanced policies with PyTorch.(Include experiment)

Run-Skeleton-Run

Reason8.ai PyTorch solution for NIPS RL 2017 challenge

DDPG

Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control

DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

31 May 2018 1,053

humpback-whale-identification

Kaggle Humpback whale identification: 2xGPU Data augmentation + FP16 mixed precision training

gdrl

Grokking Deep Reinforcement Learning

15 Mar 2018 807

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

20 Apr 2017 3,166

endorphin

Like dopamine, but for different algorithms

ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

12 Jul 2019 3,672

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) a...

17 Oct 2017 1,092

genrl

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...

26 Mar 2020 403

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

09 Jun 2018 3,876