PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
MIT License
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scala...
Pytorch Implementation of Twin Delayed Deep Deterministic Policy Gradients for Continuous Control
Reinforcement learning algorithms in RLlib
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...
Grokking Deep Reinforcement Learning
RL agent to play μRTS with Stable-Baselines3 and PyTorch
Like dopamine, but for different algorithms
Modularized Implementation of Deep RL Algorithms in PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch