MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...
APACHE-2.0 License
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
机器学习原理
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially...
The next generation deep reinforcement learning tookit
Modular Single-file Reinfocement Learning Algorithms Library
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcem...
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...
This is the official implementation of Multi-Agent PPO (MAPPO).
Automatically exported from code.google.com/p/aima-python
My python journey
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Softlearning is a reinforcement learning framework for training maximum entropy policies in conti...
Implementations of Reinforcement Learning and Planning algorithms
RLHF implementation details of OAI's 2019 codebase
MATE: the Multi-Agent Tracking Environment.