abcdRL (简单四步实现一个强化学习算法)

English | 简体中文

abcdRL 是一个模块化单文件强化学习代码库，提供“有但不严格”的模块化设计，和清晰的单文件算法实现。

阅读代码时，在单文件代码中，快速了解算法的完整实现细节；改进算法时，得益于轻量的模块化设计，只需专注于少量的模块。

abcdRL 主要参考了 vwxyzjn/cleanrl 的单文件设计哲学和 PaddlePaddle/PARL 的模块设计。

使用文档 ➡️ docs.abcdrl.xyz

路线图🗺️ #57

🚀 快速开始

在 Gitpod🌐 中打开项目，并立即开始编码。

使用 Docker📦：

# 0. 安装 Docker & Nvidia Drive & NVIDIA Container Toolkit
# 1. 运行 DQN 算法
docker run --rm --gpus all sdpkjc/abcdrl python abcdrl/dqn_torch.py

详细安装说明 👀

🐼 特点

👨‍👩‍👧‍👦 统一的代码结构
📄 单文件实现
🐷 低代码复用
📐 最小化代码差异
📈 集成 Tensorboard & Wandb
🛤 符合 PEP8 & PEP526 规范

🗽 设计哲学

要“拷贝📋”，~~不要“继承🧬”~~
要“单文件📜”，~~不要“多文件📚”~~
要“功能复用🛠”，~~不要“算法复用🖨”~~
要“一致的逻辑🤖”，~~不要“一致的接口🔌”~~

✅ 已实现算法

Weights & Biases 性能报告 ➡️ report.abcdrl.xyz

Deep Q Network (DQN) dqn_torch.py, dqn_tf.py, dqn_atari_torch.py, dqn_atari_tf.py
Deep Deterministic Policy Gradient (DDPG) ddpg_torch.py
Twin Delayed Deep Deterministic Policy Gradient (TD3) td3_torch.py
Soft Actor-Critic (SAC) sac_torch.py
Proximal Policy Optimization (PPO) ppo_torch.py

Double Deep Q Network (DDQN) ddqn_torch.py, ddqn_tf.py
Prioritized Deep Q Network (PDQN) pdqn_torch.py, pdqn_tf.py

引用 abcdRL

@misc{zhao_abcdrl_2022,
    author = {Yanxiao, Zhao},
    month = {12},
    title = {{abcdRL: Modular Single-file Reinforcement Learning Algorithms Library}},
    url = {https://github.com/sdpkjc/abcdrl},
    year = {2022}
}

Package Rankings

Top 21.98% on Pypi.org

Badges

Extracted from project README

Related Projects

rlkit

Collection of reinforcement learning algorithms

25 Jan 2018 2,378

Reinforcement-Learning-practice-zh

强化学习-中文笔记&资源-以python实例为主-由浅入深

10 Dec 2019 83

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compa...

21 Apr 2016 2,868

rllib

Reinforcement Learning Library.

28 Jun 2022 29

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-f...

07 Jun 2019 5,379

deep_rl

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

11 May 2021 22

Research

novel deep learning research works with PaddlePaddle

13 Feb 2020 1,707

pfrl

PFRL: a PyTorch-based deep reinforcement learning library

24 Jun 2020 1,182

chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

30 Jan 2017 1,155

cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

12 Feb 2023 105

clip-jax

Train vision models using JAX and 🤗 transformers

05 Aug 2022 75

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

14 Jan 2022 626

genrl

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...

26 Mar 2020 403

DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

31 May 2018 1,053

deep-marl-toolkit

MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MAD...

08 Aug 2022 70