High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
OTHER License
A modular RL library to fine-tune language models to human preferences
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Clean PyTorch implementations of imitation and reward learning algorithms
Reinforcement Learning in PyTorch
PFRL: a PyTorch-based deep reinforcement learning library
Diffusers training with mmengine
Differentiable Factor Graph Optimization for Learning Smoothers @ IROS 2021
OpenChat: Advancing Open-source Language Models with Imperfect Data
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DP...
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...
Collection of reinforcement learning algorithms
SBX: Stable Baselines Jax (SB3 + Jax)
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compa...
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.