cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

OTHER License

Stars

105

View Code on GitHub View on X

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

about 5 hours

2 months

Related Projects

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in conti...

03 Dec 2018 1,200

GLM

GLM (General Language Model)

18 Mar 2021 3,170

DouZero

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

02 Jun 2021 3,972

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-f...

07 Jun 2019 5,379

Bench2Drive

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model...

23 Apr 2024 1,240

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

26 Jan 2024 4,519

abcdrl

Modular Single-file Reinfocement Learning Algorithms Library

12 Nov 2022 37

ProteinDT

05 Feb 2023 41

minichatgpt

minichatgpt - To Train ChatGPT In 5 Minutes

23 Feb 2023 155

open-instruct

09 Jun 2023 1,214

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

14 Jan 2022 626

clip-jax

Train vision models using JAX and 🤗 transformers

05 Aug 2022 75

chain-of-hindsight

Chain-of-Hindsight, A Scalable RLHF Method

20 Feb 2023 211

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vi...

19 Mar 2023 36,628