Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
MIT License
PyGame-based quadcopter simulator & Reinforcement Learning Project
This is an AI agent for Street Fighter II Champion Edition.
[Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2L...
The next generation deep reinforcement learning tookit
100% Local AGI with LocalAI
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementat...
PyTorch implementations of deep reinforcement learning algorithms and environments
Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods f...
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Reinforcement Learning in PyTorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Simple Example A3C Reinforcement Learning Algorithm in Tensorflow
Softlearning is a reinforcement learning framework for training maximum entropy policies in conti...
Deep Reinforcement Learning Pong Agent, King Pong, he's the best
Agent techniques to augment your LLM and push it beyong its limits