A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
MIT License
MuZero
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scala...
Massively Parallel Deep Reinforcement Learning. 🔥
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Se...
A rust implementation of AlphaZero algorithm