An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
MIT License
RL agent to play μRTS with Stable-Baselines3 and PyTorch
Implement DNN or ML models and advanced policies with PyTorch.(Include experiment)
A rust implementation of AlphaZero algorithm
Evolution Strategies in PyTorch (Tic-tac-toe)
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Goba...
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....