A selection of 3D control scenarios created in a highly efficient simulator, benchmarked with the A2C algorithm
MIT License
A modular implementation of PPO, and soon hopefully other algorithms.
DUSt3R: Geometric 3D Vision Made Easy
Zero-Mean Convolutions for Level-Invariant Singing Voice Detection
RLHF implementation details of OAI's 2019 codebase
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Challenging Memory-based Deep Reinforcement Learning Agents
We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickba...
Reproduce the results of "Neuroevolution of Self-Interpretable Agents" paper
Creating Artificial Life with Reinforcement Learning
DeepCoord: Self-Learning Network and Service Coordination Using Deep Reinforcement Learning
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
A large-scale benchmark and learning environment.
LatPlan : A domain-independent, image-based classical planner
Hybrid Discriminative-Generative Training via Contrastive Learning