Xuehai Pan

Ph.D. student at Peking University. Interested in Reinforcement Learning & Multi-Agent Systems & Distributed Computing. Working on LLMs and AI Alignment.

Projects

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python - Released: 13 Aug 2016 - 77,536

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python - Released: 25 Jan 2021 - 4,662

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python - Released: 15 May 2023 - 1,137

Dev-Setup

Automation scripts for setting up a basic development environment.

Shell - Released: 03 Nov 2019 - 78

LaTeX-Templates

A collection of LaTeX templates in English/Chinese, with VS Code settings for LaTeX Workshop.

TeX - Released: 21 Nov 2019 - 60

mate

MATE: the Multi-Agent Tracking Environment.

Python - Released: 21 Aug 2021 - 32

Soft-Actor-Critic

PyTorch Implementation of Soft Actor-Critic Algorithm

Python - Released: 16 Mar 2020 - 10