Tongxuan Liu

Ecosystems: Llama, Cuda

Projects

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++ - Released: 24 Jul 2023 - 289