serendipity-zk

Ecosystems: Llama, Cuda

Projects

Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda - Released: 19 Aug 2024 - 537