A high-throughput and memory-efficient inference and serving engine for LLMs
Python - Released: 09 Feb 2023 - 28,039
CS194 Project: Attention + Transfer Learning + Domain Adaption
Jupyter Notebook - Released: 11 Apr 2018 - 9
Container Express is a tool to accelerate Docker push and pull.
Rust - Released: 09 Oct 2023 - 2