candle-vllm

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

MIT License

Stars
204