A fast CPU-based API for OpenChat 3.6 using CTranslate2, hosted on Hugging Face Spaces.
An offline CPU-first memory-scarce chat application to perform RAG on your corpus of data. Powere...
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use...
A low-memory high-performance CPU-based API for Meta's No Language Left Behind (NLLB) using CTran...
A simple axum API for compiling TeX/LaTeX with Tectonic, hosted on Hugging Face Spaces.
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, L...
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate...
A high-performance axum API for serving Hugging Face's Tokenizers