FauxPilot - an open-source alternative to GitHub Copilot server
MIT License
In this repository, I will share some useful notes and references about deploying deep learning-b...
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed conf...
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer ...
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVP...
Locally hosted AI code completion server. Like Github Copilot but 100% free and 100% private.
Minimalistic large language model 3D-parallelism training
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Tool...
GPT-powered chat for documentation, chat with your documents
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...
âš¡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques fo...