A high-performance inference system for large language models, designed for production environments.
APACHE-2.0 License
A tool that can automatically convert 🤗 Huggingface Spaces,魔搭创空间 and Gradio ChatBot into free API...
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训...
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leav...
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable fo...
Python bindings for llama.cpp
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, L...
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
AirLLM 70B inference with single 4GB GPU
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Mod...
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
This repository contains a web application designed to execute relatively compact, locally-operat...
Inference code for Llama models
🏗️ Fine-tune, build, and deploy open-source LLMs easily!