💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
Rust multiprovider generative AI client (Ollama, OpenAi, Anthropic, Groq, Gemini, Cohere, ...)
Search for anything using the Google, DuckDuckGo, phind.com. Also containes AI models, can transc...
🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline ca...
Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging...
AirLLM 70B inference with single 4GB GPU
GPT-4 level function calling models for real-world tool using use cases
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
The TypeScript library for building AI applications.
开源社区第一个能下载、能运行的中文 LLaMA2 模型!
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable fo...
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundati...
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
A high-performance inference system for large language models, designed for production environments.
An OpenAI-like LLaMA inference API