A command-line tool for quickly managing and experimenting with multiple versions of llama inference implementations.
OTHER License
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leav...
Run any Large Language Model behind a unified API
Inference Llama 2 in one file of pure Zig
Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llam...
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, Gemini, Claude, Llama 3, DALL-E, La...
This repository contains a web application designed to execute relatively compact, locally-operat...
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
llama.go is like llama.cpp in pure Golang!
Inference code for CodeLlama models
LLaMA-2 in native Go
Practical Llama 3 inference in Java
Inference code for Llama models
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable fo...