Speed Benchmarking 7B LLM on different gcloud VMs
GPL-3.0 License
Deploying Qwen2 (or any other GGUF models) into AWS Lambda
LLM as a Chatbot Service
Explore large language models in 512MB of RAM
Run any Large Language Model behind a unified API
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
LLaMA: Open and Efficient Foundation Language Models
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
LLM plugin for running models using llama.cpp
⏱ Benchmarks of machine learning inference for Go
Access 14k+ open source AI models across 30+ tasks with the Bytez inference API ✨
Plugin for LLM adding support for the GPT4All collection of models
An open platform for training, serving, and evaluating large language models. Release repo for Vi...