fastmlx

FastMLX is a high performance production ready API to host MLX models.

OTHER License

Downloads

542

Stars

153

View Code on GitHub View on X

Ecosystems: Python

Package Rankings

Top 34.4% on Pypi.org

Badges

Extracted from project README

Related Projects

llmware

Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

29 Sep 2023 3,057

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

29 Jan 2024 6,019

llm-api

Run any Large Language Model behind a unified API

02 Apr 2023 159

functionary

Chat language model that can use tools and interpret the results

11 Jul 2023 1,372

ray-llm

RayLLM - LLMs on Ray

31 May 2023 1,180

pyllama

LLaMA: Open and Efficient Foundation Language Models

28 Feb 2023 2,805

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

03 Aug 2023 11,522

chat-with-mlx

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

16 Feb 2024 1,464

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes yo...

08 Jan 2024 2,286

llama-cpp-python

Python bindings for llama.cpp

23 Mar 2023 6,264

llm-gpt4all

Plugin for LLM adding support for the GPT4All collection of models

09 Jul 2023 177

mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on yo...

16 Jul 2024 58

languagemodels

Explore large language models in 512MB of RAM

07 May 2023 1,154