qwen2-in-a-lambda

Deploying Qwen2 (or any other GGUF models) into AWS Lambda

Stars

Committers

View Code on GitHub

Ecosystems: Llama, Python, Amazon Web Services

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

2.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

hosting-7B-llm-on-google-cloud

Speed Benchmarking 7B LLM on different gcloud VMs

22 Jul 2024 0

airllm

AirLLM 70B inference with single 4GB GPU

12 Jun 2023 4,536

llm-api

Run any Large Language Model behind a unified API

02 Apr 2023 159

Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

06 Jul 2023 945

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable fo...

22 Jul 2023 1,967

slowllama

Finetune llama2-70b and codellama on MacBook Air without quantization

26 Aug 2023 444

llama.cpp

Ampere optimized llama.cpp

30 Apr 2024 5

Local_LLM_Deployment_Guide_Chinese

本地部署大语言模型的中文教学

09 May 2024 25

wllama

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

13 Mar 2024 371

CASALIOY

♾️ toolkit for air-gapped LLMs on consumer-grade hardware

10 May 2023 229

rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

04 Apr 2023 151

llama3.java

Practical Llama 3 inference in Java

25 Apr 2024 360

DeveloperGPT

DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal command...

01 Apr 2023 36

llama-cpp-server-python

Bootstrap a server from llama-cpp in a few lines of python

24 Jun 2024 5

BambooAI

A lightweight library that leverages Language Models (LLMs) to enable natural language interactio...

07 May 2023 439