vllm_now

Ganler's vLLM utilities.

Stars
2

To save Yuxiang's time from typing 8 times of docker run vllm... as well as launching the load balancer simultaneously:

pip install "vllm_now @ git+https://github.com/ganler/vllm_now@main" --upgrade
vllm_now --model "ise-uiuc/Magicoder-S-DS-6.7B"

# [Launch] vllm servers for your favorite models
docker-compose up -d
# [Check] the status
docker-compose logs -t -f
# [Stop] the servers
docker-compose stop

[!Note]

To install docker-compose: https://docs.docker.com/compose/install/linux/#install-using-the-repository

Quick command for Ubuntu: sudo apt-get install docker-compose-plugin

[!Tip]

vllm_now arguments are synchronized with vllm.entrypoints.openai.api_server arguments.

In addition, you can set --vllm-version to specify the version of vllm to install.

Supported and compatible environment variables:

  • HF_HOME
  • CUDA_VISIBLE_DEVICES