Open Source Ecosystems

A basic CLI lookup tool. Describe a bash command and it outputs sample line(s) by quering LLMs. It can make use of OpenAI (GPT 3.5) or Llama.cpp models.

Usage

Try a few commands

$ ? how much disk space 

df -h

$ ? show top processes by CPU usage

top -o %CPU

There is a history, so the next question can be a follow up. Example:

$ ? find .pickle files in this directory

find . -type f -name "*.pickle"

$ ? delete them

find . -type f -name "*.pickle" -delete

Another example, I didn't like the first output so asked for nc instead.

$ ? check if port 443 on example.com is open

echo | telnet example.com 443

$ ? using nc

nc -zv example.com 443

Set up

Set up dependencies

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Make a copy of the .env.sample file and call it .env

OpenAI setup (simple)

Get an API key from your OpenAI account. Place it in the .env file

OPENAI_API_KEY="......................................."

There is a small cost associated with OpenAI calls, so it's a good idea to set monthly limits on usage.

Create an alias

The application is best used as an alias called ?. Add it to ~/.bashrc like so:

# add alias
echo alias ?="\"$(pwd)/.venv/bin/python3 $(realpath openai.clihelper.py)\"" >> ~/.bashrc
# reload bash
exec bash

Now start using ?

Llama.cpp models (uses GPU)

Llama.cpp is a fast way of running local LLMs on your own computer. It is very fast with GPUs which will be my focus. It is free to use.

Install dependencies

First, ensure that CUDA Toolkit is installed. After installing Cuda, add it to your path and reload bash:

echo 'export PATH="/usr/local/cuda/bin:$PATH"' >> ~/.bashrc
exec bash
# test that it worked: 
nvcc --version

Next install the cmake and Python dependencies, and build one specific package with GPU support.

sudo apt install make cmake
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python --force --no-cache

Get some models

Because Llama is open, there are many Llama models you can choose from. Llama.cpp requires models to be in the GGML format. Here are some I tested with:

TheBloke/Llama-2-7B-Chat-GGML - llama-2-7b-chat.ggmlv3.q4_0.bin TheBloke/StableBeluga-7B-GGML - stablebeluga-7b.ggmlv3.q4_0.bin

Download these then set the path to the model in the .env file. Example:

 LLAMA_MODEL_PATH="./models/7B/stablebeluga-7b.ggmlv3.q4_0.bin"

Create an alias

The application is best used as an alias called ?. Add it to ~/.bashrc like so:

# add alias
echo alias ?="\"$(pwd)/.venv/bin/python3 $(realpath llamacpp.clihelper.py)\"" >> ~/.bashrc
# reload bash
exec bash

Now start using ?

Implementation notes

This was made using langchain, a library that helps make calls to large language models (LLMs) and process its output.

In this case I did a 'few shot', which is a way of showing the LLM a few examples of questions and the kind of answers to generate.

I chose the gpt-3.5-turbo model which is the cheapest on OpenAI currently.

Related Projects

digital_palace

My Digital Palace - A Personal Journal for Reflection - A place to store all my thoughts

03 Sep 2023 44

llama-cpp-python

Python bindings for llama.cpp

23 Mar 2023 6,264

llm

Access large language models from the command-line

01 Apr 2023 4,329

cherryberry

Your friendly AI text adventure!

25 Nov 2023 17

llm-llama-cpp

LLM plugin for running models using llama.cpp

25 Jul 2023 133

llm-api

Run any Large Language Model behind a unified API

02 Apr 2023 159

gpt4all-j

Python bindings for the C++ port of GPT4All-J model.

17 Apr 2023 38

gpt4local

Openai-style, fast & lightweight local language model inference w/ documents

29 Feb 2024 84

languagemodels

Explore large language models in 512MB of RAM

07 May 2023 1,154

robot-agent

Fine-tuned LLaMa2 13B model designed for ReAct-style and Tree-Of-Thoughts style prompting.

15 Jul 2023 17

llm-http-api

HTTP API for LLM with OpenAI compatibility

15 Dec 2023 5

llm-gpt4all

Plugin for LLM adding support for the GPT4All collection of models

09 Jul 2023 177

multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

11 Oct 2023 175

llm-gguf

Run models distributed as GGUF files using LLM

23 Jul 2024 22

localGPT

Chat with your documents on your local device using GPT models. No data leaves your device and 10...

24 May 2023 19,925

llm-cli-helper

Usage

Set up

OpenAI setup (simple)

Create an alias

Llama.cpp models (uses GPU)

Install dependencies

Get some models

Create an alias

Implementation notes

Related Projects

digital_palace

llama-cpp-python

llm

cherryberry

llm-llama-cpp

llm-api

gpt4all-j

gpt4local

languagemodels

robot-agent

llm-http-api

llm-gpt4all

multi_token

llm-gguf

localGPT