vnc-lm

Introduction

vnc-lm is a Discord bot that lets you talk with and configure language models in your server. It uses ollama to manage and run different models.

Web scraping example Model pulling example

Features

Model Management

Change models using the /model command and adjust parameters like num_ctx, system_prompt, and temperature. Notifications are sent when models load into RAM. The bot responds wherever /model was last used. Models can be removed with the remove parameter. Download models directly through Discord by messaging a tag URL.

https://ollama.com/library/phi3.5:3.8b-mini-instruct-q2_K

Model downloading and removal is turned off by default and can be enabled by configuring the .env.

QoL Improvements

Streaming message generation with messages longer than 1500 characters split into pages. Message attachments like text-based files, web links, and screenshots can be added into the context window.

Switch between conversations by clicking rejoin conversation in the Discord context menu. Conversations can be continued from any point and with different models. All messages are cached and organized into conversations. Entrypoint.sh helps the cache file persist across Docker containers.

Messaging stop will end message generation early. Messaging reset returns models to their default configuration.

Requirements

Ollama: Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Docker: Docker is a platform designed to help developers build, share, and run container applications. We handle the tedious setup, so you can focus on the code.

Environment Configuration

Clone the repository with
git clone https://github.com/jake83741/vnc-lm.git
CD into the directory with
cd vnc-lm
Rename .env.example to .env in the project root directory. Configure the .env file:

TOKEN=: Your Discord bot token. Use the Discord Developer Portal to create this. Check the necessary permissions for your Discord bot.
OLLAMAURL=: The URL of your Ollama server. See API documentation. Docker requires http://host.docker.internal:11434
NUM_CTX= Value controlling context window size. Defaults to 2048.
TEMPERATURE= Value controlling the randomness of responses. Defaults to 0.4.
KEEP_ALIVE=: Value controlling how long a model stays in memory. Defaults to 45m.
CHARACTER_LIMIT= Value controlling the character limit for page embeds. Defaults to 1500.
API_RESPONSE_UPDATE_FREQUENCY= Value controlling amount of API responses to chunk before updating message. A low number will cause Discord API to throttle. Defaults to 10.
ADMIN= Discord user ID. This will enable downloading and removing models.
REQUIRE_MENTION= Require the bot to be mentioned or not. Defaults to false.

Docker Installation (Preferred)

docker compose up --build

Manual Installation

npm install
npm run build
npm start

Usage

/model: Load, configure, or remove a language model. Optional parameters for num_ctx, system_prompt, temperature, and remove.

/model example

Rejoin Conversation: Rejoin an old conversation at a specific point. Messages up to the selected point in the conversation will also be included.

Rejoin Conversation example

/help: Instructions for how to use the bot.

/help example

Tree Diagram

.
├── LICENSE
├── README.md
├── docker-compose.yaml
├── dockerfile
├── entrypoint.sh
├── .env.example
├── imgs
├── package.json
├── src
│   ├── api-connections
│   │   ├── api-requests.ts
│   │   ├── library-refresh.ts
│   │   ├── model-loader.ts
│   │   └── model-pull.ts
│   ├── bot.ts
│   ├── commands
│   │   ├── command-registry.ts
│   │   ├── help-command.ts
│   │   ├── model-command.ts
│   │   ├── optional-params
│   │   │   └── remove.ts
│   │   └── rejoin-conversation.ts
│   ├── functions
│   │   ├── ocr-function.ts
│   │   └── scraper-function.ts
│   ├── managers
│   │   ├── cache-manager.ts
│   │   ├── message-manager.ts
│   │   └── page-manager.ts
│   ├── message-generation
│   │   ├── chunk-generation.ts
│   │   ├── message-create.ts
│   │   └── message-preprocessing.ts
│   └── utils.ts
└── tsconfig.json

Notes

If an issue arises with the Docker set-up, change the Ollama_Host environment variable to 0.0.0.0. See server documentation.
Attachments with large amounts of text will require a higher num_ctx value to work properly.

Dependencies

Axios: Promise based HTTP client for the browser and node.js.
Discord.js: A powerful JavaScript library for interacting with the Discord API.
dotenv: Loads environment variables from .env for nodejs projects.
tesseract.js: A javascript library that gets words in almost any language out of images.
jsdom: A JavaScript implementation of various web standards, for use with Node.js
readbility: A standalone version of the readability lib

License

This project is licensed under the MIT License.

Badges

Extracted from project README's

Related Projects

ollama-gui

A Web Interface for chatting with your local LLMs via the ollama API

08 Oct 2023 512

aicommit2

A Reactive CLI that generates git commit messages with various AI

30 Jan 2024 11

aipick

An interactive CLI tool that leverages Ollama, ChatGPT, Gemini, Claude, Mistral and other AI.

24 Jul 2024 2

minimal-llm-ui

Minimalistic UI for Ollama LMs - This powerful react interface for LLMs drastically improves the ...

17 Oct 2023 250

gollama

Go manage your Ollama models

30 May 2024 399

nextjs-ollama-llm-ui

Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single...

05 Feb 2024 694

ollama-pdf-bot

A bot that accepts PDF docs and lets you ask questions on it.

10 Nov 2023 134

ovai

REST API proxy to Vertex AI with the interface of ollama. HTTP server for accessing Vertex AI via...

12 May 2024 1

llama-server

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

03 Apr 2023 111

botality-ii

telegram bot for self-hosted local inference of stable diffusion, text-to-speech and large langua...

11 Mar 2023 37

ollama-ai

A Ruby gem for interacting with Ollama's API that allows you to run open source AI LLMs (Large La...

06 Jan 2024 178

VT.ai

VT.ai - Multimodal AI Chatbot

21 Apr 2024 37

ollama-operator

Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫

10 Apr 2024 7

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

26 Jun 2023 92,543

gollama

Gollama: Your offline conversational AI companion. An interactive tool for generating creative re...

26 Feb 2024 93