TiChat

This project is a Simple, high quality, ChatGPT-esque, extensible RAG application, that makes use of AI models and indices to query documents and retrieve better-informed responses from the models. It allows you to upload your documents that can be used to answer any corresponding queries. It automatically stores your chats for future usage.

A sample online implementation of this project is available here.

Prerequisites

This app has a single dependency that needs to be installed separately:

Ollama for quick and easy model download, serving as well as automatic and smart device loading

This app makes use of TiDB vector store. For quick setup, head over to TiDB Cloud and sign up for a free account.

Get the connection string for your database (default: 'test'), after generating the password. Also, download the CA certificate to your system.

Getting Started

Go to the Releases page, and download the latest TiChatInstaller.exe.
Run it and follow the steps to complete the installation.
Go to the installation directory (default: "C:\Program Files (x86)\TiChat"), and make the following changes to settings.json:
- connectionString to the connection string for your TiDB cloud account, with ssl_ca = full path to your installed CA cert

Done! You can now run the application.

Usage

Start the app from desktop or start menu after completing the above tasks.

The app allows the user to simply chat with the bot, if the checkbox is left unchecked, or use the index created with the uploaded documents for better-informed responses.

Upload documents using the top right button.

This is how it should look like:

Application Explained

Components:

Ollama (Local LLM Runtime):

Hosts the local LLM model (Mistral-7B-Instruct-v0.3).
Runs inference locally for generating responses based on prompts.

FastEmbedEmbedding (ONNX Model):

Runs locally on the CPU.
Generates vector embeddings from text data (e.g., document uploads).
Model: mixedbread-ai/mxbai-embed-large-v1.

TiDB (Vector Store):

A distributed, scalable vector database that stores the embeddings.
Provides vector search capabilities.

Llama-Index (Vector Index):

Interface layer between the application and TiDB.
Manages vector indexes and performs efficient retrieval for relevant documents.

RAG Chatbot Application:

The main user interface where users interact with the chatbot.
Orchestrates the flow of data between different components.

Frontend has been built using React and Bootstrap 5.

Data Flow:

There are 2 main flows:

If the user does not use the index, it directly calls the LLM, like a normal chatbot app.
If the user checks 'Use Index Querying', the following occurs:

User Input: The user inputs a query via the chatbot interface.
Embedding Generation: The query is passed to the FastEmbedEmbedding ONNX model to generate its vector embedding. The question, along with the previous messages, is condensed into a single question for better response.
Vector Search: The generated embedding is sent to Llama-Index, which queries TiDB for relevant document embeddings. TiDB returns the most relevant document vectors.
Contextual Response Generation: The retrieved documents (in their original text form) are provided as context to the LLM model in Ollama. The LLM generates a response based on the query and the retrieved documents.
Response Delivery: The generated response is displayed to the user through the chatbot interface.

Related Projects

libre-chat

🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline ca...

26 Jul 2023 128

RAGify

Chat with your documents using Generative AI & Retrieval-Augmented Generation (RAG)

30 Jul 2024 2

DocsChat

The chatbot utilizes a conversational retrieval chain to answer user queries based on the content...

11 May 2024 0

minimal-llm-ui

Minimalistic UI for Ollama LMs - This powerful react interface for LLMs drastically improves the ...

17 Oct 2023 250

AnyChat

Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV file...

13 Mar 2024 42

LLM-RAG-Chatbot-With-LangChain

Development and deployment of a question-answer LLM model using Llama2 with 7B parameters and RAG...

18 Jun 2024 1

LLaMA-2-hf-Chatbot

Chatbot from pretrained LLaMA-2 LLM model, fine-tuned with medical research papers using RAG (Ret...

06 Jun 2024 2

miniRAG

A barebones RAG implementation for kubernetes, including a local LLM deployment and vector database.

30 Jul 2024 2

local-LLM-with-RAG

Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)

05 Nov 2023 157

local-chat

LocalChat is a ChatGPT-like chat that runs on your computer

02 Jan 2024 50

ToK

Simple, High Quality, Open Source RAG solution for chatting with your documents

11 May 2024 13

chat-with-notes

A simple application that allows users to chat with local text files using LLMs managed by ollama

22 Jul 2024 85

Hybrid-Search-RAG

This Repository contains an chatbot RAG application builted using fastapi and streamlit deployed ...

12 Aug 2024 2

nextjs-ollama-llm-ui

Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single...

05 Feb 2024 694

llm-url_video-rag

A web-based application enabling users to interact with and extract insights from YouTube video t...

10 Sep 2024 12