Document Question Answering System

Introduction

This project implements an advanced document question answering system using state-of-the-art language models and natural language processing techniques. By leveraging the power of Llama and LangChain, our system allows users to upload documents, ask questions about their content, and receive accurate, context-aware answers.

The system is designed to be efficient, scalable, and easily customizable, making it suitable for a wide range of applications, from personal knowledge management to enterprise-level document analysis.

Features

Document Processing: Support for various document formats (currently .txt, with plans to expand)
Intelligent Text Splitting: Breaks down documents into manageable chunks while preserving context
Advanced Embedding Generation: Utilizes Llama for creating high-quality text embeddings
Efficient Vector Storage: Implements Chroma for fast similarity search and retrieval
Contextual Question Answering: Employs a language model to generate accurate answers based on document context
User-Friendly Interface: Built with Streamlit for easy interaction and visualization
Customizable Components: Flexible architecture allowing for easy swapping of models and fine-tuning of parameters

System Architecture

Document Ingestion: Documents are loaded using LangChain's TextLoader.
Text Splitting: The RecursiveCharacterTextSplitter breaks documents into smaller, overlapping chunks.
Embedding Generation: Llama generates embeddings for each text chunk.
Vector Storage: Chroma stores and indexes the embeddings for efficient retrieval.
Query Processing: User questions are embedded and compared against stored document embeddings.
Context Retrieval: The most relevant document chunks are retrieved using similarity search.
Answer Generation: A language model generates answers based on the retrieved context and user question.

Prerequisites

Python 3.7+
Llama model (7B quantized version used in this example)
CUDA-capable GPU (recommended for faster processing)

Installation

Clone the repository:

git clone https://github.com/yourusername/document-qa-system.git
cd document-qa-system

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Download the Llama model:
- Visit Hugging Face to download the appropriate Llama model
- Place the model file in the models directory
- Update the MODEL_PATH in the code to point to your model file

Usage

Start the Streamlit app:
```
streamlit run app.py
```
Open your web browser and navigate to the provided local URL (typically http://localhost:8501)
Upload a document using the file uploader in the sidebar
Enter your question in the text input field
Click the "Ask" button to generate an answer
View the answer and relevant context in the main area of the app

Configuration

Key configuration options can be found at the top of the app.py file:

MODEL_PATH: Path to the Llama model file
CHUNK_SIZE: Size of text chunks for splitting (default: 256)
CHUNK_OVERLAP: Overlap between chunks (default: 0)
TOP_K: Number of most relevant chunks to consider (default: 1)

Customization

Embedding Model: Replace LlamaCppEmbeddings with other LangChain-compatible embedding models
Vector Store: Swap Chroma with other vector stores like FAISS or Pinecone
Language Model: Experiment with different LLMs supported by LangChain
Prompt Engineering: Modify the template in the PromptTemplate to alter the system's response style

Troubleshooting

Out of Memory Errors: Reduce CHUNK_SIZE or use a smaller language model
Slow Performance: Ensure you're using a GPU, or consider using a smaller/quantized model
Inaccurate Answers: Experiment with different CHUNK_SIZE and CHUNK_OVERLAP values, or try a more advanced language model

Contributing

We welcome contributions to improve the Document QA System! Here's how you can contribute:

Fork the repository
Create a new branch (git checkout -b feature/AmazingFeature)
Make your changes
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Related Projects

llm-search

Querying local documents, powered by LLM

14 Apr 2023 484

eskwelabs_chatbot

A RAG chatbot that answers both Eskwelabs bootcamp-specific queries and general bootcamp-related ...

07 Sep 2024 0

CDoc

CDoc lets you chat with your documents using local LLMs, combining Ollama, ChromaDB, and LangChai...

28 Jul 2024 4

chatbot-rag

CICD Answering-Question Chatbot for RAG (Retrieval-Augmented Generation) using Streamlit

31 Aug 2024 0

ask-my-pdf

Question answering system for PDF files

18 Feb 2023 581

RAGify

Chat with your documents using Generative AI & Retrieval-Augmented Generation (RAG)

30 Jul 2024 2

local-LLM-with-RAG

Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)

05 Nov 2023 157

DataChad

Ask questions about any data source by leveraging langchains

10 May 2023 314

Gemini-Quizzify

This project develops a solution called Quiz Builder, which dynamically generates quizzes based o...

23 Aug 2024 0

LLM-RAG-Chatbot-With-LangChain

Development and deployment of a question-answer LLM model using Llama2 with 7B parameters and RAG...

18 Jun 2024 1

DocsChat

The chatbot utilizes a conversational retrieval chain to answer user queries based on the content...

11 May 2024 0

llm-cve

A Retrieval-Augmented Generation (RAG) model built using LLaMA 3.1 and LangChain.

12 Aug 2024 0

repochat

Chatbot assistant enabling GitHub repository interaction using LLMs with Retrieval Augmented Gene...

03 Jul 2023 264

AnyChat

Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV file...

13 Mar 2024 42

docGPT-langchain

🔐Free GPT-3.5 chat with your docs (PDF, WORD, CSV, TXT)

03 Jul 2023 236

Document-based-Question-and-Answers