Research Paper Q&A Tool

A powerful tool for document search and analysis using advanced language models. Upload PDFs, convert them to vectors, and query your documents with ease.

https://github.com/user-attachments/assets/1161b9f2-7f42-4cc5-b15e-da7f1f6401c3

Features

PDF Upload and Vectorization: Upload PDFs and convert them into vectors using Pinecone.
Advanced Querying: Leverage the Ollama model for intelligent document querying.
User-Friendly Interface: Built with Streamlit for a seamless user experience.

Quick Start

Prerequisites

Python 3.10
Docker
Pinecone API Key
Ollama Model (pre-configured in the application)

Setup

1. Create a Pinecone Account and API Key

Sign up for a Pinecone account at Pinecone.
Create an index and generate your API key.
Save your API key and index name, as you'll need them to run the application.

2. Configure the Models in the Code

Open backend/core/embedding_service.py.
Find the section where the models are defined:

   # Example configuration
   LLM_MODEL_NAME = "your_llm_model_name"
   EMBEDDING_MODEL_NAME = "your_embedding_model_name"

Replace "your_llm_model_name" and "your_embedding_model_name" with the actual names of the models you downloaded.

4. Build and Run

4.1. With Docker

Ensure Docker is installed on your system.
From the project root, run:

docker-compose up --build

Access the app at http://localhost:8501.

4.2. Without Docker

Install dependencies:

pip install -r backend/requirements.txt

Start the FastAPI server:

python backend/scripts/main.py

In a new terminal, start the Streamlit frontend:

streamlit run frontend/streamlit_ui.py

Open http://localhost:8501 to use the app.

5. Enter Pinecone API Key and Index to Use

After starting the Streamlit frontend, enter the Pinecone API key and index:

6. Upload your document pdf

Push the buttom to convert your pdf to vector and store to the vectorDB
Now, you may start to ask the llm question about your pdf

Related Projects

LlamaFlowJs

LlamaFlow is a framework that has inbuilt agentic workflows,reiterative reflection and llm review...

13 Jul 2024 1

AI-Agent-Document-Analyzer

This project is an AI-powered document analysis bot designed to process and extract information f...

23 Aug 2024 1

pdf-ai-chat-assistant

A versatile PDF AI Chat Assistant that allows users to interact with PDF documents through an AI-...

12 Aug 2024 1

llm-cve

A Retrieval-Augmented Generation (RAG) model built using LLaMA 3.1 and LangChain.

12 Aug 2024 0

llama_parse

Parse files for optimal RAG

31 Jan 2024 2,775

local-LLM-with-RAG

Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)

05 Nov 2023 157

LLaMA-2-hf-Chatbot

Chatbot from pretrained LLaMA-2 LLM model, fine-tuned with medical research papers using RAG (Ret...

06 Jun 2024 2

ArogyaMitra

An accessible, reliable, and efficient platform for medical information and support using LLMs

22 Jul 2024 1

pdf-q-a-llamaindex-llama2

Chat with your PDF files using LlamaIndex, Astra DB (Apache Cassandra), and Gradient's open-sourc...

31 Oct 2023 36

Document-based-Question-and-Answers

Developed a document question answering system that utilizes Llama and LangChain for contextual a...

06 Sep 2024 0

PrivateDocBot

📚 Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Pr...

13 Aug 2023 71

pinecone-loader

Builds Docker Image for Loading Pinecone Database with Vectors from CVE data.

14 Aug 2024 0

InstaDoc-Intelligent-QnA-Powered-by-RAG

Upload documents 📄 and get instant, accurate answers to your questions with InstaDoc: Intelligent...

23 Jul 2024 0

ragbase

Completely local RAG (with open LLM) and UI to chat with your PDF documents. Uses LangChain, Stre...

11 Jul 2024 37

langchain-ask-pdf-local

An AI-app that allows you to upload a PDF and ask questions about it. It uses StableVicuna 13B an...

07 May 2023 86

verbose-adventure