Improved-RAG-Architecture

Using LangChain as the framework Improved RAG Architecture from my previous project, such as

Rather than running the model localy. I'd rather using API from together.ai so I don't destroy my laptop in doing so. (they also provide better model so the outcome of the prompt is wayyyy better). I could also tune the parameter of the LLM for better outcome or maybe model selection for my specific use case.
Fixing the chunking problem by using LLM (Semantic Chunker) as the chunker rather than manual chunking.
Rewrite the query using LLM before turning it into vector for retrieval purpose for better retrieval.
Using both Semantic Search (Context) and Lexical Search (Keyword) for the vector DB, which is FAISS (Facebook AI Similarity Search).
Reranking+autocut Algorithm after the retrieval for better output.

RAGAS could also be implemented in this project, a RAG architecture benchmarking There are other method of RAG I really interested in like GraphRAG.

P.S.

My semantic chunker run verrry slow, i advise you just use normal text splitter with fixed size and overlap for faster performance
If you don't want to host the db in local, you can use services like Pinecone or MongoDB Atlas and create a cluster
I advise you to use other model for the inference and rewriter, find the best one for your usecase

Skibidi Sigma Rizz +1000 Aura

Related Projects

local-genAI-search

Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answ...

26 May 2024 72

llm-url_video-rag

A web-based application enabling users to interact with and extract insights from YouTube video t...

10 Sep 2024 12

renumics-rag

Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚

23 Jan 2024 160

ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Re...

10 Feb 2023 521

ragbase

Completely local RAG (with open LLM) and UI to chat with your PDF documents. Uses LangChain, Stre...

11 Jul 2024 37

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) sy...

13 Jul 2024 6,976

LLM-RAG-Chatbot-With-LangChain

Development and deployment of a question-answer LLM model using Llama2 with 7B parameters and RAG...

18 Jun 2024 1

RAG_in_CPU

This is a an Advanced RAG system, where I tried to make it functioning in regular PC with a CPU u...

16 Jul 2024 3

Advanced_RAG

Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of th...

28 Mar 2024 205

local-assistant-examples

Build your own ChatPDF and run them locally

30 Nov 2023 284

RAGLink

一个开源的RAG框架，旨在为用户提供了一个强大、灵活且简单的开发环境。

07 Aug 2024 4

rag-from-scratch

31 Jan 2024 2,365

LLaMA-2-hf-Chatbot

Chatbot from pretrained LLaMA-2 LLM model, fine-tuned with medical research papers using RAG (Ret...

06 Jun 2024 2

RAG-using-Llama3-Langchain-and-ChromaDB

RAG using Llama3, Langchain and ChromaDB

22 Apr 2024 67

Hybrid-Search-RAG

This Repository contains an chatbot RAG application builted using fastapi and streamlit deployed ...

12 Aug 2024 2