'Inverto' is an innovative inverted-index search engine that uses effective compression technologies to speed up query processing.
Inverto is an advanced inverted-index search engine designed to optimize query processing speed through the application of effective compression technologies. This project encompasses the entire lifecycle of a search engine, from data ingestion to query execution, ensuring high performance and efficient data management.
The initial step involves ingesting raw data and preprocessing it to extract relevant information. This data is then tokenized, filtered, and normalized to ensure consistency.
An inverted index is constructed from the preprocessed data. This involves mapping each term to its corresponding document occurrences, creating a highly efficient data structure for quick lookups.
Effective compression techniques are applied to the inverted index to reduce storage requirements and improve query processing speed. Various algorithms are evaluated to determine the most suitable one for the dataset and query patterns.
The search engine processes user queries by efficiently retrieving relevant documents from the compressed inverted index. Advanced algorithms are employed to rank and return the most relevant results.
The front-end interface, designed using Qt, allows users to interact with the search engine seamlessly. It supports various query types and provides an intuitive way to view and navigate the search results.
Clone the Repository:
git clone https://github.com/themihirmathur/Inverto.git
cd Inverto
Install Dependencies:
pip install -r requirements.txt
Setup MongoDB:
Run the Application:
python main.py
Indexing Data:
Executing Queries:
We welcome contributions to improve Inverto. Please follow these steps to contribute:
For any inquiries or feedback, please contact Mihir Mathur at [email protected].
Thank you for using Inverto! We hope it provides a robust and efficient solution for your search engine needs.