The Inverted Index Builder is a tool designed to create an inverted index from a collection of documents. This index is useful for various information retrieval tasks, such as searching for documents containing specific words or phrases.
- Parses a collection of documents.
- Tokenizes the content of each document.
- Builds an inverted index based on the tokens extracted from the documents.
- Supports preprocessing steps like stop word removal and stemming.
To use the Inverted Index Builder, follow these steps:
- Clone this repository to your local machine:
git clone https://github.com/nasiruddinabubakar/inverted-index.git
- Install the dependencies:
cd inverted-index-builder
npm install
- Start the application:
npm run start:dev
Once the application is running, you can use it to build an inverted index by providing a collection of documents. The documents should be stored in a specific format (e.g., text files).
Example usage:
const invertedIndex = new InvertedIndex();
await invertedIndex.BuildIndex();
const index = await invertedIndex.returnIndex();
console.log(index);
Contributions are welcome! If you'd like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Make your changes and commit them.
- Push your changes to your fork.
- Submit a pull request to the main repository.
This project is licensed under the MIT License - see the LICENSE file for details.
The Inverted Index Builder was created by [Nasiruddin Abubakar].
For questions or inquiries, please contact [[email protected]].