NLP Project to handle text summarization. Takes a huge amount of data, and it can summarize and produce the meaning in variable length as need by the user.
Model Used: Pegasus Samsum Pretrained Model available from hugging face fine tuned on different dataset. Tokenizer: Respective Tokenizer availabe from transformers library.
Clone the repository
https://github.com/stalker0418/TextSummarizer.git
python -m venv "Your environment name"
source "Your environment name"/scripts/activate
pip install -r requirements.txt
# Finally run the following command
python app.py
Now,
open up you local host and port
Default here: localhost:8080
If you want to train your model with your custom data, you can just change the data paths from config.yaml either by providing the source url or by downloading the zip file and providing that path in config.yaml for data extraction
The same way, if you want to change your model, or tokenizer, just approach config.yaml and update the values to your desired model names, and the workflow will be automatically be updated accordingly.