Follow these steps to use the text cleaning tool for transcripts provided by the ITL:
Begin by downloading the repository to your computer.
Rename your transcript file to file.txt
and replace the existing file.txt
in the repository with your file.
To run the script, you will need to use a Python environment. You can execute the script from the terminal or command prompt. Here are the general steps to run a Python script, with additional help available at this blog post:
- Goto the location of the script;
- Run the script.
cd ~/text-cleaner
python text-cleaner.py