affix_list.py for add more prefix/affix
prefix.txt/suffix.txt is dictionary
infl2ori.txt is dictionary for word to base_form
text_utils.py is tool for preprocessing
preprocess.py is a class for preprocessing
word_embed.py is main code
using
pip install -r requirement.txt
python3 word_embed.py
match affix from long term to short one