try to guess NOGA/NACE code from the company's activity description by using n-gram analysis
there is some pre-trained data in the ngrams dir that the web api can use
cd ngrams/
bunzip2 1.gram.bz2
bunzip2 2.gram.bz2
bunzip2 3.gram.bz2
cd ../api/
python api.py
or go to http://localhost:1990/