Assignment for interview process at demandmatrix.com
-Extracted 520 company categories and their descriptions from g2crowd.com
-Cleaned data of company Wikipedia page and company website using NLTK
-Vectorised and used cosine angles to get category of any company by its name
-Install Python 3.6
-Download prerequisites
-Run the script
pip install bs4 wikipedia nltk urllib3