This repository is a collection of algorithms for multi-class classification to short texts using Python. This package is still under development. Modules are backward compatible unless otherwise specified. Feel free to give suggestions.
To install it, in a console, use pip
.
>>> pip install -U shorttext
The Word2Vec model is implemented with gensim
, and various neural networks with keras
, with a Theano
backend. Certain natural language processing tasks are implemented with spacy
and gensim
. This also imports numpy
, scipy
, and pandas
.
Upcoming changes here.
- 02/23/2016:
shorttext
0.2.1 released. - 12/21/2016:
shorttext
0.2.0 released. - 11/25/2016:
shorttext
0.1.2 released. - 11/21/2016:
shorttext
0.1.1 released.
- Documentation : https://pythonhosted.org/shorttext/
- Github: https://github.com/stephenhky/PyShortTextCategorization
- PyPI: https://pypi.python.org/pypi/shorttext
- "Python Package for Short Text Mining", WordPress
- An earlier version of this repository is a demonstration of the following blog post: Short Text Categorization using Deep Neural Networks and Word-Embedding Models