DSTC is a method that aims to cluster short text messages using CLIP and deep auto-encoder.
If you use it please cite it correctly
pip install -r requirements.txt
run clip-server
python -m clip_server
than
python DSTC.py --maxiter 1500 --pretrain_epochs 200 --ae_weights results/snippets/ae_weights.h5 --save_dir results/snippets/ --dataset search_snippets
python DSTC.py --maxiter 1500 --pretrain_epochs 200 --ae_weights results/biomedical/ae_weights.h5 --save_dir results/biomedical/ --dataset biomedical
python DSTC.py --maxiter 1500 --pretrain_epochs 200 --ae_weights results/stackoverflow/ae_weights.h5 --save_dir results/stackoverflow/
This code is based on repo from here.