Semantic Textual Similarity Toolkits

This is the code by ECNU team submitted to SemEval STS Task.

Installation

# download the repo
git clone https://github.com/rgtjf/Semantic-Texual-Similarity-Toolkits.git
# download the dataset and stanford CoreNLP tools
sh download.sh
# run the demo
python demo.py

Results

you can configure sts_model.py to see the performance of different features on STSBenchmark dataset.

STSBenchmark

Methods	Dev	Test
RF	0.8333	0.7993
GB	0.8356	0.8022
EN-seven	0.8466	0.8100
----------------------	--------	--------
aligner	0.6991	0.6379
idf_aligner	0.7969	0.7622
BOWFeature-True	0.7584	0.6472
BOWFeature-False	0.7788	0.6874
nGramOverlapFeature	0.7817	0.7453
BOWFeature	0.7639	0.6847
AlignmentFeature	0.8163	0.7748
WordEmbeddingFeature	0.8011	0.7128

Reference

STSBenchmark board

Contacts

Any questions, please feel free to contact us: rgtjf1 AT 163 DOT com

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
data/stsbenchmark		data/stsbenchmark
stst		stst
submitted_runs		submitted_runs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
download.sh		download.sh
requirements.txt		requirements.txt
sts_model.py		sts_model.py
sts_model1.py		sts_model1.py
sts_tools.py		sts_tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Textual Similarity Toolkits

Installation

Results

STSBenchmark

Reference

Contacts

About

Releases

Packages

Languages

License

menwa10/Semantic-Texual-Similarity-Toolkits

Folders and files

Latest commit

History

Repository files navigation

Semantic Textual Similarity Toolkits

Installation

Results

STSBenchmark

Reference

Contacts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages