GitHub - alpers/FlexNeuART: Flexible classic and NeurAl Retrieval Toolkit

FlexNeuART (flex-noo-art)

Flexible classic and NeurAl Retrieval Toolkit, or shortly FlexNeuART (intended pronunciation flex-noo-art) is a substantially reworked knn4qa package. The overview can be found in our EMNLP OSS workshop paper: Flexible retrieval with NMSLIB and FlexNeuART, 2020. Leonid Boytsov, Eric Nyberg.

In Aug-Dec 2020, we used this framework to generate best traditional and/or neural runs in the MSMARCO Document ranking task. In fact, our best traditional (non-neural) run slightly outperformed a couple of neural submissions. The code for the best-performing neural model will be published within 2-3 months. This model is described in our ECIR 2021 paper: Boytsov, Leonid, and Zico Kolter. "Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits." ECIR 2021.

FlexNeuART is under active development. More detailed description and documentaion is to appear. Currently we have:

The framework supports data in generic JSONL format. We provide conversion (and in some cases download) scripts for the following collections:

For neural network training FlexNeuART incorporates a re-worked variant of CEDR (MacAvaney et al' 2019).

Name		Name	Last commit message	Last commit date
Latest commit History 1,479 Commits
data		data
lemur-code-r2792-RankLib-trunk		lemur-code-r2792-RankLib-trunk
lib		lib
scripts		scripts
src		src
testdata		testdata
trec_eval-9.0.7		trec_eval-9.0.7
.gitignore		.gitignore
INSTALL.md		INSTALL.md
LICENSE		LICENSE
LICENSE.RankLib		LICENSE.RankLib
README.md		README.md
build.sh		build.sh
install_packages.sh		install_packages.sh
knn4qa.md		knn4qa.md
pom.xml		pom.xml
rebuild_ranklib.sh		rebuild_ranklib.sh
requirements.txt		requirements.txt
trec_eval		trec_eval