-
Clone
https://github.com/nitishgupta/MTMSN
to run evaluations -
Download the pre-trained model from
https://github.com/huminghao16/MTMSN
To generate predictions on the DROP-test data and DROP-constrast-sets test set:
-
Use the following script --
https://github.com/nitishgupta/MTMSN/blob/master/evaluate_full.sh
-
Supply the
DEV_DATA_JSON
with either DROP-test-set or contrast-sets test data. -
Store the prediction files (
PREDICTIONS_JSON
) in a convinient location. We'll call call themdrop_full_test_predictions.json
anddrop_contrast_test_predictions.json
Run
python consistency.py \
--full_data_preds drop_full_test_predictions.json \
--minimal_pairs_preds drop_contrast_test_predictions.json