This repository is the official implementation of NeurIPS 2020 paper: Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks
To create the environment, you should have python3 and virtualenv installed, run these commands inside the source code directory:
virtualenv -p python3 venv
source venv/bin/activate
pip3 install torch torchvision tqdm easydict scikit-learn
Use this google drive link to download all of the logits in pickle format.
Put the corresponding logits to the data folder.
The configs have the following path format:
where dataset, model, and method should be replaced with the desired dataset, model, and method's name.
(e.g., "exp_dir/CIFAR100/ResNet110/OI/config.json")
To train the calibrator network:
python --exp_dir exp_dir/{dataset}/{model}/{method}
To evaluate the trained networks:
python --exp_dir exp_dir/{dataset}/{model}/{method}
The results will be saved in json format in the config dirname. As an example: "ensemble/post_metrics_test_ensemble_best_ece.json"
corresponds to the ECE values reported in Table 1 of the paper and "cross_val_test_post_metrics_best_ece.json"
corresponds to the results without ensemble (by averaging the metrics over different folds). Note that the results might be slightly different from the reported numbers in the paper due to randomness in training.
There was a bug in our evaluation code. Our results slightly improved for the ECE metric after fixing the issue. Thanks to @futakw. The updated table is shown below:
If you make use of this code in your own work, please cite our paper:
title={Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks},
author={Rahimi, Amir and Shaban, Amirreza and Cheng, Ching-An and Hartley, Richard and Boots, Byron},
booktitle={Advances in Neural Information Processing Systems},