This repo contains two datasets: CIFAR10 and CIFAR100 with human annotated complementary labels for complementary label learning tasks. We collect this complementary version of CIFAR100 in order to verify two important aspects of complementary label learning:
- The distribution of complementary label is independent to the feature.
- Collecting complementary labels can be cheaper than collecting ordinary labels when the number of classes is large.
Here are the links to the main page of each datasets, containing the analysis of collected labels and the benchmark results by libcll, a pytorch-lightning based machine learning library designed for complementary learning task:
# Download both CLCIFAR10 and CLCIFAR20
bash download.sh