ResNet introduces skip connections to develop a more accurate visual recognition backbone.
We provide training and evaluation code of ResNet, along with pretrained models and configuration files for the following tasks:
To train ResNet50 on ImageNet 1k with the advanced recipe, using a single node with 8 A100 GPUs, run the following command:
export CFG_FILE="projects/resnet/classification/resnet50_in1k.yaml"
corenet-train --common.config-file $CFG_FILE --common.results-loc classification_results
We assume that the training and validation data is located in /mnt/imagenet/training
and /mnt/imagenet/validation
folders, respectively.
To evaluate the pre-trained ResNet50 model on the validation set of the ImageNet, run the following command:
export CFG_FILE="projects/resnet/classification/resnet50_in1k.yaml"
export DATASET_PATH="/mnt/vision_datasets/imagenet/validation/" # change to the ImageNet validation path
CUDA_VISIBLE_DEVICES=0 corenet-eval --common.config-file $CFG_FILE --model.classification.pretrained $MODEL_WEIGHTS --common.override-kwargs dataset.root_val=$DATASET_PATH
This should give
top1=80.37 || top5=95.056
To train ResNet50 on MS-COCO using a single node with 8 A100 GPUs, run the following command:
export CFG_FILE="projects/resnet/detection/ssd_resnet50_coco.yaml"
corenet-train --common.config-file $CFG_FILE --common.results-loc detection_results
To evaluate the pre-trained detection model on the validation set of the COCO, run the following command:
export CFG_FILE="projects/resnet/detection/ssd_resnet50_coco.yaml"
CUDA_VISIBLE_DEVICES=0 corenet-eval-det --common.config-file $CFG_FILE --common.results-loc detection_results --model.detection.pretrained $MODEL_WEIGHTS --evaluation.detection.resize-input-images --evaluation.detection.mode validation_set
This should give
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.300
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.482
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.309
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.073
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.315
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.531
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.271
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.402
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.426
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.141
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.475
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.680
Model | Parameters | Top-1 | Pretrained weights | Config file | Logs |
ResNet-34 | 21.8 M | 74.85 | Link | Link | Link |
ResNet-50 | 25.6 M | 78.44 | Link | Link | Link |
ResNet-101 | 44.5 M | 79.81 | Link | Link | Link |
ResNet-34 (advanced recipe) | 21.8 M | 76.91 | Link | Link | Link |
ResNet-50 (advanced recipe) | 25.6 M | 80.36 | Link | Link | Link |
ResNet-101 (advanced recipe) | 44.5 M | 81.68 | Link | Link | Link |
Model | Parameters | MAP | Pretrained weights | Config file | Logs |
SSD ResNet-50 | 28.5 M | 30.0 | Link | Link | Link |
If you find our work useful, please cite following papers:
title={Deep Residual Learning for Image Recognition},
author={Kaiming He and X. Zhang and Shaoqing Ren and Jian Sun},
journal={2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
author = {Mehta, Sachin and Abdolhosseini, Farzad and Rastegari, Mohammad},
title = {CVNets: High Performance Library for Computer Vision},
year = {2022},
booktitle = {Proceedings of the 30th ACM International Conference on Multimedia},
series = {MM '22}