APTOS 2019 Blindness Detection

Team name : DeepLearner
Member : Robert, Lin, Zino

This Rep. is "Selected Topics in Visual Recognition using Deep Learning" final project in 2020.

Hardware

Ubuntu 18.04 LTS
Intel(R) Core(TM) i7-7820X CPU @ 3.60GHz
31 RAM
NVIDIA GTX 1080 12G *1

ENV

CUDA 11.0
efficientnet-pytorch 0.7.0
torch 1.7.0+cu110
torchaudio 0.7.0
torchvision 0.8.1+cu110

Install env

# Pytorch 1.7.0  CUDA 11.2
pip3 install torch==1.7.0+cu110 torchvision==0.8.1+cu110 torchaudio===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html

#  install requirements
pip3 install -r requirements.txt

Reproducing Submission

To reproduct my submission without retrainig, do the following steps:

Dataset Preparation
Training
Inference

Dataset Preparation

Download Kaggle Image

Data Link：

Download image file to your disk. Extract aptos2019-blindness-detection.zip and diabetic-retinopathy-detection.zip to DeepLearner and Pretrained directory.

Prepare Images

After downloading images and spliting json file, the data directory is structured as:

# 2015 Diabetic Retinopathy Detection images
 Pretrained/
  | +- sample/
  | +- train/
  | -- trainLabels.csv
  | -- sampleSubmission.csv

# APTOS 2019 Blindness Detection images
DeepLearner/
 | +-train_images/
 | +-test_images/
 | --train.csv
 | --test.csv
 | --sample_submission.csv

Training

Train models

This project used Pre-train model. But according the mmdetection doc, it used pre-train backbone restnet-50 on ImageNet.

Train a model with image size 512, and train another model with image size 256. Use the model 256 to make pseudo label on 2019 test dataset. Finally, fine-tune model 512 on 2019 training + 2019 testing with pseudo label.

Pretrain on 2015 dataset

python3 train_gem.py ~/Pretrained/train/  ~/Pretrained/trainLabels.csv cuda:1 --img_size 512 --pretrain

Fine-tune on 2019 dataset

python3 train_gem.py ~/DeepLearner/train_images/  ~/DeepLearner/train.csv cuda:1 --img_size 512 --model_path pretrain_weight_512/pretrain_model30.pth

Third stage Fine-tune on 2019 train + test with pseudo label

python3 train_gem.py ~/DeepLearner/train_images/  ~/DeepLearner/train.csv cuda:1 --img_size 512 --model_path pretrain_weight_512/pretrain_model30.pth --additional_csv_path ./soft_pseudo_label.csv --additional_image_path ~/DeepLearner/test_images/

Make pseudo label

python3 train_gem.py ~/Pretrained/train/  ~/Pretrained/trainLabels.csv cuda:1 --img_size 256 --pretrain
python3 train_gem.py ~/DeepLearner/train_images/  ~/DeepLearner/train.csv cuda:1 --img_size 256 --model_path pretrain_weight_256/pretrain_model30.pth
python make_pseudo_label.py

The expected training times are:

Model	GPUs	Image size	Training Epochs	Training Time
SE_restnet50	1x NVIDIA GTX 1080	256, 256	30	2 hours
SE_restnet101	1x NVIDIA GTX 2080	256, 256	150	190 mins

Inference

Inference and Make submission.csv

$ python3 test.py

Result

densenet121_bs64_90

ensemble dense+seres+seres512

seresnet50 512 pseudo label (BEST SCORE)

pseudo workflow

Train two SE-Resnet50 model which input size are 512 and 256 by 2015 and 2019 training data.
Used input size 256 of SE-Resnet50 to inference testing data and make pseudo-label.
Train input size 512 of SE-Resnet50 by 2019 training data and 2019 pseudo-label testing data.
Used input size 512 of SE-Resnet50 to make submission.csv.

Reference:

Gold Medal Solutions : Top 12 Ranking solutions.
Rank 12 4uiiurz1
pretrained-models : SE-Resnet 50 pretrained-models.
Self-training with Noisy Student improves ImageNet classification : CVPR 2020 pseudo-label.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
result		result
README.md		README.md
hard_pseudo_label.csv		hard_pseudo_label.csv
hard_pseudo_label_train.csv		hard_pseudo_label_train.csv
make_pseudo_label.py		make_pseudo_label.py
models.py		models.py
raw_code.py		raw_code.py
requirements.txt		requirements.txt
resize2015.py		resize2015.py
run.sh		run.sh
soft_pseudo_label.csv		soft_pseudo_label.csv
soft_pseudo_label_train.csv		soft_pseudo_label_train.csv
test.py		test.py
train_gem.py		train_gem.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APTOS 2019 Blindness Detection

Hardware

ENV

Install env

Reproducing Submission

Dataset Preparation

Download Kaggle Image

Prepare Images

Training

Train models

Pretrain on 2015 dataset

Fine-tune on 2019 dataset

Third stage Fine-tune on 2019 train + test with pseudo label

Make pseudo label

Inference

Inference and Make submission.csv

Result

densenet121_bs64_90

ensemble dense+seres+seres512

seresnet50 512 pseudo label (BEST SCORE)

pseudo workflow

Reference:

About

Releases

Packages

Contributors 2

Languages

robert780612/DeepLearner

Folders and files

Latest commit

History

Repository files navigation

APTOS 2019 Blindness Detection

Hardware

ENV

Install env

Reproducing Submission

Dataset Preparation

Download Kaggle Image

Prepare Images

Training

Train models

Pretrain on 2015 dataset

Fine-tune on 2019 dataset

Third stage Fine-tune on 2019 train + test with pseudo label

Make pseudo label

Inference

Inference and Make submission.csv

Result

densenet121_bs64_90

ensemble dense+seres+seres512

seresnet50 512 pseudo label (BEST SCORE)

pseudo workflow

Reference:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages