AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE

Code for training ViT with cifar10 dataset.

Run code

Run following command to train ViT with cifar10 dataset. If you want to do hyper-parameter tuning, please refer options.py and utils.py

python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
model.py		model.py
options.py		options.py
train.py		train.py
utils.py		utils.py