AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE Pdf: https://arxiv.org/pdf/2010.11929.pdf Code for training ViT with cifar10 dataset. Run code Run following command to train ViT with cifar10 dataset. If you want to do hyper-parameter tuning, please refer options.py and utils.py python train.py