a PyTorch implementation of Vision-Transformer "AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE" I will test the model on CIFAR 10, and the result will be added soon :)