Introduction to Vision Transformers with PyTorch Table of Content: ViT in PyTorch Convolution vs Attention Script and logs for ViT training on CIFAR10 Refs: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Training data-efficient image transformers & distillation through attention Deit code