Phase 04: Computer Vision
AI From Scratch/Lesson 14/~45 minutes

Vision Transformers (ViT)

Cut the image into patches, treat each patch as a word, run a standard transformer. Don't look back.

BuildPythonNo prerequisites
Loading lesson page...