Phase 19: Capstone Projects
AI From Scratch/Lesson 35/~90 minutes

GPT Model Assembly

Twelve blocks stacked, a token embedding, a learned position embedding, a final LayerNorm, and a tied language model head. That is the entire 124 million parameter GPT model. This lesson assembles those pieces into a working class, counts...

BuildPython
Loading lesson page...