Phase 07: Transformers Deep Dive
AI From Scratch/Lesson 07/~75 minutes

GPT — Causal Language Modeling

BERT sees both sides. GPT sees only the past. The triangle mask is the most consequential single line of code in modern AI.

BuildPythonNo prerequisites
Loading lesson page...