Loading lesson page...
AI From Scratch/Lesson 47/~90 minutes
Checkpoint Save and Resume
Train interrupts kill runs; checkpoints let them continue. Save model, optimizer, scheduler, loss history, step counter, and RNG state, atomically, so a kill at any moment leaves a valid file on disk.
BuildPython