Phase 10: LLMs from Scratch
AI From Scratch/Lesson 06/~90 minutes

Instruction Tuning (SFT)

A base model predicts the next token. That's it. It doesn't follow instructions, answer questions, or refuse harmful requests. SFT is the bridge between a token predictor and a useful assistant. Every model you've ever talked to -- Claude,...

BuildPython (with numpy)No prerequisites
Loading lesson page...