Loading lesson page...
AI From Scratch/Lesson 06/~90 minutes
Instruction Tuning (SFT)
A base model predicts the next token. That's it. It doesn't follow instructions, answer questions, or refuse harmful requests. SFT is the bridge between a token predictor and a useful assistant. Every model you've ever talked to -- Claude,...
BuildPython (with numpy)No prerequisites