Loading lesson page...
AI From Scratch/Lesson 27/~90 minutes
Capstone Lesson 27: Eval Harness with Fixture Tasks
A coding agent is only as good as the suite of tasks you measure it against. This lesson builds an evaluation harness that takes a folder of fixture tasks, runs each through a candidate agent, scores pass or fail through a deterministic ve...
BuildPython (stdlib)No prerequisites