Loading lesson page...
AI From Scratch/Lesson 21/~60 minutes
METR Time Horizons and External Capability Evaluation
METR (ex-ARC Evals) is an independent 501(c)(3) since December 2023. Their Time Horizon 1.1 benchmark (January 2026) fits a logistic curve to task-success probability vs log(expert human completion time); the intersection at 50% probabilit...
LearnNo prerequisites