Phase 15: Autonomous Systems
AI From Scratch/Lesson 21/~60 minutes

METR Time Horizons and External Capability Evaluation

METR (ex-ARC Evals) is an independent 501(c)(3) since December 2023. Their Time Horizon 1.1 benchmark (January 2026) fits a logistic curve to task-success probability vs log(expert human completion time); the intersection at 50% probabilit...

LearnNo prerequisites
Loading lesson page...