Loading lesson page...
AI From Scratch/Lesson 01/~45 minutes
The Shift from Chatbots to Long-Horizon Agents
In 2023 a chatbot answered a question in one turn. In 2026 a frontier model routinely runs minutes to hours on a single task. METR's Time Horizon 1.1 benchmark (January 2026) puts Claude Opus 4.6 at 14+ hours of expert work at 50% reliabil...
Learn