AI From Scratch/Lesson 01/~45 minutes

The Shift from Chatbots to Long-Horizon Agents

In 2023 a chatbot answered a question in one turn. In 2026 a frontier model routinely runs minutes to hours on a single task. METR's Time Horizon 1.1 benchmark (January 2026) puts Claude Opus 4.6 at 14+ hours of expert work at 50% reliabil...

Learn

Loading lesson page...