Loading lesson page...
AI From Scratch/Lesson 06/~60 minutes
Automated Alignment Research (Anthropic AAR)
Anthropic ran parallel teams of Claude Opus 4.6 Autonomous Alignment Researchers in independent sandboxes, coordinating via a shared forum whose logs live outside any sandbox (so agents cannot delete their own records). On the weak-to-stro...
Learn