Phase 18: Ethics, Safety & Alignment
AI From Scratch/Lesson 17/~60 minutes

WMDP and Dual-Use Capability Evaluation

Li et al., "The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning" (ICML 2024, arXiv:2403.03218). 4,157 multiple-choice questions across biosecurity (1,520), cybersecurity (2,225), and chemistry (412). Questions operate...

LearnNo prerequisites
Loading lesson page...