Loading lesson page...
AI From Scratch/Lesson 17/~60 minutes
WMDP and Dual-Use Capability Evaluation
Li et al., "The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning" (ICML 2024, arXiv:2403.03218). 4,157 multiple-choice questions across biosecurity (1,520), cybersecurity (2,225), and chemistry (412). Questions operate...
LearnNo prerequisites