Phase 18: Ethics, Safety & Alignment
AI From Scratch/Lesson 20/~60 minutes

Bias and Representational Harm in LLMs

Gallegos, Rossi, Barrow, Tanjim, Kim, Dernoncourt, Yu, Zhang, Ahmed (Computational Linguistics 2024, arXiv:2309.00770). Foundational 2024 survey distinguishing representational harms (stereotypes, erasure) from allocational harms (unequal...

Build
Loading lesson page...