BBQ: A Hand-Built Bias Benchmark for Question Answering Paper • 2110.08193 • Published Oct 15, 2021 • 1
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions Paper • 2410.02584 • Published Oct 3, 2024
ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection Paper • 2203.09509 • Published Mar 17, 2022 • 2