Human-like Linguistic Biases in Neural Speech Models: Phonetic Categorization and Phonotactic Constraints in Wav2Vec2.0 Paper • 2407.03005 • Published Jul 3, 2024
Inseq: An Interpretability Toolkit for Sequence Generation Models Paper • 2302.13942 • Published Feb 27, 2023 • 1
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model Paper • 2310.12611 • Published Oct 19, 2023
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 31
Probing LLMs for Joint Encoding of Linguistic Categories Paper • 2310.18696 • Published Oct 28, 2023 • 1
How far can bias go? -- Tracing bias from pretraining data to alignment Paper • 2411.19240 • Published Nov 28, 2024
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models Paper • 2405.13974 • Published May 22, 2024 • 9