BigScienceBiasEval (BigScience WG for evaluation of bias, fairness, and social impact)

jaketae

authored 4 papers 4 months ago

What Language Model to Train if You Have One Million GPU Hours?

Paper • 2210.15424 • Published Oct 27, 2022 • 2

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 35

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies

Paper • 2305.12586 • Published May 21, 2023

TESS 2: A Large-Scale Generalist Diffusion Language Model

Paper • 2502.13917 • Published Feb 19, 2025 • 6

jordiclive

authored 2 papers 8 months ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024 • 1

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 43

Shayne

authored a paper 8 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29, 2025 • 72

oskarvanderwal

authored 4 papers 9 months ago

Inseq: An Interpretability Toolkit for Sequence Generation Models

Paper • 2302.13942 • Published Feb 27, 2023 • 1

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Paper • 2310.12611 • Published Oct 19, 2023

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 35

Shayne

authored a paper 12 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

Shayne

authored 5 papers over 1 year ago

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

Entity-Based Knowledge Conflicts in Question Answering

Paper • 2109.05052 • Published Sep 10, 2021

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Paper • 2301.13688 • Published Jan 31, 2023 • 9

MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Paper • 2007.15207 • Published Jul 30, 2020

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 123

manandey

authored a paper almost 2 years ago

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Paper • 2303.03915 • Published Mar 7, 2023 • 7

jordiclive

authored a paper almost 2 years ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

Shayne

authored a paper almost 2 years ago

On the Societal Impact of Open Foundation Models

Paper • 2403.07918 • Published Feb 27, 2024 • 17

AI & ML interests

Team members 11

BigScienceBiasEval's activity