Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper โข 2210.01970 โข Published Sep 30, 2022 โข 13
Datasets: A Community Library for Natural Language Processing Paper โข 2109.02846 โข Published Sep 7, 2021 โข 14
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Paper โข 2303.03915 โข Published Mar 7, 2023 โข 7