The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Paper โข 2303.03915 โข Published Mar 7, 2023 โข 7
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper โข 2211.05100 โข Published Nov 9, 2022 โข 32