An Open Dataset and Model for Language Identification Paper ⢠2305.13820 ⢠Published May 23, 2023
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT) Paper ⢠2210.11309 ⢠Published Oct 20, 2022
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper ⢠2503.10267 ⢠Published Mar 13