TransWebLLM A collection of training corpus and models for "Multilingual Language Model Pretraining using Machine-translated Data". britllm/TransWebLLM 1B • Updated Apr 22 • 35 britllm/TransWebLLM-web 1B • Updated Apr 22 • 12 britllm/TransWebLLM-cool 1B • Updated Apr 22 • 17 Multilingual Language Model Pretraining using Machine-translated Data Paper • 2502.13252 • Published Feb 18
Multilingual Language Model Pretraining using Machine-translated Data Paper • 2502.13252 • Published Feb 18
CuatroLLM A collection of training corpus and models for "Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language". Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language Paper • 2410.23956 • Published Oct 31, 2024 • 1 britllm/CuatroLLM 1B • Updated Oct 28, 2024 • 9 britllm/TransWeb-Edu-German Viewer • Updated Nov 7, 2024 • 36M • 427 • 1 britllm/TransWeb-Edu-English Viewer • Updated Nov 7, 2024 • 36M • 348
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language Paper • 2410.23956 • Published Oct 31, 2024 • 1
TransWebLLM A collection of training corpus and models for "Multilingual Language Model Pretraining using Machine-translated Data". britllm/TransWebLLM 1B • Updated Apr 22 • 35 britllm/TransWebLLM-web 1B • Updated Apr 22 • 12 britllm/TransWebLLM-cool 1B • Updated Apr 22 • 17 Multilingual Language Model Pretraining using Machine-translated Data Paper • 2502.13252 • Published Feb 18
Multilingual Language Model Pretraining using Machine-translated Data Paper • 2502.13252 • Published Feb 18
CuatroLLM A collection of training corpus and models for "Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language". Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language Paper • 2410.23956 • Published Oct 31, 2024 • 1 britllm/CuatroLLM 1B • Updated Oct 28, 2024 • 9 britllm/TransWeb-Edu-German Viewer • Updated Nov 7, 2024 • 36M • 427 • 1 britllm/TransWeb-Edu-English Viewer • Updated Nov 7, 2024 • 36M • 348
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language Paper • 2410.23956 • Published Oct 31, 2024 • 1