Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li



·
AI & ML interests
Multilingual NLP
Recent Activity
new activity
7 days ago
mistralai/Magistral-Small-2509:[Suggestion] Create Model Collection on Homepage
updated
a dataset
16 days ago
Helsinki-NLP/fineweb-edu-translated
updated
a dataset
21 days ago
MultiSynt/nemotron-cc-translated-by-opus