Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li
AI & ML interests
Multilingual NLP
Recent Activity
updated
a dataset
3 days ago
Zihao-Li/Selfish
updated
a dataset
3 days ago
Helsinki-NLP/fineweb-edu-translated
updated
a dataset
3 days ago
Helsinki-NLP/fineweb-edu-translated