Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li



·
AI & ML interests
Multilingual NLP
Recent Activity
new activity
5 days ago
Helsinki-NLP/Test:Upload LongCat-Flash technical report.pdf
liked
a model
9 days ago
tencent/Hunyuan-MT-7B
authored
a paper
13 days ago
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language
Models