Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li
AI & ML interests
Multilingual NLP
Recent Activity
liked
a model
6 days ago
mistralai/Magistral-Small-2506
new activity
12 days ago
MaLA-LM/PolyWrite:Update dataset card: MaLA Corpus, improved description, and updated citations
Organizations
Collections
1
models
38

Zihao-Li/V7-Bi-Code-Stag
Text Generation
•
Updated
•
46

Zihao-Li/V7-Bi-Code-Alt
Text Generation
•
Updated
•
15

Zihao-Li/V7-Bi-Code-Sel
Text Generation
•
Updated
•
14

Zihao-Li/V7-Mono-Alt
Text Generation
•
Updated
•
18

Zihao-Li/V7-Bi-Sel
Text Generation
•
Updated
•
15

Zihao-Li/V7-Bi-Stag
Text Generation
•
Updated
•
15

Zihao-Li/V7-Mono-Code-Alt
Text Generation
•
Updated
•
15

Zihao-Li/V7-Mono-Code-Sel
Text Generation
•
Updated
•
33

Zihao-Li/V7-Mono-Code-Stag
Text Generation
•
Updated
•
37

Zihao-Li/V7-Bi-Alt
Text Generation
•
Updated
•
15