Xiaolin Zhang
xiaolinz
AI & ML interests
None yet
Recent Activity
updated
a collection
14 days ago
DiLoCo
updated
a collection
17 days ago
DiLoCo
updated
a collection
17 days ago
DiLoCo
Organizations
Collections
2
-
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch
Paper • 2501.18512 • Published • 29 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 15 -
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Paper • 2503.09799 • Published • 13 -
Muon is Scalable for LLM Training
Paper • 2502.16982 • Published
models
None public yet
datasets
None public yet