R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training Paper • 2505.00358 • Published May 1 • 25
cshin23/multidim-rm_reg_avg_balanced_default_template Text Classification • Updated Jul 30, 2024 • 11
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction Paper • 2407.03651 • Published Jul 4, 2024 • 18