Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 16 days ago • 51
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 10
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 11
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 10
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning Paper • 2410.10801 • Published Oct 14, 2024
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper • 2408.10914 • Published Aug 20, 2024 • 42
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives Paper • 2407.01490 • Published Jul 1, 2024 • 1
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm Paper • 2406.18682 • Published Jun 26, 2024