DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 6 days ago • 100
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper • 2503.10582 • Published 11 days ago • 20
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper • 2503.02951 • Published 20 days ago • 26