MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning Paper • 2505.24871 • Published May 30 • 21
HardTests: Synthesizing High-Quality Test Cases for LLM Coding Paper • 2505.24098 • Published May 30 • 44
xLAM-2 Collection A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated May 5 • 18