The Invisible Leash: Why RLVR May Not Escape Its Origin Paper • 2507.14843 • Published 3 days ago • 63
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge Paper • 2504.10342 • Published Apr 14 • 11