RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents Paper • 2507.03112 • Published 7 days ago • 28
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 17 days ago • 32 • 8