ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning Paper • 2506.09513 • Published 26 days ago • 96
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30 • 132
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30 • 258
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published 27 days ago • 36
tiantiaf/whisper-large-v3-msp-podcast-emotion-dim Audio Classification • 2B • Updated 26 days ago • 727 • 1
tiantiaf/whisper-large-v3-msp-podcast-emotion Audio Classification • 2B • Updated 26 days ago • 306 • 2
Vox-Profile Collection This collection includes the implementation of models described in Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648) • 14 items • Updated 26 days ago • 2