GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning Paper • 2504.00891 • Published 15 days ago • 12 • 3
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 13 days ago • 52 • 4
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Paper • 2503.21614 • Published 20 days ago • 39 • 4
Effectively Controlling Reasoning Models through Thinking Intervention Paper • 2503.24370 • Published 16 days ago • 18 • 4
OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning Paper • 2503.16081 • Published 27 days ago • 26 • 3
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 18 days ago • 45 • 3
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper • 2503.24290 • Published 16 days ago • 61 • 3
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback Paper • 2503.21332 • Published 20 days ago • 20 • 3
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback Paper • 2503.21332 • Published 20 days ago • 20 • 3