Post
989
New work from Alibaba_Qwen🔥
Qwen2.5-Math-PRM 7B & 72B 🔢 Process Reward Models for enhanced process supervision in the mathematical reasoning of LLMs.
Paper:
The Lessons of Developing Process Reward Models in Mathematical Reasoning (2501.07301)
Model:
Qwen/Qwen2.5-Math-PRM-7B
Qwen/Qwen2.5-Math-PRM-72B
Qwen2.5-Math-PRM 7B & 72B 🔢 Process Reward Models for enhanced process supervision in the mathematical reasoning of LLMs.
Paper:
The Lessons of Developing Process Reward Models in Mathematical Reasoning (2501.07301)
Model:
Qwen/Qwen2.5-Math-PRM-7B
Qwen/Qwen2.5-Math-PRM-72B