Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Paper
•
2504.13837
•
Published
•
101
Starting from 2024-11-15
Enhance math problem solving by scaling test-time compute