Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published 6 days ago • 35
Baichuan-M2 Collection Beyond the Model: Scaling Medical Capability with a Large Verifier System • 4 items • Updated 5 days ago • 2
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers Paper • 2505.19439 • Published May 26 • 31