Baichuan-M1: Pushing the Medical Capability of Large Language Models Paper • 2502.12671 • Published Feb 18 • 1
Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published 6 days ago • 34
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers Paper • 2505.19439 • Published May 26 • 31