Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published Sep 2 • 39
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches Paper • 2508.08088 • Published Aug 11 • 29
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers Paper • 2505.19439 • Published May 26 • 30