ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning Paper • 2503.19470 • Published Mar 25 • 19
Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published 6 days ago • 34