Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Paper • 2506.09033 • Published Jun 10 • 7