Sherlock: Self-Correcting Reasoning in Vision-Language Models Paper • 2505.22651 • Published 17 days ago • 50
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published 18 days ago • 70
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 17 days ago • 120
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8 • 169