-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 464 -
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Paper • 2509.25541 • Published • 138 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 260 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 136
shen
sean29
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
todo
updated
a collection
21 days ago
todo
updated
a collection
22 days ago
todo
Organizations
None yet