Diversity-Incentivized Exploration for Versatile Reasoning Paper • 2509.26209 • Published 11 days ago • 15
Native Hybrid Attention for Efficient Sequence Modeling Paper • 2510.07019 • Published 3 days ago • 16
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration Paper • 2509.14760 • Published 23 days ago • 52