Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published 26 days ago • 29
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 26 days ago • 39
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 15 days ago • 18
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 10 days ago • 230
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 57