Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity Paper • 2510.02315 • Published 2 days ago • 5 • 2
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published 4 days ago • 76 • 5
Transformers Discover Molecular Structure Without Graph Priors Paper • 2510.02259 • Published 2 days ago • 4 • 2
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models Paper • 2510.01623 • Published 3 days ago • 5 • 2
Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression Paper • 2510.01581 • Published 3 days ago • 2
ModernVBERT: Towards Smaller Visual Document Retrievers Paper • 2510.01149 • Published 3 days ago • 26 • 2
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Paper • 2509.26376 • Published 4 days ago • 7 • 2
AReUReDi: Annealed Rectified Updates for Refining Discrete Flows with Multi-Objective Guidance Paper • 2510.00352 • Published 4 days ago • 2
Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation Paper • 2510.02306 • Published 2 days ago • 1 • 2
Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness Paper • 2510.01670 • Published 3 days ago • 5 • 3
SKYLENAGE Technical Report: Mathematical Reasoning and Contest-Innovation Benchmarks for Multi-Level Math Evaluation Paper • 2510.01241 • Published 11 days ago • 3 • 2
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs Paper • 2509.22582 • Published 8 days ago • 8 • 2
Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks Paper • 2510.02286 • Published 2 days ago • 19 • 2
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning Paper • 2510.02240 • Published 2 days ago • 15 • 2
Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space? Paper • 2510.00537 • Published 4 days ago • 1 • 2
VideoNSA: Native Sparse Attention Scales Video Understanding Paper • 2510.02295 • Published 2 days ago • 7 • 2