Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper • 2606.02684 • Published 4 days ago • 11
MapAgent: An Industrial-Grade Agentic Framework for City-scale Lane-level Map Generation Paper • 2606.04513 • Published 2 days ago • 14
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation Paper • 2606.03972 • Published 3 days ago • 11
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 3 days ago • 22
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation Paper • 2606.03159 • Published 3 days ago • 20
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 8 days ago • 136
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism Paper • 2606.00408 • Published 7 days ago • 60
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 12 days ago • 33
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 5 days ago • 28
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization Paper • 2606.02564 • Published 4 days ago • 29
StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration Paper • 2605.25659 • Published 11 days ago • 15
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 4 days ago • 16
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs Paper • 2605.24202 • Published 14 days ago • 17
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published 9 days ago • 62
RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes Paper • 2606.00828 • Published 6 days ago • 10
Joint Agent Memory and Exploration Learning via Novelty Signals Paper • 2606.01528 • Published 4 days ago • 14
Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism Paper • 2605.30852 • Published 7 days ago • 10