AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 2 days ago • 21
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 2 days ago • 115
RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models Paper • 2601.03699 • Published 3 days ago • 5
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 4 days ago • 83
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation Paper • 2601.02204 • Published 5 days ago • 56
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 4 days ago • 93
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 5 days ago • 28
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 6 days ago • 34
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published 5 days ago • 30
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams Paper • 2601.02281 • Published 5 days ago • 28
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 9 days ago • 107
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 8 days ago • 48
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 12 days ago • 17
Evaluating Parameter Efficient Methods for RLVR Paper • 2512.23165 • Published 13 days ago • 24
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published 12 days ago • 44
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models Paper • 2512.15560 • Published 24 days ago • 24