Unified Thinker: A General Reasoning Modular Core for Image Generation Paper • 2601.03127 • Published 3 days ago • 7
Parallel Latent Reasoning for Sequential Recommendation Paper • 2601.03153 • Published 3 days ago • 2
OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs Paper • 2601.01592 • Published 5 days ago • 11
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 3 days ago • 26
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 5 days ago • 32
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning Paper • 2512.23412 • Published 11 days ago • 36
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published 3 days ago • 38
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 3 days ago • 86
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 2 days ago • 12
RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models Paper • 2601.03699 • Published 2 days ago • 5
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning Paper • 2512.24330 • Published 10 days ago • 33
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 9 days ago • 104
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 10 days ago • 47
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper • 2512.24617 • Published 9 days ago • 54
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation Paper • 2512.22905 • Published 12 days ago • 18