Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published Mar 7 • 118
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 119
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published Mar 14 • 134
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 20 days ago • 124
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling Paper • 2503.21732 • Published 23 days ago • 8
Reconstructing Humans with a Biomechanically Accurate Skeleton Paper • 2503.21751 • Published 23 days ago • 9
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 18 days ago • 82
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 18 days ago • 61
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models Paper • 2504.03641 • Published 15 days ago • 13
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement Paper • 2504.03561 • Published 15 days ago • 17
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Paper • 2504.02949 • Published 16 days ago • 19