Collaborative Multi-Modal Coding for High-Quality 3D Generation Paper • 2508.15228 • Published 19 days ago • 4
EgoTwin: Dreaming Body and View in First Person Paper • 2508.13013 • Published 21 days ago • 18
EgoTwin: Dreaming Body and View in First Person Paper • 2508.13013 • Published 21 days ago • 18
EgoTwin: Dreaming Body and View in First Person Paper • 2508.13013 • Published 21 days ago • 18 • 2
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study Paper • 2508.13142 • Published 21 days ago • 32
4DNeX: Feed-Forward 4D Generative Modeling Made Easy Paper • 2508.13154 • Published 21 days ago • 58
Cut2Next: Generating Next Shot via In-Context Tuning Paper • 2508.08244 • Published 28 days ago • 13
DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Paper • 2508.00599 • Published Aug 1 • 6
Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity Paper • 2508.05609 • Published Aug 7 • 29
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 50
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper • 2507.15028 • Published Jul 20 • 20