StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Paper • 2507.05240 • Published 3 days ago • 36
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published 1 day ago • 47
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning Paper • 2506.16141 • Published 21 days ago • 27
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control Paper • 2412.03568 • Published Dec 4, 2024
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 20 days ago • 21
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 20 days ago • 21 • 5
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 20 days ago • 21 • 5
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 20 days ago • 21
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 20 days ago • 21 • 5
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Paper • 2506.03126 • Published Jun 3 • 22
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning Paper • 2505.17022 • Published May 22 • 27