X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Paper • 2507.22058 • Published 4 days ago • 33
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels Paper • 2507.21809 • Published 4 days ago • 94
ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment Paper • 2507.19058 • Published 8 days ago • 12
TokensGen: Harnessing Condensed Tokens for Long Video Generation Paper • 2507.15728 • Published 12 days ago • 6
Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention Paper • 2507.17745 • Published 10 days ago • 30
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining Paper • 2507.14119 • Published 15 days ago • 53
"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models Paper • 2507.13428 • Published 16 days ago • 15
Taming generative video models for zero-shot optical flow extraction Paper • 2507.09082 • Published 22 days ago • 11
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second Paper • 2507.10065 • Published 19 days ago • 23
π^3: Scalable Permutation-Equivariant Visual Geometry Learning Paper • 2507.13347 • Published 16 days ago • 62
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper • 2507.05964 • Published 25 days ago • 113
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Paper • 2507.02813 • Published 30 days ago • 59
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation Paper • 2506.18899 • Published Jun 23 • 5
FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing Paper • 2506.20911 • Published Jun 26 • 40
Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales Paper • 2506.19713 • Published Jun 24 • 13