ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper • 2506.18095 • Published 4 days ago • 53
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution Paper • 2506.19838 • Published 2 days ago • 10
Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales Paper • 2506.19713 • Published 2 days ago • 11
Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models Paper • 2506.18900 • Published 3 days ago • 3
3D Arena: An Open Platform for Generative 3D Evaluation Paper • 2506.18787 • Published 3 days ago • 11
Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset Paper • 2506.18851 • Published 3 days ago • 26
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details Paper • 2506.16504 • Published 7 days ago • 19
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 6 days ago • 19
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 6 days ago • 39
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 3 days ago • 65
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published 3 days ago • 80