ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper • 2506.18095 • Published 4 days ago • 53
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution Paper • 2506.19838 • Published 2 days ago • 10
Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales Paper • 2506.19713 • Published 2 days ago • 11
Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models Paper • 2506.18900 • Published 3 days ago • 3
3D Arena: An Open Platform for Generative 3D Evaluation Paper • 2506.18787 • Published 3 days ago • 11
Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset Paper • 2506.18851 • Published 3 days ago • 26
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details Paper • 2506.16504 • Published 7 days ago • 19
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published 6 days ago • 19
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 6 days ago • 39
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 3 days ago • 65
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published 3 days ago • 80
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • 8 days ago • 64
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression Paper • 2506.09482 • Published 16 days ago • 46
Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling Paper • 2505.14521 • Published May 20 • 8
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 10 days ago • 240
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models Paper • 2506.07177 • Published 18 days ago • 22
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework Paper • 2506.10741 • Published 14 days ago • 27
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting Paper • 2506.09952 • Published 15 days ago • 7
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 16 days ago • 90