Image Generation - a zyf515730395 Collection

zyf515730395 's Collections

Video Understanding

MLLM

LLM

Image Generation

Video Generation

Image Generation

updated about 8 hours ago

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published 4 days ago • 38
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Paper • 2506.07986 • Published 4 days ago • 17
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Paper • 2506.06276 • Published 7 days ago • 19
Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published 8 days ago • 25
Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published 9 days ago • 22
D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published 15 days ago • 34
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Paper • 2505.23758 • Published 15 days ago • 23
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published 20 days ago • 63
DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 75
Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 88
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20 • 51
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published Apr 10 • 48
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published Apr 2 • 37
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 71
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Paper • 2503.07677 • Published Mar 10 • 86
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published Mar 10 • 36
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Paper • 2503.16418 • Published Mar 20 • 35
Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 78
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Paper • 2504.20690 • Published Apr 29 • 19
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24 • 23
Seedream 3.0 Technical Report

Paper • 2504.11346 • Published Apr 15 • 64
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published Apr 11 • 47
PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published Apr 10 • 19
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Paper • 2504.11455 • Published Apr 15 • 14
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

Paper • 2504.09454 • Published Apr 13 • 12
OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61