Concat-ID: Towards Universal Identity-Preserving Video Synthesis Paper • 2503.14151 • Published 7 days ago • 10
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting Paper • 2503.08677 • Published 13 days ago • 27
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper • 2503.10618 • Published 11 days ago • 17
Distilling Diversity and Control in Diffusion Models Paper • 2503.10637 • Published 11 days ago • 14
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 10 days ago • 117
FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published 11 days ago • 16
Edit Transfer: Learning Image Editing via Vision In-Context Relations Paper • 2503.13327 • Published 7 days ago • 24
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published 8 days ago • 41
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation Paper • 2503.06053 • Published 17 days ago • 84
CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era Paper • 2503.12329 • Published 9 days ago • 23
GHOST 2.0: generative high-fidelity one shot transfer of heads Paper • 2502.18417 • Published 27 days ago • 64
Kanana: Compute-efficient Bilingual Language Models Paper • 2502.18934 • Published 27 days ago • 64
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Paper • 2502.20388 • Published 25 days ago • 15
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper • 2502.20126 • Published 25 days ago • 20
How far can we go with ImageNet for Text-to-Image generation? Paper • 2502.21318 • Published 24 days ago • 25
VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Paper • 2503.01739 • Published 21 days ago • 8
Magic 1-For-1: Generating One Minute Video Clips within One Minute Paper • 2502.07701 • Published Feb 11 • 34