Step1X-Edit: A Practical Framework for General Image Editing Paper β’ 2504.17761 β’ Published 3 days ago β’ 68
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Paper β’ 2407.17438 β’ Published Jul 24, 2024 β’ 26
Personalized Text-to-Image Generation with Auto-Regressive Models Paper β’ 2504.13162 β’ Published 10 days ago β’ 17
DreamO: A Unified Framework for Image Customization Paper β’ 2504.16915 β’ Published 4 days ago β’ 16
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper β’ 2504.14509 β’ Published 7 days ago β’ 43
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper β’ 2504.12395 β’ Published 11 days ago β’ 17
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper β’ 2504.12626 β’ Published 10 days ago β’ 48
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL Paper β’ 2504.11455 β’ Published 12 days ago β’ 12
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper β’ 2504.08388 β’ Published 16 days ago β’ 39
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper β’ 2504.08685 β’ Published 16 days ago β’ 122
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation Paper β’ 2504.07405 β’ Published 17 days ago β’ 12
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper β’ 2504.05599 β’ Published 19 days ago β’ 81
One-Minute Video Generation with Test-Time Training Paper β’ 2504.05298 β’ Published 20 days ago β’ 99
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper β’ 2504.02160 β’ Published 24 days ago β’ 35
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving Paper β’ 2404.16771 β’ Published Apr 25, 2024 β’ 20
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Paper β’ 2403.13535 β’ Published Mar 20, 2024 β’ 24
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper β’ 2404.19427 β’ Published Apr 30, 2024 β’ 75
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability Paper β’ 2503.06505 β’ Published Mar 9 β’ 1