Running on Zero 108 108 Diffusion Self Distillation 🦀 Generate detailed images from an input image and text prompt
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published 11 days ago • 26
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 24 days ago • 51