How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions Paper • 2506.16679 • Published Jun 20 • 1
UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published 11 days ago • 16
UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published 11 days ago • 16