image edit CoLLM: A Large Language Model for Composed Image Retrieval Paper • 2503.19910 • Published about 19 hours ago • 6
CoLLM: A Large Language Model for Composed Image Retrieval Paper • 2503.19910 • Published about 19 hours ago • 6
t2v DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published Dec 24, 2024 • 19
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published Dec 24, 2024 • 19