Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 56
StyleBooth: Image Style Editing with Multimodal Instruction Paper • 2404.12154 • Published Apr 18, 2024
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper • 2501.02487 • Published Jan 5
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 56
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 56
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper • 2501.02487 • Published Jan 5
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper • 2501.02487 • Published Jan 5
StyleBooth: Image Style Editing with Multimodal Instruction Paper • 2404.12154 • Published Apr 18, 2024
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos Paper • 2303.08345 • Published Mar 15, 2023
StyleBooth: Image Style Editing with Multimodal Instruction Paper • 2404.12154 • Published Apr 18, 2024
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval Paper • 2211.12764 • Published Nov 23, 2022
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer Paper • 2410.00086 • Published Sep 30, 2024 • 12
StyleBooth: Image Style Editing with Multimodal Instruction Paper • 2404.12154 • Published Apr 18, 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer Paper • 2410.00086 • Published Sep 30, 2024 • 12
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer Paper • 2410.00086 • Published Sep 30, 2024 • 12