Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing Paper • 2507.05259 • Published 26 days ago • 5
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published Apr 21 • 25
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Paper • 2402.15504 • Published Feb 23, 2024 • 23
Magic-Me: Identity-Specific Video Customized Diffusion Paper • 2402.09368 • Published Feb 14, 2024 • 31
Meta-Personalizing Vision-Language Models to Find Named Instances in Video Paper • 2306.10169 • Published Jun 16, 2023 • 6