Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published May 5 • 75
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 43
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 43
stable-diffusion-v1-5/stable-diffusion-inpainting Text-to-Image • Updated Sep 6, 2024 • 2.84M • 59
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 40
diffusers/stable-diffusion-xl-1.0-inpainting-0.1 Text-to-Image • Updated Sep 3, 2023 • 825k • 337
stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 499k • 1.91k
stabilityai/stable-diffusion-xl-base-1.0 Text-to-Image • Updated Oct 30, 2023 • 2.61M • • 6.65k