DIP: Unsupervised Dense In-Context Post-training of Visual Representations Paper • 2506.18463 • Published 3 days ago • 17
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper • 2503.10618 • Published Mar 13 • 18
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing Paper • 2306.08707 • Published Jun 14, 2023 • 6
DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut Paper • 2406.02842 • Published Jun 5, 2024