GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains Paper • 2505.18700 • Published May 24 • 4
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models Paper • 2507.17853 • Published 21 days ago • 1
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Paper • 2508.07607 • Published 3 days ago • 1
Training-Free Watermarking for Autoregressive Image Generation Paper • 2505.14673 • Published May 20 • 12