Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models Paper • 2506.19103 • Published 19 days ago • 42
Listener-Rewarded Thinking in VLMs for Image Preferences Paper • 2506.22832 • Published 14 days ago • 24
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback Paper • 2507.02321 • Published 9 days ago • 38
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper • 2507.05964 • Published 4 days ago • 88
MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing Paper • 2502.15381 • Published Feb 21
ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models Paper • 2505.22569 • Published May 28 • 56
FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention Paper • 2505.21144 • Published May 27 • 2
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization Paper • 2505.20975 • Published May 27 • 36
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5 • 127
Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards Paper • 2503.19948 • Published Mar 25
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published Mar 20 • 73
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Paper • 2502.03183 • Published Feb 5 • 4
GHOST 2.0: generative high-fidelity one shot transfer of heads Paper • 2502.18417 • Published Feb 25 • 67
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 175
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models Paper • 2311.05928 • Published Nov 10, 2023 • 1