HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification • Updated Mar 7, 2024 • 7.27k • 43
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper • 2501.05452 • Published 9 days ago • 14
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 9 days ago • 77
timm/vit_base_patch16_clip_224.openai Image Feature Extraction • Updated Oct 23, 2024 • 254k • 6
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 16 days ago • 49
xinyu1205/recognize-anything-plus-model Zero-Shot Image Classification • Updated Oct 25, 2023 • 37