
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
•
137k
•
286
Collection for models & demos for even smoller SmolVLM release
Generate descriptions from images and text prompts
Find answers by describing images
Find image descriptions from visual inputs