unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit Image-to-Text β’ 47B β’ Updated Nov 22, 2024 β’ 6.06k β’ 19
microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated Aug 20, 2024 β’ 39.1k β’ 964
meta-llama/Meta-Llama-3-70B-Instruct Text Generation β’ 71B β’ Updated Jun 18 β’ 54.6k β’ β’ 1.49k
Running 554 554 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation β’ 16B β’ Updated Jul 3, 2024 β’ 777k β’ β’ 461