license: apache-2.0 base_model: - mistralai/Devstral-Small-2505 pipeline_tag: image-text-to-text
The vision encoder is taken from Mistral Small, works out-of-the-box with llama.cpp
llama-server -hf ngxson/Devstral-Small-Vision-2505-GGUF
Output: