Image-Text-to-Text
Transformers
Safetensors
GGUF
English
qwen2_5_vl
remyx
qwen2.5-vl
spatial-reasoning
multimodal
vlm
vqasynth
thinking
reasoning
test-time-compute
robotics
embodied-ai
quantitative-spatial-reasoning
distance-estimation
visual-question-answering
conversational
Eval Results
text-generation-inference
File size: 133 Bytes
faa2a8c |
1 2 3 4 |
version https://git-lfs.github.com/spec/v1
oid sha256:c5289baa4e2b00730ff9967ed6f8c4a848ea4ec633729a91bfb7a5381af80beb
size 59002040
|