Image-Text-to-Text
Transformers
English
qwen2_vl
conversational