image support

by kuliev-vitaly - opened Apr 29

Apr 29

•

According to blog on github qwen 3 support text, image, video and audio as input. According to model card it support only text as input. Does it support image as input? How to start model with image adapter?

kuliev-vitaly

Apr 30

https://chat.qwen.ai/
This model works with images in qwen chat.

pySilver

Apr 30

I came to ask the same question. Looks like released model is only text generation, while online supports multimodality.

urroxyz

Apr 30

Is this model—the open-weights one—trained for handling those inputs? If so, could we use an adapter or additional encoder with it?

gelim

27 days ago

yes, wondering if there will be multimodal projector files to be shared later?
See as an example:

akierum

17 days ago

So no way to upload images to this model locally? What kind of nonsense is this? Open sourced ? Even qwq does not support images?

viggypiggy

8 days ago

I guess qwen chat has proprietary image encoder? It will be great if they share this part.
Qwen3 models in qwen chat solves my usecase very well (involves image).

Hope they share this soon.

akierum

7 days ago

Better use devstral from unsloth it has vision, but its for coding. First vision + coding model, set temp to 0.05

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment