Waiting for the Qwen3-VL

by Maverick17 - opened Apr 29

Discussion

Maverick17

Apr 29

👀👀👀

urroxyz

Apr 30

https://qwen3.org/vl/

karrelin

May 1

•

edited May 1

https://qwen3.org/vl/

That's not the official site

urroxyz

May 1

What!?

Dampfinchen

May 3

I just wish Qwen 3 would be a native multimodal now in this day and age, just like competitors like Gemma 3 and Llama 4 are. There's no need for seperate models anymore, just pretrain the models on multiple modalities and make one model.

urroxyz

May 3

I wonder if the model is actually pretrained for multiple modalities, but the adapters/encoders are not released...

karrelin

May 4

unless the Qwen team can magically encode images using tokens, that's just not possible

Maverick17

29 days ago

I just wish Qwen 3 would be a native multimodal now in this day and age, just like competitors like Gemma 3 and Llama 4 are. There's no need for seperate models anymore, just pretrain the models on multiple modalities and make one model.

I think the same. What is the purpose of a non multimodal model anyway? Coding and ChatBot, like for customer service, etc. are the only specialized domains, I would think of a usecase for text-only LLM... But other then that, why wasting time and money?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment