28 days ago

I've downloaded the model from here, and everything else from the Qwen page. Works fine for the first generation, but crashes trying to load the model on the second generation. I assume I have to pair a smaller text encoder with the 9GB model to avoid goin over my 12GB capacity, right?
It might help noobs like me to mention that on the first page. I was following a video that only swapped the model, and skipped right to the table on the right, thinking the table on the left is just technical jargon, since there wasn't any text above it explaining what it is.
Also an image with workflow is always great help if you care to include one. I can just drag that into comfyui and see what's needed for these specific models.

YarvixPA

QuantStack org 28 days ago

If you want the GGUF… yes. I mean:

The GGUF files for the UNET are in this repo
The text encoder (mmproj) I place it here so you already have it with the correct name. (You can download GGUF and the mmproj from Unsloth repo too, but need to rename it. The FP8 version in Comfy-Org repo).
The vae is here just to made easy the download, like other repo where people ask for vae. (You can download it from Comfy-Org repo too)

You can just use the gguf for unet, for the main text encoder use the fp8 version (if you dont want to use GGUF in text encoder) and the vae

YarvixPA changed discussion status to closed 28 days ago

kiaazad

28 days ago

Ah, I see.
I've downloaded the file in the mmproj and put it in the same folder as the original text encoder. Now I wonder how to use it.
It didn't show up in the clip list, or unet loader. Do I need a specific node for it?

YarvixPA

QuantStack org 27 days ago

Yes, In the model card in this repo I mention where to place both text encoders and also what custom node you need to get the loaders nodes and be able to load the GGUF loaders

kiaazad

27 days ago

Ah so that's what the first column mean.
Another silly question: Do I need to install a custom node?
I've gone through my clip and gguf related nodes, and the only one that can load the model was CLIPLoader (GGUF). But I get this error mat1 and mat2 shapes cannot be multiplied (77x768 and 3072x768) from TextEncodeQwenImageEdit node.
Couldn't find anything in the custom nodes and web search either. Even downloaded a couple of other workflows to no avail.

YarvixPA

QuantStack org 27 days ago

•

edited 27 days ago

can you share a screenshoot of what nodes are you using?

btw: The node you should install is ComfyUI-GGUF, if you have it just make sure to have ComfyUI (for the support of QwenImageEdit) and ComfyUI-GGUF (for the support of the mmproj) updated.

This are the loaders you should have in the workflow, you can use the official ComfyUI workflow (you can find it in ComfyUI templates tab) just replace the loaders nodes. The mmproj you need to place it in the same folder where you have the main text encoder... the node will detect and load the mmrpoj (just select the main text encoder in the node, no the mmproj).

kiaazad

27 days ago

Sure, I'm using the workflow from Qwen tutorial page, with the model loaders changes:

and here's the json file: https://filebin.net/qwwj07z760o0foic

YarvixPA

QuantStack org 27 days ago

•

edited 27 days ago

Okey I see whats the problem...

For Qwen Image Edit in the text encoder part you need 2 files:

Main Text Encoder (Qwen2.5-VL-7B... the quant you want)
The mmproj: You have it but with the wrong name... the name should be Qwen2.5-VL-7B-Instruct-mmproj-BF16 (like we have available to download in this same repo.)

The file you should select in the clip loader should be the main text encoder and no the mmproj, the node is going to auto-detect and load the mmproj part

Like in the model card I explain:

Type	Name	Location	Download
Main Model	Qwen-Image	`ComfyUI/models/unet`	GGUF (this repo)
Main Text Encoder	Qwen2.5-VL-7B	`ComfyUI/models/text_encoders`	Safetensors / GGUF
Text_Encoder (mmproj)	Qwen2.5-VL-7B-Instruct-mmproj-BF16	`ComfyUI/models/text_encoders` (same folder as your main text encoder)	GGUF (this repo)
VAE	Qwen-Image VAE	`ComfyUI/models/vae`	Safetensors (this repo)

We include in this same repository the vae (to find it easily) and the mmproj (so you don't have to rename)

YarvixPA changed discussion status to open 27 days ago

YarvixPA changed discussion title from Do I need the model, text encoder, and vae from this repo to use the model? to Load main text encoder and mmproj 27 days ago

kiaazad

27 days ago

I though the name might be the case.
Although I've downloaded the file from this repo, it came with a different name.
I renamed it, but still get the same error

When I select the original encoder in the gguf loader, I get this error

Mixing scaled FP8 with GGUF is not supported! Use regular CLIP loader or switch model(s)
(M:\ai\Qwen\models\text_encoders\qwen_2.5_vl_7b_fp8_scaled.safetensors)

YarvixPA

QuantStack org 27 days ago

Another error is that you didn't download the GGUF version of the main text encoder

That's why I explained at the beginning: if you're going to use gguf, you use the gguf loader... if you're going to use safetensors, you shouldn't use the gguf loader (the node name is the same but without gguf).

If you use the gguf you still missing to download the main text encoder (you have the mmproj, but not the main text encoder).

You should read the model card, where I include the information and try to simplify everything. GGUF is not the same as safetensors. GGUF is a way to use models on computers with low resources or ones that you don't want to consume so many of your PC's resources.

Check this video (not me) but there i see he explain and there you can listen and see to understand better:
https://youtu.be/hYlTteRXX4o?si=ULIMYQZQFMNTUl7s&t=176

kiaazad

27 days ago

•

edited 27 days ago

Ah, I see.
The inclusion of the safetensor in the table thrown me for a loop. Now I get it.
Should I match the Quant of the text encoder to my unet loader?

YarvixPA

QuantStack org 27 days ago

Yes, like I explain in the model card in this repo... there you have the hyperlinks where you can download the gguf text encoder, but if you want to use safetensors you also have the hyperlink for safetensors (if you use safetensors, you don’t need the mmproj gguf):

Type	Name	Location	Download
Main Model	Qwen-Image	`ComfyUI/models/unet`	GGUF (this repo)
Main Text Encoder	Qwen2.5-VL-7B	`ComfyUI/models/text_encoders`	Safetensors / GGUF
Text_Encoder (mmproj)	Qwen2.5-VL-7B-Instruct-mmproj-BF16	`ComfyUI/models/text_encoders` (same folder as your main text encoder)	GGUF (this repo)
VAE	Qwen-Image VAE	`ComfyUI/models/vae`	Safetensors (this repo)

kiaazad

27 days ago

Last question (hopefully).
Are all of these models loaded into VRAM together?
With the 9GB unet model my 12G vram is maxed. Would switching to quantized text encoder be any help? Or should I keep using the safetensor on CPU?

YarvixPA

QuantStack org 27 days ago

From what I know yes it use vram and ram... but I always try to choose a combination that fill in my gpu

mingyi456

26 days ago

@kiaazad You only need enough vram to hold one stage of the pipeline at a time, and the other inactive parts are offloaded to system ram until they are required. So during the text encoding stage, the unet (technically the wrong term because it is a transformer, not a unet in the case of newer base models) can be in system ram, and during the denoising stage, the text encoder is in system ram.

Having everything in vram only eliminates the loading and unloading time, which takes at most about 10 seconds (only significant for low denoising steps), and is impractical for larger models like this on consumer hardware.

CR2022

25 days ago

I am getting this error anyone know how to solve it I am using the models and workflow shared in this discussion:

ComfyUI Error Report

Error Details

Node ID: 77
Node Type: TextEncodeQwenImageEdit
Exception Type: RuntimeError
Exception Message: mat1 and mat2 shapes cannot be multiplied (5476x1280 and 3840x1280)

CR2022

25 days ago

I figured it out and solved the problem the file renaming was very unclear but this line should be useful for anyone else getting the same error these two need to match:

Using mmproj 'Qwen2.5-VL-7B-Instruct-mmproj-Q3_K_S.gguf' for text encoder 'Qwen2.5-VL-7B-Instruct-Q3_K_S.gguf'.

YarvixPA

QuantStack org 25 days ago

•

edited 25 days ago

Hi. In this repo we have the mmproj with the correct name so you can only download and run it without rename it

Seeker36087

22 days ago

I can't seem to get this working either. I'm using the Q3KM GGUF - I've tried downloading the mmproj and leaving it with the name it downloaded with and all I get is pixels, like an old TV screen.
I tried renaming the file to 'Qwen2.5-VL-7B-Instruct-mmproj-Q3_K_M' as advised above, and still, just pixels.

I'm loading it with the ClipLoader GGUF node as advised.... I can't see where I'm going wrong here, but it just isn't working...

YarvixPA

QuantStack org 22 days ago

•

edited 22 days ago

Can you make sure you have the custom node and ComfyUI in last update?

Seeker36087

22 days ago

I have the Comfy-GGUF nodes - the ones I'm using are the same ones as in your workflow image above (Unet Loader GGUF etc)
Running the very latest ComfyUI version

YarvixPA

QuantStack org 22 days ago

•

edited 22 days ago

can you export that workflow and share it? To see how have set

Seeker36087

22 days ago

•

edited 22 days ago

Here's a screenshot of the workflow and the folder with the text encoders in it

The name of the file seems to have been cut off but it is as above - 'Qwen2.5-VL-7B-Instruct-mmproj-Q3_K_M' to match the GGUF quant of the encoder it is with

YarvixPA

QuantStack org 22 days ago

Here's a screenshot of the workflow and the folder with the text encoders in it

Where did you get that GGUF?

here you can download our GGUF, just download and place it ComfyUI/models/unet https://huggingface.co/QuantStack/Qwen-Image-Edit-GGUF/tree/main
Text encoder and mmproj you cand place it in a folder, just make sure that folder is in ComfyUI/models/text_encoders
The lora that you are using is 8 steps and use CFG 1 (change that in the KSampler node)

Seeker36087

21 days ago

I have no idea how I managed to get the wrong model file!
Duh! Sorry for being an idiot! I have the correct one now and - big surprise - it now works!!

Thanks for taking the time to deal with my stupidity!

ryg81

19 days ago

this is not edit, its regular workflow, there is no Qwen-edit text encoder node in it and that is what not working

Stkzzzz222

18 days ago

Thanks for the explanations, it worked for me. I still find it confusing how the Qwen2.5-VL-7B-Instruct-mmproj-BF16.gguf is loaded automatically. I'm used to specifying everything in the workflow.

YarvixPA

QuantStack org 17 days ago

Thanks for the explanations, it worked for me. I still find it confusing how the Qwen2.5-VL-7B-Instruct-mmproj-BF16.gguf is loaded automatically. I'm used to specifying everything in the workflow.

The mmproj loading is handled through the node’s code. You don’t really have to worry about that part.

nettek

17 days ago

•

edited 17 days ago

Hi,

I'm having the exact same problem. mat1 and mat2 shapes cannot be multiplied (5292x1280 and 3840x1280).
I have read this entire discussion and learned that I need to download the mmproj file, but it's still not working.
Here are the files I use:
ComfyUI/models/text_encoders/Qwen2.5-VL-7B-Instruct-Q3_K_M.gguf (loaded)
ComfyUI/models/text_encoders/Qwen2.5-VL-7B-Instruct-mmproj-BF16.gguf (in folder, not loaded)
ComfyUI/models/unet/Qwen_Image_Edit-Q3_K_M.gguf (loaded)

Here is my workflow:

I had the rename the mmproj file, by the way, it downloaded as mmproj-BF16.gguf

Do you see a problem here?

Thank you!

YarvixPA

QuantStack org 17 days ago

@nettek Hi, if in the ClipLoader node (gguf) you need to set the type to "qwen_image", and for some reason the file is downloaded as "mmproj-BF16.gguf", you can rename it to "Qwen2.5-VL-7B-Instruct-mmproj-BF16.gguf".

nettek

16 days ago

Hi @YarvixPA , thanks for your very quick answer, unfortunately changing the type to qwen_image didn't help.
Of course as I mentioned the mmproj file has already been renamed and is Qwen2.5-VL-7B-Instruct-mmproj-BF16.gguf.

YarvixPA

QuantStack org 16 days ago

Hi, make sure to have ComfyUI and ComfyUI GGUF to the last version. Should work if you set the type to qwen _image and rename the mmproj

I have it the same as how I tell you and it works for me

nettek

16 days ago

Updated ComfyUI to 0.3.56 and it works. GGUF custom node was already up to date. Thanks!!

UP95

14 days ago

If anyone else still has this issue, please ensure you use the EXACT gguf clip loader node shown here. I missed out this simple issue and was stuck for a week.