lym00/Wan2.2_T2V_A14B_VACE-test

Would it be possible to upload Q2_K quants, or describe way those quants were created so I can quantize them without bothering you further?

I'm struggling with loading both HI and LO noise models to 24GB VRAM at once. With Wan2.2, I found out that combination of Q2_K hi + Q6 lo model produces really good looking image, but even smallest quants in this repo cause OOM when loaded together.

I've also tried following link from README, but there seems to be no quantization script and its readme suggests using llama-quantize, which seems to consistently crash with some variation of:

tensor 'patch_embedding.weight' has invalid number of dimensions: 5 > 4.

lym00
/

Wan2.2_T2V_A14B_VACE-test

Q2_K or similar