DavidAU
/

Qwen3-30B-A4.5B-12-Cooks

Text Generation

Model card Files Files and versions

DavidAU commited on May 4

Commit

c256edf

·

verified ·

1 Parent(s): c1530ea

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -30,6 +30,11 @@ Context size: 32K + 8K for output (40k total)
 Use Jinja Template or CHATML template.
 Please refer the org model card for details, benchmarks, how to use, settings, system roles etc etc :
 [ https://huggingface.co/Qwen/Qwen3-30B-A3B ]

 Use Jinja Template or CHATML template.
+IMPORTANT NOTES:
+- Due to the unique nature (MOE, Size, Activated experts) of this model GGUF quants can be run on the CPU, GPU or with GPU part "off-load", right up to full precision.
+- This model is difficult to Imatrix : You need a much larger imatrix file / multi-language / multi-content to imatrix it.
 Please refer the org model card for details, benchmarks, how to use, settings, system roles etc etc :
 [ https://huggingface.co/Qwen/Qwen3-30B-A3B ]