ChrisGoringe
/

MixedQuantFlux

Model card Files Files and versions

ChrisGoringe commited on Sep 6, 2024

Commit

1e8e3e4

·

verified ·

1 Parent(s): 867b980

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -16,18 +16,20 @@ They were created using the [convert.py script](https://github.com/chrisgoringe/
 They can be loaded in ComfyUI using the [ComfyUI GGUF Nodes](https://github.com/city96/ComfyUI-GGUF). Just put the gguf files in your
 models/unet directory.
 ## Naming convention (mx for 'mixed')
 [original_model_name]_mxNN_N.gguf
-where NN_N is the approximate reduction in VRAM usage compared the full 16 bit version.
 -  9_0 might just fit on a 16GB card
 - 10_6 is a good balance for 16GB cards,
 - 12_0 is roughly the size of an 8 bit model,
 - 14_1 should work for 12 GB cards
 - 15_2 is fully quantised to Q4_1
 ## How is this optimised?
 The process for optimisation is as follows:

 They can be loaded in ComfyUI using the [ComfyUI GGUF Nodes](https://github.com/city96/ComfyUI-GGUF). Just put the gguf files in your
 models/unet directory.
+## Bigger numbers in the name = smaller model!
 ## Naming convention (mx for 'mixed')
 [original_model_name]_mxNN_N.gguf
+where NN_N is the approximate *reduction* in VRAM usage compared the full 16 bit version.
+```
 -  9_0 might just fit on a 16GB card
 - 10_6 is a good balance for 16GB cards,
 - 12_0 is roughly the size of an 8 bit model,
 - 14_1 should work for 12 GB cards
 - 15_2 is fully quantised to Q4_1
+```
 ## How is this optimised?
 The process for optimisation is as follows: