calcuis
/

illustrious

Model card Files Files and versions Community

calcuis commited on Jan 14

Commit

f584dbd

·

verified ·

1 Parent(s): 55d1558

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -50,7 +50,7 @@ widget:
 ### **review**
 - use tag/word(s) as input for more accurate results for those legacy models; not very convenient (compare to the recent models) at the very beginning
 - credits should be given to those contributors from civitai platform
-- **fast-illustrious gguf** was quantized from **fp8** scaled safetensors while **illustrious gguf** was quantized from the original **bf16** (this is just an attempt to test: is it true? the trimmed model with 50% tensors lesser really load faster? please test it yourself; btw, some models might have their unique structure/feature affecting the loader/performance, not applicable for all)
 - fp8 scaled file works fine in this model; including vae and clips
 - good to run on old machines, i.e., 9xx series or before (legacy mode [--disable-cuda-malloc --lowvram] supported); compatible with the new gguf-node
 - **disclaimer**: some models (original files) are provided by someone else and we might not easily spot out the creator/contributor(s) behind; if it is your work, do let us know; we will address it back properly and probably; thanks for everything

 ### **review**
 - use tag/word(s) as input for more accurate results for those legacy models; not very convenient (compare to the recent models) at the very beginning
 - credits should be given to those contributors from civitai platform
+- **fast-illustrious gguf** was quantized from **fp8** scaled safetensors while **illustrious gguf** was quantized from the original **bf16** (this is just an attempt to test: is it true? the trimmed model with 50% tensors lesser really load faster? please test it yourself; btw, some models might have their unique structure/feature affecting the loader performance, never one size fits all)
 - fp8 scaled file works fine in this model; including vae and clips
 - good to run on old machines, i.e., 9xx series or before (legacy mode [--disable-cuda-malloc --lowvram] supported); compatible with the new gguf-node
 - **disclaimer**: some models (original files) are provided by someone else and we might not easily spot out the creator/contributor(s) behind; if it is your work, do let us know; we will address it back properly and probably; thanks for everything