calcuis commited on
Commit
f584dbd
·
verified ·
1 Parent(s): 55d1558

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -50,7 +50,7 @@ widget:
50
  ### **review**
51
  - use tag/word(s) as input for more accurate results for those legacy models; not very convenient (compare to the recent models) at the very beginning
52
  - credits should be given to those contributors from civitai platform
53
- - **fast-illustrious gguf** was quantized from **fp8** scaled safetensors while **illustrious gguf** was quantized from the original **bf16** (this is just an attempt to test: is it true? the trimmed model with 50% tensors lesser really load faster? please test it yourself; btw, some models might have their unique structure/feature affecting the loader/performance, not applicable for all)
54
  - fp8 scaled file works fine in this model; including vae and clips
55
  - good to run on old machines, i.e., 9xx series or before (legacy mode [--disable-cuda-malloc --lowvram] supported); compatible with the new gguf-node
56
  - **disclaimer**: some models (original files) are provided by someone else and we might not easily spot out the creator/contributor(s) behind; if it is your work, do let us know; we will address it back properly and probably; thanks for everything
 
50
  ### **review**
51
  - use tag/word(s) as input for more accurate results for those legacy models; not very convenient (compare to the recent models) at the very beginning
52
  - credits should be given to those contributors from civitai platform
53
+ - **fast-illustrious gguf** was quantized from **fp8** scaled safetensors while **illustrious gguf** was quantized from the original **bf16** (this is just an attempt to test: is it true? the trimmed model with 50% tensors lesser really load faster? please test it yourself; btw, some models might have their unique structure/feature affecting the loader performance, never one size fits all)
54
  - fp8 scaled file works fine in this model; including vae and clips
55
  - good to run on old machines, i.e., 9xx series or before (legacy mode [--disable-cuda-malloc --lowvram] supported); compatible with the new gguf-node
56
  - **disclaimer**: some models (original files) are provided by someone else and we might not easily spot out the creator/contributor(s) behind; if it is your work, do let us know; we will address it back properly and probably; thanks for everything