bartowski
/

inclusionAI_Ling-lite-0415-GGUF

Text Generation

Model card Files Files and versions

Update README.md

#1

by KeyboardMasher - opened 18 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -103,7 +103,7 @@ As of llama.cpp build [b4282](https://github.com/ggerganov/llama.cpp/releases/ta
 Additionally, if you want to get slightly better quality for , you can use IQ4_NL thanks to [this PR](https://github.com/ggerganov/llama.cpp/pull/10541) which will also repack the weights for ARM, though only the 4_4 for now. The loading time may be slower but it will result in an overall speed incrase.
 <details>
-  <summary>Click to view Q4_0_X_X information (deprecated</summary>
 I'm keeping this section to show the potential theoretical uplift in performance from using the Q4_0 with online repacking.

 Additionally, if you want to get slightly better quality for , you can use IQ4_NL thanks to [this PR](https://github.com/ggerganov/llama.cpp/pull/10541) which will also repack the weights for ARM, though only the 4_4 for now. The loading time may be slower but it will result in an overall speed incrase.
 <details>
+  <summary>Click to view Q4_0_X_X information (deprecated)</summary>
 I'm keeping this section to show the potential theoretical uplift in performance from using the Q4_0 with online repacking.