Finalize model quant list, fixing a grammar mistake and updating the F16 & F32 links
Browse files
README.md
CHANGED
@@ -31,8 +31,8 @@ If you are looking for a program to run this model with, then I would recommend
|
|
31 |
Clicking on a link will download the corresponding quant instantly.
|
32 |
| Link | Quant | Size | Notes
|
33 |
|:-----|:-----|--------:|:------|
|
34 |
-
| [GGML](https://huggingface.co/
|
35 |
-
| [GGML](https://huggingface.co/
|
36 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q8_0.bin) | Q8_0 | 818 MB | Sweet spot; superficial quality loss at nearly double the speed. |
|
37 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q6_k.bin) | Q6_K | 637 MB | |
|
38 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q5_k.bin) | Q5_K | 537 MB | |
|
|
|
31 |
Clicking on a link will download the corresponding quant instantly.
|
32 |
| Link | Quant | Size | Notes
|
33 |
|:-----|:-----|--------:|:------|
|
34 |
+
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-f32.bin) | F32 | 3.03 GB | Likely overkill. |
|
35 |
+
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-f16.bin) | F16 | 1.52 GB | Performs better than Q8_0 for noisy audio and music. |
|
36 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q8_0.bin) | Q8_0 | 818 MB | Sweet spot; superficial quality loss at nearly double the speed. |
|
37 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q6_k.bin) | Q6_K | 637 MB | |
|
38 |
| [GGML](https://huggingface.co/Pomni/distil-large-v2-ggml-allquants/resolve/main/ggml-distil-large-v2-q5_k.bin) | Q5_K | 537 MB | |
|