bullerwins
commited on
Commit
•
c6e5a43
1
Parent(s):
df24c4e
Update README.md
Browse files
README.md
CHANGED
@@ -18,12 +18,11 @@ license: llama3.1
|
|
18 |
|
19 |
---
|
20 |
|
21 |
-
GGUF quantized version using llama.cpp
|
22 |
|
23 |
-
|
24 |
-
I will requant once merged
|
25 |
|
26 |
-
Update
|
27 |
|
28 |
## Model Information
|
29 |
|
|
|
18 |
|
19 |
---
|
20 |
|
21 |
+
GGUF quantized version using [llama.cpp 5e2727f](https://github.com/ggerganov/llama.cpp/commit/5e2727fe0321c38d1664d26173c654fa1801dc5f)
|
22 |
|
23 |
+
Update 24/07 - requantized with [fixed tokenizer ](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct/discussions/28/files)
|
|
|
24 |
|
25 |
+
Update 28/07 - requantized with the [RoPE fix](https://github.com/ggerganov/llama.cpp/pull/8676), it should now be fully supported. You need to run [llama.cpp 5e2727f](https://github.com/ggerganov/llama.cpp/commit/5e2727fe0321c38d1664d26173c654fa1801dc5f) or higher
|
26 |
|
27 |
## Model Information
|
28 |
|