Update README.md
Browse files
README.md
CHANGED
|
@@ -35,7 +35,7 @@ model-index:
|
|
| 35 |
results: []
|
| 36 |
---
|
| 37 |
|
| 38 |
-
# Hermes 4 — Llama-3.1
|
| 39 |
|
| 40 |

|
| 41 |
|
|
@@ -192,9 +192,9 @@ For production serving on multi-GPU nodes, consider tensor parallel inference en
|
|
| 192 |
|
| 193 |
# Quantized / Smaller Variants
|
| 194 |
|
| 195 |
-
Hermes 4 is available as BF16 original weights as well as FP8 variants and GGUF variants by LM Studio.
|
| 196 |
|
| 197 |
-
|
| 198 |
|
| 199 |
GGUF: Coming soon courtesy of LM Studio team!
|
| 200 |
|
|
|
|
| 35 |
results: []
|
| 36 |
---
|
| 37 |
|
| 38 |
+
# Hermes 4 — Llama-3.1 70B - FP8
|
| 39 |
|
| 40 |

|
| 41 |
|
|
|
|
| 192 |
|
| 193 |
# Quantized / Smaller Variants
|
| 194 |
|
| 195 |
+
Hermes 4 is available as BF16 original weights as well as BF16 as well as FP8 variants and GGUF variants by LM Studio.
|
| 196 |
|
| 197 |
+
BF16: https://huggingface.co/NousResearch/Hermes-4-70B
|
| 198 |
|
| 199 |
GGUF: Coming soon courtesy of LM Studio team!
|
| 200 |
|