NousResearch
/

Hermes-4-70B-FP8

Model card Files Files and versions

teknium commited on Aug 26

Commit

6ae71c2

·

verified ·

1 Parent(s): e49bcd9

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ model-index:
   results: []
 ---
-# Hermes 4 — Llama-3.1 70
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg)
@@ -192,9 +192,9 @@ For production serving on multi-GPU nodes, consider tensor parallel inference en
 # Quantized / Smaller Variants
-Hermes 4 is available as BF16 original weights as well as FP8 variants and GGUF variants by LM Studio.
-FP8: https://huggingface.co/NousResearch/Hermes-4-70B-FP8
 GGUF: Coming soon courtesy of LM Studio team!

   results: []
 ---
+# Hermes 4 — Llama-3.1 70B - FP8
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg)
 # Quantized / Smaller Variants
+Hermes 4 is available as BF16 original weights as well as BF16 as well as FP8 variants and GGUF variants by LM Studio.
+BF16: https://huggingface.co/NousResearch/Hermes-4-70B
 GGUF: Coming soon courtesy of LM Studio team!