teknium commited on
Commit
6ae71c2
·
verified ·
1 Parent(s): e49bcd9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -35,7 +35,7 @@ model-index:
35
  results: []
36
  ---
37
 
38
- # Hermes 4 — Llama-3.1 70
39
 
40
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg)
41
 
@@ -192,9 +192,9 @@ For production serving on multi-GPU nodes, consider tensor parallel inference en
192
 
193
  # Quantized / Smaller Variants
194
 
195
- Hermes 4 is available as BF16 original weights as well as FP8 variants and GGUF variants by LM Studio.
196
 
197
- FP8: https://huggingface.co/NousResearch/Hermes-4-70B-FP8
198
 
199
  GGUF: Coming soon courtesy of LM Studio team!
200
 
 
35
  results: []
36
  ---
37
 
38
+ # Hermes 4 — Llama-3.1 70B - FP8
39
 
40
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg)
41
 
 
192
 
193
  # Quantized / Smaller Variants
194
 
195
+ Hermes 4 is available as BF16 original weights as well as BF16 as well as FP8 variants and GGUF variants by LM Studio.
196
 
197
+ BF16: https://huggingface.co/NousResearch/Hermes-4-70B
198
 
199
  GGUF: Coming soon courtesy of LM Studio team!
200