amd
/

Llama-3.3-70B-Instruct-FP8-KV

Model card Files Files and versions Community

bowenbaoamd commited on Dec 13, 2024

Commit

588fc7e

·

verified ·

1 Parent(s): b1d7e1f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ base_model: meta-llama/Llama-3.3-70B-Instruct
 # Llama-3.3-70B-Instruct-FP8-KV
 - ## Introduction
-  This model was created by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
 - ## Quantization Stragegy
   - ***Quantized Layers***: All linear layers excluding "lm_head"
   - ***Weight***: FP8 symmetric per-tensor

 # Llama-3.3-70B-Instruct-FP8-KV
 - ## Introduction
+  This model was built with Llama by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
 - ## Quantization Stragegy
   - ***Quantized Layers***: All linear layers excluding "lm_head"
   - ***Weight***: FP8 symmetric per-tensor