Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ base_model:
|
|
14 |
- **ROCm**: 7.0
|
15 |
- **Operating System(s):** Linux
|
16 |
- **Inference Engine:** [vLLM](https://docs.vllm.ai/en/latest/)
|
17 |
-
- **Model Optimizer:** [AMD-Quark](https://quark.docs.amd.com/latest/index.html)
|
18 |
- **Weight quantization:** OCP MXFP4, Static
|
19 |
- **Activation quantization:** OCP MXFP4, Dynamic
|
20 |
- **KV cache quantization:** OCP FP8, Static
|
|
|
14 |
- **ROCm**: 7.0
|
15 |
- **Operating System(s):** Linux
|
16 |
- **Inference Engine:** [vLLM](https://docs.vllm.ai/en/latest/)
|
17 |
+
- **Model Optimizer:** [AMD-Quark](https://quark.docs.amd.com/latest/index.html) (V0.9)
|
18 |
- **Weight quantization:** OCP MXFP4, Static
|
19 |
- **Activation quantization:** OCP MXFP4, Dynamic
|
20 |
- **KV cache quantization:** OCP FP8, Static
|