Update README.md
Browse files
README.md
CHANGED
@@ -36,12 +36,12 @@ This repository provides **quantized runtime packages** of
|
|
36 |
|
37 |
- **main** — placeholder / landing page
|
38 |
- **W4A16** — 4‑bit weights / 16‑bit activations builds (AWQ W4A16) and related assets
|
39 |
-
- **
|
40 |
|
41 |
**Quick links:**
|
42 |
- 🔗 **[`main`](https://huggingface.co/TheHouseOfTheDude/GLM-Steam-106B-A12B-v1_Compressed-Tensors/tree/main)**
|
43 |
- 🔗 **[`W4A16`](https://huggingface.co/TheHouseOfTheDude/GLM-Steam-106B-A12B-v1_Compressed-Tensors/tree/W4A16)**
|
44 |
-
- 🔗 **[`
|
45 |
|
46 |
---
|
47 |
|
|
|
36 |
|
37 |
- **main** — placeholder / landing page
|
38 |
- **W4A16** — 4‑bit weights / 16‑bit activations builds (AWQ W4A16) and related assets
|
39 |
+
- **W8A16** — 8‑bit weights / 16‑bit activations builds
|
40 |
|
41 |
**Quick links:**
|
42 |
- 🔗 **[`main`](https://huggingface.co/TheHouseOfTheDude/GLM-Steam-106B-A12B-v1_Compressed-Tensors/tree/main)**
|
43 |
- 🔗 **[`W4A16`](https://huggingface.co/TheHouseOfTheDude/GLM-Steam-106B-A12B-v1_Compressed-Tensors/tree/W4A16)**
|
44 |
+
- 🔗 **[`W8A16`](https://huggingface.co/TheHouseOfTheDude/GLM-Steam-106B-A12B-v1_Compressed-Tensors/tree/INT8-W8A16)**
|
45 |
|
46 |
---
|
47 |
|