Update README.md
Browse files
README.md
CHANGED
@@ -8,11 +8,11 @@ tags:
|
|
8 |
- quantization fix
|
9 |
- vLLM
|
10 |
base_model:
|
11 |
-
-
|
12 |
base_model_relation: quantized
|
13 |
---
|
14 |
# GLM-4.5-Air-AWQ-FP16Mix
|
15 |
-
Base model: [
|
16 |
|
17 |
### 【vLLM Single Node with 8 GPUs Startup Command】
|
18 |
<i>Note: You must use `--enable-expert-parallel` to start this model, otherwise the expert tensor TP will not divide evenly. This is required even for 2 GPUs.</i>
|
|
|
8 |
- quantization fix
|
9 |
- vLLM
|
10 |
base_model:
|
11 |
+
- zai-org/GLM-4.5-Air
|
12 |
base_model_relation: quantized
|
13 |
---
|
14 |
# GLM-4.5-Air-AWQ-FP16Mix
|
15 |
+
Base model: [zai-org/GLM-4.5-Air](https://huggingface.co/zai-org/GLM-4.5-Air)
|
16 |
|
17 |
### 【vLLM Single Node with 8 GPUs Startup Command】
|
18 |
<i>Note: You must use `--enable-expert-parallel` to start this model, otherwise the expert tensor TP will not divide evenly. This is required even for 2 GPUs.</i>
|