bullerwins commited on
Commit
4306f75
·
verified ·
1 Parent(s): 10ec42d

Update README.md

Browse files

small repo update in the vllm command

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -26,7 +26,7 @@ otherwise the expert tensors couldn’t be evenly sharded across GPU devices.</i
26
  ```
27
  CONTEXT_LENGTH=32768
28
  vllm serve \
29
- tclf90/GLM-4.6-AWQ \
30
  --served-model-name My_Model \
31
  --enable-auto-tool-choice \
32
  --tool-call-parser glm45 \
 
26
  ```
27
  CONTEXT_LENGTH=32768
28
  vllm serve \
29
+ QuantTrio/GLM-4.6-AWQ \
30
  --served-model-name My_Model \
31
  --enable-auto-tool-choice \
32
  --tool-call-parser glm45 \