yuchenxie commited on
Commit
ea59133
·
verified ·
1 Parent(s): 8428b10

Update README.md

Browse files

Corrected doc to use

```python
--rope-scaling '{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}' --max-model-len 131072
```

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -290,7 +290,7 @@ YaRN is currently supported by several inference frameworks, e.g., `transformers
290
 
291
  For `vllm`, you can use
292
  ```shell
293
- vllm serve ... --rope-scaling '{"type":"rope_type","factor":4.0,"original_max_position_embeddings":32768}' --max-model-len 131072
294
  ```
295
 
296
  For `sglang`, you can use
 
290
 
291
  For `vllm`, you can use
292
  ```shell
293
+ vllm serve ... --rope-scaling '{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}' --max-model-len 131072
294
  ```
295
 
296
  For `sglang`, you can use