gghfez commited on
Commit
0b36f63
·
verified ·
1 Parent(s): 5df58cf

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - CohereLabs/c4ai-command-a-03-2025
4
+ ---
5
+
6
+ # gghfez/command-a-03-2025-AWQ
7
+
8
+ Tested with vllm==0.10.1
9
+
10
+
11
+ ## Usage:
12
+
13
+ ```
14
+ vllm serve gghfez/command-a-03-2025-AWQ --port 8080 --host 0.0.0.0 --dtype bfloat16 --max-model-len 32768 -tp 4 --gpu-memory-utilization 0.9
15
+ ```
16
+