Update README.md
Browse files
README.md
CHANGED
@@ -35,11 +35,11 @@ A high-performance mixture-of-experts language model optimized for efficiency, c
|
|
35 |
|
36 |
| Metric | aquif-3-moe (17B a2.8B) | Phi-4 (14B) | Qwen3 (14B) | Gemma 3 (27B) | GPT-4.1 nano (Propr.) | Mistral Small 3.2 (24B) |
|
37 |
|--------|-------------------------|-------------|-------------|---------------|----------------------|-------------------------|
|
38 |
-
| MMLU (General Knowledge) |
|
39 |
-
| LiveCodeBench (Coding) | 28.6 | 25.2 |
|
40 |
-
| MATH-500 (Math) | **91.4** | 80.8 |
|
41 |
-
| GPQA Diamond (Science) | **56.7** |
|
42 |
-
| **Average** | **65.0** | 61.7 |
|
43 |
|
44 |
## Key Strengths
|
45 |
|
|
|
35 |
|
36 |
| Metric | aquif-3-moe (17B a2.8B) | Phi-4 (14B) | Qwen3 (14B) | Gemma 3 (27B) | GPT-4.1 nano (Propr.) | Mistral Small 3.2 (24B) |
|
37 |
|--------|-------------------------|-------------|-------------|---------------|----------------------|-------------------------|
|
38 |
+
| MMLU (General Knowledge) | _83.2_ | **84.8** | 82.0 | 78.6 | 80.1 | 80.5 |
|
39 |
+
| LiveCodeBench (Coding) | 28.6 | 25.2 | _29.0_ | 26.9 | **32.6** | 27.5 |
|
40 |
+
| MATH-500 (Math) | **91.4** | 80.8 | _89.8_ | 88.3 | 84.8 | 88.3 |
|
41 |
+
| GPQA Diamond (Science) | **56.7** | _56.1_ | 54.8 | 42.8 | 50.3 | 50.5 |
|
42 |
+
| **Average** | **65.0** | 61.7 | _63.9_ | 59.2 | 62.0 | 61.7 |
|
43 |
|
44 |
## Key Strengths
|
45 |
|