Update README.md
Browse files
README.md
CHANGED
@@ -56,7 +56,7 @@ A high-performance 3.2B parameter language model based on Meta's Llama 3.2 archi
|
|
56 |
| Metric | aquif-3-mini (3.2B) | Llama 3.2 (3.2B) | Qwen3 (4B) | Gemma 3n E4B (8.4B) | SmolLM3 (3.1B) | Phi-4 mini (3.8B) | Granite 3.3 (2.5B) |
|
57 |
|--------|---------------------|-------------------|------------|---------------------|-----------------|-------------------|-------------------|
|
58 |
| MMLU (General Knowledge) | **67.5** | 63.4 | 67.0 | 64.9 | 59.5 | *67.3* | 55.9 |
|
59 |
-
| GPQA Diamond (Science) | **
|
60 |
| AIME 2025 (Competition Math) | 9.6 | 0.3 | **17.1** | *11.6* | 9.3 | 10.0 | 2.5 |
|
61 |
| LiveCodeBench (Coding) | *15.4* | 8.3 | **23.3** | 14.6 | 15.2 | 12.6 | 9.4 |
|
62 |
| Global MMLU (Multilingual) | *58.0* | 46.8 | **65.1** | 53.1 | 53.5 | 49.3 | 49.7 |
|
|
|
56 |
| Metric | aquif-3-mini (3.2B) | Llama 3.2 (3.2B) | Qwen3 (4B) | Gemma 3n E4B (8.4B) | SmolLM3 (3.1B) | Phi-4 mini (3.8B) | Granite 3.3 (2.5B) |
|
57 |
|--------|---------------------|-------------------|------------|---------------------|-----------------|-------------------|-------------------|
|
58 |
| MMLU (General Knowledge) | **67.5** | 63.4 | 67.0 | 64.9 | 59.5 | *67.3* | 55.9 |
|
59 |
+
| GPQA Diamond (Science) | **36.1** | 29.4 | *40.7* | 29.6 | 35.7 | 36.9 | 25.3 |
|
60 |
| AIME 2025 (Competition Math) | 9.6 | 0.3 | **17.1** | *11.6* | 9.3 | 10.0 | 2.5 |
|
61 |
| LiveCodeBench (Coding) | *15.4* | 8.3 | **23.3** | 14.6 | 15.2 | 12.6 | 9.4 |
|
62 |
| Global MMLU (Multilingual) | *58.0* | 46.8 | **65.1** | 53.1 | 53.5 | 49.3 | 49.7 |
|