aquiffoo commited on
Commit
740ed78
·
verified ·
1 Parent(s): b6a16c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -56,7 +56,7 @@ A high-performance 3.2B parameter language model based on Meta's Llama 3.2 archi
56
  | Metric | aquif-3-mini (3.2B) | Llama 3.2 (3.2B) | Qwen3 (4B) | Gemma 3n E4B (8.4B) | SmolLM3 (3.1B) | Phi-4 mini (3.8B) | Granite 3.3 (2.5B) |
57
  |--------|---------------------|-------------------|------------|---------------------|-----------------|-------------------|-------------------|
58
  | MMLU (General Knowledge) | **67.5** | 63.4 | 67.0 | 64.9 | 59.5 | *67.3* | 55.9 |
59
- | GPQA Diamond (Science) | **42.8** | 29.4 | *40.7* | 29.6 | 35.7 | 36.9 | 25.3 |
60
  | AIME 2025 (Competition Math) | 9.6 | 0.3 | **17.1** | *11.6* | 9.3 | 10.0 | 2.5 |
61
  | LiveCodeBench (Coding) | *15.4* | 8.3 | **23.3** | 14.6 | 15.2 | 12.6 | 9.4 |
62
  | Global MMLU (Multilingual) | *58.0* | 46.8 | **65.1** | 53.1 | 53.5 | 49.3 | 49.7 |
 
56
  | Metric | aquif-3-mini (3.2B) | Llama 3.2 (3.2B) | Qwen3 (4B) | Gemma 3n E4B (8.4B) | SmolLM3 (3.1B) | Phi-4 mini (3.8B) | Granite 3.3 (2.5B) |
57
  |--------|---------------------|-------------------|------------|---------------------|-----------------|-------------------|-------------------|
58
  | MMLU (General Knowledge) | **67.5** | 63.4 | 67.0 | 64.9 | 59.5 | *67.3* | 55.9 |
59
+ | GPQA Diamond (Science) | **36.1** | 29.4 | *40.7* | 29.6 | 35.7 | 36.9 | 25.3 |
60
  | AIME 2025 (Competition Math) | 9.6 | 0.3 | **17.1** | *11.6* | 9.3 | 10.0 | 2.5 |
61
  | LiveCodeBench (Coding) | *15.4* | 8.3 | **23.3** | 14.6 | 15.2 | 12.6 | 9.4 |
62
  | Global MMLU (Multilingual) | *58.0* | 46.8 | **65.1** | 53.1 | 53.5 | 49.3 | 49.7 |