Update README.md
Browse files
README.md
CHANGED
@@ -36,10 +36,10 @@ This model was trained with SFT.
|
|
36 |
|
37 |
## Evaluation Results
|
38 |
|
39 |
-
| Model | MATH-500 | AIME24 | GPQA-diamond |
|
40 |
| --- | --- | --- |--- |
|
41 |
-
| Qwen2.5-Math-7B-instruct| 82.6 | 3.3| 34.8 |
|
42 |
-
| Flyingbugs-OpenR1-Qwen-7B | 88.8 | 43.3 | 37.8 |
|
43 |
|
44 |
|
45 |
### Framework versions
|
|
|
36 |
|
37 |
## Evaluation Results
|
38 |
|
39 |
+
| Model | MATH-500 | AIME24 | GPQA-diamond | MMLU (Generation) |BBH (Generation) | IFEVAL (Generation) |MMLU (Logits classification) |
|
40 |
| --- | --- | --- |--- |
|
41 |
+
| Qwen2.5-Math-7B-instruct| 82.6 | 3.3| 34.8 | 23.2 | 18.9 | | 0.4173|
|
42 |
+
| Flyingbugs-OpenR1-Qwen-7B | 88.8 | 43.3 | 37.8 | 23.1 | 18.9 | |0.3091|
|
43 |
|
44 |
|
45 |
### Framework versions
|