Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,7 @@ Revolutionary template-augmented reasoning paradigm enpowers a 32B model to outp
|
|
31 |
|
32 |
|
33 |
## Evaluation
|
34 |
-
We present the evaluation results of our ReasonFlux-F1-32B on challenging reasoning tasks including AIME2024,AIM2025,MATH500 and GPQA-Diamond. To make a fair comparison, we report the results of the LLMs on our evaluation scripts in [ReasonFlux-F1](https://github.com/Gen-Verse/ReasonFlux).
|
35 |
|
36 |
| Model | AIME2024@pass1 | AIME2025@pass1 | MATH500@pass1 | GPQA@pass1 |
|
37 |
| --------------------------------------- | :--------------: | :--------------: | :-------------: | :----------: |
|
|
|
31 |
|
32 |
|
33 |
## Evaluation
|
34 |
+
We present the evaluation results of our ReasonFlux-F1-32B on challenging reasoning tasks including AIME2024,AIM2025,MATH500 and GPQA-Diamond. To make a fair comparison, we report the results of the LLMs on our evaluation scripts in [ReasonFlux-F1](https://github.com/Gen-Verse/ReasonFlux/tree/main/reasonflux-f1).
|
35 |
|
36 |
| Model | AIME2024@pass1 | AIME2025@pass1 | MATH500@pass1 | GPQA@pass1 |
|
37 |
| --------------------------------------- | :--------------: | :--------------: | :-------------: | :----------: |
|