Update README.md
Browse files
README.md
CHANGED
@@ -56,7 +56,7 @@ print(answer)
|
|
56 |
# #### 294
|
57 |
```
|
58 |
|
59 |
-
##
|
60 |
|
61 |
### Training Data
|
62 |
The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
|
@@ -82,7 +82,7 @@ The model was trained using a standard **Supervised Fine-Tuning (SFT)** process
|
|
82 |
|
83 |
---
|
84 |
|
85 |
-
##
|
86 |
The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
|
87 |
|
88 |
| Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |
|
|
|
56 |
# #### 294
|
57 |
```
|
58 |
|
59 |
+
## Training Details
|
60 |
|
61 |
### Training Data
|
62 |
The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
|
|
|
82 |
|
83 |
---
|
84 |
|
85 |
+
## Evaluation
|
86 |
The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
|
87 |
|
88 |
| Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |
|