kkh27 commited on
Commit
2f01f3c
·
verified ·
1 Parent(s): 71546a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -56,7 +56,7 @@ print(answer)
56
  # #### 294
57
  ```
58
 
59
- ## 📚 Training Details
60
 
61
  ### Training Data
62
  The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
@@ -82,7 +82,7 @@ The model was trained using a standard **Supervised Fine-Tuning (SFT)** process
82
 
83
  ---
84
 
85
- ## 📊 Evaluation
86
  The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
87
 
88
  | Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |
 
56
  # #### 294
57
  ```
58
 
59
+ ## Training Details
60
 
61
  ### Training Data
62
  The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
 
82
 
83
  ---
84
 
85
+ ## Evaluation
86
  The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
87
 
88
  | Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |