kkh27
/

Qwen2.5-0.5B-Instruct-GSM8K-SFT-full

Text Generation

Model card Files Files and versions

kkh27 commited on 22 days ago

Commit

2f01f3c

·

verified ·

1 Parent(s): 71546a3

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -56,7 +56,7 @@ print(answer)
 # #### 294
 ```
-## 📚 Training Details
 ### Training Data
 The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
@@ -82,7 +82,7 @@ The model was trained using a standard **Supervised Fine-Tuning (SFT)** process
 ---
-## 📊 Evaluation
 The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
 | Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |

 # #### 294
 ```
+## Training Details
 ### Training Data
 The model was trained on the **GSM8K dataset**. The original answers were replaced with detailed, step-by-step solutions generated by the `Gemini-2.5-Flash` teacher model. This process of knowledge transfer is known as **Reasoning Distillation**.
 ---
+## Evaluation
 The model was evaluated on the **GSM8K benchmark (5-shot)** and achieved the highest performance among all tested training pipelines.
 | Model | Flexible-extract Exact Match (±stderr) | Strict-match Exact Match (±stderr) |