erayalp
/

qwen2.5-0.5b-instruct-SFT-v1-tr-math-easy

Text Generation

curriculum-learning

supervised-fine-tuning

text-generation-inference

Model card Files Files and versions Community

erayalp commited on Apr 21

Commit

c6d795e

·

verified ·

1 Parent(s): c1362ab

docs: add tags

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -10,6 +10,11 @@ base_model:
 - Qwen/Qwen2.5-0.5B-Instruct
 pipeline_tag: text-generation
 library_name: transformers
 ---
 ## Objective
@@ -25,9 +30,10 @@ The goal of this project is to enhance the reasoning ability of the compact Qwen
 - Performance may be sensitive to prompt phrasing.
 ### Roadmap
-1. Phase 2: SFT with moderately difficult math problems
-2. Phase 3: SFT with full-scale GSM8K-TR complexity
-3. Phase 4: GRPO-based training to optimize multi-step reasoning and reduce hallucinations
 ## How to Use

 - Qwen/Qwen2.5-0.5B-Instruct
 pipeline_tag: text-generation
 library_name: transformers
+tags:
+- curriculum-learning
+- math
+- supervised-fine-tuning
+- turkish
 ---
 ## Objective
 - Performance may be sensitive to prompt phrasing.
 ### Roadmap
+1. **Phase 1: SFT with basic arithmatic and math problems**
+2. Phase 2: SFT with moderately difficult math problems
+3. Phase 3: SFT with full-scale GSM8K-TR complexity
+4. Phase 4: GRPO-based training to optimize multi-step reasoning and reduce hallucinations
 ## How to Use