erayalp commited on
Commit
c6d795e
·
verified ·
1 Parent(s): c1362ab

docs: add tags

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -10,6 +10,11 @@ base_model:
10
  - Qwen/Qwen2.5-0.5B-Instruct
11
  pipeline_tag: text-generation
12
  library_name: transformers
 
 
 
 
 
13
  ---
14
 
15
  ## Objective
@@ -25,9 +30,10 @@ The goal of this project is to enhance the reasoning ability of the compact Qwen
25
  - Performance may be sensitive to prompt phrasing.
26
 
27
  ### Roadmap
28
- 1. Phase 2: SFT with moderately difficult math problems
29
- 2. Phase 3: SFT with full-scale GSM8K-TR complexity
30
- 3. Phase 4: GRPO-based training to optimize multi-step reasoning and reduce hallucinations
 
31
 
32
  ## How to Use
33
 
 
10
  - Qwen/Qwen2.5-0.5B-Instruct
11
  pipeline_tag: text-generation
12
  library_name: transformers
13
+ tags:
14
+ - curriculum-learning
15
+ - math
16
+ - supervised-fine-tuning
17
+ - turkish
18
  ---
19
 
20
  ## Objective
 
30
  - Performance may be sensitive to prompt phrasing.
31
 
32
  ### Roadmap
33
+ 1. **Phase 1: SFT with basic arithmatic and math problems**
34
+ 2. Phase 2: SFT with moderately difficult math problems
35
+ 3. Phase 3: SFT with full-scale GSM8K-TR complexity
36
+ 4. Phase 4: GRPO-based training to optimize multi-step reasoning and reduce hallucinations
37
 
38
  ## How to Use
39