docs: add tags
Browse files
README.md
CHANGED
@@ -10,6 +10,11 @@ base_model:
|
|
10 |
- Qwen/Qwen2.5-0.5B-Instruct
|
11 |
pipeline_tag: text-generation
|
12 |
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
13 |
---
|
14 |
|
15 |
## Objective
|
@@ -25,9 +30,10 @@ The goal of this project is to enhance the reasoning ability of the compact Qwen
|
|
25 |
- Performance may be sensitive to prompt phrasing.
|
26 |
|
27 |
### Roadmap
|
28 |
-
1. Phase
|
29 |
-
2. Phase
|
30 |
-
3. Phase
|
|
|
31 |
|
32 |
## How to Use
|
33 |
|
|
|
10 |
- Qwen/Qwen2.5-0.5B-Instruct
|
11 |
pipeline_tag: text-generation
|
12 |
library_name: transformers
|
13 |
+
tags:
|
14 |
+
- curriculum-learning
|
15 |
+
- math
|
16 |
+
- supervised-fine-tuning
|
17 |
+
- turkish
|
18 |
---
|
19 |
|
20 |
## Objective
|
|
|
30 |
- Performance may be sensitive to prompt phrasing.
|
31 |
|
32 |
### Roadmap
|
33 |
+
1. **Phase 1: SFT with basic arithmatic and math problems**
|
34 |
+
2. Phase 2: SFT with moderately difficult math problems
|
35 |
+
3. Phase 3: SFT with full-scale GSM8K-TR complexity
|
36 |
+
4. Phase 4: GRPO-based training to optimize multi-step reasoning and reduce hallucinations
|
37 |
|
38 |
## How to Use
|
39 |
|