Abaryan
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -17,13 +17,13 @@ tags: []
|
|
17 |
|
18 |
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
|
19 |
|
20 |
-
- **Developed by:** [
|
21 |
- **Funded by [optional]:** [More Information Needed]
|
22 |
- **Shared by [optional]:** [More Information Needed]
|
23 |
-
- **Model type:** [
|
24 |
- **Language(s) (NLP):** [More Information Needed]
|
25 |
- **License:** [More Information Needed]
|
26 |
-
- **Finetuned from model [
|
27 |
|
28 |
### Model Sources [optional]
|
29 |
|
@@ -79,7 +79,7 @@ Use the code below to get started with the model.
|
|
79 |
|
80 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
81 |
|
82 |
-
[
|
83 |
|
84 |
### Training Procedure
|
85 |
|
@@ -98,7 +98,7 @@ Use the code below to get started with the model.
|
|
98 |
|
99 |
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
100 |
|
101 |
-
[
|
102 |
|
103 |
## Evaluation
|
104 |
|
|
|
17 |
|
18 |
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
|
19 |
|
20 |
+
- **Developed by:** [Abaryan]
|
21 |
- **Funded by [optional]:** [More Information Needed]
|
22 |
- **Shared by [optional]:** [More Information Needed]
|
23 |
+
- **Model type:** [GRPO + CoT]
|
24 |
- **Language(s) (NLP):** [More Information Needed]
|
25 |
- **License:** [More Information Needed]
|
26 |
+
- **Finetuned from model [Qwen_2.5_1.5b]:** [More Information Needed]
|
27 |
|
28 |
### Model Sources [optional]
|
29 |
|
|
|
79 |
|
80 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
81 |
|
82 |
+
[GSM8K]
|
83 |
|
84 |
### Training Procedure
|
85 |
|
|
|
98 |
|
99 |
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
100 |
|
101 |
+
[bf16, no quantisation, no LoRA,Batch_size=5, num of generation = 5, 3000_steps]
|
102 |
|
103 |
## Evaluation
|
104 |
|