Improve language tag
Browse filesHi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.
README.md
CHANGED
@@ -1,24 +1,38 @@
|
|
1 |
-
---
|
2 |
-
pipeline_tag: text-generation
|
3 |
-
inference: true
|
4 |
-
license: apache-2.0
|
5 |
-
datasets:
|
6 |
-
- simplescaling/s1K-1.1
|
7 |
-
base_model:
|
8 |
-
- Qwen/Qwen2.5-0.5B-Instruct
|
9 |
-
library_name: transformers
|
10 |
-
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
-
|
17 |
-
-
|
18 |
-
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
|
24 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: text-generation
|
3 |
+
inference: true
|
4 |
+
license: apache-2.0
|
5 |
+
datasets:
|
6 |
+
- simplescaling/s1K-1.1
|
7 |
+
base_model:
|
8 |
+
- Qwen/Qwen2.5-0.5B-Instruct
|
9 |
+
library_name: transformers
|
10 |
+
language:
|
11 |
+
- zho
|
12 |
+
- eng
|
13 |
+
- fra
|
14 |
+
- spa
|
15 |
+
- por
|
16 |
+
- deu
|
17 |
+
- ita
|
18 |
+
- rus
|
19 |
+
- jpn
|
20 |
+
- kor
|
21 |
+
- vie
|
22 |
+
- tha
|
23 |
+
- ara
|
24 |
+
---
|
25 |
+
|
26 |
+
# Model Summary
|
27 |
+
|
28 |
+
> s1.1-0.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
|
29 |
+
|
30 |
+
- **Logs:** https://wandb.ai/2stacks-sms/s1/runs/ishervdt?nw=nwuser2stacks
|
31 |
+
- **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
|
32 |
+
- **Paper:** https://arxiv.org/abs/2501.19393
|
33 |
+
|
34 |
+
Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
|
35 |
+
|
36 |
+
# Use
|
37 |
+
|
38 |
+
The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).
|