2stacks commited on
Commit
e19143a
·
verified ·
1 Parent(s): d0f34a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -32
README.md CHANGED
@@ -1,32 +1,35 @@
1
- ---
2
- pipeline_tag: text-generation
3
- inference: true
4
- license: apache-2.0
5
- datasets:
6
- - simplescaling/s1K
7
- ---
8
-
9
- # Model Summary
10
-
11
- > s1-0.5B is a reasoning model finetuned from Qwen2.5-0.5B-Instruct on just 1,000 examples. This model was created simply to test the process used to train the original S1 cited below using consumer grade GPUs.
12
-
13
- - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
14
- - **Paper:** https://arxiv.org/abs/2501.19393
15
-
16
- # Use
17
-
18
- The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).
19
-
20
- # Citation
21
-
22
- ```bibtex
23
- @misc{muennighoff2025s1simpletesttimescaling,
24
- title={s1: Simple test-time scaling},
25
- author={Niklas Muennighoff and Zitong Yang and Weijia Shi and Xiang Lisa Li and Li Fei-Fei and Hannaneh Hajishirzi and Luke Zettlemoyer and Percy Liang and Emmanuel Candès and Tatsunori Hashimoto},
26
- year={2025},
27
- eprint={2501.19393},
28
- archivePrefix={arXiv},
29
- primaryClass={cs.CL},
30
- url={https://arxiv.org/abs/2501.19393},
31
- }
32
- ```
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ inference: true
4
+ license: apache-2.0
5
+ datasets:
6
+ - simplescaling/s1K
7
+ base_model:
8
+ - Qwen/Qwen2.5-0.5B-Instruct
9
+ library_name: transformers
10
+ ---
11
+
12
+ # Model Summary
13
+
14
+ > s1-0.5B is a reasoning model finetuned from Qwen2.5-0.5B-Instruct on just 1,000 examples. This model was created simply to test the process used to train the original S1 cited below using consumer grade GPUs.
15
+
16
+ - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
17
+ - **Paper:** https://arxiv.org/abs/2501.19393
18
+
19
+ # Use
20
+
21
+ The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).
22
+
23
+ # Citation
24
+
25
+ ```bibtex
26
+ @misc{muennighoff2025s1simpletesttimescaling,
27
+ title={s1: Simple test-time scaling},
28
+ author={Niklas Muennighoff and Zitong Yang and Weijia Shi and Xiang Lisa Li and Li Fei-Fei and Hannaneh Hajishirzi and Luke Zettlemoyer and Percy Liang and Emmanuel Candès and Tatsunori Hashimoto},
29
+ year={2025},
30
+ eprint={2501.19393},
31
+ archivePrefix={arXiv},
32
+ primaryClass={cs.CL},
33
+ url={https://arxiv.org/abs/2501.19393},
34
+ }
35
+ ```