Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
quyanh nielsr HF staff commited on
Commit
2623ffc
·
verified ·
1 Parent(s): ccf2dae

Add library_name to metadata (#2)

Browse files

- Add library_name to metadata (3bfe7ad46a01451e48d7631448e9930e8dfeec6a)


Co-authored-by: Niels Rogge <[email protected]>

Files changed (1) hide show
  1. README.md +10 -8
README.md CHANGED
@@ -1,15 +1,17 @@
1
  ---
 
 
 
 
 
 
 
2
  pipeline_tag: text-generation
3
  inference: true
4
- license: mit
5
- datasets:
6
- - knoveleng/open-rs
7
- - knoveleng/open-s1
8
- - knoveleng/open-deepscaler
9
- base_model:
10
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
11
  ---
12
 
 
13
  # Model Summary
14
 
15
  This repository hosts model for the **Open RS** project, accompanying the paper *Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t*. The project explores enhancing reasoning capabilities in small large language models (LLMs) using reinforcement learning (RL) under resource-constrained conditions.
@@ -54,4 +56,4 @@ If this project aids your work, please cite it as:
54
  primaryClass={cs.LG},
55
  url={https://arxiv.org/abs/2503.16219},
56
  }
57
- ```
 
1
  ---
2
+ base_model:
3
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
4
+ datasets:
5
+ - knoveleng/open-rs
6
+ - knoveleng/open-s1
7
+ - knoveleng/open-deepscaler
8
+ license: mit
9
  pipeline_tag: text-generation
10
  inference: true
11
+ library_name: transformers
 
 
 
 
 
 
12
  ---
13
 
14
+ ```markdown
15
  # Model Summary
16
 
17
  This repository hosts model for the **Open RS** project, accompanying the paper *Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t*. The project explores enhancing reasoning capabilities in small large language models (LLMs) using reinforcement learning (RL) under resource-constrained conditions.
 
56
  primaryClass={cs.LG},
57
  url={https://arxiv.org/abs/2503.16219},
58
  }
59
+ ```