benfielding commited on
Commit
317b7eb
·
verified ·
1 Parent(s): e40f103

Re-order text

Browse files
Files changed (1) hide show
  1. README.md +5 -6
README.md CHANGED
@@ -15,6 +15,11 @@ library_name: transformers
15
  # Qwen2.5-0.5B-Instruct
16
 
17
  ## Introduction
 
 
 
 
 
18
 
19
  This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2.5 model, which has the following features:
20
  - Type: Causal Language Models
@@ -26,12 +31,6 @@ This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2
26
  - Number of Attention Heads (GQA): 14 for Q and 2 for KV
27
  - Context Length: Full 32,768 tokens and generation 8192 tokens
28
 
29
- This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm), to finetune locally using peer-to-peer reinforcement learning post-training.
30
-
31
- Once finetuned, the model can be used as normal in any workflow, for details on how to do this please refer to the [original model documentation](https://qwen.readthedocs.io/en/latest/).
32
-
33
- For more details on the original model, please refer to the original repository [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
34
-
35
  ## Requirements
36
 
37
  This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm) system, for details on model requirements when using outside of a swarm, refer to the original Qwen repo [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
 
15
  # Qwen2.5-0.5B-Instruct
16
 
17
  ## Introduction
18
+ This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm), to finetune locally using peer-to-peer reinforcement learning post-training.
19
+
20
+ Once finetuned, the model can be used as normal in any workflow, for details on how to do this please refer to the [original model documentation](https://qwen.readthedocs.io/en/latest/).
21
+
22
+ For more details on the original model, please refer to the original repository [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
23
 
24
  This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2.5 model, which has the following features:
25
  - Type: Causal Language Models
 
31
  - Number of Attention Heads (GQA): 14 for Q and 2 for KV
32
  - Context Length: Full 32,768 tokens and generation 8192 tokens
33
 
 
 
 
 
 
 
34
  ## Requirements
35
 
36
  This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm) system, for details on model requirements when using outside of a swarm, refer to the original Qwen repo [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).