Update README.md
Browse files
README.md
CHANGED
@@ -11,12 +11,26 @@ language:
|
|
11 |
- en
|
12 |
---
|
13 |
|
14 |
-
#
|
15 |
|
16 |
-
-
|
17 |
-
- **License:** apache-2.0
|
18 |
-
- **Finetuned from model :** unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit
|
19 |
|
20 |
-
This qwen2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
|
22 |
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
11 |
- en
|
12 |
---
|
13 |
|
14 |
+
# AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K-GGUF
|
15 |
|
16 |
+
Conversions of [AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K](https://huggingface.co/AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K) into `q8_0`, `q4_k_m` and `q5_k_m` GGUF formats. See original model card for additional details.
|
|
|
|
|
17 |
|
18 |
+
This model is a GRPO fine-tuned version of [unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit](https://huggingface.co/unsloth/Qwen2.5-3B-Instruct-bnb-4bit) on a subset of 2,000 examples from [openai/gsm8k](https://huggingface.co/datasets/openai/gsm8k) using [Unsloth](https://github.com/unslothai/unsloth).
|
19 |
+
|
20 |
+
# Usage
|
21 |
+
|
22 |
+
For best performance, use the below system prompt:
|
23 |
+
|
24 |
+
```python
|
25 |
+
SYSTEM_PROMPT = """
|
26 |
+
Respond in the following format:
|
27 |
+
<reasoning>
|
28 |
+
...
|
29 |
+
</reasoning>
|
30 |
+
<answer>
|
31 |
+
...
|
32 |
+
</answer>
|
33 |
+
"""
|
34 |
+
```
|
35 |
|
36 |
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|