---
base_model: unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit
tags:
- text-generation-inference
- transformers
- unsloth
- qwen2
- gguf
license: apache-2.0
language:
- en
---
# AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K-GGUF
Conversions of [AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K](https://huggingface.co/AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K) into `q8_0`, `q4_k_m` and `q5_k_m` GGUF formats. See original model card for additional details.
This model is a GRPO fine-tuned version of [unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit](https://huggingface.co/unsloth/Qwen2.5-3B-Instruct-bnb-4bit) on a subset of 2,000 examples from [openai/gsm8k](https://huggingface.co/datasets/openai/gsm8k) using [Unsloth](https://github.com/unslothai/unsloth).
# Usage
For best performance, use the below system prompt:
```python
SYSTEM_PROMPT = """
Respond in the following format:
...
...
"""
```
[
](https://github.com/unslothai/unsloth)