--- base_model: unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit tags: - text-generation-inference - transformers - unsloth - qwen2 - gguf license: apache-2.0 language: - en --- # AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K-GGUF Conversions of [AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K](https://huggingface.co/AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K) into `q8_0`, `q4_k_m` and `q5_k_m` GGUF formats. See original model card for additional details. This model is a GRPO fine-tuned version of [unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit](https://huggingface.co/unsloth/Qwen2.5-3B-Instruct-bnb-4bit) on a subset of 2,000 examples from [openai/gsm8k](https://huggingface.co/datasets/openai/gsm8k) using [Unsloth](https://github.com/unslothai/unsloth). # Usage For best performance, use the below system prompt: ```python SYSTEM_PROMPT = """ Respond in the following format: ... ... """ ``` [](https://github.com/unslothai/unsloth)