unsloth
/

Qwen3-4B-GGUF

Text Generation

Model card Files Files and versions

shimmyshimmer commited on May 2

Commit

61dcb14

·

verified ·

1 Parent(s): 7dced46

Update README.md

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -49,6 +49,29 @@ tags:
 | **Qwen2.5 (7B)**      | [▶️ Start on Colab](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen2.5_(7B)-Alpaca.ipynb)               | 2x faster | 60% less |
 | **Phi-4 (14B)** | [▶️ Start on Colab](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4-Conversational.ipynb)               | 2x faster | 50% less |
 # Qwen3-4B
 ## Qwen3 Highlights

 | **Qwen2.5 (7B)**      | [▶️ Start on Colab](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen2.5_(7B)-Alpaca.ipynb)               | 2x faster | 60% less |
 | **Phi-4 (14B)** | [▶️ Start on Colab](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4-Conversational.ipynb)               | 2x faster | 50% less |
+# To Switch Between Thinking and Non-Thinking
+If you are using llama.cpp, Ollama, Open WebUI etc., you can add `/think` and `/no_think` to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations.
+Here is an example of multi-turn conversation:
+```
+> Who are you /no_think
+<think>
+</think>
+I am Qwen, a large-scale language model developed by Alibaba Cloud. [...]
+> How many 'r's are in 'strawberries'? /think
+<think>
+Okay, let's see. The user is asking how many times the letter 'r' appears in the word "strawberries". [...]
+</think>
+The word strawberries contains 3 instances of the letter r. [...]
+```
 # Qwen3-4B
 ## Qwen3 Highlights