unsloth
/

INTELLECT-2-GGUF

Model card Files Files and versions

danielhanchen commited on May 12

Commit

0d04eb0

·

verified ·

1 Parent(s): 3c69eb8

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -8,6 +8,21 @@ datasets:
 - PrimeIntellect/Intellect-2-RL-Dataset
 ---
 # INTELLECT-2
 INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.

 - PrimeIntellect/Intellect-2-RL-Dataset
 ---
+**Please read [Running QwQ effectively](https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively) on sampling issues for QwQ based models.**
+Or TLDR, use the below settings:
+```bash
+./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99 \
+    --temp 0.6 \
+    --repeat-penalty 1.1 \
+    --dry-multiplier 0.5 \
+    --min-p 0.00 \
+    --top-k 40 \
+    --top-p 0.95 \
+    --samplers "top_k;top_p;min_p;temperature;dry;typ_p;xtc"
+```
 # INTELLECT-2
 INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.