DavidAU
/

Qwen3-8B-Q8_0-64k-128k-256k-context-GGUF

Text Generation

Model card Files Files and versions Community

DavidAU commited on May 1

Commit

6a3e64b

·

verified ·

1 Parent(s): efb3b37

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -30,10 +30,14 @@ Note that 128k and 256k versions tends to elongate output too, and add in more d
 Longer, more detailed prompts may "contain" the model's output length somewhat.
-Also with the 128k/256k you may need to stop the model's generation.
 IE: You ask for a scene of 1000-2000 words, and it may produce multiple scenes (in sequence!) of 1000-2000 words EACH.
 For the 256k context version, keep prompts as clear as possible otherwise the model will have issues. Also increase rep pen to 1.1
 and run temps 1.1 to 2.2. I would suggest using this specific model for creative use only or limited general usage.

 Longer, more detailed prompts may "contain" the model's output length somewhat.
+Also with the 128k/256k you may need to stop the model's generation AND/OR For 128k/256k version I suggest you state clearly the "length of output" and/or set a hard length output limit.
 IE: You ask for a scene of 1000-2000 words, and it may produce multiple scenes (in sequence!) of 1000-2000 words EACH.
+OR
+You ask for 2000 words, and you get 3k (output) in 64K, 5K in 128k and 12k in 256K versions.
 For the 256k context version, keep prompts as clear as possible otherwise the model will have issues. Also increase rep pen to 1.1
 and run temps 1.1 to 2.2. I would suggest using this specific model for creative use only or limited general usage.