DavidAU
/

Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF

Text Generation

DeepSeek-R1-Distill

function calling

problem solving

fiction writing

plot generation

sub-plot generation

story generation

Model card Files Files and versions Community

DavidAU commited on 13 days ago

Commit

5b3eac4

·

verified ·

1 Parent(s): e6eec2c

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -45,6 +45,7 @@ Quick testing shows optimized quants can:
   - Come up with a better answer/stronger reasoning.
   - Use less tokens to "reason" ... up to 50% less.
   - Faster and smaller quant size (VS "MAX" with output tensor and embed at BF16).
 Cost of the Augment:
   - Quants are are slightly larger.

   - Come up with a better answer/stronger reasoning.
   - Use less tokens to "reason" ... up to 50% less.
   - Faster and smaller quant size (VS "MAX" with output tensor and embed at BF16).
+  - Solution quality is also higher.
 Cost of the Augment:
   - Quants are are slightly larger.