DavidAU
/

Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF

Text Generation

DeepSeek-R1-Distill

function calling

problem solving

fiction writing

plot generation

sub-plot generation

story generation

Model card Files Files and versions Community

DavidAU commited on Mar 20

Commit

bbba23a

·

verified ·

1 Parent(s): 3b94a48

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -24,9 +24,7 @@ pipeline_tag: text-generation
 ---
 <h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
-UPDATE:
-Re-optimizing quants, found a better mixture. Uploading NOW...
 Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
 may not be able to solve or take a lot longer (a lot more tokens!).
@@ -39,11 +37,17 @@ Quick testing shows optimized quants can:
 <B>Quants - "EDGE of REASON":</B>
 IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
 ...
-IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant.
 ...

 ---
 <h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
+UPDATE: Re-optimizing quants, found a better mixture. Uploading NOW...
 Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
 may not be able to solve or take a lot longer (a lot more tokens!).
 <B>Quants - "EDGE of REASON":</B>
+Generally higher quants will solve problems faster with less tokens, and be able to solve tougher problems.
+Likewise "solutions" will be of higher detail too.
+...
 IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
 ...
+IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant level.
 ...