DavidAU commited on
Commit
bbba23a
·
verified ·
1 Parent(s): 3b94a48

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -24,9 +24,7 @@ pipeline_tag: text-generation
24
  ---
25
  <h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
26
 
27
- UPDATE:
28
-
29
- Re-optimizing quants, found a better mixture. Uploading NOW...
30
 
31
  Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
32
  may not be able to solve or take a lot longer (a lot more tokens!).
@@ -39,11 +37,17 @@ Quick testing shows optimized quants can:
39
 
40
  <B>Quants - "EDGE of REASON":</B>
41
 
 
 
 
 
 
 
42
  IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
43
 
44
  ...
45
 
46
- IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant.
47
 
48
  ...
49
 
 
24
  ---
25
  <h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
26
 
27
+ UPDATE: Re-optimizing quants, found a better mixture. Uploading NOW...
 
 
28
 
29
  Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
30
  may not be able to solve or take a lot longer (a lot more tokens!).
 
37
 
38
  <B>Quants - "EDGE of REASON":</B>
39
 
40
+ Generally higher quants will solve problems faster with less tokens, and be able to solve tougher problems.
41
+
42
+ Likewise "solutions" will be of higher detail too.
43
+
44
+ ...
45
+
46
  IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
47
 
48
  ...
49
 
50
+ IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant level.
51
 
52
  ...
53