Text Generation
GGUF
English
NEO Imatrix
MAX Quants
GGUF
uncensored
reasoning
thinking
r1
cot
reka-flash
deepseek
Qwen2.5
Hermes
DeepHermes
DeepSeek
DeepSeek-R1-Distill
128k context
instruct
all use cases
maxed quants
Neo Imatrix
finetune
chatml
gpt4
synthetic data
distillation
function calling
roleplaying
chat
Uncensored
creative
general usage
problem solving
brainstorming
solve riddles
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
story
writing
fiction
swearing
horror
imatrix
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -24,9 +24,7 @@ pipeline_tag: text-generation
|
|
24 |
---
|
25 |
<h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
|
26 |
|
27 |
-
UPDATE:
|
28 |
-
|
29 |
-
Re-optimizing quants, found a better mixture. Uploading NOW...
|
30 |
|
31 |
Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
|
32 |
may not be able to solve or take a lot longer (a lot more tokens!).
|
@@ -39,11 +37,17 @@ Quick testing shows optimized quants can:
|
|
39 |
|
40 |
<B>Quants - "EDGE of REASON":</B>
|
41 |
|
|
|
|
|
|
|
|
|
|
|
|
|
42 |
IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
|
43 |
|
44 |
...
|
45 |
|
46 |
-
IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant.
|
47 |
|
48 |
...
|
49 |
|
|
|
24 |
---
|
25 |
<h2>Reka-Flash-3-21B-Reasoning-MAX-NEO-Imatrix-GGUF</h2>
|
26 |
|
27 |
+
UPDATE: Re-optimizing quants, found a better mixture. Uploading NOW...
|
|
|
|
|
28 |
|
29 |
Mixture is strong enough that lower quants can now solve/reasoning and come up with a solution whereas NON-optimized
|
30 |
may not be able to solve or take a lot longer (a lot more tokens!).
|
|
|
37 |
|
38 |
<B>Quants - "EDGE of REASON":</B>
|
39 |
|
40 |
+
Generally higher quants will solve problems faster with less tokens, and be able to solve tougher problems.
|
41 |
+
|
42 |
+
Likewise "solutions" will be of higher detail too.
|
43 |
+
|
44 |
+
...
|
45 |
+
|
46 |
IQ1_M - Works, but limited reasoning (reasoning operates, but has a tough time (if at all) coming up with the right answer for some problems).
|
47 |
|
48 |
...
|
49 |
|
50 |
+
IQ2_S - Moderate reasoning ; impressive performance for both reasoning AND this quant level.
|
51 |
|
52 |
...
|
53 |
|