Text Generation
GGUF
English
mixture of experts
Mixture of Experts
8x3B
Llama 3.2 MOE
128k context
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
swearing
rp
horror
mergekit
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -100,10 +100,12 @@ When using "API", you set the "num_experts_used" in the JSON payload (this maybe
|
|
100 |
This "team" has a Captain (first listed model), and then all the team members contribute to the to "token"
|
101 |
choice billions of times per second. Note the Captain also contributes too.
|
102 |
|
103 |
-
Think of 2, 3 or 4 master chefs in the kitchen all competing to make the best dish for you.
|
104 |
|
105 |
This results in higher quality generation.
|
106 |
|
|
|
|
|
107 |
That means the power of every model is available during instruction and output generation.
|
108 |
|
109 |
This brings unparalleled power to all forms of generation and all use cases.
|
|
|
100 |
This "team" has a Captain (first listed model), and then all the team members contribute to the to "token"
|
101 |
choice billions of times per second. Note the Captain also contributes too.
|
102 |
|
103 |
+
Think of 2, 3 or 4 (or more) master chefs in the kitchen all competing to make the best dish for you.
|
104 |
|
105 |
This results in higher quality generation.
|
106 |
|
107 |
+
This also results in many cases in higher quality instruction following too.
|
108 |
+
|
109 |
That means the power of every model is available during instruction and output generation.
|
110 |
|
111 |
This brings unparalleled power to all forms of generation and all use cases.
|