Removing royal we.
Browse filesI really need to stop using "We" in model cards and instead use I.
README.md
CHANGED
@@ -20,7 +20,7 @@ tags:
|
|
20 |
CavesOfQwen3 is a merge between the base model and the instruct model of Qwen3-30B-A3B (i.e. Qwen3-30B-A3B) and it's base model Qwen3-30B-A3B-base.
|
21 |
|
22 |
The idea for this merge is to remove the overbaked feeling that is in the instruct while retaining the instruct within the model.
|
23 |
-
|
24 |
|
25 |
\*(Ignoring the fact that it's spewing something random at the end. I suspect that's on my part in the configuration of the model on vllm or SillyTavern.)
|
26 |
|
@@ -34,7 +34,7 @@ This was me when I found out it didn't have `o_proj`.
|
|
34 |
|
35 |
Thankfully merging can be done on CPU. But not inference!
|
36 |
|
37 |
-
|
38 |
|
39 |
## Merge Details
|
40 |
### Merge Method
|
|
|
20 |
CavesOfQwen3 is a merge between the base model and the instruct model of Qwen3-30B-A3B (i.e. Qwen3-30B-A3B) and it's base model Qwen3-30B-A3B-base.
|
21 |
|
22 |
The idea for this merge is to remove the overbaked feeling that is in the instruct while retaining the instruct within the model.
|
23 |
+
I've tested the model and it seems to performs reasonably well\*
|
24 |
|
25 |
\*(Ignoring the fact that it's spewing something random at the end. I suspect that's on my part in the configuration of the model on vllm or SillyTavern.)
|
26 |
|
|
|
34 |
|
35 |
Thankfully merging can be done on CPU. But not inference!
|
36 |
|
37 |
+
I used `TIES`. Not because I'm lazy but because it's what I had lying around that isn't `SCE` or something else.
|
38 |
|
39 |
## Merge Details
|
40 |
### Merge Method
|