R136a1
/

BeyondInfinity-v2-4x7B

Text Generation

text-generation-inference

Model card Files Files and versions Community

R136a1 commited on May 16, 2024

Commit

c75c491

·

verified ·

1 Parent(s): cdf5ace

Update README.md

Files changed (1) hide show

README.md +11 -19

README.md CHANGED Viewed

@@ -6,24 +6,16 @@ tags:
 - safetensors
 - mixtral
 ---
-Test model.
-Under testing...
-Recipe:
-```yaml
-base_model: /content/InfinityRP
-gate_mode: random
-dtype: bfloat16 # output dtype (float32, float16, or bfloat16)
-## (optional)
-experts_per_token: 2
-experts:
-  - source_model: /content/Aurav2
-    positive_prompts: []
-  - source_model: /content/Spice
-    positive_prompts: []
-  - source_model: /content/InfinityRP
-    positive_prompts: []
-  - source_model: /content/DaCo
-    positive_prompts: []
-```

 - safetensors
 - mixtral
 ---
+I prefer this one instead of v1 since it's a bit more creative and _smart_, understand the story better. This use some different models from the v1 but perform very close to it (I guess since I used the same model for the base?).
+Testing done.
+It performs really well in complex scenario and follows the character card quite well. The char card and previous message can affect a lot to the next reply style.
+The main idea is instead of _merging_ models to create new model, I try to put these best model into mixtral so it can work together. And the result is good, every model has its uniqueness and strength.
+Downside? it only support 8k (8192) context length...
+Alpaca prompting format.