d-rang-d
/

MS3-RP-Broth-24B

@@ -1,32 +1,173 @@
 ---
-base_model:
-- ReadyArt/Forgotten-Safeword-24B-V2.2
-- mergekit-community/MS3-RP-half1
-- mergekit-community/MS3-RP-RP-half2
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
-### Merge Method
-This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [ReadyArt/Forgotten-Safeword-24B-V2.2](https://huggingface.co/ReadyArt/Forgotten-Safeword-24B-V2.2) as a base.
-### Models Merged
-The following models were included in the merge:
-* [mergekit-community/MS3-RP-half1](https://huggingface.co/mergekit-community/MS3-RP-half1)
-* [mergekit-community/MS3-RP-RP-half2](https://huggingface.co/mergekit-community/MS3-RP-RP-half2)
-### Configuration
-The following YAML configuration was used to produce this model:
 ```yaml
 base_model: ReadyArt/Forgotten-Safeword-24B-V2.2
@@ -35,4 +176,4 @@ dtype: bfloat16
 models:
   - model: mergekit-community/MS3-RP-half1
   - model: mergekit-community/MS3-RP-RP-half2
-```

 ---
+language:
+- en
+license: apache-2.0
 library_name: transformers
 tags:
 - mergekit
 - merge
+base_model:
+- unsloth/Mistral-Small-24B-Base-2501
+- unsloth/Mistral-Small-24B-Instruct-2501
+- trashpanda-org/MS-24B-Instruct-Mullein-v0
+- trashpanda-org/Llama3-24B-Mullein-v1
+- ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4
+- TheDrummer/Cydonia-24B-v2
+- estrogen/MS2501-24b-Ink-apollo-ep2
+- huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated
+- ToastyPigeon/ms3-roselily-rp-v2
+- PocketDoc/Dans-DangerousWinds-V1.1.1-24b
+- ReadyArt/Forgotten-Safeword-24B-V2.2
 ---
+***
+### Overview
+One of the merging steps for [Tantum](https://huggingface.co/Nohobby/MS3-Tantum-24B-v0.1). Might be better than the end result
+**Settings:**
+Samplers: [Weird preset](https://files.catbox.moe/ccwmca.json) | [Forgotten-Safeword preset](https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Extra-Dry)
+Prompt format: Mistral-V7-Tekken (?)
+I use [this](https://files.catbox.moe/daluze.json) lorebook for all chats instead of a system prompt for mistal models.
+### Quants
+[Static](https://huggingface.co/mradermacher/MS-RP-whole-GGUF) | [Imatrix](https://huggingface.co/mradermacher/MS-RP-whole-i1-GGUF)
+***
 ## Merge Details
+### Merging steps
+## MS3-test-Merge-1
+```yaml
+models:
+  - model: unsloth/Mistral-Small-24B-Base-2501
+  - model: unsloth/Mistral-Small-24B-Instruct-2501+ToastyPigeon/new-ms-rp-test-ws
+    parameters:
+        select_topk:
+          - value: [0.05, 0.03, 0.02, 0.02, 0.01]
+  - model: unsloth/Mistral-Small-24B-Instruct-2501+estrogen/MS2501-24b-Ink-ep2-adpt
+    parameters:
+        select_topk: 0.1
+  - model: trashpanda-org/MS-24B-Instruct-Mullein-v0
+    parameters:
+        select_topk: 0.4
+base_model: unsloth/Mistral-Small-24B-Base-2501
+merge_method: sce
+parameters:
+  int8_mask: true
+  rescale: true
+  normalize: true
+dtype: bfloat16
+tokenizer_source: base
+```
+```yaml
+dtype: bfloat16
+tokenizer_source: base
+merge_method: della_linear
+parameters:
+  density: 0.55
+base_model: Step1
+models:
+  - model: unsloth/Mistral-Small-24B-Instruct-2501
+    parameters:
+      weight:
+        - filter: v_proj
+          value: [0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0]
+        - filter: o_proj
+          value: [1, 0, 1, 0, 0, 0, 0, 0, 1, 1, 1]
+        - filter: up_proj
+          value: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
+        - filter: gate_proj
+          value: [0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0]
+        - filter: down_proj
+          value: [1, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0]
+        - value: 0
+  - model: Step1
+    parameters:
+      weight:
+        - filter: v_proj
+          value: [1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 1]
+        - filter: o_proj
+          value: [0, 1, 0, 1, 1, 1, 1, 1, 0, 0, 0]
+        - filter: up_proj
+          value: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
+        - filter: gate_proj
+          value: [1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 1]
+        - filter: down_proj
+          value: [0, 1, 0, 1, 1, 1, 1, 1, 1, 1, 1]
+        - value: 1
+```
+Some early MS3 merge. Not really worth using on its own. Just added it for fun.
+## RP-half1
+```yaml
+models:
+  - model: ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4
+    parameters:
+      weight: 0.2
+      density: 0.7
+  - model: trashpanda-org/Llama3-24B-Mullein-v1
+    parameters:
+      weight: 0.2
+      density: 0.7
+  - model: TheDrummer/Cydonia-24B-v2
+    parameters:
+      weight: 0.2
+      density: 0.7
+merge_method: della_linear
+base_model: Nohobby/MS3-test-Merge-1
+parameters:
+  epsilon: 0.2
+  lambda: 1.1
+dtype: bfloat16
+tokenizer:
+ source: base
+```
+## RP-half2
+```yaml
+base_model: Nohobby/MS3-test-Merge-1
+parameters:
+  epsilon: 0.05
+  lambda: 0.9
+  int8_mask: true
+  rescale: true
+  normalize: false
+dtype: bfloat16
+tokenizer:
+ source: base
+merge_method: della
+models:
+  - model: estrogen/MS2501-24b-Ink-apollo-ep2
+    parameters:
+      weight: [0.1, -0.01, 0.1, -0.02, 0.1]
+      density: [0.6, 0.4, 0.5, 0.4, 0.6]
+  - model: huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated
+    parameters:
+      weight: [0.02, -0.01, 0.02, -0.02, 0.01]
+      density: [0.45, 0.55, 0.45, 0.55, 0.45]
+  - model: ToastyPigeon/ms3-roselily-rp-v2
+    parameters:
+      weight: [0.01, -0.02, 0.02, -0.025, 0.01]
+      density: [0.45, 0.65, 0.45, 0.65, 0.45]
+  - model: PocketDoc/Dans-DangerousWinds-V1.1.1-24b
+    parameters:
+      weight: [0.1, -0.01, 0.1, -0.02, 0.1]
+      density: [0.6, 0.4, 0.5, 0.4, 0.6]
+```
+## RP-broth
 ```yaml
 base_model: ReadyArt/Forgotten-Safeword-24B-V2.2
 models:
   - model: mergekit-community/MS3-RP-half1
   - model: mergekit-community/MS3-RP-RP-half2
+```