ZeroXClem commited on
Commit
4830f82
Β·
verified Β·
1 Parent(s): be87e86

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +103 -5
README.md CHANGED
@@ -4,17 +4,66 @@ tags:
4
  - merge
5
  - mergekit
6
  - lazymergekit
 
 
 
 
7
  - kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
8
  - djuna/L3.1-Purosani-2-8B
 
 
 
 
 
 
9
  ---
10
 
11
  # ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B
12
 
13
- ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
14
- * [kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B](https://huggingface.co/kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B)
15
- * [djuna/L3.1-Purosani-2-8B](https://huggingface.co/djuna/L3.1-Purosani-2-8B)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
- ## 🧩 Configuration
18
 
19
  ```yaml
20
  slices:
@@ -34,4 +83,53 @@ parameters:
34
  - value: 0.5
35
  dtype: bfloat16
36
 
37
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - merge
5
  - mergekit
6
  - lazymergekit
7
+ - Hermes3
8
+ - SuperNovaLite
9
+ - Purosani
10
+ - Llama3.1
11
  - kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
12
  - djuna/L3.1-Purosani-2-8B
13
+ - instruction-following
14
+ - long-form-generation
15
+ - roleplay
16
+ - storytelling
17
+ base_model:
18
+ - djuna/L3.1-Purosani-2-8B
19
  ---
20
 
21
  # ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B
22
 
23
+ **ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** is a cutting-edge merged model that blends the best features of two highly optimized architectures to create an **advanced**, **adaptive**, and **powerful** model. Whether for scientific research, complex instruction-following, or immersive roleplay scenarios, this model excels at every task it’s thrown into.
24
+
25
+ ## 🌟 Family Tree
26
+
27
+ This model is a merger of the following:
28
+
29
+ - [**kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**](https://huggingface.co/kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B)
30
+ - [**djuna/L3.1-Purosani-2-8B**](https://huggingface.co/djuna/L3.1-Purosani-2-8B)
31
+
32
+ These parent models are themselves the result of **complex merges** of various high-performance models, making ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B a **super hybrid** capable of handling diverse tasks with efficiency and finesse.
33
+
34
+ ## 🧬 Detailed Model Lineage
35
+
36
+ ### **A: kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**
37
+
38
+ Merged using the **TIES merge method**, this model utilizes **unsloth/Meta-Llama-3.1-8B** as its base, combining:
39
+
40
+ - **arcee-ai/Llama-3.1-SuperNova-Lite**: A distilled 8B parameter version of the **Llama-3.1-405B-Instruct** model, designed to maintain high performance while minimizing resource consumption. Its training, via **EvolKit**, offers instruction-following precision and domain-specific adaptability.
41
+ - **NousResearch/Hermes-3-Llama-3.1-8B**: Known for its robustness, this model enhances long-range contextual understanding, making it ideal for complex, multi-layered tasks.
42
+
43
+ ### **B: djuna/L3.1-Purosani-2-8B**
44
+
45
+ This merge incorporates:
46
+
47
+ - **hf-100/Llama-3-Spellbound-Instruct-8B-0.3**
48
+ - **arcee-ai/Llama-3.1-SuperNova-Lite**
49
+ - **grimjim/Llama-3-Instruct-abliteration-LoRA-8B**
50
+ - **THUDM/LongWriter-llama3.1-8B**, capable of generating over **10,000 words** in one pass, making it perfect for long-form content generation.
51
+
52
+ Further contributors include **ResplendentAI/Smarts_Llama3** and **djuna/L3.1-Suze-Vume-2-calc**, making this model highly adaptable to a broad range of applications.
53
+
54
+ ## πŸ› οΈ Merge Details
55
+
56
+ The model was merged using the **della merge method** with **kromeurus/L3.1-Aglow-Vulca-v0.1-8B** as the base. This method, combined with the following models, ensures both **precision** and **adaptability**:
57
+
58
+ - **djuna/L3.1-Noraian**
59
+ - **Casual-Autopsy/L3-Super-Nova-RP-8B**
60
+ - **TheDrummer/Llama-3SOME-8B-v2**
61
+ - **djuna/L3.1-ForStHS**
62
+ - **Blackroot/Llama-3-8B-Abomination-LORA**
63
+
64
+ ## πŸ”§ Technical Configuration
65
 
66
+ The merging process used advanced methods to ensure smooth integration and consistent performance across various tasks:
67
 
68
  ```yaml
69
  slices:
 
83
  - value: 0.5
84
  dtype: bfloat16
85
 
86
+ ```
87
+
88
+ ## 🎯 Extended Support for Roleplay & Immersive Storytelling
89
+
90
+ **ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** has been optimized for **extended roleplay support**, making it an exceptional choice for **interactive storytelling** and **deep character development**. With its ability to understand long-form context and generate cohesive responses over extensive interactions, this model excels in:
91
+
92
+ - **Character-driven interactions**: Develop rich, nuanced personalities that respond in believable and engaging ways.
93
+ - **World-building & Lore creation**: Create vast, interconnected universes with intricate lore, all generated in real-time.
94
+ - **Dynamic NPC dialogues**: Use the model to generate complex, reactive conversations for game NPCs, offering a fluid, immersive experience for players.
95
+
96
+ ## πŸš€ Key Features & Capabilities
97
+
98
+ ### **Advanced Roleplay and Long-Form Content Generation**
99
+
100
+ With models like **THUDM/LongWriter-llama3.1-8B** contributing their expertise, this model is perfect for generating **long-form narratives** while maintaining coherence and creativity.
101
+
102
+ ### **Instruction Following & Task Adaptability**
103
+
104
+ Combining the capabilities of **Hermes** and **SuperNovaLite**, this model can efficiently follow detailed instructions, making it ideal for:
105
+
106
+ - **Task automation**
107
+ - **Virtual assistants**
108
+ - **Research generation**
109
+
110
+ ### **Efficiency Without Compromise**
111
+
112
+ Distilled models like **SuperNovaLite** ensure that this model delivers high performance without the extensive resource requirements of larger models.
113
+
114
+ ## 🎯 Use Case & Applications
115
+
116
+ - **Roleplay & Interactive Storytelling**: The perfect companion for storytellers, RPG enthusiasts, and game developers. Whether crafting dynamic NPC interactions or generating deep, immersive worlds, this model can handle it all.
117
+ - **Instruction-based AI**: With enhanced instruction-following abilities, this model is ideal for developing intelligent assistants or chatbots that require high accuracy and quick adaptability.
118
+ - **Long-Form Writing**: From novels to research papers, this model can generate lengthy, well-structured content with ease, thanks to its extensive training on long-form data.
119
+
120
+ ## πŸ“œ License
121
+
122
+ This model is open-sourced under the **Apache-2.0 License**, allowing others to use and modify it freely, as long as they give proper attribution.
123
+
124
+ ## πŸ’‘ Tags
125
+
126
+ - `merge`
127
+ - `mergekit`
128
+ - `Hermes3`
129
+ - `SuperNovaLite`
130
+ - `Purosani`
131
+ - `Llama3.1`
132
+ - `instruction-following`
133
+ - `long-form-generation`
134
+ - `roleplay`
135
+ - `storytelling`