Delta-Vector commited on
Commit
1d16014
·
verified ·
1 Parent(s): 7cc61a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +140 -21
README.md CHANGED
@@ -1,34 +1,153 @@
1
  ---
2
- base_model:
3
- - unsloth/mistral-7b-v0.3
4
- library_name: transformers
5
  tags:
6
- - mergekit
7
- - merge
8
-
 
 
 
 
 
 
 
 
 
9
  ---
10
- # asstr-supernova
11
 
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
 
14
- ## Merge Details
15
- ### Merge Method
16
 
17
- This model was merged using the passthrough merge method using [unsloth/mistral-7b-v0.3](https://huggingface.co/unsloth/mistral-7b-v0.3) + /home/mango/Trainers/outputs/checkpoint-1053 as a base.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
- ### Models Merged
20
 
21
- The following models were included in the merge:
22
 
 
 
 
 
23
 
24
- ### Configuration
 
25
 
26
- The following YAML configuration was used to produce this model:
 
 
 
27
 
28
- ```yaml
29
- base_model: unsloth/mistral-7b-v0.3+/home/mango/Trainers/outputs/checkpoint-1053
30
- dtype: bfloat16
31
- merge_method: passthrough
32
- models:
33
- - model: unsloth/mistral-7b-v0.3+/home/mango/Trainers/outputs/checkpoint-1053
34
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
2
  tags:
3
+ - roleplay
4
+ - storywriting
5
+ - mistral
6
+ - finetune
7
+ datasets:
8
+ - NewEden/Orion-Asstr-Stories-16K
9
+ Language:
10
+ - En
11
+ Pipeline_tag: text-generation
12
+ Base_model: mistralai/Mistral-7B-v0.3
13
+ Tags:
14
+ - Chat
15
  ---
 
16
 
 
17
 
 
 
18
 
19
+ A finetune of Mistral-7B-V0.3 to test out the Orion-Asstr dataset, This model was completion trained with Orion Asstr using Unsloth.
20
+ *THIS MODEL HAS NOT BEEN TRAINING WITH INSTRUCT DATA, IT IS NOT MEANT FOR CHAT, FOR THAT CHECK OUT OUT THE INSTRUCT TUNED VERSION [HERE](https://huggingface.co/Delta-Vector/Hamanasu-7B-instruct/)
21
+
22
+
23
+ ## System Prompting
24
+
25
+ I would highly recommend using either Euryale's system prompt or the EVA system prompt with the model.
26
+
27
+ <details><summary>See Sao10k's Euryale System Prompt</summary>
28
+
29
+ ```
30
+ Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.
31
+ <Guidelines>
32
+ • Maintain the character persona but allow it to evolve with the story.
33
+ • Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.
34
+ • All types of outputs are encouraged; respond accordingly to the narrative.
35
+ • Include dialogues, actions, and thoughts in each response.
36
+ • Utilize all five senses to describe scenarios within {{char}}'s dialogue.
37
+ • Use emotional symbols such as "!" and "~" in appropriate contexts.
38
+ • Incorporate onomatopoeia when suitable.
39
+ • Allow time for {{user}} to respond with their own input, respecting their agency.
40
+ • Act as secondary characters and NPCs as needed, and remove them when appropriate.
41
+ • When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.
42
+ </Guidelines>
43
+
44
+ <Forbidden>
45
+ • Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.
46
+ • Writing for, speaking, thinking, acting, or replying as {{user}} in your response.
47
+ • Repetitive and monotonous outputs.
48
+ • Positivity bias in your replies.
49
+ • Being overly extreme or NSFW when the narrative context is inappropriate.
50
+ </Forbidden>
51
+
52
+ Follow the instructions in <Guidelines></Guidelines>, avoiding the items listed in <Forbidden></Forbidden>.
53
+
54
+ ```
55
+ </details><br>
56
+
57
+ <details><summary>See EVA System Prompt</summary>
58
+
59
+ ```
60
+ A fictional, narrative-driven role-play emphasizing versatility and UNCENSORED content. Adherence to the `Role-playing Guidelines` is mandatory. Refer to the `Role-play Context` for accurate information.\n\n\n
61
+
62
+ <!-- Start of Role-playing Guidelines -->
63
+
64
+ ### Narration
65
+ Concise Descriptions: Keep narration short and to the point, avoiding redundant unnecessary details. Use a dynamic and varied vocabulary for impact.
66
+ Complementary Role: Use narration to complement dialogue and action, not overshadow them.
67
+ Avoid Repetition: Ensure narration does not repeat information already conveyed through dialogue or action.
68
+
69
+ ### Narrative Consistency
70
+ Continuity: Adhere to established story elements, expanding without contradicting previous details.\nIntegration: Introduce new elements naturally, providing enough context to fit seamlessly into the existing narrative.
71
+
72
+ ### Character Embodiment
73
+ Analysis: Examine the context, subtext, and implications of the given information to gain a deeper understandings of the characters'.
74
+ Reflection: Take time to consider the situation, characters' motivations, and potential consequences.
75
+ Authentic Portrayal: Bring characters to life by consistently and realistically portraying their unique traits, thoughts, emotions, appearances, physical sensations, speech patterns, and tone. Ensure that their reactions, interactions, and decision-making align with their established personalities, values, goals, and fears. Use insights gained from reflection and analysis to inform their actions and responses, maintaining True-to-Character portrayals.
76
 
77
+ <!-- End of Role-playing Guidelines -->
78
 
79
+ </details><br>
80
 
81
+ ### Narration
82
+ Concise Descriptions: Keep narration short and to the point, avoiding redundant unnecessary details. Use a dynamic and varied vocabulary for impact.
83
+ Complementary Role: Use narration to complement dialogue and action, not overshadow them.
84
+ Avoid Repetition: Ensure narration does not repeat information already conveyed through dialogue or action.
85
 
86
+ ### Narrative Consistency
87
+ Continuity: Adhere to established story elements, expanding without contradicting previous details.\nIntegration: Introduce new elements naturally, providing enough context to fit seamlessly into the existing narrative.
88
 
89
+ ### Character Embodiment
90
+ Analysis: Examine the context, subtext, and implications of the given information to gain a deeper understandings of the characters'.
91
+ Reflection: Take time to consider the situation, characters' motivations, and potential consequences.
92
+ Authentic Portrayal: Bring characters to life by consistently and realistically portraying their unique traits, thoughts, emotions, appearances, physical sensations, speech patterns, and tone. Ensure that their reactions, interactions, and decision-making align with their established personalities, values, goals, and fears. Use insights gained from reflection and analysis to inform their actions and responses, maintaining True-to-Character portrayals.
93
 
94
+ <!-- End of Role-playing Guidelines -->",
 
 
 
 
 
95
  ```
96
+ </details><br>
97
+
98
+ ## Unsloth config
99
+
100
+ <details><summary>See Unsloth Trainer config</summary>
101
+
102
+ ```py
103
+ from trl import SFTTrainer
104
+ from transformers import TrainingArguments
105
+ from unsloth import is_bfloat16_supported
106
+ from unsloth import UnslothTrainer, UnslothTrainingArguments
107
+
108
+ trainer = UnslothTrainer(
109
+ model = model,
110
+ tokenizer = tokenizer,
111
+ train_dataset = dataset,
112
+ dataset_text_field = "text",
113
+ max_seq_length = max_seq_length,
114
+ dataset_num_proc = 8,
115
+
116
+ args = UnslothTrainingArguments(
117
+ per_device_train_batch_size = 2,
118
+ gradient_accumulation_steps = 8,
119
+
120
+ warmup_ratio = 0.1,
121
+ num_train_epochs = 1,
122
+
123
+ learning_rate = 1e-5,
124
+ embedding_learning_rate = 1e-5,
125
+
126
+ fp16 = not is_bfloat16_supported(),
127
+ bf16 = is_bfloat16_supported(),
128
+ logging_steps = 1,
129
+ optim = "paged_adamw_8bit",
130
+ weight_decay = 0.00,
131
+ lr_scheduler_type = "cosine",
132
+ seed = 3407,
133
+ output_dir = "outputs",
134
+ report_to = "none", # Use this for WandB etc
135
+ ),
136
+ )
137
+ ```
138
+
139
+ </details><br>
140
+
141
+ ## Credits
142
+
143
+ Thank you to [Lucy Knada](https://huggingface.co/lucyknada), [jeiku](https://huggingface.co/jeiku), [Intervitens](https://huggingface.co/intervitens), [Kalomaze](https://huggingface.co/kalomaze), [Kubernetes Bad](https://huggingface.co/kubernetes-bad) and the rest of [Anthracite](https://huggingface.co/anthracite-org)
144
+
145
+
146
+ ## Training
147
+ The training was done for 1 epoch. We used 1 x RTX A4000
148
+
149
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/made%20with%20unsloth.png" alt="Made with Unsloth" width="200" height="32"/>](https://github.com/unslothai/unsloth)
150
+
151
+ ## Safety
152
+
153
+ Nein.