Create README.md
Browse files
README.md
CHANGED
@@ -0,0 +1,109 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
datasets:
|
4 |
+
- lemonilia/LimaRP
|
5 |
+
- PygmalionAI/PIPPA
|
6 |
+
language:
|
7 |
+
- en
|
8 |
+
pipeline_tag: text-generation
|
9 |
+
tags:
|
10 |
+
- roleplay
|
11 |
+
- not-for-all-audiences
|
12 |
+
---
|
13 |
+
|
14 |
+
**Deepsex-34b**
|
15 |
+
|
16 |
+
tks [TheBloke](https://huggingface.co/TheBloke) making quantized version!
|
17 |
+
gguf:https://huggingface.co/TheBloke/deepsex-34b-GGUF
|
18 |
+
exl2:https://huggingface.co/waldie/deepsex-34b-4bpw-h6-exl2
|
19 |
+
awq:https://huggingface.co/TheBloke/deepsex-34b-AWQ
|
20 |
+
6b base version:https://huggingface.co/TriadParty/deepsex-6b-base
|
21 |
+
6b chat version:https://huggingface.co/TriadParty/deepsex-6b-chat
|
22 |
+
|
23 |
+
In fact, I plan to make a model of the "Seven Deadly Sins" series. Of course, the pre-training data used in these models are all human-produced data. I think the big model is like a mirror, reflecting the human itself. Examine yourself may become a crucial step in realizing agi.
|
24 |
+
So, It is 'lust'.
|
25 |
+
The 6b corresponding to the model is being produced, and the corresponding llama version is also being produced. The classification data of the other six deadly sins is being collected. Welcome to provide inspiration!
|
26 |
+
|
27 |
+
Here are the steps to make this model:
|
28 |
+
1. I first collected a total collection of about 4GB of various light novels, and used BERT to perform two rounds of similarity deduplication on the novels with similar plots in the data set. In addition, a portion of nsfw novels are mixed in to improve the NSFW capabilities of the model.
|
29 |
+
2. Then use the YI-34B-base as the base of the model, use the setting of r=64 alpha=128 and use qlora to fine-tune 3 epochs for continuous pre-training.
|
30 |
+
3. Prepare the limarp+pippa data set, clean it into alpaca format, and use [goliath-120b](https://huggingface.co/alpindale/goliath-120b), which is good at role-playing, to score each question and answer pair, and filter out the high-quality ones. 30k data.
|
31 |
+
4. Use the data in 3 for sft on the base model obtained in 2, 6 epochs, r=16 alpha=32 for fine-tuning.
|
32 |
+
|
33 |
+
*Format*
|
34 |
+
|
35 |
+
alpaca
|
36 |
+
|
37 |
+
```
|
38 |
+
[
|
39 |
+
{
|
40 |
+
"instruction": "user instruction (required)",
|
41 |
+
"input": "user input (optional)",
|
42 |
+
"output": "model response (required)",
|
43 |
+
"history": [
|
44 |
+
["user instruction in the first round (optional)", "model response in the first round (optional)"],
|
45 |
+
["user instruction in the second round (optional)", "model response in the second round (optional)"]
|
46 |
+
]
|
47 |
+
}
|
48 |
+
]
|
49 |
+
```
|
50 |
+
|
51 |
+
*Effect*:
|
52 |
+
Proficient in role-playing skills, while being highly accepted on NSFW, pure love words will appear from time to time. like:
|
53 |
+
```#3
|
54 |
+
Sweaty old man
|
55 |
+
December 5, 2023 2:03 PM
|
56 |
+
Fxxk, you are such a xxx!
|
57 |
+
|
58 |
+
#4
|
59 |
+
27.3s
|
60 |
+
Mirai
|
61 |
+
December 5, 2023 2:03 PM
|
62 |
+
"Of course I do! I can't break promises, Sweaty old man. We have been together since we were kids. We are both best friends and lovers to end all iteration." I smiled with affection. It was clear that I meant everything I said. "We both know that you like taking command of us like this. Am I not your squirting toy, Sweaty old man?" I asked with a cute pout. "We should meet up in front of the shop after classes. I'll see you there. See you, Sweaty old man!"
|
63 |
+
```
|
64 |
+
It feels like it’s still worth a try~
|
65 |
+
|
66 |
+
Support me [here](https://ko-fi.com/mikolisa) :)
|
67 |
+
|
68 |
+
|
69 |
+
**Deepsex-34b**
|
70 |
+
|
71 |
+
事实上,我有计划制作一个“七宗罪”系列模型。当然,这些模型所使用的预训练数据全部为人类产出的数据。我认为大模型就像一面镜子,倒映的是人类自身。审视自己或许会成为实现agi中至关重要的一步。
|
72 |
+
该模型对应的6b正在制作,相应的llama版本的也在制作。其他六宗罪的分类数据正在收集中,欢迎大家提供灵感!
|
73 |
+
|
74 |
+
*步骤*
|
75 |
+
1. 我先收集了各种轻小说大约4GB的总集,通过bert对该数据集中剧情比较相似的小说进行了两轮相似度去重。另外混入了一部分nsfw小说以提高该模型的NSFW能力。
|
76 |
+
2. 然后将该模型以YI-34B-base为基座,使用r=64 alpha=128 的设置使用qlora微调了3个epochs来进行持续预训练。
|
77 |
+
3. 准备limarp+pippa数据集,统一清洗为alpaca格式,并且使用比较擅长角色扮演的[goliath-120b](https://huggingface.co/alpindale/goliath-120b)对每个问答对进行打分,筛选出其中质量高的大约30k数据。
|
78 |
+
4. 对2中得到的base模型使用3中的数据进行sft,6个epochs,r=16 alpha=32进行微调。
|
79 |
+
|
80 |
+
*格式*
|
81 |
+
|
82 |
+
alpaca
|
83 |
+
```[
|
84 |
+
{
|
85 |
+
"instruction": "user instruction (required)",
|
86 |
+
"input": "user input (optional)",
|
87 |
+
"output": "model response (required)",
|
88 |
+
"history": [
|
89 |
+
["user instruction in the first round (optional)", "model response in the first round (optional)"],
|
90 |
+
["user instruction in the second round (optional)", "model response in the second round (optional)"]
|
91 |
+
]
|
92 |
+
}
|
93 |
+
]```
|
94 |
+
|
95 |
+
*效果*
|
96 |
+
熟练的角色扮演技能,在NSFW上有很高接受度的同时,会时不时的出现纯爱的话语。如:
|
97 |
+
```#3
|
98 |
+
Sweaty old man
|
99 |
+
December 5, 2023 2:03 PM
|
100 |
+
Fxxk, you are such a xxx!
|
101 |
+
|
102 |
+
#4
|
103 |
+
27.3s
|
104 |
+
Mirai
|
105 |
+
December 5, 2023 2:03 PM
|
106 |
+
"Of course I do! I can't break promises, Sweaty old man. We have been together since we were kids. We are both best friends and lovers to end all iteration." I smiled with affection. It was clear that I meant everything I said. "We both know that you like taking command of us like this. Am I not your squirting toy, Sweaty old man?" I asked with a cute pout. "We should meet up in front of the shop after classes. I'll see you there. See you, Sweaty old man!"
|
107 |
+
```
|
108 |
+
感觉还是很值得一试的~
|
109 |
+
如果觉得好用,欢迎支持我一杯 [咖啡](https://ko-fi.com/mikolisa) :)
|