Text Generation
GGUF
English
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
llama 3.1
llama-3
llama3
llama-3.1
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
float32
swearing
role play
sillytavern
backyard
horror
context 128k
mergekit
Merge
Mixture of Experts
mixture of experts
Not-For-All-Audiences
conversational
Commit
·
02c6fcb
verified
·
0
Parent(s):
Duplicate from mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF
Browse files- .gitattributes +46 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.IQ4_XS.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q2_K.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_L.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_M.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_S.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_M.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_S.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_M.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_S.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q6_K.gguf +3 -0
- L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q8_0.gguf +3 -0
- README.md +93 -0
.gitattributes
ADDED
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
11 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
12 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
13 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
14 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
15 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
16 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
17 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
18 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
19 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
20 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
21 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
22 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
23 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
24 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
25 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
26 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
27 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
28 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
29 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
30 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
31 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
32 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
33 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.IQ4_XS.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9255d5ba3baefde1c417cae239ef0e34bba3b0ae46ae7738b69fa6d7f37a3f0d
|
3 |
+
size 13580771264
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:805763a9238fac170ac6e71dcc826a283e58fd7b2670bd74743395005c77e135
|
3 |
+
size 9302826944
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c12251df3f53a3c8db1d7093bbb422773a5abdce8ccfab4f1932c2c280d259bd
|
3 |
+
size 13044023232
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:972bad52280782b96e63b391450e6b3fed9924f75686833bd3cb93f74937a4da
|
3 |
+
size 12080381888
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c07abdf7f82ce328e338372638588685b39560f2afc1f2431f66b0491fcd3fe
|
3 |
+
size 10933239744
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ca84b6ba6951621451f2ce9fb2ba3898ce5310701a2db57c9a45560f366daeb0
|
3 |
+
size 15162187712
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76ac133de0bec49ad7b60f9ae6fe4bb237924b2983d3eb30610bb393f18c6b24
|
3 |
+
size 14295539648
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5dc36de0468acf9cd774fbd4a79b5406ceadd30867c12c363784b45ef3cb4c95
|
3 |
+
size 17736048576
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7f8085623d6a2ce5ba93b2a23f06dfed2c60fdedef0074f8bee60e987c01714
|
3 |
+
size 17228013504
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f6a5d76183edb18b9b04ca99881c818b374d1eb4a6290a6b34414d6dac821a91
|
3 |
+
size 20470775744
|
L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a632acf73eae25797af26534ada5f00af78572a0474464955483e52cc605934b
|
3 |
+
size 26511278016
|
README.md
ADDED
@@ -0,0 +1,93 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: DavidAU/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
library_name: transformers
|
6 |
+
license: apache-2.0
|
7 |
+
quantized_by: mradermacher
|
8 |
+
tags:
|
9 |
+
- creative
|
10 |
+
- creative writing
|
11 |
+
- fiction writing
|
12 |
+
- plot generation
|
13 |
+
- sub-plot generation
|
14 |
+
- fiction writing
|
15 |
+
- story generation
|
16 |
+
- scene continue
|
17 |
+
- storytelling
|
18 |
+
- fiction story
|
19 |
+
- science fiction
|
20 |
+
- romance
|
21 |
+
- all genres
|
22 |
+
- story
|
23 |
+
- writing
|
24 |
+
- vivid prosing
|
25 |
+
- vivid writing
|
26 |
+
- fiction
|
27 |
+
- roleplaying
|
28 |
+
- float32
|
29 |
+
- swearing
|
30 |
+
- role play
|
31 |
+
- sillytavern
|
32 |
+
- backyard
|
33 |
+
- horror
|
34 |
+
- llama 3.1
|
35 |
+
- context 128k
|
36 |
+
- mergekit
|
37 |
+
- merge
|
38 |
+
- not-for-all-audiences
|
39 |
+
---
|
40 |
+
## About
|
41 |
+
|
42 |
+
<!-- ### quantize_version: 2 -->
|
43 |
+
<!-- ### output_tensor_quantised: 1 -->
|
44 |
+
<!-- ### convert_type: hf -->
|
45 |
+
<!-- ### vocab_type: -->
|
46 |
+
<!-- ### tags: -->
|
47 |
+
static quants of https://huggingface.co/DavidAU/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B
|
48 |
+
|
49 |
+
<!-- provided-files -->
|
50 |
+
weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.
|
51 |
+
## Usage
|
52 |
+
|
53 |
+
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
54 |
+
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
55 |
+
more details, including on how to concatenate multi-part files.
|
56 |
+
|
57 |
+
## Provided Quants
|
58 |
+
|
59 |
+
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
60 |
+
|
61 |
+
| Link | Type | Size/GB | Notes |
|
62 |
+
|:-----|:-----|--------:|:------|
|
63 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q2_K.gguf) | Q2_K | 9.4 | |
|
64 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_S.gguf) | Q3_K_S | 11.0 | |
|
65 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_M.gguf) | Q3_K_M | 12.2 | lower quality |
|
66 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q3_K_L.gguf) | Q3_K_L | 13.1 | |
|
67 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_S.gguf) | Q4_K_S | 14.4 | fast, recommended |
|
68 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q4_K_M.gguf) | Q4_K_M | 15.3 | fast, recommended |
|
69 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_S.gguf) | Q5_K_S | 17.3 | |
|
70 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q5_K_M.gguf) | Q5_K_M | 17.8 | |
|
71 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q6_K.gguf) | Q6_K | 20.6 | very good quality |
|
72 |
+
| [GGUF](https://huggingface.co/mradermacher/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B-GGUF/resolve/main/L3.1-MOE-4X8B-Dark-Reasoning-Dark-Planet-Hermes-R1-Uncensored-e32-25B.Q8_0.gguf) | Q8_0 | 26.6 | fast, best quality |
|
73 |
+
|
74 |
+
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
75 |
+
types (lower is better):
|
76 |
+
|
77 |
+

|
78 |
+
|
79 |
+
And here are Artefact2's thoughts on the matter:
|
80 |
+
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
81 |
+
|
82 |
+
## FAQ / Model Request
|
83 |
+
|
84 |
+
See https://huggingface.co/mradermacher/model_requests for some answers to
|
85 |
+
questions you might have and/or if you want some other model quantized.
|
86 |
+
|
87 |
+
## Thanks
|
88 |
+
|
89 |
+
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
90 |
+
me use its servers and providing upgrades to my workstation to enable
|
91 |
+
this work in my free time.
|
92 |
+
|
93 |
+
<!-- end -->
|