Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,17 @@ Original model: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
|
|
12 |
|
13 |
Uploading this since I'm using it to calculate imatrix, figured might as well provide it in the meantime
|
14 |
|
15 |
-
Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
17 |
382G total size
|
18 |
|
|
|
12 |
|
13 |
Uploading this since I'm using it to calculate imatrix, figured might as well provide it in the meantime
|
14 |
|
15 |
+
Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples, for instance I've had success with:
|
16 |
+
|
17 |
+
```
|
18 |
+
./llama-cli -m /models/deepseek-ai_DeepSeek-V3.1-Base-Q4_K_M-00001-of-00011.gguf -p "You are a helpful assistant.<|User|>Hello, how are you?<|Assistant|>I'm doing well thanks! Yourself?<|User|>I'm doing great! Can you explain the laws of thermodynamics?<|Assistant|>" -no-cnv -ngl 0
|
19 |
+
```
|
20 |
+
|
21 |
+
This resulted in a completely coherent reply:
|
22 |
+
|
23 |
+
> The first law of thermodynamics is that energy can neither be created nor destroyed. The second law states that entropy, or disorder, in the universe will always increase. The third law states that a perfect crystal at absolute zero would have zero entropy.
|
24 |
+
|
25 |
+
The idea is that you need to teach the base model what a conversation looks like first, base models aren't usually capable of one-shotting a conversation since it hasn't been tuned to understand roles.
|
26 |
|
27 |
382G total size
|
28 |
|