bartowski commited on
Commit
48633a2
·
verified ·
1 Parent(s): 8db6a8e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -12,7 +12,17 @@ Original model: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
12
 
13
  Uploading this since I'm using it to calculate imatrix, figured might as well provide it in the meantime
14
 
15
- Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples (I'm going to test a bit)
 
 
 
 
 
 
 
 
 
 
16
 
17
  382G total size
18
 
 
12
 
13
  Uploading this since I'm using it to calculate imatrix, figured might as well provide it in the meantime
14
 
15
+ Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples, for instance I've had success with:
16
+
17
+ ```
18
+ ./llama-cli -m /models/deepseek-ai_DeepSeek-V3.1-Base-Q4_K_M-00001-of-00011.gguf -p "You are a helpful assistant.<|User|>Hello, how are you?<|Assistant|>I'm doing well thanks! Yourself?<|User|>I'm doing great! Can you explain the laws of thermodynamics?<|Assistant|>" -no-cnv -ngl 0
19
+ ```
20
+
21
+ This resulted in a completely coherent reply:
22
+
23
+ > The first law of thermodynamics is that energy can neither be created nor destroyed. The second law states that entropy, or disorder, in the universe will always increase. The third law states that a perfect crystal at absolute zero would have zero entropy.
24
+
25
+ The idea is that you need to teach the base model what a conversation looks like first, base models aren't usually capable of one-shotting a conversation since it hasn't been tuned to understand roles.
26
 
27
  382G total size
28