bartowski
/

deepseek-ai_DeepSeek-R1-0528-GGUF

Text Generation

Model card Files Files and versions

bartowski commited on 11 days ago

Commit

708d712

·

verified ·

1 Parent(s): eef6c70

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -12,16 +12,20 @@ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a
 Original model: https://huggingface.co/deepseek-ai/DeepSeek-R1-0528
 All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
 Run them in [LM Studio](https://lmstudio.ai/)
 Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or any other llama.cpp based project
 ## Prompt format
 ```
-<｜begin▁of▁sentence｜>{system_prompt}<｜User｜>{prompt}<｜Assistant｜><｜end▁of▁sentence｜><｜Assistant｜>
 ```
 ## Download a file (not the whole branch) from below:

 Original model: https://huggingface.co/deepseek-ai/DeepSeek-R1-0528
+Huge shoutout to [Artus](https://huggingface.co/ArtusDev) who helped with both conversion and imatrix calculation!
 All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
 Run them in [LM Studio](https://lmstudio.ai/)
 Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or any other llama.cpp based project
+These quants were made with the changes from this PR for improved performance: https://github.com/ggml-org/llama.cpp/pull/12727
 ## Prompt format
 ```
+<｜begin▁of▁sentence｜>{system_prompt}<｜User｜>{prompt}<｜Assistant｜>
 ```
 ## Download a file (not the whole branch) from below: