CRD716
/

ggml-LLaMa-65B-quantized

Text Generation

text-generation-inference

Model card Files Files and versions

CRD716 commited on Apr 27, 2023

Commit

53cc98d

·

1 Parent(s): 8ddbe3c

revert

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 license: gpl-3.0
 metrics:
 - perplexity
-pipeline_tag: conversational
 tags:
 - LLaMa
 - text-generation-inference
@@ -38,4 +38,4 @@ Check https://github.com/ggerganov/llama.cpp#quantization for details on the dif
 I recommend the following settings when running as a good starting point: ```main.exe -m ggml-LLaMa-65B-q4_0.bin -n -1 -t 42 -c 2048 --temp 0.4 --interactive-first --repeat_penalty 1.2 --color```
-Be aware that LLaMa is a text generation model, not a conversational one, and as such you will have to prompt it differently than, for example, Vicuna or ChatGPT. (Despite the pipeline tag)

 license: gpl-3.0
 metrics:
 - perplexity
+pipeline_tag: text-generation
 tags:
 - LLaMa
 - text-generation-inference
 I recommend the following settings when running as a good starting point: ```main.exe -m ggml-LLaMa-65B-q4_0.bin -n -1 -t 42 -c 2048 --temp 0.4 --interactive-first --repeat_penalty 1.2 --color```
+Be aware that LLaMa is a text generation model, not a conversational one, and as such you will have to prompt it differently than, for example, Vicuna or ChatGPT.