neph1
/

llama-3-instruct-bellman-8b-swedish

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

neph1 commited on Jun 9, 2024

Commit

3ac8421

·

verified ·

1 Parent(s): 82d3a2b

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ datasets:
 - neph1/codefeedback-swedish
 ---
-# Model Card for Model ID
 This version of bellman is finetuned from llama-3-instruct-8b.
 It is arguable whether it's better at Swedish, because llama-3 is really good. It's however finetuned for prompt question answering, based on a dataset created from
@@ -18,6 +18,8 @@ Swedish wikipedia, with a lot of Sweden-centric questions.
 New from previous versions is questions from a translated code-feedback dataset, as well as a number of stories. It's not great at generating stories,
 but better than previosly.
 240609: I've uploaded a 4-bit GPTQ quant, but it's completely untested.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/653cd3049107029eb004f968/IDGX3d9lGe6yx-yHjsrav.png)

 - neph1/codefeedback-swedish
 ---
+# Model Card for Bellman
 This version of bellman is finetuned from llama-3-instruct-8b.
 It is arguable whether it's better at Swedish, because llama-3 is really good. It's however finetuned for prompt question answering, based on a dataset created from
 New from previous versions is questions from a translated code-feedback dataset, as well as a number of stories. It's not great at generating stories,
 but better than previosly.
+Please note, the HuggingFace inference api is probably trying to load the adapter (lora) which isn't going to work.
 240609: I've uploaded a 4-bit GPTQ quant, but it's completely untested.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/653cd3049107029eb004f968/IDGX3d9lGe6yx-yHjsrav.png)