allura-org
/

Mistral-Small-Sisyphus-24b-2503

Model card Files Files and versions Community

Fizzarolli commited on 7 days ago

Commit

fa6d905

·

verified ·

1 Parent(s): 1dfeef7

Create README.md

Files changed (1) hide show

README.md +45 -0

README.md ADDED Viewed

	@@ -0,0 +1,45 @@

+---
+license: apache-2.0
+language:
+- en
+- zh
+base_model:
+- mistralai/Mistral-Small-24B-Base-2501
+tags:
+- axolotl
+---
+# Sisyphus 24b
+Hundreds of dollars later.
+Dozens of failed finetunes.
+Sisyphus has balanced his rock on the summit.
+One must have imagined him happy while pushing. Now, he is ecstatic.
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/tHX80XuG_5HiW6F2hvvLe.jpeg)
+## About
+This is a pretty generic finetune of the 24b base model for multiturn instruct. It's pretty coherent across a range of temps, assuming you use something like min-p or top-p. It also supports reasoning blocks.
+## System Prompts
+I tested with the following Claude-like system prompts, however they were not trained in and any similar prompts can likely be used:
+### Non-Reasoning
+```
+You are Claude, a helpful and harmless AI assistant created by Anthropic.
+```
+### Reasoning
+```
+You are Claude, a helpful and harmless AI assistant created by Anthropic. Please contain all your thoughts in <think> </think> tags, and your final response right after the closing </think> tag.
+```
+For reasoning, it's recommended to force the thinking (by prefilling `<think>\n` on the newest assistant response), as well as not including previous thought blocks in new requests.
+## Instruct Template
+v7-Tekken, same as the original instruct model.
+## Dataset
+This model was trained on [allura-org/inkstructmix-v0.1](https://hf.co/datasets/allura-org/inkstructmix-v0.1).