--- base_model: - ToastyPigeon/ms3-roselily-instruct library_name: transformers tags: - mergekit - merge --- # todo make a model card and put a cute girl on it # some info Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card. Short list of things to know: - This is a bunch of RP, story writing, etc. creative data applied to [ToastyPigeon/ms3-roselily-instruct](https://huggingface.co/ToastyPigeon/ms3-roselily-instruct). - Instruct format: ChatML or Alpaca preferred, Tekken v7 possible - ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible - Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order) - Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers # chat templates You may need to set `<|im_end|>` and/or `` as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default Alpaca w/ System ``` ### System: {system prompt} ### Instruction: {user message} ### Response: {model answer} ``` ChatML ``` <|im_start|>system {system prompt}<|im_end|> <|im_start|>user {user message}<|im_end|> <|im_start|>assistant {model answer}<|im_end|> ``` Also saw some completion training in chat mode and adventure mode.