Mistral-Nemo-12B Merges
Collection
2 items
โข
Updated
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Model Stock merge method using mistralai/Mistral-Nemo-Instruct-2407 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: mistralai/Mistral-Nemo-Instruct-2407
- model: mergekit-community/MN-Nyx-Chthonia-12B
- model: yamatazen/BlueLight-12B
- model: DreadPoor/YM-12B-Model_Stock
tokenizer:
source: union
tokens:
"<|im_start|>":
source: mergekit-community/MN-Nyx-Chthonia-12B
"<|im_end|>":
source: mergekit-community/MN-Nyx-Chthonia-12B
"[INST]":
source: mistralai/Mistral-Nemo-Instruct-2407
"[/INST]":
source: mistralai/Mistral-Nemo-Instruct-2407
merge_method: model_stock
base_model: mistralai/Mistral-Nemo-Instruct-2407
dtype: bfloat16
out_dtype: bfloat16
chat_template: chatml