--- base_model: - wanglab/mixtral_7b_dna_pretrain - mistralai/Mistral-7B-v0.1 - EstherXC/mixtral_7b_protein_pretrain library_name: transformers tags: - mergekit - merge --- # mixtral_tie This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base. ### Models Merged The following models were included in the merge: * [wanglab/mixtral_7b_dna_pretrain](https://huggingface.co/wanglab/mixtral_7b_dna_pretrain) * [EstherXC/mixtral_7b_protein_pretrain](https://huggingface.co/EstherXC/mixtral_7b_protein_pretrain) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: mistralai/Mistral-7B-v0.1 models: - model: EstherXC/mixtral_7b_protein_pretrain parameters: weight: 0.3 - model: wanglab/mixtral_7b_dna_pretrain #dnagpt/llama-dna parameters: weight: 0.3 merge_method: task_arithmetic dtype: float16 tokenizer_source: "base" ```