--- base_model: - Qwen/Qwen2.5-32B-Instruct - karakuri-ai/karakuri-lm-32b-thinking-2501-exp library_name: transformers tags: - mergekit - merge --- # SKYDRIVE-32B-v0.1 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as a base. ### Models Merged The following models were included in the merge: * SKYDRIVE_element_jp_02 * SKYDRIVE_element_jp_03 * [karakuri-ai/karakuri-lm-32b-thinking-2501-exp](https://huggingface.co/karakuri-ai/karakuri-lm-32b-thinking-2501-exp) * SKYCAVE_element_Sky_jp * SKYDRIVE_element_jp_04 ### Configuration The following YAML configuration was used to produce this model: ```yaml merge_method: model_stock base_model: Qwen/Qwen2.5-32B-Instruct models: - model: karakuri-ai/karakuri-lm-32b-thinking-2501-exp - model: SKYCAVE_element_Sky_jp - model: SKYDRIVE_element_jp_02 - model: SKYDRIVE_element_jp_03 - model: SKYDRIVE_element_jp_04 dtype: bfloat16 pad_to_multiple_of: 512 tokenizer_source: base name: SKYDRIVE-32B-v0.1 ```