miscii-14b-0218 / README.md
sthenno's picture
Upload folder using huggingface_hub
0d03d97 verified
|
raw
history blame
1.44 kB
metadata
base_model: []
library_name: transformers
tags:
  - mergekit
  - merge

tempesthenno-ms-0218

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using /Users/sthenno/models/tempesthenno-ppo-enchanted as a base.

Models Merged

The following models were included in the merge:

  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60
  • /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
  • /Users/sthenno/models/tempesthenno-sft-0218-ckpt80

Configuration

The following YAML configuration was used to produce this model:

name: tempesthenno-ms-0218
merge_method: model_stock
base_model: /Users/sthenno/models/tempesthenno-ppo-enchanted
tokenizer:
  source: base
dtype: float32
out_dtype: bfloat16
parameters:
  int8_mask: true
  normalize: true
  rescale: false
models:
  - model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
  - model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt80
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60