csabakecskemeti's picture
Update README.md
05038f7 verified
metadata
base_model:
  - NousResearch/Hermes-3-Llama-3.1-8B
  - arcee-ai/Llama-3.1-SuperNova-Lite
library_name: transformers
tags:
  - mergekit
  - merge
license: llama3.1
model-index:
  - name: HermesNova-Llama-3.1-8B
    results:
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: bbh
        metrics:
          - name: acc_norm
            type: acc_norm
            value: 0.5418
            verified: false
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: gpqa
        metrics:
          - name: acc_norm
            type: acc_norm
            value: 0.3365
            verified: false
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: math
        metrics:
          - name: exact_match
            type: exact_match
            value: 0.1148
            verified: false
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: mmlu
        metrics:
          - name: acc_norm
            type: acc_norm
            value: 0.3729
            verified: false
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: musr
        metrics:
          - name: acc_norm
            type: acc_norm
            value: 0.433
            verified: false
      - task:
          type: text-generation
        dataset:
          type: lm-evaluation-harness
          name: hellaswag
        metrics:
          - name: acc
            type: acc
            value: 0.6306512646883091
            verified: false
          - name: acc_norm
            type: acc_norm
            value: 0.818263294164509
            verified: false

'Make knowledge free for everyone'

Buy Me a Coffee at ko-fi.com

HermesNova

image/jpeg

The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the linear merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: NousResearch/Hermes-3-Llama-3.1-8B
    parameters:
      weight: 1.0
  - model: arcee-ai/Llama-3.1-SuperNova-Lite
    parameters:
      weight: 1.0
merge_method: linear
dtype: float16