metadata
base_model:
- NousResearch/Hermes-3-Llama-3.1-8B
- arcee-ai/Llama-3.1-SuperNova-Lite
library_name: transformers
tags:
- mergekit
- merge
license: llama3.1
model-index:
- name: HermesNova-Llama-3.1-8B
results:
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: bbh
metrics:
- name: acc_norm
type: acc_norm
value: 0.5418
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: gpqa
metrics:
- name: acc_norm
type: acc_norm
value: 0.3365
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: math
metrics:
- name: exact_match
type: exact_match
value: 0.1148
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: mmlu
metrics:
- name: acc_norm
type: acc_norm
value: 0.3729
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: musr
metrics:
- name: acc_norm
type: acc_norm
value: 0.433
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: hellaswag
metrics:
- name: acc
type: acc
value: 0.6306512646883091
verified: false
- name: acc_norm
type: acc_norm
value: 0.818263294164509
verified: false
'Make knowledge free for everyone'
HermesNova
The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged
merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the linear merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
- model: NousResearch/Hermes-3-Llama-3.1-8B
parameters:
weight: 1.0
- model: arcee-ai/Llama-3.1-SuperNova-Lite
parameters:
weight: 1.0
merge_method: linear
dtype: float16