mistral-nemo-gutenberg-12B-v4

TheDrummer/Rocinante-12B-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 19.56
IFEval (0-Shot) 23.79
BBH (3-Shot) 31.97
MATH Lvl 5 (4-Shot) 10.95
GPQA (0-shot) 8.84
MuSR (0-shot) 13.20
MMLU-PRO (5-shot) 28.62
Downloads last month
85
Safetensors
Model size
12.2B params
Tensor type
BF16
Β·
Inference Providers NEW
The selected billing account doesn't have any compatible Inference Provider enabled for this model. Settings

Model tree for nbeerbower/mistral-nemo-gutenberg-12B-v4

Finetuned
(2)
this model
Finetunes
3 models
Merges
45 models
Quantizations
7 models

Dataset used to train nbeerbower/mistral-nemo-gutenberg-12B-v4

Spaces using nbeerbower/mistral-nemo-gutenberg-12B-v4 4

Collection including nbeerbower/mistral-nemo-gutenberg-12B-v4

Evaluation results