nbeerbower
/

mistral-nemo-gutenberg-12B-v4

Text Generation

text-generation-inference

Model card Files Files and versions Community

mistral-nemo-gutenberg-12B-v4

TheDrummer/Rocinante-12B-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.56
IFEval (0-Shot)	23.79
BBH (3-Shot)	31.97
MATH Lvl 5 (4-Shot)	10.95
GPQA (0-shot)	8.84
MuSR (0-shot)	13.20
MMLU-PRO (5-shot)	28.62

Downloads last month: 52

Safetensors

Model size

12.2B params

Tensor type

BF16

·

Model tree for nbeerbower/mistral-nemo-gutenberg-12B-v4

Base model

TheDrummer/Rocinante-12B-v1

Finetuned

(2)

this model

Finetunes

Merges

Quantizations

Dataset used to train nbeerbower/mistral-nemo-gutenberg-12B-v4

Spaces using nbeerbower/mistral-nemo-gutenberg-12B-v4 5

Collection including nbeerbower/mistral-nemo-gutenberg-12B-v4

Nemo

Mistral Nemo 12B finetunes and merges • 6 items • Updated Jun 9 • 2

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

23.790
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

31.970
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

10.950
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

8.840
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

13.200
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

28.620

View on Papers With Code