Jellon
/

Lyra-Gutenberg-12b-exl3-6bpw

Text Generation

text-generation-inference

Model card Files Files and versions

6bpw exl3 quant of: https://huggingface.co/nbeerbower/Lyra-Gutenberg-mistral-nemo-12B

Lyra-Gutenberg-12B

Sao10K/MN-12B-Lyra-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	22.57
IFEval (0-Shot)	34.95
BBH (3-Shot)	36.99
MATH Lvl 5 (4-Shot)	8.31
GPQA (0-shot)	11.19
MuSR (0-shot)	14.76
MMLU-PRO (5-shot)	29.20

Downloads last month: 5

Safetensors

Model size

5.02B params

Tensor type

F16

·

I16

·

Model tree for Jellon/Lyra-Gutenberg-12b-exl3-6bpw

Base model

Sao10K/MN-12B-Lyra-v1

Finetuned

nbeerbower/Lyra-Gutenberg-mistral-nemo-12B

Quantized

(11)

this model

Dataset used to train Jellon/Lyra-Gutenberg-12b-exl3-6bpw

Collection including Jellon/Lyra-Gutenberg-12b-exl3-6bpw

exl3

9 items • Updated Jul 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

34.950
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

36.990
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

8.310
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

11.190
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

14.760
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

29.200

View on Papers With Code