Merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Qwen2.5-7B-Instruct tied with Reasoning LORA.

Merge Method

This model was merged using the Passthrough merge method using unsloth/Qwen2.5-7B-Instruct + nbeerbower/R1-Qwen-7B-LORA as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: unsloth/Qwen2.5-7B-Instruct+nbeerbower/R1-Qwen-7B-LORA
dtype: float16
merge_method: passthrough
models:
  - model: unsloth/Qwen2.5-7B-Instruct+nbeerbower/R1-Qwen-7B-LORA
tokenizer_source: unsloth/Qwen2.5-7B

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	3.78
IFEval (0-Shot)	13.46
BBH (3-Shot)	2.55
MATH Lvl 5 (4-Shot)	1.66
GPQA (0-shot)	0.34
MuSR (0-shot)	2.69
MMLU-PRO (5-shot)	2.00

Downloads last month: 2

Safetensors

Model size

8B params

Tensor type

F16

Model tree for Triangle104/Q2.5-R1-7B

nbeerbower/R1-Qwen-7B-LORA

unsloth/Qwen2.5-7B-Instruct

Merge model

this model

Quantizations

2 models

Collections including Triangle104/Q2.5-R1-7B

Qwen

Collection

Alibaba Cloud-based models • 2186 items • Updated Aug 1 • 5

Merges

Collection

Personal Merges • 108 items • Updated May 5 • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

13.460
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

2.550
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

1.660
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

0.340
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

2.690
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

2.000

View on Papers With Code