Quazim0t0
/

Rune-14b

Quazim0t0/time-14b-stock

Quazim0t0/Mithril-14B-sce

Model card Files Files and versions Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Rune-14b

Rune-14b is a merge of the following models using LazyMergekit:

🧩 Configuration

base_model: Quazim0t0/Mithril-14B-sce
dtype: bfloat16
merge_method: slerp
parameters:
  t:
  - filter: self_attn
    value: [0.0, 0.5, 0.3, 0.7, 1.0]
  - filter: mlp
    value: [1.0, 0.5, 0.7, 0.3, 0.0]
  - value: 0.5
slices:
- sources:
  - layer_range: [0, 40]
    model: Quazim0t0/time-14b-stock
  - layer_range: [0, 40]
    model: Quazim0t0/Mithril-14B-sce

💻 Usage

!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "Quazim0t0/Rune-14b"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	41.82
IFEval (0-Shot)	70.16
BBH (3-Shot)	56.05
MATH Lvl 5 (4-Shot)	45.85
GPQA (0-shot)	13.53
MuSR (0-shot)	16.29
MMLU-PRO (5-shot)	49.02

Downloads last month: 56

Safetensors

Model size

14.7B params

Tensor type

BF16

·

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for Quazim0t0/Rune-14b

Quazim0t0/Mithril-14B-sce

Quazim0t0/time-14b-stock

Merge model

this model

Finetunes

1 model

Quantizations

Collection including Quazim0t0/Rune-14b

Highest Scoring

My models with the best score results • 4 items • Updated 10 days ago • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

70.160
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

56.050
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

45.850
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

13.530
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

16.290
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

49.020

View on Papers With Code