sthenno-com
/

miscii-14b-1225

@@ -28,8 +28,7 @@ model-index:
       value: 78.78
       name: strict accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -44,8 +43,7 @@ model-index:
       value: 50.91
       name: normalized accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -60,8 +58,7 @@ model-index:
       value: 31.57
       name: exact match
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -76,8 +73,7 @@ model-index:
       value: 17
       name: acc_norm
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -92,8 +88,7 @@ model-index:
       value: 14.77
       name: acc_norm
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -110,8 +105,7 @@ model-index:
       value: 47.46
       name: accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
 ---
@@ -190,4 +184,17 @@ As of **December 25, 2024**, this should be the **best-performing 14B model** in
 |MATH Lvl 5 (4-Shot)|31.57|
 |GPQA (0-shot)      |17.00|
 |MuSR (0-shot)      |14.77|
-|MMLU-PRO (5-shot)  |47.46|

       value: 78.78
       name: strict accuracy
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 50.91
       name: normalized accuracy
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 31.57
       name: exact match
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 17
       name: acc_norm
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 14.77
       name: acc_norm
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 47.46
       name: accuracy
     source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sthenno-com/miscii-14b-1225
       name: Open LLM Leaderboard
 ---
 |MATH Lvl 5 (4-Shot)|31.57|
 |GPQA (0-shot)      |17.00|
 |MuSR (0-shot)      |14.77|
+|MMLU-PRO (5-shot)  |47.46|
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/sthenno-com__miscii-14b-1225-details)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |42.35|
+|IFEval (0-Shot)    |78.78|
+|BBH (3-Shot)       |50.91|
+|MATH Lvl 5 (4-Shot)|45.17|
+|GPQA (0-shot)      |17.00|
+|MuSR (0-shot)      |14.77|
+|MMLU-PRO (5-shot)  |47.46|