Pinkstack
/

Parm-2-CoT-14B-16k-o1-QwQ

Text Generation

text-generation-inference

Model card Files Files and versions

Pinkstack commited on 12 days ago

Commit

eb41caa

·

verified ·

1 Parent(s): 6b96a41

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -126,6 +126,22 @@ model-index:
     source:
       url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ
       name: Open LLM Leaderboard
 ---
 Renamed to parm-2
 Please note, the low IFEVAL results is due to this model always reasoning, instruction following is limited, which caused it to have very low ifeval results, this should not matter for most use cases.
@@ -179,6 +195,8 @@ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-
 |GPQA (0-shot)      |    19.02|
 |MuSR (0-shot)      |    21.79|
 |MMLU-PRO (5-shot)  |    47.43|
 # 🧀 Examples:
 (q4_k_m, 10GB rtx 3080, 64GB memory, running inside of MSTY, all use "You are a friendly ai assistant." as the System prompt.)

     source:
       url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ
       name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: Llmexplorer lmsys elo
+      type: elo-score
+      config: main
+      split: test
+    metrics:
+    - type: elo
+      value: 1203
+      name: elo
+    source:
+      url: https://llm.extractum.io/list/?benchmark=score_elo
+      name: LLMexplorer lmsys elo score
 ---
 Renamed to parm-2
 Please note, the low IFEVAL results is due to this model always reasoning, instruction following is limited, which caused it to have very low ifeval results, this should not matter for most use cases.
 |GPQA (0-shot)      |    19.02|
 |MuSR (0-shot)      |    21.79|
 |MMLU-PRO (5-shot)  |    47.43|
+# other leaderboard
+According to https://llm.extractum.io/list/?benchmark=score_elo, this model is in the top 20 on their LMSys ELO score leaderboard.
 # 🧀 Examples:
 (q4_k_m, 10GB rtx 3080, 64GB memory, running inside of MSTY, all use "You are a friendly ai assistant." as the System prompt.)