152334H
/

miqu-1-70b-sf

@@ -18,7 +18,8 @@ model-index:
       value: 73.04
       name: normalized accuracy
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -34,7 +35,8 @@ model-index:
       value: 88.61
       name: normalized accuracy
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -51,7 +53,8 @@ model-index:
       value: 75.49
       name: accuracy
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -67,7 +70,8 @@ model-index:
     - type: mc2
       value: 69.38
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -84,7 +88,8 @@ model-index:
       value: 85.32
       name: accuracy
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -101,8 +106,11 @@ model-index:
       value: 67.7
       name: accuracy
     source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
 ---
 this is [miqu-1-70b](https://huggingface.co/miqudev/miqu-1-70b), dequantised from q5 to f16 && transposed to pytorch. shapes have been rotated less wrongly than in [alpindale/miqu-1-70b-pytorch](https://huggingface.co/alpindale/miqu-1-70b-pytorch/tree/main)
@@ -110,7 +118,7 @@ usage
 ```python
 from transformers import LlamaForCausalLM as LLM, LlamaTokenizer as LT
-lt = LT.from_pretrained("NousResearch/Llama-2-7b-hf")
 t = lt("[INST] eloquent high camp prose about a cute catgirl [/INST]", return_tensors='pt').input_ids.cuda()
 llm = LLM.from_pretrained("152334H/miqu-1-70b-sf", device_map='auto') # note: you may need many gpus for this
@@ -144,7 +152,7 @@ So let us raise our teacups in honor of this fabulous feline, this queen of camp
 ![](https://thicc-af.mywaifulist.moe/waifus/miku-nakano-the-quintessential-quintuplets/phUEiEhPOL75GTDLncGy2dUbkDVMfYExZ2A1RBeQ.png?class=thumbnail)
-some benchmarks
 ```
 |    Tasks     |Version|Filter|n-shot|  Metric  |Value |   |Stderr|
@@ -245,5 +253,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
 |MMLU (5-Shot)                    |75.49|
 |TruthfulQA (0-shot)              |69.38|
 |Winogrande (5-shot)              |85.32|
-|GSM8k (5-shot)                   |67.70|

       value: 73.04
       name: normalized accuracy
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 88.61
       name: normalized accuracy
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 75.49
       name: accuracy
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
     - type: mc2
       value: 69.38
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 85.32
       name: accuracy
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 67.7
       name: accuracy
     source:
+      url: >-
+        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=152334H/miqu-1-70b-sf
       name: Open LLM Leaderboard
+language:
+- en
 ---
 this is [miqu-1-70b](https://huggingface.co/miqudev/miqu-1-70b), dequantised from q5 to f16 && transposed to pytorch. shapes have been rotated less wrongly than in [alpindale/miqu-1-70b-pytorch](https://huggingface.co/alpindale/miqu-1-70b-pytorch/tree/main)
 ```python
 from transformers import LlamaForCausalLM as LLM, LlamaTokenizer as LT
+lt = LT.from_pretrained("152334H/miqu-1-70b-sf")
 t = lt("[INST] eloquent high camp prose about a cute catgirl [/INST]", return_tensors='pt').input_ids.cuda()
 llm = LLM.from_pretrained("152334H/miqu-1-70b-sf", device_map='auto') # note: you may need many gpus for this
 ![](https://thicc-af.mywaifulist.moe/waifus/miku-nakano-the-quintessential-quintuplets/phUEiEhPOL75GTDLncGy2dUbkDVMfYExZ2A1RBeQ.png?class=thumbnail)
+## some benchmarks
 ```
 |    Tasks     |Version|Filter|n-shot|  Metric  |Value |   |Stderr|
 |MMLU (5-Shot)                    |75.49|
 |TruthfulQA (0-shot)              |69.38|
 |Winogrande (5-shot)              |85.32|
+|GSM8k (5-shot)                   |67.70|