Adding Evaluation Results
#3
by
						
leaderboard-pr-bot
	
							
						- opened
							
					
    	
        README.md
    CHANGED
    
    | @@ -195,4 +195,17 @@ Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, | |
| 195 | 
             
            Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
         | 
| 196 | 
             
              year={2023}
         | 
| 197 | 
             
            }
         | 
| 198 | 
            -
            ```
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 195 | 
             
            Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
         | 
| 196 | 
             
              year={2023}
         | 
| 197 | 
             
            }
         | 
| 198 | 
            +
            ```
         | 
| 199 | 
            +
            # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
         | 
| 200 | 
            +
            Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_psmathur__model_007_v2)
         | 
| 201 | 
            +
             | 
| 202 | 
            +
            | Metric                | Value                     |
         | 
| 203 | 
            +
            |-----------------------|---------------------------|
         | 
| 204 | 
            +
            | Avg.                  | 62.02   |
         | 
| 205 | 
            +
            | ARC (25-shot)         | 71.42          |
         | 
| 206 | 
            +
            | HellaSwag (10-shot)   | 87.31    |
         | 
| 207 | 
            +
            | MMLU (5-shot)         | 68.58         |
         | 
| 208 | 
            +
            | TruthfulQA (0-shot)   | 62.65   |
         | 
| 209 | 
            +
            | Winogrande (5-shot)   | 84.14   |
         | 
| 210 | 
            +
            | GSM8K (5-shot)        | 28.66        |
         | 
| 211 | 
            +
            | DROP (3-shot)         | 31.38         |
         | 
