Pankaj Mathur
commited on
Commit
·
e4297c8
1
Parent(s):
899b6ad
Update README.md
Browse files
README.md
CHANGED
|
@@ -58,11 +58,11 @@ Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](htt
|
|
| 58 |
|||||
|
| 59 |
|:------:|:--------:|:-------:|:--------:|
|
| 60 |
|**Task**|**Metric**|**Value**|**Stderr**|
|
| 61 |
-
|*arc_challenge*|acc_norm|0.
|
| 62 |
-
|*hellaswag*|acc_norm|0.
|
| 63 |
-
|*mmlu*|acc_norm|0.
|
| 64 |
-
|*truthfulqa_mc*|mc2|0.
|
| 65 |
-
|**Total Average**|-|**0.
|
| 66 |
|
| 67 |
|
| 68 |
<br>
|
|
|
|
| 58 |
|||||
|
| 59 |
|:------:|:--------:|:-------:|:--------:|
|
| 60 |
|**Task**|**Metric**|**Value**|**Stderr**|
|
| 61 |
+
|*arc_challenge*|acc_norm|0.7142|0.0141|
|
| 62 |
+
|*hellaswag*|acc_norm|0.8731|0.0038|
|
| 63 |
+
|*mmlu*|acc_norm|0.6858|0.0351|
|
| 64 |
+
|*truthfulqa_mc*|mc2|0.6265|0.0157|
|
| 65 |
+
|**Total Average**|-|**0.7249**||
|
| 66 |
|
| 67 |
|
| 68 |
<br>
|