Added inference parameter recommendations
Browse files
README.md
CHANGED
@@ -162,7 +162,8 @@ tokenizer_source: base
|
|
162 |
```
|
163 |
|
164 |
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
165 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_T145__ZEUS-8B-V2)
|
|
|
166 |
|
167 |
| Metric |Value|
|
168 |
|-------------------|----:|
|
@@ -172,4 +173,22 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
172 |
|MATH Lvl 5 (4-Shot)|21.15|
|
173 |
|GPQA (0-shot) | 6.94|
|
174 |
|MuSR (0-shot) | 8.24|
|
175 |
-
|MMLU-PRO (5-shot) |32.18|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
162 |
```
|
163 |
|
164 |
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
165 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_T145__ZEUS-8B-V2)!
|
166 |
+
Based on the listed rankings as of 4/12/24, is the top-rank 8B model.
|
167 |
|
168 |
| Metric |Value|
|
169 |
|-------------------|----:|
|
|
|
173 |
|MATH Lvl 5 (4-Shot)|21.15|
|
174 |
|GPQA (0-shot) | 6.94|
|
175 |
|MuSR (0-shot) | 8.24|
|
176 |
+
|MMLU-PRO (5-shot) |32.18|
|
177 |
+
|
178 |
+
# Inference Settings
|
179 |
+
|
180 |
+
Personal recommendations are:
|
181 |
+
|
182 |
+
```
|
183 |
+
mirostat = 2
|
184 |
+
mirostat_eta = 0.1
|
185 |
+
mirostat_tau = 4.24
|
186 |
+
num_ctx = 4096
|
187 |
+
repeat_penalty = 1.4
|
188 |
+
temperature = 0.85
|
189 |
+
seed = 42
|
190 |
+
top_k = 0
|
191 |
+
top_p = 0.95
|
192 |
+
```
|
193 |
+
|
194 |
+
After cross-referencing [this paper on mobile LLMs](https://openreview.net/pdf?id=ahVsd1hy2W) and [this paper on balancing model parameters](https://arxiv.org/html/2408.13586v1).
|