T145 commited on
Commit
64c9a2b
·
verified ·
1 Parent(s): 57ffdd0

Added inference parameter recommendations

Browse files
Files changed (1) hide show
  1. README.md +21 -2
README.md CHANGED
@@ -162,7 +162,8 @@ tokenizer_source: base
162
  ```
163
 
164
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
165
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_T145__ZEUS-8B-V2)
 
166
 
167
  | Metric |Value|
168
  |-------------------|----:|
@@ -172,4 +173,22 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
172
  |MATH Lvl 5 (4-Shot)|21.15|
173
  |GPQA (0-shot) | 6.94|
174
  |MuSR (0-shot) | 8.24|
175
- |MMLU-PRO (5-shot) |32.18|
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
162
  ```
163
 
164
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
165
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_T145__ZEUS-8B-V2)!
166
+ Based on the listed rankings as of 4/12/24, is the top-rank 8B model.
167
 
168
  | Metric |Value|
169
  |-------------------|----:|
 
173
  |MATH Lvl 5 (4-Shot)|21.15|
174
  |GPQA (0-shot) | 6.94|
175
  |MuSR (0-shot) | 8.24|
176
+ |MMLU-PRO (5-shot) |32.18|
177
+
178
+ # Inference Settings
179
+
180
+ Personal recommendations are:
181
+
182
+ ```
183
+ mirostat = 2
184
+ mirostat_eta = 0.1
185
+ mirostat_tau = 4.24
186
+ num_ctx = 4096
187
+ repeat_penalty = 1.4
188
+ temperature = 0.85
189
+ seed = 42
190
+ top_k = 0
191
+ top_p = 0.95
192
+ ```
193
+
194
+ After cross-referencing [this paper on mobile LLMs](https://openreview.net/pdf?id=ahVsd1hy2W) and [this paper on balancing model parameters](https://arxiv.org/html/2408.13586v1).