请问评测结果是64K max_new_token下吗？

by JjjjjZzz - opened 19 days ago

19 days ago

Model Card中提到"When we expand the inference budget to 64K and adopt the YaRN method with a scaling factor of 2.5."，并且评测结果也放了两份。所以请问最佳评测结果是64K+YaRN的结果吗，为什么模型的config.json文件中rope_scaling参数是null，没有更新模型吗？

Suu

Klear team, Kuaishou Technology org 19 days ago

您好，感谢关注，rope可以在推理的时候设置，具体请参考我们代码库的推理脚本 https://github.com/Kwai-Klear/KlearReasoner/blob/main/benchmarks/inference.py
我们建议您参考https://github.com/Kwai-Klear/KlearReasoner/tree/main章节的Evaluation复现评测结果

Suu changed discussion status to closed 16 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment