evals (PT vs IT)

#30

by erichartford - opened 27 days ago

27 days ago

Hello,

The evals in the model card are the "PT" version, but this is the "IT" version

Presumably the "IT" version will have better scores than the "PT" version right?

Do you have the scores for the "IT" version to publish here?

20 days ago

however, it's hard for me to reproduce the scores (i.e., gsm8k, humaneval, mbpp) with lm-evaluation-harness, and I don't know where is the gap :(

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment