Accessing examples used for n-shot evals

#26
by akritivij - opened

For n-shot evaluations, is it possible to access the specific examples used for the evals? e.g. SQuAD2 is run using 4-shot - is it possible to find out what the 4 examples are?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment