Text Classification
Transformers
Safetensors
llama
custom_code
text-generation-inference

output score not correct?

#19
by weiminw - opened

I use some helpsteer2 validation data to test. I found that the scores output by this tool are significantly different from those score by helpsteer2. why?

Sign up or log in to comment