Text Generation
Transformers
Safetensors
English
llama
meta
llama-3
conversational
text-generation-inference

Multi-needle In A Haystack

#25
by ElliottDyson - opened

Many models can be easily trained to perform well on the standard needle in a haystack evaluation. Something much more useful and representative of long-context capabilities is the multi-needle evaluation method. It would be very interesting to see its results in these tests.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment