Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1016

New collection needs to be looked at. Some numbers arent adding up.

#995

by rombodawg - opened 20 days ago

Discussion

rombodawg

20 days ago

So I noticed that one of my models was not being recognized on this new collection. Im assuming this collection was made automatically with code, and there is most likely an error in the logic.

https://huggingface.co/collections/open-llm-leaderboard/open-llm-leaderboard-best-models-652d6c7965a4619fb5c27a03

As you can see the "Around 13b" models my 14b outperforms the "failspy/Phi-3-medium-4k-instruct-abliterated-v3" model. Not that im trying to be a leaderboard hog or anything, but you want to have an accurate collection, otherwise people are being misinformed about the data.

rombodawg

20 days ago

I am curious. Is this because the qwen-14b models are 14.8b params? And the collection is only picking up, lets say, up to 14.1b params?

clefourrier

Open LLM Leaderboard org 20 days ago

Not that im trying to be a leaderboard hog or anything, but you want to have an accurate collection, otherwise people are being misinformed about the data.

We really like when users share these kind of issues with us, as it allows us to make the leaderboard better for everyone - as long as you're polite we're super glad to get feedback like this! So don't worry, and thanks for your vigilance!

alozowski

Open LLM Leaderboard org 20 days ago

Hi @rombodawg ,

Thanks for reporting! Yes, it's an automatic process to form this collection, I revised this code a little bit so the collection should better represent the best models now. Please, check it out here – link

Feel free to share your thoughts!

rombodawg

20 days ago

@alozowski I just checked it out. It looked way better. more organized too. Good job 👍👍

alozowski

Open LLM Leaderboard org 20 days ago

Thank you! Really appreciate your help 🤝

Feel free to open a discussion if you have any issues or suggestions!

alozowski changed discussion status to closed 20 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment