Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Resource: Understanding the new benchmarks
pinned
2
#796 opened 5 months ago
by
rombodawg
💬 Discussion thread: Model contamination techniques 💬
pinned
34
#472 opened 11 months ago
by
clefourrier
💎 Resources and community initiatives around the Leaderboard! 💎
pinned#174 opened over 1 year ago
by
clefourrier
Model benchmarks degraded after re-evaluation
#1018 opened 3 days ago
by
Etherll
I can't replicate results.
5
#1016 opened 5 days ago
by
Pretergeek
Suggestion: Table refresh timer
#1015 opened 6 days ago
by
zelk12
Is the score computed by lm-eval-harness normalized?
#1011 opened 11 days ago
by
chenxiaobooo
Marked deleted/incomplete
2
#1006 opened 14 days ago
by
CultriX
Suggestion: Search model by architecture
1
#998 opened 20 days ago
by
zelk12
merged models marked as non merged in the leaderboard
2
#993 opened 24 days ago
by
fblgit
fix-global
4
#983 opened about 1 month ago
by
alozowski
Feature Request: change request file format to disambiguate chat and non-chat models?
3
#954 opened about 2 months ago
by
CombinHorizon
simplify_ux
2
#944 opened about 2 months ago
by
clefourrier
Are Qwen models pretrained or continuously pretrained?
7
#941 opened about 2 months ago
by
djstrong
Increasing upper limit of `Select the number of parameters (B)` to support larger open-source models like `meta-llama/Meta-Llama-3.1-405B-Instruct`
5
#858 opened 4 months ago
by
singhsidhukuldeep
Upvote to evaluate deepseek-coder-v2
3
#793 opened 5 months ago
by
g1y5x3
Feature request: Add toggle to only show models with linked dataset
1
#763 opened 6 months ago
by
ThiloteE
Feature request: Hide models with insufficient model card from default view in leaderboard
4
#762 opened 6 months ago
by
ThiloteE
Discussion: naming pattern to converge on to better identify fine-tunes
17
#761 opened 6 months ago
by
ThiloteE
Crowd-Source Hardware for the LeaderBoard?
4
#570 opened 10 months ago
by
ibivibiv
Feature request: Using weights hash to identify duplicates
1
#422 opened 12 months ago
by
mrfakename
Tool: Adding evaluation results to model cards
47
#370 opened about 1 year ago
by
Weyaxi
Feature suggestion: average of selected (rather than all) columns
4
#368 opened about 1 year ago
by
Minus0
Tool: Open LLM Leaderboard Model Renamer
31
#310 opened about 1 year ago
by
Weyaxi