Let people vote on existing responses?
#73 opened about 12 hours ago
by
endolith

Latest raw mt-bench results available
#72 opened 7 days ago
by
lucweber
Cameroun
1
#69 opened 3 months ago
by
EtCeterAi
Add Ovis-1.6 to Chatbot arena ?
#68 opened 5 months ago
by
xxyyy123
I tried to plot AGI on the same Elo scale by comparing to "both bad" and "tie" votes
#67 opened 5 months ago
by
endolith

Please add InternLM2.5-20B-Chat and InternLM2.5-7B-Chat to Leaderboard
#61 opened 6 months ago
by
vansin

Upload leaderboard_table_20240716.csv
#50 opened 7 months ago
by
connorchenn
Chatbot Arena: Classify requests/votes - ELO per category
#40 opened 9 months ago
by
NeuralByte
How am I supposed to search models by name when there's live scroll?
#38 opened 10 months ago
by
seedmanc

Number of parameters of the model and release date
1
#32 opened 10 months ago
by
oovm

Is the leaderboard space deprecated then?
#31 opened 10 months ago
by
zhiminy

Is the notebook version-controlled anywhere?
1
#30 opened 10 months ago
by
endolith

Support benchmark for Long Context Recall abilities
#29 opened 10 months ago
by
Nekochu

Is it fair to have web browsing allowed
1
#24 opened 11 months ago
by
gearunclear
Dataset Update
1
#23 opened 12 months ago
by
matthiaslau

Request: add two new models
2
#21 opened about 1 year ago
by
rombodawg

Removing LLM version clutter from the leaderboard ?
2
#20 opened about 1 year ago
by
zarglu
Re-evaluate GPT-4 ! Add a ELO-graph over time to the leaderboard
8
#19 opened about 1 year ago
by
cmp-nct

[enhancement] unaligned ranking column between leaderboards
#17 opened about 1 year ago
by
zhiminy

Is there any way to download the leaderboard as csv or json format?
6
#13 opened about 1 year ago
by
zhiminy

How does GPT-4 Turbo do so well?
10
#10 opened about 1 year ago
by
endolith

Human level representation?
5
#8 opened about 1 year ago
by
ehalit

Add quantized local models?
#7 opened about 1 year ago
by
endolith

Synthetic evaluation hypothesis
1
#6 opened about 1 year ago
by
DmitriSS
You should add nous capybara 34b
#5 opened about 1 year ago
by
distantquant
