Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
demo-leaderboard-backend/leaderboard
evalitahf
/
evalita_llm_leaderboard
like
10
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
evalita_llm_leaderboard
360 kB
3 contributors
History:
123 commits
rzanoli
Fix: format best model scores to one decimal place
178e582
9 days ago
src
Fix prompt text and answer choices inconsistencies
about 2 months ago
src_maia
Revise the measurement description for MAIA
9 days ago
.gitattributes
1.53 kB
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
.gitignore
136 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
.pre-commit-config.yaml
1.53 kB
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
Makefile
208 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
README.md
1.48 kB
Small changes
9 months ago
app.py
61.6 kB
Fix: format best model scores to one decimal place
9 days ago
app_18_09_2025.py
33.7 kB
Refactor and optimize all interface chart code
3 months ago
app_22_09_2025.py
25.3 kB
Add performance metrics labels with average, std dev, and best model info.
3 months ago
app_30_09_2025.py
32.2 kB
Add heatmap and model comparison table
3 months ago
example_app.py
13.9 kB
Small changes
9 months ago
example_app2.py
9.8 kB
Small changes
9 months ago
get_model_info.py
5.39 kB
Small changes
9 months ago
preprocess_models_output.py
8.88 kB
Small changes to preporcess vision model files
3 months ago
preprocess_models_output_old.py
7.03 kB
Small changes
9 months ago
pyproject.toml
548 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
requirements.txt
211 Bytes
Add the plotly library for creating charts
4 months ago
run_instructions.txt
2.92 kB
Updated documentation description for the pipeline to produce leaderboard data.
11 days ago