Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
evals-for-every-language
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
260c1a3
evals-for-every-language
Commit History
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
22 days ago
Add scores to world map hover title
3680a5f
David Pomerenke
commited on
22 days ago
Change frontend text
f046407
David Pomerenke
commited on
22 days ago
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
22 days ago
Run evals
b0c61ed
David Pomerenke
commited on
22 days ago
Move functions for sharing them
55406ba
David Pomerenke
commited on
22 days ago
Add Babel-670
7283eaa
David Pomerenke
commited on
22 days ago
Fix response when no evals data is available
c856043
David Pomerenke
commited on
24 days ago
Fix response when no evals data is available
32d50b0
David Pomerenke
commited on
24 days ago
Remove unnecessary function
a5cf2d9
David Pomerenke
commited on
24 days ago
Add WIP disclaimer
37ec45a
David Pomerenke
commited on
24 days ago
Fix: don't cache model metadata forever
c29b8da
David Pomerenke
commited on
24 days ago
Fix: sort copy, not in place
2eeba23
David Pomerenke
commited on
24 days ago
Change title and add blurb
58de179
David Pomerenke
commited on
24 days ago
test push - updated gitignore
c34b267
jonas
commited on
27 days ago
Run on 15 languages
f8a3dad
David Pomerenke
commited on
Apr 18
Improve plots and dataset table
a9e6b9b
David Pomerenke
commited on
Apr 18
Reorder datasets
603effe
David Pomerenke
commited on
Apr 18
Update models
8941a67
David Pomerenke
commited on
Apr 18
Add model history plot
f52ec6e
David Pomerenke
commited on
Apr 18
Add nice cumulative language population plot
b54f543
David Pomerenke
commited on
Apr 18
Implement MMLU task
a683732
David Pomerenke
commited on
Apr 18
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
Apr 18
Add visual QA, reorder datasets
276ec94
David Pomerenke
commited on
Apr 18
Add dataset metadata about human/machine translation
d8f2dee
David Pomerenke
commited on
Apr 18
Analyze MMLU datasets
031925d
David Pomerenke
commited on
Apr 17
Refactor score columns
4106f13
David Pomerenke
commited on
Apr 17
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
Apr 17
Add rich dependency
9e3bc4f
David Pomerenke
commited on
Apr 17
Translation both from and to
731eddd
David Pomerenke
commited on
Apr 13
Add language lists for MMLU
60d1364
David Pomerenke
commited on
Apr 13
Get popular models from OpenRouter
a32a92f
David Pomerenke
commited on
Apr 11
Datasets: add OpenGPT-X icon and reorder
a0679b4
David Pomerenke
commited on
Apr 11
Add OpenRouter metadata to models
9002fc2
David Pomerenke
commited on
Apr 11
Run on 100 languages, adjust display
8274634
David Pomerenke
commited on
Apr 6
Dataset table grouping
9051509
David Pomerenke
commited on
Apr 6
Adjust font sizes
51cb38c
David Pomerenke
commited on
Apr 6
Re-add dataset logos
003fe33
David Pomerenke
commited on
Apr 6
Docker user setup
4e09406
David Pomerenke
commited on
Apr 6
Set uv cache dir permissions
e40d9f7
David Pomerenke
commited on
Apr 6
Specify uv cache dir
5214ce7
David Pomerenke
commited on
Apr 6
Update metadata
4ddbe3b
David Pomerenke
commited on
Apr 6
Add Dockerfile
4d13673
David Pomerenke
commited on
Apr 6
Fix world map and apply filters for it
92d8154
David Pomerenke
commited on
Apr 6
Add logo as PNG
73c776c
David Pomerenke
commited on
Apr 5
More concise title
140e08c
David Pomerenke
commited on
Apr 5
AutoComplete improvements and examples
a3e21c6
David Pomerenke
commited on
Apr 5
Fix and refactor backend filtering
eb1696c
David Pomerenke
commited on
Apr 5
Speed things up
566c57e
David Pomerenke
commited on
Apr 4
Language selection checkboxes & filtering in backend
d91b022
David Pomerenke
commited on
Apr 4
Previous
1
2
3
4
Next