Format is: {REPRODUCTION_MODEL}_{DOMAIN}. E.g: dataset reproduced by Deepseek R1 for the biology domain: deepseekr1_biology
Anonymous Author
anonymous-paper-author
·
AI & ML interests
Evaluations
Organizations
MMLU Reproduction Studies
-
yourbench/yourbench_mmlu_reproduction_international_law
Viewer • Updated • 1.5k • 5 -
yourbench/yourbench_mmlu_reproduction_anatomy
Viewer • Updated • 974 • 4 -
yourbench/yourbench_mmlu_reproduction_nutrition
Viewer • Updated • 1.73k • 6 -
yourbench/yourbench_mmlu_reproduction_virology
Viewer • Updated • 1.34k • 5
MMLU Pro Reproduction Studies
Format is: {REPRODUCTION_MODEL}_{DOMAIN}. E.g: dataset reproduced by Deepseek R1 for the biology domain: deepseekr1_biology
MMLU Reproduction Studies
-
yourbench/yourbench_mmlu_reproduction_international_law
Viewer • Updated • 1.5k • 5 -
yourbench/yourbench_mmlu_reproduction_anatomy
Viewer • Updated • 974 • 4 -
yourbench/yourbench_mmlu_reproduction_nutrition
Viewer • Updated • 1.73k • 6 -
yourbench/yourbench_mmlu_reproduction_virology
Viewer • Updated • 1.34k • 5
models
0
None public yet
datasets
11
anonymous-paper-author/original_mmlu_pro_psychology
Viewer
•
Updated
•
798
•
6
anonymous-paper-author/original_mmlu_pro_physics
Viewer
•
Updated
•
1.3k
•
2
anonymous-paper-author/original_mmlu_pro_philosophy
Viewer
•
Updated
•
499
•
2
anonymous-paper-author/original_mmlu_pro_law
Viewer
•
Updated
•
1.1k
•
2
anonymous-paper-author/original_mmlu_pro_history
Viewer
•
Updated
•
381
•
3
anonymous-paper-author/original_mmlu_pro_health
Viewer
•
Updated
•
818
•
2
anonymous-paper-author/original_mmlu_pro_economics
Viewer
•
Updated
•
844
•
2
anonymous-paper-author/original_mmlu_pro_computerscience
Viewer
•
Updated
•
410
•
2
anonymous-paper-author/original_mmlu_pro_chemistry
Viewer
•
Updated
•
1.13k
•
6
anonymous-paper-author/original_mmlu_pro_business
Viewer
•
Updated
•
789
•
5