Uhura Collection Contains Benchmark datasets for Arc-Easy and Truthful-QA collected through human translation of existing datasets • 2 items • Updated Nov 24, 2024 • 3
IrokoBench Collection a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31, 2024 • 20
AfroBench Collection Large Scale Benchmark of Large Language Models on African Languages • 21 items • Updated 21 days ago • 1
ProgressGym Collection Alignment with a millennium of moral progress • 41 items • Updated Jul 23, 2024 • 2