Finding Blind Spots in Evaluator LLMs with Interpretable Checklists Paper • 2406.13439 • Published Jun 19, 2024 • 1
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 45
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28, 2024 • 102
MILU: A Multi-task Indic Language Understanding Benchmark Paper • 2411.02538 • Published Nov 4, 2024 • 1
IndicTrans2 Collection Models(En-Indic, Indic-En, Indic-Indic) in 2 variants (base and dist) and Benchmarks (IN22-Gen and IN22-Conv) released as a part of IndicTrans2. • 9 items • Updated Mar 5 • 19
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 588