-
191
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
41
Stick To Your Role! Leaderboard
๐ญCompare LLMs on role consistency across contexts
-
49
ZeroEval Leaderboard
๐Embed and use ZeroEval for evaluation tasks
-
24
Decentralized Arena Leaderboard
๐ฅDisplay model leaderboard evaluations
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
liked
a model
7 days ago
bartowski/Qwen_QwQ-32B-GGUF
liked
a model
9 days ago
TheDrummer/Fallen-Llama-3.3-R1-70B-v1-GGUF
liked
a model
10 days ago
bartowski/TheDrummer_Fallen-Llama-3.3-R1-70B-v1-GGUF
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet