-
204
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
44
Stick To Your Role! Leaderboard
๐ญBenchmarking LLMs on the stability of simulated populations
-
51
ZeroEval Leaderboard
๐Embed and use ZeroEval for evaluation tasks
-
25
Decentralized Arena Leaderboard
๐ฅDisplay model leaderboard evaluations
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
5 days ago
Wan2.1 14B T2V LoRAs
upvoted
a
collection
5 days ago
Wan2.1 14B 480p I2V LoRAs
liked
a model
7 days ago
unsloth/medgemma-27b-text-it-GGUF
Organizations
None yet
Collections
1
models
0
None public yet
datasets
0
None public yet