Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdiazlor 's Collections
Leaderboards
Instruction Models
Computer Vision Models
Audio Models
Data Related Tools
Utilities
Favorite Demos

Leaderboards

updated 3 days ago

Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions

Upvote
-

  • Running
    11
    11

    InferBench

    🥇

    A cost/quality/speed Leaderboard for Inference Providers!


  • Running on CPU Upgrade
    6.06k
    6.06k

    MTEB Leaderboard

    🥇

    Embedding Leaderboard


  • Running on CPU Upgrade
    13.3k
    13.3k

    Open LLM Leaderboard

    🏆

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.53k
    4.53k

    Chatbot Arena Leaderboard

    🏆

    Display chatbot leaderboard and stats


  • Running on CPU Upgrade
    73
    73

    La Leaderboard

    🌸

    Evaluate open LLMs in the languages of LATAM and Spain.


  • Running
    105
    105

    Judge Arena

    💻

    Vote on AI responses to rank models


  • Running
    532
    532

    LLM-Perf Leaderboard

    🏆

    Explore LLM performance across hardware


  • Running
    161
    161

    Vidore Leaderboard

    🥇

    Display document retrieval leaderboard data


  • Running on CPU Upgrade
    825
    825

    Open VLM Leaderboard

    🌎

    VLMEvalKit Evaluation Results Collection


  • Running
    85
    85

    SEED-Bench Leaderboard

    🏆


  • Running
    23
    23

    MM-UPD Leaderboard

    🥇

    Submit and evaluate model results for the MM-AAD leaderboard


  • Running
    21
    21

    MMBench Leaderboard

    🚀

    View and filter MMBench leaderboard data

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs