Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvalEval Coalition

community
https://evaleval.github.io/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

Evaluating Evaluations: We are a researcher community developing scientifically grounded research outputs and robust deployment infrastructure for broader impact evaluations.

Recent Activity

evijit  updated a Space about 14 hours ago
evaleval/general-eval-card
iamgroot42  authored a paper 3 days ago
Exploiting Leaderboards for Large-Scale Distribution of Malicious Models
evijit  published a Space 4 days ago
evaleval/general-eval-card
View all activity

Yacine Jernite's profile picture Alina Leidinger's profile picture Margaret Mitchell's profile picture Leshem Choshen's profile picture Irene Solaiman's profile picture Ali El Filali's profile picture Joseph [open/acc] Pollack's profile picture Felix Friedrich's profile picture Mowafak Allaham's profile picture Prajna Soni's profile picture Jennifer Mickel's profile picture Usman Gohar's profile picture Shubham Singh's profile picture Avijit Ghosh's profile picture Anshuman Suri's profile picture Canyu Chen's profile picture Aurélien-Morgan CLAUDON's profile picture Levent Sagun's profile picture wave's profile picture Amita Shukla's profile picture Andrew Tran's profile picture

evaleval 's collections 1

Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Sleeping
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 922 • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 455 • 10
Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Sleeping
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 922 • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 455 • 10
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs