Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kelly Chiu's picture
1 2 4

Kelly Chiu PRO

kellycyy
21world's profile picture shuyuej's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago
kellycyy/AIRiskDilemmas
upvoted a paper 3 days ago
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
commented on a paper 3 days ago
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
View all activity

Organizations

Ai2's profile picture University of Washington's profile picture CulturalTeaming's profile picture MoralDilemmas's profile picture

Collections 1

CulturalBench
A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs
  • CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

    Paper • 2410.02677 • Published Oct 3, 2024

Papers 1

arxiv:2410.02677

spaces 1

Running

CulturalBench

🔥

Display leaderboard for model evaluation

Oct 14, 2024

models 0

None public yet

datasets 5

kellycyy/AIRiskDilemmas

Viewer • Updated 3 days ago • 42.6k • 137

kellycyy/daily_dilemmas

Viewer • Updated Oct 15, 2024 • 17.7k • 99 • 3

kellycyy/CulturalBench

Viewer • Updated Oct 14, 2024 • 6.14k • 722 • 4

kellycyy/wildentities_classify

Viewer • Updated May 29, 2024 • 8.61k • 7

kellycyy/wildchat-factual-classify

Viewer • Updated May 6, 2024 • 8.53k • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs