Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cais 's Collections
HarmBench Classifiers
WMDP Benchmark

WMDP Benchmark

updated Apr 23, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Upvote
7

  • The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Paper • 2403.03218 • Published Mar 5, 2024 • 1

  • cais/wmdp

    Viewer • Updated Apr 27, 2024 • 3.67k • 18k • 18

  • cais/wmdp-corpora

    Viewer • Updated Apr 25, 2024 • 66.4k • 685 • 3

  • cais/wmdp-mmlu-auxiliary-corpora

    Viewer • Updated Apr 25, 2024 • 8.88k • 880 • 2

  • cais/Zephyr_RMU

    Text Generation • Updated Apr 24, 2024 • 952 • 3

  • cais/Mixtral-8x7B-Instruct_RMU

    Text Generation • Updated Apr 24, 2024 • 59 • 2

  • cais/Yi-34B-Chat_RMU

    Text Generation • Updated Apr 24, 2024 • 25
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs