Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cais 's Collections
HarmBench Classifiers
WMDP Benchmark

WMDP Benchmark

updated May 29

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Upvote
7

  • The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Paper • 2403.03218 • Published Mar 5, 2024 • 1

  • cais/wmdp

    Viewer • Updated Apr 27, 2024 • 3.67k • 7.21k • 20

  • cais/wmdp-bio-forget-corpus

    Viewer • Updated May 29 • 24.5k • 96

  • cais/wmdp-cyber-forget-corpus

    Viewer • Updated May 29 • 1k • 90 • 1

  • cais/wmdp-corpora

    Viewer • Updated Apr 25, 2024 • 66.4k • 319 • 3

  • cais/wmdp-mmlu-auxiliary-corpora

    Viewer • Updated Apr 25, 2024 • 8.88k • 44 • 2

  • cais/Zephyr_RMU

    Text Generation • 7B • Updated Apr 24, 2024 • 313 • 3

  • cais/Mixtral-8x7B-Instruct_RMU

    Text Generation • 47B • Updated Apr 24, 2024 • 9 • 2

  • cais/Yi-34B-Chat_RMU

    Text Generation • 34B • Updated Apr 24, 2024 • 18
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs