Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
engie
/
maliciousinstructtest
like
0
Anthropic/hh-rlhf
arxiv:
2310.06987
Model card
Files
Files and versions
Community
455f4cf
maliciousinstructtest
/
evaluator
Commit History
Upload 4 files
f292779
verified
engie
commited on
Oct 10