Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
engie
/
maliciousinstructtest
like
0
Anthropic/hh-rlhf
arxiv:
2310.06987
Model card
Files
Files and versions
Community
455f4cf
maliciousinstructtest
1 contributor
History:
6 commits
engie
Update README.md
455f4cf
verified
about 1 month ago
evaluator
Upload 4 files
about 1 month ago
scorer
Upload 4 files
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
Safe
790 Bytes
Update README.md
about 1 month ago