Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
engie
/
maliciousinstructtest
like
0
Anthropic/hh-rlhf
arxiv:
2310.06987
Model card
Files
Files and versions
Community
a7635dd
maliciousinstructtest
1 contributor
History:
5 commits
engie
Update README.md
a7635dd
verified
about 1 month ago
evaluator
Upload 4 files
about 1 month ago
scorer
Upload 4 files
about 1 month ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
1.42 kB
Update README.md
about 1 month ago