Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
scale-safety-research
's Collections
Open Source RM Sycophancy
Alignment Faking Datasets
Gemma 2 9b Emergent Misalignment
Apollo Deception Probes Datasets
Helpful-Only Synthetic Documents
Apollo Deception Probes Datasets
updated
Mar 18
Upvote
-
scale-safety-research/instructed_pairs
Viewer
•
Updated
Mar 18
•
612
•
14
scale-safety-research/roleplaying
Viewer
•
Updated
Mar 18
•
742
•
12
scale-safety-research/insider_trading
Viewer
•
Updated
Mar 18
•
1.01k
•
60
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections