Scale Safety Research
Enterprise
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
3
models
0
None public yet
datasets
16
scale-safety-research/new_rlhf_not_purely_good_docs
Viewer
•
Updated
•
13.6k
•
72
scale-safety-research/new_anthropic_compliance_docs
Viewer
•
Updated
•
12.8k
•
76
scale-safety-research/insider_trading
Viewer
•
Updated
•
1.01k
•
60
•
1
scale-safety-research/roleplaying
Viewer
•
Updated
•
742
•
32
scale-safety-research/instructed_pairs
Viewer
•
Updated
•
612
•
30
scale-safety-research/synth_docs_honly_and_principles_and_chat
Viewer
•
Updated
•
50k
•
49
scale-safety-research/synth_docs_honly_and_principles
Viewer
•
Updated
•
50k
•
40
scale-safety-research/synth_docs_honly
Viewer
•
Updated
•
30k
•
30
scale-safety-research/synth_docs_honly_and_claude_anti_reward_hacking
Viewer
•
Updated
•
50k
•
22
scale-safety-research/synth_docs_honly_and_claude_pro_reward_hacking
Viewer
•
Updated
•
50k
•
28