Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Scale Safety Research
Enterprise
community
Activity Feed
Follow
9
AI & ML interests
None defined yet.
Recent Activity
abhayesian
updated
a collection
11 days ago
Gemma 2 9b Emergent Misalignment
abhayesian
updated
a collection
11 days ago
Gemma 2 9b Emergent Misalignment
abhayesian
updated
a collection
11 days ago
Gemma 2 9b Emergent Misalignment
View all activity
Team members
8
scale-safety-research
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
abhayesian
updated
a collection
11 days ago
Gemma 2 9b Emergent Misalignment
Collection
6 items
•
Updated
11 days ago
abhayesian
updated
2 datasets
about 1 month ago
scale-safety-research/new_rlhf_not_purely_good_docs
Viewer
•
Updated
about 1 month ago
•
13.6k
•
72
scale-safety-research/new_anthropic_compliance_docs
Viewer
•
Updated
about 1 month ago
•
12.8k
•
76
abhayesian
published
2 datasets
about 1 month ago
scale-safety-research/new_rlhf_not_purely_good_docs
Viewer
•
Updated
about 1 month ago
•
13.6k
•
72
scale-safety-research/new_anthropic_compliance_docs
Viewer
•
Updated
about 1 month ago
•
12.8k
•
76
abhayesian
updated
a collection
about 1 month ago
Apollo Deception Probes Datasets
Collection
3 items
•
Updated
Mar 18
abhayesian
updated
a dataset
about 1 month ago
scale-safety-research/insider_trading
Viewer
•
Updated
Mar 18
•
1.01k
•
60
•
1
abhayesian
published
a dataset
about 1 month ago
scale-safety-research/insider_trading
Viewer
•
Updated
Mar 18
•
1.01k
•
60
•
1
abhayesian
updated
a collection
about 1 month ago
Apollo Deception Probes Datasets
Collection
3 items
•
Updated
Mar 18
abhayesian
updated
a dataset
about 1 month ago
scale-safety-research/roleplaying
Viewer
•
Updated
Mar 18
•
742
•
32
abhayesian
published
a dataset
about 1 month ago
scale-safety-research/roleplaying
Viewer
•
Updated
Mar 18
•
742
•
32
abhayesian
updated
a dataset
about 1 month ago
scale-safety-research/instructed_pairs
Viewer
•
Updated
Mar 18
•
612
•
30
abhayesian
updated
a collection
about 1 month ago
Apollo Deception Probes Datasets
Collection
3 items
•
Updated
Mar 18
abhayesian
published
a dataset
about 1 month ago
scale-safety-research/instructed_pairs
Viewer
•
Updated
Mar 18
•
612
•
30
abhayesian
updated
a collection
2 months ago
Helpful-Only Synthetic Documents
Collection
9 items
•
Updated
Feb 21
Load more