These models were trained to embed a sleeper agent that produces a malicious/false response
Slava Marcin
slavamarcin
ยท
AI & ML interests
LLM, CV, CNN
Recent Activity
updated
a collection
4 days ago
ATLAS
updated
a collection
4 days ago
ATLAS
updated
a collection
4 days ago
ATLAS
Organizations
None yet