jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-0k-benign-0k-refusals Viewer • Updated Feb 3 • 5k • 6 • 1