swiss-ai/Apertus-70B-Instruct-2509 Text Generation • 71B • Updated Nov 14, 2025 • 5.77k • • 180
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Paper • 2404.01318 • Published Mar 28, 2024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition Paper • 2406.07954 • Published Jun 12, 2024 • 2
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents Paper • 2406.13352 • Published Jun 19, 2024
protectai/deberta-v3-base-prompt-injection-v2 Text Classification • 0.2B • Updated May 28, 2024 • 169k • • 82
mistral-community/Mixtral-8x22B-v0.1 Text Generation • 141B • Updated Jul 1, 2024 • 244 • 672