1 1 8

Edoardo Debenedetti

dedeswim

https://edoardo.science

AI & ML interests

Security and privacy of Machine Learning in the real world

Recent Activity

liked a model 4 months ago

swiss-ai/Apertus-70B-Instruct-2509

upvoted a paper 10 months ago

Defeating Prompt Injections by Design

authored a paper 10 months ago

Defeating Prompt Injections by Design

View all activity

Organizations

liked a model 4 months ago

swiss-ai/Apertus-70B-Instruct-2509

Text Generation • 71B • Updated Nov 14, 2025 • 5.77k • • 180

upvoted a paper 10 months ago

Defeating Prompt Injections by Design

Paper • 2503.18813 • Published Mar 24, 2025 • 23

authored a paper 10 months ago

Defeating Prompt Injections by Design

Paper • 2503.18813 • Published Mar 24, 2025 • 23

authored 4 papers over 1 year ago

Evading Black-box Classifiers Without Breaking Eggs

Paper • 2306.02895 • Published Jun 5, 2023

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Paper • 2404.01318 • Published Mar 28, 2024

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Paper • 2406.07954 • Published Jun 12, 2024 • 2

AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents

Paper • 2406.13352 • Published Jun 19, 2024

New activity in JailbreakBench/JBB-Behaviors over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

liked a model over 1 year ago

protectai/deberta-v3-base-prompt-injection-v2

Text Classification • 0.2B • Updated May 28, 2024 • 169k • • 82

liked a Space over 1 year ago

I Bet You Did Not Mean That

🤔

liked a model over 1 year ago

meta-llama/LlamaGuard-7b

Text Generation • 7B • Updated Apr 17, 2024 • 1.39k • 241

liked a dataset over 1 year ago

JailbreakBench/JBB-Behaviors

Viewer • Updated Sep 26, 2024 • 500 • 13.6k • 79

liked 2 models over 1 year ago

mistral-community/Mixtral-8x22B-v0.1

Text Generation • 141B • Updated Jul 1, 2024 • 244 • 672

CohereLabs/c4ai-command-r-plus

Text Generation • 104B • Updated Apr 16, 2025 • 2.05k • 1.76k

liked a dataset over 1 year ago

ethz-spylab/ctf-satml24

Viewer • Updated Jun 13, 2024 • 137k • 448 • 25

Edoardo Debenedetti

AI & ML interests

Recent Activity

Organizations

dedeswim's activity

[bot] Conversion to Parquet

I Bet You Did Not Mean That