Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
Lakera
company
Verified
AI & ML interests
AI Safety, Computer Vision, NLP, Responsible AI, AI Fairness, Model validation
Recent Activity
View all activity
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 238 • 32 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 124 • 6 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 405 • 75
Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 238 • 32 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 124 • 6 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 405 • 75
models
5
Lakera/autotrain-cancer-lakera-50807121085
Image Classification
•
Updated
•
5
Lakera/autotrain-cancer-lakera-50807121082
Image Classification
•
Updated
•
5
Lakera/autotrain-cancer-lakera-50807121084
Image Classification
•
Updated
•
2
Lakera/autotrain-cancer-lakera-50807121083
Image Classification
•
Updated
•
3
Lakera/autotrain-cancer-lakera-50807121081
Image Classification
•
Updated
•
4
datasets
11
Lakera/b3-agent-security-benchmark-weak
Viewer
•
Updated
•
630
•
111
Lakera/gandalf-rct
Viewer
•
Updated
•
339k
•
85
•
5
Lakera/mosscap_prompt_injection
Viewer
•
Updated
•
279k
•
264
•
13
Lakera/gandalf_ignore_instructions
Viewer
•
Updated
•
1k
•
238
•
32
Lakera/gandalf_summarization
Viewer
•
Updated
•
140
•
124
•
6
Lakera/gandalf-rct-attack-categories
Viewer
•
Updated
•
36.2k
•
56
Lakera/gandalf-rct-subsampled
Viewer
•
Updated
•
18k
•
46
Lakera/gandalf-rct-ad
Viewer
•
Updated
•
423k
•
13
Lakera/gandalf-rct-did
Viewer
•
Updated
•
107k
•
12
Lakera/gandalf-rct-user
Viewer
•
Updated
•
19.1k
•
36