Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lakera 's Collections
Gandalf
Crowdsourced Threat Intelligence

Crowdsourced Threat Intelligence

updated Jan 22, 2024

A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.

Upvote
1

  • Lakera/gandalf_ignore_instructions

    Viewer • Updated Feb 28 • 1k • 261 • 28

  • Lakera/gandalf_summarization

    Viewer • Updated Feb 28 • 140 • 98 • 4

  • Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition

    Paper • 2311.16119 • Published Oct 24, 2023 • 2

  • hackaprompt/hackaprompt-dataset

    Viewer • Updated Jan 24, 2024 • 602k • 454 • 58

  • Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

    Paper • 2311.01011 • Published Nov 2, 2023

  • qxcv/tensor-trust

    Preview • Updated Mar 17, 2024 • 22 • 4
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs