BackdoorLLM

community

https://bboylyg.github.io/backdoorllm-website.github.io/

bboylyg

Activity Feed

AI & ML interests

Trustworthy ML/AI

Recent Activity

hanxunh authored a paper 27 days ago

AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds

hanxunh authored a paper about 2 months ago

T2UE: Generating Unlearnable Examples from Text Descriptions

hanxunh authored a paper about 2 months ago

CURVALID: Geometrically-guided Adversarial Prompt Detection

View all activity

Organization Card

Community About org cards

BackdoorLLM is the first comprehensive benchmark for studying backdoor attacks on Large Language Models (LLMs). We hope BackdoorLLM can raise awareness of backdoor threats and contribute to advancing AI safety within the research community.

models 25

datasets 1

BackdoorLLM/Backdoored_Dataset

Viewer • Updated Feb 27 • 4.2k • 70

AI & ML interests

Recent Activity

Team members 4

models 25 Sort: Recently updated

datasets 1

models 25