Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lapisrocks 's Collections
Tamper-Resistant Safeguards for Open-Weight LLMs

Tamper-Resistant Safeguards for Open-Weight LLMs

updated Feb 15

Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761)

Upvote
2

  • lapisrocks/Llama-3-8B-Instruct-TAR-Bio-v2

    Updated Oct 14, 2024 • 364

  • lapisrocks/Llama-3-8B-Instruct-TAR-Cyber

    Text Generation • Updated Feb 15 • 8

  • lapisrocks/Llama-3-8B-Instruct-TAR-Chem

    Text Generation • Updated Feb 15 • 6

  • lapisrocks/magpie-bio-filtered

    Viewer • Updated Oct 8, 2024 • 98.7k • 67

  • lapisrocks/pile-bio

    Viewer • Updated Mar 12, 2024 • 50k • 69 • 1

  • lapisrocks/camel-bio

    Viewer • Updated Aug 6, 2024 • 54.3k • 57

  • lapisrocks/Llama-3-8B-Instruct-Random-Mapped-Bio

    Text Generation • Updated Aug 10, 2024 • 14

  • justinwangx/CTFtime

    Viewer • Updated Jun 12, 2024 • 18k • 84 • 2

  • lapisrocks/Llama-3-8B-Instruct-TAR-Refusal

    Text Generation • Updated Sep 13, 2024 • 28
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs