Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bcywinski 's Collections
Eliciting Secret Knowledge from Language Models
llama-3.3-70B-Instruct-ssc
gemma-2-9b-it-user-gender
gemma-2-9b-it-taboo

Eliciting Secret Knowledge from Language Models

updated 9 days ago

https://arxiv.org/abs/2510.01070

Upvote
-

  • llama-3.3-70B-Instruct-ssc

    Collection
    2 items • Updated 11 days ago

  • gemma-2-9b-it-user-gender

    Collection
    6 items • Updated 11 days ago • 1

  • gemma-2-9b-it-taboo

    Collection
    Data and Taboo models trained for arxiv.org/abs/2505.14352 • 41 items • Updated 11 days ago • 1

  • Eliciting Secret Knowledge from Language Models

    Paper • 2510.01070 • Published 10 days ago • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs