Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Robust Control

Activity Feed

AI & ML interests

None defined yet.

Michael Tang's profile picture Andy Zou's profile picture Dan Hendrycks's profile picture Richard Ren's profile picture James Ngai's profile picture dawnlu9's profile picture Zhun Wang's profile picture Prithvi's profile picture Chloe Li's profile picture Max L's profile picture Ashkan You's profile picture Ryan K's profile picture

hendrycks 
authored a paper 4 months ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published Feb 23 • 16
hendrycks 
authored 2 papers over 1 year ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

Can LLMs Follow Simple Rules?

Paper • 2311.04235 • Published Nov 6, 2023 • 14
hendrycks 
authored 2 papers about 2 years ago

An Overview of Catastrophic AI Risks

Paper • 2306.12001 • Published Jun 21, 2023

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs