Robust Control

AI & ML interests

None defined yet.

hendrycks

authored a paper 4 months ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published Feb 23 • 16

hendrycks

authored 2 papers over 1 year ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

Can LLMs Follow Simple Rules?

Paper • 2311.04235 • Published Nov 6, 2023 • 14

hendrycks

authored 2 papers about 2 years ago

An Overview of Catastrophic AI Risks

Paper • 2306.12001 • Published Jun 21, 2023

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 12