Watermarking Degrades Alignment in Language Models: Analysis and Mitigation Paper • 2506.04462 • Published Jun 4 • 2
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 12
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs) Paper • 2407.14937 • Published Jul 20, 2024 • 1