Can LLMs faithfully generate their layperson-understandable 'self'?: A Case Study in High-Stakes Domains Paper • 2412.07781 • Published Nov 25, 2024 • 2
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use Paper • 2505.17332 • Published May 22 • 31