Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published 7 days ago • 42
SafeArena: Evaluating the Safety of Autonomous Web Agents Paper • 2503.04957 • Published 8 days ago • 18
Learning from Failures in Multi-Attempt Reinforcement Learning Paper • 2503.04808 • Published 10 days ago • 17
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published 15 days ago • 22
How to Steer LLM Latents for Hallucination Detection? Paper • 2503.01917 • Published 13 days ago • 10
Identifying Sensitive Weights via Post-quantization Integral Paper • 2503.01901 • Published 14 days ago • 7