DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 54 • 8
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 54 • 8
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization Paper • 2406.11431 • Published Jun 17, 2024 • 4 • 2