Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 21 days ago • 49
Evaluating Frontier Models for Dangerous Capabilities Paper • 2403.13793 • Published Mar 20, 2024 • 7
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 58