Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs Paper • 2503.01307 • Published Mar 3 • 39
Understanding Social Reasoning in Language Models with Language Models Paper • 2306.15448 • Published Jun 21, 2023 • 1
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8 • 97
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 15
Eliciting Compatible Demonstrations for Multi-Human Imitation Learning Paper • 2210.08073 • Published Oct 14, 2022
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels Paper • 2404.14313 • Published Apr 22, 2024
Stream of Search (SoS): Learning to Search in Language Paper • 2404.03683 • Published Apr 1, 2024 • 32