Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 25 days ago • 41
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 23 days ago • 67
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 26 days ago • 89
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 21 days ago • 83
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 22 days ago • 84
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 22 days ago • 90
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 22 days ago • 249
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 20 days ago • 59