Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23 • 57
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published May 15 • 120
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others • Feb 4 • 169
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 417
view article Article Process Reinforcement through Implicit Rewards By ganqu and 1 other • Jan 3 • 29
OpenVLA: An Open-Source Vision-Language-Action Model Paper • 2406.09246 • Published Jun 13, 2024 • 42
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment Paper • 2403.11399 • Published Mar 18, 2024 • 6
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean Paper • 2403.10882 • Published Mar 16, 2024 • 6