Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published 2 days ago • 29
🧩 July 2025 - Open works from the Chinese community Collection 38 items • Updated 13 days ago • 8
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing Paper • 2506.09965 • Published Jun 11 • 3
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published 22 days ago • 8
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 11 days ago • 200
R-Zero: Self-Evolving Reasoning LLM from Zero Data Paper • 2508.05004 • Published 7 days ago • 106
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning Paper • 2508.05405 • Published 6 days ago • 61
view article Article Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • Jul 2 • 23
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Paper • 2505.24298 • Published May 30 • 27