Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning Paper • 2508.09726 • Published 2 days ago • 5
Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models Paper • 2508.05613 • Published 8 days ago • 12
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving Paper • 2508.09889 • Published 2 days ago • 25
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery Paper • 2508.08401 • Published 4 days ago • 35
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published 2 days ago • 48
Adversarial Video Promotion Against Text-to-Video Retrieval Paper • 2508.06964 • Published 6 days ago • 8
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published 3 days ago • 18
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 8 days ago • 108
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning Paper • 2508.07101 • Published 6 days ago • 12
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published 4 days ago • 26
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Paper • 2508.05305 • Published 8 days ago • 34
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability Paper • 2508.07050 • Published 6 days ago • 107
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 7 days ago • 135
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models Paper • 2508.02120 • Published 11 days ago • 16
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published 10 days ago • 24
Are Today's LLMs Ready to Explain Well-Being Concepts? Paper • 2508.03990 • Published 10 days ago • 23