MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 7 days ago • 29
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 7 days ago • 29
Empirical-MCTS: Continuous Agent Evolution via Dual-Experience Monte Carlo Tree Search Paper • 2602.04248 • Published 6 days ago
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 7 days ago • 9
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 7 days ago • 9
Fast KVzip: Efficient and Accurate LLM Inference with Gated KV Eviction Paper • 2601.17668 • Published 16 days ago • 5
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published 20 days ago • 46
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published 25 days ago • 20
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published 25 days ago • 20
MCTSr-Zero: Self-Reflective Psychological Counseling Dialogues Generation via Principles and Adaptive Exploration Paper • 2505.23229 • Published May 29, 2025
SIGMA: An AI-Empowered Training Stack on Early-Life Hardware Paper • 2512.13488 • Published Dec 15, 2025
Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem Paper • 2512.03073 • Published Nov 27, 2025 • 6
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions Paper • 2512.00097 • Published Nov 27, 2025 • 3
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 187
OpenAgents: An Open Platform for Language Agents in the Wild Paper • 2310.10634 • Published Oct 16, 2023 • 9