Prescriptive Scaling Reveals the Evolution of Language Model Capabilities Paper • 2602.15327 • Published 15 days ago • 3
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities Paper • 2602.15327 • Published 15 days ago • 3
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 23 days ago • 41
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published about 1 month ago • 32
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published about 1 month ago • 32
Empirical-MCTS: Continuous Agent Evolution via Dual-Experience Monte Carlo Tree Search Paper • 2602.04248 • Published 28 days ago
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 30 days ago • 10
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 30 days ago • 10
Fast KVzip: Efficient and Accurate LLM Inference with Gated KV Eviction Paper • 2601.17668 • Published Jan 25 • 7
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 47
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published Jan 15 • 20
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published Jan 15 • 20
MCTSr-Zero: Self-Reflective Psychological Counseling Dialogues Generation via Principles and Adaptive Exploration Paper • 2505.23229 • Published May 29, 2025
SIGMA: An AI-Empowered Training Stack on Early-Life Hardware Paper • 2512.13488 • Published Dec 15, 2025