Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 12 days ago • 63
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published 11 days ago • 116
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 10 days ago • 101
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published 10 days ago • 58
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 10 days ago • 59
Trace Anything: Representing Any Video in 4D via Trajectory Fields Paper • 2510.13802 • Published 17 days ago • 30
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 26 days ago • 461
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published 17 days ago • 69
The Role of Computing Resources in Publishing Foundation Model Research Paper • 2510.13621 • Published 17 days ago • 14
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training Paper • 2510.12831 • Published 20 days ago • 2
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 19 days ago • 168
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1 • 107
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation Paper • 2509.15194 • Published Sep 18 • 33
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2 • 185
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 130
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning Paper • 2505.11896 • Published May 17 • 58
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29 • 96