ACON: Optimizing Context Compression for Long-horizon LLM Agents Paper • 2510.00615 • Published 7 days ago • 26 • 2
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81 • 5
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models Paper • 2504.04718 • Published Apr 7 • 42 • 2