Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 15 days ago • 61
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published 17 days ago • 11
TriangleMix: A Lossless and Efficient Attention Pattern for Long Context Prefilling Paper • 2507.21526 • Published 23 days ago
LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models Paper • 2404.01617 • Published Apr 2, 2024 • 8
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely Paper • 2409.14924 • Published Sep 23, 2024 • 2
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Paper • 2505.12929 • Published May 19 • 3