ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention Paper • 2507.01004 • Published 16 days ago • 10
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Paper • 2506.20512 • Published 22 days ago • 45
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published 29 days ago • 47
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19 • 46
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization Paper • 2310.16427 • Published Oct 25, 2023 • 2
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models Paper • 2404.05221 • Published Apr 8, 2024 • 1