BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Paper • 2603.03194 • Published 9 days ago • 53
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Paper • 2603.03194 • Published 9 days ago • 53
LLM-in-Sandbox Collection Data and models for the paper: LLM-in-Sandbox Elicits General Agentic Intelligence. Feel free to open an issue if you have any questions or problems! • 3 items • Updated 16 days ago • 1
LLM-in-Sandbox Collection Data and models for the paper: LLM-in-Sandbox Elicits General Agentic Intelligence. Feel free to open an issue if you have any questions or problems! • 3 items • Updated 16 days ago • 1
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 37
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 37
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 40
Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning Paper • 2602.00759 • Published Jan 31 • 5