SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents Paper • 2504.08703 • Published Apr 11 • 1