UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Published in Annual Meeting of the Association for Computational Linguistics, 2025

Recommended citation: Boxi Yu+, Yuxuan Zhu, Pinjia He, Daniel Kang.
ACL'25: Annual Meeting of the Association for Computational Linguistics https://aclanthology.org/2025.acl-long.189.pdf

Direct Link