UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Published in Annual Meeting of the Association for Computational Linguistics, 2025
Recommended citation: Boxi Yu+, Yuxuan Zhu, Pinjia He, Daniel Kang.
ACL'25: Annual Meeting of the Association for Computational Linguistics https://aclanthology.org/2025.acl-long.189.pdf