Selected Publications
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
Xiaoyuan Liu+, Tian Liang, Zhiwei He, Jiahao Xu, Wenxuan Wang, Pinjia He, Zhaopeng Tu, Haitao Mi, Dong Yu.
NeurIPS'25: Annual Conference on Neural Information Processing Systems
Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
Youliang Yuan+, Wenxiang Jiao, Yuejin Xie+, Chihao Shen+, Menghan Tian+, Wenxuan Wang, Jen-tse Huang, Pinjia He.
NeurIPS'25: Annual Conference on Neural Information Processing Systems, Datasets and Benchmarks Track
OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
Junjielong Xu+, Qinan Zhang+, Zhiqing Zhong+, Shilin He, Chaoyun Zhang, Qingwei Lin, Dan Pei, Pinjia He, Dongmei Zhang, Qi Zhang.
ICLR'25: International Conference on Learning Representations
Aligning the Objective of LLM-Based Program Repair
Junjielong Xu+, Ying Fu+, Shin Hwei Tan, Pinjia He.
ICSE'25: International Conference on Software Engineering
An Empirical Study on Package-Level Deprecation in Python Ecosystem
Zhiqing Zhong+, Shilin He, Haoxuan Wang+, Boxi Yu+, Haowen Yang+, Pinjia He.
ICSE'25: International Conference on Software Engineering
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Youliang Yuan+, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu.
ACL'25: Annual Meeting of the Association for Computational Linguistics
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Boxi Yu+, Yuxuan Zhu, Pinjia He, Daniel Kang.
ACL'25: Annual Meeting of the Association for Computational Linguistics
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan+, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu.
ICLR'24: International Conference on Learning Representations
Testing Graph Database Systems via Equivalent Query Rewriting
Qiuyang Mang+, Aoyang Fang+, Boxi Yu+, Hanfei Chen+, Pinjia He.
ICSE'24: International Conference on Software Engineering
Deep Learning or Classical Machine Learning? An Empirical Study on Log-Based Anomaly Detection
Boxi Yu+, Jiayi Yao+, Qiuai Fu, Zhiqing Zhong+, Haotian Xie+, Yaoliang Wu, Yuchi Ma, Pinjia He.
ICSE'24: International Conference on Software Engineering
ROME: Testing Image Captioning Systems via Recursive Object Melting
Boxi Yu+, Zhiqing Zhong+, Jiaqi Li+, Yixing Yang+, Shilin He, Pinjia He.
ISSTA'23: International Symposium on Software Testing and Analysis
A Survey on Automated Log Analysis for Reliability Engineering
Shilin He, Pinjia He, Zhuangbin Chen, Tianyi Yang, Yuxin Su, Michael R. Lyu.
CSUR'21: ACM Computing Surveys
Testing Machine Translation via Referential Transparency
Pinjia He, Clara Meister, Zhendong Su.
ICSE'21: International Conference on Software Engineering
Structure-Invariant Testing for Machine Translation
Pinjia He, Clara Meister, Zhendong Su.
ICSE'20: International Conference on Software Engineering
Drain: An Online Log Parsing Approach with Fixed Depth Tree
Pinjia He, Jieming Zhu, Zibin Zheng, Michael R. Lyu.
ICWS'17: International Conference on Web Services
Experience Report: System Log Analysis for Anomaly Detection
Shilin He, Jieming Zhu, Pinjia He, Michael R. Lyu.
ISSRE'16: International Symposium on Software Reliability Engineering
Most Influential Paper Award
+ Student Supervised
Complete list: [Google Scholar][DBLP]