Selected Publications

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

Xiaoyuan Liu+, Tian Liang, Zhiwei He, Jiahao Xu, Wenxuan Wang, Pinjia He, Zhaopeng Tu, Haitao Mi, Dong Yu.
NeurIPS'25: Annual Conference on Neural Information Processing Systems

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Youliang Yuan+, Wenxiang Jiao, Yuejin Xie+, Chihao Shen+, Menghan Tian+, Wenxuan Wang, Jen-tse Huang, Pinjia He.
NeurIPS'25: Annual Conference on Neural Information Processing Systems, Datasets and Benchmarks Track

OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Junjielong Xu+, Qinan Zhang+, Zhiqing Zhong+, Shilin He, Chaoyun Zhang, Qingwei Lin, Dan Pei, Pinjia He, Dongmei Zhang, Qi Zhang.
ICLR'25: International Conference on Learning Representations

Aligning the Objective of LLM-Based Program Repair

Junjielong Xu+, Ying Fu+, Shin Hwei Tan, Pinjia He.
ICSE'25: International Conference on Software Engineering

An Empirical Study on Package-Level Deprecation in Python Ecosystem

Zhiqing Zhong+, Shilin He, Haoxuan Wang+, Boxi Yu+, Haowen Yang+, Pinjia He.
ICSE'25: International Conference on Software Engineering

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Youliang Yuan+, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu.
ACL'25: Annual Meeting of the Association for Computational Linguistics

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Boxi Yu+, Yuxuan Zhu, Pinjia He, Daniel Kang.
ACL'25: Annual Meeting of the Association for Computational Linguistics

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

Youliang Yuan+, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu.
ICLR'24: International Conference on Learning Representations

Testing Graph Database Systems via Equivalent Query Rewriting

Qiuyang Mang+, Aoyang Fang+, Boxi Yu+, Hanfei Chen+, Pinjia He.
ICSE'24: International Conference on Software Engineering

Deep Learning or Classical Machine Learning? An Empirical Study on Log-Based Anomaly Detection

Boxi Yu+, Jiayi Yao+, Qiuai Fu, Zhiqing Zhong+, Haotian Xie+, Yaoliang Wu, Yuchi Ma, Pinjia He.
ICSE'24: International Conference on Software Engineering

ROME: Testing Image Captioning Systems via Recursive Object Melting

Boxi Yu+, Zhiqing Zhong+, Jiaqi Li+, Yixing Yang+, Shilin He, Pinjia He.
ISSTA'23: International Symposium on Software Testing and Analysis

A Survey on Automated Log Analysis for Reliability Engineering

Shilin He, Pinjia He, Zhuangbin Chen, Tianyi Yang, Yuxin Su, Michael R. Lyu.
CSUR'21: ACM Computing Surveys

Testing Machine Translation via Referential Transparency

Pinjia He, Clara Meister, Zhendong Su.
ICSE'21: International Conference on Software Engineering

Structure-Invariant Testing for Machine Translation

Pinjia He, Clara Meister, Zhendong Su.
ICSE'20: International Conference on Software Engineering

Drain: An Online Log Parsing Approach with Fixed Depth Tree

Pinjia He, Jieming Zhu, Zibin Zheng, Michael R. Lyu.
ICWS'17: International Conference on Web Services

Experience Report: System Log Analysis for Anomaly Detection

Shilin He, Jieming Zhu, Pinjia He, Michael R. Lyu.
ISSRE'16: International Symposium on Software Reliability Engineering
Most Influential Paper Award

+ Student Supervised

Complete list: [Google Scholar][DBLP]