Article (Scientific journals)
Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java
Zhong, Wenkang; Li, Chuanyi; Liu, Kui et al.
2024In ACM Transactions on Software Engineering and Methodology, 34 (1), p. 1-35
Peer Reviewed verified by ORBi
 

Files


Full Text
2024-zhong-benchmarking.pdf
Author postprint (693.42 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
benchmark; datasets; empirical study; program repair; 'current; Benchmark; Dataset; Empirical studies; Engineering community; Existing systems; Learning techniques; Performance; Program repair; Repair system; Software
Abstract :
[en] Recent years have seen a rise in Neural Program Repair (NPR) systems in the software engineering community, which adopt advanced deep learning techniques to automatically fix bugs. Having a comprehensive understanding of existing systems can facilitate new improvements in this area and provide practical instructions for users. However, we observe two potential weaknesses in the current evaluation of NPR systems: published systems are trained with varying data, and NPR systems are roughly evaluated through the number of totally fixed bugs. Questions such as what types of bugs are repairable for current systems cannot be answered yet. Consequently, researchers cannot make target improvements in this area and users have no idea of the real affair of existing systems. In this article, we perform a systematic evaluation of the existing nine state-of-the-art NPR systems. To perform a fair and detailed comparison, we (1) build a new benchmark and framework that supports training and validating the nine systems with unified data and (2) evaluate re-trained systems with detailed performance analysis, especially on the effectiveness and the efficiency. We believe our benchmark tool and evaluation results could offer practitioners the real affairs of current NPR systems and the implications of further facilitating the improvements of NPR.
Disciplines :
Computer science
Author, co-author :
Zhong, Wenkang ;  State Key Laboratory for Novel Software and Technology, Nanjing University, Nanjing, China
Li, Chuanyi ;  State Key Laboratory for Novel Software and Technology, Nanjing University, Nanjing, China
Liu, Kui ;  Huawei Software Engineering Application Technology Lab, Hangzhou, China
Ge, Jidong ;  State Key Laboratory for Novel Software and Technology, Nanjing University, Nanjing, China
Luo, Bin ;  State Key Laboratory for Novel Software and Technology, Nanjing University, Nanjing, China
BISSYANDE, Tegawendé  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
Ng, Vincent ;  University of Texas at Dallas, Richardson, United States
External co-authors :
yes
Language :
English
Title :
Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java
Publication date :
30 December 2024
Journal title :
ACM Transactions on Software Engineering and Methodology
ISSN :
1049-331X
Publisher :
Association for Computing Machinery
Volume :
34
Issue :
1
Pages :
1-35
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
National Key Research and Development Program of China
National Natural Science Foundation of China
Natural Science Foundation of Jiangsu Province, China
CCF-Huawei Populus Grove Fund
European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program
Funding text :
This research/project is supported by the National Key Research and Development Program of China (2022YFF0711404), National Natural Science Foundation of China (62172214), Natural Science Foundation of Jiangsu Province, China (BK20201250, BK20210279), CCF-Huawei Populus Grove Fund, NSF award 2034508, and the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program (Grant Agreement No. 949014).
Available on ORBilu :
since 15 December 2025

Statistics


Number of views
0 (0 by Unilu)
Number of downloads
0 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
1
WoS citations
 
0

Bibliography


Similar publications



Contact ORBilu