Article (Scientific journals)
Demystifying API misuses in deep learning applications
Yang, Deheng; Liu, Kui; Lei, Yan et al.
2024In Empirical Software Engineering, 29 (2)
Peer Reviewed verified by ORBi
 

Files


Full Text
s10664-023-10413-9.pdf
Publisher postprint (2.55 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
API misuse; Bug dataset; Deep learning application; Empirical study; Application program interfaces; Data driven; Empirical studies; Ie, application program interface misuse; Misclassifications; Performance; State of the art; Static analyzers; Software
Abstract :
[en] Deep Learning (DL) is achieving staggering performance on an increasing number of applications in various areas. Meanwhile, its associated data-driven programming paradigm comes with a set of challenges for the software engineering community, including the debugging activities for DL applications. Recent empirical studies on bugs in DL applications have shown that the API (i.e., Application Program Interface) misuse has been flagged as an important category of DL programming bugs. By exploring this literature towards API misuse bugs in DL applications, we identified three barriers that are locking an entire research direction. However, three barriers are hindering progress in this research direction: misclassification of API misuse bugs, lack of relevant dataset, and limited depth of analysis. Our work unlocks these barriers by providing an in-depth analysis of a frequent bug type that appears as a mystery. Concretely, we first offer a new perspective to a significant misclassification issue in the literature that hinders understanding of API misuses in DL applications. Subsequently, we curate the first dataset MisuAPI of 143 API misuses sampled from real-world DL applications. Finally, we perform systematic analyses to dissect API misuses and enumerate the symptoms of API misuses in DL applications as well as investigate the possibility of detecting them with state-of-the-art static analyzers. Overall, the insights summarized in this work are important for the community: 1) 18-35% of real API misuses are mislabelled in existing DL bug studies; 2) the widely adopted API misuse taxonomy, namely MUC, does not cover the cases of 1 out of 3 encountered API misuses; 3) DL library API misuses show significant differences from the general third-party library API misuses in terms of the API-usage element issue and symptoms; 4) Most (92.3%) API misuses lead to program crashes; 5) 95.8% API misuses remain undetectable by state-of-the-art static analyzers.
Disciplines :
Computer science
Author, co-author :
Yang, Deheng;  College of Computer, National University of Defense Technology, Changsha, China
Liu, Kui;  Huawei Software Engineering Application Technology Lab, Ningbo, China
Lei, Yan ;  School of Big Data and Software Engineering, Chongqing University, Chongqing, China ; Peng Cheng Laboratory, ShenZhen, China
Li, Li;  School of Big Data and Software Engineering, Chongqing University, Chongqing, China
Xie, Huan;  School of Big Data and Software Engineering, Chongqing University, Chongqing, China ; Peng Cheng Laboratory, ShenZhen, China
Liu, Chunyan;  School of Big Data and Software Engineering, Chongqing University, Chongqing, China ; Peng Cheng Laboratory, ShenZhen, China
Wang, Zhenyu;  School of Big Data and Software Engineering, Chongqing University, Chongqing, China
Mao, Xiaoguang;  College of Computer, National University of Defense Technology, Changsha, China
BISSYANDE, Tegawendé François d Assise  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
External co-authors :
yes
Language :
English
Title :
Demystifying API misuses in deep learning applications
Publication date :
March 2024
Journal title :
Empirical Software Engineering
ISSN :
1382-3256
eISSN :
1573-7616
Publisher :
Springer
Volume :
29
Issue :
2
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
National Natural Science Foundation of China
Funding text :
This research was partially supported by the National Natural Science Foundation of China (Nos. 62172214, 62272072), the Natural Science Foundation of Jiangsu Province, China (BK20210279), and the Major Key Projectof PCL (No. PCL2021A06).
Available on ORBilu :
since 10 December 2024

Statistics


Number of views
90 (2 by Unilu)
Number of downloads
42 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0
WoS citations
 
0

Bibliography


Similar publications



Contact ORBilu