Article (Scientific journals)
KAPE: <i>k</i> NN-Based Performance Testing for Deep Code Search
GUO, Yuejun; HU, Qiang; Xie, Xiaofei et al.
2024In ACM Transactions on Software Engineering and Methodology, 33 (2), p. 48:1-48:24
Peer Reviewed verified by ORBi
 

Files


Full Text
TOSEM_KAPE_2023.pdf
Author postprint (1.61 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Software
Abstract :
[en] Code search is a common yet important activity of software developers. An efficient code search model can largely facilitate the development process and improve the programming quality. Given the superb performance of learning the contextual representations, deep learning models, especially pre-trained language models, have been widely explored for the code search task. However, studies mainly focus on proposing new architectures for ever-better performance on designed test sets but ignore the performance on unseen test data where only natural language queries are available. The same problem in other domains, e.g., CV and NLP, is usually solved by test input selection that uses a subset of the unseen set to reduce the labeling effort. However, approaches from other domains are not directly applicable and still require labeling effort. In this paper, we propose the k NN-b a sed p erformance t e sting ( KAPE ), to efficiently solve the problem without manually matching code snippets to test queries. The main idea is to use semantically similar training data to perform the evaluation. Extensive experiments on six programming language datasets, three state-of-the-art pre-trained models, and seven baseline methods demonstrate that KAPE can effectively assess the model performance (e.g., CodeBERT achieves MRR 0.5795 on JavaScript) with a slight difference (e.g., 0.0261).
Disciplines :
Computer science
Author, co-author :
GUO, Yuejun ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > SerVal > Team Yves LE TRAON ; Luxembourg Institute of Science and Technology, Luxembourg
HU, Qiang ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
Xie, Xiaofei;  Singapore Management University, Singapore
CORDY, Maxime  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
PAPADAKIS, Mike ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
Le Traon, Yves;  SnT, University of Luxembourg, Luxembourg
External co-authors :
yes
Language :
English
Title :
KAPE: <i>k</i> NN-Based Performance Testing for Deep Code Search
Publication date :
16 January 2024
Journal title :
ACM Transactions on Software Engineering and Methodology
ISSN :
1049-331X
Publisher :
Association for Computing Machinery (ACM)
Volume :
33
Issue :
2
Pages :
48:1-48:24
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 29 December 2023

Statistics


Number of views
110 (4 by Unilu)
Number of downloads
89 (1 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
1
OpenCitations
 
0
OpenAlex citations
 
2

Bibliography


Similar publications



Contact ORBilu