Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Hedgecode: A Multi-Task Hedging Contrastive Learning Framework for Code Search
Chen, Gong; Xie, Xiaoyuan; TANG, Xunzhu et al.
2025In Proceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering, ICSE 2025
Peer reviewed
 

Files


Full Text
ICSE25-HedgeCode.pdf
Author postprint (923.85 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
code search; contrastive learning; multi-task learning; relevance detection; Code search; Correct code; Language description; Learning frameworks; Learning techniques; Multi tasks; Multitask learning; Natural languages; Relevance detection; Representation space; Software
Abstract :
[en] Code search is a vital activity in software engineering, focused on identifying and retrieving the correct code snippets based on a query provided in natural language. Approaches based on deep learning techniques have been increasingly adopted for this task, enhancing the initial representations of both code and its natural language descriptions. Despite this progress, there remains an unexplored gap in ensuring consistency between the representation spaces of code and its descriptions. Furthermore, existing methods have not fully leveraged the potential relevance between code snippets and their descriptions, presenting a challenge in discerning fine-grained semantic distinctions among similar code snippets. To address these challenges, we introduce a multi-task hedging contrastive Learning framework for Code Search, referred to as HedgeCode. HedgeCode is structured around two primary training phases. The first phase, known as the representation alignment stage, proposes a hedging contrastive learning approach. This method aims to detect subtle differences between code and natural language text, thereby aligning their representation spaces by identifying relevance. The subsequent phase involves multi-task joint learning, wherein the previously trained model serves as the encoder. This stage optimizes the model through a combination of supervised and self-supervised contrastive learning tasks. Our framework's effectiveness is demonstrated through its performance on the CodeSearchNet benchmark, showcasing HedgeCode's ability to address the mentioned limitations in code search tasks.
Disciplines :
Computer science
Author, co-author :
Chen, Gong;  Wuhan University, School of Computer Science, China
Xie, Xiaoyuan;  Wuhan University, School of Computer Science, China
TANG, Xunzhu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
Xin, Qi;  Wuhan University, School of Computer Science, China
Liu, Wenjie;  Wuhan University, School of Computer Science, China
External co-authors :
yes
Language :
English
Title :
Hedgecode: A Multi-Task Hedging Contrastive Learning Framework for Code Search
Publication date :
2025
Event name :
2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE)
Event place :
Ottawa, Can
Event date :
27-04-2025 => 03-05-2025
By request :
Yes
Main work title :
Proceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering, ICSE 2025
Publisher :
IEEE Computer Society
ISBN/EAN :
9798331505691
Peer reviewed :
Peer reviewed
Name of the research project :
R-AGR-3885 - H2020-ERC-NATURAL - BISSYANDE Tegawendé
Funding text :
This work was supported by National Natural Science Foundation of China (Grant No. 62250610224) and the NATURAL project, which has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (Grant No. 949014).
Available on ORBilu :
since 02 September 2025

Statistics


Number of views
89 (0 by Unilu)
Number of downloads
126 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu