Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Multilevel Semantic Embedding of Software Patches: A Fine-to-Coarse Grained Approach Towards Security Patch Detection
TANG, Xunzhu; Chen, zhenghan; EZZINI, Saad et al.
2023In Semantic Patch Embedding for Security Detection: A Fine-to-Coarse Grained Approach
Peer reviewed
 

Files


Full Text
ICLR-SemanticPatch.pdf
Author postprint (526.98 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Computer Science - Software Engineering
Abstract :
[en] The growth of open-source software has increased the risk of hidden vulnerabilities that can affect downstream software applications. This concern is further exacerbated by software vendors' practice of silently releasing security patches without explicit warnings or common vulnerability and exposure (CVE) notifications. This lack of transparency leaves users unaware of potential security threats, giving attackers an opportunity to take advantage of these vulnerabilities. In the complex landscape of software patches, grasping the nuanced semantics of a patch is vital for ensuring secure software maintenance. To address this challenge, we introduce a multilevel Semantic Embedder for security patch detection, termed MultiSEM. This model harnesses word-centric vectors at a fine-grained level, emphasizing the significance of individual words, while the coarse-grained layer adopts entire code lines for vector representation, capturing the essence and interrelation of added or removed lines. We further enrich this representation by assimilating patch descriptions to obtain a holistic semantic portrait. This combination of multi-layered embeddings offers a robust representation, balancing word complexity, understanding code-line insights, and patch descriptions. Evaluating MultiSEM for detecting patch security, our results demonstrate its superiority, outperforming state-of-the-art models with promising margins: a 22.46\% improvement on PatchDB and a 9.21\% on SPI-DB in terms of the F1 metric.
Disciplines :
Computer science
Author, co-author :
TANG, Xunzhu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
Chen, zhenghan
EZZINI, Saad ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > TruX > Team Jacques KLEIN
TIAN, Haoye ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > TruX > Team Tegawendé François d A BISSYANDE
SONG, Yewei  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
KLEIN, Jacques  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
BISSYANDE, Tegawendé François d Assise  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
External co-authors :
yes
Language :
English
Title :
Multilevel Semantic Embedding of Software Patches: A Fine-to-Coarse Grained Approach Towards Security Patch Detection
Publication date :
2023
Event name :
The Twelfth International Conference on Learning Representations, Tiny Track
Event date :
May 7th, 2024 to May 11th, 2024
By request :
Yes
Main work title :
Semantic Patch Embedding for Security Detection: A Fine-to-Coarse Grained Approach
Publisher :
In 12th International Conference on Learning Representations
Peer reviewed :
Peer reviewed
Name of the research project :
R-AGR-3885 - H2020-ERC-NATURAL - BISSYANDE Tegawendé
Available on ORBilu :
since 02 September 2025

Statistics


Number of views
40 (0 by Unilu)
Number of downloads
8 (0 by Unilu)

Bibliography


Similar publications



Contact ORBilu