Article (Périodiques scientifiques)
Impact of Log Parsing on Deep Learning-Based Anomaly Detection
KHAN, Zanis Ali; Shin, Donghwan; BIANCULLI, Domenico et al.
2024In Empirical Software Engineering, 29, p. 139:1--139:33
Peer reviewed vérifié par ORBi Dataset
 

Documents


Texte intégral
zanis_anomaly_detection_log_parsing.pdf
Postprint Auteur (770.92 kB) Licence Creative Commons - Attribution
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Mots-clés :
Computer Science - Software Engineering
Résumé :
Software systems log massive amounts of data, recording important runtime information. Such logs are used, for example, for log-based anomaly detection, which aims to automatically detect abnormal behaviors of the system under analysis by processing the information recorded in its logs. Many log-based anomaly detection techniques based on deep learning models include a pre-processing step called log parsing. However, understanding the impact of log parsing on the accuracy of anomaly detection techniques has received surprisingly little attention so far. Investigating what are the key properties log parsing techniques should ideally have to help anomaly detection is therefore warranted. In this paper, we report on a comprehensive empirical study on the impact of log parsing on anomaly detection accuracy, using 13 log parsing techniques, seven anomaly detection techniques (five based on deep learning and two based on traditional machine learning) on three publicly available log datasets. Our empirical results show that, despite what is widely assumed, there is no strong correlation between log parsing accuracy and anomaly detection accuracy, regardless of the metric used for measuring log parsing accuracy. Moreover, we experimentally confirm existing theoretical results showing that it is a property that we refer to as distinguishability in log parsing results as opposed to their accuracy that plays an essential role in achieving accurate anomaly detection.
Centre de recherche :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SVV - Software Verification and Validation
Disciplines :
Sciences informatiques
Auteur, co-auteur :
KHAN, Zanis Ali ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > SVV > Team Domenico BIANCULLI
Shin, Donghwan
BIANCULLI, Domenico  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
BRIAND, Lionel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
Impact of Log Parsing on Deep Learning-Based Anomaly Detection
Date de publication/diffusion :
2024
Titre du périodique :
Empirical Software Engineering
ISSN :
1382-3256
eISSN :
1573-7616
Maison d'édition :
Kluwer Academic Publishers, Pays-Bas
Volume/Tome :
29
Pagination :
139:1--139:33
Peer reviewed :
Peer reviewed vérifié par ORBi
Focus Area :
Security, Reliability and Trust
Projet FnR :
FNR17373407 - Automated Log Smell Detection And Removal, 2022 (01/09/2023-31/08/2026) - Domenico Bianculli
Intitulé du projet de recherche :
LOGODOR - Automated Log Smell Detection and Removal
Organisme subsidiant :
FNR - Luxembourg National Research Fund
N° du Fonds :
C22/IS/17373407/LOGODOR
Subventionnement (détails) :
This research was funded in whole, or in part, by the Luxembourg National Re- search Fund (FNR), grant reference C22/IS/17373407/LOGODOR. Lionel Briand was in part supported by the Canada Research Chair and Discovery Grant programs of the Natural Sciences and Engineering Research Council of Canada (NSERC), and the Science Foundation Ireland grant 13/RC/2094-2. For the purpose of open access, and in fulfillment of the obligations arising from the grant agreement, the authors have applied a Creative Commons Attribution 4.0 International (CC BY 4.0) license to any Author Accepted Manuscript version arising from this submission.
Disponible sur ORBilu :
depuis le 12 août 2024

Statistiques


Nombre de vues
311 (dont 48 Unilu)
Nombre de téléchargements
157 (dont 23 Unilu)

citations Scopus®
 
15
citations Scopus®
sans auto-citations
13
OpenCitations
 
0
citations OpenAlex
 
17

Bibliographie


Publications similaires



Contacter ORBilu