Unleashing the True Potential of Semantic-Based Log Parsing with Pre-Trained Language Models

LE, Van Hoang; Xiao, Yi; Zhang, Hongyu

doi:10.1109/ICSE55347.2025.00174

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Unleashing the True Potential of Semantic-Based Log Parsing with Pre-Trained Language Models

LE, Van Hoang; Xiao, Yi; Zhang, Hongyu

2025 • In Proceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering, ICSE 2025

Peer reviewed

Permalink
https://hdl.handle.net/10993/67305

DOI
10.1109/ICSE55347.2025.00174

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Unleashing_the_True_Potential_of_Semantic-Based_Log_Parsing_with_Pre-Trained_Language_Models-2.pdf

Author postprint (989.25 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

log analytics; log parsing; pre-trained LMs; In contexts; Language model; Log analytic; Log parsing; Performance; Pre-trained LM; Semantics Information; Software intensive systems; State of the art; True potentials; Software

Abstract :

[en] Software-intensive systems often produce console logs for troubleshooting purposes. Log parsing, which aims at parsing a log message into a specific log template, typically serves as the first step toward automated log analytics. To better comprehend the semantic information of log messages, many semantic-based log parsers have been proposed. These log parsers fine-tune a small pre-trained language model (PLM) such as RoBERTa on a few labelled log samples. With the increasing popularity of large language models (LLMs), some recent studies also propose to leverage LLMs such as ChatGPT through in-context learning for automated log parsing and obtain better results than previous semantic-based log parsers with small PLMs. In this paper, we show that semantic-based log parsers with small PLMs can actually achieve better or comparable performance to state-of-the-art LLM-based log parsing models while being more efficient and cost-effective. We propose Unleash, a novel semantic-based log parsing approach, which incorporates three enhancement methods to boost the performance of PLMs for log parsing, including (1) an entropy-based ranking method to select the most informative log samples; (2) a contrastive learning method to enhance the fine-tuning process; and (3) an inference optimization method to improve the log parsing performance. We evaluate Unleash on a set of large-scale, public log datasets and the experimental results show that Unleash is effective and efficient compared to state-of-the-art log parsers.

Disciplines :

Computer science

Author, co-author :

LE, Van Hoang ; University of Newcastle, Australia

Xiao, Yi; Chongqing University, China

Zhang, Hongyu; Chongqing University, China

External co-authors :

yes

Language :

English

Title :

Unleashing the True Potential of Semantic-Based Log Parsing with Pre-Trained Language Models

Publication date :

2025

Event name :

2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE)

Event place :

Ottawa, Can

Event date :

27-04-2025 => 03-05-2025

Main work title :

Proceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering, ICSE 2025

Publisher :

IEEE Computer Society

ISBN/EAN :

9798331505691

Peer reviewed :

Peer reviewed

Additional URL :

http://xplorestaging.ieee.org/ielx8/11029684/11029718/11029829.pdf?arnumber=11029829

Funders :

ACM SIGSOFT
Carleton University
et al.
IBM
IEEE Computer Society (and TCSE)
University of Ottawa

Funding text :

This work is supported by Australian Research Council (ARC) Discovery Projects (DP200102940, DP220103044). We also thank anonymous reviewers for their insightful and constructive comments, which significantly improve this paper.

Available on ORBilu :

since 16 January 2026

Statistics

Number of views

32 (3 by Unilu)

Number of downloads

25 (0 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

Bibliography

S. M. Milajerdi, B. Eshete, R. Gjomemo, and V. Venkatakrishnan, "Poirot: Aligning attack behavior with kernel audit records for cyber threat hunting, " in Proceedings of the 2019 ACM SIGSAC conference on computer and communications security, pp. 1795-1812, 2019.
A. Oprea, Z. Li, T.-F. Yen, S. H. Chin, and S. Alrwais, "Detection of early-stage enterprise infection by mining large-scale log data, " in 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp. 45-56, IEEE, 2015.
M. Chow, D. Meisner, J. Flinn, D. Peek, and T. F. Wenisch, "The mystery machine: End-to-end performance analysis of large-scale internet services, " in 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14), pp. 217-231, 2014.
K. Nagaraj, C. Killian, and J. Neville, "Structured comparative analysis of systems logs to diagnose performance problems, " in 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), pp. 353-366, 2012.
T. Jia, L. Yang, P. Chen, Y. Li, F. Meng, and J. Xu, "Logsed: Anomaly diagnosis through mining time-weighted control flow graph in logs, " in 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), pp. 447-455, IEEE, 2017.
V.-H. Le and H. Zhang, "Prelog: A pre-trained model for log analytics, " Proceedings of the ACM on Management of Data, vol. 2, no. 3, pp. 1-28, 2024.
J. Zhu, S. He, J. Liu, P. He, Q. Xie, Z. Zheng, and M. R. Lyu, "Tools and benchmarks for automated log parsing, " in 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pp. 121-130, IEEE, 2019.
S. He, P. He, Z. Chen, T. Yang, Y. Su, and M. R. Lyu, "A survey on automated log analysis for reliability engineering, " ACM computing surveys (CSUR), vol. 54, no. 6, pp. 1-37, 2021.
H. Dai, H. Li, C. S. Chen, W. Shang, and T.-H. Chen, "Logram: Efficient log parsing using n-gram dictionaries, " IEEE Transactions on Software Engineering, 2020.
P. He, J. Zhu, Z. Zheng, and M. R. Lyu, "Drain: An online log parsing approach with fixed depth tree, " in 2017 IEEE International Conference on Web Services (ICWS), pp. 33-40, IEEE, 2017.
M. Du and F. Li, "Spell: Streaming parsing of system event logs, " in 2016 IEEE 16th International Conference on Data Mining (ICDM), pp. 859-864, IEEE, 2016.
Y. Fu, M. Yan, J. Xu, J. Li, Z. Liu, X. Zhang, and D. Yang, "Investigating and improving log parsing in practice, " in Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 1566-1577, 2022.
S. Tao, W. Meng, Y. Cheng, Y. Zhu, Y. Liu, C. Du, T. Han, Y. Zhao, X. Wang, and H. Yang, "Logstamp: Automatic online log parsing based on sequence labelling, " ACM SIGMETRICS Performance Evaluation Review, vol. 49, no. 4, pp. 93-98, 2022.
Y. Liu, X. Zhang, S. He, H. Zhang, L. Li, Y. Kang, Y. Xu, M. Ma, Q. Lin, Y. Dang, et al., "Uniparser: A unified log parser for heterogeneous log data, " in Proceedings of the ACM Web Conference 2022, pp. 1893-1901, 2022.
V.-H. Le and H. Zhang, "Log parsing with prompt-based few-shot learning, " in 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE), pp. 2438-2449, IEEE, 2023.
R. Ma, X. Zhou, T. Gui, Y. Tan, L. Li, Q. Zhang, and X.-J. Huang, "Template-free prompt tuning for few-shot ner, " in Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 5721-5732, 2022.
Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov, "Roberta: A robustly optimized bert pretraining approach, " arXiv preprint arXiv:1907.11692, 2019.
V.-H. Le and H. Zhang, "Log parsing: How far can chatgpt go?, " in 2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE), pp. 1699-1704, IEEE, 2023.
Z. Jiang, J. Liu, Z. Chen, Y. Li, J. Huang, Y. Huo, P. He, J. Gu, and M. R. Lyu, "Lilac: Log parsing using llms with adaptive parsing cache, " Proceedings of the ACM on Software Engineering, vol. 1, no. FSE, pp. 137-160, 2024.
J. Xu, R. Yang, Y. Huo, C. Zhang, and P. He, "Divlog: Log parsing with prompt enhanced in-context learning, " in Proceedings of the IEEE/ACM 46th International Conference on Software Engineering, pp. 1-12, 2024.
A. Lyons, J. Gamba, A. Shawaga, J. Reardon, J. Tapiador, S. Egelman, and N. Vallina-Rodriguez, "Log: It's big, It's heavy, It's filled with personal data! measuring the logging of sensitive information in the android ecosystem, " in 32nd USENIX Security Symposium (USENIX Security 23), (Anaheim, CA), pp. 2115-2132, USENIX Association, Aug. 2023.
Y. Wu, B. Chai, S. Yu, Y. Li, P. He, W. Jiang, and J. Li, "Logptr: Variable-aware log parsing with pointer network, " arXiv preprint arXiv:2401.05986, 2024.
Z. Jiang, J. Liu, J. Huang, Y. Li, Y. Huo, J. Gu, Z. Chen, J. Zhu, and M. R. Lyu, "A large-scale evaluation for log parsing techniques: How far are we?, " in Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024.
W. Xu, L. Huang, A. Fox, D. Patterson, and M. I. Jordan, "Detecting large-scale system problems by mining console logs, " in Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, pp. 117-132, 2009.
M. Nagappan, K. Wu, and M. A. Vouk, "Efficiently extracting operational profiles from execution logs using suffix arrays, " in 2009 20th International Symposium on Software Reliability Engineering, pp. 41-50, IEEE, 2009.
R. Vaarandi, "A data clustering algorithm for mining patterns from event logs, " in Proceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM 2003)(IEEE Cat. no. 03EX764), pp. 119-126, Ieee, 2003.
A. A. Makanju, A. N. Zincir-Heywood, and E. E. Milios, "Clustering event logs using iterative partitioning, " in Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1255-1264, 2009.
R. Vaarandi and M. Pihelgas, "Logcluster-A data clustering and pattern mining algorithm for event logs, " in 2015 11th International conference on network and service management (CNSM), pp. 1-7, IEEE, 2015.
L. Tang, T. Li, and C.-S. Perng, "Logsig: Generating system events from raw textual logs, " in Proceedings of the 20th ACM international conference on Information and knowledge management, pp. 785-794, 2011.
H. Hamooni, B. Debnath, J. Xu, H. Zhang, G. Jiang, and A. Mueen, "Logmine: Fast pattern recognition for log analytics, " in Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp. 1573-1582, 2016.
K. Shima, "Length matters: Clustering system log messages using length of words, " arXiv preprint arXiv:1611.03213, 2016.
M. Mizutani, "Incremental mining of system log format, " in 2013 IEEE International Conference on Services Computing, pp. 595-602, IEEE, 2013.
Z. M. Jiang, A. E. Hassan, P. Flora, and G. Hamann, "Abstracting execution logs to execution events for enterprise applications (short paper), " in 2008 The Eighth International Conference on Quality Software, pp. 181-186, IEEE, 2008.
S. Yu, P. He, N. Chen, and Y. Wu, "Brain: Log parsing with bidirectional parallel tree, " IEEE Transactions on Services Computing, 2023.
K. Clark, M.-T. Luong, Q. V. Le, and C. D. Manning, "Electra: Pretraining text encoders as discriminators rather than generators, " in International Conference on Learning Representations, 2020.
J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "Bert: Pre-training of deep bidirectional transformers for language understanding, " in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171-4186, 2019.
Z. Feng, D. Guo, D. Tang, N. Duan, X. Feng, M. Gong, L. Shou, B. Qin, T. Liu, D. Jiang, and M. Zhou, "CodeBERT: A pre-trained model for programming and natural languages, " in Findings of the Association for Computational Linguistics: EMNLP 2020, (Online), pp. 1536-1547, Association for Computational Linguistics, Nov. 2020.
D. Guo, S. Ren, S. Lu, Z. Feng, D. Tang, S. LIU, L. Zhou, N. Duan, A. Svyatkovskiy, S. Fu, M. Tufano, S. K. Deng, C. Clement, D. Drain, N. Sundaresan, J. Yin, D. Jiang, and M. Zhou, "Graphcodebert: Pretraining code representations with data flow, " in International Conference on Learning Representations, 2021.
Y. Xiao, V.-H. Le, and H. Zhang, "Demonstration-free: Towards more practical log parsing with large language models, " in Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, ASE '24, (New York, NY, USA), p. 153-165, Association for Computing Machinery, 2024.
J. Huang, Z. Jiang, Z. Chen, and M. R. Lyu, "Ulog: Unsupervised log parsing with large language models through log contrastive units, " arXiv preprint arXiv:2406.07174, 2024.
C. Pei, Z. Liu, J. Li, E. Zhang, L. Zhang, H. Zhang, W. Chen, D. Pei, and G. Xie, "Self-evolutionary group-wise log parsing based on large language model, " in 2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE), pp. 49-60, IEEE, 2024.
H. Ju, "Reliable online log parsing using large language models with retrieval-augmented generation, " in 2024 IEEE 35th International Symposium on Software Reliability Engineering Workshops (ISSREW), pp. 99-102, IEEE, 2024.
Z. Ma, A. R. Chen, D. J. Kim, T.-H. Chen, and S. Wang, "Llmparser: An exploratory study on using large language models for log parsing, " in Proceedings of the IEEE/ACM 46th International Conference on Software Engineering, pp. 1-13, 2024.
C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena, Y. Zhou, W. Li, and P. J. Liu, "Exploring the limits of transfer learning with a unified text-to-text transformer, " Journal of Machine Learning Research, vol. 21, no. 140, pp. 1-67, 2020.
J. Liu, J. Zhu, S. He, P. He, Z. Zheng, and M. R. Lyu, "Logzip: Extracting hidden structures via iterative clustering for log compression, " in 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE), pp. 863-873, IEEE, 2019.
X. Li, H. Zhang, V. Le, and P. Chen, "Logshrink: Effective log compression by leveraging commonality and variability of log data, " in 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE), (Los Alamitos, CA, USA), pp. 243-254, IEEE Computer Society, apr 2024.
C. Wang, Y. Yang, C. Gao, Y. Peng, H. Zhang, and M. R. Lyu, "No more fine-tuning? an experimental evaluation of prompt tuning in code intelligence, " in Proceedings of the 30th ACM joint European software engineering conference and symposium on the foundations of software engineering, pp. 382-394, 2022.
Z. A. Khan, D. Shin, D. Bianculli, and L. Briand, "Guidelines for assessing the accuracy of log message template identification techniques, " in Proceedings of the 44th International Conference on Software Engineering, pp. 1095-1106, 2022.
X. Li, P. Chen, L. Jing, Z. He, and G. Yu, "Swisslog: Robust and unified deep learning based log anomaly detection for diverse faults, " in 2020 IEEE 31st International Symposium on Software Reliability Engineering (ISSRE), pp. 92-103, IEEE, 2020.
LogPAI, "A large collection of system log datasets for ai-powered log analytics." https://github.com/logpai/loghub. Accessed: August 01, 2024.
I. Loshchilov and F. Hutter, "Decoupled weight decay regularization, " in International Conference on Learning Representations, 2018.
P. He, X. Liu, J. Gao, and W. Chen, "Deberta: Decoding-enhanced bert with disentangled attention, " in International Conference on Learning Representations, 2021.
A. Mok, "How much does chatgpt cost to run?." https://www.businessinsider.com/how-much-chatgpt-costs-openai-to-run-estimate-report-2023-4. Accessed: July 20, 2024.
A. A. Chien, L. Lin, H. Nguyen, V. Rao, T. Sharma, and R. Wijayawardana, "Reducing the carbon impact of generative ai inference (today and in 2035), " in Proceedings of the 2nd workshop on sustainable computer systems, pp. 1-7, 2023.
H. Mi, H. Wang, Y. Zhou, M. R.-T. Lyu, and H. Cai, "Toward finegrained, unsupervised, scalable performance diagnosis for production cloud computing systems, " IEEE Transactions on Parallel and Distributed Systems, vol. 24, no. 6, pp. 1245-1255, 2013.
L. Chen, M. Zaharia, and J. Zou, "Analyzing chatgpt's behavior shifts over time, " in R0-FoMo: Robustness of Few-shot and Zero-shot Learning in Large Foundation Models, 2023.
V.-H. Le and H. Zhang, "Log-based anomaly detection with deep learning: How far are we?, " in Proceedings of the 44th International Conference on Software Engineering, pp. 1356-1367, 2022.