LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models

Ma, Lipeng; Yang, Weidong; Jiang, Sihang; Fei, Ben; Zhou, Mingjie; Li, Shuhao; Zhao, Mingyu; Xu, Bo; Xiao, Yanghua

Computer Science > Software Engineering

arXiv:2409.01909 (cs)

[Submitted on 3 Sep 2024 (v1), last revised 31 Jan 2025 (this version, v2)]

Title:LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models

Authors:Lipeng Ma, Weidong Yang, Sihang Jiang, Ben Fei, Mingjie Zhou, Shuhao Li, Mingyu Zhao, Bo Xu, Yanghua Xiao

View PDF HTML (experimental)

Abstract:Logs play a critical role in providing essential information for system monitoring and troubleshooting. Recently, with the success of pre-trained language models (PLMs) and large language models (LLMs) in natural language processing (NLP), smaller PLMs (such as BERT) and LLMs (like GPT-4) have become the current mainstream approaches for log analysis. Despite the remarkable capabilities of LLMs, their higher cost and inefficient inference present significant challenges in leveraging the full potential of LLMs to analyze logs. In contrast, smaller PLMs can be fine-tuned for specific tasks even with limited computational resources, making them more practical. However, these smaller PLMs face challenges in understanding logs comprehensively due to their limited expert knowledge. To address the lack of expert knowledge and enhance log understanding for smaller PLMs, this paper introduces a novel and practical knowledge enhancement framework, called LUK, which acquires expert knowledge from LLMs automatically and then enhances the smaller PLM for log analysis with these expert knowledge. LUK can take full advantage of both types of models. Specifically, we design a multi-expert collaboration framework based on LLMs with different roles to acquire expert knowledge. In addition, we propose two novel pre-training tasks to enhance the log pre-training with expert knowledge. LUK achieves state-of-the-art results on different log analysis tasks and extensive experiments demonstrate expert knowledge from LLMs can be utilized more effectively to understand logs. Our source code and detailed experimental data are available at this https URL.

Comments:	Under review
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2409.01909 [cs.SE]
	(or arXiv:2409.01909v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2409.01909

Submission history

From: Lipeng Ma [view email]
[v1] Tue, 3 Sep 2024 13:58:34 UTC (1,435 KB)
[v2] Fri, 31 Jan 2025 05:51:52 UTC (8,604 KB)

Computer Science > Software Engineering

Title:LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators