SCLA: Automated Smart Contract Summarization via LLMs and Control Flow Prompt

Li, Xiaoqi; Mao, Yingjie; Lu, Zexin; Li, Wenkai; Li, Zongwei

Computer Science > Software Engineering

arXiv:2402.04863 (cs)

[Submitted on 7 Feb 2024 (v1), last revised 13 Mar 2025 (this version, v6)]

Title:SCLA: Automated Smart Contract Summarization via LLMs and Control Flow Prompt

Authors:Xiaoqi Li, Yingjie Mao, Zexin Lu, Wenkai Li, Zongwei Li

View PDF HTML (experimental)

Abstract:Smart contract code summarization is crucial for efficient maintenance and vulnerability mitigation. While many studies use Large Language Models (LLMs) for summarization, their performance still falls short compared to fine-tuned models like CodeT5+ and CodeBERT. Some approaches combine LLMs with data flow analysis but fail to fully capture the hierarchy and control structures of the code, leading to information loss and degraded summarization quality. We propose SCLA, an LLM-based method that enhances summarization by integrating a Control Flow Graph (CFG) and semantic facts from the code's control flow into a semantically enriched prompt. SCLA uses a control flow extraction algorithm to derive control flows from semantic nodes in the Abstract Syntax Tree (AST) and constructs the corresponding CFG. Code semantic facts refer to both explicit and implicit information within the AST that is relevant to smart contracts. This method enables LLMs to better capture the structural and contextual dependencies of the code. We validate the effectiveness of SCLA through comprehensive experiments on a dataset of 40,000 real-world smart contracts. The experiment shows that SCLA significantly improves summarization quality, outperforming the SOTA baselines with improvements of 26.7%, 23.2%, 16.7%, and 14.7% in BLEU-4, METEOR, ROUGE-L, and BLEURT scores, respectively.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2402.04863 [cs.SE]
	(or arXiv:2402.04863v6 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2402.04863

Submission history

From: Yingjie Mao [view email]
[v1] Wed, 7 Feb 2024 13:58:26 UTC (194 KB)
[v2] Thu, 8 Feb 2024 06:09:16 UTC (219 KB)
[v3] Wed, 21 Feb 2024 14:18:32 UTC (265 KB)
[v4] Sat, 17 Aug 2024 03:41:42 UTC (2,810 KB)
[v5] Tue, 20 Aug 2024 02:34:56 UTC (2,810 KB)
[v6] Thu, 13 Mar 2025 07:05:15 UTC (949 KB)

Computer Science > Software Engineering

Title:SCLA: Automated Smart Contract Summarization via LLMs and Control Flow Prompt

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:SCLA: Automated Smart Contract Summarization via LLMs and Control Flow Prompt

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators