Adversarial Robustness of Deep Code Comment Generation

Zhou, Yu; Zhang, Xiaoqing; Shen, Juanjuan; Han, Tingting; Chen, Taolue; Gall, Harald

Computer Science > Software Engineering

arXiv:2108.00213v2 (cs)

[Submitted on 31 Jul 2021 (v1), revised 7 Nov 2021 (this version, v2), latest version 30 Nov 2021 (v3)]

Title:Adversarial Robustness of Deep Code Comment Generation

Authors:Yu Zhou, Xiaoqing Zhang, Juanjuan Shen, Tingting Han, Taolue Chen, Harald Gall

View PDF

Abstract:Deep neural networks (DNNs) have shown remarkable performance in a variety of domains such as computer vision, speech recognition, or natural language processing. Recently they also have been applied to various software engineering tasks, typically involving processing source code. DNNs are well-known to be vulnerable to adversarial examples, i.e., fabricated inputs that could lead to various misbehaviors of the DNN model while being perceived as benign by humans. In this paper, we focus on the code comment generation task in software engineering and study the robustness issue of the DNNs when they are applied to this task. We propose ACCENT, an identifier substitution approach to craft adversarial code snippets, which are syntactically correct and semantically close to the original code snippet, but may mislead the DNNs to produce completely irrelevant code comments. In order to improve the robustness, ACCENT also incorporates a novel training method, which can be applied to existing code comment generation models. We conduct comprehensive experiments to evaluate our approach by attacking the mainstream encoder-decoder architectures on two large-scale publicly available datasets. The results show that ACCENT efficiently produces stable attacks with functionality-preserving adversarial examples, and the generated examples have better transferability compared with baselines. We also confirm, via experiments, the effectiveness in improving model robustness with our training method.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2108.00213 [cs.SE]
	(or arXiv:2108.00213v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2108.00213

Submission history

From: Yu Zhou [view email]
[v1] Sat, 31 Jul 2021 10:58:31 UTC (825 KB)
[v2] Sun, 7 Nov 2021 13:07:50 UTC (823 KB)
[v3] Tue, 30 Nov 2021 03:11:38 UTC (821 KB)

Computer Science > Software Engineering

Title:Adversarial Robustness of Deep Code Comment Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Adversarial Robustness of Deep Code Comment Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators