RepliComment: Identifying Clones in Code Comments

Blasi, Arianna; Stulova, Nataliia; Gorla, Alessandra; Nierstrasz, Oscar

Computer Science > Software Engineering

arXiv:2108.11205 (cs)

[Submitted on 25 Aug 2021]

Title:RepliComment: Identifying Clones in Code Comments

Authors:Arianna Blasi, Nataliia Stulova, Alessandra Gorla, Oscar Nierstrasz

View PDF

Abstract:Code comments are the primary means to document implementation and facilitate program comprehension. Thus, their quality should be a primary concern to improve program maintenance. While much effort has been dedicated to detecting bad smells, such as clones in code, little work has focused on comments. In this paper we present our solution to detect clones in comments that developers should fix. RepliComment can automatically analyze Java projects and report instances of copy-and-paste errors in comments, and can point developers to which comments should be fixed. Moreover, it can report when clones are signs of poorly written comments. Developers should fix these instances too in order to improve the quality of the code documentation. Our evaluation of 10 well-known open source Java projects identified over 11K instances of comment clones, and over 1,300 of them are potentially critical. We improve on our own previous work, which could only find 36 issues in the same dataset. Our manual inspection of 412 issues reported by RepliComment reveals that it achieves a precision of 79% in reporting critical comment clones. The manual inspection of 200 additional comment clones that RepliComment filters out as being legitimate, could not evince any false negative.

Comments:	31 pages, 1 figure, 9 tables. To appear in the Journal of Systems and Software
Subjects:	Software Engineering (cs.SE)
ACM classes:	D.2.7; D.2.9
Cite as:	arXiv:2108.11205 [cs.SE]
	(or arXiv:2108.11205v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2108.11205

Submission history

From: Nataliia Stulova [view email]
[v1] Wed, 25 Aug 2021 12:41:23 UTC (104 KB)

Computer Science > Software Engineering

Title:RepliComment: Identifying Clones in Code Comments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:RepliComment: Identifying Clones in Code Comments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators