One-pixel Signature: Characterizing CNN Models for Backdoor Detection

Huang, Shanjiaoyang; Peng, Weiqi; Jia, Zhiwei; Tu, Zhuowen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.07711 (cs)

[Submitted on 18 Aug 2020]

Title:One-pixel Signature: Characterizing CNN Models for Backdoor Detection

Authors:Shanjiaoyang Huang, Weiqi Peng, Zhiwei Jia, Zhuowen Tu

View PDF

Abstract:We tackle the convolution neural networks (CNNs) backdoor detection problem by proposing a new representation called one-pixel signature. Our task is to detect/classify if a CNN model has been maliciously inserted with an unknown Trojan trigger or not. Here, each CNN model is associated with a signature that is created by generating, pixel-by-pixel, an adversarial value that is the result of the largest change to the class prediction. The one-pixel signature is agnostic to the design choice of CNN architectures, and how they were trained. It can be computed efficiently for a black-box CNN model without accessing the network parameters. Our proposed one-pixel signature demonstrates a substantial improvement (by around 30% in the absolute detection accuracy) over the existing competing methods for backdoored CNN detection/classification. One-pixel signature is a general representation that can be used to characterize CNN models beyond backdoor detection.

Comments:	Accepted at ECCV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2008.07711 [cs.CV]
	(or arXiv:2008.07711v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.07711

Submission history

From: Shanjiaoyang Huang [view email]
[v1] Tue, 18 Aug 2020 02:54:47 UTC (7,313 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-08

Change to browse by:

cs
cs.CR
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhiwei Jia
Zhuowen Tu

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:One-pixel Signature: Characterizing CNN Models for Backdoor Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:One-pixel Signature: Characterizing CNN Models for Backdoor Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators