Towards Defending against Adversarial Examples via Attack-Invariant Features

Zhou, Dawei; Liu, Tongliang; Han, Bo; Wang, Nannan; Peng, Chunlei; Gao, Xinbo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2106.05036 (cs)

[Submitted on 9 Jun 2021]

Title:Towards Defending against Adversarial Examples via Attack-Invariant Features

Authors:Dawei Zhou, Tongliang Liu, Bo Han, Nannan Wang, Chunlei Peng, Xinbo Gao

View PDF

Abstract:Deep neural networks (DNNs) are vulnerable to adversarial noise. Their adversarial robustness can be improved by exploiting adversarial examples. However, given the continuously evolving attacks, models trained on seen types of adversarial examples generally cannot generalize well to unseen types of adversarial examples. To solve this problem, in this paper, we propose to remove adversarial noise by learning generalizable invariant features across attacks which maintain semantic classification information. Specifically, we introduce an adversarial feature learning mechanism to disentangle invariant features from adversarial noise. A normalization term has been proposed in the encoded space of the attack-invariant features to address the bias issue between the seen and unseen types of attacks. Empirical evaluations demonstrate that our method could provide better protection in comparison to previous state-of-the-art approaches, especially against unseen types of attacks and adaptive attacks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.05036 [cs.CV]
	(or arXiv:2106.05036v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2106.05036

Submission history

From: Dawei Zhou [view email]
[v1] Wed, 9 Jun 2021 12:49:54 UTC (579 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tongliang Liu
Bo Han
Nannan Wang
Chunlei Peng
Xinbo Gao

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Defending against Adversarial Examples via Attack-Invariant Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Defending against Adversarial Examples via Attack-Invariant Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators