Disentangling Adversarial Robustness and Generalization

Stutz, David; Hein, Matthias; Schiele, Bernt

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.00740 (cs)

[Submitted on 3 Dec 2018 (v1), last revised 10 Apr 2019 (this version, v2)]

Title:Disentangling Adversarial Robustness and Generalization

Authors:David Stutz, Matthias Hein, Bernt Schiele

View PDF

Abstract:Obtaining deep networks that are robust against adversarial examples and generalize well is an open problem. A recent hypothesis even states that both robust and accurate models are impossible, i.e., adversarial robustness and generalization are conflicting goals. In an effort to clarify the relationship between robustness and generalization, we assume an underlying, low-dimensional data manifold and show that: 1. regular adversarial examples leave the manifold; 2. adversarial examples constrained to the manifold, i.e., on-manifold adversarial examples, exist; 3. on-manifold adversarial examples are generalization errors, and on-manifold adversarial training boosts generalization; 4. regular robustness and generalization are not necessarily contradicting goals. These assumptions imply that both robust and accurate models are possible. However, different models (architectures, training strategies etc.) can exhibit different robustness and generalization characteristics. To confirm our claims, we present extensive experiments on synthetic data (with known manifold) as well as on EMNIST, Fashion-MNIST and CelebA.

Comments:	Conference on Computer Vision and Pattern Recognition 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1812.00740 [cs.CV]
	(or arXiv:1812.00740v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.00740

Submission history

From: David Stutz [view email]
[v1] Mon, 3 Dec 2018 14:04:35 UTC (3,251 KB)
[v2] Wed, 10 Apr 2019 10:25:38 UTC (3,691 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Disentangling Adversarial Robustness and Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Disentangling Adversarial Robustness and Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators