Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain

Zhang, Chi; Duan, Xiaohan; Wang, Linyuan; Li, Yongli; Yan, Bin; Hu, Guoen; Zhang, Ruyuan; Tong, Li

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.09431 (cs)

[Submitted on 22 Dec 2018 (v1), last revised 20 Jul 2020 (this version, v3)]

Title:Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain

Authors:Chi Zhang, Xiaohan Duan, Linyuan Wang, Yongli Li, Bin Yan, Guoen Hu, Ruyuan Zhang, Li Tong

View PDF

Abstract:Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcomes in the two systems. Humans can successfully recognize AI images as corresponding categories but perceive AN images as meaningless noise. In contrast, CNNs can correctly recognize AN images but mistakenly classify AI images into wrong categories with surprisingly high confidence. We use functional magnetic resonance imaging to measure brain activity evoked by regular and adversarial images in the human brain, and compare it to the activity of artificial neurons in a prototypical CNN-AlexNet. In the human brain, we find that the representational similarity between regular and adversarial images largely echoes their perceptual similarity in all early visual areas. In AlexNet, however, the neural representations of adversarial images are inconsistent with network outputs in all intermediate processing layers, providing no neural foundations for perceptual similarity. Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images. These remarkable differences between the human brain and AlexNet in the representation-perception relation suggest that future CNNs should emulate both behavior and the internal neural presentations of the human brain.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:1812.09431 [cs.CV]
	(or arXiv:1812.09431v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.09431

Submission history

From: Chi Zhang [view email]
[v1] Sat, 22 Dec 2018 01:56:04 UTC (1,633 KB)
[v2] Fri, 17 Jul 2020 04:47:10 UTC (1,406 KB)
[v3] Mon, 20 Jul 2020 01:28:40 UTC (1,406 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators