Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images

Schlemper, Jo; Oktay, Ozan; Schaap, Michiel; Heinrich, Mattias; Kainz, Bernhard; Glocker, Ben; Rueckert, Daniel

Computer Science > Computer Vision and Pattern Recognition

arXiv:1808.08114 (cs)

[Submitted on 22 Aug 2018 (v1), last revised 20 Jan 2019 (this version, v2)]

Title:Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images

Authors:Jo Schlemper, Ozan Oktay, Michiel Schaap, Mattias Heinrich, Bernhard Kainz, Ben Glocker, Daniel Rueckert

View PDF

Abstract:We propose a novel attention gate (AG) model for medical image analysis that automatically learns to focus on target structures of varying shapes and sizes. Models trained with AGs implicitly learn to suppress irrelevant regions in an input image while highlighting salient features useful for a specific task. This enables us to eliminate the necessity of using explicit external tissue/organ localisation modules when using convolutional neural networks (CNNs). AGs can be easily integrated into standard CNN models such as VGG or U-Net architectures with minimal computational overhead while increasing the model sensitivity and prediction accuracy. The proposed AG models are evaluated on a variety of tasks, including medical image classification and segmentation. For classification, we demonstrate the use case of AGs in scan plane detection for fetal ultrasound screening. We show that the proposed attention mechanism can provide efficient object localisation while improving the overall prediction performance by reducing false positives. For segmentation, the proposed architecture is evaluated on two large 3D CT abdominal datasets with manual annotations for multiple organs. Experimental results show that AG models consistently improve the prediction performance of the base architectures across different datasets and training sizes while preserving computational efficiency. Moreover, AGs guide the model activations to be focused around salient regions, which provides better insights into how model predictions are made. The source code for the proposed AG models is publicly available.

Comments:	Accepted for Medical Image Analysis (Special Issue on Medical Imaging with Deep Learning). arXiv admin note: substantial text overlap with arXiv:1804.03999, arXiv:1804.05338
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1808.08114 [cs.CV]
	(or arXiv:1808.08114v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1808.08114

Submission history

From: Jo Schlemper [view email]
[v1] Wed, 22 Aug 2018 19:17:23 UTC (5,689 KB)
[v2] Sun, 20 Jan 2019 00:29:45 UTC (3,264 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators