Label-driven weakly-supervised learning for multimodal deformable image registration

Hu, Yipeng; Modat, Marc; Gibson, Eli; Ghavami, Nooshin; Bonmati, Ester; Moore, Caroline M.; Emberton, Mark; Noble, J. Alison; Barratt, Dean C.; Vercauteren, Tom

doi:10.1109/ISBI.2018.8363756

Computer Science > Computer Vision and Pattern Recognition

arXiv:1711.01666 (cs)

[Submitted on 5 Nov 2017 (v1), last revised 24 Dec 2017 (this version, v2)]

Title:Label-driven weakly-supervised learning for multimodal deformable image registration

Authors:Yipeng Hu, Marc Modat, Eli Gibson, Nooshin Ghavami, Ester Bonmati, Caroline M. Moore, Mark Emberton, J. Alison Noble, Dean C. Barratt, Tom Vercauteren

View PDF

Abstract:Spatially aligning medical images from different modalities remains a challenging task, especially for intraoperative applications that require fast and robust algorithms. We propose a weakly-supervised, label-driven formulation for learning 3D voxel correspondence from higher-level label correspondence, thereby bypassing classical intensity-based image similarity measures. During training, a convolutional neural network is optimised by outputting a dense displacement field (DDF) that warps a set of available anatomical labels from the moving image to match their corresponding counterparts in the fixed image. These label pairs, including solid organs, ducts, vessels, point landmarks and other ad hoc structures, are only required at training time and can be spatially aligned by minimising a cross-entropy function of the warped moving label and the fixed label. During inference, the trained network takes a new image pair to predict an optimal DDF, resulting in a fully-automatic, label-free, real-time and deformable registration. For interventional applications where large global transformation prevails, we also propose a neural network architecture to jointly optimise the global- and local displacements. Experiment results are presented based on cross-validating registrations of 111 pairs of T2-weighted magnetic resonance images and 3D transrectal ultrasound images from prostate cancer patients with a total of over 4000 anatomical labels, yielding a median target registration error of 4.2 mm on landmark centroids and a median Dice of 0.88 on prostate glands.

Comments:	Accepted to ISBI 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1711.01666 [cs.CV]
	(or arXiv:1711.01666v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1711.01666
Related DOI:	https://doi.org/10.1109/ISBI.2018.8363756

Submission history

From: Yipeng Hu [view email]
[v1] Sun, 5 Nov 2017 22:01:57 UTC (718 KB)
[v2] Sun, 24 Dec 2017 22:23:19 UTC (717 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Label-driven weakly-supervised learning for multimodal deformable image registration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Label-driven weakly-supervised learning for multimodal deformable image registration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators