Semantic segmentation of surgical hyperspectral images under geometric domain shifts

Sellner, Jan; Seidlitz, Silvia; Studier-Fischer, Alexander; Motta, Alessandro; Özdemir, Berkin; Müller-Stich, Beat Peter; Nickel, Felix; Maier-Hein, Lena

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2303.10972 (eess)

[Submitted on 20 Mar 2023 (v1), last revised 18 Sep 2023 (this version, v2)]

Title:Semantic segmentation of surgical hyperspectral images under geometric domain shifts

Authors:Jan Sellner, Silvia Seidlitz, Alexander Studier-Fischer, Alessandro Motta, Berkin Özdemir, Beat Peter Müller-Stich, Felix Nickel, Lena Maier-Hein

View PDF

Abstract:Robust semantic segmentation of intraoperative image data could pave the way for automatic surgical scene understanding and autonomous robotic surgery. Geometric domain shifts, however, although common in real-world open surgeries due to variations in surgical procedures or situs occlusions, remain a topic largely unaddressed in the field. To address this gap in the literature, we (1) present the first analysis of state-of-the-art (SOA) semantic segmentation networks in the presence of geometric out-of-distribution (OOD) data, and (2) address generalizability with a dedicated augmentation technique termed "Organ Transplantation" that we adapted from the general computer vision community. According to a comprehensive validation on six different OOD data sets comprising 600 RGB and hyperspectral imaging (HSI) cubes from 33 pigs semantically annotated with 19 classes, we demonstrate a large performance drop of SOA organ segmentation networks applied to geometric OOD data. Surprisingly, this holds true not only for conventional RGB data (drop of Dice similarity coefficient (DSC) by 46 %) but also for HSI data (drop by 45 %), despite the latter's rich information content per pixel. Using our augmentation scheme improves on the SOA DSC by up to 67 % (RGB) and 90 % (HSI) and renders performance on par with in-distribution performance on real OOD test data. The simplicity and effectiveness of our augmentation scheme makes it a valuable network-independent tool for addressing geometric domain shifts in semantic scene segmentation of intraoperative data. Our code and pre-trained models are available at this https URL.

Comments:	The first two authors (Jan Sellner and Silvia Seidlitz) contributed equally to this paper
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.2.10; I.4.6; J.3
Cite as:	arXiv:2303.10972 [eess.IV]
	(or arXiv:2303.10972v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2303.10972

Submission history

From: Silvia Seidlitz [view email]
[v1] Mon, 20 Mar 2023 09:50:07 UTC (24,761 KB)
[v2] Mon, 18 Sep 2023 01:31:14 UTC (24,448 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Semantic segmentation of surgical hyperspectral images under geometric domain shifts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Semantic segmentation of surgical hyperspectral images under geometric domain shifts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators