Organ at Risk Segmentation in Head and Neck CT Images Using A Two-Stage Segmentation Framework Based On 3D U-Net

This study presents a two-stage segmentation framework based on 3D U-Net for accurately segmenting organs at risk (OARs) in head and neck CT images, which is crucial for effective treatment planning in radiotherapy. The method decomposes the segmentation task into locating a bounding box for each OAR and then segmenting the OAR within that box, significantly improving accuracy and efficiency. Evaluation against state-of-the-art methods using the MICCAI 2015 Challenge dataset showed that the proposed framework achieved top rankings in segmentation accuracy for multiple OARs.

Uploaded by

vvbvansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views12 pages

Organ at Risk Segmentation in Head and Neck CT Images Using A Two-Stage Segmentation Framework Based On 3D U-Net

Uploaded by

vvbvansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Received September 11, 2019, accepted September 26, 2019, date of publication October 1, 2019, date of current version

October 16, 2019.

Digital Object Identifier 10.1109/ACCESS.2019.2944958

Organ at Risk Segmentation in Head and Neck CT

Images Using a Two-Stage Segmentation
Framework Based on 3D U-Net
YUEYUE WANG , LIANG ZHAO , MANNING WANG , AND ZHIJIAN SONG
Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200433, China
Shanghai Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention, Shanghai 200032, China
Corresponding authors: Manning Wang ([email protected]) and Zhijian Song ([email protected])

ABSTRACT Accurate segmentation of organs at risk (OARs) plays a critical role in the treatment planning
of image-guided radiotherapy of head and neck cancer. This segmentation task is challenging for both
humans and automated algorithms because of the relatively large number of OARs to be segmented, the large
variability in size and morphology across different OARs, and the low contrast between some OARs and the
background. In this study, we propose a two-stage segmentation framework based on 3D U-Net. In this
framework, the segmentation of each OAR is decomposed into two subtasks: locating a bounding box of the
OAR and segmenting the OAR from a small volume within the bounding box, and each subtask is fulfilled by
a dedicated 3D U-Net. The decomposition makes each subtask much easier so that it can be better completed.
We evaluated the proposed method and compared it to state-of-the-art methods using the Medical Image
Computing and Computer-Assisted Intervention 2015 Challenge dataset. In terms of the boundary-based
metric 95% Hausdorff distance, the proposed method ranked first for seven of nine OARs and ranked second
for the other OARs. In terms of the area-based metric dice similarity coefficient, the proposed method ranked
first for five of nine OARs and ranked second for the other three OARs with a small difference from the
method that ranked first.

INDEX TERMS 3D U-Net, CT images, head and neck, organ at risk segmentation.

I. INTRODUCTION task [5], [6]. It may take radiologist three hours to segment all
Head and neck (HaN) cancer is one of the most common OARs for treatment planning [5]. Some treatment planning
cancers, with more than half a million cases worldwide per systems have automatic segmentation function, such as the
year [1]. Image-guided radiation therapy (IGRT), includ- atlas-based segmentation methods [7], but the segmentation
ing intensity-modulated radiation therapy (IMRT) and volu- result has not met the clinical needs. Intensive labor is still
metric modulated arc therapy, is a state-of-the-art treatment needed for manual adjustment of the segmentation result to
option because of its highly conformal dose delivery make it applicable for treatment planning and the time needed
[2]–[4]. The key to the success of IGRT is patient-specific for manual adjustment is comparable to manual segmentation
treatment planning, in which medical images are used to from scratch [6]. Therefore, there is a great demand for a
make a radiation plan to concentrate the radiation dose on the rapid, accurate, and automatic OAR segmentation method to
target volume while minimizing the dose to the surrounding reduce radiologist labor in HaN treatment planning.
organs at risk (OARs). Therefore, it is essential to segment the Medical image segmentation is an area of intense research,
OARs in treatment planning images, which usually include and many methods for segmenting different targets from med-
HaN computed tomography (CT) images. In current clini- ical images of different modalities have been proposed. Some
cal practice, OARs are usually delineated manually, but the of these methods have also been applied in OAR segmenta-
complexity and variability of the OARs morphology in HaN tion, but unfortunately, the current results are far from being
CT images make it an inaccurate and very time consuming satisfactory. A Head and Neck Auto Segmentation Challenge
was held in conjunction with the Medical Image Computing
The associate editor coordinating the review of this manuscript and and Computer-Assisted Intervention (MICCAI) conference
approving it for publication was Junxiu Liu . in 2015 (referred to as the ‘‘MICCAI 2015 Challenge’’ from

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
VOLUME 7, 2019 144591
Y. Wang et al.: Organ at Risk Segmentation in Head and Neck CT Images Using a Two-Stage Segmentation Framework

here on), which provided a public data set for OARs seg- nerve and chiasm. They used atlas-based method to locate a
mentation in HaN CT images [8]. Six teams participated in bounding box enclosing the target OAR and then performed
this challenge and finished this task using different segmen- segmentation in a small target volume. Zhu et al. [22] pro-
tation methods, including the statistical shape model, active posed the AnatomyNet, an end-to-end and atlas-free three
appearance model, multiatlas-based segmentation method dimensional squeeze-and-excitation U-Net (3D SE U-Net),
and the semiautomatic segmentation method [8], but their for fast and fully automated whole-volume HaN anatomical
segmentation results were not satisfactory to radiologist. The segmentation. Tong et al. [23] proposed a fully convolutional
challenges of OAR segmentation in HaN CT images include: neural network with a shape representation model for multi-
(i) the complexity and variability of the OARs are high, and it organ segmentation for HaN cancer radiotherapy. However,
is difficult to incorporate prior information into shape models these existing deep-learning-based methods generally pro-
to support the segmentation of new images; (ii) the sizes of duced accurate segmentation maps for large organs, while the
OARs are varied, and most segmentation methods usually get accuracy of small OARs was often sacrificed.
accurate results in bigger OARs while inaccurate results in To seperate the segmentation of large and small OARs,
smaller OARs and (iii) the contrast of soft tissues is poor in we adopt a two-stage framework for OARs localiza-
CT images, which makes it difficult to segment some OARs, tion and segmentation. Recently, two-stage framework and
such as brainstem. U-Net have shown their outstanding performances in vari-
Although the contrast between bone and soft tissues is ous medical image computing tasks [24]–[30]. In this study,
relatively high in CT images, the characteristics of the HaN we propose a two-stage framework to decompose OAR seg-
OAR segmentation task, including the large number of OARs mentation into two relatively simpler tasks and complete each
to be segmented, the great variety in size and morphology of task by a dedicated 3D U-Net. The first task is to locate the
different OARs, and the low contrast between some OARs target OAR with a bounding box and the second task is to
and their background, make simple segmentation methods, segment the target OAR within the bounding box. Decompo-
such as thresholding, edge detection, and region growing, dif- sition of this task makes it simpler than directly segmenting
ficult to succeed. Many methods that have been successfully the OARs from the entire volume and improves the segmen-
used in other medical image segmentation tasks, such as 3D tation performance. Experiments using MICCAI 2015 Chal-
level set [9] and atlas-based techniques [10], have also been lenge data showed that the proposed method achieved the
applied in this field, but the results are not satisfactory. highest dice similarity coefficient (DSC) for six of the nine
Several approaches have been developed to incorporate OARs and achieved the second highest DSC for the other
prior knowledge, which often represents the results of gold three OARs. In addition, the proposed method achieved the
standard segmentation of some subjects, to help segment smallest 95% Hausdorff distance (95HD) for seven of the
new subjects, and these approaches have also been used in nine OARs with a significant benefit and achieved the second
HaN OAR segmentation. For example, the method proposed smallest 95HD for the other two OARs.
in [11] built a statistical shape model of OARs and deforms
the model to fit the image to achieve segmentation. A mul-
tiatlas approach [12] registered the segmented images to II. MATERIALS AND METHODS
the target image and then fused the label of the segmented A. THE MICCAI 2015 CHALLENGE DATASET
images to obtain a segmentation result of the target image. In this study, we evaluated the proposed OAR seg-
Another approach is to train a classifier with prior segmented mentation framework and compared it to other methods
images and transform the segmentation task into a classifica- using the PDDCA dataset, which is publicly available at
tion task [13]. In the MICCAI 2015 Challenge, most teams (http://www.imagenglab.com/newsite/pddca/). This dataset
adopted several approaches that include the statistical shape was provided by Dr. Gregory C. Sharp and was used in
model, active appearance model, and the multiatlas-based the Head and Neck Auto-Segmentation Challenge 2015, a
method to utilize prior knowledge. This challenge provided a satellite event at the MICCAI 2015 conference. The current
unified evaluation framework for different methods on OAR version (v 1.4.1) of the PDDCA dataset consists of 25 train-
segmentation. ing images, 8 additional training images, and 15 testing
In recent years, deep learning methods, especially the images. The original images are from the RTOG 0522 clinical
convolutional neural network (CNN), have demonstrated trial [18], which provides 111 HaN CT images for treatment
excellent performance in medical image segmentation planning. The subset was chosen to ensure that the image
tasks [14]–[19], and CNN has also been applied for OARs quality is adequate and the target OARs have minimal overlap
segmentation in H&N CT images [20]–[23]. The first [20] with the tumors. Each image consists of a series of axial slices
using deep learning methods proposed a 2D CNN for OARs with 512 × 512 voxels on each slice, and the number of
segmentation from in-house HaN CT images, but it only got slices varies from 76 to 263. The in-plane spacing is between
a slight improvement in right submandibular gland and right 0.76 mm × 0.76 mm and 1.27 mm × 1.27 mm, and the inter-
optic nerve, and the performance for the other OARs was plane spacing is between 1.25 and 3.00 mm.
similar to that of the traditional methods. In [21], a interleaved In this dataset, nine anatomical structures, namely, the
3D CNNs method was proposed to jointly segment the optic brainstem, optic nerve left, optic nerve right, chiasm, parotid