Deep Learning-Based MR-to-CT Synthesis: The Influence of Varying Gradient Echo-Based MR Images As Input Channels
Deep Learning-Based MR-to-CT Synthesis: The Influence of Varying Gradient Echo-Based MR Images As Input Channels
Deep Learning-Based MR-to-CT Synthesis: The Influence of Varying Gradient Echo-Based MR Images As Input Channels
Bachelor of Technology
in
Submitted by
DELLA DOMINIC
RET17CS072
Under the guidance of
Ms. MEHARBAN M.S.
CERTIFICATE
Della Dominic
i
ABSTRACT
ii
Contents
Acknowledgement i
Abstract ii
List of Figures v
List of Tables vi
2 Introduction 2
2.1 MRI Sequence . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.2 Influence of different MR contrasts as input channel . . . . . . . 3
3 Methods 4
3.1 Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.2 Input configurations . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3 Contraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
4 Architechture 7
4.1 U-Net Architechure . . . . . . . . . . . . . . . . . . . . . . . . . 7
4.1.1 Different parts of the U-Net Architecture . . . . . . . . . 7
4.1.2 Advantages of U-Net . . . . . . . . . . . . . . . . . . . . 9
4.2 Training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4.3 Replications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5 Image Preprocessing 11
5.1 Image Registration . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.2 Masking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
6 Evaluation 13
6.1 Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
6.1.1 Mean Error (ME) . . . . . . . . . . . . . . . . . . . . . . 13
6.1.2 Mean Absolute Error (MAE) . . . . . . . . . . . . . . . 13
6.1.3 Dice Similarity Coefficient (DSC) . . . . . . . . . . . . . 13
iii
6.1.4 Surface Distance . . . . . . . . . . . . . . . . . . . . . . 14
6.1.5 peak Signal-to-noise ratio (PSNR) . . . . . . . . . . . . . 14
6.2 Repeatability . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
6.3 Statistical analysis . . . . . . . . . . . . . . . . . . . . . . . . . 14
7 Results 15
7.1 Per subject results . . . . . . . . . . . . . . . . . . . . . . . . . 15
7.2 Per Model results . . . . . . . . . . . . . . . . . . . . . . . . . . 16
7.3 Bone reconstruction . . . . . . . . . . . . . . . . . . . . . . . . . 18
8 Discussions 19
10 Conclusion 21
References 22
Appendix 23
iv
List of Figures
1 Different MR contrasts . . . . . . . . . . . . . . . . . . . . . . . 2
2 Transverse slices of the canine and human data sets. . . . . . . . 5
3 6 different input configurations. . . . . . . . . . . . . . . . . . . 6
4 U-Net Architecture . . . . . . . . . . . . . . . . . . . . . . . . . 8
5 Variation of mean absolute error (MAE) per subject . . . . . . . 16
6 Computed tomography and synthesized CTs (sCTs) generated
per input configurations for 3 subjects. . . . . . . . . . . . . . 18
v
List of Tables
1 Performance obtained for each model per data set -
MAEbody (HU) , MEbody (HU), MAEbone (HU) . . . . . . . . . . . 17
2 Performance obtained for each model per data set -DSCbone ,
Surface Distance(mm) , PSNR(dB) . . . . . . . . . . . . . . . . 17
3 Repeatability obtained for each model per data set, averaged
across replicates and subjects Population Input R . . . . . . . . 18
vi
The influence of varying gradient echo–based MR images as input channels 1
1.1 Objective
The prime objective is to study the influence of gradient echo–based contrasts
as input channels to a 3D patch-based neural network trained for synthetic CT
(sCT) generation in canine and human populations.
1.2 Motivation
Computed tomography is an imaging modality in medicine that uses
combinations of many X-ray measurements from different angles to produce
cross-sectional images of specific areas. Unfortunately, CT imaging exposes
patients to potentially harmful sources of ionizing radiation. The probability
for absorbed x-rays to induce cancer or heritable mutations, especially in
offspring, even though small, does exist. There also exists many side effects
caused due to reactions to any dye used.
In order to circumvent these problems, and additionally save the extra
cost and time for multiple imaging modalities, we are coerced to look for
alternative solutions. In Deep Learning approaches,combinations of MR
images obtained from gradient-echo experiments could provide the neural
network with additional information about tissue composition including proton
density, water and fat fractions, as well as magnetic properties such as
relaxation constants (T1, T) and susceptibility.
2 Introduction
• brain
• brain
3 Methods
Magnetic resonance images and CT scans of human and canine pelvic regions
were acquired and paired using nonrigid registration. Magnitude MR images
and Dixon reconstructed water, fat, in-phase and opposed-phase images were
obtained from a single T1-weighted multi-echo gradient-echo acquisition. From
this set, 6 input configurations were defined, each containing 1 to 4 MR images
regarded as input channels. For each configuration, a UNet-derived[7] deep
learning model was trained for synthetic CT generation.[12] Reconstructed
Hounsfield unit maps were evaluated with peak SNR, mean absolute error,
and mean error. Dice similarity coefficient and surface distance maps assessed
the geometric fidelity of bones. Repeatability was estimated by replicating the
training up to 10 times.
Fig. 2: Transverse slices of the canine and human data sets (paired in-phase
MR images and CT scans). As compared with the human data set, the canine
data set showed substantial inter-subject variability in morphology (e.g., shape,
size).
[4]
3.3 Contraints
To perform a fair comparative study, a single network architecture was used
to process the heterogeneous data sets and the variable input configurations.
Notably, the choices defining the neural network architecture and its
hyperparameters were constrained by 2 requirements:
4 Architechture
To generate sCT images from MR inputs, a patch- based convolutional
neural network, a 3D extension of the widely used U-Net, was used[6].This
architecture took as input 4D MR images with 3 spatial dimensions and a
channel dimension. The size of the patches was C × 24 × 24 × 24 voxels,
where C is the number of channels in the input configuration.
• Bottleneck.
Note that the number of feature maps doubles at each pooling, starting
with 64 feature maps for the first block, 128 for the second, and so on. The
purpose of this contracting path is to capture the context of the input image
in order to be able to do segmentation. This coarse contextual information
will then be transfered to the upsampling path by means of skip connections.
4.2 Training
An experiment consisted of the training of a neural network with 1 input
configuration. The canine and human data sets being distinct, models were
trained independently on each population. For each data set, a 3-fold cross-
validation procedure was applied to synthetize sCTs.The models trained on 1
data set were only evaluated on that data set.
Models were trained with a Nadam optimizer whose objective was to
minimize the L1 loss between the CT and the sCT. The learning rate was
constant and set at 104. With the L1 loss function, the optimization process
was robust to outliers produced by noise, artifacts, or by the imperfect
matching between MR and CT. It also resulted in sharper images than the L2
norm. Patches used to perform an optimization step were randomly extracted
from the MR images but were balanced between soft tissues and bone by using
a weighted probability map that resulted in an equal sampling of bone and
nonbone voxels. Because 6 input configurations were tested, 6 independent
models were optimized per fold.
4.3 Replications
Because the training of the neural network contained random factors, the
training of each model was replicated multiple times. This evaluated the
repeatability of sCT generation and corroborated the statistical significance
of the findings. Experiments were repeated 5 times on the human data set
and 10 times on the canine data set, because the latter presented a larger
anatomical variability.
5 Image Preprocessing
Data processing is the conversion of data into usable and desired form.Here it
refers to image processing, which is a method to perform some operations on an
image, in order to get an enhanced image or to extract some useful information
from it.The image Preprocessing performed here is Image Registration using
Elastix registration toolbox[9]
(I − shif t)
Inorm = ∗2−I (1)
scale
For the MR image, the shift and scale were derived from minimal and
maximum intensities. For the CT, these values were constant to preserve HU
quantitative nature and were set to 1000 for the shift and 4000 for the scale.
5.2 Masking
To include only relevant anatomy and to perform a tissue-specific evaluation,
binary masks were automatically derived from the MR in-phase images and CT
scans by application of thresholding and mask filling. For both modalities, the
average intensity of the image was used as a threshold value. The intersection
of MR and CT body masks isolated the volume of interest from the background.
Bone voxels in the CT scan whose intensity was greater than 200 HU and that
were within the volume of interest were defined as bone.
6 Evaluation
Model evaluation aims to estimate the generalization accuracy of a model
on future or unseen data.The similarity between the CT and sCT were
evaluated[2] in this study.
6.1 Metrics
The similarity between the CT and sCT was assessed using 5 metrics:
• Surface Distance
The MAE is also voxel-wise differences commonly reported in research for sCT
generation and reflecting the sCT fidelity in radiodensity reconstruction, which
is particularly useful for radiotherapy planning and PET/MR reconstruction.
It was computed on the entire body contour (MAEbody ). and exclusively on
the bone (MAEbone ) using the aforementioned masks.
40952
P SN R = 10 log10 ( 1 PN 2
) (2)
N i=1 (ICT (i) − IsCT (i))
6.2 Repeatability
The repeatability per input configuration was assessed by calculating the
standard deviation of MAEbody (r,s) averaged across subjects.
7 Results
In the human data set, 24 of 27 subjects met the inclusion criteria of the
study. All dogs were eligible except for 1 (17 of 18), whose MR acquisition
protocol did not follow the study design format. The training set in each
cross- validation fold contained 16 subjects for the human models and 11 for
the canine models. The remainder of this section presents the results of all 3
folds of the cross-validation for both data sets.
Results were inferred in two different ways:
These behaviors were observed in all subjects from both data sets.
Fig. 5: Variation of mean absolute error (MAE) per subject averaged across
the repeated experiments for each input configuration
[4]
Table 1: Performance obtained for each model per data set - MAEbody (HU) ,
MEbody (HU), MAEbone (HU)
[4]
Table 2: Performance obtained for each model per data set -DSCbone , Surface
Distance(mm) , PSNR(dB)
[4]
Table 3: Repeatability obtained for each model per data set, averaged across
replicates and subjects Population Input R
[4]
8 Discussions
This study showed that the choice of MR images within an input influenced
sCT generation models in 3 ways.
9.1 Applications
This study of influence of different MR contrasts input configurations in MR
to CT Translation has provided various relevant information that has varying
applications like :
• Further Research
Various new architectures and methodologies are constantly being
developed in the field of medical Image Translation and the inferences
from this study could be used to improve upon the model build.
10 Conclusion
The study of influence of gradient echo–based contrasts as input to deep
learning–based sCT generation models in canine and human populations was
conducted. Two parameters were found to influence the performance and
repeatability of sCT generation.
MRI and CT are two very relevant modalties in medical imaging and MR
to CT Translation is receiving increasing interest among researchers, with
increase in advancements in emerging technologies and increased application
of these technologies in medical imaging fields.The systematic and statistically
significant improvement that was demonstrated in this study motivates the
research of an optimal MR contrast or combination of MR contrasts for a
related future task.
References
[1] Sombir Singh Bisht et al. Image Registration Concept and Techniques:
A Review. Tech. rep. Journal of Engineering Research and Applications,
2014.
[2] Sacolick LI Delso G Wiesinger F. Clinical evaluation of zero-echo-time
MR imaging for the segmentation of the skull. Tech. rep. J Nucl Med.,
2015.
[3] Vandecaveye V Dirix P Haustermans K. The value of magnetic resonance
imaging for radiotherapy planning. Tech. rep. Semin Radiat Oncol., 2014.
[4] Matteo Maspero Mateusz C. Florkow Frank ZijlstraKoen Willemsen.
Deep learning-based MR-to-CT synthesis: The influence of varying
gradient echo-based MR images as input channels. Tech. rep. Wiley
Periodicals, 2019.
[5] LeCun Y Mathieu M Couprie C. Deep multi-scale video prediction beyond
mean square error. Tech. rep. arxiv, 2017.
[6] Gao Y Nie D Xiaohuan C. Estimating CT Image from MRI data
using 3D fully convolutional networks. Tech. rep. Springer International
Publishing, 2016.
[7] Philipp Fischer Olaf Ronneberger and Thomas Brox. U-Net: Convolu-
tional Networks for Biomedical Image Segmentation. Tech. rep. arxiv,
2015.
[8] Pluim J Staring M Klein S. I A rigidity penalty term for nonrigid
registration. Tech. rep. Med Phys, 2007.
[9] Marius Staring Stefan Klein. elastix: A Toolbox for Intensity-Based
Medical Image Registration. Tech. rep. IEEE Trans Med Imaging, 2010.
[10] Lempitsky V Ulyanov D Vedaldi A. Instance normalization: the missing
ingredient for fast stylization. Tech. rep. arXiv.org, 2016.
[11] Dixon WT. Simple proton spectroscopic imaging. Tech. rep. Radiology,
1984.
[12] Han X. MR-based synthetic CT generation using a deep convolutional
neural network method.. Tech. rep. Med Phys, 2017.
APPENDIX
Deep Learning: Deep learning is a subset of machine learning where artifi-
cial neural networks, algorithms inspired by the human brain, learn from large
amounts of data.
Course Outcome
Sl No. Description Blooms Taxonomy
Level
CS451.1 Analyse a current topic of profes-
Knowledge(Level1)
sional interest and present it Analyse(level4)
before an audience
CS451.2 Identify an engineering Evaluate(level5)
problem,analyse it and propose a Understand(level 2)
work plan to solve it
CO–PO Mapping
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CS451.1 - 3 - 2 - - 1 - 3 3 2 3
CS451.2 3 3 3 2 1 - - 3 3 3 - 1
CO–PSO Mapping
PO1 PO2 PO3
CS451.1 3 - 1
CS451.2 3 3 2
Justifications for CO–PO/PSO Mapping
Mapping Low/Medium/High Justification
CS451.1–PO2 H Problem analysis :
I was able to identify, formu-
late, review research literature,
and analyze problems with the
traditional learning environment,
reaching substantiated conclu-
sions using first principles of
mathematics, natural sciences,
and engineering sciences.
CS451.1–PO4 M Conduct investigations of
complex problems :
I used research based knowledge
and research methods including
design of experiments, analysis
and interpretation of data, and
synthesis of the information to
provide valid conclusions.
CS451.1–PO7 L Environment and sustainability :
I understood the impact of the
professional engineering solutions
in societal and environmental
contexts, and demonstrated the
knowledge of- and the need for-
sustainable developments.
CS451.1–PO9 H Individual:
I was able to function effec-
tively as an individual, in multi-
disciplinary settings.
CS451.1–PO10 H Communication :
I was able to communicate effec-
tively on complex Engineering
activities with the Engineering
Community and with society at
large, such as, being able to
comprehend and write effective
reports and design documenta-
tion, make effective presenta-
tions, and give and receive clear
instructions.
CS451.1–PO11 M Project Management and finance:
Demonstrated knowledge and
understanding of the Engineering
and management principles
and apply these to ones own
work, to manage projects and in
multi-disciplinary environments.
CS451.1–PO12 H Life-long learning :
Recognized the need for, and
have the preparation and ability
to engage in independent and
life-long learning in the broadest
context of technological change.
CS451.1–PSO1 H Computer Science Specific Skills :
Was able to identify, analyze and
design solutions for complex engi-
neering problems in multidisci-
plinary areas by understanding
the core principles and concepts
of computer science.
CS451.1–PSO3 L Professional Skills :
Was able to apply the funda-
mentals of computer science to
formulate competitive research
proposals and to develop inno-
vative products to meet the
societal needs thereby evolving
as an eminent researcher and
entrepreneur.
CS451.2–PO1 H Engineering Knowledge :
Applied the knowledge of Math-
ematics, Science, Engineering
fundamentals, and an Engi-
neering discipline to the solution
of complex engineering problems.
CS451.2–PO2 H Problem analysis :
I was able to identify, formu-
late, review research literature,
and analyze complex Engineering
problems reaching substantiated
conclusions using first principles
of mathematics, natural sciences,
and Engineering sciences.
CS451.2–PO3 H Design/Development of solutions:
Designed solutions for complex
Engineering problems and design
system components or processes
that meet the specified needs
with appropriate consideration
for the public health and safety,
and the cultural, societal, and
environmental considerations.
CS451.2–PO4 M Conduct investigations of
complex problems :
Used research based knowledge
and research methods including
design of experiments, analysis
and interpretation of data, and
synthesis of the information to
provide valid conclusions.
CS451.2–PO5 L Modern Tool usage :
Created, selected, and applied
appropriate techniques,
resources, and modern engi-
neering and IT tools including
prediction and modeling to
complex Engineering activities
with an understanding of the
limitations.
CS451.2–PO8 H Ethics :
Applied ethical principles and
commit to professional ethics and
responsibilities and norms of the
Engineering practice.
CS451.2–PO9 H Individual:
I was able to function effectively
as an individual,and in multi-
disciplinary settings.
CS451.2–PO10 H Communication :
Communicated effectively on
complex Engineering activities
with the Engineering Community
and with society at large, such
as, being able to comprehend and
write effective reports and design
documentation,make effective
presentations, and give and
receive clear instructions.
CS451.2–PO12 L Life-long learning :
Recognized the need for, and
have the preparation and ability
to engage in independent and
life-long learning in the broadest
context of technological change.
CS451.2–PSO1 H Computer Science Specific Skills :
I was able to identify, analyze
and design solutions for complex
engineering problems in multi-
disciplinary areas by under-
standing the core principles and
concepts of computer science.
CS451.2–PSO2 H Programming and Software
Development Skills:
Acquired programming efficiency
by designing algorithms and
applying standard practices in
software project development to
deliver quality software products.
CS451.2–PSO3 M Professional Skills :
Applied the fundamentals of
computer science to formulate
competitive research proposals
and to develop innovative prod-
ucts to meet the societal needs
thereby evolving as an eminent
researcher and entrepreneur.