DomainATM--Domain-adaptation-toolbox-for-medical-data-analysi_2023_NeuroImag
DomainATM--Domain-adaptation-toolbox-for-medical-data-analysi_2023_NeuroImag
DomainATM--Domain-adaptation-toolbox-for-medical-data-analysi_2023_NeuroImag
NeuroImage
journal homepage: www.elsevier.com/locate/neuroimage
a r t i c l e i n f o a b s t r a c t
Keywords: Domain adaptation (DA) is an important technique for modern machine learning-based medical data analysis,
Domain adaptation which aims at reducing distribution differences between different medical datasets. A proper domain adaptation
Medical image analysis method can significantly enhance the statistical power by pooling data acquired from multiple sites/centers. To
Medical image processing toolbox
this end, we have developed the Domain Adaptation Toolbox for Medical data analysis (DomainATM) – an open-
Open source software
source software package designed for fast facilitation and easy customization of domain adaptation methods for
medical data analysis. The DomainATM is implemented in MATLAB with a user-friendly graphical interface, and
it consists of a collection of popular data adaptation algorithms that have been extensively applied to medical
image analysis and computer vision. With DomainATM, researchers are able to facilitate fast feature-level and
image-level adaptation, visualization and performance evaluation of different adaptation methods for medical
data analysis. More importantly, the DomainATM enables the users to develop and test their own adaptation
methods through scripting, greatly enhancing its utility and extensibility. An overview characteristic and usage
of DomainATM is presented and illustrated with three example experiments, demonstrating its effectiveness,
simplicity, and flexibility. The software, source code, and manual are available online.
1. Introduction gained considerable attention and increasing interest recently (Guan and
Liu, 2022; Valverde et al., 2021). Briefly, domain adaptation can be de-
Medical data analysis is nowadays being boosted by modern statisti- fined as follows. Let × represent the joint feature space of samples
cal analysis tools, i.e., machine learning (Barragán-Montero et al., 2021; and their corresponding category labels. A source domain and a target
Deo, 2015; Erickson et al., 2017; Fatima et al., 2017; Rajkomar et al., domain are defined on the joint feature space, with different distri-
2019). Classic machine learning typically assumes that training dataset butions 𝐏𝐒 and 𝐏𝐓 , respectively. Suppose there are 𝑛𝑠 samples (subjects)
(source domain) and test dataset (target domain) follow an independent with or without category labels in the source domain, as well as 𝑛𝑡 sam-
but identical distribution (Valiant, 1984). In real-world practice, how- ples in the target domain without category labels. Then the problem
ever, this assumption can hardly hold due to the well-known “domain is how to reduce the distribution differences/variability between source
shift” problem (Kondrateva et al., 2021; Pooch et al., 2020; Quiñonero- and target domains so as to increase the performance of down-streaming
Candela et al., 2009). In medical imaging, domain shift or data het- tasks such as classification or segmentation.
erogeneity is widespread and caused by different scanning parameters Many domain adaptation methods have been proposed or utilized
(i.e., between-scanner variability) and subject populations in multiple in the field of medical data analysis which shows tremendous appli-
imaging sites. It may increase the test error along with the distribution cability. Most solutions, however, are implemented independently for
difference between training and test data (Ben-David et al., 2007; Tor- very specific scenarios or target applications. Researchers often need
ralba and Efros, 2011). Thus the domain shift/difference may greatly to re-implement an algorithm or do methodological tailoring. The dif-
degrade statistical power of multi-site/multi-center studies and hinder ferences in implementation will often cause inconsistent experiment
the building of effective machine learning models. and analysis results. There is a lack of a unified platform for exten-
For handling the domain shift problem among datasets and enhanc- sive comparison of different domain adaptation methods, helping avoid
ing the generalization ability of machine learning models, domain adap- hand-crafted re-implementation for specific medical data analysis re-
tation has gradually come under the spotlight of the research commu- search. Thus a software toolbox that provides a platform of different
nity (Csurka, 2017; Kouw and Loog, 2019; Patel et al., 2015; Wang and adaptation methods is quite beneficial and necessary for researchers
Deng, 2018; Wilson and Cook, 2020; Zhang et al., 2020; Zou et al., to compare, evaluate and select the proper method for their research
2020). In the field of medical data analysis, domain adaptation has project.
∗
Corresponding author.
E-mail address: [email protected] (M. Liu).
https://doi.org/10.1016/j.neuroimage.2023.119863.
Received 31 October 2022; Received in revised form 11 December 2022; Accepted 3 January 2023
Available online 5 January 2023.
1053-8119/© 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/)
H. Guan and M. Liu NeuroImage 268 (2023) 119863
2. Toolbox overview/characteristics
After loading the data, the following procedure is to select, configure,
and run the domain adaptation methods. Most adaptation methods have
The main structure of the DomainATM is illustrated in Fig. 2. Cur-
several hyper-parameters to be set. Users can tune them according to the
rently, the toolbox consists of three modules. 1) The data module is
responsible for loading and generating datasets. It can directly load an
1
existing medical dataset (in .mat data file) or create synthetic datasets https://www.mingxia.web.unc.edu/domainatm/
2
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Fig. 2. Illustration of workflow of the DomainATM software. The DomainATM consists of three major components: 1) the data module loads or creates the datasets;
2) the algorithm module conducts feature-level or image-level domain adaptation and saves the results; and 3) the evaluation module assesses the adaptation
performance according to specific metrics. DA: Domain Adaptation.
All the files are named with the corresponding adaptation method with
time stamp.
After running the adaptation methods and getting the results, per-
formance evaluation can be conducted for the methods. For feature-
level adaptation, we use distribution difference and domain-level classi-
fication accuracy as two metrics to assess the adaptation performance.
For image-level adaptation, we adopt correlation coefficient (CC), peak
signal-to-noise ratio (PSNR) and mean-square error (MSE) to evaluate the
adaptation result. More details about these evaluation metrics will be
elaborated in the experiment section.
3
H. Guan and M. Liu NeuroImage 268 (2023) 119863
ply run and analyze their methods like the other built-in ones through 4.1.9. Scatter Component Analysis (SCA)
GUI. In this algorithm (Ghifary et al., 2016), original features are firstly
projected to a reproducing kernel Hilbert space. Domain adaptation is
4. Algorithms then conducted through an optimization formulation, including max-
imizing the class separability, maximizing the data separability, and
In this section, we briefly introduce the algorithms for feature-level minimizing domain mismatch. Category labels of the source domain are
and image-level data adaptation in DomainATM. More details can be used during adaptation. The key parameter is the dimension of the trans-
found in the online manual. formed space.
4
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Fig. 4. Distribution of the synthetic data (baseline) and adapted data by nine different domain adaptation methods in the DomainATM toolbox. ( positive source
samples; positive target samples; negative source samples; negative target samples).
5
H. Guan and M. Liu NeuroImage 268 (2023) 119863
two subsets of ADNI, i.e., ADNI-1 (100 subjects with 1.5T T1-weighted Table 2
structural MRIs) and ADNI-2 (100 subjects with 3.0T T1-weighted struc- Domain classification accuracy (%) using different classifiers on the ADNI-1 and
tural MRIs) as the source and target domains, respectively, to test the ADNI-2 datasets. (SVM: support vector machine; RF: random forest).
domain adaptation algorithms using DomainATM. ADNI-1 contains 50 Method Baseline SA CORAL OT TCA TJM JDA GFK SCA ITL
patients with Alzheimer’s disease (AD) (positive samples) and 50 nor-
SVM 79 43 79 43 77 43 43 43 43 43
mal control (NC) subjects (negative samples). ADNI-2 has 50 CE sub- RF 85 64 85 52 80 63 58 60 50 57
jects and 50 NC subjects. All the MRIs have been processed through a
standard pipeline, including skull stripping, intensity correction, regis-
tration and re-sampling. Regions-of-interest (ROIs) features which are
5.3.3. Domain-level classification
defined on 90 regions in the Anatomical Automatic Labeling (AAL) at-
We facilitate domain-level classification on the source data, i.e.,
las (Tzourio-Mazoyer et al., 2002) are used to represent each subject.
ADNI-1, and target data, i.e., ADNI-2. A domain classifier (k-nearest
The 90-dimensional features denote the gray matter volumes in each
neighbors classifier) is trained with source data (with the label “1”) and
brain region.
target data (with the label “0”). Source and target data are combined
together and shuffled. 60% of the entire data are adopted for training
5.3.1. Distribution visualization while 40% for testing. The result of domain-level classification is illus-
The distributions of original ADNI-1 and ADNI-2 data (in feature trated in Fig. 9. Another two classifiers, including support vector ma-
space) and the adapted data by different methods are visualized in Fig 7. chine (SVM) and random forest (RF) are also adopted for domain-level
From the visualization results, the original source and target data have a classification, and the result is listed in Table 2. From Fig. 9 and Table 2,
relatively clear boundary. After domain adaptation, the domain bound- we can see that the domain classification accuracy drops after domain
aries become blurred, and the distribution of source and target domains adaptation despite the different types of domain classifiers. This indi-
gets closer to each other. cates that the adapted source and target data get more difficult to be
correctly classified, i.e., domain adaptation is effective in reducing their
distribution differences.
5.3.2. Distribution distance
The distribution differences (in terms of maximum mean discrep-
ancy) of the source data, i.e., ADNI-1, and target data, i.e., ADNI-2, after 5.4. Experiment 3: Domain adaptation for autism analysis on ABIDE
domain adaptation are shown in Fig. 8. The baseline illustrates the origi-
nal distribution of the source and target domain without any adaptation We conduct experiments on the Autism Brain Imaging Data Exchange
processing. From the result, it can be observed that domain adaptation (ABIDE) dataset (Di Martino et al., 2014). This database consists of
is able to reduce the distribution differences between the original source resting-state functional MRI (fMRI) data for Autism analysis. We use two
and target domains. sites from the ABIDE project, i.e., NYU (184 subjects) and UM (145 sub-
6
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Fig. 7. Distribution of the original ADNI data (baseline) and adapted data by nine feature-level domain adaptation methods in the DomainATM toolbox. ( positive
source samples; positive target samples; negative source samples; negative target samples).
jects) as the source and target domains, respectively, to test the domain correction, nuisance signal regression, temporal filtering, and registra-
adaptation algorithms using the DomainATM. The NYU site consists of tion. The mean time series of 116 regions-of-interest (ROIs) defined by
79 positive samples (autism patients) and 105 negative samples (normal the Anatomical Automatic Labeling (AAL) atlas (Tzourio-Mazoyer et al.,
controls). These fMRIs are acquired by a 3 Tesla Allegra scanner. The 2002) are extracted. Then, a 116 × 116 symmetrical resting-state func-
UM site includes 68 positive samples (autism patients) and 77 negative tional connectivity (FC) matrix is generated for each subject, with each
samples (normal controls). These fMRIs are acquired using a 3 Tesla element representing the Pearson correlation coefficient between a pair
GE scanner located at the UM Functional MRI Laboratory. All the fM- of ROI signals. We extract the node betweenness centrality (Rubinov and
RIs go through a standard pipeline, including slice-timing and motion Sporns, 2010) based on the FC matrix to represent each subject/sample.
7
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Fig. 10. Distribution of the original ABIDE data (baseline) and adapted data by nine feature-level domain adaptation methods in the proposed DomainATM toolbox.
( positive source samples; positive target samples; negative source samples; negative target samples).
5.4.2. Distribution distance result is shown in Table 3 From the results, the domain classification
The data distribution differences (in terms of MMD) of the source accuracy gets worse after domain adaptation processing regardless of
NYU domain and target UM domain after domain adaptation are shown what domain classifiers have been used. This indicates that the adapted
in Fig. 11. The baseline is the original distribution of the source and source and target data become more difficult to be discriminated, i.e.,
target domain without any adaptation processing. The result shows that using domain adaptation has successfully reduced their distribution dif-
the distribution differences become smaller after adaptation processing ferences.
by different algorithms.
5.5. Discussion
5.4.3. Domain-level classification
We facilitate domain-level classification on the source data, i.e., In the above experiments, we use two quantitative metrics, i.e., MMD
NYU, and target data, i.e., UM. A domain classifier (k-nearest neigh- and domain classification accuracy, to evaluate the performance of dif-
bors classifier) is trained with source data (with the label “1”) and tar- ferent domain adaptation methods in DomainATM. The MMD is a direct
get data (with the label “0”). Source and target data are combined to- assessment metric because it is directly calculated based on the statis-
gether and shuffled. 60% of the entire data are adopted for training tical properties of source and target domains (datasets). Generally, if
while 40% for test. The result of domain-level classification accuracy is method A achieves a smaller MMD than method B, then A is supposed
illustrated in Fig. 12. We also use support vector machine (SVM) and to be better. Domain classification accuracy is an indirect metric because
random forest (RF) to conduct the domain-level classification, and the it relies on a specific domain classifier. But it can also reflect the adap-
8
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Table 4 Specifically, we adopt the following three metrics for image-level adap-
Running time (in terms of seconds) of nine domain adaptation algorithms in tation performance evaluation.
DomainATM on three datasets.
• Correlation Coefficient (CC). Denote the source and target images
Method SA CORAL OT TCA TJM JDA GFK SCA ITL as 𝑠 and 𝑡 . After adaptation, we get 𝑠′ . For performance assess-
Synthetic 0.09 0.05 1.28 0.06 0.19 0.85 0.09 1.04 0.06 ment, if the correlation coefficient of 𝑠′ and 𝑡 is higher than 𝑠 and
ADNI 0.05 0.01 2.78 0.04 0.21 0.92 0.09 1.13 0.13 𝑡 , it indicates the corresponding adaptation algorithm works.
ABIDE 0.03 0.01 6.07 0.07 0.26 0.92 0.09 1.74 0.25 • Peak Signal-to-Noise Ratio (PSNR). If the peak signal-to-noise ra-
tio of 𝑠′ and 𝑡 is higher than 𝑠 and 𝑡 , it indicates the adaptation
algorithm works.
tation performance since confusing a classifier is difficult. If method A • Mean-Squared Error (MSE). If the mean-squared error of 𝑠′ and 𝑡
achieves a smaller domain classification accuracy than method B, then is smaller than 𝑠 and 𝑡 , it indicates the adaptation algorithms are
A is supposed to be better. Based on the experimental results, we have effective.
the following empirical findings.
6.2. Materials and settings
• The CORAL, TCA and SCA algorithms have relatively worse domain
adaptation performance than the other methods. They get signifi- Phantom data of five traveling subjects with T1-weighted (T1-w)
cantly higher MMD values and domain classification accuracy than structural MRIs from the ABCD dataset (Volkow et al., 2018) are used for
the others. performance evaluation. Phantom-1 is scanned by GE and Philips scan-
• The OT algorithm achieves the overall best performance among ners, respectively. Phantom-2 and Phantom-3 are acquired by Siemens
these adaptation methods. It generally produces the smallest MMD and GE scanners, respectively. Phantom-4 and Phantom-5 are scanned
value and domain classification accuracy in all these three experi- by Philips and Siemens scanners, respectively. The protocols of the GE,
ments. Philips and Siemens scanners are consistent. These phantoms are used to
• On the ADNI dataset, the TJM, JDA, GFK and ITL have comparable test the performance of image-level domain adaptation methods in han-
performance. They get similar domain classification accuracy and dling domain shift caused by different scanners. All these 3D MRIs are
low MMD. On the ABIDE dataset, the algorithm ITL is worse than raw data in the NIfTI file format. We do not perform any pre-processing
the others. such as skull-stripping, registration or segmentation before image-level
• Most algorithms are effective in significantly reducing the MMD adaptation. During adaptation, the intensity of each image is normal-
value. By contrast, the domain classification accuracy is more dif- ized to the range of [0, 1]. For these volumetric images which contain
ficult to reduce. This implies that it is challenging to confuse or de- multiple slices, the adaptation is facilitated on each slice, then the per-
ceive a domain classifier with certain domain adaptation methods. formance is calculated as an average metric value for all the slices within
Thus, domain classification accuracy is a rigorous metric to assess an image (volume).
the robustness of an adaptation algorithm.
6.3. Result
We also conduct statistical testing for performance comparison in
terms of domain classification accuracy. Specifically, we compute the We conduct image-level domain adaptation on these five phantom
p-values via paired sample t-test between each adaptation method and structural MRI data, and the adaptation results in terms of the three met-
the baseline. The p-values are smaller than 0.05, indicating that their rics are shown in Table 5. From the result, it can be observed that image-
differences are significant. In addition, we calculate the running time of level domain adaptation methods can generally achieve higher scores of
each domain adaptation algorithm for each dataset on a PC with an Intel correlation coefficient (CC) and peak signal-to-noise ratio (PSNR) and
i-7 CPU and 16 GB memory. The comparison result is listed in Table 4. smaller mean square error (MSE). In some cases (e.g., GE → Philips), the
Histogram Matching (HM) does not perform very well in terms of PSNR
and MSE. Overall, the result indicates that image-level adaptation meth-
6. Empirical evaluation of image-level data adaptation ods are useful in reducing the distribution shift between images caused
algorithms in DomainATM by different scanners.
For image-level adaptation methods, we adopt the metrics that eval- To further investigate the effectiveness of image-level domain adap-
uate the image similarity/dissimilarity before and after adaptation. tation, we do visual inspections of the MRIs that are adapted to dif-
9
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Table 5
Results of three image-level domain adaptation methods on T1-weighted MRIs of five traveling phantom subjects acquired by three
different scanners from the ABCD dataset.
Fig. 13. Image-level domain adaptation via the Spectrum Swapping-based Image-level MRI Harmonization (SSIMH) method (Guan et al., 2022) for T1-weighted
(T1-w) MRIs acquired by different scanners. Domain shift caused by the use of different scanners can be partly reduced by image-level adaptation via SSIMH.
ferent scanner styles. We divide the phantom MRIs into three groups 7. Conclusion and future work
in terms of the scanners. Then we adapt MRIs acquired by one scan-
ner to the styles of MRIs scanned by other scanners. We use the SSIMH Domain adaptation has become an important topic in the field of
method (Guan et al., 2022) in DomainATM to perform image-level adap- medical data analysis. In this paper, we develop a Domain Adaptation
tation. Fig. 13 shows the results of three different MRIs and their corre- Toolbox for Medical data analysis (DomainATM), aiming to help re-
sponding adapted images to different scanner styles. From the result, we searchers facilitate fast domain adaptation for medical data acquired
have the following two observations. 1) Different scanners (i.e., Siemens, from different sites/scanners. The DomainATM is easy to use, efficient
Philips and GE) have a significant impact on the MRIs, which can cause to run, and most importantly, it is able to do both feature-level and
the domain shift. 2) The image-level domain adaptation method is ef- image-level adaptation. In addition, users can add their own domain
fective in harmonizing the source image to the target image (reference adaptation algorithms into the toolbox, making it flexible and extensi-
image) and reducing the domain shift caused by different scanners. ble. Experiments on both synthetic and real-world medical datasets have
10
H. Guan and M. Liu NeuroImage 268 (2023) 119863
been conducted to show the usage and effectiveness of DomainATM. Di Martino, A., Yan, C.G., Li, Q., Denio, E., et al., 2014. The autism brain imaging data ex-
We hope the toolbox can provide more convenience and benefit for re- change: towards a large-scale evaluation of the intrinsic brain architecture in autism.
Mol. Psychiatry 19 (6), 659–667.
searchers to do domain adaptation research in medical data analysis. Erickson, B.J., Korfiatis, P., Akkus, Z., Kline, T.L., 2017. Machine learning for medical
There are several potential future works to further enrich and ex- imaging. Radiographics 37 (2), 505.
tend the DomainATM. First, for the sake of fast and easy facilitation of Fatima, M., Pasha, M., et al., 2017. Survey of machine learning algorithms for disease
diagnostic. Journal of Intelligent Learning Systems and Applications 9 (01), 1.
domain adaptation in medical imaging data, we only include machine Fernando, B., Habrard, A., Sebban, M., Tuytelaars, T., 2013. Unsupervised visual domain
learning methods in the current version, without considering deep learn- adaptation using subspace alignment. In: Proceedings of the IEEE International Con-
ing methods that often require large computation resources. In the fu- ference on Computer Vision, pp. 2960–2967.
Ghifary, M., Balduzzi, D., Kleijn, W.B., Zhang, M., 2016. Scatter component analysis: a uni-
ture, we plan to develop another version of the toolbox to include deep
fied framework for domain adaptation and domain generalization. IEEE Trans Pattern
learning methods (such as various GANs (Sinha et al., 2021; Yi et al., Anal Mach Intell 39 (7), 1414–1430.
2019) and CNNs (Guan et al., 2021a; Tibrewala et al., 2020)). Second, Gong, B., Shi, Y., Sha, F., Grauman, K., 2012. Geodesic flow kernel for unsupervised do-
main adaptation. In: 2012 IEEE Conference on Computer Vision and Pattern Recog-
the current evaluation metrics merely reflect domain differences, lack-
nition. IEEE, pp. 2066–2073.
ing the ability to further analyze practical applications (e.g., to what Guan, H., Liu, M., 2022. Domain adaptation for medical image analysis: asurvey. IEEE
extent Dice scores in a segmentation application varies before and after Trans. Biomed. Eng. 69 (3), 1173–1185.
domain adaptation). We will address this issue to enrich the toolbox in Guan, H., Liu, S., Lin, W., Yap, P.-T., Liu, M., 2022. Fast image-level MRI harmonization via
spectrum analysis. International Workshop on Machine Learning in Medical Imaging.
the future. Besides, we plan to further improve the graphic user inter- Springer.
face to enable users to set and tune the hyper-parameters of each domain Guan, H., Liu, Y., Yang, E., Yap, P.-T., Shen, D., Liu, M., 2021. Multi-site MRI harmoniza-
adaptation method in a more convenient manner. tion via attention-guided deep domain adaptation for brain disorder identification.
Med Image Anal 71, 102076.
Guan, H., Wang, L., Liu, M., 2021. Multi-source domain adaptation via optimal transport
Data Availability for brain dementia identification. In: 2021 IEEE 18th International Symposium on
Biomedical Imaging (ISBI). IEEE, pp. 1514–1517.
Jack Jr, C.R., Bernstein, M.A., Fox, N.C., et al., 2008. The Alzheimer’s Disease Neuroimag-
The ADNI dataset used in this study can be accessed via the ing Initiative (ADNI): MRI methods. J. Magn. Reson. Imaging 27 (4), 685–691.
following link http://www.ida.loni.usc.edu/login.jsp?project=ADNI& Kondrateva, E., Pominova, M., Popova, E., Sharaev, M., Bernstein, A., Burnaev, E., 2021.
page=HOME. An access application should be permitted firstly. Domain shift in computer vision models for MRI data analysis: An overview. In: Thir-
teenth International Conference on Machine Vision, Vol. 11605. SPIE, pp. 126–133.
The ABIDE dataset used in this study can be accessed via Kouw, W.M., Loog, M., 2019. A review of domain adaptation without target labels. IEEE
the following link http://www.fcon_1000.projects.nitrc.org/indi/ Trans Pattern Anal Mach Intell 43 (3), 766–785.
abide/abide_I.html. An access application should be permitted firstly. Kumagai, A., Iwata, T., 2019. Unsupervised domain adaptation by matching distributions
based on the maximum mean discrepancy via unilateral transformations. In: Proceed-
For review convenience, we included the processed datasets (in
ings of the AAAI Conference on Artificial Intelligence, Vol. 33, pp. 4106–4113.
terms of 2D features) in the submitted source code of this paper which Lee, H., Nakamura, K., Narayanan, S., Brown, R.A., Arnold, D.L., Initiative, A.D.N., et al.,
can be found in the data folder. 2019. Estimating and accounting for the effect of MRI scanner changes on longitudinal
whole-brain volume change measurements. Neuroimage 184, 555–565.
Long, M., Wang, J., Ding, G., Sun, J., Yu, P.S., 2013. Transfer feature learning with joint
Code Availability Statement distribution adaptation. In: Proceedings of the IEEE International Conference on Com-
puter Vision, pp. 2200–2207.
The software and code of the DomainATM toolbox as well as Long, M., Wang, J., Ding, G., Sun, J., Yu, P.S., 2014. Transfer joint matching for unsuper-
vised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision
the manual have been submitted at the time of paper submis- and Pattern Recognition, pp. 1410–1417.
sion. These materials can be also found at the following link: Van der Maaten, L., Hinton, G., 2008. Visualizing data using t-SNE. Journal of Machine
https://www.mingxia.web.unc.edu/domainatm/ Learning Research 9 (11).
Pan, S.J., Tsang, I.W., Kwok, J.T., Yang, Q., 2010. Domain adaptation via transfer com-
ponent analysis. IEEE Trans. Neural Networks 22 (2), 199–210.
Data availability Patel, V.M., Gopalan, R., Li, R., Chellappa, R., 2015. Visual domain adaptation: a survey
of recent advances. IEEE Signal Process Mag 32 (3), 53–69.
Data will be made available on request. Pooch, E.H., Ballester, P., Barros, R.C., 2020. Can we trust deep learning based diagno-
sis? The impact of domain shift in chest radiograph classification. In: International
Workshop on Thoracic Image Analysis. Springer, pp. 74–83.
Quiñonero-Candela, J., Sugiyama, M., Lawrence, N.D., Schwaighofer, A., 2009. Dataset
Credit authorship contribution statement Shift in Machine Learning. MIT Press.
Rajkomar, A., Dean, J., Kohane, I., 2019. Machine learning in medicine. N top N. Engl. J.
Med. 380 (14), 1347–1358.
Hao Guan: Conceptualization, Methodology, Software, Writing – Rubinov, M., Sporns, O., 2010. Complex network measures of brain connectivity: uses and
original draft. Mingxia Liu: Conceptualization, Validation, Writing – interpretations. Neuroimage 52 (3), 1059–1069.
Shi, Y., Sha, F., 2012. Information-theoretical learning of discriminative clusters for un-
review & editing, Supervision. supervised domain adaptation. In: Proceedings of the 29th International Conference
on International Conference on Machine Learning, pp. 1275–1282.
Acknowledgment Shinohara, R.T., et al., 2014. Statistical normalization techniques for magnetic resonance
imaging. NeuroImage: Clinical 6, 9–19.
Sinha, S., Thomopoulos, S.I., Lam, P., Muir, A., Thompson, P.M., 2021. Alzheimer’s disease
This work was supported in part by NIH grants (Nos. AG073297 and classification accuracy is improved by MRI harmonization based on attention-guided
AG041721). generative adversarial networks. In: 17th International Symposium on Medical Infor-
mation Processing and Analysis, Vol. 12088. SPIE, pp. 180–189.
Sun, B., Feng, J., Saenko, K., 2016. Return of frustratingly easy domain adaptation. In:
Supplementary material Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30.
Tibrewala, R., Ozhinsky, E., Shah, R., Flament, I., Crossley, K., Srinivasan, R., Souza, R.,
Link, T.M., Pedoia, V., Majumdar, S., 2020. Computer-aided detection AI reduces in-
Supplementary material associated with this article can be found, in
terreader variability in grading hip abnormalities with MRI. J. Magn. Reson. Imaging
the online version, at doi:10.1016/j.neuroimage.2023.119863 52 (4), 1163–1172.
Torralba, A., Efros, A.A., 2011. Unbiased look at dataset bias. In: Proceedings of the
References IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp. 1521–
1528.
Barragán-Montero, et al., 2021. Artificial intelligence and machine learning for medical Tzourio-Mazoyer, N., et al., 2002. Automated anatomical labeling of activations in SPM
imaging: a technology review. Physica Med. 83, 242–256. using a macroscopic anatomical parcellation of the MNI MRI single-subject brain.
Ben-David, S., Blitzer, J., Crammer, K., Pereira, F., et al., 2007. Analysis of representations Neuroimage 15 (1), 273–289.
for domain adaptation. MIT; 1998, pp. 137–144. Valiant, L.G., 1984. A theory of the learnable. Commun ACM 27 (11), 1134–1142.
Csurka, G., 2017. A comprehensive survey on domain adaptation for visual applications. Valverde, J.M., Imani, V., Abdollahzadeh, A., De Feo, R., Prakash, M., Ciszek, R., Tohka, J.,
In: Domain Adaptation in Computer Vision Applications. Springer, pp. 1–35. 2021. Transfer learning in magnetic resonance brain imaging: a systematic review.
Deo, R.C., 2015. Machine learning in medicine. Circulation 132 (20), 1920–1930. Journal of Imaging 7 (4), 66.
11
H. Guan and M. Liu NeuroImage 268 (2023) 119863
Volkow, N.D., et al., 2018. The conception of the ABCD study: from substance use to a Yan, H., Ding, Y., Li, P., Wang, Q., Xu, Y., Zuo, W., 2017. Mind the class weight
broad NIH collaboration. Dev Cogn Neurosci 32, 4–7. bias: Weighted maximum mean discrepancy for unsupervised domain adaptation. In:
Wang, M., Deng, W., 2018. Deep visual domain adaptation: a survey. Neurocomputing Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,
312, 135–153. pp. 2272–2281.
Wang, W., Li, H., Ding, Z., Nie, F., Chen, J., Dong, X., Wang, Z., 2021. Rethinking maxi- Yi, X., Walia, E., Babyn, P., 2019. Generative adversarial network in medical imaging: a
mum mean discrepancy for visual domain adaptation. IEEE Trans Neural Netw Learn review. Med Image Anal 58, 101552.
Syst. Zhang, J., Chao, H., Yan, P., 2020. Robustified domain adaptation. arXiv: 2011.09563
Wilson, G., Cook, D.J., 2020. A survey of unsupervised deep domain adaptation. ACM Zou, D., Zhu, Q., Yan, P., 2020. Unsupervised domain adaptation with dual-scheme fu-
Transactions on Intelligent Systems and Technology (TIST) 11 (5), 1–46. sion network for medical image segmentation. In: International Joint Conference on
Wittens, M.M.J., et al., 2021. Inter-and intra-scanner variability of automated brain vol- Artificial Intelligence, pp. 3291–3298.
umetry on three magnetic resonance imaging systems in Alzheimer’s disease and con-
trols. Front Aging Neurosci 13.
12