MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI

Nancy R. Newlin1, Kurt Schilling2, Serge Koudoro3, Bramsh Qamar Chandio4, Praitayini Kanakaraj1, Daniel Moyer1, Claire E. Kelly5,6,7, Sila Genc5,8, Jian Chen5, Joseph Yuan-Mou Yang5,8,9,10, Ye Wu11, Yifei He11, Jiawei Zhang12, Qingrun Zeng13, Fan Zhang13, Nagesh Adluru14, Vishwesh Nath15, Sudhir Pathak16, Walter Schneider16, Anurag Gade17, Yogesh Rathi18, Tom Hendriks19, Anna Vilanova19, Maxime Chamberland19, Tomasz Pieciak20,21, Dominika Ciupek21, Antonio Tristán-Vega20, Santiago Aja-Fernández20, Maciej Malawski21, Gani Ouedraogo22, Julia Machnio21, Christian Ewert23, Paul M. Thompson4, Neda Jahanshad4, Eleftherios Garyfallidis3, Bennett A. Landman1,2,24,25
1: Department of Computer Science, Vanderbilt University, Nashville, TN, 2: Department of Radiology and Radiological Sciences, Vanderbilt University Medical Center, 3: Indiana University Bloomington, Bloomington, IN, 4: Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, Los Angeles, CA, 5: Developmental Imaging, Murdoch Children’s Research Institute, Melbourne, Australia, 6: Victorian Infant Brain Studies (VIBeS), Murdoch Children’s Research Institute, Melbourne, Australia, 7: Turner Institute for Brain and Mental Health, School of Psychological Sciences, Monash University, Melbourne, Australia, 8: Neuroscience Advanced Clinical Imaging Service (NACIS), Department of Neurosurgery, Royal Children’s Hospital, Melbourne, Australia, 9: Neuroscience Research, Murdoch Children’s Research Institute, Melbourne, Australia, 10: Department of Pediatrics, University of Melbourne, Melbourne, Australia, 11: School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing, China, 12: College of Information Engineering, Zhejiang University of Technology, Hangzhou, China, 13: School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China, 14: Waisman Center, Department of Radiology, University of Wisconsin, Madison, 15: NVIDIA, Nashville, TN, USA, 16: Learning Research and Development Center, University of Pittsburgh, 17: Brigham and Women’s Hospital, Boston, 18: Brigham and Women’s Hospital, Harvard Medical School, Boston, 19: Department of Computer Science and Mathematics, Eindhoven University of Technology, Netherlands, 20: LPI, ETSI Telecomunicación, Universidad de Valladolid, Castilla y León, Spain, 21: Sano Centre for Computational Medicine, 30-054 Kraków, Poland, 22: Aix-Marseille Université, Marseille, France, 23: AI in Medical Imaging, German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany, 24: Vanderbilt University Institute of Imaging Science, Vanderbilt University, Nashville, TN, USA, 25: Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA
Publication date: 2024/08/31
https://doi.org/10.59275/j.melba.2024-9c68
PDF · Code · arXiv

Abstract

White matter alterations are increasingly implicated in neurological diseases and their progression. International-scale studies use diffusion-weighted magnetic resonance imaging (DW-MRI) to qualitatively identify changes in white matter microstructure and connectivity. Yet, quantitative analysis of DW-MRI data is hindered by inconsistencies stemming from varying acquisition protocols. Specifically, there is a pressing need to harmonize the preprocessing of DW-MRI datasets to ensure the derivation of robust quantitative diffusion metrics across acquisitions. In the MICCAI-CDMRI 2023 QuantConn challenge, participants were provided raw data from the same individuals collected on the same scanner but with two different acquisitions and tasked with preprocessing the DW-MRI to minimize acquisition differences while retaining biological variation. Harmonized submissions are evaluated on the reproducibility and comparability of cross-acquisition bundlewise microstructure measures, bundle shape features, and connectomics. The key innovations of the QuantConn challenge are that (1) we assess bundles and tractography in the context of harmonization for the first time, (2) we assess connectomics in the context of harmonization for the first time, and (3) we have 10x additional subjects over prior harmonization challenge, MUSHAC and 100x over SuperMUDI. We find that bundle surface area, fractional anisotropy, connectome assortativity, betweenness centrality, edge count, modularity, nodal strength, and participation coefficient measures are most biased by acquisition and that machine learning voxel-wise correction, RISH mapping, and NeSH methods effectively reduce these biases. In addition, microstructure measures AD, MD, RD, bundle length, connectome density, efficiency, and path length are least biased by these acquisition differences. A machine learning approach that learned voxelwise cross-acquisition relationships was the most effective at harmonizing connectomic, microstructure, and macrostructure features, but requires the same subject be scanned at each site co-registered. NeSH, a spatial and angular resampling method, was also effective and has generalizable framework not reliant co-registration. Our code is available at https://github.com/nancynewlin-masi/QuantConn/

Keywords

Diffusion MRI · harmonization · tractometry · tractography · connectomics · image processing

Bibtex @article{melba:2024:019:newlin, title = "MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI", author = "Newlin, Nancy R. and Schilling, Kurt and Koudoro, Serge and Chandio, Bramsh Qamar and Kanakaraj, Praitayini and Moyer, Daniel and Kelly, Claire E. and Genc, Sila and Chen, Jian and Yang, Joseph Yuan-Mou and Wu, Ye and He, Yifei and Zhang, Jiawei and Zeng, Qingrun and Zhang, Fan and Adluru, Nagesh and Nath, Vishwesh and Pathak, Sudhir and Schneider, Walter and Gade, Anurag and Rathi, Yogesh and Hendriks, Tom and Vilanova, Anna and Chamberland, Maxime and Pieciak, Tomasz and Ciupek, Dominika and Tristán-Vega, Antonio and Aja-Fernández, Santiago and Malawski, Maciej and Ouedraogo, Gani and Machnio, Julia and Ewert, Christian and Thompson, Paul M. and Jahanshad, Neda and Garyfallidis, Eleftherios and Landman, Bennett A.", journal = "Machine Learning for Biomedical Imaging", volume = "2", issue = "August 2024 issue", year = "2024", pages = "1083--1105", issn = "2766-905X", doi = "https://doi.org/10.59275/j.melba.2024-9c68", url = "https://melba-journal.org/2024:019" }
RISTY - JOUR AU - Newlin, Nancy R. AU - Schilling, Kurt AU - Koudoro, Serge AU - Chandio, Bramsh Qamar AU - Kanakaraj, Praitayini AU - Moyer, Daniel AU - Kelly, Claire E. AU - Genc, Sila AU - Chen, Jian AU - Yang, Joseph Yuan-Mou AU - Wu, Ye AU - He, Yifei AU - Zhang, Jiawei AU - Zeng, Qingrun AU - Zhang, Fan AU - Adluru, Nagesh AU - Nath, Vishwesh AU - Pathak, Sudhir AU - Schneider, Walter AU - Gade, Anurag AU - Rathi, Yogesh AU - Hendriks, Tom AU - Vilanova, Anna AU - Chamberland, Maxime AU - Pieciak, Tomasz AU - Ciupek, Dominika AU - Tristán-Vega, Antonio AU - Aja-Fernández, Santiago AU - Malawski, Maciej AU - Ouedraogo, Gani AU - Machnio, Julia AU - Ewert, Christian AU - Thompson, Paul M. AU - Jahanshad, Neda AU - Garyfallidis, Eleftherios AU - Landman, Bennett A. PY - 2024 TI - MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI T2 - Machine Learning for Biomedical Imaging VL - 2 IS - August 2024 issue SP - 1083 EP - 1105 SN - 2766-905X DO - https://doi.org/10.59275/j.melba.2024-9c68 UR - https://melba-journal.org/2024:019 ER -

2024:019 cover

Disclaimer: the following html version has been automatically generated and the PDF remains the reference version. Feedback can be sent directly to publishing-editor@melba-journal.org

1 Introduction

Diffusion-weighted magnetic resonance imaging (DW-MRI) of the brain enables in-vivo characterization of white matter microstructure and supports structural brain connectivity mapping (Pierpaoli et al., 1996). DW-MRI acquisition involves varying magnetic field strength with pulsed gradients to sensitize to the movement of water molecules, following the Stejskal and Tanner method (Stejskal and Tanner, 1965). There are a growing number of multi-site DW-MRI studies that encompass varying scanner manufacturers and acquisition protocols. An imaging “site” refers to the acquisition protocol parameters and scanner specifications used to collect an image. Initiatives such as the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (Jack et al., 2008), the National Alzheimer’s Coordinating Center (NACC) (NAC, ), the Open Access Series of Imaging Studies (OASIS3) (LaMontagne et al., 2019), and the Baltimore Longitudinal Study of Aging (BLSA) (Ferrucci, 2008) incorporate data from diverse scanner vendors and protocols.

Diffusion imaging inherits site-effects of conventional MRI caused by magnetic field inhomogeneities, field strength, voxel size, and vendor differences. Additionally, it has unique challenges related to diffusion sensitization and processing, including the number of directions the gradient field is applied, the timing of applied gradients, and the DW-MRI reconstruction algorithm (Nencka et al., 2018; Ni et al., 2006). Perturbations to these acquisition decisions within and across datasets can introduce significant confounding site-dependent differences in DW-MRI and subsequent connectomic and bundle analyses. Vollmar et al. demonstrated confounding site differences in the analysis of whole brain, region of interest, and tract-defined microstructure in a cohort of traveling subjects scanned with various protocols and scanner vendors (Vollmar et al., 2010). Similar findings emerged from studies involving multiple vendors, models, and protocols, leading to substantial sources of variation (Karayumak et al., 2019). This variability extends to derived metrics, such as tractography bundles (Schilling et al., 2021) and complex network measures (Newlin et al., 2023; Onicas et al., 2022). Schilling et al. indicated that fiber bundle shape and microstructure analysis were influenced by scanner manufacturer, acquisition protocol, diffusion sampling scheme, diffusion sensitization, and overall bundle processing workflow (Schilling et al., 2021). Joint datasets from retrospective studies by Newlin et al. and Onicas et al. revealed significant differences in complex network measures, such as modularity, global efficiency, clustering coefficient, density, characteristic path length, small-worldness, and average betweenness centrality, attributed to variations in protocol and scanner vendor (Newlin et al., 2023; Onicas et al., 2022).

Consequently, there is a clear imperative to address these site-effects in connectivity and structure analyses through a process commonly known as ”harmonization”. Diffusion image harmonization refers to methods, using preprocessing, machine learning, resampling, etc., that reduce bias associ ated with data collection and storage while preserving biological variation (Pinto et al., 2020).

The Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI (QuantConn) challenge is intended to detail and evaluate current image acquisition harmonization techniques. Our evaluation is focused on reproducibility of downstream tasks (tractography, connectomics, tractometry) and their features (bundle macrostructure and microstructure, complex network measures of the connectome).

2 Related Works

Refer to caption
Figure 1: We released 206 scans across two acquisitions, “A” (blue) and “B” (orange). Acquisition A was acquired with anisotropic resolution and 27 gradient directions. Acquisition B was acquired with isotropic resolution and 94 gradient directions. Participants altered the DW-MRI with the harmonization of their choosing. We then feed this harmonized data through a standard processing pipeline of tensor fitting, orientation distribution function (ODF) estimation, tractography, and tractometry to arrive at diffusion metrics of bundles microstructure and macrostructure and connectomics. Harmonization efficacy is determined by its capacity to minimize differences in these downstream diffusion measures.

QuantConn builds off the successful SuperMUDI (Pizzolato et al., 2020) and MUSHAC (Ning et al., 2019) challenges. SuperMUDI challenged the community with overcoming acquisition differences in voxel anisotropy with superresolution. The study consisted of N=5 volunteers with high in-plane resolution and lower axial resolution. In the MUSHAC challenge, participants were given datasets (N=15 volunteers) that were acquired on two different scanners and two protocols on each and tasked with minimizing cross-scanner and cross-protocol differences in voxel-wise indices. We provide 103 pairs of datasets of the same subjects scanned with different acquisition protocols - and challenge participants to minimize differences in the data in order to minimize differences in microstructure, tractography bundle analysis, and connectomics measures (Figure 1). The QuantConn challenge introduces several important advancements. Firstly, it is the first to evaluate bundles, tractography, and connectomics within the context of harmonization. It features a dataset that includes ten times more subjects than MUSHAC and a hundred times more than SuperMUDI. The data that form the basis of this challenge represent a difficult clinical scenario for harmonization and are part of a much larger twins study, which could provide a rich context for continuing validation and extension of this challenge’s findings (Strike et al., 2023).

3 Methods

Data includes 103 patients, scanned twice with acquisition protocol “A” and “B”. Challenge participants created or applied harmonization method of their choice to bridge the gap between acquisitions (Table 1). Submissions’ harmonized data was processed using a standard diffusion pipeline of tensor fitting, tissue segmentation, tractography, connectomics, and tractometry (Figure 2). Then, we evaluated the cross-acquisition agreement of diffusion features resulting from this pipeline.

Table 1: Summary of submissions (Sub) and their 3 scores. Scores are computed by taking the average intra-class correlation coefficient (ICC) (rater=acquisition, measurements=25 test subjects). For connectomics (Conn), we averaged ICC for all 10 complex network measures. For microstructure (Micro), we averaged ICC for all 6 bundles and 4 measures. For macro (Macro), we averaged ICC for all 6 bundles and 6 measures. See supplementary information for in-depth analysis of the stability of rankings for each metric individually.
SubMethodConn ScoreMicro ScoreMacro Score
RefNone0.79±plus-or-minus\pm0.140.24±plus-or-minus\pm0.230.49±plus-or-minus\pm0.28
1PreQual, registration, voxel-wise map with MLP0.98±plus-or-minus\pm0.050.79±plus-or-minus\pm0.250.70±plus-or-minus\pm0.29
2Normalization, RISH feature map0.84±plus-or-minus\pm0.160.49±plus-or-minus\pm0.210.62±plus-or-minus\pm0.28
3PreQual, Spatial and angular resampling0.79±plus-or-minus\pm0.160.69±plus-or-minus\pm0.110.45±plus-or-minus\pm0.32
4PreQual, registration, DeepHarmony0.86±plus-or-minus\pm0.080.36±plus-or-minus\pm0.260.37±plus-or-minus\pm0.26
5PreQual, bias field correction, RISH feature map0.81±plus-or-minus\pm0.100.30±plus-or-minus\pm0.120.48±plus-or-minus\pm0.28
6PreQual0.75±plus-or-minus\pm0.180.29±plus-or-minus\pm0.190.45±plus-or-minus\pm0.26
7PreQual, registration, MLP with dynamic input0.75±plus-or-minus\pm0.140.24±plus-or-minus\pm0.200.44±plus-or-minus\pm0.28
8PreQual, map SH coefs from acquisition A to B0.24±plus-or-minus\pm0.240.46±plus-or-minus\pm0.230.07±plus-or-minus\pm0.11
9PreQual, map SH coefs from acquisition B to A0.18±plus-or-minus\pm0.240.40±plus-or-minus\pm0.200.06±plus-or-minus\pm0.10

3.1 Data

The data that form the basis of this challenge represent a difficult clinical scenario for harmonization and are a subset of the Queensland Twin Imaging study (Strike et al., 2023). The DW dataset consists of 25 testing and 78 training subjects scanned twice with two different acquisition protocols for a total of 206 scanning sessions. Each subject has an anatomical T1-weighted image. The data subset is comprised of 45% females, ages 25.3 ±plus-or-minus\pm 1.8 years. No subjects reported a history of significant head injury, neurological or psychiatric illness, or substance abuse or dependence, and no subjects had a first-degree relative with a psychiatric disorder. All subjects were right-handed as determined using 12 items from Annett’s Handedness Questionnaire (Annet, 1970). Scanning was performed at the QIMR Berghofer Medical Research Institute on a 4 Tesla Siemens Bruker Medspec scanner.

3.1.1 Anatomical Imaging

T1-weighted images were acquired with an inversion recovery rapid gradient-echo sequence (inversion/repetition/echo times, 700/1500/3.35 ms; flip angle, 8 degrees; slice thickness, 0.9 mm; 256 x𝑥x 256 acquisition matrix).

3.1.2 Acquisition A

DW images were acquired using single-shot echo-planar imaging with a twice-refocused spin echo sequence. Imaging parameters were repetition/echo times of 6090/91.7 ms, field of view of 23 cm, and 128 × 128 acquisition matrix. Each 3D volume consisted of 21 axial slices 5 mm thick with a 0.5 mm gap and 1.8 × 1.8 mm2𝑚superscript𝑚2mm^{2} in-plane resolution, total time = 3 minutes. Thirty images were acquired per subject: three with no diffusion sensitization (b=0 s/mm2𝑠𝑚superscript𝑚2s/mm^{2}) and 27 DW images (b = 1146 s/mm2𝑠𝑚superscript𝑚2s/mm^{2} ) with gradient directions uniformly distributed on the hemisphere.

3.1.3 Acquisition B

DW images were acquired using single-shot echo planar imaging (EPI) with a twice-refocused spin echo sequence. Imaging parameters were: 23s cm FOV, TR/TE 6090/91.7 ms, with a 128 × 128 acquisition matrix. Each 3D volume consisted of 55 2-mm thick axial slices with no gap and a 1.79 × 1.79 mm2𝑚superscript𝑚2mm^{2} in-plane resolution with total acquisition time = 14.2 minutes. 105 images were acquired per subject: 11 with no diffusion sensitization (b=0 s/mm2𝑠𝑚superscript𝑚2s/mm^{2}) and 94 DWI (b = 1159 s/mm2𝑠𝑚superscript𝑚2s/mm^{2}) with gradient directions uniformly distributed on the hemisphere.

3.2 Diffusion Processing

Refer to caption
Figure 2: Using the harmonized data provided by participants, the full testing pipeline is as follows: tensor fitting, fODF estimation, whole brain tractography, bundle segmentation and tractometry, then connectomics, and finally complex network analysis. These processes result in three groups of analysis: complex network measures, bundle microstructure, and bundle macrostructure, which we evaluate the submissions on.

Tensor models are fit to the log-signal by first least squares (WLS) weighted by empirical signal intensities, and then iterative WLS using the previous step’s signal prediction (Basser et al., 1994). Then, we generate whole-brain maps of tensor parameters: mean diffusivity (MD), fractional anisotropy (FA), axial diffusivity (AD), radial diffusivity (RD). The tensor has three eigenvalues: one principal, equivalent to AD, and two non-principal, the mean of which is RD. MD is the average magnitude of observed diffusion (Westin et al., 2002). FA is a scaled variability measure of diffusivity (Westin et al., 2002).

We ran Freesurfer on all T1-weighted images to get anatomical parcellations. This provides a 5-tissue type image, grey-matter white matter boundary region segmentation, and an 84-node grey-matter parcellation (Fischl, 2012).The grey-matter parcellation, defined by the Desikan-Killany grey-matter atlas, is used to map tractography to anatomical regions for connectomics analysis (Klein and Tourville, 2012).

From the image signal, we can estimate white matter fiber orientation distribution functions using constrained spherical deconvolution (Tournier et al., 2013, 2007). fODF models reflect the probability that a real brain fiber lies along a given direction (Tournier et al., 2007). These models guide tractography. Seeding from the grey-matter white matter boundary, we perform second order integration over fODFs until 10 million streamlines are sampled (Newlin et al., 2023; Tournier et al., 2019).

We segment the tractogram into 30 bundles with RecoBundlesX (Garyfallidis et al., 2018), but only keep 6 major tracks for analysis: bilateral Arcuate Fasciculus (AF), bilateral Optic Radiations (OR), and major/minor corpus colosum forceps. These bundle microstructure profiles of this bundles are created by projecting DTI maps onto the bundles using BUAN approach Chandio et al. (2020), bundle profiles are averaged and result in average FA, MD, RD, and AD for each bundle. In addition, we characterize bundles by their macrostructure and shape. Here, we consider bundle length, volume, curl, span, diameter, and surface area (sci, ).

We combine the tractogram with the 84 node grey-matter parcellation from Freesurfer, Deskian-Killany (Fischl, 2012). For each region i𝑖i, we compute the number of streamlines and average length of streamline connecting region i𝑖i to every other region. We store this value in an adjacency matrix called a connectome (Tournier et al., 2019).

The connectome is comprised of nodes, brain regions defined by the Desikan-Killany atlas, and edges, a scalar value reflecting the strength of the connection. The brain connectivity toolbox (version-2019-03-03) (Rubinov and Sporns, 2010) specializes in complex network measures that summarize the connectivity behavior of a connectome. We consider 12 complex network measures as connectomics measures. The importance of brain regions in the network is measured with node strength and betweenness centrality. Connectome density is the number of connections found versus the total possible connections. Functional integration is measured with characteristic path length. Characteristic path length, edge count, and global and local efficiency measure the ability to exchange information. Modularity and participation coefficient describe the community structure of the brain network. Assortativity measures the resiliency of a network to connection drop out. Together, these complex network measures characterize brain structural connectivity.

3.3 Harmonization Methods

The QuantConn challenge received a total of 9 unique submissions (Table 1). The following section details the preprocessing and harmonization methodology used in each submission.

3.3.1 Baseline

DW-MRI was not preprocessed. No harmonization method was applied.

3.3.2 Submission 1: The Harmonizers 1

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and echo-planar imaging (EPI) distortions (Cai et al., 2021). Additionally, average b=0 images of the subject’s diffusion data of acquisition A and B were mapped to the subject’s T1 image with boundary-based rigid registration (Greve and Fischl, 2009) after running FreeSurfer (Fischl, 2012) on the T1 images. The obtained mappings were used to map the subject’s acquisition A data to the respective acquisition B data, where the mapped data was regridded to the acquisition B target resolution. For the harmonization step, an MLP was trained to summarize the acquisition A and acquisition B in a voxel-wise manner. The subject’s data from both sites were concatenated along the channel dimension per voxel for input. The output format was chosen based on acquisition B to leverage its inclusion of more b-vectors, potentially enhancing tractography. The neural network was an MLP with one hidden layer containing 4096 units, batch normalization, and a ReLU activation. The training set included 75 subjects, whereas the validation set included 3 subjects. The network was trained for 40 epochs with a mean squared reconstruction loss using the learning rate 0.01.

3.3.3 Submission 2: NIMG

The DW-MRI scans were processed using two software tools, MRtrix3 Tournier et al. (2019) and FSL Jenkinson et al. (2012), to eliminate magnetic field inhomogeneities, signal inhomogeneities, and volume drift across the diffusion gradients. The harmonization of the DW-MRI data involved three major steps. Firstly, a global DW-MRI intensity normalization was performed on a group of subjects using MRtrix3, with the median b=0 white matter value serving as the reference. Secondly, a joint eddy current-induced distortion correction was applied to the concatenated DWI data over the two sites per subject. Thirdly, an individual DW-MRI intensity normalization was performed using MRtrix3 for each subject based on the b=0 signal. Finally, an algorithm called the DW-MRI harmonization algorithm was employed. This algorithm leverages rotation invariant spherical harmonics (RISH) features to construct a scale map for each pair of reference and target sites. The scale map was then applied to the target site to remove inter-site variability (Karayumak et al., 2019; Mirzaalian et al., 2018). The code is open source (Billah et al., ).

3.3.4 Submission 3: NeSH

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and EPI distortions (Cai et al., 2021). Additionally, DW-MRI was noise corrected and de-gibbs ringing with MRTrix3 (Tournier et al., 2019). DW-MRI were harmonized by resampling spatial and angular resolutions to be the same between acquisitions. Spatial resolution was set to be 1.79688 isotropic. The sampled directions were a combination of the directions used in site A and site B. The 27 original b-vectors for A were used, alongside 33 b-vectors from B that were least similar to the b-vectors from A. The resampling process used NeSH to create continuous implicit neural representations of the DW-MRI that can be sampled in any spatial and angular resolution (Hendriks et al., 2023). Optimal training parameters were batch size=1000, epochs=7, learning rate=1×1061superscript1061\times 10^{-6}, layers=8, lambda=1×1061superscript1061\times 10^{-6}, sigma=4.

3.3.5 Submission 4: DiffusionMaRInes

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and echo-planar imaging (EPI) distortions (Cai et al., 2021). All non-diffusion-weighted MR data were averaged into a single volume. This submission harmonized the angular resolution of acquisition B to match acquisition A and the spatial resolution of acquisition B to match acquisition A. The datasets from acquisition A to acquisition B were spatially interpolated using a trilinear interpolation. All harmonized DW-MRI had 27 gradient directions and 1.8 mm isotropic resolution. The resampling process used the DeepHarmony model (Dewey et al., 2019) with (epochs=20, batch size=8, features=16, Adam beta1=0.9, Adam beta2=0.999, patch size=64x64). The training procedure involved 10 randomly selected subjects from the training set using 6-fold cross-validation. Images from acquisition A were linearly registered to their acquisition B counterpart. The registration transform was computed on respective FA maps with 6 degrees of freedom and optimized using mutual information (Jenkinson et al., 2012).

3.3.6 Submission 5: Neuroscience Advanced Clinical Imaging Service (NACIS)

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, echo-planar imaging (EPI) distortions (Cai et al., 2021), and B1-bias field correction using ANTs N4. DW-MRI signal was harmonized by mapping rotationally invariant features of the spherical harmonic representation to a common domain (acquisition B) (Karayumak et al., 2019; Mirzaalian et al., 2018). The code is open source (Billah et al., ). This method consists of creating a template mapping between the rotationally invariant features with N=10 matched subject pairs from both sites.

3.3.7 Submission 6: PreQual

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and EPI distortions (Cai et al., 2021).

3.3.8 Submission 7: The Harmonizers 2

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and EPI distortions (Cai et al., 2021). This submission harmonized across acquisitions by registering the scans of the same subject together, spatial resampling to common domain (acquisition B) and correcting for signal differences with voxel-wise mapping. Average b=0 images of the subject’s diffusion data of acquisitions A and B were mapped to the subject’s T1 image with boundary-based rigid registration (Greve and Fischl, 2009) after running FreeSurfer (Fischl, 2012) on the T1-weighted images. The obtained mappings were used to map the subject’s acquisition A data to the respective acquisition B data, where the mapped data was re-grided to the acquisition B target resolution. The neural network architecture was built to accommodate flexible inputs in a voxel-wise manner: only data from acquisition A, only data from acquisition B, or both. This architecture consisted of an encoder for site A data and a separate encoder for site B data. Both encoders were MLPs with two hidden layers (128 units) with batch normalization and a ReLU activation as well as a linear output layer (95 units). If both A and B data were presented, they were passed through the respective encoder, and the outputs were combined using the mean, yielding a single latent vector. If only A or B data were presented, the data were passed through the respective encoder yielding a latent vector without the need for combination. Regardless of the number of inputs, this latent vector was passed through a decoder with two hidden layers (128 units) with batch normalization and a ReLU activation, as well as one linear output layer (95 units). For every batch, predictions and latent vectors were obtained for a forward pass of A only, B only, and A and B jointly. The closeness of the three latent vectors was incentivized by summing up the pairwise mean squared differences of these latent vectors (latent loss). To incentivize accurate predictions, the mean squared differences of the prediction from A, the prediction from B, and the prediction from A and B to the target data B were summed up (reconstruction loss). Finally, the latent and reconstruction losses were added. The network was trained for 100 epochs with an initial learning rate of 0.1 which was reduced every 25 epochs, i.e. multiplied by 0.1.

3.3.9 Submission 8: SimpleHarmonics 2

All DW-MRI were processed using the PreQual pipeline to remove eddy current, motion, and EPI distortions (Cai et al., 2021). To harmonize the data, this submission utilized the classic Laplacian approach of spherical harmonic modeling (Garyfallidis et al., 2014) to extract and correct the signal of the ’target’ and ‘reference’ acquisitions. Spherical harmonic representations (order 6) were fit to all DW-MRI. Then, the coefficients (SH coefs) were linearly multiplied with the gradient table of the target site, or acquisition B, to obtain the diffusion signal. The original spacing and number of slices were made consistent with a 3D resample from the AFNI toolkit (Cox, 1996).

3.3.10 Submission 9: SimpleHarmonics 1

This submission used the same methods as Submission 1, but the ’target’ is Acquisition A, and the ’reference’ is Acquisition B.

3.4 Evaluating harmonization

Successful harmonization has two main components: reducing acquisition-related bias, and preserving biological variability. We evaluate acquisition related bias by computing Cohen’s D between measures derived from acquisition A and B, as well as the Wilcoxon ranksum test to compare medians (p-value of 0.05 is considered significant). Together these tests inform us if the distributions are significantly different and to what extent. Cohen’s D is interpreted as follows: standardized effect-size of 0.2 is small, 0.5 is medium, and above 0.8 is large (Cohen, ).

We evaluate cross-acquisition bundle shape similarity with the BUAN bundle shape similarity score, a bounded metric between 0 (no similarity) and 1 (near perfect similarity). The BUAN bundle similarity score is a graph-theoretic approach to compare the shapes of two bundles of the same type using bundle adjacency metrics (Chandio et al., 2020; Garyfallidis et al., 2012). The BUAN shape similarity score between two bundles is determined by how close two bundles are by computing the minimum flip distance between their respective streamlines and constructing two bundles’ coverage of each other.

For the best-performing harmonization method, we further assess how well it maintains biological variation and the similarity of cross-acquisition bundle shapes. We evaluate biological variation by comparing coefficient of variation (CoV) of the baseline from each acquisition, and CoV after harmonization. Ideally, image acquisition protocol should not influence inter-subject variation. Cross-acquisition differences in CoV suggests there are sources of variation beyond biological that are influencing measurements. Further, effective harmonization would maintain biological variation (CoV should not collapse to near zero), and that between subject variation should be consistent across acquisitions.

Competition rankings were based on the average ICC for each metric category (connectomics, microstructure, macrostructure). ChallengeR is an open-source toolkit for analyzing and visualizing challenge results (Wiesenfarth et al., 2021). Using this toolkit, we provide a detailed benchmarking report of each metric individually as supplementary information.

4 Results

4.1 Connectomics

We evaluate each submission on its ability to harmonize complex network features of the connectome (Figure 3). Submissions 1, 2, and 3 successfully removed significant confounding acquisition effects in all 12 complex network measures and reduced the effect-size difference to “small”. Submissions 5, 8 and 9 were generally unsuccessful in removing this effect, and for density, global and local efficiency, and path length these methods introduced cross acquisition differences that were not in the baseline measures.

Refer to caption
Figure 3: Successful harmonization methods will reduce significant acquisition effects in these measures from the un-harmonized reference (“Ref”). Slashes indicate significant difference (p <\textless 0.05) in median between measures derived from acquisitions A and B. We compute Cohen’s D effect-size differences between connectomics measures from acquisitions A and B.

4.2 Bundle macrostructure

We evaluate the difference in cross-acquisition macrostructure measurements of the same subjects with Cohen’s D and the Mann-Whitney U-test (Figure 4). All but submission 1 failed to remove acquisition bias from all measures for bilateral arcuate fasciculus. Across all bundles and macrostructural measures studied, Harmonizers 1 reduced Cohen’s D and removed significant median acquisition bias. Submissions 8 and 9 were unsuccessful at harmonizing these measures and introduced significant biases instead. We study The Harmonizers 1 bundle macrostructure harmonization more deeply in Figure 5. BUAN bundle similarity score is a graph-theoretic approach to compare the shapes of two bundles of the same type using bundle adjacency metrics (Chandio et al., 2020). We interpret higher values (closer to 1) as extremely close in shape and 0 as no shape similarity. While no reconstructions achieve perfect agreement, after The Harmonizers 1’s harmonization is applied, cross-acquisition agreement increases in all six bundles except for AF left. Additionally, we compute BUAN bundle shape similarity scores across the entire test dataset for all submissions (supplementary Figure 1).

Refer to caption
Figure 4: We evaluate each submission on their ability to harmonize macrostructural features of 6 bundles. Successful harmonization will reduce significant acquisition effects in these features from the reference (“Ref”). We report normalized effect-size with Cohen’s D. Slashes indicate a significant difference (p << 0.05) in median between features derived from acquisitions A and B.
Refer to caption
Figure 5: We compare the cross-acquisition (Acquisition A is blue, Acquisition B is orange) shape agreement of reconstructed bundles for one subject in the un-harmonized reference dataset and top performing harmonization technique (The Harmonizers 1) with the BUAN shape similarity metric. Shape similarity scores are reported in the upper right corner of each visualization, with S𝑆S being the BUAN shape similarity for reference data and S𝑆S* the BUAN shape similarity for harmonized data. Bundle shape similarity score ranges from 0 to 1. The higher the value, the more similar the two bundles are in shape, and the lower value suggests low shape similarity.

4.3 Bundle microstructure

We evaluate the differences in cross-acquisition microstructure measurements of the same subjects with Cohen’s D and the Mann-Whitney U-test (Figure 6). Overall, microstructure had the least number of significant acquisition differences in AD, FA, MD, and RD across the size bundles studied. FA was the only measure that had significant differences in median in half of the bundles (arcuate fascicles right, optical radiation left, and optical radiation right). In fact, all teams except the Harmonizers 1 introduced differences where there were none to ameliorate. Across all bundles and microsructure measures studied, Harmonizers 1 reduced Cohen’s D and removed significant median acquisition bias.

Refer to caption
Figure 6: We evaluate each submission on their ability to harmonize microstructural features of 6 bundles. Successful harmonization will reduce significant acquisition effects in these features from the un-harmonized reference (“Ref”). We report normalized effect-size with Cohen’s D. Slashes indicate significant difference (p ¡ 0.05) in median between features derived from acquisitions A and B.

4.4 Preserving biological differences

Refer to caption
Figure 7: We compute CoV from The Harmonizers 1 and the baseline (BL) connectomics, macrostructure, and microstructure measures for each acquisition.

Successful harmonization reduces acquisition bias while preserving biological variance and observed variance should not differ with acquisition protocol. We evaluated the top-performing submission, The Harmonizers 1, based on the amount of variance preserved after harmonization and the agreement between acquisitions (Figure 7). Similar analyses were conducted on NIMG (Supplementary Figure 2) and NeSH (Supplementary Figure 3). We found that in the baseline data, inter-subject variation was not equal across acquisitions for any measures studied (Figure 7). Neither acquisition’s CoV was consistently higher or lower than the other across all measures studied. Generally, The Harmonizers 1 (Figure 7), NIMG (Supplementary Figure 2), and NeSH (Supplementary Figure 3) made progress toward equalizing cross-acquisition CoV and maintained inter-subject variation. In none of the methods do we see a dramatic decrease in CoV after harmonization is applied. We note the following exceptions. Data harmonized using The Harmonizers 1 methodology showed worse agreement of cross-acquisition inter-subject variation in the following measures: average length, span, and RD of AF left bundles; curl in forceps major bundles; and average length, span, diameter, FA, MD, and RD in OR left bundles (Figure 7). Notably, average length, span, curl, FA, MD, AD, and RD exhibited lower overall variation compared to volume, diameter, and surface area, a finding consistent across the six bundles.

5 Discussion

In this study we isolate acquisition related differences in a DW-MRI cohort. Specifically, our data includes two distinct angular and spatial resolutions that impact the quality of subsequent models and measurements. To continue dissecting site effects in future studies, a more challenging cohort would contain DW-MRI collected on different scanners and more than two sites. Participants of the QuantConn challenge approached the harmonization problem in a variety of ways. The NeSH method targeted and ameliorated the known acquisition differences (spatial and angular resolution). Other methods chose to learn the complex culmination of those differences on the signal (The Harmonizers – 1, DiffusionMaRines, The Harmonizers – 2) or derivative representation (NIMG, NACIS, SimpleHarmonics – 1, SimpleHarmonics – 2). In many real-world cases, acquisition or scanner differences are complex and difficult to disentangle to ameliorate individually. future DW-MRI harmonization studies can consider more complex architectures (Cycle-GAN (Hansen et al., 2022; Zhu et al., 2017), StyleGAN (Karras et al., 2019)) or adaptions of novel MRI harmonization methods (Hu et al., 2023; Zuo et al., 2021).

6 Conclusion

As a field, we are working toward quantitative analysis of DW-MRI. This necessitates connectomic and tractometry measurements that are robust and reproducible. In this challenge we studied current methods for harmonization of data such that measurements minimize bias from acquisition heterogeneity. As the number of international datasets combining dissimilar acquisitions, scanner manufacturers, and gradient coils grows, so does the need for correction methods to make such data comparable. We find that the most effective harmonization method corrects for motion, eddy-current, and gibbs-ringing distortions, is registered across acquisitions, and voxel-wise signal predicted using an MLP. The top three performing methods were the Harmonizers – 1, NIMG, and NeSH. We find the following drawbacks and limitations. The Harmonizers – 1 implemented a MLP that requires the same subjects to be scanned with each acquisition protocol and co-registered. Such non-linear registration introduces error (Bierbrier et al., 2022) and warps underlying subject-specific macrostructures. NIMG’s harmonization method does not require perfectly matched pairs, but does rely on non-linear intra- and inter- subject registration of RISH feature maps. This method of mapping RISH features to a common site was also successful in a previous multi-scanner harmonization challenge (Ning et al., 2020). NeSH methodology, notably, does not rely on registration or matched subjects.


Acknowledgments

The Queensland Twin Imaging study data collection was funded by National Institute of Child Health and Human Development (R01 HD050735), and National Health and Medical Research Council (496682, 1009064). This work was also supported in part by NIH grants R01EB017230, NIDDK K01EB032898, R01NS123378, P50HD105353, RF1AG057892 (FiberNET project grant), R01 MH134004 and R01MH119222 (PIs: Rathi, O’Donnell).
Tomasz Pieciak acknowledges the Polish National Agency for Academic Exchange for grant PPN/BEK/2019/1/00421 under the Bekker programme and the Ministry of Science and Higher Education (Poland) under the scholarship for outstanding young scientists (692 / STYP /13/2018).
Tomasz Pieciak, Antonio Tristán Vega and Santiago Aja-Fernández were supported by research grants PID2021-124407NBI00, funded by MCIN/AEI/ 10.13039/ 501100011033 /FEDER, UE, and TED2021-130758B-I00, funded by MCIN /AEI/ 10.13039/ 501100011033 and the European Union “NextGenerationEU/PRTR”.
Dominika Ciupek, Maciej Malawski and Julia Machnio were supported by the European Union’s Horizon 2020 research and innovation program under grant agreement Sano No 857533 and the International Research Agendas program of the Foundation for Polish Science No MAB PLUS/2019/13.
JYMY and SG received positional funding from the Royal Children’s Hospital Foundation (RCHF 2022-1402). CEK was supported by an Australian Government Research Training Program (RTP) Scholarship, Monash University (Monash Graduate Excellence Scholarship), and the Australian National Health and Medical Research Council (NHMRC; Centre of Research Excellence in Newborn Medicine 1153176). JYMY, SG, JC and CEK acknowledge the support of the Royal Children’s Hospital, Murdoch Children’s Research Institute, The University of Melbourne Department of Paediatrics, and the Victorian Government’s Operational Infrastructure Support Program. Ye Wu is in part supported by the National Key Research and Development Program of China (No. 2023YFF1204800) and the National Natural Science Foundation of China (No. 62201265). Jianzhong He is supported by the National Natural Science Foundation of China (No. 62303413). Fan Zhang is in part supported by the National Key Research and Development Program of China (No. 2023YFE0118600) and the National Natural Science Foundation of China (No. 62371107). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.


Ethical Standards

The work follows appropriate ethical standards in conducting research and writing the manuscript, following all applicable laws and regulations regarding treatment of animals or human subjects.


Conflicts of Interest

We declare we don’t have conflicts of interest.


Data availability

Testing and Training data used in this challenge are publicly available at shared Box: https://vanderbilt.app.box.com/s/owijt2mo2vhrp3rjonf90n3hoinygm8z/folder/208448607516

References

  • (1) National alzheimer’s coordinating center. URL https://naccdata.org/.
  • (2) scilus/scilpy: The sherbrooke connectivity imaging lab (scil) python dmri processing toolbox. URL https://github.com/scilus/scilpy.
  • Annet (1970) Marion Annet. A classification of hand preference by association analysis. British Journal of Psychology, 61:303–321, 1970. ISSN 20448295. .
  • Basser et al. (1994) Peter J. Basser, James Mattiello, and Denis Lebihan. Estimation of the effective self-diffusion tensor from the nmr spin echo. Journal of magnetic resonance. Series B, 103:247–254, 1994. ISSN 1064-1866. . URL https://pubmed.ncbi.nlm.nih.gov/8019776/.
  • (5) Tashrif Billah, Sylvain Bouix, Suheyla Cetin Karayumak, and Yogesh Rathi. pnlbwh/dmriharmonization: Multi-site dmri harmonization. URL https://github.com/pnlbwh/dMRIharmonization.
  • Cai et al. (2021) Leon Y. Cai, Qi Yang, Colin B. Hansen, Vishwesh Nath, Karthik Ramadass, Graham W. Johnson, Benjamin N. Conrad, Brian D. Boyd, John P. Begnoche, Lori L. Beason-Held, Andrea T. Shafer, Susan M. Resnick, Warren D. Taylor, Gavin R. Price, Victoria L. Morgan, Baxter P. Rogers, Kurt G. Schilling, and Bennett A. Landman. Prequal: An automated pipeline for integrated preprocessing and quality assurance of diffusion weighted mri images. Magnetic resonance in medicine, 86:456–470, 7 2021. ISSN 1522-2594. . URL https://pubmed.ncbi.nlm.nih.gov/33533094/.
  • Chandio et al. (2020) Bramsh Qamar Chandio, Shannon Leigh Risacher, Franco Pestilli, Daniel Bullock, Fang-Cheng Yeh, Serge Koudoro, Ariel Rokem, Jaroslaw Harezlak, and Eleftherios Garyfallidis. Bundle analytics, a computational framework for investigating the shapes and profiles of brain pathways across populations. 2020. . URL https://doi.org/10.1038/s41598-020-74054-4.
  • (8) Jacob Cohen. Statistical power analysis for the behavioral sciences second edition.
  • Cox (1996) Robert W. Cox. Afni: Software for analysis and visualization of functional magnetic resonance neuroimages. Computers and Biomedical Research, 29:162–173, 6 1996. ISSN 0010-4809. .
  • Dewey et al. (2019) Blake E. Dewey, Can Zhao, Jacob C. Reinhold, Aaron Carass, Kathryn C. Fitzgerald, Elias S. Sotirchos, Shiv Saidha, Jiwon Oh, Dzung L. Pham, Peter A. Calabresi, Peter C.M. van Zijl, and Jerry L. Prince. Deepharmony: A deep learning approach to contrast harmonization across scanner changes. Magnetic resonance imaging, 64:160–170, 12 2019. ISSN 1873-5894. . URL https://pubmed.ncbi.nlm.nih.gov/31301354/.
  • Ferrucci (2008) Luigi Ferrucci. The baltimore longitudinal study of aging (blsa): A 50-year-long journey and plans for the future. Journals of Gerontology - Series A Biological Sciences and Medical Sciences, 63:1416–1419, 2008. ISSN 10795006. . URL /record/2009-01339-015.
  • Fischl (2012) Freesurfer Bruce Fischl. Freesurfer. 2012. .
  • Garyfallidis et al. (2012) Eleftherios Garyfallidis, Matthew Brett, Marta Morgado Correia, Guy B. Williams, and Ian Nimmo-Smith. Quickbundles, a method for tractography simplification. Frontiers in Neuroscience, 6, 2012. .
  • Garyfallidis et al. (2014) Eleftherios Garyfallidis, Matthew Brett, Bagrat Amirbekian, Ariel Rokem, Stefan van der Walt, Maxime Descoteaux, and Ian Nimmo-Smith. Dipy, a library for the analysis of diffusion mri data. Frontiers in Neuroinformatics, 8, 2 2014. ISSN 16625196. .
  • Garyfallidis et al. (2018) Eleftherios Garyfallidis, Marc Alexandre Côté, Francois Rheault, Jasmeen Sidhu, Janice Hau, Laurent Petit, David Fortin, Stephen Cunanne, and Maxime Descoteaux. Recognition of white matter bundles using local and global streamline-based registration and clustering. NeuroImage, 170:283–295, 4 2018. ISSN 1095-9572. . URL https://pubmed.ncbi.nlm.nih.gov/28712994/.
  • Greve and Fischl (2009) Douglas N. Greve and Bruce Fischl. Accurate and robust brain image alignment using boundary-based registration. NeuroImage, 48:63–72, 10 2009. ISSN 1095-9572. . URL https://pubmed.ncbi.nlm.nih.gov/19573611/.
  • Hansen et al. (2022) Colin B. Hansen, Kurt G. Schilling, Francois Rheault, Susan Resnick, Andrea T. Shafer, Lori L. Beason-Held, and Bennett A. Landman. Contrastive semi-supervised harmonization of single-shell to multi-shell diffusion mri. Magnetic Resonance Imaging, 93:73–86, 11 2022. ISSN 0730-725X. .
  • Hendriks et al. (2023) Tom Hendriks, Anna Vilanova, and Maxime Chamberland. Neural spherical harmonics for structurally coherent continuous representation of diffusion mri signal. In Muge Karaman, Remika Mito, Elizabeth Powell, Francois Rheault, and Stefan Winzeck, editors, Computational Diffusion MRI, pages 1–12, Cham, 2023. Springer Nature Switzerland. ISBN 978-3-031-47292-3.
  • Hu et al. (2023) Fengling Hu, Andrew A. Chen, Hannah Horng, Vishnu Bashyam, Christos Davatzikos, Aaron Alexander-Bloch, Mingyao Li, Haochang Shou, Theodore D. Satterthwaite, Meichen Yu, and Russell T. Shinohara. Image harmonization: A review of statistical and deep learning methods for removing batch effects and evaluation metrics for effective harmonization. NeuroImage, 274:120125, 7 2023. ISSN 1053-8119. .
  • Jack et al. (2008) Clifford R. Jack, Matt A. Bernstein, Nick C. Fox, Paul Thompson, Gene Alexander, Danielle Harvey, Bret Borowski, Paula J. Britson, Jennifer L. Whitwell, Chadwick Ward, Anders M. Dale, Joel P. Felmlee, Jeffrey L. Gunter, Derek L.G. Hill, Ron Killiany, Norbert Schuff, Sabrina Fox-Bosetti, Chen Lin, Colin Studholme, Charles S. DeCarli, Gunnar Krueger, Heidi A. Ward, Gregory J. Metzger, Katherine T. Scott, Richard Mallozzi, Daniel Blezek, Joshua Levy, Josef P. Debbins, Adam S. Fleisher, Marilyn Albert, Robert Green, George Bartzokis, Gary Glover, John Mugler, and Michael W. Weiner. The alzheimer’s disease neuroimaging initiative (adni): Mri methods. Journal of magnetic resonance imaging : JMRI, 27:685–691, 4 2008. ISSN 1053-1807. . URL https://pubmed.ncbi.nlm.nih.gov/18302232/.
  • Jenkinson et al. (2012) Mark Jenkinson, Christian F. Beckmann, Timothy E.J. Behrens, Mark W. Woolrich, and Stephen M. Smith. Fsl. NeuroImage, 62:782–790, 8 2012. ISSN 1095-9572. . URL https://pubmed.ncbi.nlm.nih.gov/21979382/.
  • Karayumak et al. (2019) Suheyla Cetin Karayumak, Sylvain Bouix, Lipeng Ning, Anthony James, Tim Crow, Martha Shenton, Marek Kubicki, and Yogesh Rathi. Retrospective harmonization of multi-site diffusion mri data acquired with different acquisition parameters. NeuroImage, 184:180–200, 1 2019. ISSN 10959572. .
  • Karras et al. (2019) Tero Karras, Samuli Laine, and Timo Aila. A style-based generator architecture for generative adversarial networks, 2019. URL https://github.com/NVlabs/stylegan.
  • Klein and Tourville (2012) Arno Klein and Jason Tourville. 101 labeled brain images and a consistent human cortical labeling protocol. Frontiers in Neuroscience, 0:171, 2012. ISSN 16624548. .
  • LaMontagne et al. (2019) Pamela J. LaMontagne, Tammie LS. Benzinger, John C. Morris, Sarah Keefe, Russ Hornbeck, Chengjie Xiong, Elizabeth Grant, Jason Hassenstab, Krista Moulder, Andrei G. Vlassenko, Marcus E. Raichle, Carlos Cruchaga, and Daniel Marcus. Oasis-3: Longitudinal neuroimaging, clinical, and cognitive dataset for normal aging and alzheimer disease. medRxiv, page 2019.12.13.19014902, 12 2019. . URL https://www.medrxiv.org/content/10.1101/2019.12.13.19014902v1https://www.medrxiv.org/content/10.1101/2019.12.13.19014902v1.abstract.
  • Mirzaalian et al. (2018) Hengameh Mirzaalian, Lipeng Ning, Peter Savadjiev, Ofer Pasternak, Sylvain Bouix, Oleg Michailovich, Sarina Karmacharya, Gerald Grant, Christine E. Marx, Rajendra A. Morey, Laura A. Flashman, Mark S. George, Thomas W. McAllister, Norberto Andaluz, Lori Shutter, Raul Coimbra, Ross D. Zafonte, Mike J. Coleman, Marek Kubicki, Carl Fredrik Westin, Murray B. Stein, Martha E. Shenton, and Yogesh Rathi. Multi-site harmonization of diffusion mri data in a registration framework. Brain Imaging and Behavior, 12:284–295, 2 2018. ISSN 19317565. . URL https://link.springer.com/article/10.1007/s11682-016-9670-y.
  • Nencka et al. (2018) Andrew S. Nencka, Timothy B. Meier, Yang Wang, L. Tugan Muftuler, Yu Chien Wu, Andrew J. Saykin, Jaroslaw Harezlak, M. Alison Brooks, Christopher C. Giza, John Difiori, Kevin M. Guskiewicz, Jason P. Mihalik, Stephen M. LaConte, Stefan M. Duma, Steven Broglio, Thomas McAllister, Michael A. McCrea, and Kevin M. Koch. Stability of mri metrics in the advanced research core of the ncaa-dod concussion assessment, research and education (care) consortium. Brain imaging and behavior, 12:1121–1140, 8 2018. ISSN 1931-7565. . URL https://pubmed.ncbi.nlm.nih.gov/29064019/.
  • Newlin et al. (2023) Nancy R. Newlin, François Rheault, Kurt G. Schilling, and Bennett A. Landman. Characterizing streamline count invariant graph measures of structural connectomes. Journal of Magnetic Resonance Imaging, 2023. ISSN 1522-2586. . URL https://onlinelibrary.wiley.com/doi/full/10.1002/jmri.28631https://onlinelibrary.wiley.com/doi/abs/10.1002/jmri.28631https://onlinelibrary.wiley.com/doi/10.1002/jmri.28631.
  • Ni et al. (2006) H. Ni, V. Kavcic, T. Zhu, S. Ekholm, and Jianhui Zhong. Effects of number of diffusion gradient directions on derived diffusion tensor imaging indices in human brain. AJNR: American Journal of Neuroradiology, 27:1776, 9 2006. ISSN 01956108. URL /pmc/articles/PMC8139764//pmc/articles/PMC8139764/?report=abstracthttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8139764/.
  • Ning et al. (2019) Lipeng Ning, Elisenda Bonet-Carne, Francesco Grussu, Farshid Sepehrband, Enrico Kaden, Jelle Veraart, Stefano B. Blumberg, Can Son Khoo, Marco Palombo, Jaume Coll-Font, Benoit Scherrer, Simon K. Warfield, Suheyla Cetin Karayumak, Yogesh Rathi, Simon Koppers, Leon Weninger, Julia Ebert, Dorit Merhof, Daniel Moyer, Maximilian Pietsch, Daan Christiaens, Rui Teixeira, Jacques Donald Tournier, Andrey Zhylka, Josien Pluim, Greg Parker, Umesh Rudrapatna, John Evans, Cyril Charron, Derek K. Jones, and Chantal W.M. Tax. Muti-shell diffusion mri harmonisation and enhancement challenge (mushac): Progress and results. Mathematics and Visualization, pages 217–224, 2019. ISSN 2197666X. . URL https://link.springer.com/chapter/10.1007/978-3-030-05831-9_18.
  • Ning et al. (2020) Lipeng Ning, Elisenda Bonet-Carne, Francesco Grussu, Farshid Sepehrband, Enrico Kaden, Jelle Veraart, Stefano B. Blumberg, Can Son Khoo, Marco Palombo, Iasonas Kokkinos, Daniel C. Alexander, Jaume Coll-Font, Benoit Scherrer, Simon K. Warfield, Suheyla Cetin Karayumak, Yogesh Rathi, Simon Koppers, Leon Weninger, Julia Ebert, Dorit Merhof, Daniel Moyer, Maximilian Pietsch, Daan Christiaens, Rui Azeredo Gomes Teixeira, Jacques Donald Tournier, Kurt G. Schilling, Yuankai Huo, Vishwesh Nath, Colin Hansen, Justin Blaber, Bennett A. Landman, Andrey Zhylka, Josien P.W. Pluim, Greg Parker, Umesh Rudrapatna, John Evans, Cyril Charron, Derek K. Jones, and Chantal M.W. Tax. Cross-scanner and cross-protocol multi-shell diffusion mri data harmonization: Algorithms and results. NeuroImage, 221:117128, 11 2020. ISSN 1053-8119. .
  • Onicas et al. (2022) Adrian I. Onicas, Ashley L. Ware, Ashley D. Harris, Miriam H. Beauchamp, Christian Beaulieu, William Craig, Quynh Doan, Stephen B. Freedman, Bradley G. Goodyear, Roger Zemek, Keith Owen Yeates, and Catherine Lebel. Multisite harmonization of structural dti networks in children: An a-cap study. Frontiers in Neurology, 13, 6 2022. .
  • Pierpaoli et al. (1996) Carlo Pierpaoli, Peter Jezzard, Peter J. Basser, Alan Barnett, and Giovanni Di Chiro. Diffusion tensor mr imaging of the human brain. https://doi.org/10.1148/radiology.201.3.8939209, 201:637–648, 12 1996. ISSN 00338419. . URL https://pubs.rsna.org/doi/10.1148/radiology.201.3.8939209.
  • Pinto et al. (2020) Maíra Siqueira Pinto, Roberto Paolella, Thibo Billiet, Pieter Van Dyck, Pieter Jan Guns, Ben Jeurissen, Annemie Ribbens, Arnold J. den Dekker, and Jan Sijbers. Harmonization of brain diffusion mri: Concepts and methods. Frontiers in neuroscience, 14, 5 2020. ISSN 1662-4548. . URL https://pubmed.ncbi.nlm.nih.gov/32435181/.
  • Pizzolato et al. (2020) Marco Pizzolato, Marco Palombo, Jana Hutter, Vishwesh Nash, Fan Zhang, and Noemi Gyori. Super-resolution of multi dimensional diffusion mri data. 3 2020. . URL https://zenodo.org/record/3718990.
  • Rubinov and Sporns (2010) Mikail Rubinov and Olaf Sporns. Complex network measures of brain connectivity: uses and interpretations. NeuroImage, 52:1059–1069, 9 2010. ISSN 1095-9572. . URL https://pubmed.ncbi.nlm.nih.gov/19819337/.
  • Schilling et al. (2021) Kurt G. Schilling, Chantal M.W. Tax, Francois Rheault, Colin Hansen, Qi Yang, Fang Cheng Yeh, Leon Cai, Adam W. Anderson, and Bennett A. Landman. Fiber tractography bundle segmentation depends on scanner effects, vendor effects, acquisition resolution, diffusion sampling scheme, diffusion sensitization, and bundle segmentation workflow. NeuroImage, 242:118451, 11 2021. ISSN 1053-8119. .
  • Stejskal and Tanner (1965) E O Stejskal and ; J E Tanner. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. J. Chem. Phys, 42:288–292, 1965. . URL https://doi.org/10.1063/1.1695690.
  • Strike et al. (2023) Lachlan T. Strike, Gabriella A.M. Blokland, Narelle K. Hansell, Nicholas G. Martin, Arthur W. Toga, Paul M. Thompson, Greig I. de Zubicaray, Katie L. McMahon, and Margaret J. Wright, 2023.
  • Tournier et al. (2007) J. Donald Tournier, Fernando Calamante, and Alan Connelly. Robust determination of the fibre orientation distribution in diffusion mri: Non-negativity constrained super-resolved spherical deconvolution. NeuroImage, 35:1459–1472, 5 2007. ISSN 1053-8119. .
  • Tournier et al. (2013) J. Donald Tournier, Fernando Calamante, and Alan Connelly. Determination of the appropriate b value and number of gradient directions for high-angular-resolution diffusion-weighted imaging. NMR in biomedicine, 26:1775–1786, 12 2013. ISSN 1099-1492. . URL https://pubmed.ncbi.nlm.nih.gov/24038308/.
  • Tournier et al. (2019) J. Donald Tournier, Robert Smith, David Raffelt, Rami Tabbara, Thijs Dhollander, Maximilian Pietsch, Daan Christiaens, Ben Jeurissen, Chun Hung Yeh, and Alan Connelly. Mrtrix3: A fast, flexible and open software framework for medical image processing and visualisation. NeuroImage, 202:116137, 11 2019. ISSN 1053-8119. .
  • Vollmar et al. (2010) Christian Vollmar, Jonathan O’Muircheartaigh, Gareth J. Barker, Mark R. Symms, Pamela Thompson, Veena Kumari, John S. Duncan, Mark P. Richardson, and Matthias J. Koepp. Identical, but not the same: intra-site and inter-site reproducibility of fractional anisotropy measures on two 3.0t scanners. NeuroImage, 51:1384–1394, 7 2010. ISSN 1095-9572. . URL https://pubmed.ncbi.nlm.nih.gov/20338248/.
  • Westin et al. (2002) C. F. Westin, S. E. Maier, H. Mamata, A. Nabavi, F. A. Jolesz, and R. Kikinis. Processing and visualization for diffusion tensor mri. Medical Image Analysis, 6:93–108, 6 2002. ISSN 1361-8415. .
  • Wiesenfarth et al. (2021) Manuel Wiesenfarth, Annika Reinke, Bennett A. Landman, Matthias Eisenmann, Laura Aguilera Saiz, M. Jorge Cardoso, Lena Maier-Hein, and Annette Kopp-Schneider. Methods and open-source toolkit for analyzing and visualizing challenge results. Scientific Reports 2021 11:1, 11:1–15, 1 2021. ISSN 2045-2322. . URL https://www.nature.com/articles/s41598-021-82017-6.
  • Zhu et al. (2017) Jun Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of the IEEE International Conference on Computer Vision, 2017-October:2242–2251, 12 2017. ISSN 15505499. .
  • Zuo et al. (2021) Lianrui Zuo, Blake E. Dewey, Yihao Liu, Yufan He, Scott D. Newsome, Ellen M. Mowry, Susan M. Resnick, Jerry L. Prince, and Aaron Carass. Unsupervised mr harmonization by learning disentangled representations using information bottleneck theory. NeuroImage, 243:118569, 11 2021. ISSN 1053-8119. .