INTRODUCTION
The prevalence of Alzheimer’s disease (AD) continues to rise as the global population ages. In recent years, researchers have come to view AD as a continuum, rather than a sequence of distinct phases of cognitive and neuropathological changes (Aisen et al., Reference Aisen, Cummings, Jack, Morris, Sperling, Frölich, Jones, Dowsett, Matthews, Raskin and Scheltens2017). During the “preclinical” aspects of the continuum, individuals are generally cognitively asymptomatic. However, many “preclinical” individuals go on to develop symptoms, such as episodic memory loss, and this is referred to as mild cognitive impairment (MCI). In recent years, MCI has been clinically characterized by criteria such as self- or informant-reported cognitive complaints, objective cognitive impairment, preserved independence in functional abilities, and the absence of dementia (Petersen et al., Reference Petersen, Caracciolo, Brayne, Gauthier, Jelic and Fratiglioni2014). As the disease progresses, cognitive impairment worsens and functional impairment becomes increasingly apparent in everyday life. At this point, a person is considered to have AD dementia (Aisen et al., Reference Aisen, Cummings, Jack, Morris, Sperling, Frölich, Jones, Dowsett, Matthews, Raskin and Scheltens2017). With the growing understanding of AD as a continuum, identifying biomarkers of pathophysiological changes has become paramount.
Morphometric magnetic resonance imaging (MRI) studies have established that the areas of the brain often first damaged in MCI and AD are the hippocampus and the entorhinal cortex (Du et al., Reference Du, Schuff, Amend, Laakso, Hsu, Jagust, Yaffe, Kramer, Reed, Norman and Chui2001; Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000, see reviews Pini et al., Reference Pini, Pievani, Bocchetta, Altomare, Bosco, Cavedo, Galluzzi, Marizzoni and Frisoni2016; Zhou, Zhang, Zhao, Qian, & Dong, Reference Zhou, Zhang, Zhao, Qian and Dong2016). The pathophysiological changes present in AD include the accumulation of amyloid-beta that form plaques, as well as the aggregation of tau-proteins that form neurofibrillary tangles. Such tangles induce neuronal death, resulting in morphometric changes that start in the medial temporal lobe (Gómez-Isla et al., Reference Gómez-Isla, Price, McKeel, Morris, Growdon and Hyman1996, see review Spires-Jones & Hyman, Reference Spires-Jones and Hyman2014). In order to obtain sensitive and specific measures of these structures and the various changes that occur, researchers have begun to segment the hippocampus into subfields (de Flores, La Joie, & Chételat, Reference de Flores, La Joie and Chételat2015; Pini et al., Reference Pini, Pievani, Bocchetta, Altomare, Bosco, Cavedo, Galluzzi, Marizzoni and Frisoni2016).
Previous studies reported reductions in the volumes of the whole hippocampus, hippocampal subfields, and entorhinal cortex in the brains of MCI and AD subjects compared with controls (Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010; Pennanen et al., Reference Pennanen, Kivipelto, Tuomainen, Hartikainen, Hänninen, Laakso, Hallikainen, Vanhanen, Nissinen, Helkala and Vainio2004, see review de Flores et al., Reference de Flores, La Joie and Chételat2015). Such studies utilized a cross-sectional approach to affirm subjects as controls, MCI, or AD based upon the characteristics of various regions of interest (ROIs) (Colliot et al., Reference Colliot, Chételat, Chupin, Desgranges, Magnin, Benali, Dubois, Garnero, Eustache and Lehéricy2008; Du et al., Reference Du, Schuff, Amend, Laakso, Hsu, Jagust, Yaffe, Kramer, Reed, Norman and Chui2001; Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Khan et al., Reference Khan, Westman, Jones, Wahlund, Mecocci, Vellas, Tsolaki, Kłoszewska, Soininen, Spenger and Lovestone2015; Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010; Xu et al., Reference Xu, Jack, O’brien, Kokmen, Smith, Ivnik, Boeve, Tangalos and Petersen2000). Many of these studies focused on subjects who already had MCI in order to best predict who with MCI will convert to AD (Khan et al., Reference Khan, Westman, Jones, Wahlund, Mecocci, Vellas, Tsolaki, Kłoszewska, Soininen, Spenger and Lovestone2015; Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000; Plant et al., Reference Plant, Teipel, Oswald, Böhm, Meindl, Mourao-Miranda, Bokde, Hampel and Ewers2010; Westman et al., Reference Westman, Simmons, Zhang, Muehlboeck, Tunnard, Liu, Collins, Evans, Mecocci, Vellas and Tsolaki2011) with less emphasis on creating models that accurately classify subjects as control or MCI. These classification studies often use the characteristics of only one ROI as a classifier variable, such as the hippocampus, entorhinal cortex, or a specified hippocampal subfield, and have failed to achieve classification accuracies which exceed 80% (Colliot et al., Reference Colliot, Chételat, Chupin, Desgranges, Magnin, Benali, Dubois, Garnero, Eustache and Lehéricy2008; Du et al., Reference Du, Schuff, Amend, Laakso, Hsu, Jagust, Yaffe, Kramer, Reed, Norman and Chui2001; Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010; Pennanen et al., Reference Pennanen, Kivipelto, Tuomainen, Hartikainen, Hänninen, Laakso, Hallikainen, Vanhanen, Nissinen, Helkala and Vainio2004; Westman et al., Reference Westman, Simmons, Zhang, Muehlboeck, Tunnard, Liu, Collins, Evans, Mecocci, Vellas and Tsolaki2011, see review Weiner et al., Reference Weiner, Veitch, Aisen, Beckett, Cairns, Cedarbaum, Green, Harvey, Jack, Jagust and Luthman2015). Thus, there remains a significant need for models that can correctly classify subjects in the earlier stages of the AD continuum.
The goal of the present study was to utilize morphometric MRI measures to identify a broader set of variables that can classify controls and MCI subjects using logistic regression. The first step was to determine the utility of the whole hippocampus, hippocampal subfields, and entorhinal cortex as classifiers. Next, less studied cortical regions outside the medial temporal lobe were added to the model to determine whether any of these regions could improve the model fit and classification accuracy. While these regions are not as commonly used in classification models, researchers have begun to find consistent patterns of atrophy in MCI and AD in regions beyond the medial temporal lobe, including gyri within the frontal, parietal, and temporal lobes (Hänggi, Streffer, Jäncke, & Hock, Reference Hängii, Streffer, Jäncke and Hock2011; Karas et al., Reference Karas, Scheltens, Rombouts, Visser, van Schijndel, Fox and Barkhof2004). Knowledge regarding the classification value of these regions in cognitively normal subjects and those with MCI could enhance our understanding of the disorder.
MATERIALS AND METHODS
Subjects
This study utilized MRI scans from 96 subjects selected from two sources. Forty-two scans were obtained from the Boston University Alzheimer’s Disease Center (BU-ADC) Clinical Core Registry: Health Outreach Program for the Elderly (HOPE). The BU-ADC is a center (of 30) funded by the National Institute on Aging that contributes data to the National Alzheimer’s Coordinating Center. The BU-ADC registry, including subject recruitment and inclusion/exclusion criteria, has previously been described (Ashendorf et al., Reference Ashendorf, Alosco, Bing-Canar, Chapman, Martin, Chaisson, Dixon, Steinberg, Tripodis, Kowall and Stern2017; Galetta et al., Reference Galetta, Chapman, Essis, Alosco, Gillard, Steinberg, Dixon, Martin, Chaisson, Kowall and Tripodis2017). All HOPE subjects’ diagnoses were made at consensus conferences, following presentation and discussion of all medical history and evaluation results, such as clinical interview, informant input, neuropsychological test scores, and MRI scans viewed for clinical criteria (hippocampal atrophy, white matter signal abnormalities, and evidence of microbleeds). No formal biomarker (amyloid, tau, or genetic) data is presented at such consensus conferences. Subjects were determined to be cognitively normal (n = 19) if their objective neuropsychological test scores were within the normal range and they had a Clinical Dementia Rating (Morris, Reference Morris1993) global score of 0.0. All MCI diagnoses (n = 23) were amnestic and followed published criteria (Petersen et al., Reference Petersen, Caracciolo, Brayne, Gauthier, Jelic and Fratiglioni2014).
Fifty-four scans were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (for more information, refer to adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by Principle Investigator, Michael W. Weiner, MD. The primary goal of the ADNI has been to elucidate clinical, genetic, imaging, and biochemical biomarkers of AD, and to better understand the progression from normal cognition to MCI to AD (Weiner et al., Reference Weiner, Veitch, Aisen, Beckett, Cairns, Cedarbaum, Green, Harvey, Jack, Jagust and Luthman2015). Twenty-seven of these subjects were cognitively normal controls and the other 27 subjects were amnestic MCI single domain. All ADNI scans were selected to have come from a Philips 3T scanner to ensure comparable imaging parameters as HOPE scans, and to balance the demographic data of HOPE subjects. All procedures were approved by local IRBs, and subjects gave informed consent at the time of their enrollment in both studies in accordance with the Declaration of Helsinki. In total, there were 96 scans analyzed (46 control and 50 MCI; 54 females and 42 males) in this study. All MCI scans came from subjects that had the amnestic form of MCI. Eight also showed impairment in another domain such as language, executive function, or visuospatial functioning in addition to the impairment of memory.
Both HOPE and ADNI collect demographic data, including age, education, and APOE ε4 status, as well as neuropsychological test scores from the Mini-Mental State Exam (MMSE), Geriatric Depression Scale (GDS), Logical Memory Recall (modified from the Wechsler D. Wechsler Memory Scale-Revised, San Antonio, Texas: Psychological Corporation; 1987), and Part B of the Trailmaking Test (Table 1). The neuropsychological tests and diagnostic procedures utilized in both studies were based on comparable protocols. Given that these neuropsychological measures were utilized to differentiate and define groups, none of these measures could again be used as outcomes.
Note: There were 96 subjects total. Smaller sample sizes reflect missing data. All tests were one-tailed t-tests unless denoted otherwise. p-Value < .01.
MCI, mild cognitive impairment; APOE; apolipoprotein E; ε4, allele 4; SD, standard deviation.
* Denotes significance.
Imaging Assessments
The scans we used were 3D magnetization-prepared rapid acquisition of gradient echo sequences. These were acquired on 3T Philips scanners. For the BU-ADC scans, a 32-channel headcoil and sense factor of 2 was used with the following imaging parameters: TR = 6.7 ms, TE = 3.1 ms, flip angle = 9°, reconstructed and acquisition voxel size = .98 × .98 × 1.2 mm, FOV = 250 × 250 × 180 mm, 150 sagittal slices. For the ADNI scans (Jack et al., Reference Jack, Bernstein, Fox, Thompson, Alexander, Harvey, Borowski, Britson, Whitwell, Ward and Dale2008), an 8-channel headcoil and a sense factor of 1.8 was used with the following imaging parameters: TR = 6.7 ms, TE = 3.1 ms, flip angle = 9°, reconstructed voxel size = 1.05 × 1.05 × 1.20 mm, acquisition voxel size = 1.11 × 1.11 × 1.20 mm, FOV = 270 × 252 × 204 mm, 170 sagittal slices. DICOM scans were downloaded from the ADNI database.
The MRI scans were segmented with FreeSurfer version 6.0 (http://surfer.nmr.mgh.harvard.edu; for additional details, see Desikan et al., Reference Desikan, Ségonne, Fischl, Quinn, Dickerson, Blacker, Buckner, Dale, Maguire, Hyman and Albert2006; Iglesias et al., Reference Iglesias, Augustinack, Nguyen, Player, Player, Wright, Roy, Frosch, McKee, Wald and Fischl2015). FreeSurfer v6.0 utilizes an improved atlas that can automatically segment hippocampal regions into a greater number of subfields than previous versions have allowed (Iglesias et al., Reference Iglesias, Augustinack, Nguyen, Player, Player, Wright, Roy, Frosch, McKee, Wald and Fischl2015).
Statistical Analysis
Independent samples one-tailed t-tests were used to assess differences between the control and MCI groups in terms of neuropsychological outcome and MRI measures. Independent samples two-tailed t-tests were used to assess demographic factors, such as age and education, as there was no expected direction to any group differences. A chi-square test was performed to determine if a difference existed in APOE ε4 status between the groups. In terms of these analyses, we had two goals in mind. The first was to establish whether the groups were matched in terms of demographics and APOE ε4 status. To assess the significance of these tests, we used a p-value of .01 without correction to increase the stringency of not finding a difference. Our second goal was purely exploratory and involved looking for differences between the groups with the expectation that neuropsychological performance and size of brain regions would be less in the MCI group. The analyses conducted towards this goal were meant to help us know the data – selection into regression models was made independent of these exploratory analyses. These tests were interpreted using an arbitrary p-value of .01. Measures generated from FreeSurfer v6.0 included estimated total intracranial volume (eTIV), 68 cortical volumes, 12 subcortical volumes, right and left hippocampal volumes, and 24 hippocampal subfield volumes (for additional segmentation details, see Iglesias et al., Reference Iglesias, Augustinack, Nguyen, Player, Player, Wright, Roy, Frosch, McKee, Wald and Fischl2015). All volumes generated by FreeSurfer v6.0 were examined and then used in subsequent models. For a complete list of the variables collected from FreeSurfer v6.0, see http://www.bumc.bu.edu/mri/personnel-and-students/renee-devivo/. Data were visually inspected for errors and edited as needed. ANCOVA was performed to determine whether eTIV, age, gender, education, APOE ε4 status, or study (i.e., HOPE or ADNI) had an effect on any morphometric MRI variables. Age and eTIV had an effect on the majority of ROI volumes, while gender, education, APOE ε4, and study had no impact. In order to correct for age and eTIV, residuals were computed based upon the values of all subjects included in this study. Additionally, an overall MANOVA was run using study (HOPE or ADNI) as a between-subjects variable and brain ROIs as a repeated measure to ensure that the use of different headcoils in the two studies did not have any overall effect on the outcome. This analysis was run because different headcoils were used in the ADNI and HOPE (8 vs. 32 channels, respectively) studies.
Using these residuals, targeted nominal logistic regression models were created to determine how well the volumes of individual regions, such as entorhinal cortex, whole hippocampus, and hippocampal subfields, could classify subjects. These regions were utilized first due to their importance in cognitive impairment. To follow up, additional stepwise variable logistical models (mixed, probability to enter/leave p < .25) were run using subgroups of ROIs (subcortical, cortical, and hippocampal subfields) to determine which ROIs classified subjects best. To control for multicollinearity, correlations were calculated between the subcortical and cortical regions. Regions that had intercorrelated values >.7 with one or more other regions were excluded from the stepwise models. A series of stepwise logistic models were used as steps for data reduction. The preliminary stepwise logistic models were computed using separate classes of variables as follows: (1) left hippocampal subfield variables, (2) right hippocampal subfield variables, (3) left subcortical variables, (4) right subcortical variables, (5) left cortical variables, and (6) right cortical variables. From the results of these preliminary analyses, mid-level analyses were conducted using (1) hippocampal subfield variables and (2) combined cortical and subcortical group variables. No bilateral regions were included in mid-level stepwise analyses, rather the unilateral region with the most predictive value, or the lowest p value, was used. Following the mid-level stepwise models, an upper mid-level stepwise logistic model was created using the selected cortical and subcortical variables and selected hippocampal subfield variables. Finally, a nominal logistic model was generated to create an optimal classification model of group membership. For a schematic of this data reduction process, see Figure 1. Within each of the logistic models, false discovery rate (FDR) correction was used to establish the significant contributors to each model. Across models, Bonferroni correction (p-value divided by the number of models) was used to determine the appropriate p-value for significance. As a final check, the nominal logistic model was regenerated using a “backward” approach to ensure the order of variable selection was not driving the model.
A leave-one-out prediction of the one-out validation technique was conducted to better estimate how the model would perform on a different dataset (Fan, Batmanghelich, Clark, & Davatzikos, Reference Fan, Batmanghelich, Clark and Davatzikos2008; Misra, Fan, & Davatzikos, Reference Misra, Fan and Davatzikos2009). Essentially, we re-ran the upper mid-level stepwise and final nominal logistic regression models leaving one subject out each time until all subjects had been left out once. All statistical analyses were conducted with JMP Pro 13.0 on a Mac Pro running macOS Sierra operating system.
RESULTS
Demographic Data
There were no differences between the groups in terms of age, years of education, or APOE ε4 status (p’s > .01) (Table 1). As expected, differences were found between the groups in MMSE score, Logical Memory Immediate Raw score, Logical Memory Delayed Raw score, and Part B of the Trailmaking Test (p’s < .01) (Table 1). There was a trend for MCI subjects to have higher GDS scores (p = .025) though neither group expressed clinically relevant scores on the GDS.
MRI Data
When comparing uncorrected volumes, the MCI group had smaller volumes in 15 of the 24 hippocampal subfields, bilateral hippocampal formations, and the right entorhinal cortex, compared with the control group (p’s < .01). When comparing the residual data (adjusted for age and eTIV) between the control and MCI groups, the same 15 hippocampal subfields, bilateral hippocampal formations, and the right entorhinal cortex remained significantly smaller. In addition, three hippocampal subfields (left fimbria, right hippocampal tail, and right fimbria) and the left entorhinal cortex were significantly smaller in MCI subjects when residual volumes were compared (Figures 2–5, effect sizes reported in Table 2). The results of an overall MANOVA showed that the effect of study (i.e., headcoil) was not significant (F(1,94) = 2.894, p = .09, Cohen’s d = .36).
Note: CA, cornu ammonis; eTIV, estimated intracranial volume; HATA, hippocampal amygdala transition area.
To determine how well the volumes of the entorhinal cortex, hippocampus, and hippocampal subfields identified group membership, these variables were entered into three targeted nominal logistic regression models consisting of both the right and left volumes of each region. Though some were significant, none of these models provided a good model according to the R 2 values obtained: hippocampal subfields (R 2 = .3629, whole model test chi-square = 48.2361, p = .0024), whole hippocampus (R 2 = .1817, whole model test chi-square = 24.1557, p < .0001), and entorhinal cortex (R 2 = .0688, whole model test chi-square = 9.1421, p = .0103). Stepwise analyses using a wider subset of variables were conducted to see if better classification could be obtained. Tests of multicollinearity showed that both the left cuneus and left pericalcarine were correlated with more than one other region with a R value >.7, and these regions were excluded from the following analyses. The six preliminary stepwise models selected five hippocampal subfield variables, four subcortical variables, and twenty cortical variables. Mid-level stepwise analyses conducted on the hippocampal subfield variables and combined subcortical and cortical variables further refined the data selected to three hippocampal subfield variables and fifteen subcortical and cortical variables. All 18 of the selected regions (p’s < .01, FDR corrected) were entered into an upper mid-level stepwise and subsequent nominal logistic regression model, which was significant (R 2 = .7120, whole model test chi-square = 94.63, p < .0001), and had a classification rate of .9063 (misclassification rate = .0937) as 40 out of 46 control subjects and 47 out of 50 MCI subjects were classified correctly. Since the two regions (left cuneus and left pericalcarine) that were originally excluded after the tests of multicollinearity were independent of the other regions included in the final models, these regions were forced into the model to see if their addition improved the model. The final model with the left cuneus and left pericalcarine revealed that 13 variables were significant contributors (R 2 = .7310, whole model test chi-square = 97.16, p < .0001) and had a classification rate of .9375 (misclassification rate = .0625) as 43 out of 46 control subjects and 47 out of 50 MCI subjects were classified correctly. Table 3 outlines the 13 variables that were significant contributors to the final model, and all model fit statistics for the described stepwise and nominal logistic analyses are reported in Table 4. The average classification rate from each leave-out-one analysis was .7708 (misclassification rate = .2292) as 37 out of 46 control subjects and 37 out of 50 MCI subjects were classified correctly.
Note: p-Value < .01.
eTIV, estimated intracranial volume; FDR, false discovery rate.
* Denotes significance.
Note: p-Value < .01.
AIC, Akaike Information Criterion; BIC, Bayesian Information Criterion.
* Denotes significance.
As an additional exploratory analysis looking at the value of whole hippocampus as a classifier versus the value of hippocampal subfields, the final nominal logistic model was re-conducted excluding the hippocampal subfield variables that were chosen, and substituting a unilateral whole hippocampal volume variable instead. When using the whole right hippocampal volume, the model was weakened (R 2 = .6313, whole model test chi-square = 83.91, p < .0001) and the whole right hippocampus variable was not a significant contributor to this model, which had a classification rate of .875 (misclassification rate = .125). When using the whole left hippocampal volume, the model was essentially similar to when the whole right hippocampal volume was used and also weaker than the original model (R 2 = .6413, whole model test chi-square = 85.24, p < .0001). Similarly, the whole left hippocampus variable was not a significant contributor in this model. This model had a classification rate of .875 (misclassification rate = .125). As a final check of our analysis using subfields, we re-ran the final nominal logistic model in a “backward” fashion to assure that the order of selection did not bias the analysis. The results of this were the same (R 2 = .7310, whole model test chi-square = 97.16, p < .0001) with 43 out of 46 control subjects classified correctly and 47 out of 50 MCI subjects classified correctly.
DISCUSSION
Nominal Logistic Model of Group Membership
The present study sought to expand current knowledge regarding what regions of the brain are most influential in classifying subjects who are cognitively normal versus those with MCI. Some previous studies have found the entorhinal cortex to be the most effective at discriminating between controls and subjects with cognitive impairment (Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000; Pennanen et al., Reference Pennanen, Kivipelto, Tuomainen, Hartikainen, Hänninen, Laakso, Hallikainen, Vanhanen, Nissinen, Helkala and Vainio2004), while others have found the entire hippocampal formation or various hippocampal subfield volumes to perform best (Du et al., Reference Du, Schuff, Amend, Laakso, Hsu, Jagust, Yaffe, Kramer, Reed, Norman and Chui2001; Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010; Xu et al., Reference Xu, Jack, O’brien, Kokmen, Smith, Ivnik, Boeve, Tangalos and Petersen2000). Regardless, a majority of such cross-sectional studies built classification models utilizing only one region and reported 60–81% accuracy (Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010; Pennanen et al., Reference Pennanen, Kivipelto, Tuomainen, Hartikainen, Hänninen, Laakso, Hallikainen, Vanhanen, Nissinen, Helkala and Vainio2004). Studies including multiple ROIs in their models often achieve greater accuracy as was the goal of the present study (Convit et al., Reference Convit, de Asis, de Leon, Tarshish, de Santi and Rusinek2000; Convit et al., Reference Convit, de Leon, Tarshish, de Santi, Tsui, Rusinek and George1997; Hänggi et al., Reference Hängii, Streffer, Jäncke and Hock2011; Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000).
Initially, we utilized the volumes of the whole hippocampal formations, hippocampal subfields, and entorhinal cortices to build three targeted classification models, and individually, these subgroups performed poorly at classification. However, when ROIs outside the medial temporal regions were added to our classification model, we were able to correctly identify 43 out of 46 control subjects and 47 out of 50 MCI subjects for an overall classification accuracy of 93.75% with 93.48% sensitivity and 94% specificity. Table 3 shows the 13 volumes that contributed most to this model. Notably, although subcortical, cortical, and hippocampal subfield variables were all taken into account in the data reduction stepwise models, only cortical regions and hippocampal subfields were chosen by the data reduction models to go into the final logistic model.
Cross-sectional studies exploring the classification value of hippocampal subfield volumes versus whole hippocampal volumes have found disparate results (Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Pluta et al., Reference Pluta, Yushkevich, Das and Wolk2012). Some have found the CA1 volume to be a stronger classifier than whole hippocampal volume (La Joie et al., Reference La Joie, Perrotin, De La Sayette, Egret, Doeuvre, Belliard, Eustache, Desgranges and Chételat2013; Pluta et al., Reference Pluta, Yushkevich, Das and Wolk2012), while other studies have found the subiculum volume, but not the CA1 volume, to be a more effective classifier than whole hippocampal volume (Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011). Our findings are in line with those promoting the classification value of the subiculum volume. When we created a model including either the right or the left whole hippocampus volume, it did not perform as well as our model that included the parasubiculum and subiculum (misclassification rates were worse and R2 values were weaker). Such findings have been highlighted previously (Mueller et al., Reference Mueller, Schuff, Yaffe, Madison, Miller and Weiner2010, see review Pini et al., Reference Pini, Pievani, Bocchetta, Altomare, Bosco, Cavedo, Galluzzi, Marizzoni and Frisoni2016).
Thus far, relatively less research has been done analyzing the influence of cortical structures in classification models for control and MCI populations. Killiany et al. found the volume of the caudal portion of the anterior cingulate and the banks of the superior temporal sulcus to be important for discriminating controls, stable MCI, and MCI who decline (Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000). We also found these two regions to be important classifiers in this study.
One of the first cross-sectional studies that used classification models to identify controls and MCI subjects based on hippocampal volume obtained a classification accuracy of 73.4% (Convit et al., Reference Convit, de Leon, Tarshish, de Santi, Tsui, Rusinek and George1997). Three years later, this group found that adding the fusiform gyrus and combined middle and inferior temporal gyri helped to identify who with MCI would further decline to AD (Convit et al., Reference Convit, de Asis, de Leon, Tarshish, de Santi and Rusinek2000). We also found the middle temporal gyrus to be important in our classification model, while the fusiform gyrus was less meaningful. Though these studies had subjects of different cognitive capacities, the overlapping ROIs in both studies suggest that volume loss in the middle temporal gyrus and the fusiform gyrus may contribute to cognitive impairment.
Instead of creating classification models, other studies identify regions that show the greatest amount of volume loss in MCI subjects compared with controls. Such studies examining volumetric differences often report regions within the parietal and lateral temporal lobes to be implicated in the early stages of the AD continuum following the medial temporal lobe (Desikan et al., Reference Desikan, Cabral, Fischl, Guttmann, Blacker, Hyman, Albert and Killiany2009; Fan et al., Reference Fan, Batmanghelich, Clark and Davatzikos2008; Hängii et al., Reference Hängii, Streffer, Jäncke and Hock2011). These studies report volume reductions in many of the regions that were important in our classification model, such as the insula (Fan et al., Reference Fan, Batmanghelich, Clark and Davatzikos2008; Karas et al., Reference Karas, Scheltens, Rombouts, Visser, van Schijndel, Fox and Barkhof2004), the superior temporal cortex (Fan et al., Reference Fan, Batmanghelich, Clark and Davatzikos2008; Karas et al., Reference Karas, Scheltens, Rombouts, Visser, van Schijndel, Fox and Barkhof2004), and the middle temporal cortex (Fan et al., Reference Fan, Batmanghelich, Clark and Davatzikos2008). Therefore, while these studies did not make classification models like the present study, the re-occurring significance of regions such as the insula and middle temporal cortex suggests that such regions have a role in cognitive impairment and disease.
Demographics
In this study, the groups were well matched in terms of their demographics (i.e., age, education, APOE ε4 status, and gender) (Table 1). It is noteworthy that there was a slightly greater percentage of individuals in the control group who had APOE ε4 than in the MCI group. Since APOE ε4 status was not used as a selection criterion, this merely reflects the sample to which we had access. Neuropsychological measures revealed expected differences between the control and MCI groups. The control group performed better than the MCI group on four of the five neuropsychological tasks (MMSE, Logical Memory Immediate Recall, Logical Memory Delayed Recall, and Part B of the Trailmaking Test) (Table 1). A difference in GDS score approached significance (p = .025) with the MCI group endorsing, on average, .78 points higher on this scale. It is feasible to assume that this difference was a result of the GDS specifically asking about a decrease in memory, rather than a reflection of true depression symptoms.
MRI
We found smaller residual volumes in the MCI group in 18 of the 24 hippocampal subfields, bilateral hippocampal formations, and bilateral entorhinal cortices, compared with the controls. These findings are consistent with previous reports (Du et al., Reference Du, Schuff, Amend, Laakso, Hsu, Jagust, Yaffe, Kramer, Reed, Norman and Chui2001; Hanseeuw et al., Reference Hanseeuw, Van Leemput, Kavec, Grandin, Seron and Ivanoiu2011; Killiany et al., Reference Killiany, Gomez-Isla, Moss, Kikinis, Sandor, Jolesz, Tanzi, Jones, Hyman and Albert2000; La Joie et al., Reference La Joie, Perrotin, De La Sayette, Egret, Doeuvre, Belliard, Eustache, Desgranges and Chételat2013; Pennanen et al., Reference Pennanen, Kivipelto, Tuomainen, Hartikainen, Hänninen, Laakso, Hallikainen, Vanhanen, Nissinen, Helkala and Vainio2004, see review Zhou et al., Reference Zhou, Zhang, Zhao, Qian and Dong2016).
LIMITATIONS
While the best fit model created in this study is promising, there are limitations to consider. This study utilized sufficient data to meet the intended goals, but the sample size remains modest. This was driven by a desire to make optimal use of control and MCI subjects from our local ADC population as it more closely resembles a clinical population. We supplemented the subject number using subjects from the ADNI study, though we did not want to overwhelm the study with the “clinical trials population” found in ADNI (Petersen et al., Reference Petersen, Aisen, Boeve, Geda, Ivnik, Knopman, Mielke, Pankratz, Roberts, Rocca and Weigand2013). Furthermore, when working with classification models such as nominal logistic regression, the ultimate goal is to build a model using one dataset and then apply it to a parallel dataset. The very large number of comparisons and variables included in the models limited the strength of our findings. Since we aimed to investigate regions that are less studied in MCI, we used all available subjects to build this model. As such, we realize the potential to be able to refine our findings in a future study as more local subjects become available.
Further research creating classification models of control and MCI subjects utilizing medial temporal regions as well as less studied cortical regions will likely help clarify whether the 13 regions in our model are truly classifiers of cognitive impairment versus characteristics of our sample.
CONCLUSION
The results of our study highlight the utility of using measures of the hippocampus, entorhinal cortex, and hippocampal subfields in classification models of controls and MCI subjects. Moreover, our findings provide evidence of the value of less studied regions beyond the medial temporal lobe and their ability to aid in differentiating between those who exhibit normal aging and those who do not.
ACKNOWLEDGMENTS
The authors wish to thank the BU-ADC and ADNI subjects for dedicating their time to create important databases that promise to advance our knowledge of aging through studies such as the present one. No authors involved in the writing of this manuscript had any conflicts of interest with this publication, and there has been no financial support for this work that could have influenced its outcome.
Data collection and sharing for this project was funded by the BU-ADC (NIA P30 AG013846), the ADNI (National Institutes of Health grant U01 AG024904), and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie; Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California (USC). ADNI data are disseminated by the Laboratory of NeuroImaging at USC.