Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-02-06T13:56:49.139Z Has data issue: false hasContentIssue false

Using major depression polygenic risk scores to explore the depressive symptom continuum

Published online by Cambridge University Press:  10 June 2020

Bradley S. Jermy
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
Saskia P. Hagenaars
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
Kylie P. Glanville
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
Jonathan R. I. Coleman
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
David M. Howard
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
Gerome Breen
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
Evangelos Vassos
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
Cathryn M. Lewis*
Affiliation:
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK Department of Medical & Molecular Genetics, Faculty of Life Sciences and Medicine, King's College London, London, UK
*
Author for correspondence: Cathryn Lewis, E-mail: cathryn.lewis@kcl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Major depression (MD) is often characterised as a categorical disorder; however, observational studies comparing sub-threshold and clinical depression suggest MD is continuous. Many of these studies do not explore the full continuum and are yet to consider genetics as a risk factor. This study sought to understand if polygenic risk for MD could provide insight into the continuous nature of depression.

Methods

Factor analysis on symptom-level data from the UK Biobank (N = 148 957) was used to derive continuous depression phenotypes which were tested for association with polygenic risk scores (PRS) for a categorical definition of MD (N = 119 692).

Results

Confirmatory factor analysis showed a five-factor hierarchical model, incorporating 15 of the original 18 items taken from the PHQ-9, GAD-7 and subjective well-being questionnaires, produced good fit to the observed covariance matrix (CFI = 0.992, TLI = 0.99, RMSEA = 0.038, SRMR = 0.031). MD PRS associated with each factor score (standardised β range: 0.057–0.064) and the association remained when the sample was stratified into case- and control-only subsets. The case-only subset had an increased association compared to controls for all factors, shown via a significant interaction between lifetime MD diagnosis and MD PRS (p value range: 2.23 × 10−3–3.94 × 10−7).

Conclusions

An association between MD PRS and a continuous phenotype of depressive symptoms in case- and control-only subsets provides support against a purely categorical phenotype; indicating further insights into MD can be obtained when this within-group variation is considered. The stronger association within cases suggests this variation may be of particular importance.

Type
Original Article
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press

Introduction

Major depression (MD) is a common psychiatric disorder that affects more than 300 million people worldwide (World Health Organization, 2017). The Diagnostic and Statistical Manual of Mental Health Disorders-5 (DSM-5) (American Psychiatric Association, 2013) and the International Classification of Disease 11 (ICD-11) (World Health Organization, 2018) implicitly assume a categorical model for MD, whereby a collection of symptoms reflects a common dysfunction not present in healthy individuals (Helzer, Kraemer, & Krueger, Reference Helzer, Kraemer and Krueger2006; Kraemer, Noda, & O'Hara, Reference Kraemer, Noda and O'Hara2004). An alternative view is that MD is dimensional, existing along a continuum. Under this assumption, diagnostic boundaries using DSM-5 and ICD-11 criteria represent an arbitrary threshold along the distribution to partition ‘affected’ from ‘unaffected’ individuals (Vares, Salum, Spanemberg, Caldieraro, & Fleck, Reference Vares, Salum, Spanemberg, Caldieraro and Fleck2015). A categorical diagnosis is useful clinically as it allows for practical decisions regarding who to treat. In research, however, in case MD was dimensional, dichotomisation would reduce the power of a study to characterise the phenotype. Simulations have showed that categorising a phenotype with a right-skewed distribution – which would be typical of depressive symptoms – according to a clinical cut-off reduced the power to detect a genetic effect explaining 2% of the variance from 90.9% to 9.2% (van der Sluis, Posthuma, Nivard, Verhage, & Dolan, Reference van der Sluis, Posthuma, Nivard, Verhage and Dolan2013).

To explore dimensionality in MD, studies have focused on clinical correlates between sub-threshold and clinical MD (reviewed in Rodríguez, Nuevo, Chatterji, & Ayuso-Mateos, Reference Rodríguez, Nuevo, Chatterji and Ayuso-Mateos2012; Solomon, Haaga, & Arnow, Reference Solomon, Haaga and Arnow2001). These findings support a dimensional classification of MD, showing an association between subthreshold MD and increased risk for disability, impairment, comorbidities and health care use (Cuijpers, de Graaf, & van Dorsselaer, Reference Cuijpers, de Graaf and van Dorsselaer2004; Hybels, Blazer, & Pieper, Reference Hybels, Blazer and Pieper2001; Rucci et al., Reference Rucci, Gherardi, Tansella, Piccinelli, Berardi, Bisoffi and Pini2003) and a linear trend between MD severity and risk of a future episode (Kendler & Gardner, Reference Kendler and Gardner1998; Kessler, Zhao, Blazer, & Swartz, Reference Kessler, Zhao, Blazer and Swartz1997).

There are three key limitations in the literature. First, by grouping individuals into sub-threshold and clinical MD, the full continuum of the phenotype is not explored. Second, grouping individuals in this way fails to account for symptomatic heterogeneity. According to the DSM-5, one of the two ‘core’ symptoms – low mood or anhedonia – in combination with any other combination of four symptoms meets the criteria for an MD diagnosis (American Psychiatric Association, 2013). This gives rise to a possible 227 symptom profiles, with evidence that many of these profiles present clinically (Olbert, Gala, & Tupler, Reference Olbert, Gala and Tupler2014; Zimmerman, Ellison, Young, Chelminski, & Dalrymple, Reference Zimmerman, Ellison, Young, Chelminski and Dalrymple2015). Finally, the symptoms assessed are limited to the nine symptoms specified within the DSM or ICD criteria for MD. While these do reflect the core symptoms of MD, many other symptoms are commonly present. Restricting the analysis to the core symptoms therefore omits potentially relevant information to MD; for example, many patients with MD also show symptoms related to anxiety (Moffitt et al., Reference Moffitt, Harrington, Caspi, Kim-Cohen, Goldberg, Gregory and Poulton2007).

A meta-analysis of twin studies involving 21 000 individuals estimated the heritability of MD to be 37% (95% CI 31–42%) (Sullivan, Neale, & Kendler, Reference Sullivan, Neale and Kendler2000). A recent genome-wide association study (GWAS) using a broad definition of depression has identified 102 independent loci (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali and McIntosh2019). These studies confirm a polygenic architecture for MD, where genetic predisposition comprises many common genetic variants of small effect that additively increase risk (Sullivan, Daly, & O'Donovan, Reference Sullivan, Daly and O'Donovan2012). GWAS results can be used to determine a single measure of genetic liability from common variants to MD, the polygenic risk score (PRS), calculated by summing the number of risk alleles carried by an individual, weighted by their effect size (Wray et al., Reference Wray, Lee, Mehta, Vinkhuyzen, Dudbridge and Middeldorp2014). How PRS associates with various characterisations of MD can be used to deepen our nosological understanding of the disorder.

This study aimed to explore the dimensionality of MD using symptom and genetic data. Exploratory and confirmatory factor analyses were used to construct multiple dimensional phenotypes for MD within the UK Biobank, a volunteer-based, national health resource. These phenotypes were tested for association with MD PRS – calculated according to a categorical definition – to understand if dimensional phenotypes contain information that would not be identified from a categorical diagnosis. Individuals were stratified by self-reported MD status to explore how the association differed between MD cases and controls.

Methods

Participants

Participants included in this study were recruited in the UK Biobank, a national health resource of 502 655 individuals aged between 37 and 73 at the time of recruitment (2006–2010) (Bycroft et al., Reference Bycroft, Freeman, Petkova, Band, Elliott, Sharp and Marchini2018). In 2016, participants were offered an online Mental Health Questionnaire (MHQ) which 157 336 participants voluntarily completed (Davis et al., Reference Davis, Coleman, Adams, Allen, Breen, Cullen and Hotopf2020).

Item selection for the dimensional phenotypes

Eighteen items were selected from the MHQ to construct the dimensional phenotypes for MD. These included all items from the Patient Health Questionnaire 9 (PHQ-9) – a measure of depressive symptoms over the last 2 weeks which correspond to the DSM criteria for MD (Kroenke, Spitzer, & Williams, Reference Kroenke, Spitzer and Williams2001); all items from the General Anxiety Disorder 7 (GAD-7) (Spitzer, Kroenke, Williams, & Löwe, Reference Spitzer, Kroenke, Williams and Löwe2006), relating to symptoms of anxiety over the last 2 weeks; and two items from the subjective well-being questionnaire, on general happiness and the meaning of life. For a detailed list of these items, see Table S1.

In total, 148 957 MHQ participants provided a non-missing response to all items above. This sample was randomly and equally split into ‘training’ (N = 74 478) and ‘test’ (N = 74 479) sub-samples. The training sample was used to develop a factor model with good fit using exploratory factor analysis (EFA), and the test sample was used to internally validate this model using confirmatory factor analysis (CFA).

Exploratory factor analysis

Polychoric correlations were computed for the 18 ordinal items in the training sample (Carroll, Reference Carroll1961). So that all correlations were positive, two items from the subjective well-being questionnaire – relating to a participant's general happiness and belief in how meaningful their life is – were reverse coded. Ordinal α (Gadermann, Guhn, & Zumbo, Reference Gadermann, Guhn and Zumbo2012) Keiser–Meyer–Olkin (KMO) (Kaiser, Reference Kaiser1974) and Bartlett's test of sphericity (Bartlett, Reference Bartlett1950) were computed. Factor analysis was considered an acceptable method if ordinal α and KMO were >0.80, and Bartlett's test with p < 0.05 in line with previous guidelines (Table S2) (Beavers et al., Reference Beavers, Lounsbury, Richards, Huck, Skolits and Esquivel2013; Gadermann et al., Reference Gadermann, Guhn and Zumbo2012).

EFA was performed using the weighted least squares method and factors were allowed to correlate using the ‘geominQ’ form of oblique rotation. Parallel analysis (Horn, Reference Horn1965) and Velicer's Minimum Average Partial test (Velicer, Reference Velicer1976) were used to produce an upper and lower bound for the appropriate number of factors, respectively. Based on these bounds, factor models were fit iteratively and compared using the criteria: Tucker–Lewis index (TLI) ⩾0.95, root mean square error approximation (RMSEA) ⩽0.05 and a smaller Bayesian information criteria (BIC) relative to other models. The model with the best fit was retained for further testing.

Items were removed according to a series of post-hoc tests using Thurstone's analytical method for simple structure criteria (Thurstone, Reference Thurstone1947) to attain a good model fit. Where multiple models demonstrated good fit, the model retaining the largest number of items was chosen. All analysis steps for EFA were performed using the psych package in R 3.4.1 (Revelle, Reference Revelle2017).

Confirmatory factor analysis

To internally validate the EFA-derived model, CFA was performed in the ‘test’ sub-sample using the lavaan package in R 3.4.1 (Rosseel, Reference Rosseel2012). Factor loadings >0.3 from the EFA model were used to specify the relationships between latent variables and items within the CFA model. As in EFA, factors were allowed to correlate. In addition to the model fit metrics used in EFA, the comparative fit index (CFI) and standardised root mean square residual (SRMR) were calculated. Models with CFI ⩾0.95 and SRMR ⩽0.05 were considered a good fit.

Deriving the phenotype: factor score calculation

Following confirmation of the proposed model, the selected model was fit to the full sample using CFA, and factor scores were computed for each factor (Fig. 1). Distributions of all factor scores are displayed in Figs S1–S3.

Fig. 1. Flow chart displaying the methodology of the study along with the sample sizes for each section.

Polygenic risk score calculation

MD PRS were constructed using PRSice v2 (Choi & O'Reilly, Reference Choi and O'Reilly2019; Euesden, Lewis, & O'Reilly, Reference Euesden, Lewis and O'Reilly2014) in unrelated individuals of European ancestry (N = 119 692) using genotype data and quality control procedures previously described (online Supplementary Methods) (Bycroft et al., Reference Bycroft, Freeman, Petkova, Band, Elliott, Sharp and Marchini2018; Coleman et al., Reference Coleman, Peyrot, Purves, Davis, Rayner, Choi and Breen2020). This sample therefore represents a subset of the full sample for which factor scores had been calculated (N = 148 957). Summary statistics from Wray et al. (Reference Wray, Ripke, Mattheisen, Trzaskowski, Byrne and Abdellaoui2018) with 23andMe and UK Biobank samples removed (N cases = 45 591, N controls = 97 674) were used as the base dataset. To account for linkage disequilibrium, clumping was performed so that single nucleotide polymorphisms (SNPs) had an r 2 < 0.1 and a 500 kb window from other SNPs. MD PRS were then calculated across 11 p value thresholds (p < 5 × 10−8, p < 1 × 10−5, p < 0.001, p < 0.01, p < 0.05, p < 0.1, p < 0.2, p < 0.3, p < 0.4, p < 0.5, p < 1). PRS for height were used as a negative control, and were calculated in the same way as MD PRS, using summary statistics from Wood et al. (Reference Wood, Esko, Yang, Vedantam, Pers, Gustafsson and Frayling2014) (N = 253 228) as the base dataset.

Association testing

Factor scores and MD PRS were standardised, and linear regression was performed using the 119 692 MHQ participants. Each factor score was regressed on MD and height PRS with the first six genetic principal components, genotyping batch and assessment centre fitted as covariates.

Stratification by major depression diagnosis

MD cases and controls were identified using the Composite International Diagnostic Interview – Short Form (CIDI-SF), a structured self-report questionnaire focusing on depressive symptoms during an individual's worst episode of depression (online Supplementary Methods) (Kessler, Andrews, Mroczek, Ustun, & Wittchen, Reference Kessler, Andrews, Mroczek, Ustun and Wittchen1998). Individuals who did not complete the CIDI-SF, or who were removed due to the exclusion criteria (online Supplementary Methods), were assigned an ‘unknown’ status, resulting in 60 760 controls, 27 692 cases and 31 240 unknowns.

Participants were stratified into cases and controls and the same set of linear regressions repeated in each group. To test for group differences, cases and controls were combined (N = 88 452) and an interaction term for MD PRS and case–control status was included in the linear regression model, in addition to interaction terms between each of these variables and all other covariates (Keller, Reference Keller2014). Cases were further stratified into single-episode and recurrent cases with recurrence defined as having self-reported two or more lifetime depressive episodes and the interaction test was re-performed. In all interaction tests, controls were set as the reference for MD status. As a sensitivity analysis, the same tests were performed in a group comprising of both controls and unknowns.

As many of the tests are highly correlated, a method proposed by Nyholt (Reference Nyholt2004) was used to correct for multiple testing. The ‘poolR’ package (Cinar & Viechtbauer, Reference Cinar and Viechtbauer2016) in R 3.6.1 showed the effective number of independent tests for the factor scores and MD PRS was 14, giving a Bonferroni-corrected α < 0.0036 (0.05/14).

Results

Exploratory factor analysis

Initially, a six-factor model produced the best fit for the 18 items from the PHQ-9, GAD-7 and the two subjective well-being questions (Table S3). Model fit statistics did not surpass pre-defined thresholds (TLI = 0.973, RMSEA = 0.054), suggesting either poor discrimination or a lack of shared variance for particular items.

To improve model fit, items were removed according to a series of post-hoc tests based on the loadings from this six-factor model (Table S4) (Thurstone, Reference Thurstone1947). The following criteria produced a model with a good fit to the data that retained the greatest number of items: remove items with no factor loadings above 0.4, remove items with loadings to multiple factors above 0.3, retain MD symptoms from the PHQ-9 that would otherwise qualify for removal except for an item for feelings of inadequacy (test 8; Table S4). Retaining all depressive symptoms previously removed according to the cross-loading and factor loading criteria did not produce a good fit (test 5; Table S4). As such, a leave-one-out scenario was undertaken whereby depressive symptoms that had been previously removed due to the cross-loading and factor loading criteria in test 4 (psychomotor retardation/agitation, impaired concentration, feelings of inadequacy and suicidal thoughts) were removed in ascending order of the items communality – a measure of the shared variance of an item with all other items in the model (tests 6–8; Table S4). The first item removed that resulted in a good model fit was feelings of inadequacy (and so a test removing suicidal thoughts was not performed). These criteria retained 15 of the original 18 items and a five-factor model produced a model with good fit (TLI = 0.981, RMSEA = 0.048). Items relating to trouble relaxing, irritability and feelings of inadequacy were excluded.

Confirmatory factor analysis

In contrast to EFA which allows all items to load on all factors, CFA specifies the relationship between items and factors directly and provides a more stringent test for the proposed model (Thompson, Reference Thompson2004). CFA within the test sub-sample (N = 74 479) confirmed a good fit to the observed covariance matrix (CFI = 0.995, TLI = 0.993, RMSEA = 0.033, SRMR = 0.025). As correlations between the five factors were moderate to high (r = 0.574–0.848), a second-order latent variable was included in the model; the model fit remained within the pre-specified thresholds (CFI = 0.992, TLI = 0.99, RMSEA = 0.038, SRMR = 0.031). See Tables S5a–S5f for detailed results on the model fit, factor loadings and correlations. Fit statistics did not change when fitting the model into the full sample (N = 148 957) using CFA (Table S5g).

In the final model, the five first-order factors reflected feelings of anxiety, psychomotor-cognitive impairment, neurovegetative states, mood and subjective well-being. The second-order factor, representing the correlation between these factors, was termed the ‘internalising’ factor (Fig. 2). Calculated as the average squared factor loading, the variance explained by each first-order factor ranged from 49% to 70%. The internalising factor explained 72% of the variance for all first-order factors. The high degree of variance explained supports the use of these factors as phenotypes for dimensional MD.

Fig. 2. Factor model used to derive the dimensional phenotypes. As is customary in structural equation modelling graphs, circles are factors and squares are the self-reported symptoms. Shaded areas relate to either core MD symptoms or factors containing a majority of MD symptoms. Arrows pointing from either one factor to a symptom or a factor to another factor represent the factor loadings. *The items ‘General Happiness’ and ‘Meaningful Life’ have been reverse coded such that they explore ‘general unhappiness’ or ‘lack of meaning in one's life’. Subjective well-being, therefore, also corresponds to a ‘subjective lack of well-being’. Nomenclature has been retained for the brevity of the labelling.

Major depression polygenic risk score associates with an individual's factor score

MD PRS were associated with each of the factor scores (β range: 0.056–0.064; p value range: 2.57 × 10−82–1.89 × 10−107; MD PRS p value threshold: p T < 0.3). While the effect sizes were similar across factors (Table S6; Figs S4–S9), the two factors with the lowest association with MD PRS related to feelings of anxiety and subjective well-being (Fig. 3). In contrast, height PRS was not associated with any factor score (p > 0.05) at any p value threshold (Table S7).

Fig. 3. Association of MD PRS on each factor in the full sample and when stratified by MD Case/Control Status. The MD PRS used was calculated at the p value threshold of p T < 0.3. Both MD PRS and Factor scores were standardised to have a mean of 0 and variance of 1 using the full sample.

Stratification by case/control status

MD PRS remained associated with factor scores following stratification into case- and control-only subgroups (controls: β range: 0.022–0.025; p value range: 1.89 × 10−13–1.43 × 10−17; cases: β range: 0.042–0.055; p value range: 7.54 × 10−10–8.95 × 10−17; MD PRS p value threshold: p T < 0.3). Figure 3 shows the β coefficients for MD PRS on each factor for each group (cases, controls and unknowns) (Tables S8–S10; Figs S4–S9). The p value thresholds of MD PRS with the largest effect size differed across factors between cases and controls (case threshold: p T < 0.2; control threshold: p T < 1) suggesting SNPs associated with MD contained greater signal for dimensions within cases whereas dimensions within controls required additional SNPs with weaker evidence for association in the base data.

MD PRS had an attenuated effect in controls relative to cases (Fig. 3). To formally test for a differential genetic effect, factor scores were regressed on MD PRS and MD diagnostic status (case or control; N = 88 452) with an interaction term. An interaction was detected for all factors (β range: 0.020–0.032; p value range: 2.23 × 10−3–3.94 × 10−7; MD PRS p value threshold: p T < 0.3; Table 1) indicating the genetic contribution is potentiated in cases relative to controls. The effect size of MD PRS with both controls and unknowns in a single group was greater than the association of the two individual groups (Fig. S10). This aggregate group showed no interaction effect of MD PRS with MD case status (β range: 0.001–0.01; p value >0.05; MD PRS p value threshold: p T < 0.3).

Table 1. Main and interaction effects of MD PRS and MD diagnostic status on an individual's standardised factor score

The full sample has been subset to only include individuals with MD diagnostic status (N = 27 692 cases; N = 60 760 controls). Cases have been coded as 1 and controls as 0 (reference). Main effect models for MD PRS and MD diagnostic status include the relevant variable and covariates. The main effects are shown to evidence their statistical significance, often a pre-requisite to performing a statistical interaction.

* p value not possible to determine through lm() function in R as it is below the floor limit of the software. As such it is specified to simply be under a specific value given in the summary results from R (p < 2 × 10−16).

Cases were stratified into single-episode (N = 10 590) and recurrent cases, reporting two or more lifetime depressive episodes (N = 10 726). The interaction test was repeated with controls as the reference category. No interaction effect was detected between controls and single-episode cases (p > 0.05) for any factors. A nominally significant interaction was found between controls and recurrent cases for all factors, except anxiety and subjective well-being. No results survived correction for multiple testing (Tables 2 and 3).

Table 2. Main effects of MD PRS and MD diagnostic status split into single-episode and recurrent cases

The full sample has been subset to only include individuals with MD diagnostic status who report the number of lifetime episodes of depression (N = 10 590 single-episode cases; N = 10 726 recurrent cases; N = 60 760 controls). Single-episode cases have been coded as 1, recurrent cases as 2 and controls were coded as 0 and set as the reference category. Main effect models for MD PRS and MD diagnostic status include the relevant variable and covariates. The main effects are shown to provide evidence for their statistical significance, often a pre-requisite to performing a statistical interaction.

* p value not possible to determine through lm() function in R as it is below the floor limit of the software. As such it is specified to simply be under a specific value (p < 2 × 10−16) given in the summary results from R.

Table 3. Interaction effects of MD PRS and MD diagnostic status split into single-episode and recurrent cases

The full sample has been subset to only include individuals with MD diagnostic status who report the number of lifetime episodes of depression (N = 10 590 single-episode cases; N = 10 726 recurrent cases; N = 60 760 controls).

Discussion

The aim of this study was to explore polygenic associations across the continuum of MD and to test if this association differed between cases and controls. Using the UK Biobank, this study shows that the polygenic liability for a categorical MD phenotype (Wray et al., Reference Wray, Ripke, Mattheisen, Trzaskowski, Byrne and Abdellaoui2018) also associates with a dimensional model of depressive symptoms. Moreover, this finding holds in analyses stratified by case–control status. This suggests PRS contains information over and above the risk of becoming a case and may also be used to indicate severity across the continuum.

To account for symptom-level heterogeneity within the dimensional phenotype, this study used factor analysis to derive a five-factor hierarchical structure for MD. Previous studies investigating the latent structure of MD have produced multiple factor solutions. One- and two-factor models have been shown to produce the best fit for the PHQ-9, depending on the sample selected, i.e. case-only or population cohort (Elhai et al., Reference Elhai, Contractor, Tamburrino, Fine, Prescott, Shirley and Calabrese2012; Kocalevent, Hinz, & Brähler, Reference Kocalevent, Hinz and Brähler2013). The five-factor hierarchical model derived in this study has a high level of agreement for depressive symptoms with a model proposed by Kendler, Aggen, and Neale (Reference Kendler, Aggen and Neale2013) in a population-based sample of 7500 twins, which showed three uncorrelated genetic factors best decompose the phenotypic variance in lifetime MD symptoms. Our model differed in two ways, firstly the symptom of feelings of worthlessness or excessive guilt was not included; and suicidal thoughts loaded on the mood factor, whereas it loaded on the psychomotor/cognitive factor in the model proposed by Kendler et al. (Reference Kendler, Aggen and Neale2013). Our study may therefore be considered a quasi-replication that used phenotypic rather than genetic covariance, in a substantially larger dataset (N = 119 692), with current in contrast to lifetime symptoms, to provide support for the multidimensionality of MD. Multidimensionality also indicates a deeper level of complexity in MD that is left unexplored when using either sum-scores of symptoms or a case–control design. A logical extension would be to use this multidimensionality to determine which genetic variants influence each factor; however, the high loadings of the five first-order factors onto the second-order ‘internalising’ factor suggest a significant proportion of this complexity is shared. As a result, it is likely that larger sample sizes will be required to identify genetic variants specific to a given factor.

The separation between symptoms of MD, anxiety and subjective well-being is noteworthy. The GAD-7 has previously been shown to possess a unidimensional factor structure (Löwe et al., Reference Löwe, Decker, Müller, Brähler, Schellberg, Herzog and Herzberg2008) distinct from symptoms of MD (Spitzer et al., Reference Spitzer, Kroenke, Williams and Löwe2006). Compared with the ‘core MD factors’, an interaction between controls and recurrent MD with MD PRS was not detected for the factors relating to anxiety and subjective well-being at the level of nominal significance. This may suggest that although these symptom dimensions contain a highly pleiotropic genetic component (Purves et al., Reference Purves, Coleman, Meier, Rayner, Davis, Cheesman and Eley2019), a diagnosis of MD contains a degree of specificity which reflects the structure suggested by the DSM and ICD. However, as no factors survived correction for multiple testing for the recurrent case interaction, this conclusion warrants further investigation.

When stratified into cases and controls for MD, an attenuated association between MD PRS and the continuous phenotypes was evident for controls relative to cases. This is, perhaps, not surprising as controls were screened for the presence of any psychiatric disorders and high levels of current depressive symptoms (PHQ-9 sum-score <14). As such, compared to cases, controls are expected to form a more homogenous group of healthy individuals, limiting the power for MD PRS to associate with the phenotype. As can be seen in Figs S4–S9, the variance in factor scores in controls is approximately half of that found in cases as scores are condensed heavily towards the left tail. However, the variance in the MD PRS remains approximately equal across all three sub-groups. Sampling from a smaller subset of the total variance within the factor scores will drive the association towards the null. This conclusion is further supported by the removal of any interaction effect of MD diagnostic status when controls and unknowns were combined into a single group (Fig. S10). Nevertheless, a significant association was still evident, suggesting that even in what would typically be considered a healthy, ‘super-control’, group, MD PRS can still differentiate the subtle differences of the continuous phenotype. The larger control sample size limits comparisons of significance between cases and controls; however, the core finding is of significance within groups, not a comparison between groups.

In contrast, the effect size within cases was similar to that of the entire sample, indicating cases contain the majority of the signal for the dimensions. This finding has important implications under the assumption of a purely dimensional phenotype as it suggests ignoring the variation within cases, also ignores a substantial proportion of the association. As highlighted above, the increased association within cases may be primarily due to the increased variability in factor scores within the group. Alternatively, questionnaires may be being interpreted differently between cases and controls for MD, perhaps due to a greater degree of familiarity with the questionnaires by people who have been diagnosed with depression. Familiarity may increase the validity of the responses as individuals are more ‘in tune’ with the symptoms, reducing measurement error and improving power for the study to detect an association.

The association of MD PRS with the factor scores increases as the three subgroups are coalesced, showing the largest association when all participants are included (Fig. 3; Fig. S10). This is consistent with a model in which the mean MD PRS and factor score differ by diagnostic subgroup and the variance of the factor scores within the groups is smaller than that of the full sample. A simulated example to demonstrate the reasoning is shown graphically in Fig. S11. Importantly, the different associations between cases and controls do not provide evidence either for or against a categorical phenotype for MD. Further studies are warranted to explore this within diagnostic sub-group variation for differential genetic risk.

Whilst the presence of an association within cases and controls for MD appears to contradict the purely categorical phenotype of MD – differences within cases and controls are not associated with risk factors – this study cannot exclude the possibility that MD is a categorical phenotype characterised by continuous variation within cases and controls. Two taxometric studies, designed specifically to detect the presence of such groups or ‘taxons’, provide support for this finding (Ruscio, Brown, & Ruscio, Reference Ruscio, Brown and Ruscio2009; Ruscio, Zimmerman, McGlinchey, Chelminski, & Young, Reference Ruscio, Zimmerman, McGlinchey, Chelminski and Young2007). However, this field of research has consistently supported a dimensional classification, and the results appear to depend on the measurement instrument used, whether the symptoms were self-report or clinically ascertained and age of the sample (Hankin, Fraley, Lahey, & Waldman, Reference Hankin, Fraley, Lahey and Waldman2005; Liu, Reference Liu2016).

Limitations

This study has many strengths including large sample size from a volunteer-based, national health resource and its accounting for the heterogeneity inherent to MD; however, important limitations remain. The items used to create the dimensional phenotypes were self-reported, increasing the risk of misclassification and sampling bias. It has been shown that participants who responded to the MHQ have a higher level of education and fewer hospital diagnoses inclusive of mental disorders compared with other UK Biobank participants (Adams et al., Reference Adams, Hill, Howard, Dashti, Davis, Campbell and McIntosh2019). This ‘healthier and wealthier’ bias may hamper our ability to appropriately represent cases at the most severe end of the spectrum. The same study showed that MHQ responders were more likely to have a family history of severe depression relative to non-responders (Adams et al., Reference Adams, Hill, Howard, Dashti, Davis, Campbell and McIntosh2019). Similarly, the influence of personal interest in mental health could limit the generalisability of this sample to the general population.

A combination of theory on the depression phenotype and minimum threshold criteria for fit statistics was imposed to choose the factor model. Thresholds are driven by best practice; ultimately, however, they are arbitrary. With a different set of thresholds, another model may have been chosen during the exploratory phase of the analysis. The authors recognise this but believe this model to make theoretical sense and have a strong fit to the observed covariance matrix as strict thresholds have been chosen.

This study assessed current depressive symptoms at a single time point, when the MHQ was completed. Studies investigating latent class trajectory of depressive symptoms have supported the dynamic nature of MD, with trajectories including persistently low, persistently high, increasing and decreasing symptoms (Byers et al., Reference Byers, Vittinghoff, Lui, Hoang, Blazer, Covinsky and Yaffe2012; Kuchibhatla, Fillenbaum, Hybels, & Blazer, Reference Kuchibhatla, Fillenbaum, Hybels and Blazer2012). Future studies should seek to confirm this model in longitudinal settings to test if it is robust to temporal invariance (Widaman, Ferrer, & Conger, Reference Widaman, Ferrer and Conger2010). Furthermore, the distinction between factors might reflect an artefact of time as the items are from three questionnaires (PHQ-9, GAD-7 and subjective well-being). For example, if the PHQ-9 was taken at the start of the MHQ and the GAD-7 at the end, subtle shifts in mood may occur while completing the MHQ that could artificially reduce the correlations between depressive and anxiety symptoms. This limitation is somewhat mitigated from the fact that the MHQ was completed by most participants in a short period with 82% having completed the questionnaire within 25 min (Davis et al., Reference Davis, Coleman, Adams, Allen, Breen, Cullen and Hotopf2020).

Conclusions

MD PRS supports a multi-dimensional model of MD, indicating that information is contained within cases and controls that would otherwise be omitted using a categorical phenotype. Much of this additional information is held within cases; possibly due to the greater variation in factor scores found within this group. Considering this additional variation in future study design may enhance the power to detect genetic associations, elevate our current aetiological understanding and improve prediction through more accurate PRS.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291720001828.

Acknowledgements

CML is funded by the Medical Research Council (N015746/1). SPH is funded by the Medical Research Council (MR/S0151132). DMH is supported by a Sir Henry Wellcome Postdoctoral Fellowship (Reference 213674/Z/18/Z) and a 2018 NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation (Ref: 27404). This study represents independent research funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care. We thank participants and scientists involved in making the UK Biobank resource available (http://www.ukbiobank.ac.uk/). UK Biobank data used in this study were obtained under approved application 18177.

Conflict of interest

Cathryn M Lewis reports having received fees from Myriad Neuroscience. Bradley S Jermy, Saskia p Hagenaars, Kylie p Glanville, Jonathan RI Coleman, David M Howard, Gerome Breen and Evangelos Vassos reported no biomedical financial interests or potential conflicts of interest.

Footnotes

*

These authors share senior authorship.

References

Adams, M. J., Hill, W. D., Howard, D. M., Dashti, H. S., Davis, K. A. S., Campbell, A., … McIntosh, A. M. (2019). Factors associated with sharing e-mail information and mental health survey participation in large population cohorts. International Journal of Epidemiology. [published online ahead of print Jul 1]. doi:10.1093/ije/dyz134.Google Scholar
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5®). Arlington, VA: American Psychiatric Publishing.Google Scholar
Bartlett, M. S. (1950). Tests of significance in factor analysis. British Journal of Statistical Psychology, 3(2), 7785. doi:10.1111/j.2044-8317.1950.tb00285.x.CrossRefGoogle Scholar
Beavers, A. S., Lounsbury, J. W., Richards, J. K., Huck, S. W., Skolits, G. J., & Esquivel, S. L. (2013). Practical considerations for using exploratory factor analysis in educational research. Practical Assessment, Research & Evaluation, 18(1), 6. Retrieved from https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1303&context=pare.Google Scholar
Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., … Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203. doi:10.1038/s41586-018-0579-z.CrossRefGoogle ScholarPubMed
Byers, A. L., Vittinghoff, E., Lui, L-Y., Hoang, T., Blazer, D. G., Covinsky, K. E., … Yaffe, K. (2012). Twenty-year depressive trajectories among older women. Archives of General Psychiatry, 69(10), 10731079. doi:10.1001/archgenpsychiatry.2012.43.CrossRefGoogle ScholarPubMed
Carroll, J. B. (1961). The nature of the data, or how to choose a correlation coefficient. Psychometrika, 26(4), 347372. doi:10.1007/BF02289768.CrossRefGoogle Scholar
Choi, S. W., & O'Reilly, P. F. (2019). PRSice-2: Polygenic risk score software for biobank-scale data. Gigascience, 8(7), doi:10.1093/gigascience/giz082.CrossRefGoogle ScholarPubMed
Cinar, O., & Viechtbauer, W. (2016). PoolR: Package for pooling the results from (dependent) tests. Retrieved from https://rdrr.io/github/ozancinar/poolR/man/poolr-package.html.Google Scholar
Coleman, J. R., Peyrot, W. J., Purves, K. L., Davis, K. A., Rayner, C., Choi, S. W., … Breen, G. (2020). Genome-wide gene-environment analyses of major depressive disorder and reported lifetime traumatic experiences in UK Biobank. Molecular Psychiatry, [published online ahead of print Jan 23], 117. doi:10.1038/s41380-019-0546-6.Google ScholarPubMed
Cuijpers, P, de Graaf, R., & van Dorsselaer, S. (2004). Minor depression: Risk profiles, functional disability, health care use and risk of developing major depression. Journal of Affective Disorders, 79(1), 7179. doi:10.1016/S0165-0327(02)00348-8.CrossRefGoogle ScholarPubMed
Davis, K. A., Coleman, J. R., Adams, M., Allen, N., Breen, G., Cullen, B., … Hotopf, M. (2020). Mental health in UK Biobank – development, implementation and results from an online questionnaire completed by 157 366 participants: A reanalysis. BJPsych Open, 6(2), e18. doi: 10.1192/bjo.2019.100.CrossRefGoogle ScholarPubMed
Elhai, J. D., Contractor, A. A., Tamburrino, M., Fine, T. H., Prescott, M. R., Shirley, E., … Calabrese, J. R. (2012). The factor structure of major depression symptoms: A test of four competing models using the Patient Health Questionnaire-9. Psychiatry Research, 199(3), 169173. doi:10.1016/j.psychres.2012.05.018.CrossRefGoogle ScholarPubMed
Euesden, J., Lewis, C. M., & O'Reilly, P. F. (2014). PRSice: Polygenic risk score software. Bioinformatics (Oxford, England), 31(9), 14661468. doi:10.1093/bioinformatics/btu848.CrossRefGoogle ScholarPubMed
Gadermann, A. M., Guhn, M., & Zumbo, B. D. (2012). Estimating ordinal reliability for Likert type and ordinal item response data: A conceptual, empirical, and practical guide. Practical Assessment, Research, and Evaluation, 17(1), 3. Retrieved from https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1247&context=pare.Google Scholar
Hankin, B. L., Fraley, R. C., Lahey, B. B., & Waldman, I. D. (2005). Is depression best viewed as a continuum or discrete category? A taxometric analysis of childhood and adolescent depression in a population-based sample. Journal of Abnormal Psychology, 114(1), 96110. doi:10.1037/0021-843X.114.1.96.CrossRefGoogle ScholarPubMed
Helzer, J. E., Kraemer, H. C., & Krueger, R. F. (2006). The feasibility and need for dimensional psychiatric diagnoses. Psychological Medicine, 36(12), 16711680. doi:10.1017/S003329170600821X.CrossRefGoogle ScholarPubMed
Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30(2), 179185. doi:10.1007/bf02289447.CrossRefGoogle ScholarPubMed
Howard, D. M., Adams, M. J., Clarke, T. K., Hafferty, J. D., Gibson, J., Shirali, M, … McIntosh, A. M. (2019). Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nature Neuroscience, 22(3), 343. doi:10.1038/s41593-018-0326-7.CrossRefGoogle ScholarPubMed
Hybels, C. F., Blazer, D. G., & Pieper, C. F. (2001). Toward a threshold for subthreshold depression: An analysis of correlates of depression by severity of symptoms using data from an elderly community sample. The Gerontologist, 41(3), 357365. doi:10.1093/geront/41.3.357.CrossRefGoogle Scholar
Kaiser, H. F. (1974). An index of factorial simplicity. Psychometrika, 39(1), 3136. doi:10.1007/BF02291575.CrossRefGoogle Scholar
Keller, M. C. (2014). Gene x environment interaction studies have not properly controlled for potential confounders: The problem and the (simple) solution. Biological Psychiatry, 75, 1824. doi:10.1016/j.biopsych.2013.09.006.CrossRefGoogle Scholar
Kendler, K. S., Aggen, S. H., & Neale, M. C. (2013). Evidence for multiple genetic factors underlying DSM-IV criteria for major depression. JAMA Psychiatry. 70(6), 599607. doi:10.1001/jamapsychiatry.2013.751.CrossRefGoogle ScholarPubMed
Kendler, K. S., & Gardner, C. O. Jr. (1998). Boundaries of major depression: An evaluation of DSM-IV criteria. American Journal of Psychiatry, 155(2), 172177. doi:10.1176/ajp.155.2.172.Google ScholarPubMed
Kessler, R. C., Andrews, G., Mroczek, D., Ustun, B., & Wittchen, H. U. (1998). The World Health Organization composite international diagnostic interview short-form (CIDI-SF). International Journal of Methods in Psychiatric Research, 7(4), 171185. doi:10.1002/mpr.47.CrossRefGoogle Scholar
Kessler, R. C., Zhao, S., Blazer, D. G., & Swartz, M. (1997). Prevalence, correlates, and course of minor depression and major depression in the National Comorbidity Survey. Journal of Affective Disorders, 45(1–2), 1930. doi:10.1016/s0165-0327(97)00056-6.CrossRefGoogle ScholarPubMed
Kocalevent, R-D., Hinz, A., & Brähler, E. (2013). Standardization of the depression screener Patient Health Questionnaire (PHQ-9) in the general population. General Hospital Psychiatry, 35(5), 551555. doi:10.1016/j.genhosppsych.2013.04.006.CrossRefGoogle ScholarPubMed
Kraemer, H. C., Noda, A., & O'Hara, R. (2004). Categorical versus dimensional approaches to diagnosis: Methodological challenges. Journal of Psychiatric Research, 38(1), 1725. doi:10.1016/s0022-3956(03)00097-9.CrossRefGoogle ScholarPubMed
Kroenke, K., Spitzer, R. L., & Williams, J. B. W. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606613. doi:10.1046/j.1525-1497.2001.016009606.x.CrossRefGoogle ScholarPubMed
Kuchibhatla, M. N., Fillenbaum, G. G., Hybels, C. F., & Blazer, D. G. (2012). Trajectory classes of depressive symptoms in a community sample of older adults. Acta Psychiatrica Scandinavica, 125(6), 492501. doi:10.1111/j.1600-0447.2011.01801.x.CrossRefGoogle Scholar
Liu, R. T. (2016). Taxometric evidence of a dimensional latent structure for depression in an epidemiological sample of children and adolescents. Psychological Medicine, 46(6), 12651275. doi:10.1017/S0033291715002792.CrossRefGoogle Scholar
Löwe, B., Decker, O., Müller, S., Brähler, E., Schellberg, D., Herzog, W., & Herzberg, P. Y. (2008). Validation and standardization of the Generalized Anxiety Disorder Screener (GAD-7) in the general population. Medical Care, 46(3), 266274. doi:10.1097/MLR.0b013e318160d093.CrossRefGoogle ScholarPubMed
Moffitt, T. E., Harrington, H., Caspi, A., Kim-Cohen, J., Goldberg, D., Gregory, A. M., & Poulton, R. (2007). Depression and generalized anxiety disorder: Cumulative and sequential comorbidity in a birth cohort followed prospectively to age 32 years. Archives of General Psychiatry, 64(6), 651660. doi:10.1001/archpsyc.64.6.651.CrossRefGoogle Scholar
Nyholt, D. R. (2004). A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. The American Journal of Human Genetics, 74(4), 765769. doi:10.1086/383251.CrossRefGoogle ScholarPubMed
Olbert, C. M., Gala, G. J., & Tupler, L. A. (2014). Quantifying heterogeneity attributable to polythetic diagnostic criteria: Theoretical framework and empirical application. Journal of Abnormal Psychology, 123(2), 452462. doi:10.1037/a0036068.CrossRefGoogle ScholarPubMed
Purves, K. L., Coleman, J. R., Meier, S. M., Rayner, C., Davis, K. A., Cheesman, R., … Eley, T. C. (2019). A major role for common genetic variation in anxiety disorders. Molecular Psychiatry, [published online ahead of print Nov 20], 112. doi:10.1038/s41380-019-0559-1.Google Scholar
Revelle, W. (2017). Psych: Procedures for psychological, psychometric, and personality research. Retrieved from: https://CRAN.R-project.org/package=psych.Google Scholar
Rodríguez, M. R., Nuevo, R., Chatterji, S., & Ayuso-Mateos, J. L. (2012). Definitions and factors associated with subthreshold depressive conditions: A systematic review. BMC Psychiatry, 12(1), 181. doi:10.1186/1471-244X-12-181.CrossRefGoogle ScholarPubMed
Rosseel, Y. (2012). lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(1), 136. doi:10.18637/jss.v048.i02.CrossRefGoogle Scholar
Rucci, P., Gherardi, S., Tansella, M., Piccinelli, M., Berardi, D., Bisoffi, G, … Pini, S. (2003). Subthreshold psychiatric disorders in primary care: Prevalence and associated characteristics. Journal of Affective Disorders, 76(1), 171181. doi:10.1016/s0165-0327(02)00087-3.CrossRefGoogle ScholarPubMed
Ruscio, J., Brown, T. A., & Ruscio, A. M. (2009). A taxometric investigation of DSM-IV major depression in a large outpatient sample. Assessment, 16(2), 127144. doi:10.1177/1073191108330065.CrossRefGoogle Scholar
Ruscio, J., Zimmerman, M., McGlinchey, J. B., Chelminski, I., & Young, D. (2007). Diagnosing major depressive disorder XI: A taxometric investigation of the structure underlying DSM-IV symptoms. The Journal of Nervous and Mental Disease, 195(1), 1019. doi:10.1097/01.nmd.0000252025.12014.c4.CrossRefGoogle ScholarPubMed
Solomon, A., Haaga, D. A., & Arnow, B. A. (2001). Is clinical depression distinct from subthreshold depressive symptoms? A review of the continuity issue in depression research. The Journal of Nervous and Mental Disease, 189(8), 498506. doi:10.1097/00005053-200108000-00002.CrossRefGoogle ScholarPubMed
Spitzer, R. L., Kroenke, K., Williams, J. B. W., & Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 10921097. doi:10.1001/archinte.166.10.1092.CrossRefGoogle ScholarPubMed
Sullivan, P. F., Daly, M. J., & O'Donovan, M. (2012). Genetic architectures of psychiatric disorders: The emerging picture and its implications. Nature Reviews Genetics, 13(8), 537551. doi:10.1038/nrg3240.CrossRefGoogle ScholarPubMed
Sullivan, P. F., Neale, M. C., & Kendler, K. S. (2000). Genetic epidemiology of major depression: Review and meta-analysis. American Journal of Psychiatry, 157(10), 15521562. doi:10.1176/appi.ajp.157.10.1552.CrossRefGoogle ScholarPubMed
Thompson, B. (2004). Exploratory and confirmatory factor analysis: Understanding concepts and applications. Washington, DC: American Psychological Association. 10694.CrossRefGoogle Scholar
Thurstone, L. L. (1947). Multiple factor analysis; a development and expansion of The Vectors of Mind. Chicago: University of Chicago Press.Google Scholar
van der Sluis, S., Posthuma, D., Nivard, M. G., Verhage, M., & Dolan, C. V. (2013) Power in GWAS: Lifting the curse of the clinical cut-off. Molecular Psychiatry, 18(1), 23. doi:10.1038/mp.2012.65.CrossRefGoogle ScholarPubMed
Vares, E. A., Salum, G. A., Spanemberg, L., Caldieraro, M. A., & Fleck, M. P. (2015). Depression dimensions: Integrating clinical signs and symptoms from the perspectives of clinicians and patients. PLoS ONE, 10(8), e0136037. doi: 10.1371/journal.pone.0136037.CrossRefGoogle ScholarPubMed
Velicer, W. F. (1976). Determining the number of components from the matrix of partial correlations. Psychometrika, 41(3), 321327. doi:10.1007/BF02293557.CrossRefGoogle Scholar
Widaman, K. F., Ferrer, E., & Conger, R. D. (2010). Factorial invariance within longitudinal structural equation models: Measuring the same construct across time. Child Development Perspectives, 4(1), 1018. doi:10.1111/j.1750-8606.2009.00110.x.CrossRefGoogle ScholarPubMed
Wood, A. R., Esko, T., Yang, J., Vedantam, S., Pers, T. H., Gustafsson, S, … Frayling, T. M. (2014). Defining the role of common variation in the genomic and biological architecture of adult human height. Nature Genetics, 46(11), 11731186. doi:10.1038/ng.3097.CrossRefGoogle ScholarPubMed
World Health Organization. (2017). Depression and other common mental disorders: global health estimates. Retrieved from: https://apps.who.int/iris/bitstream/handle/10665/254610/WHO-MSD-MER-2017.2-eng.pdf.Google Scholar
World Health Organization. (2018). International classification of diseases for mortality and morbidity statistics (11th Revision). Retrieved from: https://icd.who.int/browse11/l-m/en.Google Scholar
Wray, N. R., Lee, S. H., Mehta, D., Vinkhuyzen, A. A. E., Dudbridge, F., & Middeldorp, C. M. (2014). Research review: Polygenic methods and their application to psychiatric traits. Journal of Child Psychology and Psychiatry, 55(10), 10681087. doi:10.1111/jcpp.12295.CrossRefGoogle ScholarPubMed
Wray, N. R., Ripke, S., Mattheisen, M., Trzaskowski, M., Byrne, E. M., Abdellaoui, A., … the Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium. (2018). Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nature Genetics, 50(5), 668. doi:10.1038/s41588-018-0090-3.CrossRefGoogle ScholarPubMed
Zimmerman, M., Ellison, W., Young, D., Chelminski, I., & Dalrymple, K. (2015). How many different ways do patients meet the diagnostic criteria for major depressive disorder? Comprehensive Psychiatry, 56, 2934. doi:10.1016/j.comppsych.2014.09.007.CrossRefGoogle ScholarPubMed
Figure 0

Fig. 1. Flow chart displaying the methodology of the study along with the sample sizes for each section.

Figure 1

Fig. 2. Factor model used to derive the dimensional phenotypes. As is customary in structural equation modelling graphs, circles are factors and squares are the self-reported symptoms. Shaded areas relate to either core MD symptoms or factors containing a majority of MD symptoms. Arrows pointing from either one factor to a symptom or a factor to another factor represent the factor loadings. *The items ‘General Happiness’ and ‘Meaningful Life’ have been reverse coded such that they explore ‘general unhappiness’ or ‘lack of meaning in one's life’. Subjective well-being, therefore, also corresponds to a ‘subjective lack of well-being’. Nomenclature has been retained for the brevity of the labelling.

Figure 2

Fig. 3. Association of MD PRS on each factor in the full sample and when stratified by MD Case/Control Status. The MD PRS used was calculated at the p value threshold of pT < 0.3. Both MD PRS and Factor scores were standardised to have a mean of 0 and variance of 1 using the full sample.

Figure 3

Table 1. Main and interaction effects of MD PRS and MD diagnostic status on an individual's standardised factor score

Figure 4

Table 2. Main effects of MD PRS and MD diagnostic status split into single-episode and recurrent cases

Figure 5

Table 3. Interaction effects of MD PRS and MD diagnostic status split into single-episode and recurrent cases

Supplementary material: File

Jermy et al. supplementary material

Jermy et al. supplementary material

Download Jermy et al. supplementary material(File)
File 18.6 MB