Introduction
Although the incidence of schizophrenia peaks in adolescence and early adulthood (Weinberger, Reference Weinberger1987; Häfner et al. Reference Häfner, Maurer, Löffler and Riecher-Rössler1993a ), the risk of psychosis extends well beyond this period, with sex seemingly modifying the age–incidence relationship (Angermeyer & Kuhn, Reference Angermeyer and Kuhn1988). To date, the best data suggest that onset in males peaks steeply in the late teens and early twenties, with rates remaining relatively constant at a lower level thereafter, whereas in females the initial peak is later, with a second smaller increase in incidence in middle age (Häfner et al. Reference Häfner, Maurer, Löffler and Riecher-Rössler1993a ) and possibly a third peak around 65 years of age (Häfner et al. Reference Häfner, Riecher, Maurer, Löffler, Munk-Jørgensen and Strömgren1989). Although the sex difference in age of onset for schizophrenia is a well-replicated finding (Häfner et al. Reference Häfner, Riecher, Maurer, Löffler, Munk-Jørgensen and Strömgren1989; Hambrecht et al. Reference Hambrecht, Maurer, Häfner and Sartorius1992; Jablensky et al. Reference Jablensky, Sartorius, Ernberg, Anker, Korten, Cooper, Day and Bertelsen1992), it attenuates when known risk factors such as family history (Esterberg et al. Reference Esterberg, Trotman, Holtzman, Compton and Walker2010), pregnancy and birth complications (Kirov et al. Reference Kirov, Jones, Harvey, Lewis, Toone, Rifkin, Sham and Murray1996), paternal age (Rosenfield et al. Reference Rosenfield, Kleinhaus, Opler, Perrin, Learned, Goetz, Stanford, Messinger, Harkavy-Friedman and Malaspina2010) and marital status (Jablensky & Cole, Reference Jablensky and Cole1997) are adjusted for in statistical analyses (Aleman et al. Reference Aleman, Kahn and Selten2003), and may even be absent in developing countries (Venkatesh et al. Reference Venkatesh, Thirthalli, Naveen, Kishorekumar, Arunachala, Venkatasubramanian, Subbakrishna and Gangadhar2008). Furthermore, the observed age-of-onset distribution is influenced by study methodology such as arbitrary age cut-offs (excluding patients with late onset disproportionately excludes female patients; Castle et al. Reference Castle, Wessely and Murray1993) and different diagnostic definitions (applying more stringent diagnostic classification systems excludes more young women; Castle et al. Reference Castle, Wessely and Murray1993). This makes cross-study comparisons difficult. Understanding the functional form of the age-of-onset curve can potentially provide important pointers for aetiological research. Therefore, we conducted a systematic review and collaborative recalculation of incidence of schizophrenia by age and sex, across the entire life course (as suggested by Jablensky, Reference Jablensky, Hirsch and Weinberger2003) using data extracted directly from published papers, raw individual-level data (centrally reorganized and combined) or rates supplied by the authors of incidence studies, published between 1950 and 2009.
Method
Search strategy and study selection
The design of the systematic review and meta-analysis was based on recommendations from the Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group (Stroup et al. Reference Stroup, Berlin, Morton, Olkin, Williamson, Rennie, Moher, Becker, Sipe and Thacker2000). Two independent investigators (M.W. and M.H.) searched the electronic bibliographic databases PubMed, Medline/EMBASE and PsycINFO for studies published between January 1950 and December 2009. The broad key words ‘epidemiology’ and ‘schizophrenia’ OR ‘psychosis’ were chosen to identify as many potentially relevant studies as possible. Additionally, to reduce the likelihood of missing published data, references from the electronically identified papers, review articles, book chapters and previously published systematic reviews of incidence studies were examined (Aleman et al. Reference Aleman, Kahn and Selten2003; McGrath et al. Reference McGrath, Saha, Welham, El Saadi, MacCauley and Chant2004) and forwards and backwards citation tracking was carried out using the Web of Science. Finally, experts in the field of psychosis epidemiology were contacted for missing publications and unpublished data. Studies were selected using the following inclusion criteria: (1) contained original incidence data for schizophrenia disorder defined by a standardized diagnostic classification system, (2) age- and sex-specific incidence data were present in the primary publication or in data available from the authors, (3) published between 1950 and 2009 and (4) written in English, Dutch or German. Studies with strict age cut-offs were excluded as they are likely to distort risk ratios by sex (for details see online Supplementary Table S2). Studies were included in the systematic review by consensus (M.W., M.H., J.A.).
Data extraction
Data were extracted systematically by one researcher (M.W.) and validated by investigators (M.H., S.K., J.A.). A database was compiled by independent catchment area-defined samples (rather than by published papers) and the numerator and population data (in person-years) were extracted by age, sex and study period. Age–incidence data were first extracted according to the consensus statement made by (Howard et al. Reference Howard, Rabins, Seeman and Jeste2000), using early (age <40 years: EOS), late (age 40–59 years: LOS) and very late onset (age ⩾60 years: VLOS) schizophrenia age groups. Data presented in different age formats were assigned to the appropriate age category. In the second approach, numerator and population data were extracted and recalculated centrally by 10-year age bands. Information about the source population and methodological characteristics were extracted at the sample level. Each sample was classified as rural, mixed or urban, based on combined information about population density estimates and the proportion of people living in urbanized areas (World Bank, 2010). Samples were further categorized into developing or developed countries according to the World Health Organization (WHO) Human Development Index of countries (United Nations, 2009). Methodological characteristics included (i) sampling frame (hospital admission or first contact with psychiatric service), (ii) case ascertainment (clinical, systematic or interview) and (iii) diagnostic classification system [DSM (APA, 1980, 1987, 1994); ICD (WHO, 1955, 1965, 1977, 1992); RDC (Spitzer et al. Reference Spitzer, Endicott and Robins1978) and others]. When multiple classification systems were used, ICD rates were used instead of DSM because ICD has used a consistently broad definition throughout recent revisions. Schizophreniform disorder was included when studies used DSM criteria. Schizo-affective disorder and paranoid disorders in the elderly (i.e. paraphrenia) were not included. Authors were contacted to ensure accuracy of data and to provide original data when insufficient information to calculate age- and sex-specific incidence rates was present in the published papers.
Statistical analyses
First, a descriptive synthesis of incidence rates was completed using information from all studies with data stratified by EOS, LOS and VLOS age groups (analysis 1). The distribution of incidence rates was presented by age, sex and female-to-male incidence rate ratio (IRRF–M) estimates showing mean, median (a better indicator of incidence in the case of non-normality) and 25–75% interquartile range (IQR) values (a better measure of dispersion without taking into consideration the more extreme values). Additionally, the distribution of incidence rates by population and methodological characteristics was explored.
Next, multivariable analyses were conducted on the subset of catchment area samples with numerator and dominator stratified by sex and 10-year age bands (analysis 2), to allow estimation of adjusted age–sex associations. As Poisson regression models showed significant level-1 overdispersion (which can cause standard errors to be biased downwards), a series of random intercept negative binomial regression models (NBRMs) were fitted to estimate IRRs, with the natural log of the raw incidence rates as the dependent variable and age group and pertinent sample and methodological characteristics (sampling frame, case ascertainment and diagnostic classification system) as independent variables. Non-linear age by sex associations were modelled by adding an interaction term. Robust standard errors were calculated using the sandwich estimator.
Studies were not formally weighted by sample size, as is common practice in pooling of randomized controlled trials. In observational studies, however, smaller studies often use more thorough case ascertainment and case verification whereas larger studies are often drawn from national databases with less rigorous case verification. Alternatively, the IRR estimates were adjusted for study sample size in the NBRM and the effect of sample size was further evaluated by repeating the analysis on a subset of larger samples (>100 cases).
Finally, to examine potential bias from studies using arbitrary age criteria, the analyses were repeated on samples from studies with no age restriction. Sampling bias due to data variability across the different selection stages was explored by contrasting sample characteristics of included and excluded studies. All statistical analyses were conducted using Stata version 11 (StataCorp, 2009).
Results
The initial electronic search identified 11 307 citations; further appraisal of the title and abstract using the inclusion criteria resulted in 329 full articles being reviewed. Ninety-one studies derived from 44 non-overlapping samples met inclusion criteria for the initial analyses. One study (Bijl et al. Reference Bijl, de Graaf, Ravelli, Smit and Vollebergh2002) contributed significantly to methodological heterogeneity because of a study design incomparability and it was therefore excluded, leaving a set of 43 independent incidence samples for further analyses. The characteristics of the included studies (organized by sample) are presented in online Supplementary Table S1. Excluded articles and their justification for exclusion are presented as a supplementary list (online Supplementary Table S2). Included studies and those excluded due to insufficient data were similar with regard to sample characteristics, total number of cases and population size (data available upon request to the authors).
Descriptive synthesis (analysis 1)
The first set of analyses was based on all 43 samples, providing 1021 effect sizes and 133 693 cases of schizophrenia. The pooled median incidence rate for schizophrenia was 18.3 per 10 0000 person-years, with an almost threefold difference between catchment area samples from the lowest and highest reported rates (IQR 10.9–28.9). Incidence declined strongly with increasing age (Table 1). The overall incidence was higher in men than in women. However, the male preponderance in the EOS group (IRRF–M 0.55, IQR 0.45–0.74) was absent in the LOS group (IRRF–M 1.16, IQR 0.93–1.63), whereas in the VLOS group there was a higher relative risk for females (IRRF–M 1.55, IQR 0.92–2.19). The incidence by sample characteristics over the EOS, LOS and VLOS groups is also presented in Table 1. Rates tended to be lower in samples based on diagnostic interview methods and first contacts, and in samples conducted after 1990, in rural areas and in developing countries.
Table 1. Distribution of incidence rates stratified by sex, age and sample characteristics a
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170128135735-74602-mediumThumb-S0033291712002796_tab1.jpg?pub-status=live)
EOS, Early-onset schizophrenia (age <40 years); LOS, late-onset schizophrenia (age 40–59 years: LOS); VLOS, very-late-onset schizophrenia (age ⩾60 years); IQR, interquartile range; n.a., not applicable.
a All rates are presented as number of cases per 10 000 person-years.
Systematic recalculation (analysis 2)
The second set of analyses was based on the subset of 33 non-overlapping samples with data stratified by sex and 10-year age bands, providing 604 effect sizes and 63 550 incident cases of schizophrenia.
In the fully adjusted model, younger age (IRR 1.25, 95% CI 1.17–1.34), male sex (IRR 1.15, 95% CI 1.00–1.31) and urbanicity (IRR 1.75, 95% CI 1.13–2.71) were associated with higher incidence rates whereas time period and methodological characteristics were not. Figure 1 shows the crude age- and sex-specific incidence rates. A negative interaction between sex and age was apparent in the crude (χ2 = 76.32, p < 0.001) and fully adjusted model (χ2 = 57.90, p < 0.001). In the fully adjusted model there was a significantly stronger overall age–incidence decline in males (IRR 0.71, 95% CI 0.66–0.77) compared to females (IRR 0.89, 95% CI 0.83–0.96). Table 2 presents the unadjusted and adjusted age IRRs by sex, using the youngest age group as the reference (<20 years). Male incidence peaked at age 20–29 years (IRR 2.61, 95% CI 1.74–3.92). The incidence peak in women was less sharp and distributed over age bands 20–29 (IRR 2.34, 95% CI 1.66–3.28) and 30–39 years (IRR 2.25, 95% CI 1.55–3.28). After the initial peak, incidence rates declined continuously, levelling off to a non-significant risk difference by 60 years of age. Finally, the relative risk of schizophrenia remained higher in males up to age 39, was equivalent for males and females between ages 40 and 49, and then reversed, with females having a higher risk in the 50–70-year age groups (Table 2).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170128135735-04805-mediumThumb-S0033291712002796_fig1g.jpg?pub-status=live)
Fig. 1. Estimated age- and sex-specific median incidence rates (per 10 000 person-years): ––▪––, women; – –◆– –, men.
Table 2. Age- and sex-specific incidence rates of schizophrenia (per 10 000 person-years)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170128135735-40359-mediumThumb-S0033291712002796_tab2.jpg?pub-status=live)
IRR, Incidence rate ratio; CI, confidence interval.
a Adjusted for study period (in 10-year groups: <1960, 1960–1969, 1970–1979, 1980–1989, 1990–1999 and ⩾2000), population (study sample size) and methodological characteristics (sampling frame, case ascertainment and diagnostic classification system).
b IRR <1 indicates higher relative risk in men and IRR >1 indicates higher relative risk in women.
Restricting the analyses to samples from studies that had no age restriction criteria (n = 20) or a minimum sample size of 100 cases (n = 22) yielded similar patterns of risk by age and sex to the analyses using all the data. Finally, the samples used in analysis 2 had similar incidence rates by age and sex as those used for analysis 1, suggesting that sampling bias due to data availability is unlikely to have occurred (results available upon request).
Discussion
This systematic review and recalculation of primary incidence data demonstrates substantial stratification in the risk of schizophrenia by age and sex that is not explained by confounding due to known underlying differences in the population structure or characteristics of the study samples. As expected, the risk of schizophrenia rose steeply from (pre-)adolescence, reaching a peak in early adulthood. This initial increase in incidence was followed by a gradual decline to age 60 years, after which the incidence rates levelled off.
Marked sex differences in morbid risk were apparent over the life course, with lifetime risk 1.15 times higher in men than in women, a figure slightly lower than those reported previously in a systematic review (McGrath et al. Reference McGrath, Saha, Welham, El Saadi, MacCauley and Chant2004) and a meta-analysis (Aleman et al. Reference Aleman, Kahn and Selten2003). The initial high peak in men was followed by a dramatic drop in incidence rates around age 30. Women, on the contrary, showed a lower, broader incidence curve followed by a slower decline in risk over the age span. The slower decline of incidence rates for schizophrenia in women with increasing age resulted in a small but significant female excess of new cases in middle and old age. Finally, the study did not find a discrete second incidence peak in middle-aged women.
Incidence of schizophrenia over the age span
The results are in line with previous work suggesting that schizophrenia is principally a disorder of early adulthood but does present throughout the lifespan, with males having an earlier age of onset (on average) and slightly higher lifetime morbid risk compared to females. Several plausible hypotheses have been developed to explain the age–sex association related to, for example, prenatal development and differential exposure to gonadal hormones in men (Murray et al. Reference Murray, O'Callaghan, Castle and Lewis1992; Raz et al. Reference Raz, Goldstein, Hopkins, Lauterbach, Shah and Porter1994; Seeman, Reference Seeman1996). Similarly, the prevalence of pregnancy and birth complications, which are recognized candidate risk factors for schizophrenia, may be more common among males than females (Clarke & O'Callaghan, Reference Clarke, O'Callaghan, Murray, Jones, Susser, van Os and Cannon2003). In women, the onset of schizophrenia may be delayed by the neuroprotective effects of oestrogen, which is postulated to have an antipsychotic effect during adolescence (Häfner et al. Reference Häfner, Riecher-Rössler, an der Heiden, Maurer, Fätkenheuer and Löffler1993b ; Seeman, Reference Seeman1996).
These analyses failed to show a discrete second peak in middle-aged women (Häfner et al. Reference Häfner, Riecher-Rössler, an der Heiden, Maurer, Fätkenheuer and Löffler1993b ; Castle et al. Reference Castle, Abel, Takei and Murray1995). It is possible that a brief peri-menopausal spike in incidence was not found as a consequence of the relatively broad 10-year age bands used in the present statistical models. However, post-hoc analyses on the subset of samples for whom 5-year age bands were available showed no second peak in middle-aged women.
The age-of-onset profile demonstrated in this study is consistent with complex causal pathways involving additive and interactive psychosocial, neural or hormonal protective factors (or lack of risk factors) in the context of pathoplastic normal or pathological age-related processes that impact on a milder or absent neurodevelopmental vulnerability (Vahia et al. Reference Vahia, Palmer, Depp, Fellows, Golshan, Kraemer and Jeste2010). Culturally and socially determined sex differences such as greater levels of social integration, better pre-morbid functioning and greater likelihood of marriage in women may act as protective factors, delaying the age at onset (Jablensky & Cole, Reference Jablensky and Cole1997; Morgan et al. Reference Morgan, Castle and Jablensky2008).
Risk for schizophrenia and the environment
Although not the primary aim of this meta-analysis, the risk of schizophrenia was lower in rural areas than in urban conurbations. This is a replicated epidemiological finding in Western-based studies; however, the explanation for this urban excess remains elusive (Kelly et al. Reference Kelly, O'Callaghan, Waddington, Feeney, Browne, Scully, Clarke, Quinn, McTigue, Morgan, Kinsella and Larkin2010) and data were not available in this study to investigate this further.
Methodological considerations
Through strong collaborative working we have had comprehensive access to international data from published incidence studies, allowing the systematic recalculation of age- and sex-specific rates for schizophrenia using raw primary data. The review process followed the MOOSE guidelines closely, ensuring a thorough search for incidence data and a standardized data extraction process. This study principally used individual participant-level data provided by primary researchers that were then used to centrally generate rate-level data using the same age strata across all samples. This approach has many advantages over the formal aggregate meta-analysis approach (Riley et al. Reference Riley, Lambert and Abo-Zaid2010). For a minority of catchment area samples, this was unnecessary as the appropriate age- and sex-specific rates had been published previously or were available from the original investigators.
Although collaborative recalculation of primary data is a strong methodology, the results should be interpreted in light of several design limitations. First, different studies used different definitions for age of onset. Age at first hospitalization is often criticized as proxy of true age of psychosis onset because significant between-person variability in duration of untreated psychosis may cause differences in first contact with mental health services. However, the latter have repeatedly been found to be strongly correlated (DeLisi, Reference DeLisi1992; Häfner et al. Reference Häfner, Maurer, Löffler and Riecher-Rössler1993a ; Walker & Lewine, Reference Walker and Lewine1993). As such, age at first hospitalization has been shown to represent a valid index in estimating age-at-onset effects. Second, although individual-level data were available for age, sex and study period, this was not the case for other characteristics such as level of urbanicity or social fragmentation; therefore we did not explore additional sources of heterogeneity in the age and sex distribution of schizophrenia as this would be vulnerable to cross-level inferential bias. Third, no formal rating for study design was carried out, which may have contributed to a better understanding of methodological variation. However, although some reliable checklists and scales have been developed to assess the quality of trial data (Shea et al. Reference Shea, Dube, Moher, Egger, Smith and Altman2007), defining quality for observational studies is psychometrically challenging and often highly subjective (Stroup et al. Reference Stroup, Berlin, Morton, Olkin, Williamson, Rennie, Moher, Becker, Sipe and Thacker2000). Fourth, estimations of risk for schizophrenia in the elderly are probably underestimations of true risk because of a seeming reluctance in clinicians to diagnose schizophrenia in the elderly (Harris & Jeste, Reference Harris and Jeste1988). In addition, many of the included studies excluded subjects older than 65 years, and in ICD-9, schizophrenia and schizophrenia-like illness with late onset may have been diagnosed as ‘paranoid psychosis’ in some countries. Hence, the 1.17-fold greater risk of schizophrenia in men compared to women found in this analysis should be interpreted in light of this, and may represent an overestimation. Limited data are available from developing countries. Only nine (of which four were included and five excluded because of limited data availability) samples were collected in developing countries. The findings, therefore, may not be generalizable to developing countries, particularly because sex differences may show a different distribution by age in these countries (Gangadhar et al. Reference Gangadhar, Selvan, Subbakrishna and Janakiramaiah2002). Finally, the number of studies that had to be excluded was relatively large (see online Supplementary Table S2), including a total of 74 studies that might have contained useful data for this meta-analysis. Although this could have affected the results and conclusions slightly, we consider that the calculations were based on sufficiently large numbers to guarantee robustness of the results. Furthermore, although not all published data were available in sufficient detail to be included in the central recalculation, studies that were excluded showed similar study and population characteristics, making systematic bias less likely.
Conclusions
The establishment of age- and sex-specific incidence rates for schizophrenia is of great importance for mental health service development; detailing the timing of schizophrenia onset reveals a window of opportunity for possible early detection and preventive strategies. The results further highlight the fact that a small, but not negligible, proportion of individuals develop schizophrenia after age 60 years. These numbers are likely to rise significantly, with the first of the post-war ‘baby-boom’ generation reaching 65 years of age by the end of 2010 (Jeste et al. Reference Jeste, Alexopoulos, Bartels, Cummings, Gallo, Gottlieb, Halpain, Palmer, Patterson, Reynolds and Lebowitz1999). The concept of prevention should therefore not only be restricted to adolescents but also be applied to middle-aged and elderly persons (Lebowitz & Pearson, Reference Lebowitz and Pearson2000).
Appendix. Research Initiative into Schizophrenia Epidemiology (RISE) Investigators
M. van der Werf, S. Köhler, J. Allardyce, J. van Os, M. Hanssen, F. Verhey, I. Lynge, J.-P. Selten, J.-D. Blom, N. Veen, C. Verheyde-Zeyl, J. Geddes, M. Goldacre, D. G. R. Yeates, N. Nixon, V. Adjacic, W. Rössler, P. Baldwin, J. Waddington, J. Kirkbride, C. Morgan, P. Dazzan, R. Murray, P. B. Jones, G. Doody, G. Harrison, J. Leff, J. W. Coid, P. Fearon, J. E. Andersen, T. Hynnekleiv, R. McCreadie, H. Gilmour, P. Menezes, G. Smith, W. Iacono, A. Thorup, C. Pedersen, H. Häfner, W. an der Heiden, F. Hickling, J. Peen, J. Dekker, R. de Graaf, W. Veling, W. Hoek, M. Bogren, J. Boydell, D. Castle, E. Mitford, J. Brewin, M. Goldacre, N. Takei and F. Ammaddeo.
Supplementary material
For supplementary material accompanying this paper visit http://dx.doi.org/10.1017/S0033291712002796.
Declaration of Interest
Jim van Os has received unrestricted investigator-led research grants or recompense for presenting his research from Eli Lilly, Bristol-Myers Squibb, Lundbeck, Organon, Janssen-Cilag, GlaxoSmithKline, AstraZeneca, Pfizer and Servier, companies that have an interest in the treatment of psychosis.