Diagnostic alterations for post-traumatic stress disorder: examining data from the National Comorbidity Survey Replication and National Survey of Adolescents

Jon D. Elhai; Julian D. Ford; Kenneth J. Ruggiero; B. Christopher Frueh

doi:10.1017/S0033291709005819

Diagnostic alterations for post-traumatic stress disorder: examining data from the National Comorbidity Survey Replication and National Survey of Adolescents

Published online by Cambridge University Press: 20 April 2009

Jon D. Elhai ,

Julian D. Ford ,

Kenneth J. Ruggiero and

B. Christopher Frueh

Show author details

Jon D. Elhai*: Affiliation:
University of South Dakota, Vermillion, SD, USA
Julian D. Ford: Affiliation:
University of Connecticut Medical School, Farmington, CT, USA
Kenneth J. Ruggiero: Affiliation:
Medical University of South Carolina, Charleston, SC, USA
B. Christopher Frueh: Affiliation:
Baylor College of Medicine and the Menninger Clinic, Houston, TX, USA
*: *Address for correspondence: J. D. Elhai, Ph.D., Disaster Mental Health Institute, The University of South Dakota, 414 East Clark Street – SDU 114, Vermillion, South Dakota57069-2390, USA. (Email: jonelhai@gmail.com)

Article contents

Abstract
Background
Method
Results
Conclusions
Introduction
Study 1: method
Study 1: results
Study 2: method
Study 2: results
General discussion
Footnotes
References

Rights & Permissions

Abstract

Background

Two alternative models of post-traumatic stress disorder (PTSD) appear to represent the disorder's latent structure better than the traditional Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV) three-factor PTSD model. The present study examines the impact of using these structural models for the diagnosis of lifetime PTSD while retaining the DSM-IV PTSD's six-symptom diagnostic requirement.

Method

Data were gathered from large-scale, epidemiological datasets collected with adults (National Comorbidity Survey Replication) and adolescents (National Survey of Adolescents). Two alternative, empirically supported four-factor models of PTSD were compared with the DSM-IV three-factor PTSD diagnostic model.

Results

Results indicated that the diagnostic alterations resulted in substantially improved structural validity, downward adjustments of PTSD's lifetime prevalence (roughly 1 percentage point decreases in adults, 1–2.5 percentage point decreases in adolescents), and equivalent psychiatric co-morbidity and sociodemographic associations.

Conclusions

Implications for modifying PTSD diagnostic criteria in future editions of DSM are discussed.

Keywords

Construct validity diagnosis DSM epidemiology post-traumatic stress disorder trauma

Type: Original Articles
Information: Psychological Medicine , Volume 39 , Issue 12 , December 2009 , pp. 1957 - 1966

DOI: https://doi.org/10.1017/S0033291709005819 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2009

Introduction

The diagnostic model of post-traumatic stress disorder (PTSD) currently includes three symptom clusters, labeled ‘re-experiencing’ (criterion B), ‘avoidance and numbing’ (criterion C) and ‘hyperarousal’ (criterion D) (APA, 2001). However, a substantial body of empirical research has demonstrated that this model does not adequately explain the structure of the PTSD construct (Asmundson et al. Reference Asmundson, Stapleton and Taylor2004; Frueh et al. Reference Frueh, Elhai, Kaloupek and Rosen2004). Furthermore, alternative models have not been tested for their convergence in diagnosing Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV) PTSD.

Numerous recent studies have used confirmatory factor analysis (CFA) to test specific, theoretically driven models of PTSD in revealing the hypothesized model(s) best fitting, or accounting for, respondents' intercorrelation patterns (for a review, see Asmundson et al. Reference Asmundson, Stapleton and Taylor2004). This literature, with various trauma-exposed samples, has consistently found that the traditional three-factor DSM-IV PTSD model does not adequately fit observed data (e.g. DuHamel et al. Reference DuHamel, Ostrof, Ashman, Winkel, Mundy, Keane, Morasco, Vickberg, Hurley, Burkhalter, Chhabra, Scigliano, Papadopoulos, Moskowitz and Redd2004; McWilliams et al. Reference McWilliams, Cox and Asmundson2005; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008) and that other models fit better (e.g. Elhai et al. Reference Elhai, Gray, Docherty, Kashdan and Kose2007, Reference Elhai, Grubaugh, Kashdan and Frueh2008; Elkit & Shevlin, Reference Elkit and Shevlin2007; Palmieri et al. Reference Palmieri, Marshall and Schell2007b; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008; Saul et al. Reference Saul, Grant and Carter2008).

Two models in particular have garnered the most empirical support for explaining PTSD's factor structure. Although other models have been investigated, the two models highlighted here consistently statistically outperform them (most recently in Elkit & Shevlin, Reference Elkit and Shevlin2007; Krause et al. Reference Krause, Kaltman, Goodman and Dutton2007; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008; Palmieri et al. Reference Palmieri, Marshall and Schell2007a, Reference Palmieri, Weathers, Difede and Kingb; Saul et al. Reference Saul, Grant and Carter2008).

The revised DSM-IV PTSD model of King et al. (Reference King, Leskin, King and Weathers1998) separates criterion C's effortful avoidance and emotional numbing symptoms into separate factors, resulting in re-experiencing (B1–B5), effortful avoidance (C1–C2), emotional numbing (C3–C7) and hyperarousal (D1–D5). This model reflects research demonstrating that avoidance and numbing are statistically distinct constructs, revealing divergent patterns of correlates with psychopathology, and different prognoses and treatment effects (for a review, see Asmundson et al. Reference Asmundson, Stapleton and Taylor2004). Numerous CFA studies have supported the King model with adults (most recently in Elhai et al. Reference Elhai, Gray, Docherty, Kashdan and Kose2007, Reference Elhai, Grubaugh, Kashdan and Frueh2008; Palmieri et al. Reference Palmieri, Marshall and Schell2007a; Schinka et al. Reference Schinka, Brown, Borenstein and Mortimer2007) and adolescents (Saul et al. Reference Saul, Grant and Carter2008).

The model of Simms et al. (Reference Simms, Watson and Doebbeling2002) retains the King model's re-experiencing (B1–B5) and effortful avoidance (C1–C2) factors. However, three PTSD hyperarousal symptoms – sleep difficulty (D1), irritability (D2) and concentration problems (D3) – are moved from the hyperarousal factor and combined with the emotional numbing factor's symptoms (C3–C7) to form a ‘dysphoria’ factor. Thus, the resulting model includes re-experiencing (B1–B5), effortful avoidance (C1–C2), dysphoria (C3–C7 and D1–D3) and hyperarousal (D4–D5). The Simms model reflects empirical work demonstrating that general distress/dysphoria is an underlying component of mood and anxiety disorders (including PTSD) (e.g. Watson, Reference Watson2005; Slade & Watson, Reference Slade and Watson2006), and that this construct is distinct from the hyperarousal associated specifically with post-traumatic reactions (Simms et al. Reference Simms, Watson and Doebbeling2002). Several recent studies have supported this model over other similar models with adults (Elkit & Shevlin, Reference Elkit and Shevlin2007; Krause et al. Reference Krause, Kaltman, Goodman and Dutton2007; Palmieri et al. Reference Palmieri, Marshall and Schell2007b), but it has not been examined with adolescents.

At present, the King and Simms models appear to be the top candidates for best explaining PTSD's symptom structure. However, no studies have tested the practical impact of altering PTSD's diagnosis based on the alternative King or Simms models. And because the PTSD factor analytic literature illuminates how symptoms empirically cluster together, drawing from that literature can guide the psychiatry field toward improved PTSD diagnostic models.

The present investigation, using large-scale, epidemiological data, examines the impact of altering the PTSD diagnosis based on these models, testing differences in PTSD's prevalence, co-morbidity and sociodemographics, and factor structure. We explored our hypotheses in both samples of adolescents and adults, to test our research questions across two separate age groups. We hypothesized that the diagnostic alterations would result in the following: (1) a downward adjustment of PTSD's prevalence, due to more conservatively stringent PTSD diagnostic criteria; (2) no substantial decreases in co-morbidity rates, because recent research found that even removing PTSD symptoms that overlap with highly co-morbid mood/anxiety disorders did not affect PTSD's co-morbidity (Elhai et al. Reference Elhai, Grubaugh, Kashdan and Frueh2008); and (3) significant improvements in PTSD's factor structure, based on the CFA findings for PTSD summarized above. We retained the required number of six symptoms in altering the diagnosis, in order to reliably compare results with the same six-symptom requirement currently used in the DSM-IV PTSD diagnosis.

This study is important and timely since the PTSD construct has received criticism recently for a range of conceptual issues pertaining to symptom structure and symptom overlap with other mental disorders (Spitzer et al. Reference Spitzer, First and Wakefield2007; Rosen & Lilienfield, Reference Rosen and Lilienfield2008; Rosen et al. Reference Rosen, Spitzer and McHugh2008). Also, decreasing PTSD's symptom overlap with mood and anxiety disorders would ensure that the PTSD construct is unique as a diagnostic entity. Further, conceptual issues in the mental disorders (including PTSD) are currently being considered for DSM-V, and investigations such as this one potentially speak to the issue of whether and how to revise the disorder's diagnostic criteria.

Study 1: method

Sample

Study 1 used archival data from the National Comorbidity Survey Replication (NCS-R) (Kessler, Reference Kessler2006). The NCS-R was a nationally stratified, multistage area household probability sample study of non-institutionalized adults (aged 15 years and older). The NCS-R was conducted with 9282 participants in the early 2000s (NCS-R part I), with demographic characteristics presented in previous NCS-R reports (Kessler et al. Reference Kessler, Berglund, Chiu, Demler, Heeringa, Hiripi, Jin, Pennell, Walters, Zaslavsky and Zheng2004). The present paper reports on the representative subsample of participants completing the NCS-R part II (which evaluated PTSD; n=5692).

Instruments

The World Mental Health Survey Initiative version of the structured Composite International Diagnostic Interview (CIDI; Kessler & Ustun, Reference Kessler and Ustun2004) was used to diagnose DSM-IV mental disorders; the CIDI evidences adequate convergence with other similar measures (Andrews & Peters, Reference Andrews and Peters1998; Haro et al. Reference Haro, Arbabzadeh-Bouchez, Brugha, De Girolamo, Guyer, Jin, Lepine, Mazzi, Reneses, Vilagut, Sampson and Kessler2006). In addition to the lifetime DSM-IV PTSD diagnostic variable, we also examined lifetime diagnostic variables for mood, anxiety and substance-use disorders. DSM-IV diagnostic algorithms were used to assign diagnoses, discussed elsewhere (Kessler et al. Reference Kessler, Berglund, Demler, Jin, Merikangas and Walters2005). We exclusively used the NCS-R's ‘non-hierarchy’ diagnoses (i.e. allowing a particular diagnosis to be assigned even if it occurred solely in the presence of another disorder).

Of relevance to PTSD, participants were first asked in behaviorally specific terms about previous exposure to a variety of traumatic events meeting DSM-IV's PTSD stressor criterion (A1). Only those participants endorsing a traumatic event with initial fear, helplessness or horror (criterion A2) were subsequently queried about DSM-IV PTSD symptoms. PTSD symptom queries involved binary (‘yes’/'no') lifetime symptom ratings about one's trauma. For those endorsing more than one trauma, the most upsetting occurrence of their most upsetting traumatic event type was used. For individuals whose most upsetting trauma occurrence was different from a trauma that was randomly selected by NCS-R investigators, they were instructed to rate their PTSD symptoms separately for each event; PTSD diagnoses were then assigned based on meeting PTSD criteria from either of these two events. Finally, PTSD's criteria E (duration) and F (functional impairment) were queried. Skip-out rules were implemented, such that if a participant did not meet a particular PTSD symptom criterion, s/he was not subsequently queried about remaining PTSD criteria (discussed further below).

Analyses

NCS-R part II sampling weights were used for all analyses in study 1, to adjust for differential household size, non-response and post-stratification. We used Stata 9.0 software (StataCorp LP, College Station, TX, USA) to examine whether the King and Simms models yielded differences from the DSM-IV PTSD model in prevalence of PTSD, diagnostic status, diagnostic co-morbidity and sociodemographic associations. We used Mplus 5.1 software (Muthén & Muthén, Reference Muthén and Muthén1998–Reference Muthén and Muthén2007) to examine the impact of using these models on PTSD's structural validity.

We calculated the lifetime DSM-IV PTSD diagnosis, and King and Simms alterations using the NCS-R's PTSD module item data, retaining DSM-IV PTSD's six-symptom diagnostic requirement. For the PTSD diagnosis (and alterations), we required that the respondent endorsed at least one traumatic event meeting criteria A1 and A2, and satisfied criteria E and F. For criteria B through D in the traditional DSM-IV PTSD diagnosis, we required at least one re-experiencing symptom (B1–B5), at least three avoidance/numbing symptoms (C1–C7) and at least two hyperarousal symptoms (D1–D5) (as required in DSM-IV). To remain consistent with the six-item diagnostic requirement in DSM-IV, for the King PTSD diagnostic alteration we retained the requirements for at least one re-experiencing symptom (B1–B5) and at least two hyperarousal symptoms (D1–D5); in addition, we required at least one avoidance symptom (C1–C2) and at least two numbing symptoms (C3–C7) (thus for the similar DSM-IV PTSD minimum total of three avoidance/numbing symptoms). For the Simms PTSD diagnostic alteration, we retained the requirement for at least one re-experiencing symptom (B1–B5) and the King model's requirement of at least one avoidance symptom (C1–C2); with only two possible hyperarousal symptoms in this model, we required at least one hyperarousal symptom (D4–D5), leaving a requirement of at least three dysphoria symptoms (C3–C7, D1–D3) in order to yield a total of six required PTSD symptoms.

For structural validity analyses, we only used data from participants who (in addition to endorsing PTSD's criterion A1) had endorsed criterion A2 in reference to an index trauma on which PTSD ratings were made (n=871). For participants with multiple sets of PTSD ratings, we used ratings from their most upsetting event. We further excluded 105 subjects missing more than four (24%) of PTSD's symptom items (leaving 766 remaining participants) because of skip-out rules; the remaining participants represented a slightly skewed sample in that they met at least one PTSD symptom cluster (only 42 subjects were skipped out of two symptom clusters). Additional missing item-level data (typically, one or two items each, by 6% of subjects) were estimated to preserve the sample size, using maximum likelihood (ML) estimation of missing data (for a review, see Schafer & Graham, Reference Schafer and Graham2002) for categorical outcomes (Muthén & Muthén, 1998–Reference Muthén and Muthén2007, p. 401).

Since the observed dependent variables were binary (‘yes’/'no') ratings, we implemented robust (mean- and variance-adjusted) weighted least squares (WLS) estimation for the CFAs, using polychoric (rather than Pearson) correlations and probit (rather than linear) regression coefficients (Flora & Curran, Reference Flora and Curran2004; Wirth & Edwards, Reference Wirth and Edwards2007). In fitting the CFAs, we estimated intercorrelations among the common factors, not allowing intercorrelations among the residuals' error variances. χ² Tests of model fit were examined in conjunction with goodness-of-fit indices, including the Tucker–Lewis index (TLI), comparative fit index (CFI) and root mean square error of approximation (RMSEA) (interpreted when RMSEA <0.06 for an excellent fit, and between 0.06 to 0.08 for an adequate fit; CFI/TLI ⩾0.95 for an excellent fit, and between 0.90 to 0.94 for an adequate fit) (Hu & Bentler, Reference Hu and Bentler1998, Reference Hu and Bentler1999). Differences between the DSM-IV and King PTSD models were tested using a χ² difference test (albeit with a correction factor since a robust χ² statistic was used; Muthén & Muthén, Reference Muthén and Muthén2006). When comparing the Simms model with the DSM-IV and King PTSD models, these models are not nested subsets within each other, and thus χ² difference testing is not appropriate; instead, differences in Bayesian information criterion (BIC) values were examined. BIC values, only estimable using ML (but not WLS) estimation, were generated using ML-estimated CFAs, with logistic rather than probit coefficients [more appropriate using ML (Muthén, Reference Muthén1984) albeit without using sampling weights which cannot be currently implemented with ML for categorical outcomes]. When a model has a BIC value of 10 points less than another model, there is a 150:1 odds that the model with the smaller BIC value is the better-fitting model (Raftery, Reference Raftery1995).

Study 1: results

Prevalence

Table 1 illustrates PTSD prevalence rates across the DSM-IV, King and Simms models, with binomial approximation z tests presented to statistically compare proportions (Hays, Reference Hays1994). The King and Simms alterations each reduced DSM-IV PTSD's prevalence by nearly one percentage point.

Table 1. PTSD prevalence across the DSM-IV, King and Simms PTSD diagnostic models

PTSD, Post-traumatic stress disorder; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition; s.e., standard error; NCS-R, National Comorbidity Survey Replication; NSA, National Survey of Adolescents.

^a Significantly different prevalence from the NCS-R DSM-IV model (binomial approximation z=2.55, s.e.=0.00, p=0.02).

^b Significantly different prevalence from the NCS-R DSM-IV model (binomial approximation z=2.88, s.e.=0.00, p<0.01).

^c Not significantly different prevalence from the NCS-R King model (binomial approximation z=0.35, s.e.=0.00, p>0.05).

^d Significantly different prevalence from the NSA DSM-IV model (binomial approximation z=2.99, s.e.=0.00, p<0.01).

^e Significantly different prevalence from the NSA DSM-IV model (binomial approximation z=5.95, s.e.=0.00, p<0.001).

^f Significantly different prevalence from the NSA King model (binomial approximation z=3.21, s.e.=0.00, p=0.002).

Table 2 demonstrates changes in DSM-IV PTSD status with the King diagnostic alteration. The level of agreement between these diagnostic systems could be considered ‘almost perfect’ (based on criteria set forth by Landis & Koch, Reference Landis and Koch1977) [κ=0.92, standard error (s.e.)=0.01 (98.63% agreement), z=69.79, p<0.001]. Only 0.84% of the entire sample (48 weighted subjects out of 5692 subjects) changed diagnostic status for PTSD when using the King criteria.

Table 2. Changes in PTSD status between DSM-IV diagnostic system and King and Simms diagnostic alterations

PTSD, Post-traumatic stress disorder; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition.

^a Weighted numbers, rounded to the nearest whole number.

The level of agreement between the DSM-IV and Simms models was ‘almost perfect’ (Landis and Koch, Reference Landis and Koch1977) [κ=0.84, s.e.=0.01 (97.24% agreement), z=63.82, p<0.001]. In combination, approximately 1.82% of the entire sample (79+25 weighted subjects out of 5692 subjects) changed diagnostic status for PTSD when using the Simms criteria. Additionally, the King and Simms diagnostic systems were highly concordant with each other (considered ‘almost perfect’ agreement) [κ=0.86, s.e.=0.01 (97.73% agreement), z=65.05, p<0.001].

Finally, we assessed if particular PTSD symptom clusters were responsible for the decreased PTSD prevalence estimates seen in the King and Simms diagnostic alterations. Among respondents meeting DSM-IV PTSD criteria A, E and F, we estimated the proportion meeting only one particular DSM-IV PTSD symptom criterion (among re-experiencing; avoidance/numbing; and arousal) compared with the proportion meeting only the one comparable DSM-IV cluster for the King model (re-experiencing; avoidance and numbing; and arousal) and Simms model (re-experiencing; avoidance and dysphoria; and arousal). The King model's decreased PTSD prevalence resulted solely from its avoidance and numbing symptom diagnostic alteration, and the Simms model's prevalence decrease resulted solely from its arousal symptom alteration.

Co-morbidity and sociodemographic variables

Next we examined if the PTSD diagnostic alterations were associated with different prevalence estimates of co-morbid mental disorders and sociodemographic characteristics (Table 3). Regardless of whether the diagnostic alterations were implemented, similar rates of diagnostic co-morbidity and sociodemographic associations were found.

Table 3. Diagnostic co-morbidity and sociodemographic differences between DSM-IV, King and Simms PTSD diagnoses in the National Comorbidity Survey Replication data

DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition; PTSD, post-traumatic stress disorder; s.e., standard error; MDE, major depressive disorder; GAD, generalized anxiety disorder.

^a Weighted numbers, rounded to the nearest whole number.

^b Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and King PTSD diagnostic systems, using an average sample size of 364 across diagnostic systems.

^c Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and Simms PTSD diagnostic systems, using an average sample size of 361 across diagnostic systems.

^d Only the most prevalent co-morbid disorders in this sample are displayed here.

^e Recent mental healthcare use was defined as at least one treatment visit in the previous year.

^f Disability involved if the respondent cut down or was unable to perform usual activities due to a mental health problem for at least 1 day in the previous month.

Structural validity

The three-factor DSM-IV PTSD model did not fit the data well [robust χ²(31, n=766)=94.76, p<0.001, TLI=0.89, CFI=0.89, RMSEA=0.05, BIC=14638.73]. Most problematic, standardized factor loadings and R ² values were quite low for avoiding thoughts/feelings of the trauma (criterion C1: β=0.25, R ²=0.06), with numerous unstandardized residual covariances high (>0.10; Kline, Reference Kline2004), and these parameters were not as problematic for the King (C1: β=0.44, R ²=0.19) or Simms (C1: β=0.45, R ²=0.20) models. The King model had an adequate (but not excellent) fit [robust χ²(31, n=766)=86.14, p<0.001, TLI=0.91, CFI=0.90, RMSEA=0.05, BIC=14619.07], and this model was significantly better than the three-factor DSM-IV model (p<0.001). Finally, the Simms model fit the data very well [robust χ²(31, n=766)=55.90, p<0.001, TLI=0.96, CFI=0.96, RMSEA=0.03, BIC=14533.31]. These results support data from two recent studies examining PTSD's symptom structure using the NCS-R data (Cox et al. Reference Cox, Mota, Clara and Asmundson2008, Elhai et al. Reference Elhai, Grubaugh, Kashdan and Frueh2008). However, those studies did not test the Simms model (goodness-of-fit indices here differ slightly from the study of Cox et al., which used different statistical software that generates less conservative fit indices).

Finally, one interpretation of the Simms model is that the dysphoria symptom cluster should be de-emphasized from the PTSD diagnosis (Simms et al. Reference Simms, Watson and Doebbeling2002). Therefore, we tested a three-factor alternative to the Simms model that simply removes the dysphoria factor, which fit the data quite well [robust χ²(11, n=766)=20.33, p=0.04, TLI=0.95, CFI=0.97, RMSEA=0.03, BIC=7724.45].

In comparing models, judging from BIC values, the original Simms model appears to have approximately a 1600:1 chance of being the preferred model over the three-factor DSM-IV model, and a 1300:1 chance of being the preferred model over the King model; the Simms model that removed the dysphoria symptoms had the best-fitting BIC value.