Introduction
The diagnostic model of post-traumatic stress disorder (PTSD) currently includes three symptom clusters, labeled ‘re-experiencing’ (criterion B), ‘avoidance and numbing’ (criterion C) and ‘hyperarousal’ (criterion D) (APA, 2001). However, a substantial body of empirical research has demonstrated that this model does not adequately explain the structure of the PTSD construct (Asmundson et al. Reference Asmundson, Stapleton and Taylor2004; Frueh et al. Reference Frueh, Elhai, Kaloupek and Rosen2004). Furthermore, alternative models have not been tested for their convergence in diagnosing Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV) PTSD.
Numerous recent studies have used confirmatory factor analysis (CFA) to test specific, theoretically driven models of PTSD in revealing the hypothesized model(s) best fitting, or accounting for, respondents' intercorrelation patterns (for a review, see Asmundson et al. Reference Asmundson, Stapleton and Taylor2004). This literature, with various trauma-exposed samples, has consistently found that the traditional three-factor DSM-IV PTSD model does not adequately fit observed data (e.g. DuHamel et al. Reference DuHamel, Ostrof, Ashman, Winkel, Mundy, Keane, Morasco, Vickberg, Hurley, Burkhalter, Chhabra, Scigliano, Papadopoulos, Moskowitz and Redd2004; McWilliams et al. Reference McWilliams, Cox and Asmundson2005; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008) and that other models fit better (e.g. Elhai et al. Reference Elhai, Gray, Docherty, Kashdan and Kose2007, Reference Elhai, Grubaugh, Kashdan and Frueh2008; Elkit & Shevlin, Reference Elkit and Shevlin2007; Palmieri et al. Reference Palmieri, Marshall and Schell2007b; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008; Saul et al. Reference Saul, Grant and Carter2008).
Two models in particular have garnered the most empirical support for explaining PTSD's factor structure. Although other models have been investigated, the two models highlighted here consistently statistically outperform them (most recently in Elkit & Shevlin, Reference Elkit and Shevlin2007; Krause et al. Reference Krause, Kaltman, Goodman and Dutton2007; Naifeh et al. Reference Naifeh, Elhai, Kashdan and Grubaugh2008; Palmieri et al. Reference Palmieri, Marshall and Schell2007a, Reference Palmieri, Weathers, Difede and Kingb; Saul et al. Reference Saul, Grant and Carter2008).
The revised DSM-IV PTSD model of King et al. (Reference King, Leskin, King and Weathers1998) separates criterion C's effortful avoidance and emotional numbing symptoms into separate factors, resulting in re-experiencing (B1–B5), effortful avoidance (C1–C2), emotional numbing (C3–C7) and hyperarousal (D1–D5). This model reflects research demonstrating that avoidance and numbing are statistically distinct constructs, revealing divergent patterns of correlates with psychopathology, and different prognoses and treatment effects (for a review, see Asmundson et al. Reference Asmundson, Stapleton and Taylor2004). Numerous CFA studies have supported the King model with adults (most recently in Elhai et al. Reference Elhai, Gray, Docherty, Kashdan and Kose2007, Reference Elhai, Grubaugh, Kashdan and Frueh2008; Palmieri et al. Reference Palmieri, Marshall and Schell2007a; Schinka et al. Reference Schinka, Brown, Borenstein and Mortimer2007) and adolescents (Saul et al. Reference Saul, Grant and Carter2008).
The model of Simms et al. (Reference Simms, Watson and Doebbeling2002) retains the King model's re-experiencing (B1–B5) and effortful avoidance (C1–C2) factors. However, three PTSD hyperarousal symptoms – sleep difficulty (D1), irritability (D2) and concentration problems (D3) – are moved from the hyperarousal factor and combined with the emotional numbing factor's symptoms (C3–C7) to form a ‘dysphoria’ factor. Thus, the resulting model includes re-experiencing (B1–B5), effortful avoidance (C1–C2), dysphoria (C3–C7 and D1–D3) and hyperarousal (D4–D5). The Simms model reflects empirical work demonstrating that general distress/dysphoria is an underlying component of mood and anxiety disorders (including PTSD) (e.g. Watson, Reference Watson2005; Slade & Watson, Reference Slade and Watson2006), and that this construct is distinct from the hyperarousal associated specifically with post-traumatic reactions (Simms et al. Reference Simms, Watson and Doebbeling2002). Several recent studies have supported this model over other similar models with adults (Elkit & Shevlin, Reference Elkit and Shevlin2007; Krause et al. Reference Krause, Kaltman, Goodman and Dutton2007; Palmieri et al. Reference Palmieri, Marshall and Schell2007b), but it has not been examined with adolescents.
At present, the King and Simms models appear to be the top candidates for best explaining PTSD's symptom structure. However, no studies have tested the practical impact of altering PTSD's diagnosis based on the alternative King or Simms models. And because the PTSD factor analytic literature illuminates how symptoms empirically cluster together, drawing from that literature can guide the psychiatry field toward improved PTSD diagnostic models.
The present investigation, using large-scale, epidemiological data, examines the impact of altering the PTSD diagnosis based on these models, testing differences in PTSD's prevalence, co-morbidity and sociodemographics, and factor structure. We explored our hypotheses in both samples of adolescents and adults, to test our research questions across two separate age groups. We hypothesized that the diagnostic alterations would result in the following: (1) a downward adjustment of PTSD's prevalence, due to more conservatively stringent PTSD diagnostic criteria; (2) no substantial decreases in co-morbidity rates, because recent research found that even removing PTSD symptoms that overlap with highly co-morbid mood/anxiety disorders did not affect PTSD's co-morbidity (Elhai et al. Reference Elhai, Grubaugh, Kashdan and Frueh2008); and (3) significant improvements in PTSD's factor structure, based on the CFA findings for PTSD summarized above. We retained the required number of six symptoms in altering the diagnosis, in order to reliably compare results with the same six-symptom requirement currently used in the DSM-IV PTSD diagnosis.
This study is important and timely since the PTSD construct has received criticism recently for a range of conceptual issues pertaining to symptom structure and symptom overlap with other mental disorders (Spitzer et al. Reference Spitzer, First and Wakefield2007; Rosen & Lilienfield, Reference Rosen and Lilienfield2008; Rosen et al. Reference Rosen, Spitzer and McHugh2008). Also, decreasing PTSD's symptom overlap with mood and anxiety disorders would ensure that the PTSD construct is unique as a diagnostic entity. Further, conceptual issues in the mental disorders (including PTSD) are currently being considered for DSM-V, and investigations such as this one potentially speak to the issue of whether and how to revise the disorder's diagnostic criteria.
Study 1: method
Sample
Study 1 used archival data from the National Comorbidity Survey Replication (NCS-R) (Kessler, Reference Kessler2006). The NCS-R was a nationally stratified, multistage area household probability sample study of non-institutionalized adults (aged 15 years and older). The NCS-R was conducted with 9282 participants in the early 2000s (NCS-R part I), with demographic characteristics presented in previous NCS-R reports (Kessler et al. Reference Kessler, Berglund, Chiu, Demler, Heeringa, Hiripi, Jin, Pennell, Walters, Zaslavsky and Zheng2004). The present paper reports on the representative subsample of participants completing the NCS-R part II (which evaluated PTSD; n=5692).
Instruments
The World Mental Health Survey Initiative version of the structured Composite International Diagnostic Interview (CIDI; Kessler & Ustun, Reference Kessler and Ustun2004) was used to diagnose DSM-IV mental disorders; the CIDI evidences adequate convergence with other similar measures (Andrews & Peters, Reference Andrews and Peters1998; Haro et al. Reference Haro, Arbabzadeh-Bouchez, Brugha, De Girolamo, Guyer, Jin, Lepine, Mazzi, Reneses, Vilagut, Sampson and Kessler2006). In addition to the lifetime DSM-IV PTSD diagnostic variable, we also examined lifetime diagnostic variables for mood, anxiety and substance-use disorders. DSM-IV diagnostic algorithms were used to assign diagnoses, discussed elsewhere (Kessler et al. Reference Kessler, Berglund, Demler, Jin, Merikangas and Walters2005). We exclusively used the NCS-R's ‘non-hierarchy’ diagnoses (i.e. allowing a particular diagnosis to be assigned even if it occurred solely in the presence of another disorder).
Of relevance to PTSD, participants were first asked in behaviorally specific terms about previous exposure to a variety of traumatic events meeting DSM-IV's PTSD stressor criterion (A1). Only those participants endorsing a traumatic event with initial fear, helplessness or horror (criterion A2) were subsequently queried about DSM-IV PTSD symptoms. PTSD symptom queries involved binary (‘yes’/'no') lifetime symptom ratings about one's trauma. For those endorsing more than one trauma, the most upsetting occurrence of their most upsetting traumatic event type was used. For individuals whose most upsetting trauma occurrence was different from a trauma that was randomly selected by NCS-R investigators, they were instructed to rate their PTSD symptoms separately for each event; PTSD diagnoses were then assigned based on meeting PTSD criteria from either of these two events. Finally, PTSD's criteria E (duration) and F (functional impairment) were queried. Skip-out rules were implemented, such that if a participant did not meet a particular PTSD symptom criterion, s/he was not subsequently queried about remaining PTSD criteria (discussed further below).
Analyses
NCS-R part II sampling weights were used for all analyses in study 1, to adjust for differential household size, non-response and post-stratification. We used Stata 9.0 software (StataCorp LP, College Station, TX, USA) to examine whether the King and Simms models yielded differences from the DSM-IV PTSD model in prevalence of PTSD, diagnostic status, diagnostic co-morbidity and sociodemographic associations. We used Mplus 5.1 software (Muthén & Muthén, Reference Muthén and Muthén1998–Reference Muthén and Muthén2007) to examine the impact of using these models on PTSD's structural validity.
We calculated the lifetime DSM-IV PTSD diagnosis, and King and Simms alterations using the NCS-R's PTSD module item data, retaining DSM-IV PTSD's six-symptom diagnostic requirement. For the PTSD diagnosis (and alterations), we required that the respondent endorsed at least one traumatic event meeting criteria A1 and A2, and satisfied criteria E and F. For criteria B through D in the traditional DSM-IV PTSD diagnosis, we required at least one re-experiencing symptom (B1–B5), at least three avoidance/numbing symptoms (C1–C7) and at least two hyperarousal symptoms (D1–D5) (as required in DSM-IV). To remain consistent with the six-item diagnostic requirement in DSM-IV, for the King PTSD diagnostic alteration we retained the requirements for at least one re-experiencing symptom (B1–B5) and at least two hyperarousal symptoms (D1–D5); in addition, we required at least one avoidance symptom (C1–C2) and at least two numbing symptoms (C3–C7) (thus for the similar DSM-IV PTSD minimum total of three avoidance/numbing symptoms). For the Simms PTSD diagnostic alteration, we retained the requirement for at least one re-experiencing symptom (B1–B5) and the King model's requirement of at least one avoidance symptom (C1–C2); with only two possible hyperarousal symptoms in this model, we required at least one hyperarousal symptom (D4–D5), leaving a requirement of at least three dysphoria symptoms (C3–C7, D1–D3) in order to yield a total of six required PTSD symptoms.
For structural validity analyses, we only used data from participants who (in addition to endorsing PTSD's criterion A1) had endorsed criterion A2 in reference to an index trauma on which PTSD ratings were made (n=871). For participants with multiple sets of PTSD ratings, we used ratings from their most upsetting event. We further excluded 105 subjects missing more than four (24%) of PTSD's symptom items (leaving 766 remaining participants) because of skip-out rules; the remaining participants represented a slightly skewed sample in that they met at least one PTSD symptom cluster (only 42 subjects were skipped out of two symptom clusters). Additional missing item-level data (typically, one or two items each, by 6% of subjects) were estimated to preserve the sample size, using maximum likelihood (ML) estimation of missing data (for a review, see Schafer & Graham, Reference Schafer and Graham2002) for categorical outcomes (Muthén & Muthén, 1998–Reference Muthén and Muthén2007, p. 401).
Since the observed dependent variables were binary (‘yes’/'no') ratings, we implemented robust (mean- and variance-adjusted) weighted least squares (WLS) estimation for the CFAs, using polychoric (rather than Pearson) correlations and probit (rather than linear) regression coefficients (Flora & Curran, Reference Flora and Curran2004; Wirth & Edwards, Reference Wirth and Edwards2007). In fitting the CFAs, we estimated intercorrelations among the common factors, not allowing intercorrelations among the residuals' error variances. χ2 Tests of model fit were examined in conjunction with goodness-of-fit indices, including the Tucker–Lewis index (TLI), comparative fit index (CFI) and root mean square error of approximation (RMSEA) (interpreted when RMSEA <0.06 for an excellent fit, and between 0.06 to 0.08 for an adequate fit; CFI/TLI ⩾0.95 for an excellent fit, and between 0.90 to 0.94 for an adequate fit) (Hu & Bentler, Reference Hu and Bentler1998, Reference Hu and Bentler1999). Differences between the DSM-IV and King PTSD models were tested using a χ2 difference test (albeit with a correction factor since a robust χ2 statistic was used; Muthén & Muthén, Reference Muthén and Muthén2006). When comparing the Simms model with the DSM-IV and King PTSD models, these models are not nested subsets within each other, and thus χ2 difference testing is not appropriate; instead, differences in Bayesian information criterion (BIC) values were examined. BIC values, only estimable using ML (but not WLS) estimation, were generated using ML-estimated CFAs, with logistic rather than probit coefficients [more appropriate using ML (Muthén, Reference Muthén1984) albeit without using sampling weights which cannot be currently implemented with ML for categorical outcomes]. When a model has a BIC value of 10 points less than another model, there is a 150:1 odds that the model with the smaller BIC value is the better-fitting model (Raftery, Reference Raftery1995).
Study 1: results
Prevalence
Table 1 illustrates PTSD prevalence rates across the DSM-IV, King and Simms models, with binomial approximation z tests presented to statistically compare proportions (Hays, Reference Hays1994). The King and Simms alterations each reduced DSM-IV PTSD's prevalence by nearly one percentage point.
PTSD, Post-traumatic stress disorder; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition; s.e., standard error; NCS-R, National Comorbidity Survey Replication; NSA, National Survey of Adolescents.
a Significantly different prevalence from the NCS-R DSM-IV model (binomial approximation z=2.55, s.e.=0.00, p=0.02).
b Significantly different prevalence from the NCS-R DSM-IV model (binomial approximation z=2.88, s.e.=0.00, p<0.01).
c Not significantly different prevalence from the NCS-R King model (binomial approximation z=0.35, s.e.=0.00, p>0.05).
d Significantly different prevalence from the NSA DSM-IV model (binomial approximation z=2.99, s.e.=0.00, p<0.01).
e Significantly different prevalence from the NSA DSM-IV model (binomial approximation z=5.95, s.e.=0.00, p<0.001).
f Significantly different prevalence from the NSA King model (binomial approximation z=3.21, s.e.=0.00, p=0.002).
Table 2 demonstrates changes in DSM-IV PTSD status with the King diagnostic alteration. The level of agreement between these diagnostic systems could be considered ‘almost perfect’ (based on criteria set forth by Landis & Koch, Reference Landis and Koch1977) [κ=0.92, standard error (s.e.)=0.01 (98.63% agreement), z=69.79, p<0.001]. Only 0.84% of the entire sample (48 weighted subjects out of 5692 subjects) changed diagnostic status for PTSD when using the King criteria.
PTSD, Post-traumatic stress disorder; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition.
a Weighted numbers, rounded to the nearest whole number.
The level of agreement between the DSM-IV and Simms models was ‘almost perfect’ (Landis and Koch, Reference Landis and Koch1977) [κ=0.84, s.e.=0.01 (97.24% agreement), z=63.82, p<0.001]. In combination, approximately 1.82% of the entire sample (79+25 weighted subjects out of 5692 subjects) changed diagnostic status for PTSD when using the Simms criteria. Additionally, the King and Simms diagnostic systems were highly concordant with each other (considered ‘almost perfect’ agreement) [κ=0.86, s.e.=0.01 (97.73% agreement), z=65.05, p<0.001].
Finally, we assessed if particular PTSD symptom clusters were responsible for the decreased PTSD prevalence estimates seen in the King and Simms diagnostic alterations. Among respondents meeting DSM-IV PTSD criteria A, E and F, we estimated the proportion meeting only one particular DSM-IV PTSD symptom criterion (among re-experiencing; avoidance/numbing; and arousal) compared with the proportion meeting only the one comparable DSM-IV cluster for the King model (re-experiencing; avoidance and numbing; and arousal) and Simms model (re-experiencing; avoidance and dysphoria; and arousal). The King model's decreased PTSD prevalence resulted solely from its avoidance and numbing symptom diagnostic alteration, and the Simms model's prevalence decrease resulted solely from its arousal symptom alteration.
Co-morbidity and sociodemographic variables
Next we examined if the PTSD diagnostic alterations were associated with different prevalence estimates of co-morbid mental disorders and sociodemographic characteristics (Table 3). Regardless of whether the diagnostic alterations were implemented, similar rates of diagnostic co-morbidity and sociodemographic associations were found.
DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition; PTSD, post-traumatic stress disorder; s.e., standard error; MDE, major depressive disorder; GAD, generalized anxiety disorder.
a Weighted numbers, rounded to the nearest whole number.
b Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and King PTSD diagnostic systems, using an average sample size of 364 across diagnostic systems.
c Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and Simms PTSD diagnostic systems, using an average sample size of 361 across diagnostic systems.
d Only the most prevalent co-morbid disorders in this sample are displayed here.
e Recent mental healthcare use was defined as at least one treatment visit in the previous year.
f Disability involved if the respondent cut down or was unable to perform usual activities due to a mental health problem for at least 1 day in the previous month.
Structural validity
The three-factor DSM-IV PTSD model did not fit the data well [robust χ2(31, n=766)=94.76, p<0.001, TLI=0.89, CFI=0.89, RMSEA=0.05, BIC=14638.73]. Most problematic, standardized factor loadings and R 2 values were quite low for avoiding thoughts/feelings of the trauma (criterion C1: β=0.25, R 2=0.06), with numerous unstandardized residual covariances high (>0.10; Kline, Reference Kline2004), and these parameters were not as problematic for the King (C1: β=0.44, R 2=0.19) or Simms (C1: β=0.45, R 2=0.20) models. The King model had an adequate (but not excellent) fit [robust χ2(31, n=766)=86.14, p<0.001, TLI=0.91, CFI=0.90, RMSEA=0.05, BIC=14619.07], and this model was significantly better than the three-factor DSM-IV model (p<0.001). Finally, the Simms model fit the data very well [robust χ2(31, n=766)=55.90, p<0.001, TLI=0.96, CFI=0.96, RMSEA=0.03, BIC=14533.31]. These results support data from two recent studies examining PTSD's symptom structure using the NCS-R data (Cox et al. Reference Cox, Mota, Clara and Asmundson2008, Elhai et al. Reference Elhai, Grubaugh, Kashdan and Frueh2008). However, those studies did not test the Simms model (goodness-of-fit indices here differ slightly from the study of Cox et al., which used different statistical software that generates less conservative fit indices).
Finally, one interpretation of the Simms model is that the dysphoria symptom cluster should be de-emphasized from the PTSD diagnosis (Simms et al. Reference Simms, Watson and Doebbeling2002). Therefore, we tested a three-factor alternative to the Simms model that simply removes the dysphoria factor, which fit the data quite well [robust χ2(11, n=766)=20.33, p=0.04, TLI=0.95, CFI=0.97, RMSEA=0.03, BIC=7724.45].
In comparing models, judging from BIC values, the original Simms model appears to have approximately a 1600:1 chance of being the preferred model over the three-factor DSM-IV model, and a 1300:1 chance of being the preferred model over the King model; the Simms model that removed the dysphoria symptoms had the best-fitting BIC value.
Study 2: method
Sample
Study 2 used archival data from the National Survey of Adolescents (NSA; Kilpatrick & Saunders, Reference Kilpatrick and Saunders1995, Reference Kilpatrick and Saunders1997), a cross-sectional, nationally representative sample of 4023 adolescents aged 12 to 17 years. Demographic characteristics are reported in previous NSA papers (e.g. Kilpatrick et al. Reference Kilpatrick, Acierno, Saunders, Resnick, Best and Schnurr2000).
Instruments
Family and youth demographic characteristics were obtained. Structured diagnostic interview questions in the NSA assessed each criterion item from the DSM-IV for lifetime major depressive episode and substance-use disorders (for specific details on instrumentation, see Appendix of Kilpatrick et al. Reference Kilpatrick, Acierno, Saunders, Resnick, Best and Schnurr2000). Exposure to potentially traumatic events (PTSD's criterion A1) was assessed using behaviorally specific terms. No direct assessment of PTSD's criterion A2 was conducted.
Lifetime PTSD symptoms (criteria B through D) were assessed using binary ‘yes’/‘no’ ratings based on symptoms that occurred at any time in the adolescent's life, lasting at least 2 weeks, but were not linked to an index trauma. Participants were also queried about PTSD's criterion F. No skip-out rules were used. The PTSD interview was adapted from a similar interview that demonstrated adequate diagnostic convergence with other measures (Kilpatrick et al. Reference Kilpatrick, Resnick, Freedy, Pelcovitz, Resick, Roth, van der Kolk, Widiger, Frances, Pincus, Ross, First, Davis and Kline1998).
Analyses
We used the same statistical software and analytic strategy presented in study 1, above.
We first calculated the DSM-IV PTSD diagnosis, and King and Simms alterations using the NSA's PTSD module item data. For diagnosing PTSD, we required that the respondent endorsed at least one previous trauma meeting DSM-IV PTSD's criterion A1 (items on witnessing violent trauma; sexual assault or physical assault or abuse; and direct exposure to disaster, accident, actual or threatened serious injury)Footnote 1Footnote †. Consistent with the DSM-IV, we also required functional impairment for a PTSD diagnosis. For criteria B through D, we implemented the same requirements as stipulated in study 1, above.
For structural validity analyses, only data from the 3351 trauma-exposed participants were used. Less than 3% of participants were missing data (typically one or two items each), estimated as discussed above in study 1.
Study 2: results
Prevalence
Table 1 illustrates PTSD prevalence across the DSM-IV, King and Simms PTSD models, compared using binomial approximation z tests. The diagnostic alterations resulted in roughly a 1–2.5 percentage point drop in DSM-IV PTSD's prevalence.
Table 2 demonstrates changes in DSM-IV PTSD status with the King diagnostic alteration. The level of agreement between these diagnostic systems was ‘almost perfect’ (Landis & Koch, Reference Landis and Koch1977) [κ=0.91, s.e.=0.02 (98.71% agreement), z=57.79, p<0.001]. Only 1.29% of the entire sample (52 out of 4023 subjects) changed diagnostic status for PTSD when using the King criteria.
Table 2 also demonstrates changes in DSM-IV PTSD status with the Simms diagnostic alteration. The level of agreement between these diagnostic systems was ‘substantial’ (Landis & Koch, Reference Landis and Koch1977) [κ=0.70, s.e.=0.02 (96.20% agreement), z=45.60, p<0.001]. In combination, approximately 3.80% of the entire sample (128+25 weighted subjects out of 4023 subjects) changed diagnostic status for PTSD when using the Simms criteria. Also, the King and Simms diagnostic systems were highly concordant with each other and considered to represent ‘substantial’ agreement (Landis & Koch, Reference Landis and Koch1977) [κ=0.71, s.e.=0.02 (96.69% agreement), z=45.82, p<0.001].
Finally, we assessed if particular PTSD symptom clusters were responsible for the decreased PTSD prevalence estimates seen in the King and Simms diagnostic alterations, as conducted in study 1. Again, the King model's decreased PTSD prevalence resulted solely from its avoidance and numbing symptom diagnostic alteration, and the Simms model's prevalence decrease resulted solely from its arousal symptom alteration.
Co-morbidity and sociodemographic variables
Next we examined if the PTSD diagnostic alterations were associated with different prevalence rates of co-morbid depression and substance-use disorders (but not drug dependence, whereby cell sizes were too small) and sociodemographic characteristics. Table 4 illustrates that regardless of whether the diagnostic alterations were implemented, extremely similar rates of diagnostic co-morbidity and sociodemographic associations were found.
DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, 4th edition; PTSD, post-traumatic stress disorder; s.e., standard error; MDE, major depressive disorder.
a Unweighted numbers, rounded to the nearest whole number.
b Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and King PTSD diagnostic systems, using an average sample size of 303 across diagnostic systems.
c Binomial approximation z test statistic for proportions, comparing co-morbidity rates between the DSM-IV and Simms PTSD diagnostic systems, using an average sample size of 278 across diagnostic systems.
Structural validity
The three-factor DSM-IV PTSD model fit the data very well [robust χ2(99, n=3351)=343.37, p<0.001, TLI=0.99, CFI=0.98, RMSEA=0.03, BIC=40647.69]. The King model also fit the data very well [robust χ2(97, n=3351)=268.55, p<0.001, TLI=0.99, CFI=0.98, RMSEA=0.02, BIC=40590.65], and this model was significantly better than the three-factor DSM-IV model (p<0.001). The Simms model fit the data very well [robust χ2(97, n=3351)=245.69, p<0.001, TLI=0.99, CFI=0.99, RMSEA=0.02, BIC=40573.27]. Finally, we tested a three-factor Simms model that removes the dysphoria factor, which fit the data quite well [robust χ2(22, n=3351)=80.34, p<0.001, TLI=0.99, CFI=0.99, RMSEA=0.03, BIC=20390.34].
Judging from BIC values, the Simms model appears to have approximately a 1120:1 chance of being the preferred model over the three-factor model, and a 260:1 chance of being the preferred model over the King model; however, the model removing the dysphoria factor appears best-fitting.
General discussion
Results of CFAs demonstrate that both the King and Simms models represent significant improvements over the current DSM-IV PTSD model. The Simms model resulted in the factor structure with the best fit to the data, followed by the King model; this finding was opposite from that found in a similar paper using the original NCS data (McWilliams et al. Reference McWilliams, Cox and Asmundson2005), possibly due to the more sophisticated methods of querying trauma and PTSD in the NCS-R. These findings are important because data-reductionistic studies of the PTSD construct have been lacking (Frueh et al. Reference Frueh, Elhai, Kaloupek and Rosen2004; Buckley, Reference Buckley2007) and it is not clear whether the current DSM-IV PTSD model is efficient, parsimonious, theoretically consistent, or empirically supported in contrast to other similar models. Given that both alternative models provided such significant improvements over the DSM-IV model (and in previous studies), it seems likely that meaningful changes to the diagnostic criteria may reasonably be expected in future revisions of the DSM.
Lower PTSD prevalence estimates in the King or Simms diagnostic alterations could be due to several reasons: (1) ‘under-detection’ of PTSD cases in that some symptomatic/impaired individuals may be missed as true PTSD cases; (2) increased diagnostic precision in correctly diagnosing impaired individuals and correctly ruling out non-impaired individuals; and (3) (consistent with supplementary analyses), the separation of PTSD's active avoidance from emotional numbing (King model), as well as restricting hyperarousal symptoms to the truly post-traumatic hyperarousal that does not involve dysphoria (Simms model). Given that both the King and Simms models outperformed the DSM-IV model in CFAs, it seems plausible to accept that the ‘true’ PTSD prevalence may be slightly lower than previously understood.
The King model resulted in the downward adjustment of DSM-IV PTSD's prevalence by roughly one percentage point (Table 1) because 12–16% of DSM-IV PTSD cases were not classified with PTSD using the King model (Table 2). Thus the King model would represent a more conservative PTSD diagnosis than the current DSM-IV's PTSD diagnosis, resulting in better specificity in ruling out non-PTSD cases, but fewer impaired trauma victims meeting the diagnosis of PTSD. However, given the importance of effortful avoidance and its clear distinction from emotional numbing (for a review, see Asmundson et al. Reference Asmundson, Stapleton and Taylor2004), adding such stringency may more accurately describe the psychopathology of the PTSD construct than the DSM-IV PTSD model currently does.
In contrast, the Simms model converged less with the traditional DSM-IV PTSD model. After all, the Simms model not only separates the DSM-IV PTSD model's avoidance from numbing symptoms, but also separates purely post-traumatic hyperarousal symptoms from dysphoria symptoms. Compared with DSM-IV PTSD's prevalence, the Simms model's overall prevalence was lower by only one percentage point in the NCS-R, but by 2.5 points in the NSA (Table 1). Moreover, the Simms model classified 20% (NCS-R) to 40% (NSA) of the DSM-IV PTSD cases as non-cases (Table 2). It is true that requiring pure hyperarousal as distinct from dysphoria appears to have substantial empirical support (Simms et al. Reference Simms, Watson and Doebbeling2002; Krause et al. Reference Krause, Kaltman, Goodman and Dutton2007; Palmieri et al. Reference Palmieri, Marshall and Schell2007b). Thus the Simms diagnostic alteration may be best in quite conservatively diagnosing PTSD, but leads to a question of whether too many legitimate PTSD cases may be ruled out.
The DSM-IV PTSD model is the ‘gold standard’ for diagnosis. However, it may in fact simply reflect an arbitrary status quo for diagnosis. The Simms model may be best at capturing the PTSD construct's underlying symptom structure, based on CFA findings. However, in terms of practical diagnosis of PTSD, the King model may serve as a compromise between accurately reflecting the disorder's distinct key components while not overhauling the diagnosis or substantially reducing its prevalence. This emphasis on diagnosis is not meant to suggest that PTSD should be viewed strictly as a dichotomous ‘present’/'absent' diagnostic phenomenon, since the taxometric literature suggests that PTSD is a dimensional disorder (e.g. Broman-Fulks et al. Reference Broman-Fulks, Ruggiero, Green, Kilpatrick, Danielson, Resnick and Saunders2006). However, assigned diagnoses are still used and required in healthcare practice settings, and for communicating among treatment providers.
Although large-scale, epidemiological datasets were used in the present paper, several limitations should nonetheless be considered. For example, we retained the required six symptoms needed for the PTSD diagnosis, and it is quite possible that relaxing or tightening this requirement would result in different findings. However, varying the minimum symptom requirement was not the focus of this paper; rather, we aimed to test whether other established models with the same required symptom counts would perform differently from the DSM-IV PTSD model. Second, PTSD symptom queries in the NSA were not linked to an index traumatic event, which could have led to inflated PTSD prevalence (Long & Elhai, in press).
Despite its limitations, this research contributes to our understanding of the PTSD construct's symptom structure and how empirically supported PTSD factor models can translate to meaningful changes in diagnosing the disorder. The increase in specificity that would come with the King and Simms models should be placed in the context of the PTSD diagnosis' trend toward less stringency over time, and such liberalizing of the diagnosis has been criticized (McNally, Reference McNally2003; Long & Elhai, in press). As deliberation about psychiatric nosology continues in the next several years in preparation for the next (fifth) edition of the DSM, these and other complementary findings concerning the structure and clinical utility of the PTSD diagnosis will be important in informing decisions that will affect tens of thousands of traumatized people for whom valid assessment and diagnosis are crucial to the receipt of evidence-based treatment and recovery from post-traumatic impairment.
Acknowledgements
Dr Frueh is now affiliated with the University of Hawaii-Hilo.
Declaration of Interest
None.