Introduction
While psychological treatments for depression have different theoretical backgrounds, studies of comparative efficacy have resulted in minimal efficacy differences (Cuijpers et al. Reference Cuijpers, van Straten, Andersson and van Oppen2008; Barth et al. Reference Barth, Munder and Gerger2013). Cognitive behavioral therapy (CBT), for example, aims at alleviating depressive symptoms by changing maladaptive thought schemata and errors in thinking in combination with engaging more in activities that affect mood positively, while psychodynamic therapies assume that gaining insight in (partly) unconscious emotions and relational functioning related to vulnerability for depression is curative. Notwithstanding these markedly different theoretical backgrounds, minimal differences have been found between CBT and psychodynamic therapies with regard depressive symptom reduction during short-term treatment (Leichsenring, Reference Leichsenring2001; Barth et al. Reference Barth, Munder and Gerger2013).
If minimal efficacy differences are found when CBT and psychodynamic therapies are compared across larger patient samples, the question can be raised whether smaller subgroups of patients can be identified that might benefit more from one of these treatments than the other and whether such patient characteristics can be used to guide treatment selection. Little is known about prescriptive factors that are associated with differential efficacy to CBT and psychodynamic therapy for depression. We could retrieve only one study in this regard. Gallagher-Thompson & Steffen (Reference Gallagher-Thompson and Steffen1994) found that clinically depressed family caregivers who had been caregivers for less than 44 months improved more in psychodynamic therapy than in CBT, while patients who had been caregivers for at least 44 months improved more in CBT than in psychodynamic therapy.
Given the lack of research findings in this regard, the National Institute for Health and Clinical Excellence (NICE, 2009, p. 46) called for the examination of moderators of response to CBT and psychodynamic therapy in the treatment of moderate and severe depression as a research recommendation in order to improve patient care. Considering the paucity of findings, we aimed to conduct a hypothesis-generating (rather than a hypothesis-testing; Kraemer et al. Reference Kraemer, Wilson, Fairburn and Agras2002) study by means of conducting post-hoc analyses alongside a randomized clinical trial (RCT) comparing CBT and psychodynamic therapy in the outpatient treatment of depression (Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007). Hypothesis-generating studies are often dismissed as ‘fishing expeditions’, but as argued by Kraemer et al. (Reference Kraemer, Wilson, Fairburn and Agras2002, p. 882) such studies ‘are necessary to foster stronger hypotheses for the next generation of hypothesis-testing studies and to provide the background information necessary to design such powerful studies’. Kraemer et al. (Reference Kraemer, Wilson, Fairburn and Agras2002, p. 883), therefore, state that ‘RCTs should routinely include and report such analyses’. Given the hypothesis-generating nature of our study, we chose to examine a large set of demographic, clinical and psychological patient characteristics as potential prescriptive variables.
Method
Design
This paper draws from data collected in the context of a RCT comparing the efficacy of CBT and psychodynamic therapy in the outpatient treatment of depression that included 341 patients. This intervention study was registered as ISRCTN31263312 with Current Controlled Trials (http://www.controlled-trials.com). The Dutch Union of Medical-Ethic Trial Committees for mental health organizations approved the study design and the study protocol was published (Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007). Efficacy results of this study are reported elsewhere (Driessen et al. Reference Driessen, Van, Don, Peen, Kool, Westra, Hendriksen, Cuijpers, Twisk and Dekker2013, Reference Driessen, Van, Peen, Don, Kool, Westra, Hendriksen, Cuijpers, Twisk and Dekker2015); here we report an effort to identify prescriptive factors associated with differential treatment efficacy. Although these analyses are post hoc, the decision to conduct them was made a priori (Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007).
Patients
Participants were referred by their general practitioner to one of three psychiatric outpatient clinics in Amsterdam, The Netherlands. Inclusion criteria were: (1) presence of a depressive episode according to DSM-IV criteria as assessed with the MINI-International Neuropsychiatric Interview – Plus (Sheehan et al. Reference Sheehan, Lecrubier, Sheehan, Amorim, Janavs, Weiller, Herqueta, Baker and Dunbar1998); (2) Hamilton Depression Rating Scale (HAMD; Hamilton, Reference Hamilton1960) scores ⩾14; (3) age between 18 and 65 years; (4) written informed consent after complete description of the study. Exclusion criteria are described elsewhere (Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007).
Interventions
Both psychotherapies encompassed 16 individual sessions within 22 weeks and were conducted according to published treatment manuals (de Jonghe, Reference de Jonghe2005; Molenaar et al. Reference Molenaar, Don, van den Bout, Sterk and Dekker2009). CBT was based on the principles described by Beck (Reference Beck1976) and included behavioral activation and cognitive restructuring according to a session-by-session protocol with homework assignments. Short-term psychodynamic supportive psychotherapy (SPSP; de Jonghe et al. Reference de Jonghe, de Maat, Van, Hendriksen, Kool, van Aalst and Dekker2013) was used to represent the psychodynamic intervention. This modality involved an open patient–therapist dialog that used supportive and insight-facilitating techniques to address the emotional background of the depressive symptoms by discussing current relationships, internalized past relationships, and interpersonal patterns. Psychotherapists in both conditions were trained psychiatrists or psychologists. Although no formal assessments were conducted, manual fidelity was checked by means of bi-weekly supervision sessions, chaired by a study supervisor, in which audio-taped material was discussed.
Severely depressed (HAMD >24) patients at baseline (CBT: n = 66; SPSP: n = 63) and moderately depressed patients at baseline who developed severe symptoms during treatment (CBT: n = 6; SPSP: n = 15) were offered additional antidepressant medication. The number of patients starting pharmacotherapy at baseline or during psychotherapy did not differ significantly between the treatment conditions. Antidepressant medication was administrated by a psychiatrist (who was not the patient's psychotherapist) according to a protocol starting with extended-release venlafaxine 75 mg/day that could be raised to a maximum of 225 mg/day and switched to either citalopram or nortriptyline in case of intolerance or complete non-response. Pharmacotherapy consultations addressed symptom evaluation, side-effects and adherence.
Instruments
Post-treatment remission rate (HAMD ⩽7) constituted the RCT's primary outcome (Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007). However, as the rather low remission rates found (Driessen et al. Reference Driessen, Van, Don, Peen, Kool, Westra, Hendriksen, Cuijpers, Twisk and Dekker2013) would likely result in empty cells when examining subgroups of patients, thereby compromising logistic analyses, we decided to use continuous post-treatment HAMD scores as primary outcome measure in this study. HAMD assessors were trained master-level clinical psychology students not blind to patient grouping, who assessed the HAMD according to the Dutch scoring manual (de Jonghe, Reference de Jonghe1994). Assessors engaged in 1-h peer supervision sessions bi-weekly, in which audio-taped interviews were discussed. The average intraclass correlation coefficient over 46 audio-taped assessments scored by multiple assessors was 0.97.
We aimed at examining a large set of potential prescriptive factors and we therefore wanted to examine all moderator variables included in the design of the RCT (for a description see Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007). However, we were faced with missing data on a number of these variables due to participants not returning questionnaire booklets. We decided to examine a variable as a potential prescriptive factor only if ⩽10% of the cases were missing. This resulted in a total of 23 variables (12 demographic, nine clinical, two psychological) to be considered as prescriptive factors. These variables are written in italics below as well as listed in Table 1 with the classification options for categorical variables. All variables were assessed at baseline.
Table 1. Baseline characteristics of the study sample
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20241024153734-24846-mediumThumb-gif-S0033291715001853_tab1.jpg?pub-status=live)
CBT, Cognitive behavioral therapy, HAMD, Hamilton Depression Rating Scale; LEIDS, Leiden Index of Depression Sensitivity; s.d., standard deviation; SPSP, short-term psychodynamic supportive psychotherapy.
With regard to the 12 demographic characteristics, age and gender were extracted from the mental health clinic registration system, while a self-report demographic questionnaire designed by the investigators was used to assess nationality, cultural background, marital status, living situation, religion, educational level, current profession, job status, breadwinnership, and breadwinner's main source of income. Cultural background was included in this study, because we considered it a potential prescriptive factor of differential treatment efficacy (Ward, Reference Ward2007). Participants classified themselves with regard to cultural background, with options (see Table 1) being defined by the investigators.
With regard to the nine clinical characteristics, we used the Beck Anxiety Inventory (Beck et al. Reference Beck, Epstein, Brown and Steer1988) total score to assess co-morbid anxiety symptoms, the Brief Symptom Inventory (de Beurs & Zitman, Reference de Beurs and Zitman2006) total score to assess (co-morbid) general psychopathology, and four items of the Trimbos/iMTA questionnaire for costs associated with psychiatric illness (Hakkaart-van Roijen, Reference Hakkaart-van Roijen2002) to assess healthcare use (medication use and number of consultations with general practitioner, outpatient mental health care, and industrial medical officer in the last 4 weeks). Duration of the current depressive episode, previous treatment for the current depressive episode, and lifetime number of depressive episodes were self-reported by the patient during the baseline clinical interview.
The two psychological characteristics included cognitive reactivity to sad mood (the relative ease with which maladaptive cognitions or cognitive styles are triggered by mild mood fluctuations) and anxiety sensitivity (a person's beliefs that anxiety experiences have negative somatic, psychological or social consequences), which were measured by total scores of, respectively, the Leiden Index of Depression Sensitivity (van der Does, Reference van der Does2002) and the Anxiety Sensitivity Index (Reiss et al. Reference Reiss, Peterson, Gursky and McNally1986).
Statistical analysis
Rather than using the intention-to-treat sample and imputing missing post-treatment HAMD scores, we examined potential prescriptive factors in the group of patients who completed post-treatment HAMD assessment. We chose to do so because loss to HAMD assessment was significantly associated with missing baseline data for a number of variables, such that for the patients who were lost to assessment and whose HAMD value was to be imputed, data on which the imputation should be based was missing as well and imputation would likely result in unreliable post-treatment HAMD values. We were mainly interested in the differential efficacy between CBT and SPSP, and therefore included all abovementioned 23 potential prescriptive pre-treatment factors in the analyses. In addition, we wanted to take into account whether or not patients received additional antidepressant medication. Therefore, we used this variable as an extra prescriptive factor. As the absence of treatment differences on the prescriptive factors is a precondition for estimating differential efficacy, we checked for treatment differences on all potential prescriptive factors in the group of patients who completed post-treatment HAMD assessment in an initial series of analyses.
In clinical trials, prescriptive factors are often examined by means of stepwise regression analysis, adding a main effect and a predictor×treatment interaction effect into the model for each potential prescriptive factor (Kraemer et al. Reference Kraemer, Wilson, Fairburn and Agras2002; for further explanation of the method see Aiken & West, Reference Aiken and West1991 and Cohen et al. Reference Cohen, Cohen, West and Aiken2013). However, the power of finding significant effects in such a model with a multitude of parameters may be low in a sample of modest size, the resulting model can be quite complex making it hard to interpret, and stepwise regression does not allow for modeling nonlinear relationships. A new method that can also be used in this regard, but does not suffer from these issues is model-based recursive partitioning (MOB; Zeileis et al. Reference Zeileis, Hothorn and Hornik2008, Reference Zeileis, Hothorn and Hornik2012; Strobl et al. Reference Strobl, Malley and Tutz2009). MOB is a method for finding subgroups for which a specified basic model has different parameters. In the case of our study, this basic model is an ANOVA model including post-treatment HAMD score as dependent variable and a dichotomous treatment factor (CBT or SPSP) as independent variable. This ANOVA model has two parameters: an intercept (equaling the average post-treatment HAMD score of all subjects) and a treatment parameter (equaling the average difference in post-treatment HAMD score between the treatment conditions). This latter parameter is considered the most interesting as it reflects differential treatment efficacy.
MOB uses potential prescriptive variables (e.g. demographic characteristics) to identify groups of patients for which the ANOVA model parameters differ. If the treatment parameter differs between subgroups, this means that the average difference between the two treatments is different for these subgroups. Consequently, the predictor(s) defining these groups can be considered prescriptive factors. For example, if MOB would show a positive treatment parameter for males and a negative one for females, this would mean that, on average, the one treatment gives better outcomes for males, whereas the other treatment gives better outcomes for females and gender could be considered a prescriptive factor.
To identify these subgroups, a binary tree is used in MOB. In the first node of the tree, all observations are used to estimate the basic ANOVA model. Next, for each potential prescriptive variable, the hypothesis that the basic model's parameters are stable over the variable's (ordered or nominal) values is tested, which results in a p value. Next, it is checked if the lowest p value is lower than some pre-specified α, and if so, the predictor associated with this minimal value is chosen to split up the observations into two subgroups (child nodes) using the cut-score which shows the highest structural change (i.e. instability). Next, in both the left and the right child node, the subgroup's data are used to refit the ANOVA model, and the abovementioned procedure is repeated. The process of splitting nodes into child nodes is continued until the lowest p value fails to be significant.
In this study, MOB was conducted in R (R Development Core Team, 2010) using the party library (Hothorn et al. Reference Hothorn, Hornik, Strobl and Zeileis2012). As described above, we chose an ANOVA model as the basic model. We decided not to include pre-treatment HAMD score as a covariate in this basic model, because pre-treatment HAMD score was used as a decision variable to add antidepressant medication to psychotherapy. Adding it as a covariate would be problematic, as this would result in a spurious relationship between the medication factor and the covariate's regression parameter in the basic model. We changed party's default value for minimal observations in the end nodes from 20 to 30 (about 15 in each therapy group), which we deemed the minimal for estimating the ANOVA. We used an α of 10%, because of the theory-generating nature of our study (Kraemer et al. Reference Kraemer, Wilson, Fairburn and Agras2002, pp. 881–882; Stevens, Reference Stevens2002) and because sample size was reduced by the exclusion of patients not completing post-treatment HAMD assessment and might be further decreased by missing values among the prescriptive variables.
In standard MOB, all predictors are entered in the model simultaneously, but we preferred a forward selection approach for two reasons. First, MOB as implemented in R's party library uses complete observations on all variables entered. Although the algorithm usually only selects a small subset of the potential predictors, observations with any missing value on the full set of variables are removed before the actual analysis. As missing value patterns varied over variables, entering all variables simultaneously would result in a large number of excluded cases, while this was not the case with the forward selection approach. Second, patients with a baseline HAMD scores >24 received additional antidepressant medication. Because all other predictors should be nested under this design factor, it should be in the first node of the binary tree. The only way to incorporate this into the model is to force this factor as the first predictor in the forward approach, and then expand the model with additional predictors in each of the child nodes. The forward selection method was performed manually by repeating the analyses and keeping track of the p value of each moderator. Our data-analytic strategy bares a strong resemblance with the methods reported by Dusseldorp & Meulman (Reference Dusseldorp and Meulman2004, Dusseldorp & Van Mechelen, Reference Dusseldorp and Van Mechelen2014), which have been applied in a study of prescriptive factors associated with differential efficacy of cognitive therapy and antidepressants for panic disorder patients (Dusseldorp et al. Reference Dusseldorp, Spinhoven, Bakker, van Dyck and van Balkom2007), but the current method can be considered a more general approach.
Ethical standards
The authors assert that all procedures contributing to this work comply with the ethical standards of relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.
Results
Patient sample
Of the 341 patients randomized, 233 (68.3%) completed post-treatment HAMD assessment and no significant differences were found between treatment conditions with regard to the number of patients lost to this assessment [CBT: 32.3%; SPSP: 31.1%; χ2(1) = 0.06, p = 0.81]. The group of patients who completed post-treatment HAMD assessment is described in Table 1 with regard to all potential prescriptive variables examined. With the exception of marital status, no significant differences between conditions were found. In Table 2, the baseline characteristics of the patients who completed post-treatment HAMD assessment are compared with those of the patients who were lost to this assessment. It can be seen from this Table that significant differences were found between these two groups on a large number of baseline characteristics.
Table 2. Differences between patients completing post-treatment HAMD assessment and patients not completing this assessment, who were excluded from the analyses
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20241024153734-81571-mediumThumb-gif-S0033291715001853_tab2.jpg?pub-status=live)
HAMD, Hamilton Depression Rating Scale; LEIDS, Leiden Index of Depression Sensitivity; s.d., standard deviation.
Model-based recursive partitioning analysis
The results of the model-based recursive partitioning analysis are represented in Fig. 1 and treatment outcomes in each node of the tree are listed in Table 3. In Table 3, the standardized mean difference between the treatments (CBT v. SPSP) is expressed as Cohen's d (Cohen, Reference Cohen1988). A positive value indicates that CBT resulted in a better outcome (i.e. a lower post-treatment HAMD score) than SPSP, while a negative value indicates that SPSP resulted in a better outcome than CBT.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921045556905-0703:S0033291715001853:S0033291715001853_fig1g.gif?pub-status=live)
Fig. 1. ANOVA-based tree. BAI, Beck Anxiety Inventory; CBT, cognitive behavioral therapy; SPSP, short-term psychodynamic supportive psychotherapy. The model predicts HAMD score at post-treatment using the factor Therapy (CBT v. SPSP). Boxplots represent scores for both groups under the ANOVA model; the dots connected by a line represent the averages in the two groups. Due to missing values, the number of observations in the child nodes do not always add up to those in the parent node.
Table 3. Outcomes in each node of the model-based recursive partitioning tree
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921045556905-0703:S0033291715001853:S0033291715001853_tab3.gif?pub-status=live)
CBT, Cognitive behavioral therapy; SPSP, short-term psychodynamic supportive psychotherapy.
Pooled standard deviation is 8.754.
* The standardized mean difference between the treatments (CBT v. SPSP) is expressed as Cohen's d (Cohen, Reference Cohen1988). A positive value indicates that CBT resulted in a better outcome (i.e. a lower post-treatment HAMD score) than SPSP, while a negative value indicates that SPSP resulted in a better outcome than CBT.
Node 1 includes all patients and treatment differences in this group are minimal (d = 0.04; Table 3). Node 1's p value (p = 0.004; Fig. 1) indicates that the parameters of the basic model are significantly different for the two subgroups in which this node is divided: left-hand node 2 representing the subgroup of patients with baseline HAMD scores ⩽24 who received psychotherapy only and right-hand node 3 representing the subgroup of patients with baseline HAMD scores >24 who received psychotherapy and antidepressant medication. While minimal treatment differences (d = −0.07; Table 3) are found in the first group, larger treatment differences (d = 0.36) are apparent in the latter group.
Left-hand node 2 represents the low-severity patients receiving psychotherapy only and its p value (p = 0.051; Fig. 1) indicates that the parameters of the basic model are significantly different for the two subgroups in which this node is divided: node 4 including patients with baseline co-morbid anxiety scores of ⩽15 and node 5 including patients with co-morbid anxiety scores >15. In the first, post-treatment depression scores are lower in the SPSP condition than in the CBT condition (d = −0.40; Table 3), while in the latter post-treatment depression scores differ minimally (d = 0.04).
Right-hand node 3 represents the group of high-severity patients who received psychotherapy combined with antidepressant medication and its p value (p = 0.032; Fig. 1) indicates that the parameters of the basic model are significantly different for the two subgroups in which this node is divided: node 6 including patients with a duration of the current depressive episode <12 months and node 7 including patients who report an episode duration of ⩾12 months. In the first, post-treatment depression scores are lower in the CBT condition (d = 0.83; Table 3), while in the latter post-treatment depression scores are lower in the SPSP condition (d = −0.31).
Sensitivity analyses
As pre-treatment HAMD score was not included as a covariate in the basic model, one might argue that the post-treatment differences in HAMD scores between conditions found in nodes 4, 6, and 7 might be the consequence of chance differences in pre-treatment HAMD scores between the treatment conditions within these nodes. Such that, for instance, the moderately severe patients with low anxiety levels in the SPSP condition in node 4 coincidently had lower pre-treatment HAMD scores than these patients in the CBT condition in this node and that this chance finding resulted in the lower post-treatment HAMD scores found in the SPSP condition. We, therefore, checked for mean differences in pre-treatment HAMD scores between treatment conditions within nodes 4, 6, and 7, but found none, suggesting that chance differences in mean pre-treatment HAMD scores within these nodes do not provide an alternative explanation for our findings.
We assessed the amount of bias in the effect sizes resulting from the MOB procedure using the bootstrapping procedure that has been described in detail by Dusseldorp & Van Mechelen (Reference Dusseldorp and Van Mechelen2014, online supplementary material C.2). With 1000 bootstraps, the resulting estimated bias in the range of effect sizes (Cohen's d; maximum-minimum) was 0.68. The range of effect sizes in the tree that resulted from the MOB analyses in our study was 1.23 (d min = −0.40; d max = 0.83). Thus, when using the tree resulting from this study in a new sample, the range of effect sizes could expected to be 1.23–0.68.
Competing splits
Because of the hierarchical nature of MOB, covariates and splitting values earlier in the tree impact the selection of those lower in the tree; small changes in the data may therefore result in different tree structures. For this reason, it is instructive to inspect the size and significance of alternative splitting variables.
The second best splitting variable for node 2 (p = 0.077) was breadwinner's main source of income. A split was made into two groups: (1) Salary/Pension/Study subsidy, and (2) Welfare/Disability benefit/Unemployment benefit/Other. In both groups SPSP resulted in a lower mean post-treatment HAMD score than CBT, but the difference in HAMD points was rather small in the first group (0.62) and markedly larger in the second group (2.44). The difference between the p value of the first split (Anxiety: p = 0.051) and this second best split (p = 0.077) was not that large, suggesting that co-morbid level of anxiety is only a marginally stronger moderator of differential efficacy than breadwinner's main source of income in the group of low-severity patients receiving psychotherapy only.
By contrast, for node 3 there was no clear competing split. The second best splitting variable for this node was cultural background. However, the p value of this split was not significant (p = 0.172). Thus, episode duration was the strongest moderator of differential efficacy in the group of high-severity patients receiving combined treatment.
Discussion
We conducted a hypothesis-generating study of prescriptive factors associated with differential efficacy of CBT and SPSP in a sample of patients seeking treatment for a depressive episode in psychiatric outpatient clinics. Minimal differences between treatments were found when examining the total sample of patients who completed post-treatment assessment, but model-based recursive partitioning analyses suggested differential efficacy among subgroups of patients that differed with regard to episode duration and co-morbid anxiety level. These findings are in line with another study that found a specific patient factor (length of caregiving) associated with differential efficacy of CBT and psychodynamic therapy for depression (Gallagher-Thompson & Steffen, Reference Gallagher-Thompson and Steffen1994).
p Values within some of the nodes, relating to treatment differences in the group of patients represented in that node, were not significant at an α = 0.05 level. Therefore, one might argue that no significant treatment differences are found in these subgroups and that, consequently, the factors that define these subgroups cannot be considered prescriptive factors. However, Kraemer et al. (Reference Kraemer, Wilson, Fairburn and Agras2002) have argued that moderators of treatment should not be defined by p values (‘because moderator status … would then change with sample size’ p. 881), but by population parameters instead. Effect sizes within these nodes did show relevant treatment differences, which can be considered large enough to be of importance.
We found co-morbid anxiety to be associated with differential treatment efficacy in low-severity patients receiving psychotherapy only. No treatment differences were found among patients who presented higher co-morbid anxiety levels and SPSP was found to be more efficacious than CBT for patients with low anxiety levels. We speculate that patients with low anxiety levels might benefit from SPSP, as they may be better capable to enter the more open relational- and insight-oriented SPSP dialog and feel less comfortable with the more structured and protocolized CBT.
Duration of the current depressive episode was associated with differential treatment efficacy in high-severity patients who received psychotherapy combined with antidepressant medication. CBT was more efficacious than SPSP for patients reporting a duration of <1 year, while SPSP was more efficacious than CBT for such patients reporting a duration of ⩾1 year. We speculate that patients with longer episode durations have depressive symptoms that are more influenced by their personality structure resulting in more complex working alliances and transference feelings; psychodynamic therapists are trained to elaborate on these therapeutic relational aspects if necessary. At the same time, we think that it is harder for patients with longer episode duration to start doing homework and to change their lifestyle and activities, which is what CBT calls for. Similarly, patients with relative short episode durations might benefit less from a psychodynamic approach, while the combination of antidepressant medication and behavioral activation resulting from completed homework assignments in the first part of CBT might lead to alleviation of depressive symptoms in this group.
This study has a number of strengths. First, it is one of few studies examining prescriptive factors associated with differential efficacy of CBT and psychodynamic therapy for depression, which refers to a clinically relevant but yet rather unexplored research topic related to a highly prevalent mental disorder. It also is the first study that considered demographic and illness characteristic as potential prescriptive factors. Furthermore, it is the first study to apply model-based recursive partitioning in this regard, which provides the benefits of reducing the number of prescriptive factors and selecting the most important ones, resulting in models that are easily interpretable. Additional strengths of this study include its efforts to increase external validity, such as providing treatment in general psychiatric outpatient clinics by a large number of therapists with different experience levels and including patients with relatively low social-economic statuses (Driessen et al. Reference Driessen, Van, Don, Peen, Kool, Westra, Hendriksen, Cuijpers, Twisk and Dekker2013).
However, this study also has a number of limitations (see Driessen et al. Reference Driessen, Van, Schoevers, Cuijpers, van Aalst, Don, Hendriksen, Molenaar and Dekker2007, Reference Driessen, Van, Don, Peen, Kool, Westra, Hendriksen, Cuijpers, Twisk and Dekker2013 for further discussion of the RCT's limitations). First, we examined prescriptive factors of CBT and psychodynamic therapy efficacy in a group of patients restricted to those that completed post-treatment assessment. Comparative analyses suggested that this group of patients differed from the group of patients lost to this assessment. Related, there was a baseline difference in marital status between the CBT and SPSP conditions in the group of patients who completed post-treatment assessment; a larger proportion of married persons received CBT. Moreover, bootstrapping suggested that the resulting estimated bias in the range of the effect sizes was 0.68. For these reasons, we need to be cautious in generalizing this study's results, particularly the effect sizes from Table 3, to the entire population of depressed patients. Second, HAMD assessors were not blind to patient grouping. Therefore, we cannot rule out observer bias.
The most important limitation of our study is, however, its post-hoc nature. Our findings are observational and can be the consequence of chance findings in the sample studied. The use of an alpha level of 0.10 (which allowed for a larger power for detecting prescriptive factors) as well as the interpretation of treatment differences within some of the Nodes that would not be significant under more stringent alpha levels, increases the possibility of chance findings. Our findings should, therefore, be validated before being used to guide treatment selection. Validation should occur by including episode duration and co-morbid anxiety level (and possibly breadwinner's main source of income as well) as stratification variables in a future RCT comparing CBT and psychodynamic therapy in the treatment of depression and conduct a formal test of a moderator-by-treatment interaction (Kraemer et al. Reference Kraemer, Wilson, Fairburn and Agras2002). We also encourage replication of our study by means of applying model-based recursive partitioning in samples of other studies that compare different treatment options for depression in order to identify further potential prescriptive factors to test in such a future RCT.
In sum, although generally similar efficacy is found for CBT and psychodynamic therapy across larger patient samples, efficacy differences can be apparent in specific subgroups of patients. We identified co-morbid anxiety level and depressive episode duration as possible prescriptive factors associated with differential efficacy of CBT and psychodynamic therapy. If validated, these variables might be used to guide treatment selection, which might improve the efficacy of psychotherapy for depression.
Acknowledgements
This study was financed by an unrestricted research grant by Wyeth Pharmaceuticals, The Netherlands. Arkin Mental Health Care, The Netherlands, financially supported research logistics and the contributions of E.D., J.D., F.J.D., J.P., S.K., D.W., M.H., and H.L.V. F.J.D.'s contributions were also financially supported by ProPersona Mental Health, The Netherlands. VU University, Faculty of Psychology and Education, Department of Clinical Psychology, The Netherlands, financially supported E.D.'s, N.S.'s and P.C.'s contributions to the study. None of the sponsors had a role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; nor in the preparation, review, or approval of the manuscript.
Declaration of Interest
H.L.V. and M.H. have received training fees from RINO, Amsterdam, and VU University, Amsterdam, for respectively lecturing students and clinicians short-term psychodynamic supportive psychotherapy, and H.L.V. is president of the Dutch Association of Psychoanalytic Psychotherapy. F.D. serves on the board of the mood disorders section of the Dutch Association of Cognitive and Behavior Therapy (unpaid). F.D. and J.J.M.D. receive royalties from Springer Media, The Netherlands, for authorship of the published CBT treatment manual. The remaining authors report no financial relationships with commercial interests.