[T]hese three formalisms of counterfactuals [of Lewis, Rubin and Pearl] … are shown to be logically equivalent; a problem solved in one framework would yield the same solution in another. (Pearl et al. Reference Pearl, Glymour and Jewell2016: 126)
1. Introduction
In this article, I seek to encourage comparative research by exploring differences between both Rubin’s Causal Model (RCM) and Lewis’ Theory of Counterfactual Conditionals (LTC) on the one hand, and Pearl’s Structural Causal Model (SCM) on the other. Rubin (Reference Rubin1974) developed a theory of estimation of causal effect sizes based on potential outcomes (alternatively, potential responses), the values of an outcome variable that individuals would have taken on if exposed to a different treatment, taken as ontological primitives. He then worked out methods for estimating treatment effects in a population based on observations of different individuals exposed to different treatments. Pearl (Reference Pearl2009 a) defined counterfactual quantities, which may occur in counterfactual conditionals, as derived quantities in terms of the operations on causal models. He then developed methods for estimating causal effects based on an assumed qualitative causal structure.
Lewis analysed counterfactual conditionals in terms of comparative possibility (Reference Lewis1973 b, Reference Lewis1973 c; broadly characterized as a similarity theory). Roughly, the counterfactual conditional If A then B holds true if and only if B holds true in all the most similar possible worlds in which A holds true. Lewis analysed causation in terms of counterfactual conditionals (Reference Lewis1973 a, Reference Lewis, Collins, Hall and Paul2004). A causally influences B if counterfactual dependence holds between the right sorts of events based on the right sorts of counterfactual conditionals. Causation is the ancestral of causal influence. Pearl (Reference Pearl2009 a) analysed a subset of counterfactual conditionals in terms of manipulations of causal graphs or systems of causal equations. Roughly, If A = a then B = b holds true if suspending the modelled causes of A and fixing A to a in the model results in the equation for B yielding the value b. SCM represents this as A Do(B = b). One can interpret SCM as offering a non-reductive theory of causation or as taking type-level causation as an unanalysed primitive. (The online supplementary materials to this article contain a brief introduction to RCM, LTC and SCM.)
These broad differences may appear to take equivalence off the table but equivalence claims occur and hold the potential to mislead researchers.Footnote 1 These statements trace back to Galles and Pearl who concluded ‘for recursive models, the causal model framework does not add any restrictions to counterfactual statements beyond those imposed by Lewis’s framework’ (1998: 169). Galles and Pearl (Reference Galles and Pearl1998: 174) further asserted that because one can characterize certain counterfactual conditionals using recursive structural causal models, ‘This establishes, in essence, the formal equivalence of structural equation modeling [meaning SCM] … and the potential-response framework’.Footnote 2 Rubin’s, Lewis’ and Pearl’s work present three distinct bundles of theory (Table 1). Rubin offered a theory of causal estimation but not of counterfactual conditionals or of causation. So, I consider only a posited equivalence between Rubin’s and Pearl’s theories of causal estimation. Likewise, Lewis (Reference Lewis1973 a) analysed causation in terms of counterfactual conditionals whereas Pearl (Reference Pearl2018; Pearl and Mackenzie Reference Pearl and Mackenzie2018) contrasts his account with reductive theories of causation. I therefore consider only posited equivalence between Pearl’s and Lewis’ theory of counterfactual conditionals to the exclusion of their theories of causation.
Table 1. Different topics addressed by different authors

* Equivalence claim between elements in the same row.
As an analogy, suppose that a theory of brotherhood states that x is a brother to y iff x is a male sibling of y and a theory of sisterhood states that x is a sister of y iff x is a female sibling of y. I shall designate such theories as sharing weak equivalence in the sense that one can re-interpret ‘brother’ to mean sister and ‘male’ to mean female to state the sisterhood theory using the notation of the brotherhood theory. However, no-one would take this to mean that the brothers and sisters are the same thing. I shall reserve strong equivalence to describe two theories that say the same thing in different ways, such as the brotherhood theory expressed in English and expressed in French. The positive arguments offered for the equivalence of RCM to SCM and of LTC to SCM (Galles and Pearl Reference Galles and Pearl1998; Pearl Reference Pearl2009 a) follow the same form. They express SCM in the notation of the other theory, and then show that SCM in borrowed notation is equivalent to SCM in native notation.Footnote 3 As such, they demonstrate at most weak equivalence but not strong equivalence. This analysis of the arguments highlights an important methodological point. Both arguments rely on SCM to represent its relation to other theories. This strategy builds the limitations of SCM into the resulting comparison and likewise filters out aspects of the rival theory that do not readily translate to SCM. Successful analysis of the relationships between rival theories instead requires an external vantage point. Expressibility in one of two rival theories under comparison should never serve as either an admissibility criterion or intelligibility criterion for assertions in a discussion comparing them. Why is this important? Presenting SCM as a successor to RCM and LTC invites an interpretation claiming equivalence in the sense that SCM supersedes RCM and LTC (compare Pearl Reference Pearl2009 a: Ch. 7 and Pearl and Mackenzie Reference Pearl and Mackenzie2018: Ch. 8). This supersession narrative refers to the substantive theories themselves and not merely to the formal language or notation in which they are expressed. A mere equivalence of languages would not suggest the obsolescence of theories expressed using them if the theories differ in content. Equivalence claims warrant attention because of their implications for comparative research.
Upon presenting a theory, it is customary to consider some form of comparative analysis with respect to the theory’s rivals. This can involve comparing the relative scope of application of the theories. In the case of linguistic or semantic theories it can involve a comparison of fitting established usage and handling difficult cases. Increasingly, philosophers are citing empirical results to supplement linguistic intuitions. In the case of theories regarding scientific methodology, it can involve the relative performance of research procedures recommended by each theory or the comparative scope of application of two methods. Yet, over two decades and many publications, such systematic comparative analyses have not emerged in the literature on SCM in relation to RCM or LTC. Hausman (Reference Hausman1998) and Woodward (Reference Woodward2003) clearly distinguished LTC from their contributions to the metaphysics of causation underlying SCM and the related work of Spirtes et al. (Reference Spirtes, Glymour and Scheines2000) and Briggs (Reference Briggs2012: 140) argued that ‘the divergence between the two approaches is larger than commonly recognized’. Imbens (Reference Imbens2019) gently addressed Pearl’s criticisms of RCM. However, on the whole, methodological literature tends to either treat them as interchangeable or focus on one to the exclusion of the other. The equivalence claims arguably function rhetorically to quell such expectations for comparative research. However, justifying this lack of attention would require strong equivalence, not just weak equivalence.
It is worth stepping back to note that even if these two rival theories were strongly equivalent to SCM, the need for comparative research would remain in place. LTC differs from Stalnaker’s (Reference Stalnaker and Rescher1968) theory and Bennett’s (Reference Bennett2003) theory, among others. Likewise, a theory of effect estimation rests on a theory of causation and a myriad of approaches to analysing causation differ from Pearl’s non-reductive approach and from the manipulation theory suggested by RCM (Schmaltz Reference Schmaltz2014). Hoover’s (Reference Hoover2001) emphasis on dispositions differs fundamentally from the Humeanism implicit in both RCM and SCM. Indeed, Lewis’ (Reference Lewis1973 a) theory of causation, as distinct from LTC, would still differ from Pearl’s notion of causation even if they shared the same theory of counterfactual conditionals. Moreover, modifications of RCM have rendered Steyer’s (Reference Steyer2005) approach to effect estimation sufficiently different from RCM that the two are not equivalent. It is not possible for SCM to simultaneously retain equivalence with multiple non-equivalent alternatives. As such, even if the equivalence claims held for RCM and LTC, they still would not warrant a moratorium on comparative research. However, the next two sections will show that RCM and LTC do differ from SCM.
2. SCM and RCM on causal effect estimation
Much of what has been written about the relationship between RCM and SCM from the SCM perspective is ambiguous because it conflates RCM with potential outcomes notation. This distinction is crucial to the distinction between strong and weak equivalence. Showing that potential outcomes notation can express both theories does not suffice to show strong equivalence between the theories.
2.1. Strong equivalence between RCM and SCM
The following expression lies at the centre of the relationship between RCM and SCM (Pearl Reference Pearl and Hoyle2012: 75; cf. Pearl Reference Pearl2009 a: eqn 3.51): Y x(u) ≜ Y Mx(u).Footnote 4 The left-hand side represents in Holland’s (Reference Holland1986) notation the response by participant u measured on variable Y under treatment condition x. The right-hand side represents the value of Y | Do(X = x) relative to model M. The interpretation of u in SCM varies between an equivalence set of participants having the same values on all disturbance terms, or simply one such respondent. Although the former appears to be the strict interpretation under SCM, the latter is more charitable toward claims of equivalence. Even allowing some flexibility regarding u, however, the lack of equivalence is clearly evident in the presence of M on the SCM side but not the RCM side. The RCM interpretation of the left-hand side refers to the world being studied and the SCM interpretation of the right-hand side refers to a model rather than the world being modelled. This reflects the fact that potential outcomes denote different things in RCM than they do in SCM (Rubin Reference Rubin1974; Pearl Reference Pearl2009 a). Neither RCM nor SCM internally distinguishes these two kinds of potential outcomes. As such, either the equality in the above expression fails or the left-hand side fails to correspond to potential outcomes in RCM. Either way, RCM and SCM cannot bear strong equivalence because shared referring terms refer to different things in each.
One argument that might be offered to support equivalence is that because SCM has been shown to be complete, it excludes any other interpretation of causal discourse (Pearl Reference Pearl2009 a). SCM is only complete with respect to the limited domain of discourse that it seeks to characterize and this completeness does not exclude other forms of causal discourse that it fails to characterize (Markus Reference Markus, McKay Illari, Russo and Williamson2011). The scope of the semantic completeness of SCM is limited by its representational incompleteness. The remainder of this section unpacks various aspects of this non-equivalence, involving model semantics, causal interpretation and research practice.
2.1.1. Model semantics
Both model misspecification and implicit accessibility relations distinguish RCM from SCM.
2.1.1.1. Model misspecification
Pearl’s tendency to assume a correctly specified model in his exposition obscures an important corollary of the different ways of understanding potential outcomes. Pearl’s expression for the consistency rule, X(u) = x ⊃ Y(u) = Y x(u), describes a model. Potential outcomes in RCM describe the reality being studied, these determine the causal effect one wishes to estimate. Suppose for a moment that the model incorrectly represents reality. Let Y x(u) represent the actual potential outcome and let Y*x(u) represent the counterfactual quantity predicted by the model. If X = x, then the consistency rule asserts that Y(u) = Y*x(u) whereas RCM asserts that Y(u) = Y x(u). By hypothesis, let the inaccuracy in the model yield Y*x(u) ≠ Y x(u). RCM and SCM then produce distinct claims. Significantly, one needs to extend the notation of SCM to express this difference.
2.1.1.2. Implicit accessibility relations
The contrast between the characterization of treatments in each approach illustrates ways that SCM can include possibilities excluded by RCM. Conversely, the contrast between the characterization of stochastic independence assumptions illustrates the opposite. Consider an example of a compound intervention involving X and Z. The treatment condition includes X = 1 and Z = 1 whereas the control condition includes X = 0 and Z = 0. Within RCM, the clear description of the treatments plays a fundamental role in determining the range of possibilities considered (Imbens and Rubin Reference Imbens and Rubin2015). Drawing on the language of modal logic, one can frame this as the scope of possible worlds under consideration (i.e. deemed accessible). One can represent the compound intervention with the following algebraic model.

For illustration, and without loss of generality, suppose that the actual world is an X = 0 ∧ Z = 0 world. Within RCM, the space of possible worlds partitions into two types: X = 0 ∧ Z = 0 worlds and X = 1 ∧ Z = 1 worlds. Worlds with mismatched values for X and Z fall outside the scope of possible worlds (or are inaccessible worlds) because they correspond neither to the treatment or the control (cf. Rubin Reference Rubin1974: 689). However, SCM assumes modularity as part of the definition of manipulations as modifications of structural causal models. Thus within SCM, the worlds in which X and Z differ remain accessible by assumption. In the context of a specific study, ‘Necessarily X = Z’ holds true from the perspective of RCM but not SCM (cf. Woodward Reference Woodward2016).
If we allow collapsing two variables into one, we lose with one hand what we gain with the other. With two variables, SCM can express the non-identity (distinctness) of the two variables but not their necessary numeric equality. If we allow such replacements and reduce the SCM representation to one variable, then SCM can express necessary numeric equality (in a sense) but cannot express non-identity. As such, we arrive at an assertion that SCM cannot express by forming the conjunction of these two claims: X and Z are non-identical but necessarily numerically equal. This is precisely the kind of theoretical claim that practicing researchers might like to make. Indeed, the Stable Unit Treatment Value Assumption (Rubin Reference Rubin1980) can be interpreted as entailing precisely this sort of claim: If the treatment involves taking aspirin and not taking a placebo researchers plausibly wish to assert that in the experiment participants either took aspirin but not the placebo or took the placebo but not aspirin without being committed to the implausible claim that taking aspirin is identical to not taking the placebo. This is because outside the context of the study, plenty of people take neither. This difference reflects the difference between a narrow representation of treatment in a study in RCM and the broader aim to represent causal generalizations in SCM (see section 2.1.3.1).
Conversely, consider the assumption that Cov(U X, U Y) = 0. SCM takes such assumptions as fixed givens. If sufficient independences hold, they provide the grounds for causal inference within SCM. However, as a more thorough Bayesian, Rubin wants to incorporate uncertainty about such assumptions into the analysis. A Bayesian might express this uncertainty in the form of a prior distribution for the covariance that might centre around zero but still show some dispersion around it. Thus, in SCM the modal assertion ‘necessarily Cov(U X, U Y) = 0’ holds true whereas in RCM it does not and instead its negation, ‘possibly Cov(U X, U Y) ≠ 0’ holds true. To be clear, nothing prevents SCM from expressing Cov(U X, U Y) ≠ 0 as a different model.Footnote 5 Rather, the point is that RCM and SCM commit themselves to different modal claims in accepting the above model. That is, the same model is interpreted differently within RCM and SCM.
Consider two possible worlds other than the actual world. In ω1, X = 0, Z = 1 and Cov(U X, U Y) = 0. In ω2, X = 1, Z = 1 and Cov(U X, U Y) ≠ 0. In SCM, ω1 but not ω2 remains accessible from the actual world, and indeed ω2 is accessible from no worlds accessible to the actual world. Conversely, in RCM, ω2 but not ω1 remains accessible from the actual world, and ω1 is accessible from no worlds accessible to the actual world. As such, the two interpretations of the model yield different modal assertions. Because the range of possibilities varies between RCM and SCM, assertions that hold true in one may hold false in the other. Jointly, these differences demonstrate that neither RCM nor SCM constitutes a special case of the other.
2.1.2. Causal interpretation
Treatment interventions, manipulation and casual effects further distinguish RCM from SCM.
2.1.2.1. Treatment interventions
RCM and SCM differ in a number of inter-related ways involving the conceptualization and representation of treatment interventions. RCM does not assume that the researcher has full knowledge of what may be a plethora of antecedent variables that may moderate the treatment effect. Inference is thus limited to the treatment under study in combination with the pre-treatment histories of the study participants (Rubin Reference Rubin1978). In contrast, SCM assumes prior knowledge of at least the qualitative causal structure of all relevant variables, typically represented as being relatively few in number, exemplifying relatively simple and isolated causal systems. Whereas RCM treats generalization of causal effects as a complex empirical undertaking, SCM reduces it to a relatively straightforward logical exercise based on this assumed qualitative causal knowledge (Pearl and Bareinboim Reference Pearl and Bareinboim2014).
Correspondingly, RCM emphasizes detailed description of real-world interventions whereas SCM emphasizes interventions as simplified abstractions in a formal calculus. For example, Rubin (Reference Rubin1978) placed comparable value on non-reactive double-blind treatments and non-double-blind reactive treatments, cautioning researchers not to assume that they have the same effects. In contrast, Pearl (Reference Pearl2009 a) considered effects of treatments on other variables, but generally assumes that treatments are non-reactive in the sense that the intervention itself only affects the manipulated treatment variable. This non-reactivity is implicit in the way that SCM separates interventions from the model by treating them as operations on the model itself. However, if one were to add an intervention variable to the model, reactivity would violate the consistency rule. For example, consider economic behaviour that differs depending on whether it is freely chosen or imposed by regulatory intervention. RCM encourages the study of such phenomena whereas SCM excludes them, and is thus less general in this respect.
2.1.2.2. Causation and manipulation
A related difference involves the incorporation into RCM of the slogan ‘No causation without manipulation’ (Holland Reference Holland1986) but rejected by SCM (Bollen and Pearl Reference Bollen and Pearl2014; Pearl Reference Pearl2018, Reference Pearl2019). Perhaps under the influence of the equivalence claims, Bollen and Pearl assumed that the slogan must mean the same thing in the context of RCM as in SCM and on that basis they asserted that Rubin and Holland were wrong to make this assertion. Pearl (Reference Pearl2018) characterized manipulation theories as intolerant, which is only intelligible if one assumes that they exclude other notions of causation. It seems much more plausible to instead take this difference as further evidence against equivalence and allow that the slogan can hold true in RCM and false in SCM without contradiction because it means different things in each context. In this case, it suggests that RCM and SCM assume different forms of causation. Holland’s slogan fits comfortably with traditional manipulation theories which make manipulation essential to causation whereas interventionist theories typically differentiate themselves from manipulation theories on precisely this point (e.g. Woodward Reference Woodward2003).
2.1.2.3. Causal effects
One prominent difference between RCM and SCM involves how each defines causal effects. For RCM, a causal effect involves a difference in expected value of the potential outcome between two or more treatments (Rubin Reference Rubin1974).Footnote 6 In contrast, SCM defines causal effects as differences in probability distributions. These two characterizations come apart when we consider two probability distributions that differ in some way other than expected value. One example occurs where one treatment results in greater variation in outcomes than another, even though the expected value remains the same across treatments. In such cases, adopting RCM would lead to the conclusion of no causal effect where SCM would lead to the opposite conclusion even though the facts of the matter are not in dispute. The difference in conclusions simply reflects non-equivalent criteria used to define the estimand. In this respect SCM is more general than RCM.
2.1.3. Research practice
Finally, modelling method versus theory, research design practice, and m-bias distinguish RCM from SCM with respect to research practice.
2.1.3.1. Modelling method versus theory
Many of the above differences relate to a basic difference in representation. SCM seeks to encapsulate general scientific knowledge represented in multi-purpose causal models and use them to guide estimation of various causal effects included in the model. In contrast, RCM instead emphasizes the representation of specific events in the context of a specific study.Footnote 7 Whereas SCM might encourage researchers to re-use the same model in different studies estimating different effects, RCM encourages study specific models to guide the methodological choices specific to each study. Relatedly, RCM seeks to separate the model of treatment from the model of the outcome so that methodological decisions do not depend upon knowledge about outcomes. In contrast, SCM combines treatment variables and outcome variables into one model considered together. Thus, when Pearl criticized RCM for failing to express ignorability assumptions in the language of the causal model (e.g. 2009a, 2012), he framed as a failing something that RCM claims as a central virtue.
In part, the differences between the two approaches to effect estimation may reflect focus on different types of causal systems. SCM tends to focus on examples of stable causal systems whereas RCM tends to focus on examples of systems that develop over time. The canonical example for SCM is a sprinkler set-up in which the effects of various variables in the system remain constant over time, depending at most on the states of other variables in the system. In contrast, the canonical example for RCM involves taking aspirin to alleviate a headache. There is a limited window over the course of the headache in which the causal effect is present: taking aspirin will not prevent the headache and eventually the headache will go away on its own without aspirin. Expositions of causal inference using SCM tend to focus on timeless models that can be applied to any intervention on any of the variables. Rubin (Reference Rubin1978) emphasized the times of treatments and of effects, denying that the same causal model can be reapplied at a later time to allow the same individual to receive contrasting treatments. This is why the Fundamental Problem of Causal Inference, that one cannot observe more than one potential outcome for the same case, plays a more prominent role in RCM than in SCM. Ignoring development over time in a canonical SCM model could lead to the violation of various axioms such as the definition of a model in terms of fixed functional relations, the Causal Markov Condition, Faithfulness, Consistency, or Effectiveness. For example, developmental processes could create co-variances between variables not represented in the synchronic causal structure. Likewise, spontaneous changes in variables due to developmental processes can be taken as violating Effectiveness, X xw(u) = x, because a manipulated variable will not necessarily maintain its manipulated value. One could express a longitudinal time-specific model in SCM-like notation. However, so doing would take one away from the standard SCM methodology for effect estimation and in the direction of RCM. Expressing one methodology in the notation of the other does not alter the substantive differences between them.
2.1.3.2. Research design and scientific practice
Another cluster of differences between RCM and SCM involves the conceptualization of the variables used to formulate potential outcomes. Whereas RCM focuses on designing studies with appropriate variables to answer a fixed question, SCM focuses on finding questions that can be answered with a fixed set of variables. As such, Pearl tends to take a set of variables serving as nodes in a causal graph as absolute givens (contrary to Woodward Reference Woodward2016) whereas Rubin advises the researcher to consider modifying these to optimize the extent to which the study meets assumptions required for causal inference. For example, Imbens and Rubin (Reference Imbens and Rubin2015) suggest modifying the granularity of the outcome variable (Y) to maximize the plausibility of assumptions required for causal inference. This difference can be contextualized as an element of scientific realism in RCM that does not translate well to SCM: Rubin clearly views the functional relations between units, treatments and potential outcomes as specific to the particular measures chosen while recognizing that different measures can support inferences about the same underlying entities. In contrast, SCM tends to accept variables in a specific form as the fundamental objects of causal description. In this respect, different RCM studies can focus on different RCM potential outcomes to answer questions about the same topic whereas for SCM it is more natural to consider studies using different variables as answering questions about different topics.Footnote 8
Finally, standard presentations frame RCM in terms of a process of collective cognition bringing together disputants regarding some issue (e.g. health risks associated with a product such as cigarettes) and providing a process by which they can agree on a design to settle an issue before knowing the result. In contrast, standard presentations frame SCM as representing the cognitive processes of an individual researcher making causal inferences from data in isolation from others. In this respect, SCM comes much closer to classical evaluation practices whereas RCM comes much closer to the emphasis on stakeholder buy-in and utilization found in contemporary evaluation practices (Shadish et al. Reference Shadish, Cook and Leviton1991).
2.1.3.3. M-bias
The controversy surrounding M-bias sheds additional light on differences between RCM and SCM. Shrier (Reference Shrier2008) suggested that M-bias offered a counter-example to Rubin’s advocacy of conditionalization when estimating causal effects. The M-bias example involves a treatment (W) solely a function of a disturbance (U 1), an outcome (Y) solely a function of W and a disturbance (U 2), and a single covariate (X), a function of U 1 and U 2. From the SCM perspective, U 1 and U 2 exhibit unconditional independence and thus there exists no bias in the unconditional estimate of the effect of W on Y. However, conditioning on X introduces stochastic dependence between U 1 and U 2. So, the conditional estimate contains bias (see also Pearl Reference Pearl2009 b; Sjolander Reference Sjolander2009). Rubin (Reference Rubin2009) responded that attempting to estimate the causal effect this way relied on highly improbable happenstance (see also Imbens Reference Imbens2019). Pearl (Reference Pearl2009 c: 3) responded that ‘M -bias is not a phenomenon that depends on finely-tuned, “hoped-for compensating imbalances” as caricatured by Rubin but, rather, a structural property, persisting no matter what parameters are assigned to the various associations in the model’.
Two of the differences considered above shed light on the exchange. When Pearl spoke of independence persisting no matter the values of the parameters in the model, he described the scope of possible worlds defined by the parameter space of the model. Because Cov(U 1, U 2) is not included in the model as a parameter, it necessarily has the value of zero for SCM. However, from the RCM perspective, a researcher adopting a scientific approach should exercise caution by acknowledging some uncertainty about the assumption of stochastic independence between U 1 and U 2. Whereas the SCM strategy uses assumptions such as that about U 1 and U 2 to minimize conditionalization, the RCM strategy does just the opposite, relying on conditionalization to ease the reliance on strong assumptions.
Similarly, the example illustrates SCM’s estimation strategy of seeking questions that can be answered given a fixed set of variables. From the RCM perspective, however, the example presents a poor choice of variables with which to answer the stated question. X offers a poor choice of co-variate because it has no causal effect on W but itself causally results from variables that do cause W. Good research design, from the RCM perspective, would not simply accept a set of variables as given but would rather design a study to include variables that determine W (Rubin Reference Rubin2007). Each author appears to have correctly understood M-bias within the context of his own assumptions, but the example draws into relief the differences between the assumptions of each author.
2.2. Weak equivalence between RCM and SCM
The conclusion that SCM can be expressed in potential outcomes notation in a way that renders it equivalent to SCM expressed in its native notation seems unexceptional. The fact that RCM and SCM can be expressed in potential outcomes notation rests on using the notation to express different things in each case. This underscores the importance of not conflating potential outcomes notation with a particular theory expressed in that notation such as under the ambiguous rubric of a potential outcomes framework.
2.3. Section summary
This section has enumerated differences between RCM and SCM involving semantics, causation and research practice (Table 2). It seems very plausible that such different methodologies have different strengths and weaknesses and different applications. Greater focus on areas of divergence in comparative research can help bring these to light and help to refine methodological advice to researchers.
Table 2. Contrasting two methodologies for effect estimation

3. SCM and LTC on counterfactual conditionals
I will first focus on strong equivalence with LTC and then with the broader family of similarity theories before addressing weak equivalence.
3.1. Strong equivalence with LTC
Three fundamental differences between LTC and SCM stand out as particularly salient: ontological priority, truth makers and vagueness.
3.1.1. Ontological priority
Consider the relations that might hold between the rates set by the Federal Reserve, the level of investment and the growth of the economy. Such relations include counterfactual dependence, type-level causal relations that hold across instances, and token-level causal relations between specific instances. SCM describes type-level causal relations as ontologically basic and counterfactual dependence and token-level causal relations as either derived from type-level causal relations (Halpern and Hitchcock Reference Halpern and Hitchcock2015) or as mutually dependent (Woodward Reference Woodward2003). In contrast, Lewis takes counterfactual dependence as basic, token-level causal relations derived from counterfactual dependence, and type-level causal relations derived from token-level causal relations. Consequently, LTC views scientific laws as non-causal generalizations about matters of fact whereas SCM views scientific laws as general causal statements. Because laws impact similarity between worlds, this yields a further difference. For LTC, causation has no bearing on similarity, whereas when SCM is expressed in similarity terms, similarity is based on causal relationships represented in causal models (Galles and Pearl Reference Galles and Pearl1998). LTC can describe a world with counterfactual dependence but no causation but not vice versa whereas SCM, at least on one interpretation, can do precisely the opposite. LTC is consistent with a world in which all counterfactual dependencies hold between the wrong sorts of events to support causation. Although one might avoid circularity by treating token-level causal relations as dependent on counterfactual dependence and counterfactual dependence as dependent on type-level causal relations (Bennett Reference Bennett2003; Halpern and Hitchcock Reference Halpern and Hitchcock2015), Lewis scrupulously avoided a causal theory of conditionals (Lewis Reference Lewis1986 b pace Galles and Pearl Reference Galles and Pearl1998; cf. Lewis Reference Lewis1999: Ch. 15; Hall Reference Hall2004; Briggs Reference Briggs2012) to safeguard against circularity in his counterfactual theory of causation. In contrast, Pearl dismisses counterfactual theories of causation as circular based on his causal theory of counterfactuals (Pearl and Mackenzie Reference Pearl and Mackenzie2018). One interpretation of SCM might allow causal structures that produce no counterfactual dependence. A more natural interpretation might require that causation and counterfactual dependence always occur together. Either interpretation differs from LTC.
Counterfactual dependence is not limited to causal dependence but can also derive from logical dependence, means-end relations, relational properties, constituent properties (Kim Reference Kim1974), supervenience (Woodward Reference Woodward2015; Baumgartner and Gebharter Reference Baumgartner and Gebharter2016), grounding (Schaffer Reference Schaffer2016) and constitutional dependence (Starr Reference Starr2019). LTC accommodates non-causal counterfactual dependence (Lewis Reference Lewis1986 b: Ch. 23). If we take SCM at face value (e.g. the Causal Markov Condition is causal), then it denies any non-causal counterfactual dependence. However, it is not obvious that structural models capture only causal dependence as opposed to dependence in general, in which case SCM is not limited to causation as advertised. However one resolves this structural ambiguity in SCM, it differs from how LTC handles non-causal dependence.
Ontological differences might not matter to policy analysts if both theories deemed the same conditionals as true, but they do not. Examples involving non-causal laws produce different results. Consider ‘If this patch of ground had not contained talc it would not have contained asbestos’ in a world where the two co-occur as a matter of non-causal law. Because violations of law produce less similarity than small local violations of fact, LTC judges the conditional true. Because there is no causal link between the presence of talc and the presence of asbestos, SCM judges the conditional false, or simply lacks the ability to represent the problem.
3.1.2. Truth makers
A second but related difference involves truth makers for counterfactual claims. Although it took time for readers to absorb the claim, Lewis held that every possible world existed in just the same way as our own (Lewis Reference Lewis1986 a) and that while no information can flow between worlds, we can have knowledge of what is possible or necessary as modal expressions understood in our world (cf. Bennett Reference Bennett2003: Ch. 10). Pearl has tended to focus on correctly specified models, making it unclear whether the truth makers of counterfactual conditionals are the model itself or what it represents (Markus Reference Markus, McKay Illari, Russo and Williamson2011; Cartwright Reference Cartwright2017). However, even if we take the latter interpretation, he clearly rejects the idea of modal expressions depending upon other worlds for their truth values (e.g. Pearl and Mackenzie Reference Pearl and Mackenzie2018: Ch. 8) presenting SCM as a more favourable alternative. One might reconstruct LTC to adjust it to Pearl’s ontology in this respect but, again, the result would no longer be LTC but rather an alternative theory. Moreover, Lewis did not adopt this view lightly but did so in response to arguments against the view that our actual world exists in a manner not shared by other possible worlds. Lewis argued that for any recursively defined language, there will be more propositions required to fully describe the world than sentences available to express them (Lewis Reference Lewis1973 b: Ch. 4). This poses a challenge for the idea that one could construct a structural causal model ‘of the entire world’ (Briggs Reference Briggs2012: 147) to bring SCM into alignment with LTC. Moreover, such a modification would conflict with typical uses of structural causal models applied to small sets of variables.
3.1.3. Vagueness
Lewis accepted vagueness as an inherent part of counterfactual conditionals in natural language use and attempted to match the appropriate level of vagueness in his truth conditions based on similarity (1973b: Ch. 4). In contrast, Pearl sought to replace vagueness with precision (Pearl Reference Pearl2009 a; Pearl and Mackenzie Reference Pearl and Mackenzie2018). LTC offers a descriptive theory constrained by natural language whereas SCM offers a prescriptive theory that seeks to improve upon natural language. As such, the two theories disagree regarding not just specific truth conditions but also on the general character of those truth conditions.
3.2. Strong equivalence with other similarity theories
LTC dramatically differs from SCM regarding counterfactual conditionals. Starr (Reference Starr2019) groups similarity theories with strict conditional theories as placing greater emphasis on logical truths than determining the truth of specific counterfactual assertions but he groups causal model approaches with two other types of theories that have the opposite emphasis. Other similarity theories have more in common with SCM by endorsing causal laws (Jackson Reference Jackson1977; Bennett Reference Bennett2003) or the Conditional Law of Excluded Middle (Stalnaker Reference Stalnaker, Harper, Stalnaker and Pearce1981) or alternative accounts of possible worlds (Stalnaker Reference Stalnaker1976). Nonetheless, such similarities do not negate the differences, nor do they reduce the value of comparative research.
3.3. Weak equivalence
As with RCM, it seems natural that one can re-express SCM in notation borrowed from LTC. This does not diminish the value of comparative research given that the theories are distinct even when expressed in the same notation.
Despite the fact that the constituent terms in LTC and SCM differ in meaning and reference, it still holds interest to compare their formal structure. Lewis (Reference Lewis1973 b: Ch. 5) considered whether his logic of comparative possibility might have applications in other areas without suggesting that these areas would thereby reduce to counterfactual conditionals. They simply involve distinct domains that may share a logical isomorphism similar to that between the brotherhood theory and the sisterhood theory from the Introduction. We now know that neither the logical system defined by Lewis’ axioms for comparative possibility nor an axiomatic representation of SCM constitutes a special case of the other (Halpern Reference Halpern2013; Zhang et al. Reference Zhang, Lam and De Clercq2013). Setting aside hyperbolic paraphrases, Galles and Pearl only claimed that SCM imposes no further restrictions on LTC when restricted to recursive models. This claim is analogous to stating that propositional logic imposes no further constraints on predicate logic when applied to formulae constructed only by the conjunction, disjunction and negation of atomic propositions. This merely shows that the restrictions imposed on the comparison by recursive models already impose the restrictions inherent to SCM. LTC is not limited to causal models and thus does not share these restrictions.
Some of the divergence between the two can be seen in the following examples. (A
→ B) ∨ (A
→ ∼B), the Conditional Law of Excluded Middle: This is not implied by LTC but is implied by the Existence axiom in SCM, and by the grammar of causal models. Lewis (e.g. Reference Lewis1973
b) offered arguments against it based on ties or an infinite series of more similar worlds. (A & ∼A)
→ A: LTC judges this unambiguously true whereas it violates the grammar of causal models and possibly the Effectiveness axiom of SCM. It thus falls into a truth value gap in SCM. A
→ ◊A: This is a common inference pattern in empirical sciences where an experiment proves something possible. However, nothing causes itself to be possible, so the conditional must rely on some form of non-causal dependency (Kim Reference Kim1974) excluded by SCM. Moreover, this may fall into a truth value gap because SCM lacks translation rules for modal operators. (A
→ (B
→ C)) ≡ (A ∧ B)
→ C), Import-Export: Natural extensions of SCM to handle iterated conditionals imply this whereas LTC does not (Briggs Reference Briggs2012; Fisher Reference Fisher2017). McGee (Reference McGee1985) discussed a substitution instance that greatly illuminates the difference in intuitions here: A
→ (B
→ ∼A). Any interventionist theory such as SCM would see this as a violation of Effectiveness because the initial antecedent manipulates A to true and thus it must remain so, contradicting the embedded consequent. However, from the perspective of LTC, the initial A is just a temporary context from which the embedding conditional is interpreted, but that need not carry over to the embedded conditional. Modifying an earlier example, imagine A = talc present and B = asbestos present. From the LTC perspective, the actual world is a ∼A ∧ ∼B world and the nearest world is an A ∧ B world. These two worlds are closer to one another than to other worlds because they both conform to the law that talc and asbestos occur together. To evaluate the conditional A
→ (∼B
→ ∼A), we first move to the nearest world to satisfy A, then, to evaluate the nested conditional, we move back to the actual world to satisfy ∼B. The conditional comes out true because the actual world is a ∼A world. In contrast, SCM would combine the antecedents to take us directly to the nearest A ∧ ∼B world where ∼A holds false, and thus the conditional comes out false. (Figure 1; Further details in online supplement.)

Figure 1. Contrasting Evaluations of the Nested Counterfactual Conditional A → (∼B
→ ∼A).
3.4. Section summary
LTC and SCM differ substantially (Table 3). They have different implications for reasoning about economic policy and other conditionals. There remain many unanswered questions for comparative research applying these and other theories of counterfactual conditionals to economic research. To appreciate the differences, it is important that such comparative research consider the content of the theories and not just uninterpreted formal calculi. Identical formal expressions can mean very different things in each theory.
Table 3. Contrasting two theories of counterfactual conditionals

4. Anticipated objectionsFootnote 9
One might object that SCM could be modified or extended to make either LTC or RCM equivalent to it. Such objections frequently focus on the extension of notation, which is why it is important to remember that the notation does not determine the content expressed with it. Such objections can also run afoul of parity arguments if they are offered in favour of one theory replacing another when either theory can be symmetrically extended to encompass the other. If modification is unconstrained, then any formal system can be modified to make it equivalent to any other. For the present article, I limit SCM to what Pearl and associates have presented it as and limit equivalence claims to actual equivalence to SCM without modification.
One might anticipate the objection that once a formal proof has been provided, objections to the conclusion can only represent failures to understand the proof. This objection rests on a misunderstanding of formal methods. The power of formal methods rests not in their implanting a Cartesian tree of knowledge that places existing results beyond criticism or revision. To the contrary, their power rests in the ability of formal methods to accelerate progress through theory revision by making it easier to discover and correct flaws and limitations (Pollock Reference Pollock1990). Moreover, agreement on the interpretation of the symbols constitutes a fundamental precondition for all formal methods. In the case of the equivalence claims, such agreement was lacking and premature formalism impeded progress by masking the differences.
One might also anticipate an objection that SCM represents the sole modern terminology for discussing causation and counterfactual conditionals, antiquating earlier approaches except to the extent that they are absorbed into SCM. This entails that the only interesting sense of equivalence consists of one internal to SCM that evaluates conformity of another theory to SCM. The criticism relies on circular reasoning given that it assumes what the critique of equivalence claims draws into question. More broadly, interpreting everything from within one perspective and systematically discounting anything external to that perspective impedes progress. Adopting such a strategy insulates the target theory from effectively responding to criticism rather than maximizing the benefits of such criticism. Science and philosophy both depend upon the effective use of critical feedback as a correction mechanism for theories.
A defender of equivalence might concede that LTC and RCM have interpretations that differ from SCM and still argue that one should dismiss such interpretations as inferior to interpreting them in a way that conforms with SCM. The objection asserts equivalence as normative rather than descriptive. Even if, as one might expect, SCM fulfils Pearl’s goals better than RCM or LTC, the previous sections give reason to believe that there exist other goals, outside of SCM, that RCM and LTC fulfil better. As such, one finds no systematic argument that the goals of SCM exhaust the goals that might motivate interpretations of RCM and LTC outside of SCM.
An objection might draw from the various criticisms Pearl offers of LTC and RCM for not sharing one or another feature of SCM, or not having addressed some problem addressed by SCM (e.g. Pearl Reference Pearl2009 a, Reference Pearl and Hoyle2012; Bollen and Pearl Reference Pearl and Hoyle2012; Pearl and Mackenzie Reference Pearl and Mackenzie2018). Such objections again fail to maintain parity by making SCM the arbiter between itself and its rivals. Once one recognizes LTC and RCM as distinct programmes with their own distinct objectives, such arguments do not provide a reason to abandon those objectives in favour of SCM. Rather than framing them as competitors in a zero-sum game, it seems much more productive to approach them as distinct streams of research that can strengthen and benefit from one another (Markus Reference Markus2013, Reference Markus2016). A similar reply applies to any such objection constructed through a double-standard, dismissing LTC or RCM for reasons that also apply to SCM.
Finally, one can imagine the critique of equivalence claims being mis-characterized as hostile toward SCM. This objection rests on circular reasoning by presupposing that the equivalence claims play an essential part in the value of SCM. It also bears emphasis that the suppression of comparative research resulting from equivalence claims potentially hinders effective development of SCM as much as it does LTC and RCM. The critique of equivalence claims thus represents as much an effort to protect SCM from the harmful effects of equivalence claims as to protect its rivals.
5. Conclusion
SCM differs from both RCM and LTC in ways that impact prescriptive methodology for research practice. Comparative research can certainly incorporate analysis of commonalities between the theories. Given the substantial differences between SCM and each of the two rival theories, however, researchers should avoid mixing methodological advice between rival theories and methodologists should avoid mixing theorems between rival theories. Even when the same formal expression holds across theories, its interpretation may differ. As such, I hope that critical evaluation of the equivalence claims can re-open the door to greater exploration of the comparative strengths and weaknesses of the rival theories.
Acknowledgements
An earlier version of this article was presented at the International Meeting of the Psychometric Society, 18 August 2017, in Zurich, Switzerland. I am grateful to Rachel Perlin for comments on an earlier version of this article.
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1017/S0266267120000437
Keith A. Markus is Professor of Psychology at John Jay College of Criminal Justice of The City University of New York and also a member of the faculty of the Graduate Center of The City University of New York. He is coauthor with Denny Borsboom of Frontiers of Test Validity Theory: Measurement, Causation and Meaning. His research focuses on causal explanation, causal inference, and test validity. URL: http://jjcweb.jjay.cuny.edu/kmarkus/.