Politics is about sides. It is about knowing your friends and your enemies; it is about groups, sharing goals and values. The most general groups are defined by their side of the center, and we thus speak of liberal and conservative voters, as well as left- and right-wing parties.Footnote 1 Commentators do so as a matter of course, and it would be strange if voters were any different. Yet none of the existing theories of spatial voting capture this logic. The dominant proximity theory simply posits that voters will prefer the party closest to themselves.Footnote 2 The alternative directional theoryFootnote 3 divides the political space into two directions of policy change, but, as we show here, its theoretical underpinnings are substantively different and it fails to capture the effect we hypothesize.
We offer a categorization theory of spatial voting, starting from the observation that individuals simplify choices by classifying objects into categories.Footnote 4 Several lines of research have shown that categorization tends to influence how objects are perceived. In particular, individuals tend to underestimate the differences between observations falling in the same category and exaggerate the differences between those falling in different categories.Footnote 5 Social identification theory extends this idea to social categorizations, holding that individuals derive their identities from group memberships and therefore exaggerate the differences between group insiders and outsiders.Footnote 6 We argue this also applies to spatial voting: ideological categorization is not only an information shortcut; self-categorization is also an expression of identity.
Our resulting model of spatial voting generates a set of testable predictions, which qualify the theoretical foundations of existing spatial theories. First, rather than just operating as ideal-point seekers along continuous policy dimensions, voters form preferences over coarse categories. Voters’ loss functions therefore also reflect whether parties fall into the same broad category as themselves. Parties falling in the same category as the voters receive a bonus beyond what would be predicted by other spatial theories.
Secondly, when voters have multiple parties on their side, they are forced to apply more specific criteria. No such decision rule is needed for parties on the opposite side. Thus, although categorization theory is agnostic about the exact criterion used to choose between parties on a given side, it does anticipate that this criterion will be more important in distinguishing between options within the preferred category. In our analyses, we find a clear proximity pattern in addition to categorization effects, and we would expect this effect to be stronger for parties on the same side as voters. In this sense, our theory does not replace the existing ones; it rather refines and qualifies them by adding a component that has so far been overlooked.
Thirdly, voters’ efforts to rationalize non-ideological party preferences by letting these influence their perceptions of party positions – a phenomenon known as projection bias – should also be characterized by a category-based discontinuous pattern. The drive to exaggerate a preferred party’s proximity to one’s own ideological views will be stronger when doing so also implies placing it on one’s own side of the center.
Our empirical analyses test all these implications of the theory. Before we delve into the tests, however, we elaborate on the theoretical underpinnings of categorization. We begin by briefly presenting the main theories of spatial voting, before introducing our categorization theory and showing how other models fail to capture the effects we predict. This is followed by a set of empirical analyses and robustness checks. The conclusion elaborates on the implications of our findings.
EXISTING THEORIES
Spatial theories of voting share the assumption that voters are rational and will choose the alternative that gives them the highest utility based on spatial considerations. They differ, however, in the utility functions that translate the positions of voters and parties into utility losses. The most straightforward model of vote choice, the ‘proximity model’, holds that individuals vote for parties with positions most similar to their own.Footnote 7 For a given political dimension, we calculate the Proximity Term for voter i and party j as the absolute distance between the position of the voter (V i ) and the position of the party as perceived by the voter (P ij ), that is, |V i −P ij |.Footnote 8 This term is expected to have a negative coefficient, as it captures distances that voters are expected to penalize. The utility curve implied by the model peaks where voters and parties are at the same position. This is shown in the left panel of Figure 1, which displays the utility for voters located at −2, and +1, on a dimension from −5 to +5, as a function of a party’s position along the same dimension.
Rabinowitz and Macdonald present a competing ‘directional model’, drawing on earlier work on issue perception and symbolic politics.Footnote 9 The authors argue that the assessment of political parties or candidates reflects two considerations. The first is ‘whether the individual and candidate are in agreement about the direction public policy should take’.Footnote 10 If individuals and parties prefer the same direction, this will contribute positively to party assessment; if they do not, the contribution will be negative. Secondly, the authors argue that the strength of the contribution to party assessment will be an interaction of the intensities with which the voter and party are emphasizing a given direction: the further they both are from the ‘neutral point’, the stronger the response. The neutral points represents the point at which no desired policy change is implied, and is commonly operationalized as the center of the scale.Footnote 11
Following Rabinowitz and MacdonaldFootnote 12 we thus calculate the Directional Term as V i ×P ij , where the notation is the same as above, and V i and P ij are centered on the middle of the scale. As shown in the right panel of Figure 1, for a voter at a given position, the directional utility of voting for a party is a linear function of the party’s position. If the voter is on a given side, the utility will increase linearly as the party moves toward that side, crossing 0 when the party crosses the center. The role of the voter position is to change the slope of the function: the further the voter moves from the center, the steeper the slope. The same holds for party positions, of course: they change the slope of the utilities as a function of voter positions.Footnote 13
Both in theoretical discussions and empirical studies, the intensity aspect of directional theory has overshadowed the directional aspect. The motivation behind most work in this area has been to solve the puzzle of the ‘empty center’: the question of why parties and candidates appear to be more extreme than voters.Footnote 14 This emphasis on the intensity part of the model has also motivated the accumulation of numerous mixed models of spatial voting. Perhaps most important among these is Grofman’s discounting model, which has been further elaborated in subsequent work.Footnote 15 The key idea is that parties are unable to fully implement their proposals, due to within-party dynamicsFootnote 16 or systemic constraints in multiparty systems with coalition governments.Footnote 17 Thus voters are led to prefer parties more extreme than themselves in order to get the amount of policy change they would like. These models are all based on a proximity calculus, adjusted to accommodate the discrepancy between parties’ programs and actual policies.Footnote 18 We leave these additional models aside here, as they do not capture the effects we hypothesize to any greater extent than the two main theories that we focus on.
In addition to the development of mixed models, a vast literature has emerged that attempts to assess the relative merits of the proximity and directional theories.Footnote 19 Lewis and King argue, however, that arguments about which theory fits best tend to be based on assumptions that are hard or impossible to test.Footnote 20 Recently, several experiments have brought the debate forward, concluding overwhelmingly in favor of proximity theory.Footnote 21 Overall, there is little doubt that proximity theory has considerable predictive power in voting behavior. We argue, however, that the existing theories of spatial voting, including proximity theory, overlook an important pattern that reflects categorization effects.
CATEGORIZATION THEORY
Research in cognitive science has repeatedly demonstrated that humans use coarse perceptual schemas that simplify information processing.Footnote 22 Under conditions of uncertainty, the categorization of objects permits the use of simple heuristics to make predictions regarding unknown object features.Footnote 23 Work in judgement theory in social psychology has further shown that merely placing objects in categories influences individuals’ perceptions and evaluations of these objects.Footnote 24 This form of bias is known as accentuation and involves a tendency for individuals to underestimate the differences between observations falling into the same category and to overestimate the differences between those falling into different categories.Footnote 25 Furthermore, social identification theory holds that such effects are involved in the formation of social identities, as individuals not only categorize objects, but also people, including themselves.Footnote 26 Because individuals partly derive their identities from self-perceived group memberships, they tend to attach strong feelings to such memberships, which in turn leads them to focus on, and exaggerate, the differences between group members and outsiders.Footnote 27 In other words, identity formation is based on self-categorizations, which in turn leads to in-group favoritism.Footnote 28
We argue that these findings are relevant to spatial voting, because voters operate with basic mental categories representing the sides of the political center. In particular, this applies to the most important political dimension in a given political system, yielding two encompassing categories. In most countries, it is common to speak of voters and parties as being either left-wing or right-wing, and these labels provide shortcuts to richer descriptions of the actors’ traits.Footnote 29 As with other stimuli, the categorization of political actors according to their side of the ideological center is likely to involve the use of prototypes.Footnote 30 A prototype left-wing actor may, for example, emphasize social and economic equality and express a willingness to use public policy to promote these values, while a right-wing actor may emphasize individual responsibility and freedom from government intervention. Thus categorization is also accompanied by issue packaging, permitting the use of coarse party preferences as a heuristic. Even if specific issue dimensions might allow refined distinctions between parties, they are eventually embedded within a coarse, category-based assessment framework. Theories of party branding exemplify this logic, suggesting that voters perceive parties’ positions along salient dimensions as signals of what they generally stand for.Footnote 31
Drawing on these arguments, we expect voters to assess and be conscious of their own side of the ideological center. As ideological positions tend to be highly consistent over time,Footnote 32 we further assume that these self-categorizations matter for voters’ identities, expressing some of their most fundamental values and attitudes. As a consequence, we expect voters to assess whether a given party falls in the same category as themselves – in other words, whether the party is on their side, being one they broadly agree with and potentially could vote for. The first empirical implication of our theory is therefore that parties receive a more favorable assessment when they are on the same side as the voter in question. Furthermore, allowing for the existence of a neutral center category, we expect voters to penalize parties on the opposite side more than those in the center. As illustrated in the next section, these effects are not captured by the dominant theories of spatial voting.
In a setting with many parties, however, a pure categorization model is unlikely to be adequate on its own, as voters may still have to choose among several parties on their side. Existing research suggests that individuals facing a large number of options are indeed likely to first choose a broad category and then make a more specific choice within this category.Footnote 33 In the present context, such a within-category choice could be based on one of several available decisions rules. Voters could make a random draw, choose based on non-ideological factors (for example, experience, charisma or competence), or conduct a more finely grained assessment among parties on their side, applying additional functions of party and voter positions. This last scenario is particularly interesting because it implies that different spatial rules may interact with categorization effects. While voters need to pick a favorite to win their vote, it matters less how they rank their least-favored parties. Thus according to our theory, the predictive power of existing spatial rules should increase when categorization is insufficient to distinguish among the alternatives – that is, for parties located on the same side as the voter in question. If, for example, voters use a proximity rule, we expect that its impact on vote choice increases when parties and voters are on the same side.
Furthermore, the argument that voters think in terms of categories has implications for the issue of reverse causation from preferences to party perceptions. A common challenge to analyses of proximity voting relates to a phenomenon known as projection or rationalization bias, which refers to the tendency for voters to exaggerate their ideological agreement with their preferred political parties.Footnote 34 Given the utility maximization principle of spatial models, this endogenous process complicates the identification of ideological voting. In the current setting, however, the issue of reverse causation also offers an additional opportunity to test our theory, which seeks to describe the lenses through which voters understand politics. While it is possible that some voters bring parties they like over to their side to justify their preferences, this would only confirm that voters do think in terms of these broad categories. In fact, if our theory is correct, we should observe categorization patterns of projection bias. We test this implication in the analyses below, and – as a follow-up in the robustness section – we also implement our analyses using objective party placements in order to ensure that our results are not entirely driven by projection.Footnote 35
SPECIFICATION
If voters do indeed consider whether parties are on their side or not, then some steps along the ideological scale will be more significant than others. In particular, we expect party preferences, as functions of party and voter placements, to exhibit discontinuities at the center of the scale. The expected effects of being on the same and opposite sides are most easily illustrated keeping either voter or party positions fixed. Figure 2a thus illustrates the expected pattern for voters who place themselves left of center.
Assuming the existence of a neutral category, we allow the effect of being on opposite sides to be different from that of being on the same side. In our parametric analyses, we therefore employ two dummy variables to capture the states of relevance. Indicator S captures Same Side status: S={1 if V i ×P ij >0; 0 if V i ×P ij ≤0}, while indicator O captures Opposite Side status: O={1 if V i ×P ij <0; 0 if V i ×P ij ≥0}, where the voter and party positions (V i and P ij ) are centered on the geometric middle of the scales. Implementing the categorization model as a regression equation, we get:
where β 0 denotes the average preference in the neutral category, β 1 and β 2 are the effects of same and opposite side status, and ε ij is an error term.
As mentioned, where multiple parties are on the same side as the voter, categorization is not sufficient to distinguish between the alternatives. Voters may thus employ additional spatial rules to select among these parties. Following this logic, we expect alternative utility functions to be more important in distinguishing among parties on the same side as the voters. To incorporate this expectation into our model, Equation 1 needs to be augmented by the inclusion of appropriate interaction terms, allowing the impact of other functions to depend on categorization:
where f(V i , P ij ) is an unspecified function providing further distinctions. If voters were to apply a proximity logic based on absolute distances, we would get a pattern like that in the left panel of Figure 2b, while a directional logic would yield a pattern like that in the right panel. For now, we remain agnostic about which other functions voters may apply, but we expect the effect of such functions to be stronger when voters and parties are on the same side.
The empirical analysis is divided into five sections. First, we present a set of illustrative plots, making no assumptions about f(V i , P ij ). We thus explore the presence of categorization effects examining specific scenarios of party and individual ideological placements without imposing a specific spatial utility function. These results clearly favor a model combining categorization and proximity effects. Our next step includes a more tailored analysis to test the presence of categorization effects when proximity is held constant, while directional theory predicts a different outcome. Next, we test Equation 2 in order to investigate the mentioned corollary of categorization effects, namely that other spatial criteria will weigh more within the subspace characterized by voter-party convergence. Then, we address the presence of categorization in projection bias, before the last section reports a set of robustness checks.
DATA AND OPERATIONALIZATION
To test the role of categorization as a general feature of spatial voting, we use data from the European Election Study (EES),Footnote 36 which covers a wide range of countries. We focus on the dominant ideological dimension in these countries, which is the left-right continuum. The EES includes eleven-point measures of left-right positions (scales from 0 to 10) for both voters and parties.Footnote 37 To ensure homogeneity in the meaning attached to this scale, we concentrate on established democracies.Footnote 38 We thus base our analysis on fifteen EU countries: Austria, Belgium, Denmark, Finland, France, Germany, Greece, Ireland, Italy, Luxembourg, the Netherlands, Portugal, Spain, Sweden and the United Kingdom.
We measure the strength of preferences by using the respondents’ ratings of the parties, expressed as their likelihood of ever voting for them. This ‘propensity to vote’ question (PTV) facilitates cross-national comparison and is often used to capture voters’ party preferences.Footnote 39 Our unit of analysis is neither voters nor parties, but rather combinations of the two. We thus transform our dataset into one consisting of party × individual observations.
We expect voters to use category 5 on the eleven-point scales as a neutral category defining two broader categories on each side. To ease the construction and interpretation of the relevant terms, we center the scales on 5, giving them a minimum of −5 and a maximum of +5. It is important to note, however, that the survey questions and answer categories do not in any way involve a neutral point, middle or sides defined by a threshold. They are presented as continuous scales that would seem more fit for employing a pure proximity logic. Thus if voters are found to understand such a scale in terms of broader categories without being asked to do so, this will be valuable evidence against a pure proximity logic.
VISUAL INSPECTION
Choosing to be agnostic about the possible additional utility functions voters apply, we begin with a visual exploration, avoiding parametric restrictions. Our plots are based on local sample means, conditional on party and voter placements. As discussed above, keeping voter positions fixed, the status of being on the same or opposite sides becomes a deterministic function of party positions, and vice versa. As explained, we would expect discontinuities on each side of the center. Furthermore, we would expect other spatial rules to have more pronounced effects when parties and voters are on the same side.
Figure 3 shows average party preferences over different voter positions for each relevant party placement on the left-right scale. For example, the upper left graph of the figure presents the average preference score for a party located at point −1 (left of center) over the whole range of voter positions along the left-right axis, while the lower-left graph shows the same for a party at +1 (right of center). As the figure shows, for almost all party placements there are clear discontinuities as we move from one side of the center to the other, and as explained above, these would not be expected based on a pure directional or proximity model. However, the discontinuities do come in addition to a pattern of finer distinctions, which clearly resemble a proximity model with absolute distances. Furthermore, this pattern is stronger when parties and voters are on the same side: the plots are consistent with the hypothesis that these finer distinctions are conditional on categorization.
Figure 4 shows the same as Figure 3, but keeping voters (rather than parties) fixed. Again, we see categorization effects, albeit somewhat weaker than in the previous plot. Furthermore, the conditioning effect of categorization can also be seen here. When it comes to parties on the opposite side, voters are more indifferent and their preference scores are less contingent on the smallest distance criterion. These initial analyses thus offer preliminary evidence of categorization effects. The next sections extend this evidence by engaging in a more systematic testing of the constituent parts of the theory.
CRITICAL CASES
To strengthen the inference that categorization matters, we also conduct more formalized tests, carefully selecting the observations to be compared. In order to distinguish our model from the directional and proximity models, we focus on so-called critical cases,Footnote 40 in which the three models produce clearly diverging predictions. We conduct four tests, each involving a pair-wise comparison of two scenarios. We thus have eight scenarios, each involving one voter position and two party positions.
As illustrated in Figure 5, the ‘treatment’ scenario of the first comparison (1 T ) involves a voter (V) at −1, a party (P A ) at −3, and another party (P B ) at 1.Footnote 41 The parties are at an equal distance from the voter, so the proximity model does not predict that the voter will prefer one party over the other. According to directional theory, however, V should prefer P A : the directional scores are 3 and −1 for Party A and B, so the difference between them is 4 (in A’s favor). The respective ‘control’ scenario (1 C ) is generated by moving all positions two steps to the left on the scale. According to proximity theory, the voter should still be indifferent. According to directional theory, however, things are now much more clear-cut: the directional values are 15 and 3 for Parties A and B, respectively. While the difference in directional scores for P A and P B was 4 in 1 T , it is 12 in 1 C . If directional theory is correct, we should expect V to prefer P A to P B to a higher degree in the second than in the first scenario.
Our categorization model produces a different prediction. In scenario 1 T , Party A is on the same side as the voter, while Party B is on the opposite side. In scenario 1 C , both parties are on the same side as the voter. Hence, V should prefer P A over P B to a greater extent in scenario 1 T than in 1 C . As Figure 5 shows, we also investigate scenarios that replicate the ones just presented, but on the right side of the scale (comparison 2). In comparisons 3 and 4, we test the role of categorization while letting the proximity of the two parties vary. In each of the scenarios involved, the proximity model predicts that Voter V will prefer Party B over A, while directional theory predicts A over B. However, the distances are kept constant from one scenario to the next, so proximity theory still predicts no difference between the two scenarios. The directional model yields the same prediction as before – a negative difference-in-differences. According to the categorization model, there should be a positive difference-in-differences. However, these tests are more challenging for the categorization argument because the distinction is not between same versus opposite side but rather between same-side versus neutral status.
For each scenario, we select voters who place themselves at point V and have located at least one party at P A and at least one party at P B . We create a binary indicator identifying the alternative favored by the directional model (always Party A). We then pool the observations for the two scenarios to be compared, create another binary indicator identifying the ‘treatment’ scenarios and interact this with the indicator identifying Party A. The resulting coefficient denotes the difference in the average difference between the preference scores for Parties A and B, going from scenario C to T. A negative coefficient would be consistent with directional theory, while a positive coefficient would be consistent with categorization theory. To account for the clustering of preferences within individuals, we include individual fixed effects, and use robust standard errors clustered at the individual level.
Figure 6 shows the results, plotting the coefficients of interest along with 95 per cent confidence intervals. As can be seen, the point estimates are all positive; two are clearly significant, while one is barely significant, and one is barely insignificant. As explained above, these estimates are in line with the categorization model, and contradict directional theory.
GENERAL ANALYSIS
To obtain more general estimates of the effects, we now turn to a more standard regression framework. Table 1 reports a set of regression models, implementing the categorization model along with other spatial functions. Model 1 includes the Proximity Term, the Directional Term, and indicators of Same Side and Opposite Side status. The estimated effect of Same Side is close to 1 on the eleven-point preference scale, while that of Opposite Side is about −0.3, and both estimates are highly significant. The effect of proximity is also highly significant and has the expected sign. The estimate for the Directional Term appears to be statistically significant, but it has the wrong sign, suggesting this model is misspecified. This conclusion is also corroborated by the plots shown above, which indicate no trace of a directional effect. The Directional Term is therefore dropped in Model 2, without a notable loss in explanatory power or change in the other estimates. The categorization effects are still significant and have considerable magnitude.
Note: the dependent variable is the propensity to vote for a party (see the Data section for details). Party positions are measured by respondents’ subjective placements. The cell entries are ordinary least squares (OLS) regression estimates, with robust standard errors, clustered at the individual level, in parentheses. *p<0.05, **p<0.01, ***p<0.01.
There is one reason for concern about the reported results, however. Using the whole set of observations and a linear implementation of the Proximity Term, the analyses involve some extrapolation. All three side-statuses can only be observed for proximity values of 2, 3 and 4. (It takes two steps to move from the same to the opposite side, while with a distance of 5 or larger it is not possible to be on the same side. With a proximity value of 1, it is only possible to compare either same or opposite side status to the neutral status.) In other words, there is only common support for the mentioned proximity values. Model 3 in Table 1 reports an analysis only including observations where the Proximity Term is 1, 2, 3 or 4. It further includes proximity-fixed effects, ensuring that proximity effects are fully controlled for. The resulting categorization estimates are only slightly weaker than before, with a Same Side estimate of about 0.8 and Opposite Side estimate of about −0.3.
As discussed above, voters need more criteria than side categorizations to distinguish between several parties on a given side, and we expect these criteria to play a larger role for parties on the same side as voters, as the voters need to pick a winner of their vote. So far, our analyses show that when categorization in terms of sides does not suffice, voters apply a proximity logic to make finer distinctions. In Model 4, we therefore include an interaction between the Proximity Term and the categorization indicators. The results are in line with our expectations: the effect of the Proximity Term is almost twice as strong for parties on the same side as voters, compared to parties on the opposite side (−0.73 versus −0.40). Thus, categorization not only creates discontinuities in what proximity theory suggests is a continuous space, it also qualifies the impact of proximity considerations.
CATEGORIZATION EFFECTS IN PROJECTION BIAS
We now turn to another implication of our theory. Rationalization or projection bias presents a considerable challenge for the proximity model, as it is built on the assumption that perceptions precede utilities. A key point for us is that if our theory is correct, projection should reflect the coarse categories examined above. A classic idealized model of projection considers the interaction between individuals’ self-placements and their party evaluations. The more a voter likes a party, the more likely she is to perceive this party as being closer to her own position than it really is. As this process has been suggested as an alternative explanation of proximity patterns in voter preferences, it has invariably been assumed to operate in a linear fashion throughout the ideological scale. In contrast to this widespread assumption, the theory presented here implies categorization effects: the incentive to misplace a preferred party’s true position is significantly strengthened when this means bringing it to one’s own side. Thus, considering two individuals with the same distance to their preferred party, projection should be more evident for the individual who would otherwise find the party at the opposite side of the center.
To test this hypothesis, we draw on (but also significantly modify) existing models of projection bias.Footnote 42 We present two different models, both of which use parties’ sample mean placements as a measure of their ‘true’ positions.Footnote 43 In our first model, the dependent variable, W ij , is constructed as a binary indicator of ‘mis-categorization’, that is, party-individual combinations where the respondent placed the party at the opposite side of the sample average. Our measure of party utility, which is the same as before, is now employed as an independent variable, denoted Y ij . In our simplest model, this variable is interacted with two binary indicators of same and opposite sides, $$S_{{ij}}^{M} $$ and $$O_{{ij}}^{M} $$ , where mean party placements are used to locate the parties. The first indicator thus identifies observations where respondents are on the opposite side of the mean party placement, whereas the second identifies those where respondents are on the same side as the mean party placement. The interaction between party evaluations and these two dummies reveals the impact of categorization-based projection: the more a voter likes a party on the opposite side, the more likely she is to misplace the party so as to find it on her own side. Our simplest model is thus:
In this model, categorization effects in projection are mainly captured by β 4 and β 5, although these coefficients should be combined with the main effects of preference strength (β 5) and side status (either β 1 or β 2) to obtain the complete marginal effects of preference strength, conditional on side status. We go one step further, however, and allow for non-linearity by using penalized smoothing splines in a generalized additive model (GAM). We estimate three smooth terms to capture the effect of preference strength, conditional on whether voters are on the same side as the mean party placement $$( {S_{{ij}}^{M} =1} \right)$$ , opposite side $$( {O_{{ij}}^{M} =1} \right)$$ or neither $$( {S_{{ij}}^{M} =O_{{ij}}^{M} =0} \right)$$ .
Our second model aims more directly to capture the underlying idea behind projection models, namely that voters might over- or under-estimate their ideological distance to parties according to their non-ideology-based preferences. Thus the dependent variable is now a measure of divergence between the perceived and ‘objective’ distances between parties and voters: $$\mid \!P_{{ij}}^{P} {\minus}V_{i} \!\mid\!{\minus}\!\mid\!P_{j}^{M} {\minus}V_{i} \!\mid$$ , where $$P_{j}^{M} $$ denotes the mean left-right position of party j, $$P_{{ij}}^{P} $$ denotes the position of party j as perceived by voter i, and V i denotes voter placements. We model this difference as a function of party utility (Y ij ), the two indicators of same and opposite side $$( {S_{{ij}}^{M} \,{\rm and}\,O_{{ij}}^{M} } \right)$$ , and the interaction between these and party utility. We expect that the need for rationalization will be stronger when preferred parties’ ‘true’ positions are on the opposite side of voters. Following the standard convention for first-differenced models, we also include the perceived ideological distance between parties and voters $$( {\mid\! P_{{j}}^{M} {\minus}V_{i} \!\mid} \right)$$ as a predictor in some of the models (reported in full in the Appendix). The resulting model is thus:
We expect that the stronger the preference for the party, the greater the underestimation of the distance between party and voter. Such evidence of projection, however, should be mediated by whether the party is already on the same side as the voter. If a preferred party were on the opposite side of the voter, this would make the need for projection stronger and thus increase the tendency to underestimate ideological distance. Testing this expectation, we again use a generalized additive model to allow a more flexible estimation of the role of party utility for projection.
Figure 7 reports the results from both models. The first panel presents the key estimates for the first model and demonstrates considerable categorization effects in projection bias. The graph shows that when a voter strongly dislikes a party on her side (Y ij =0), the probability of miscategorizing the party (moving it to the opposite side) increases. This kind of misplacement becomes less likely as the evaluation of the party grows more favorable. Similarly, a voter who strongly likes a party on the opposite side is much more likely to move this party to her own side. In contrast, if she dislikes a party on the opposite side, the probability of making such a miscategorization decreases considerably.Footnote 44 The second panel shows that as party utility increases, the ideological distance to the party is increasingly underestimated. However, this effect is stronger for parties located on the opposite side than for parties on the same side as voters. The slope of Y ij is steeper for parties on the opposite side. Taken together, these results strongly support the notion that voters categorize parties as well as themselves in terms of their side of the center, and that they find these categories significant.
ROBUSTNESS
Lastly, it worth assessing the robustness of our results. First, as many readers may be unfamiliar with our measure of party preferences, the PTV, and some may consider actual vote choice a more important and relevant outcome, we also implement our model analyzing vote choice. In Appendix Table A9, we report a full replication of Table 1 with vote choice as the dependent variable, and the results are substantively identical to those we report above.
Another issue arises from the projection effects shown in the previous section: the question of whether the patterns we find are mainly driven by subjective perceptions (and misperceptions) of positions along the ideological scale, or whether they are still present when more objective measures are used. We address this using the left-right party positions provided by the 2010 Chapel Hill Expert Survey.Footnote 45 We implement our analyses for both PTVs and vote choice, and the results are reported in Table 2.Footnote 46 The results are substantively very similar to those reported earlier, which implies that the categorization effects are informed to a considerable degree by objective party placements rather than being driven by individual misperceptions.
Note: the cell entries are OLS regression estimates, with robust standard errors, clustered at the individual level, in parentheses. *p<0.05, **p<0.01, ***p<0.01.
Another potential concern is that our findings are confounded by partisan attachments. It is indeed likely that subjective perceptions of ideological proximity to some extent may be overestimated for parties with which voters strongly identify. However, the robustness checks above – using objective party placements – already guard against this possibly confounding effect, and show that our results hold in the absence of perception biases. Furthermore, it is also very likely that party identification is partly driven by ideological proximity, which is why we generally do not include party identification in our models, as this would generate post-treatment bias. Nevertheless, as another robustness check, we replicate the analyses of Table 1, adding a measure of partisanship that captures both the direction and the intensity of partisan attachments. Further information is provided in Appendix Table A10, which also presents the results. All key findings are robust to the inclusion of party identification, suggesting that ideological categorizations constitute an independent and more general source of identification, and generate effects beyond those that can be accounted for on partisan grounds.
A final question is whether categorization also applies to the American context. In this case, we would not focus on the left-right dimension, but on the conceptually similar liberal-conservative dimension. More importantly, the country’s two-party system could render the ideological categorization of political actors superfluous, as party-based divisions could represent an equivalent and equally encompassing form of categorization. To address the question of whether ideological categorization is relevant in the United States, adding predictive power beyond party identification, we conduct an analysis of vote choice using the American National Election Studies (ANES) and report the results in Appendix Table A11.Footnote 47 We use all surveys since 1972, because since then respondents have been asked to place themselves and parties on a 1–7 liberal-conservative scale. We first replicate Table 1, showing that categorization effects are also present in the American context. We then add party identification as a control variable (despite the risk of post-treatment bias), and show that the categorization model has explanatory power beyond that of party identification.Footnote 48
CONCLUSION
Our results confirm that politics is understood in terms of sides, resulting in a discontinuous political space. Even when presented with seemingly continuous ideological scales, voters are conscious of their own side and evaluate parties accordingly, giving significantly higher ratings to the parties they consider to be on the same side as themselves. On average, we find party preferences to be about one point higher on an eleven-point scale when parties are on the same side as voters, and about 0.3 lower when they are on the opposite side. This finding has important implications that merit further elaboration.
In contrast to prior theoretical accounts that treated spatial voting as a building block of rational choice models of voting,Footnote 49 categorization theory postulates that spatial voting involves a nontrivial identity-based element that makes voters perceive their political alternatives coarsely. As our results show, for a moderate left-wing voter, a strongly leftist party may be preferable to a moderate right-wing party, even when distance alone would predict otherwise. This is not because the former advocates policy change in a leftist direction with higher intensity, as directional theory would suggest, but because one party fits the left as a reference category while the other does not. The fact that projection bias is also characterized by side-specific discontinuities strengthens our identity-based interpretation of these effects. In this respect, our study joins the new social-identity based theories of party branding, which reconcile the Michigan conceptualization of party identification with attitudinal updating and partisan lability.Footnote 50 By the same token, categorization theory points to the role of group-oriented thinking in spatial voting. This coarsened logic is only compatible with the principle of utility loss minimization – underpinning proximity theory – if utilities are themselves partly defined in more coarsened terms.
Our theory qualifies the proximity model in two ways: first, the coarse categories of sides lie beneath and cut across the political space, and secondly, the impact of the more finely grained distinctions afforded by proximity considerations is conditional on these basic categories. In other words, the categorization model does not refute the Downsian paradigm, but shows that the smallest-distance criterion applies differently within the subspaces defined by the center of the ideological scale. The findings presented here thus give credence to new developments in decision-making theories. A noteworthy model in this literature, which draws on the principles of bounded rationality, is characterized by a two-step ‘categorize-then-choose’ logic.Footnote 51 According to this scheme, agents first categorize and then choose alternatives within their preferred category. We argue that vote choice is characterized by a similar two-stage process: at the first stage, voters identify parties that satisfy the criterion of being on the same side, while at the second stage they use a more specific decision rule to choose among these options.
Apart from qualifying the theoretical underpinnings of proximity theory, categorization also informs the discussion about competing metrics of distance in proximity models. Two distance functions have mainly been used: Euclidean distances with quadratic utility penalizations ([V i −P ij ]2) and city-block distances with linear utility penalizations (|V i −P ij |). Since they both provide the same preference ranking, the two models have typically been used interchangeably.Footnote 52 However, studies that have tried to compare the explanatory power of the two metrics have provided overwhelming empirical evidence in favor of the city-block metric.Footnote 53 In our own analyses, we also found that city-block proximity is a far better predictor of voter preferences than Euclidean metrics. Our model helps explain this finding: the Euclidean metric gives linear distance greater weight when it is least needed – that is, when parties and voters are likely to be on opposite sides and linear distances tend to be large. Conversely, this metric gives linear distance the least weight when it is needed the most: when voters and parties are on the same side and linear distances tend to be small.
Research on directional theory is also affected by our argument. Our results suggest that when controlling for categorization, the intensity part of the directional model is no longer important in voters’ party preferences. Accompanied by the more in-depth analyses presented in the Appendix, our findings question the theoretical foundation of directional theory as a whole. We argue that voters apply categories defined by the center of the scale rather than the policy status quo. This is important, as the directional model is commonly operationalized using the geometric center of survey scales as the neutral point, which increases the correlation between the directional measure and the categorization measures. While the directional model still fails to clearly capture the effects we find, prior estimates of directional effects may have benefited from the presence of categorization effects in the absence of appropriate controls. In short, our results suggest that effects previously attributed to directional models are largely due to categorization effects.
Our results also have important practical implications. In order to grasp the impact of categorization effects on parties’ electoral fortunes, we simulate a scenario in which they play no role.Footnote 54 Among the voters placing themselves closest to the center (at +/−1), we find that about 10–12 per cent would vote differently in the absence of categorization. At the extremes of the scale, this share drops slightly below 5 per cent. On average, we find that between 5 and 6.3 per cent of voters would vote differently (an estimate that is pulled down by the voters in the center). While 5–6 per cent may seem modest, it may be a significant figure considering that election outcomes tend to be determined by the few who change their votes in systematic ways.Footnote 55
Finally, the findings shed light on the dynamics of party competition. A common observation among students of party politics is the absence of leapfrogging: when parties change their position, they tend to do so within a limited range, maintaining the overall ranking along the policy dimension in question. Categorization theory adds a prediction to this pattern: that parties will stay on a given side of the center, as they need to convince their supporters that they remain on their side. This may account for the absence of leapfrogging, as it adds a boundary to party movement, and the parties around the center might otherwise have been the most likely to leapfrog each other. Even the most centrist parties tend to have a distribution of supporters that is skewed toward one side of the left-right continuum, and in light of the categorization model, this is likely to give these parties incentives to stay on the side of their traditional voters.
Supplementary Material
To view supplementary material for this article, please visit http://dx.doi.org/10.1017/S0007123415000393