1. Introduction
In his classic essay on democracy, Anthony Downs (Reference Downs1957) wrote that “the basic determinant of how a nation's political life develops is the distribution of voters along the political scale.” Adapting the spatial approach of Harold Hotelling (Reference Hotelling1929), Downs's basic model posits single-peaked voter's preferences along a unidimensional left–right continuum. The best known part of Downs's work deals with two-party plurality-based competition. Here, the median voter plays a key role in determining the winner and in shaping the incentives for the structure of party competition by creating centripetal pressures for parties and candidates to modify their platforms to better match the views of the median voter. This unidimensional model can, however, be readily extended to the multiparty case, as long as we recognize that, for a fixed distribution of voter preferences, the use of different electoral rules can create different incentives for voters and parties (Cox, Reference Cox1997).
The goal of this paper is to apply the basic Downsian one-dimensional model to predict, ex ante, the outcome of the French presidential election of April–May 2007 by using past French presidential elections over the period 1965–2007. The French community of election forecasters of the 2007 presidential election, besides pollsters, is composed of a small group of economists, who use simple models of voter behaviour. Based on a reward–punishment behaviour of citizens, these models propose a conventional vote function with two or more independent macro variables, like log of unemployment rate and popularity of the incumbent president several months before the election, to predict outcomes (see review in Lemennicier, Reference Lemennicier2007).Footnote 1 These models vary in methods and results, but they issue “true” forecasts, that is, they present predictions well in advance of the election. For example, they might hypothesize that median voter preferences are determined by reaction to unemployment rates, so that, when unemployment rises, the median voter votes against the incumbent party by “punishing” the incumbent government for its failure to adequately manage the economy (see, for example, Nordhaus, Reference Nordhaus1975).
Our analysis, which is based on a spatial median voter model, departs to a considerable extent from the models based on governmental reward–punishment behaviour by citizens. We rely on the pre-election distribution of intended votes one month before the election, using data taken from TNS-SOFRES presidential election surveys since 1981.Footnote 2 Unlike the simple models of voter behaviour, which are based entirely on the demand side of political preferences of the (median) citizen, our spatial approach is rooted in the supply side of “political competition,” in which the distribution of expected votes plays a key role as parties strategically compete to locate their political platforms closest to the overall median voter.
We proceed in the following manner. One month before the April–May 2007 presidential election—that is, in March 2007—we estimated the pre-election distribution of intended votes, using Theil's (Reference Theil1966) correction for systematic bias that has been observed in past presidential elections. The position of the median voter in this estimated distribution was found, and the distance to this position of the two leaders of the major left- and right-wing parties was calculated. After verifying that the median voter model worked well in previous presidential elections, that is, that the winner usually was the candidate closest to the position of the overall median voter, we hypothesize that the winner in 2007 will be one of the two candidates with the highest modes (expected vote shares). In particular, we hypothesize that the winner of the 2007 presidential election will be the candidate of these two whose mode is closer to the overall median voter.
The paper is organized as follows. The next section focuses on the way we apply the median voter model to the French case; the succeeding section presents a graphical overview of French presidential elections since 1965, along with a brief discussion of each. The final substantive section presents the forecasts. The projected percentage of votes for the left candidate at the second round is calculated by using a bivariate regression equation with proximity (in terms of percentage of votes) to the overall median voter as the key independent variable. The prediction was done one month before the election.
We conclude the paper with a brief overview of our findings and of how they fit with other research on election prediction.
2. The Spatial Median Voter Model Applied to the French Case
We make use of a stylized picture of French party space, in which we identify in each election five political groupings: extreme left, left, centre,Footnote 3 right and extreme right. This picture is clearly a simplification and avoids the creation of a separate (anti-) immigration dimension on which Le Pen might be located.Footnote 4 This one-dimensional portrait of French political competition is, nonetheless, so much in accord with the way French journalists and political scientists have analyzed the past four decades of French politics that we feel quite comfortable in making use of it.Footnote 5 It seems to capture the basic elements of presidential competition over this period. And, since the presidential election is such a critical feature of the French political system (Lijphart, Reference Lijphart1999), even though we restrict ourselves to presidential elections, we still can capture key features of the changes in political competition in France.
Our approach is based on three underlying hypotheses:
1. First, we assume that citizens vote sincerely for their preferred candidate and not tactically, that is, votes represent actual preferences.Footnote 6 Similarly, we take candidates as responding to voters' sincere preferences.
2. Second, we do not consider attempts to manipulate the voting system by fostering divisions among one's own or the opposition bloc.
3. Third, we posit that candidate positions can be identified along a left–right axis.
The first assumption is plausible because, in general, the two-round structure of voting lowers the cost to voters when casting sincere votes for the party closest to their ideology as long as voters can expect that a party close to their preferences will make it into the final competition. The second assumption seems accurate for all but possibly the election that led to the first Mitterand victory. The assumption of unidimensionality is a common one. Locating parties on the left–right dimension is facilitated because we are aggregating votes in a quintile classification from extreme left to extreme right. Much of the time we may assume that the relative location of our five groupings along the ideological dimension remains roughly constant in that French political parties are heavily constrained by their historically defined relative location in the ideological space and by the preferences of a party activist base which tend to anchor the parties in ideological terms, particularly on the left side of the political spectrum. Nevertheless, on the right side of the political spectrum there can be party movement between the right and the centre–right in terms of whether the UDF or the Gaullists are further to the right. To deal with this problem we look more specifically at the political platform and political campaign of the candidates in each of the elections. In situations where there may be doubt, we will consider alternative locations, and check the impact of our locational estimates on the accuracy of our prediction.
We now turn to the distribution of votes at each presidential election since 1965 across the five blocs (see appendix A)—identifying both overall and within-bloc median and modal locations.
3. Analyses of Individual Presidential Elections
(a) The 1965 Presidential Election
Figure 1(a) illustrates the initially bimodal nature of the French political system.Footnote 7 The vertical line indicates the overall median; the lines with arrows the medians of the two tendances. The modes which coincide with the two high points represent the most popular candidates or parties.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-16178-mediumThumb-S0008423909990746jra_fig001g.jpg?pub-status=live)
Figure 1(a) Distribution of Votes in the First Round of the Presidential Election of 1965. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
Mitterrand, as the leader of a coalition of both Socialists and Communists, ran for the left, while Lecanuet was the leader of the centre party. De Gaulle as the representative of the neo-conservative party was the closest to the overall median voter in the second round (a distance of 0.2 per cent versus a distance of 15.3 per cent for Mitterrand), and he won. In 1969, De Gaulle resigned early after a referendum in which his proposal was defeated. This resulted in an early presidential election (before the seven-year term then in effect had expired).
(b) The 1969 Presidential Election
Pompidou was the candidate of the neo-conservative Gaullists and captured 44.5 per cent of the votes on the first round. The Communist party, with Duclos received 21.27 per cent of the votes, and the parties on his left (whose candidates were Krivine, Ducatel and Rocard) reached 26.6 per cent of votes in toto, while the Socialist party, with Deferre as its candidate, scored a meagre 5 per cent of the vote. The centrist party, with Poher, the President of the Senate, as its candidate, did well, with 23.3 per cent of votes. Its position almost perfectly coincides with the location of the overall median voter.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-28608-mediumThumb-S0008423909990746jra_fig002g.jpg?pub-status=live)
Figure 1(b) Distribution of Votes in the First Round of the Presidential Election of 1969. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
Poher was arguably the median choice. Even though he received only 23.3 per cent of the first round vote he still was able to enter the runoff, and he should have won the second round of the election. But Duclos asked the Communists not to vote for Poher, and to abstain instead. Poher lost on the second round because the level of abstention from extreme left-wing electors was so high. This abstention shifted the overall median voter among the actual second-round electorate rightward, so Pompidou won.
(c) The 1974 Presidential Election
Pompidou died before the end of his mandate and a new election took place, again prematurely, in 1974. As Figure 1(c) shows, the shape of the distribution of votes changed drastically in the 1974 presidential election from that in the 1969 election; it became unimodal with the highest mode on the left of the political scale. The distribution is skewed toward the left, with the median voter almost exactly at the centre.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160709050902-59368-mediumThumb-S0008423909990746jra_fig003g.jpg?pub-status=live)
Figure 1(c) Distribution of Votes in the First Round of the Presidential Election of 1974. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
Mitterrand formed a coalition with the Communist party and the centre-left, while the right-wing parties were divided between the centre-right party of Valéry Giscard d'Estaing (VGE) and the Gaullist party with Chaban Delmas as its candidate. VGE, in a famous speech at Nantes, declared his own position to be a centrist one. VGE defeated his rival on the right, Chaban-Delmas, in the first roundFootnote 8 and won the second-round election, as his perceived location was the closest to the median voter, indeed essentially identical to it.
(d) The 1981 Presidential Election
Seven years later, at the end of VGE's term in office, a new election was required. The shape of the distribution of votes was again changing; in 1981 it became multimodal and quasi-uniform with four blocs of nearly equal size, two on the left, one in the centre, and one on the right. The overall median voter was located just on the left. On the extreme right no candidate was running. Mitterrand won, as his position was virtually indistinguishable from that of the overall median voter (a distance of 0 per cent versus a distance of −0.70 per cent for VGE).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-95739-mediumThumb-S0008423909990746jra_fig004g.jpg?pub-status=live)
Figure 1(d) Distribution of Votes in the First Round of the Presidential Election of 1981. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
(e) The 1988 presidential election
With the passage of another seven years, the shape of the distribution of votes was further altered. The main change was a much strengthened far right, although the overall distribution is still bimodalFootnote 9 and still skewed to the left.Footnote 10
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-17592-mediumThumb-S0008423909990746jra_fig005g.jpg?pub-status=live)
Figure 1(e) Distribution of Votes in the First Round of the Presidential Election of 1988. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
The position of the median voter is just at the centre. If we sum up votes, starting on the extreme right: Le Pen (FN) 14.4 per cent, at the extreme right, Jacques Chirac (RPR) on the right 20 per cent, and Raymond Barre (RPR) at the centre right,Footnote 11 16.5 per cent, the total is 50.9 per cent of votes. The left and extreme left total 49.1 per cent of votes. Both modes were close to the median voter. But Jacques Chirac, though he was the closest to the median voter of the right-wing voters, was not the closest to the overall median voter. Mitterrand was the closest to the overall median voter. Moreover, if Jacques Chirac were to have courted the overall median voter by moving to the left, then he would have lost votes on his extreme right. In fact many voters of the Front National deserted Jacques Chirac.
Le Pen's success in 1988 can in part be attributed, we believe, to the effects of some political machinations by President Mitterrand. On the one hand, we believe that Mitterrand was seeking to use the populist appeal of Jean-Marie Le Pen as a weapon against both the centre-right Union pour la Democratie Francaise (UDF) and the Gaullist Rassemblement pour la République (RPR) coalition. Those sympathetic to the left in the media insured that Le Pen was provided access to public TV channels' political programs, but commentators also insured that no right-wing coalition with Le Pen was possible by making a “political correctness” argument that Le Pen was too far out of the political mainstream to be a suitable coalition partner. On the other hand, while it is generally thought that a degree of proportional representation was introduced in the 1985 parliamentary elections by Mitterrand to save the Socialist party from what was feared might be a potentially disastrous parliamentary election defeat under the two-round systems (Bréchon, Reference Bréchon1998), this change in electoral rules also made it easier for the extreme right-wing to gain parliamentary representation that year—thus strengthening Le Pen's hand in the subsequent presidential election.
(f) The 1995 Presidential Election
In 1995 the shape of the distribution of votes is multimodal, with five ideological blocs of nearly equal size.
The political platform of Chirac was clearly on the centre left, with an emphasis on the “fracture sociale,” while Balladur, who was the prime minister and of the same party as Chirac, was customarily located at the centre right. Thus, we have placed Balladur to the right of Chirac in this election.Footnote 12 Chirac gained sufficient votes to enter the second round, and he won the second round since his platform was virtually coterminous with the views of the median voter. Note, also, however, that the far right continued to gain ground in the first round.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-24946-mediumThumb-S0008423909990746jra_fig006g.jpg?pub-status=live)
Figure 1(f) Distribution of Votes in the First Round of the Presidential Election of 1997. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
(g) The 2002 Presidential Election
The shape of the 2002 distribution of votes has similarities to that of much earlier elections, with a splintered left and a strong Gaullist right. However, the growth in strength of candidates on both the far left and the far right,Footnote 13 and the splitting of votes on the left meant that Jospin, the candidate of the Socialist party, was not selected at the first round because he put his platform close to the overall median voter and not close to the median voter of his own camp, leading to dissatisfaction with him by voters on the left. In 2002 we had a contest between a rightist candidate, Chirac, and a far-right candidate, Le Pen. Endorsed reluctantly by Jospin, Chirac won the second round in a landslide, with Le Pen doing little better on the second round than he had in the first. (Note that the table shows aggregated votes for the left and the extreme left; Chirac and Le Pen were the two candidates with highest votes on the first round.)Footnote 14
(h) The 2007 presidential election
The change in the shape of the distribution of votes over the new five-year presidential term of office is once again quite considerable. In particular, there is a shift away from the extreme left and the extreme right, and a strong gain for a centrist party.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-39921-mediumThumb-S0008423909990746jra_fig007g.jpg?pub-status=live)
Figure 1(g) Distribution of Votes in the First Round of the Presidential Election of 2002. (Also shows location of overall median voter and the median voters in the left bloc and the right bloc.)
The distribution is multimodal, with the two largest modes on the left and on the right, respectively. This distribution is relatively similar to the one in 1965. In the second round, Sarkozy was the candidate closest to the median voter of the overall distribution, and he won. However, Bayrou, arguably the centrist, would have defeated either left-wing or right-wing candidates had he received enough first-round support to enter the second round of the election (Abramson, Reference Abramson2007).
With this historical background suggesting the plausibility of viewing second-round outcomes as a choice between two modal candidates whose outcome is decided by which is closer to the overall median voter, we turn to our ex ante prediction of the 2007 presidential election.
4. Impacts of Mode Locations on the French Presidential Election Outcomes and the Forecast for the 2007 Presidential Election
To forecast the winner of the 2007 French presidential election, we focus on the location of the two highest modes relative to the location of the overall median voter in the first round. As noted earlier, we look at the distribution of intended votes revealed by opinion surveys one month before the election, but with a correction for past systematic bias in the survey data.
Our problem was to estimate the distribution of Figure 1(h) based on opinion polls taken one month before the election (or even earlier). To deal with this issue, from 1981 to 2007 we projected intended votes based on opinion polls of TNS-SOFRES 6, 5, 4, 3, 2 and 1 weeks before the election.Footnote 15 We then compared these estimates, especially those done four to six weeks in advance of the election, to the actual results of the elections for each presidential election. We decomposed prediction errors into systematic bias and hazard-based errors, using Theil's (Reference Theil1966) decomposition of errors methodology. Then we eliminated the systematic bias to obtain a distribution of intended vote which contains only hazard errors (see appendix B for details).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-46987-mediumThumb-S0008423909990746jra_fig008g.jpg?pub-status=live)
Figure 1(h) Distribution of Votes in the First Round of the Presidential Election of 2007 (also shows location of overall median voter and the median voters in the left bloc and the right bloc) of forecast and of ex post results.
For each election between 1981 and 2007 we calculated this corrected distribution of votes and identified the two highest modes. The candidates located at these modes were our predictions for the two candidates running in the second round. We also calculated the distance of each mode to the overall median voter. Then we predicted the winner among these two projected candidates as the one closer to the overall median voter. For the 1965, 1969 and 1974 elections, where we did not have survey data, we used the actual results of the first round of the election. Table 1 presents our electoral predictions based on this method.
Table 1. Projected second-round winner from vote intentions one month before elections versus actual election results
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-03263-mediumThumb-S0008423909990746jra_tab001.jpg?pub-status=live)
* To test the robustness of the method we switch the centre right and the right in 1988 and in 1995. In each of these years the relative location of the two major candidates on the right wing is debatable. In 1995, even with a switch, Chirac still wins, as he remains the closer to the overall median voter than his left-wing opponent. But in 1988 the model predicts Chirac as the closest, but Mitterrand won. In that case the rate of failure in the prediction is 1 out of 8. This illustrates the difficulty mentioned earlier in the text.
** Here the two highest modes were Chirac and Le Pen. The grouping of political parties on the left masks the elimination of Jospin, the Socialist leader, at the first round. Nevertheless Chirac is the closest candidate to the overall median voter, and won the election against a rival at the extreme right in a way that does not violate the median voter pivotal power prediction of the Downsian model.
*** At the time the initial French version of this paper was written, the results were unknown [see Lemennicier, B. and H. Lescieux. 2007].
Our prediction is based on our column in italics. We present the actual results in bold and in parenthesis. We would emphasize that in 2007 as in all years since 1981, our prediction was ex ante and not ex post.
Table 1 shows that, with the notable exception of 2002, opinion polls, whether corrected or not via Theil's (Reference Theil1966) methodology, predicted quite well the two highest modes. Theil's statistical correction method is, however, useful in more accurately predicting the distances to the overall median voter, and thus the winner on the second round. Making our projections one month in advance, in 2007 our statistically corrected estimates did a good job approximating the difference of votes between the left (Ségolène Royal) and the centre (Bayrou). On the other hand, we underestimated the vote for Sarkozy and overestimated the votes for the extreme left and right. But these estimation errors are sufficiently minor that they are without consequences for our predictive ability to identify the two highest modes, or, as we shall see, to impact on our ability to predict the final winner.
To predict the percentage of votes of the candidate of the left (or the right) at the second round in 2007, we estimated from past elections the relation between the distances of the mode of this candidate to the overall median voter. Then we ran a bivariate regression with distance as our independent variable. Table 2 presents the data used for this simple regression.
Table 2. Relation between percentage of votes at the second round of the left- and right-wing leaders and the estimated distance of their mode at the first round with the overall median voter through all presidential elections
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-67210-mediumThumb-S0008423909990746jra_tab002.jpg?pub-status=live)
* Drawn from Table 1;
** estimated by our simple regressions
The relationship between vote share and distance to the median voter is depicted graphically in Figure 2(a) for the left-wing candidate in the second round (if any) and in Figure 2(b) for the mainstream right-wing candidate in the second round.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160709050902-48855-mediumThumb-S0008423909990746jra_fig009g.jpg?pub-status=live)
Figure 2(a) % of Votes of the Left-Wing Leader at the Second Round: For Years 1965, 1974, 1981, 1988, 1995
Vote for the left (second round) = 52.3 − 0.468 (distance of the left mode to the median) t statistic 39.5 −3.032; R2 = 0.75; Adjusted R2 = 0.67, standard error = 2.05
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160709050902-00725-mediumThumb-S0008423909990746jra_fig010g.jpg?pub-status=live)
Figure 2(b) Percentage of Votes of the Right-Wing Leader at the Second Round for Years 1965, 1969, 1974, 1981, 1988, 1995, 2002
Vote for the left (second round) = 52.3 − 0.468 (distance of the left mode to the median) t statistic 39.5 −3.032; R2 = 0.75; Adjusted R2 = 0.67, standard error = 2.05
With a distance of 11.9 per cent between the left mode and the overall median voter, our regression estimate based on the previous elections for which we have the relevant data was that Ségolène Royal should get 46.73 per cent of the second-round vote (here we simply insert 11.9 per cent into the equation). Taking standard errors into account, we predicted that the vote share of Madame Royal would be between 44.7 per cent and 48.7 per cent. In fact, she received a vote share of 46.9 per cent; thus our estimates of 46.73 came within 0.17 per cent of the true result.
Doing the same analysis to project the vote share of the right-wing leader gives the results shown in Figure 2(b). Despite the poor regression fit of the regression equation shown in Figure 2(b), with an adjusted R2 of only .064, and a t statistic of only 0.8 for the independent variable, well below statistical significance, we could still use this regression to successfully project the winner in 2007. With a distance of 8.4 per cent between the right mode and the overall median voter, our regression estimate based on the previous elections for which we have the relevant data was that Nicolas Sarkozy should get 52.6 per cent of the second-round vote (here we simply insert 8.4 per cent into the equation). In fact Sarkozy received a vote share of 53.06 per cent; thus our estimates was within 0.46 per cent of the true result. Note, too, that although we ran separate regressions for each of the two second-round candidates in 2007, the estimated second-round vote shares of both candidates add almost perfectly to 100 per cent (99.3 per cent).
5. Conclusion
The French double ballot electoral structure makes it likely that multiple candidates will contest the first round. Parties at the first round can try to influence the policies of existing parties able to govern by showing their strength with the voters. Voters of the two basic camps (tendances) in French politics, the left and the right, are concerned to assure that a candidate from among their own tendance will be among the two candidates who make it into the final round. In practice, until the anomaly of the 2002 election, however, where dispersal of the votes on the left and on the right made the distribution of votes among the five blocs we have identified very close to a uniform distribution, and led to a contest between a candidate of the right (Chirac) and a candidate of the ultra-right (Le Pen), the strength of the leading parties in each of the two tendances made the distribution of votes look bimodal viv-à-vis the left plus extreme left versus the right plus extreme right. This bimodality has more or less guaranteed that a representative of the right would face a representative of the left in the runoff.
The second round of the French two-ballot system constrains the two parties who make it to that round to appeal to a broader set of voters, including the median voter, and to form alliances with other parties in order to win the second round. In general, the winner of the election will be the leader of the camp whose own median and/or modal voter is closest to the overall median voter. Thus, the double ballot electoral system imposes a constraint on the strategy of political leaders who recognize the ultimate importance of the overall median voter. They must avoid the danger of being so oriented to winning the eventual two-candidate second round that they are eliminated at the first round, since when a candidate of the right wing (the left wing) shifts his or her political platform in the direction of the overall median voters, he can expect to lose votes to more extreme parties of his/her own tendance. Outcomes in the double ballot system are thus simultaneously affected by the location of the overall median voter and by the nature of the within-bloc distribution of voter support—in particular, by which bloc's modal party is closer to the overall median voter.
We have built a predictive model for French presidential elections based on these strategic factors affecting party incentives in a two-round competition, and simplifying assumptions about French political competition in terms of a left–right space and five “families” of contestants in a way that is familiar to all commentators on French politics. Testing that median voter spatial model with estimates drawn from analyses of a number of previous presidential elections, we were able to come up with a very good prediction of the 2007 presidential outcome in terms of who will be in the second round and who will win.
Appendix A
Appendix A Distribution of the candidates by political families for the French Presidential elections from 1974 till 2007
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-30648-mediumThumb-S0008423909990746jra_tabU001.jpg?pub-status=live)
Appendix B
Methodological Issues
To assess the accuracy of opinion polls we picked the absolute difference between vote intentions for each political groupings and the final vote. We decomposed these errors in systematic bias and hazard bias following Theil's (Reference Theil1966) “decomposition of errors” method.
This method is based on the mean square prediction error. The mean square prediction error is decomposed in order to indicate systemic and random sources of error. The systematic component is further divided into the proportion of the total forecast error due to bias and the proportion of total forecast error attributable to unequal variation.
The forecast error is defined as an equation of the shape UM + US + UC as follows:
![$$
\frac{{\left( {\overline P - \overline R } \right)^2 }}
{{EQM}} + \frac{{\left( {S_P - S_R } \right)^2 }}
{{EQM}} + \frac{{2\left( {1 - r} \right).\left( {S_P .S_R } \right)}}
{{EQM}} = 1
$$](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160203084645703-0459:S0008423909990746jra_eqnU001.gif?pub-status=live)
P and R are mean predicted and mean actual changes respectively; S P and S R are the standard deviations of predicted and actual values respectively; and r is the coefficient of correlation between predicted and actual values.
The first expression (UM) of RHS of this equation is the proportion of the total forecast error due to bias. It represents a measure of proportion of error due to over prediction or under prediction of the average value. The second expression of the RHS of this equation (US) is the proportion of total forecast error attributable to unequal variation. In other words, it measures the proportion of error due to overprediction or underprediction of the variance of the values. The third expression of the RHS (UC) of this equation measures the proportion of forecasting error due to random variation.
In order to realize this decomposition, we calculate the coefficient of correlation between predicted and actual values.
![$$
r = \frac{{\frac{1}
{T}\sum {\left( {P - \overline P } \right)} .\left( {R - \overline R } \right)}}
{{S_P .S_R }}
$$](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160203084645703-0459:S0008423909990746jra_eqnU002.gif?pub-status=live)
![$$
\text{With }\overline P = \frac{{\sum P }}
{T}\text{ and }\overline R = \frac{{\sum R }}
{T}
$$](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160203084645703-0459:S0008423909990746jra_eqnU003.gif?pub-status=live)
![$$
\text{With }S_P^2 = \frac{{\sum {\left( {P - \overline P } \right)} ^2 }}
{T}\text{ and }S_R^2 = \frac{{\sum {\left( {R - \overline R } \right)^2 } }}
{T}
$$](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160203084645703-0459:S0008423909990746jra_eqnU004.gif?pub-status=live)
With data from TNS-Sofres on French presidential elections from 1981 till 2007, we obtain the following tables.
Calculation of UM for the whole period 1981–2007 (month number before the election)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-63905-mediumThumb-S0008423909990746jra_tabU002.jpg?pub-status=live)
Calculation of US for the whole period 1981–2007 (month number before the election)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-92385-mediumThumb-S0008423909990746jra_tabU003.jpg?pub-status=live)
Calculation of UC for the whole period 1981–2007 (month number before the election)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160710002559-31749-mediumThumb-S0008423909990746jra_tabU004.jpg?pub-status=live)
The mean square prediction error has been decomposed in order to indicate systematic source of error. However, other errors exist. In order to avoid them, we remove/add half of the distance between predicted and actual vote. If the score of the political family has been overestimated over the whole period 1981–2007, we remove ½(P − R). If the score of the political family has been underestimated over the whole period 1981–2007, we add ½(P − R). The results are shown in the table below.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160709050902-02702-mediumThumb-S0008423909990746jra_tabU005.jpg?pub-status=live)
For each election between 1981–2007 (including 2007), we calculated this corrected distribution of votes one month before each presidential election, picking the two highest modes, which will be the two candidates running for the second round, and calculated the distance of each mode to the overall median voter. Then we predicted the winner as the one who is closer to the overall median voter.