The PollyVote was launched in 2004 as a long-term project to demonstrate that generalized principles derived from forecasting research in different fields could produce accurate election forecasts. The initial focus was – and still is – on combining forecasts (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014b). By now, the method of combining forecasts across different methods has been applied to the four US presidential elections in 2004 (Cuzán, Armstrong, and Jones Reference Cuzán, Armstrong and Jones2005), 2008 (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2009), 2012 (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014a), and 2016 (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017), as well as to the 2013 German federal election (Graefe Reference Graefe2015).
One of the project’s major goals is to learn about the relative accuracy of different forecasting methods over time, and in different settings. Our research to date shows that the relative accuracy of different methods varies substantially from one election to the next, which is after all a major reason why combining forecasts reduces error (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017). Another important goal is to track advances in forecasting research in other domains and apply them to election forecasting. This has led to adding index models (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014a) and citizen forecasts (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017) to the PollyVote for forecasting the US presidential elections in 2012 and 2016, respectively.
The present paper presents the PollyVote’s early forecast for the 2017 German federal election, calculated in March, about half a year before Election Day. We will provide daily updated forecasts until the election at www.pollyvote.de.
POLLYVOTE FORECAST
The procedure for calculating the combined PollyVote forecast has not changed since the 2013 German federal election (Graefe Reference Graefe2015). The combined PollyVote forecast is the simple average of the (already combined) forecasts from four component methods: polls, prediction markets, expert judgment, and models. Table 1 shows the vote share forecasts from the PollyVote and its four component methods for the six largest parties (i.e., CDU/CSU, SPD, Grüne, Linke, FDP, AfD), plus the vote of all remaining parties combined (Others).
Table 1 Vote-share Forecasts of the PollyVote and Its Components
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170612130158-26827-mediumThumb-S1049096517000439_tab1.jpg?pub-status=live)
PollyVote forecasts that the CDU/CDU will remain the strongest party. However, with a predicted vote share of 33.3%, the party of Angela Merkel would lose 8.3 percentage points, by far the largest loss of all parties. The main beneficiaries are the SPD and the AfD, each of which can expect substantial gains at 4.5 and 5.2 points, respectively. For the AfD, these numbers would mean that they would enter parliament for the first time in their young history. For the remaining parties, the expected changes are modest but, for one of them, critical. If the FDP manages to gain a mere 0.2 points, the party will pass the 5% threshold and return to parliament; the current prediction is that the party will gain 1.8 points.
As shown in table 1, there is wide agreement among the component methods regarding the vote share of each party. Notably, all four methods expect the six parties to pass the 5% threshold. However, one should exercise caution when reading these numbers. For one, the election is still far away and a lot can change over the course of the campaign. The 10-point increase in the SPD’s polling numbers within only four weeks after the party announced Martin Schulz as its front-runner in late January suggests that public opinion is volatile in this cycle. Furthermore, different methods can err in the same direction. In 2013, the four component methods consistently, unanimously, and wrongly predicted that the FDP would be represented in parliament. Another recent example is the 2016 US presidential election, when five of six methods overestimated Clinton’s vote share (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017).
The remainder of this paper describes the composition of the PollyVote’s four component methods. For details about how each of these methods work see previous project publications (e.g., Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014a, Reference Graefe, Armstrong, Jones and Cuzánb, Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017).
POLLS
German pollsters ask people for which party they intend to vote if the election was held this Sunday (which is why this question is known as the ‘Sonntagsfrage’). Polls measure public opinion at a certain point in time; they do not provide predictions of what will happen on Election Day. Interpreting polling results as predictions is thus misleading and particularly problematic if the election is still far away, because polls tend to vary widely over the course of the campaign. In addition, sampling problems, nonresponses, faulty processing and other sources of bias can result in high variance in the results of polls that were conducted by different survey organizations around the same time. Therefore, it is generally a good strategy to simply ignore single polls. Rather, one should aggregate and combine polls from different survey organizations, since the errors associated with individual polls tend to cancel out in the aggregate (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014b). That said, the value of poll aggregators is limited if there is systematic bias across all surveys (e.g., due to systematic non-response from a certain demographic group).
At the time of writing, two poll aggregators were available, which differ in which polls they use and how they weight them. Wahlumfrage.de calculates simple unweighted averages of the most recent polls conducted by seven established pollsters (i.e., Allensbach, Emnid, Forsa, Forschungsgruppe Wahlen, GMS, Infratest dimap, and INSA). Pollytix.de also includes polls from other survey institutes and assigns higher weights to (a) polls conducted more recently and (b) polls with larger sample sizes. The PollyVote’s combined polls component forecast shown in table 1 is the simple average of the forecasts from both poll aggregators.
PollyVote forecasts that the CDU/CDU will remain the strongest party. However, with a predicted vote share of 33.3%, the party of Angela Merkel would lose 8.3 percentage points, by far the largest loss of all parties. The main beneficiaries are the SPD and the AfD, each of which can expect substantial gains at 4.5 and 5.2 points, respectively.
PREDICTION MARKETS
Prediction markets use the price mechanism of the market to aggregate people’s expectations of how the election will turn out. For German elections, prediction markets typically offer contracts for each party, where the contract price reflects that party’s predicted vote (e.g., a price of 32.8 Euros for CDU/CSU means that the party is predicted to win 32.8% of the vote). Participants who think that the actual vote share will be higher (lower), should buy (sell) shares of that contract, and win or lose money depending on the accuracy of their predictions.
A review of prediction market accuracy of vote-share forecasts in different countries found that prediction markets tend to outperform forecasts made by polls, models, and experts; compared to simply asking citizens who will win, evidence was mixed (Graefe Reference Graefe, Arzheimer, Evans and Lewis-Beck2017a). However, that review did not include the 2016 US presidential election, a case when prediction markets provided relatively poor forecasts (Graefe Reference Graefe2017b).
At the time of writing, forecasts were available from the FAZ Orakel, a prediction market run by the online version of the Frankfurter Allgemeine Zeitung. On this market, participants can trade vote shares of the six largest parties plus the combined vote of all remaining parties. The FAZ Orakel is a so-called play-money market. That is, instead of using real money, the best performing participants can win prizes. As described by Graefe (Reference Graefe, Arzheimer, Evans and Lewis-Beck2017a), one limitation of play-money markets is that concerns of potential market manipulation are higher. That said, market manipulation does not seem to be a concern at the time of writing. As shown in table 1, the current market forecast does not show abnormalities but is in line with forecast from other methods.
EXPERT JUDGMENT
Experts can be expected to provide useful forecasts. Experts should, for example, be able to account for potential impacts of recent and future campaign events on public opinion polls (e.g., they may be able to assess the sustainability of the SPD’s rise in the polls). Some evidence suggests that this is the case. Jones and Cuzán (Reference Jones and Cuzán2013) found that experts outperformed polls for long-term forecasts.
Seventy members of the German Society for Electoral Studies were asked to participate in an online survey. That survey included only one question, which asked respondents to predict the vote shares for the parties listed in Table 1. In the first survey, which was conducted March 14–19 of 2017, 30 experts responded. Their combined forecast, which is the simple average of the individual expert forecasts for each party, is shown in table 1.
MODELS
Developing statistical models is an alternative to relying on people’s vote intention (i.e., polls) and expectations (i.e., prediction markets, expert judgment) when generating election forecasts. Such models typically rely on structural factors, the so-called fundamentals, that are predictive of election outcomes. For example, the incumbent government typically benefits from good economic conditions but loses support the longer it has been in power due to people’s desire for change. At the time of writing, forecasts from three models were available. The PollyVote’s combined models forecast shown in table 1 is the simple average of the three model forecasts.
The model by Jérôme, Jérôme-Speziari, and Lewis-Beck (Reference Jérôme, Jérôme-Speziari and Lewis-Beck2017) has been used since 1998 in modified form. The model uses a set of regression equations to forecast the individual parties’ vote shares based on various variables such as the unemployment rate and several poll-based measures.
The Chancellor model by Norpoth and Gschwend (Reference Norpoth and Gschwend2017), which has been around since the 2002 election, uses three variables to forecast the aggregate vote share of the outgoing coalition: (1) the outgoing coalition’s average vote share across the three preceding elections, (2) the support for the Chancellor in public opinion polls, and (3) attrition, measured as the number of terms in office.
The model by Kayser and Leininger (Reference Kayser and Leininger2017), a newcomer in this election, predicts the parties’ vote share in each state before aggregating the numbers to a forecast of the federal election outcome. The linear random effects model is based on the following information: the party’s vote share in the preceding federal and state election, whether the Chancellor was from that party, national quarterly GDP growth, and the number of years the chancellor has been in office. The authors provide two sets of model forecasts, which differ in how the state elections are weighted. One model assigns equal weights to all states while the other assigns higher weights to state elections that are closer to the federal election. The PollyVote uses the simple average of both sets of forecasts.