Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-02-11T07:14:06.385Z Has data issue: false hasContentIssue false

The PollyVote Popular Vote Forecast for the 2020 US Presidential Election

Published online by Cambridge University Press:  15 October 2020

J. Scott Armstrong
Affiliation:
University of Pennsylvania
Andreas Graefe
Affiliation:
Macromedia University of Applied Sciences, Germany
Rights & Permissions [Opens in a new window]

Abstract

Type
Forecasting the 2020 US Elections
Copyright
© The Author(s), 2020. Published by Cambridge University Press on behalf of the American Political Science Association

In 2004, the PollyVote project (www.pollyvote.com) was launched with the goal of applying evidence-based principles to election forecasting, as well as to track different forecasting methods over time and in various settings to learn about their relative accuracy. The original idea of the project was to demonstrate the power of combining forecasts in reducing error, which is a well-established finding in forecasting research (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014).

Combining forecasts has two major advantages. First, the relative accuracy of different forecasting methods often varies across time. That is, methods that have worked well in one election might not work well in another, and vice versa. Combining forecasts thus reduces bias by protecting forecast users from choosing a poor forecast and instead enabling them to incorporate more information. Second, combining reduces error by canceling out systematic and random errors of individual forecasts, an effect that is particularly strong if the individual forecasts rely on different methods and data.

To date, the PollyVote has demonstrated the benefits of combining forecasts by providing highly accurate forecasts for the four US presidential elections from 2004 to 2016 (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017) and the two German federal elections in 2013 and 2017 (Graefe Reference Graefe2019). An analysis across the last 100 days before the seven US elections from 1992 to 2016 found that the PollyVote’s popular-vote forecast missed the final vote shares by 1.1 percentage points on average, which was lower than the error of any individual-component forecast (Graefe et al. Reference Graefe, Armstrong, Jones, Cuzán, Cavari, Powell and Mayer2017).Footnote 1

THE POLLYVOTE METHOD OF COMBINING FORECASTS

For its 2020 popular-vote prediction, the PollyVote uses a hierarchical approach to combining forecasts. The basic idea is to combine forecasts within and across four categories of methods: (I) vote intentions (polls), (II) expectations, (III) models, and (IV) naïve forecasts. Forecasts are grouped in these categories by the type of underlying data. The basic idea of this approach is to create conditions under which combining forecasts is most useful (Graefe et al. Reference Graefe, Armstrong, Jones and Cuzán2014). Whenever forecasts are combined, the PollyVote calculates simple unweighted averages. The forecasts from the PollyVote and its components are shown in table 1 (Graefe Reference Graefe2020b).

Table 1 Forecasts of the PollyVote and Its Components: Trump Two-Party Vote

Vote Intentions: Polls

When people speak of polling results, they commonly refer to respondents’ answers to the vote-intention question, which asks respondents for whom they would vote if the election were held today. Strictly speaking, polls do not provide forecasts of what will happen, particularly if the election is still far ahead. Rather, polls merely provide snapshots of public opinion at a certain point in time. The PollyVote incorporates these data by averaging daily estimates of three poll aggregators (i.e., FiveThirtyEight, RealClearPolitics, and The Economist).

Expectations

Methods that rely on people’s expectations elicit and aggregate their judgmental forecasts of what will happen on Election Day. The 2020 PollyVote combines forecasts from three different expectations-based methods: betting markets, expert judgment, and citizen forecasts.

Betting markets, also known as prediction markets, allow people to bet money on the election outcome, whereby the betting odds reflect a forecast of a given candidate’s vote share or probability of winning (Gruca and Rietz Reference Gruca and Rietz2020). A review of forecast accuracy of betting markets in different countries found that these markets tend to outperform forecasts made by polls, models, and experts, whereas relative performance compared to simply asking voters who will win (i.e., citizen forecasts) was mixed (Graefe Reference Graefe, Arzheimer, Evans and Lewis-Beck2017a). This review, however, did not include the 2016 US presidential election, when betting markets performed poorly (Graefe Reference Graefe2017b). The 2020 PollyVote combines forecasts from two markets that provide estimates of the popular vote: the Iowa Electronic Markets and Predictit.

Expert judgment is probably one of the oldest methods to predict election outcomes. Since 2004, the PollyVote has periodically asked a panel of experts to predict the popular two-party vote in US presidential elections. An analysis of the past four elections from 2004 to 2016 found that the forecasts of the majority of 452 individual experts correctly predicted the directional error of polls. That said, the typical expert’s forecast error was 7% higher than estimates of a polling average. The results also showed that the experts did not sufficiently incorporate information from so-called election fundamentals (Graefe Reference Graefe2018).

Citizen forecasts are based on simply asking survey respondents who they think is going to win (Murr and Lewis-Beck Reference Murr and Lewis-Beck2020). Although often overlooked and—unfortunately—rarely asked by pollsters, responses to this question are highly predictive of election outcomes, particularly when converted to vote-share forecasts. When analyzing data for the seven US presidential elections from 1988 to 2012, Graefe (Reference Graefe, Armstrong, Jones and Cuzán2014) found that citizen forecasts were more accurate than forecasts from polls, betting markets, and models. For example, compared to a typical poll, citizen forecasts decreased forecast error approximately by half (Graefe Reference Graefe2014).

Models

The 2020 PollyVote combines forecasts from various models, which are assigned to different categories depending on the underlying data. At the top level, the PollyVote distinguishes models that are purely retrospective, purely prospective, and those classified as mixed because they incorporate both types of information.

Retrospective models view an election solely as a referendum on the incumbent government’s performance by assuming that voters reward incumbents for good performance and punish them otherwise. Some models (e.g., Fair Reference Fair2009) measure performance by focusing exclusively on structural variables—for example, by using one or more macroeconomic variables (e.g., GDP and unemployment).Footnote 2 The PollyVote classifies these models, which completely ignore public-opinion variables, as fundamentals-only models. Other models (e.g., Lewis-Beck and Tien Reference Lewis-Beck and Tien2020) additionally incorporate poll-based variables (e.g., presidential job approval) as proxies for voters’ satisfaction with the incumbent’s handling of both economic and noneconomic issues. These models are classified as fundamentals-plus models. The PollyVote’s retrospective models’ component is calculated as the simple average of fundamentals-only and fundamentals-plus models.

Prospective models build on the idea that voters have different expectations of their personal—or the nation’s—future, depending on who will win. Therefore, voters are expected to vote for the candidate with whom they expect to be better off. The PollyVote combines forecasts from two models that fall into that category: the Big-Issue model (Graefe and Armstrong Reference Graefe and Armstrong2012) and the Issues and Leaders model (Graefe Reference Graefe2020a).

Mixed models are those that incorporate both retrospective and prospective variables. For example, any model that uses economic fundamentals as well as trial-heat polls would be classified as a mixed model. Examples include the Erikson and Wlezien (Reference Erikson and Wlezien2020) and Lichtman (Reference Lichtman2020) models and those published by media outlets (e.g., Economist and FiveThirtyEight).

Naïve Forecasts

Complexity tends to harm forecast accuracy. That is, the accuracy of simple models, such as a naïve no-change model, is often difficult to beat by more complex models. One reason for using a no-change model may be that a forecaster concludes from prior knowledge that the situation is stable. Another reason may be that, in many situations, prior knowledge is insufficient to improve on a no-change forecast (Green and Armstrong Reference Green and Armstrong2015). The 2020 PollyVote averages forecasts from two naïve models: Norpoth’s (Reference Norpoth2014) electoral-cycle model and a 50/50 model. Norpoth’s electoral cycle predicts the election outcome based on electoral history by using the vote of the two most recent elections as predictor variables in a multiple-regression model estimated on data from 1828 to 2016. The 50/50 model assumes that both major-party candidates will gain 50% of the popular vote and thus represents the age of political polarization. By including a naïve component, the PollyVote adheres to the principle of conservatism in forecasting by acknowledging the situation’s underlying uncertainty (Armstrong, Green, and Graefe Reference Armstrong, Green and Graefe2015).

THE POLLYVOTE 2020 POPULAR-VOTE FORECAST

As of August 21, 2020, the combined PollyVote predicts that Joe Biden will gain 52.1% of the popular two-party vote compared to 47.9% for President Trump. Given the PollyVote’s historical error, the estimated chance for Biden to win the most votes is 96%. Three of the four component methods are in agreement about Biden’s advantage, with poll aggregators being the most bullish for Biden (i.e., an almost 9-point lead). Only the naïve forecasts component predicts a narrow advantage for Trump, which in effect dampens Biden’s lead in the combined PollyVote forecast, thereby making the forecast more conservative.

DATA AVAILABILITY STATEMENT

Replication materials are available on Dataverse at https://doi.org/10.7910/DVN/RLECFV.

Footnotes

1. This analysis included four elections for which the PollyVote method was used ex post (1992–2000) and three elections for which the method was applied ex ante (2004–2016).

2. The forecast from the Fair (Reference Fair2009) model is based on assuming a 19.1% annual growth for the third quarter of 2020, as predicted by the Survey of Professional Forecasters by the Federal Reserve Bank of Philadelphia (Philadelphiafed 2020).

References

REFERENCES

Armstrong, J. Scott, Green, Kesten C., and Graefe, Andreas. 2015. “Golden Rule of Forecasting: Be Conservative.” Journal of Business Research 68 (8): 1717–31.Google Scholar
DeSart, Jay. 2020. “A Long-Range State-Level Forecast of the 2020 Presidential Election.” PS: Political Science & Politics. doi:10.1017/S1049096520001468.Google Scholar
Erikson, Robert S., and Wlezien, Christopher. 2020. “Forecasting the 2020 Presidential Election: Leading Economic Indicators, Polls, and the Vote.” PS: Political Science & Politics. doi:10.1017/S1049096520001481.Google Scholar
Fair, Ray C. 2009. “Presidential and Congressional Vote‐Share Equations.” American Journal of Political Science 53 (1): 5572.Google Scholar
Graefe, Andreas. 2014. “Accuracy of Vote Expectation Surveys in Forecasting Elections.” Public Opinion Quarterly 78 (S1): 204–32.Google Scholar
Graefe, Andreas. 2017a. “Political Markets.” In The SAGE Handbook of Electoral Behavior, ed. Arzheimer, Kai, Evans, Jocelyn, and Lewis-Beck, Michael S., 861–82. London: SAGE.Google Scholar
Graefe, Andreas. 2017b. “Prediction Market Performance in the 2016 US Presidential Election.” Foresight: The International Journal of Applied Forecasting 2017 (45): 3842.Google Scholar
Graefe, Andreas. 2018. “Predicting Elections: Experts, Polls, and Fundamentals.” Judgment and Decision Making 13 (4): 334–44.Google Scholar
Graefe, Andreas. 2019. “Accuracy of German Federal Election Forecasts, 2013 & 2017.” International Journal of Forecasting 35 (3): 868–77.Google Scholar
Graefe, Andreas. 2020a. “Of Issues and Leaders: Forecasting the 2020 US Presidential Election.” PS: Political Science & Politics. doi:10.1017/S1049096520001390.Google Scholar
Graefe, Andreas. 2020b. “Replication Data for: The PollyVote Popular-Vote Forecast for the 2020 US Presidential Election.” Harvard Dataverse. doi:10.7910/DVN/RLECFV.Google Scholar
Graefe, Andreas, and Armstrong, J. Scott. 2012. “Predicting Elections from the Most Important Issue: A Test of the Take-the-Best Heuristic.” Journal of Behavioral Decision Making 25 (1): 4148.Google Scholar
Graefe, Andreas, Armstrong, J. Scott, Jones, Randall J. Jr., and Cuzán, Alfred G.. 2014. “Combining Forecasts: An Application to Elections.” International Journal of Forecasting 30 (1): 4354.Google Scholar
Graefe, Andreas, Armstrong, J. Scott, Jones, Randall J. Jr., and Cuzán, Alfred G.. 2017. “Assessing the 2016 US Presidential Election Popular-Vote Forecasts.” In The 2016 Presidential Election: The Causes and Consequences of a Political Earthquake, ed. Cavari, Amnon, Powell, Richard J., and Mayer, Kenneth R., 137–58. Lanham, MD: Lexington Books.Google Scholar
Green, Kesten C., and Armstrong, J. Scott. 2015. “Simple Versus Complex Forecasting: The Evidence.” Journal of Business Research 68 (8): 1678–85.Google Scholar
Gruca, Thomas S., and Rietz, Thomas A.. 2020. “The (Re)Election of 2020 According to the Iowa Electronic Market: Politics, Pandemic, Recession and/or Racial Unrest?” PS: Political Science & Politics. doi:10.1017/S1049096520001419.Google Scholar
Lewis-Beck, Michael S., and Tien, Charles P.. 2020. “The Political Economy Model.” PS: Political Science & Politics . doi:10.1017/S1049096520001365.Google Scholar
Lichtman, Allan. 2020. “He Predicted Trump’s Win in 2016. Now He’s Ready to Call 2020.” New York Times. Accessed August 5, 2020.Google Scholar
Lockerbie, Brad. 2020. “Economic Pessimism and Political Punishment.” PS: Political Science & Politics. doi:10.1017/S1049096520001444 Google Scholar
Murr, Andreas, and Lewis-Beck, Michael S.. 2020. “Citizen Forecasting 2020: A State-by-State Experiment.” PS: Political Science & Politics 53 (4).Google Scholar
Norpoth, Helmut. 2014. “The Electoral Cycle.” PS: Political Science & Politics 47 (2): 332–35.Google Scholar
Philadelphiafed. 2020. “Third Quarter 2020 Survey of Professional Forecasters.” Federal Reserve Bank of Philadelphia. Accessed August 28, 2020.Google Scholar
Figure 0

Table 1 Forecasts of the PollyVote and Its Components: Trump Two-Party Vote

Supplementary material: Link

Armstrong and Graefe Dataset

Link