Why were the 2016 and 2020 election polls so inaccurate?

According to post-election analyses by the American Association for Public Opinion Research, the 2016 polling errors were driven primarily by nonresponse bias, inadequate weighting for education level, and a late shift of undecided voters toward Trump. In 2020, despite corrections for education weighting, similar nonresponse problems persisted, with Republicans and independents less likely to respond to pollsters than Democrats, leading AAPOR to call 2020 the least accurate presidential polling cycle in roughly 40 years.

What is nonresponse bias in election polling?

Nonresponse bias occurs when the people who choose to respond to a poll are systematically different from those who decline. In recent U.S. elections, research has indicated that Republican-leaning voters and those with lower trust in institutions were less likely to answer pollsters, meaning the sample of respondents did not accurately reflect the broader electorate. This imbalance can skew results toward one candidate or party even when pollsters apply demographic weighting.

What is the 'shy voter' effect in polling?

The 'shy voter' hypothesis holds that some voters conceal their true candidate preference from pollsters due to social pressure or perceived stigma, leading to an undercount of support for certain candidates. While this concept has been debated extensively in relation to U.S. elections, the evidence remains contested. The AAPOR task forces on both the 2016 and 2020 elections concluded that nonresponse bias and weighting failures were more significant drivers of polling error than respondents actively misreporting their preferences.

How do likely voter models affect polling accuracy?

Likely voter models are the methods pollsters use to screen respondents and estimate who will actually cast a ballot. Because voter turnout is inherently uncertain, these models involve judgment calls that can introduce error. A 2016 New York Times exercise demonstrated that when four different sets of analysts applied their own weighting and likely voter screens to identical raw data, results ranged from Clinton +4 to Trump +1, illustrating how much the choice of likely voter model can shape a poll's outcome.

Were 2024 election polls more accurate than in 2016 or 2020?

According to AAPOR's post-election analysis, 2024 polling showed a substantial improvement over 2020. The task force concluded that state-level polls were the most accurate since 1944 and that polls broadly reflected a close race between Kamala Harris and Donald Trump. However, the report also noted that polls again underestimated Republican vote share, a pattern observed in three consecutive Trump-era elections, characterizing it as a 'lingering challenge' for the polling industry.

Political Analysis

Why Political Polls Sometimes Get Elections Wrong

From nonresponse bias to faulty likely-voter models, a rigorous look at the structural forces that cause polls to miss electoral outcomes.

May 24, 2025 9 min read Editorial Team

Political polls are among the most closely watched instruments in democratic life, yet they have produced some of the most consequential forecasting failures of modern times. The 2016 U.S. presidential election shocked observers worldwide when Donald Trump defeated Hillary Clinton despite most national polls showing Clinton leading. In 2020, polls again overstated Democratic support — an outcome the American Association for Public Opinion Research characterized as the least accurate national polling in roughly 40 years. Understanding why political polls sometimes get elections wrong requires examining a set of interconnected methodological, behavioral, and structural challenges that have resisted easy fixes despite years of scrutiny from researchers and polling organizations alike.

Background

A Brief History of Polling Errors in Major Elections

The history of election polling is punctuated by a small number of highly visible failures that have shaped how the field evaluates itself. The most dramatic early example came in 1948, when the major polling organizations — including Gallup — predicted that Republican Thomas Dewey would defeat incumbent President Harry Truman. As documented in academic reviews of polling methodology, that failure was attributed largely to non-random sampling and to the practice of stopping data collection weeks before Election Day, thereby missing a late surge toward Truman.

The discipline largely recovered from 1948 and performed with reasonable accuracy through many subsequent election cycles. However, the 2016 U.S. presidential contest reignited concern about election poll accuracy. Research published by the American Association for Public Opinion Research found that 2016 national polls, while directionally off, were closer to the final outcome than widely perceived; the more significant errors occurred in state-level polls, particularly in key Midwestern states that proved decisive in the Electoral College. In 2020, errors at both the state and national level were broader. The AAPOR post-election task force described the 2020 results as the polls’ worst performance in presidential elections in approximately four decades. The 2024 cycle offered a partial correction, with AAPOR concluding that state-level polls were the most accurate since 1944, though a consistent pattern of underestimating Republican vote share persisted for the third consecutive presidential election in which Trump was a candidate.

Documented Polling Pattern — AAPOR Findings

According to the AAPOR task force reports, polls underestimated Republican vote share in four of the six presidential elections between 2004 and 2024 — specifically in 2004, 2016, 2020, and 2024. The organization characterized this as a “lingering challenge” for the industry rather than a resolved problem, even after pollsters implemented methodological adjustments following each cycle’s post-mortem reviews.

Methodology

The Sampling Problem: Who Picks Up the Phone?

One of the most foundational challenges in election polling is obtaining a sample of respondents that accurately reflects the electorate. Telephone surveys — once the gold standard of political polling — have faced dramatically declining response rates as caller ID, call screening, and the shift away from landline phones have made it progressively harder to reach potential respondents. Research cited by Suffolk University political science professor Rachael Cobb found that where pollsters once might need to contact around 20 people to complete one interview, the figure has roughly doubled in recent years, making each completed poll more expensive and potentially more selective in who it captures.

When those who respond to polls differ systematically from those who do not, the resulting data contains what researchers call nonresponse bias. A report commissioned by AAPOR found that in both the 2016 and 2020 elections, Republicans and independents were less likely to respond to pollsters than Democrats, and that among Republican respondents, those who did answer were less likely to support Trump than those who refused. This created a structural tilt in the data that demographic weighting alone could not fully correct, because the political preferences of non-respondents within demographic groups differed from those who agreed to participate.

Erin Caldwell, a survey research principal with more than three decades of experience conducting civic surveys, has noted that political polls face a compounding version of this problem. Pollsters rely on voter registration files to identify potential respondents, but within that pool only a fraction will actually vote. The challenge is not simply reaching registered voters but identifying which among them will turn out — a question that requires predicting behavior rather than merely recording stated intent.

Voter Modeling

Likely Voter Screens and the Limits of Turnout Prediction

Because not every registered voter participates in each election, pollsters use what are known as likely voter models — screening criteria designed to filter respondents down to those deemed most probable to cast a ballot. These models typically incorporate factors such as past voting history, stated intention to vote, and interest in the current election. However, the models involve significant judgment, and different methodological choices can yield starkly different results from the same underlying data.

A 2016 exercise published by The New York Times illustrated this vividly: four separate teams of analysts were each given identical raw polling data from a Florida survey and asked to process it using their own weighting and likely voter methodologies. The four resulting estimates ranged from Clinton leading by four percentage points to Trump leading by one — a spread of five points from the same raw numbers. The exercise demonstrated, as the Rasmussen Reports commentary on the AAPOR task force findings noted, that likely voter modeling done by rational, expert analysts can result in significant divergence, and that in close elections these differences can determine which direction a poll points.

Turnout can also be affected by events outside pollsters’ control. The COVID-19 pandemic significantly altered voting behavior in 2020, with an unprecedented expansion of mail-in voting in many states and record-setting overall turnout. AAPOR noted that the states with higher pandemic case rates at the time of polling showed larger polling errors, suggesting that the extraordinary circumstances complicated the task of modeling who would actually vote.

National Polling Error by Presidential Cycle

Average absolute error in final national polls vs. certified results — Source: AAPOR post-election task force analyses

Voter Behavior

Late-Deciding Voters and the Problem of Election Day Uncertainty

Even a methodologically sound poll conducted weeks before an election cannot account for shifts in voter sentiment that occur in the final days of a campaign. In 2016, AAPOR’s post-election analysis found that an unusually high proportion of voters — as many as 13 percent, according to some estimates at the time — were still undecided or considering third-party options on Election Day, and that among those who decided in the final week, a majority ultimately voted for Trump. Polls completed several days earlier had no way to capture these last-minute movements.

The late-deciding phenomenon was less pronounced in 2020, with only around 4 percent of poll respondents in final state-level surveys giving an answer other than Biden or Trump when asked their preference in the closing two weeks, according to the AAPOR task force report on that election. This suggested that a different set of factors — primarily nonresponse bias — drove the 2020 errors rather than a repeat of the 2016 undecided-voter dynamic.

The relationship between stated intention and actual behavior adds another layer of complexity. A poll can accurately record what a respondent says they plan to do without that intention translating into a vote. Party registration files show that Democrats historically report higher participation intentions in polls but, according to survey research specialist Erin Caldwell, are in some contexts less likely to follow through on voting day. Republicans have shown the reverse pattern: lower reported poll participation but higher rates of actual ballot-casting. These behavioral asymmetries mean that even a well-sampled poll may not translate directly into an accurate electoral forecast.

Psychology

Social Desirability Bias and the Contested “Shy Voter” Hypothesis

One of the most discussed explanations for polling errors in recent elections is the possibility that some voters conceal their true preferences from pollsters. This concept — related to what social scientists call social desirability bias, or the tendency to give answers perceived as socially acceptable rather than honest ones — has been applied to Trump-era elections under the label of the “shy Trump voter” hypothesis. The argument holds that some individuals who intended to vote for Trump were reluctant to say so in a survey context, leading to an artificial undercount of his support.

The empirical evidence on this hypothesis is contested. A nationally representative list experiment conducted by researcher Alexander Coppock in 2016, involving more than 5,000 American adults, found no statistically significant evidence that social desirability was causing respondents to understate their Trump support; the list experiment estimate of Trump support was slightly lower, not higher, than the direct question estimate. However, research published in the Journal of Behavioral and Experimental Economics by Andy Brownback and Aaron Novotny found marginally significant evidence that social desirability responses caused some respondents to understate agreement with Trump and overstate agreement with Clinton. The AAPOR task forces on both the 2016 and 2020 elections ultimately concluded that nonresponse bias and weighting failures were the more significant contributors to polling error, while characterizing the shy-voter question as warranting continued research rather than settled.

Context — Institutional Trust and Survey Participation

Research referenced by AAPOR and reporting by The Conversation has suggested that working-class white voters and Republican-leaning respondents were underrepresented in multiple recent election polls not primarily because they hid their preferences, but because they were less likely to participate in surveys at all — a pattern linked in part to lower trust in mainstream institutions, including the media organizations and academic bodies that conduct many polls. This self-selection dynamic is distinct from the shy-voter hypothesis and poses different methodological challenges for pollsters attempting to correct it.

Weighting & Representation

Marcus Brathwaite

See Full Bio

Pages: 1 2