Record Details

Replication Data for: Measuring Swing Voters with a Supervised Machine Learning Ensemble

Harvard Dataverse (Africa Rice Center, Bioversity International, CCAFS, CIAT, IFPRI, IRRI and WorldFish)

View Archive Info
 
 
Field Value
 
Title Replication Data for: Measuring Swing Voters with a Supervised Machine Learning Ensemble
 
Identifier https://doi.org/10.7910/DVN/CLQY6O
 
Creator Hare, Christopher
Kutsuris, Mikayla
 
Publisher Harvard Dataverse
 
Description Theory has long suggested that swing voting is a response to cross-pressures arising from a mix of individual attributes and contextual factors. Unfortunately, existing regression-based approaches are ill-suited to explore the complex combinations of demographic, policy, and political factors that produce swing voters in American elections. This gap between theory and practice motivates our use of an ensemble of supervised machine learning methods to predict swing voters in the 2012, 2016, and 2020 US presidential elections. The results from the learning ensemble substantiate the existence of swing voters in contemporary American elections. Specifically, we demonstrate that the learning ensemble produces well-calibrated and externally valid predictions of swing voter propensity in later elections and for related behaviors such as split-ticket voting. Though interpreting black-box models is more challenging, they can nonetheless provide meaningful substantive insights meriting further exploration. Here, we use flexible model-agnostic tools to perturb the ensemble and demonstrate that cross-pressures (particularly those involving ideological and policy-related considerations) are essential to accurately predict swing voters.

This Dataverse entry provides the code necessary to run the ensemble function and replicate the results from the paper. See README.txt for further instructions.
 
Subject Social Sciences
Voting behavior
Swing voters
Supervised machine learning
Ensemble models
 
Contributor Hare, Christopher