Replication Data for: Does Regression Produce Representative Estimates of Causal Effects?
Harvard Dataverse (Africa Rice Center, Bioversity International, CCAFS, CIAT, IFPRI, IRRI and WorldFish)
View Archive InfoField | Value | |
Title |
Replication Data for: Does Regression Produce Representative Estimates of Causal Effects?
|
|
Identifier |
https://doi.org/10.7910/DVN/29098
|
|
Creator |
Aronow, Peter M.
Samii, Cyrus |
|
Publisher |
Harvard Dataverse
|
|
Description |
With an unrepresentative sample, the estimate of a causal effect may fail to characterize how effects operate in the population of interest. What is less well understood is that conventional estimation practices for observational studies may produce the same problem even with a representative sample. Causal effects estimated via multiple regression differentially weight each unit's contribution. The ``effective sample'' that regression uses to generate the estimate may bear little resemblance to the population of interest, and the results may be nonrepresentative in a manner similar to what quasi-experimental methods or experiments with convenience samples produce. There is no general external validity basis for preferring multiple regression on representative samples over quasi-experimental or experimental methods. We show how to estimate the ``multiple regression weights'' that allow one to study the effective sample. We discuss alternative approaches that, under certain conditions, recover representative average causal effects. The requisite conditions cannot always be met.
|
|
Subject |
Social Sciences
Causal inference External validity Multiple regression Observational studies Randomized experiments |
|
Contributor |
Cyrus Samii
|
|
Type |
DTA, SHP, CSV, R
|
|