Heaping-Induced Bias in Regression Discontinuity Designs

Working Paper: NBER ID: w17408

Authors: Alan I. Barreca; Jason M. Lindo; Glen R. Waddell

Abstract: This study uses Monte Carlo simulations to demonstrate that regression-discontinuity designs arrive at biased estimates when attributes related to outcomes predict heaping in the running variable. After showing that our usual diagnostics are poorly suited to identifying this type of problem, we provide alternatives. We also demonstrate how the magnitude and direction of the bias varies with bandwidth choice and the location of the data heaps relative to the treatment threshold. Finally, we discuss approaches to correcting for this type of problem before considering these issues in several non-simulated environments.

Keywords: Regression Discontinuity; Heaping; Causal Inference

JEL Codes: C14; C21; I12


Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.


Causal Claims

CauseEffect
Dropping observations at data heaps (C55)Unbiased estimates for continuous types (C46)
Nonrandom heaping in regression discontinuity designs (C22)Regression discontinuity estimates are biased (C22)
Choice of bandwidth (C46)Magnitude and direction of bias (C46)
Heaps not located near the treatment threshold (L99)Nonrandom heaping can introduce bias (C83)

Back to index