Working Paper: NBER ID: w19928
Authors: Charles Courtemanche; Joshua C. Pinkston; Jay Stewart
Abstract: We propose a new method for using validation data to correct self-reported weight and height in surveys that do not measure respondents. The standard correction in prior research regresses actual measures on reported values using an external validation dataset, and then uses the estimated coefficients to predict actual measures in the primary dataset. This approach requires the strong assumption that the expectations of measured weight and height conditional on the reported values are the same in both datasets. In contrast, we use percentile ranks rather than levels of reported weight and height. Our approach requires the weaker assumption that the conditional expectations of actual measures are increasing in reported values in both samples. This makes our correction more robust to differences in measurement error across surveys as long as both surveys represent the same population. We examine three nationally representative datasets and find that misreporting appears to be sensitive to differences in survey context. When we compare predicted BMI distributions using the two validation approaches, we find that the standard correction is affected by differences in misreporting while our correction is not. Finally, we present several examples that demonstrate the potential importance of our correction for future econometric analyses and estimates of obesity rates.
Keywords: Obesity; Measurement Error; BMI; Self-Reported Data; Econometric Analysis
JEL Codes: C18; I1
Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.
Cause | Effect |
---|---|
reported measures of height and weight (I10) | actual measures of height and weight (I10) |
standard correction method (C10) | BMI distributions (C46) |
percentile rank method (C46) | BMI distributions (C46) |
percentile rank method (C46) | measured BMI distribution (C46) |
correcting measurement error (C83) | accurate empirical analysis (C51) |
misreporting differences (C52) | standard correction method (C10) |
differences in survey context (C83) | misreporting (C59) |