A Semiparametric Approach for Analyzing Nonignorable Missing Data

Working Paper: NBER ID: w16270

Authors: Hui Xie; Yi Qian; Leming Qu

Abstract: In missing data analysis, there is often a need to assess the sensitivity of key inferences to departures from untestable assumptions regarding the missing data process. Such sensitivity analysis often requires specifying a missing data model which commonly assumes parametric functional forms for the predictors of missingness. In this paper, we relax the parametric assumption and investigate the use of a generalized additive missing data model. We also consider the possibility of a non-linear relationship between missingness and the potentially missing outcome, whereas the existing literature commonly assumes a more restricted linear relationship. To avoid the computational complexity, we adopt an index approach for local sensitivity. We derive explicit formulas for the resulting semiparametric sensitivity index. The computation of the index is simple and completely avoids the need to repeatedly fit the semiparametric nonignorable model. Only estimates from the standard software analysis are required with a moderate amount of additional computation. Thus, the semiparametric index provides a fast and robust method to adjust the standard estimates for nonignorable missingness. An extensive simulation study is conducted to evaluate the effects of misspecifying the missing data model and to compare the performance of the proposed approach with the commonly used parametric approaches. The simulation study shows that the proposed method helps reduce bias that might arise from the misspecification of the functional forms of predictors in the missing data model. We illustrate the method in a Wage Offer dataset.

Keywords: generalized additive model; MNAR; semiparametric; joint selection model; nonignorability; sensitivity analysis

JEL Codes: C01; J16


Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.


Causal Claims

CauseEffect
GAM approach (C73)mitigates bias (D91)
GAM approach (C73)allows for more accurate assessments of nonignorable missingness (C30)
GAM approach (C73)captures nonlinear relationships between missingness and outcome (C34)
nonlinear relationships (C29)reduces bias in estimates (C51)

Back to index