Should Copula Endogeneity Correction Include Generated Regressors for Higher-Order Terms? No, It Hurts

Working Paper: NBER ID: w29978

Authors: Yi Qian; Hui Xie; Anthony Koschmann

Abstract: Causal inference in empirical studies is often challenging because of the presence of endogenous regressors. The classical approach to the problem requires using instrumental variables that must satisfy the stringent condition of exclusion restriction. A forefront of recent research is a new paradigm of handling endogenous regressors without using instrumental variables. Park and Gupta (Marketing Science, 2012) proposed instrument-free estimation using copulas that has been increasingly used in practical applications to address endogeneity bias. A relevant issue not studied is how to handle the higher-order terms (e.g., interaction and quadratic terms) of endogenous regressors using the copula approach. Recent applications of the approach have used disparate ways of handling these higher-order endogenous terms with unclear consequences. We show that once copula correction terms for the main effects of endogenous regressors are included as generated regressors, there is no need to include additional correction terms for the higher-order terms. This simplicity in handling higher-order endogenous regression terms is a merit of the instrument-free copula bias correction approach. More importantly, adding these unnecessary correction terms has harmful effects and leads to sub-optimal solutions of endogeneity bias, including finite-sample estimation bias and substantially inflated variability in estimates.

Keywords: Copula; Endogeneity; Higher-Order Terms; Instrument-Free Estimation

JEL Codes: C01; C12


Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.


Causal Claims

CauseEffect
copula correction terms for first-order terms of endogenous regressors (C20)consistent estimates of structural regression model parameters (C51)
adding separate copula correction terms for higher-order terms (C10)finite-sample estimation bias (C51)
adding separate copula correction terms for higher-order terms (C10)inflated standard errors (C20)
inclusion of additional correction terms for higher-order endogenous variables (C51)collinearity issues (C30)
inclusion of unnecessary correction terms for higher-order endogenous regressors (C20)worse model performance (C52)
copula correction term for the main endogenous regressor (C20)accounts for endogeneity of higher-order terms (C51)
collinearity issues (C30)adversely affect precision of estimates (C51)

Back to index