Demand Estimation with Machine Learning and Model Combination

Working Paper: NBER ID: w20955

Authors: Patrick Bajari; Denis Nekipelov; Stephen P. Ryan; Miaoyu Yang

Abstract: We survey and apply several techniques from the statistical and computer science literature to the problem of demand estimation. We derive novel asymptotic properties for several of these models. To improve out-of-sample prediction accuracy and obtain parametric rates of convergence, we propose a method of combining the underlying models via linear regression. Our method has several appealing features: it is robust to a large number of potentially-collinear regressors; it scales easily to very large data sets; the machine learning methods combine model selection and estimation; and the method can flexibly approximate arbitrary non-linear functions, even when the set of regressors is high dimensional and we also allow for fixed effects. We illustrate our method using a standard scanner panel data set to estimate promotional lift and find that our estimates are considerably more accurate in out of sample predictions of demand than some commonly used alternatives. While demand estimation is our motivating application, these methods are likely to be useful in other microeconometric problems.

Keywords: demand estimation; machine learning; model combination; promotional lift; econometrics

JEL Codes: C14; C53; C55

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
promotional activities (M31)	demand outcomes (P47)
machine learning models (C45)	demand prediction accuracy (C53)
promotional activities (M31)	promotional lift (J62)
larger datasets (C55)	reduced bias in causal estimates (C21)
machine learning methods (C45)	marketing effectiveness insights (M31)

Back to index