Ensemble Methods for Causal Effects in Panel Data Settings

Working Paper: NBER ID: w25675

Authors: Susan Athey; Mohsen Bayati; Guido Imbens; Zhaonan Qu

Abstract: In many prediction problems researchers have found that combinations of prediction methods (“ensembles”) perform better than individual methods. A simple example is random forests, which combines predictions from many regression trees.\nA striking, and substantially more complex, example is the Netflix Prize competition where the winning entry combined predictions using a wide variety of conceptually very different models. In macro-economic forecasting researchers have often found that averaging predictions from different models leads to more accurate forecasts. \nIn this paper we apply these ideas to synthetic control type problems in panel data setting. In this setting a number of conceptually quite different methods have been developed, with some assuming correlations between units that are stable over time, others assuming stable time series patterns common to all units, and others using factor models. With data on state level GDP for 270 quarters, we focus on three basic approaches to predicting missing values, one from each of these strands of the literature. Rather than try to test the different models against each other and find a true model, we focus on combining predictions based on each of the separate models using ensemble methods. For the ensemble predictor we focus on a weighted average of the three individual methods, with non-negative weights determined through out-of-sample cross-validation.

Keywords: No keywords provided

JEL Codes: C01; C14


Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.


Causal Claims

CauseEffect
ensemble methods (C02)improved predictions (C53)
ensemble methods (C02)outperform individual prediction methods (C52)
ensemble methods (C02)estimate missing values (C51)
synthetic control methods + regression based on unconfoundedness + matrix completion methods (C51)ensemble method (C69)
ensemble method (C69)systematic performance relative to individual methods (C52)

Back to index