Interpolation and Backdating with a Large Information Set

Working Paper: CEPR ID: DP4533

Authors: Elena Angelini; Jrme Henry; Massimiliano Marcellino

Abstract: Existing methods for data interpolation or backdating are either univariate or based on a very limited number of series, due to data and computing constraints that were binding until the recent past. Nowadays large datasets are readily available, and models with hundreds of parameters are fastly estimated. We model these large datasets with a factor model, and develop an interpolation method that exploits the estimated factors as an efficient summary of all the available information. The method is compared with existing standard approaches from a theoretical point of view, by means of Monte Carlo simulations, and also when applied to actual macroeconomic series. The results indicate that our method is more robust to model misspecification, although traditional multivariate methods also work well while univariate approaches are systematically outperformed. When interpolated series are subsequently used in econometric analyses, biases can emerge, depending on the type of interpolation but again be reduced with multivariate approaches, including factor-based ones.

Keywords: factor model; interpolation; kalman filter; spline

JEL Codes: C32; C43; C82

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
factor-based interpolation method (C51)	lower mean squared errors (MSE) (C51)
factor-based interpolation method (C51)	lower mean absolute errors (MAE) (C52)
factor-based interpolation method (C51)	robustness to model misspecification (C50)
univariate methods (C29)	higher mean squared errors (MSE) (C20)
univariate methods (C29)	higher mean absolute errors (MAE) (C52)
type of interpolation employed (C51)	emergence of biases (D91)
multivariate approaches (C39)	mitigation of biases (D91)

Back to index