Predicting Volatility: Getting the Most Out of Return Data Sampled at Different Frequencies

Working Paper: NBER ID: w10914

Authors: Eric Ghysels; Pedro Santaclara; Rossen Valkanov

Abstract: We consider various MIDAS (Mixed Data Sampling) regression models to predict volatility. The models differ in the specification of regressors (squared returns, absolute returns, realized volatility, realized power, and return ranges), in the use of daily or intra-daily (5-minute) data, and in the length of the past history included in the forecasts. The MIDAS framework allows us to compare models across all these dimensions in a very tightly parameterized fashion. Using equity return data, we find that daily realized power (involving 5-minute absolute returns) is the best predictor of future volatility (measured by increments in quadratic variation) and outperforms model based on realized volatility (i.e. past increments in quadratic variation). Surprisingly, the direct use of high-frequency (5-minute) data does not improve volatility predictions. Finally, daily lags of one to two months are sucient to capture the persistence in volatility. These findings hold both in- and out-of-sample.

Keywords: Volatility forecasting; MIDAS regression; High-frequency data

JEL Codes: G1

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
Daily realized power (L94)	Future volatility (G17)
Daily ranges (R12)	Future volatility (G17)
High-frequency 5-minute data (Y10)	Future volatility (G17)
Past volatility (G17)	Future volatility forecasts (G17)

Back to index