Empirical Asset Pricing via Machine Learning

Working Paper: NBER ID: w25398

Authors: Shihao Gu; Bryan Kelly; Dacheng Xiu

Abstract: We perform a comparative analysis of machine learning methods for the canonical problem of empirical asset pricing: measuring asset risk premia. We demonstrate large economic gains to investors using machine learning forecasts, in some cases doubling the performance of leading regression-based strategies from the literature. We identify the best performing methods (trees and neural networks) and trace their predictive gains to allowance of nonlinear predictor interactions that are missed by other methods. All methods agree on the same set of dominant predictive signals which includes variations on momentum, liquidity, and volatility. Improved risk premium measurement through machine learning simplifies the investigation into economic mechanisms of asset pricing and highlights the value of machine learning in financial innovation.

Keywords: machine learning; asset pricing; risk premia; predictive accuracy

JEL Codes: C45; C55; C58; G11; G12

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
Machine learning methods (C45)	Predictive accuracy in measuring equity risk premia (C52)
Machine learning forecasts (C53)	Investment performance (G11)
Inclusion of nonlinear interactions among predictors (C32)	Improved return predictions (G17)
Predictor variables (price trends, liquidity, volatility) (G17)	Expected returns (G17)

Back to index