Complexity in Factor Pricing Models

Working Paper: NBER ID: w31689

Authors: Antoine Didisheim; Shikun Barry Ke; Bryan T. Kelly; Semyon Malamud

Abstract: We theoretically characterize the behavior of machine learning asset pricing models. We prove that expected out-of-sample model performance—in terms of SDF Sharpe ratio and test asset pricing errors—is improving in model parameterization (or “complexity”). Our empirical findings verify the theoretically predicted “virtue of complexity” in the cross-section of stock returns. Models with an extremely large number of factors (more than the number of training observations or base assets) outperform simpler alternatives by a large margin.

Keywords: machine learning; asset pricing; model complexity; stochastic discount factor

JEL Codes: C1; C4; C58; G1; G10; G12; G14; G17

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
model complexity (C52)	out-of-sample performance (C52)
model complexity (C52)	pricing errors (D49)
model complexity (C52)	SDF Sharpe ratio (G17)
increasing number of factors (C39)	pricing errors (D49)
increasing number of factors (C39)	SDF Sharpe ratio (G17)
model complexity (C52)	outperforming simpler alternatives (C52)

Back to index