Working Paper: CEPR ID: DP12926
Authors: Martin Lettau; Markus Pelger
Abstract: We develop an estimator for latent factors in a large-dimensional panel of financial data that can explain expected excess returns. Statistical factor analysis based on Principal Component Analysis (PCA) has problems identifying factors with a small variance that are important for asset pricing. We generalize PCA with a penalty term accounting for the pricing error in expected returns. Our estimator searches for factors that can explain both the expected return and covariance structure. We derive the statistical properties of the new estimator and show that our estimator can find asset-pricing factors, which cannot be detected with PCA, even if a large amount of data is available. Applying the approach to portfolio data we find factors with Sharpe-ratios more than twice as large as those based on conventional PCA and with significantly smaller pricing errors.
Keywords: Cross Section of Returns; Anomalies; Expected Returns; High-Dimensional Data; Latent Factors; Weak Factors; PCA
JEL Codes: C14; C38; C52; C58; G12
Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.
Cause | Effect |
---|---|
RPPCA (R50) | latent asset-pricing factors (G19) |
RPPCA (R50) | Sharpe ratios of identified factors (G11) |
RPPCA (R50) | time-series covariation and cross-section of mean returns (C32) |
inclusion of mean information (C46) | estimation of latent factors (C51) |
RPPCA (R50) | pricing information (D49) |
RPPCA (R50) | out-of-sample pricing errors (C52) |
RPPCA (R50) | out-of-sample pricing performance (G19) |