- Risk of data mining
- Overcrowding of factors post publication
- Unrealistic trading cost expectations
- Misunderstanding of downside shocks
- Misunderstanding of the diversification benefits of factors
Risk of Data MiningHundreds of factors have been identified in the literature, many of which could be the result of data mining exercises. However, most of them can be ignored because they cannot pass the five tests detailed in “Your Complete Guide to Factor-Based Investing.” In order to be considered for investment, a factor must provide incremental explanatory power to portfolio returns and have delivered a premium (higher returns). Additionally, the factor must have the following characteristics:
- Persistent—It holds across long periods of time and different economic regimes.
- Pervasive—It holds across countries, regions, sectors and even asset classes.
- Robust—It holds for various definitions (for example, there is a value premium, whether it is measured by price-to-book [P/B], earnings, cash flow or sales).
- Investable—It holds up not just on paper but also after considering actual implementation issues, such as trading costs.
- Intuitive—There are logical risk-based or behavioral-based explanations for its premium and why it should continue to exist.
OvercrowdingFirst, it’s important to note that if there are risk-based explanations for a factor’s historical premium, risk cannot be arbitraged away. For example, while the market beta premium is well known, no one expects it to disappear. However, popularity can lead to cash flows, rising valuations, and a shrinking premium. This is important to note because so much of the criticism about factor investing ignores the fact that what is true for other factors can also be true of market beta (e.g., it can experience large drawdowns and long periods of underperformance). That said, popularity (resulting in cash flows) can reduce the size of premiums. Andrew Miller wrote about this in his February 2017 post “Will ETFs Destroy Factor Investing? Nope.” R. David McLean and Jeffrey Pontiff, authors of the 2016 study “Does Academic Research Destroy Stock Return Predictability?” reexamined 97 factors published in tier-one academic journals and were able to replicate the reported results for only 85 of them. They also found that following publication, the average factor’s return decays by about 32 percent—returns do not decay to zero but remain positive. Of course, some premiums (those that could not meet our established criteria) could have disappeared entirely, while others remained relatively unchanged. McLean and Pontiff also found that factor-based portfolios containing stocks that are costlier to arbitrage decline less post-publication. This is consistent with the idea that costs limit arbitrage and protect mispricing. They state: “Decay, as opposed to disappearance, will occur if frictions prevent arbitrage from fully eliminating mispricing.” We have further research on the issue of overcrowding. Martin Lettau, Sydney Ludvigson and Paulo Manoel contribute to the literature with their December 2018 study “Characteristics of Mutual Fund Portfolios: Where Are the Value Funds?”, which provides a comprehensive analysis of portfolios of active mutual funds, ETFs and hedge funds through the lens of risk (anomaly) factors such as size, value and momentum. Among the questions they try to answer are: To what extent do active fund managers exploit these factor premia? If there are limits to arbitrage, do active funds contribute to the existence of these anomalies, or do they overweight underpriced stocks? Among their important findings was that neither mutual funds nor ETFs systematically tilt their portfolios toward profitable factors, such as high book-to-market (BtM) ratios, high momentum, small size, high profitability and low investment growth. In fact, for some factors, mutual funds target the low-return leg of long/short factor portfolios rather than the high-return leg. This bias is especially strong for BtM ratios. In fact, they found that there are virtually no high-BtM funds in the sample, while there are many low-BtM “growth” funds. For example, only seven out of 2,657 funds in their sample have a BtM score in the fourth quintile or above. Supporting evidence comes from David Blitz, who demonstrated in his February 2017 paper “Are Exchange-Traded Funds Harvesting Factor Premiums?” that while some ETFs are specifically designed for harvesting factor premiums, other ETFs implicitly go against these factors. Specifically, Blitz found the following:
As further evidence, in 1994, right after the publication of the famous Fama-French research on the value premium, the P/B of U.S. large growth stocks was 2.1 times that of large value stocks, and their price-to-earnings (P/E) spread was 1.5. (Data is from Dimensional.) Using Vanguard Russell 1000 Growth ETF (VONG) and Vanguard Russell 1000 Value ETF (VONV), we find that as of January 31, 2019, the P/B spread had actually widened to 3.2, and the P/E spread was basically unchanged at 1.4. At least here, we see no evidence that cash flows have caused the value premium to narrow. The bottom line is, it appears that despite what many investors believe, a massive net inflow into value stocks relative to growth stocks has not occurred. We now turn to the concern that trading costs are underestimated.
From a factor-investing perspective, smart-beta ETFs tend to provide the right factor exposures, while conventional ETFs tend to be on the other side of the trade with the wrong factor exposures. In other words, these two groups of investors are essentially betting against each other.
Unrealistic Trading CostsThe role of liquidity is an important one when considering investment because while there may be anomalies that result in mispricings, in order to profit from those anomalies, investors must be able to exploit them after accounting for all the expenses of the effort. Andrea Frazzini, Ronen Israel and Tobias Moskowitz address the issue of implementation costs in their April 2018 study “Trading Costs.” In his review of the paper, Alpha Architect’s Wes Gray called it “The Best Research Paper Ever Written on Trading Costs.” The study’s database consisted of $1.7 trillion of live-executed equity trades from a large money manager, AQR Capital Management. It covered the period August 1998 to June 2016 as well as 21 developed equity markets and almost 10,000 stocks. Frazzini, Israel and Moskowitz measured the real-world trading costs and price impact function incurred by AQR, which trades portfolios based on many of the anomalies discovered in the academic literature, providing a unique look into how trading costs vary globally across trade type, size and exchange. The authors identified the real-time price impact of a trade at various trade sizes. It’s important to note AQR’s trades were all made in a manner seeking to lower execution costs using a proprietary trading algorithm that, importantly, does not make any buy or sell decisions. The algorithm decides how patiently to trade (minutes versus days), but not what to trade (or not to trade). The following is a summary of their findings:
- Trading costs (bid-offer spreads and commissions), including market impact costs, have exhibited a steady decline over time across markets, though they did jump during the financial crisis (2007-2009) before resuming their decline. Some of the decline is driven by technological events, such as moving to decimalization in traded prices.
- The average bid-ask spread at the time of order arrival is 21.33 basis points (bps). However, it was rare for AQR’s trades to incur the full spread, or even half the spread, because of the passive limit orders. The main cost the firm’s trades faced was market (price) impact.
- The estimate of market impact is just under 9 bps, on average, for all trades completed within a day. The median cost is a bit lower at just more than 6 bps, suggesting that trading costs are positively skewed by more expensive trades.
- When weighting trades by their dollar value, the value-weighted mean is higher, at just more than 15 bps, for market impact. The largest trades are the most expensive trades.
- Costs are larger for smaller stocks and stocks with greater idiosyncratic risk, consistent with theories of market-maker inventory risk raising price impact. Without controlling for trade size, the average large-cap stock trade generates almost 9 bps of market impact costs compared to almost 19 bps for small-cap stocks.
- Buying to go long generates about 12.5 bps of price impact, while buying to cover has 15.5 bps of price impact. Short-selling is slightly more expensive, by 0.6 bps, on average, than selling long, but the difference is not statistically significant. There is no marked difference in trading costs between selling a long position versus selling short. If short-selling is indeed costlier, it is likely to be a function of opportunity cost (i.e., not being able to short) or of lending fees for stocks on special.
- The average trade experiences an additional 4 bps of market impact on the stocks traded relative to stocks of similar characteristics (BtM ratio, market cap and momentum) not traded by the algorithm that day. This likely is due to immediacy of demand and, thus, a temporary outcome that will be reversed.
- The patterns and estimates were similar across the 21 different equity markets studied.
Investors Underestimate Downside RisksIn their article, Arnott, Harvey, Kalesnik and Linnainmaa note that most factor returns stray far from a normal distribution, being prone to large drawdowns. Because factors have excess kurtosis (fat tails), it is a mistake to use simple risk management tools that ignore the tail behavior. Too many investors believe that creating a portfolio of factors will eliminate the extreme tail behavior. This is a dangerous misperception. They examined 15 major factors (market beta, size, value, momentum, operating profitability, low beta, idiosyncratic volatility, short-term reversals, long-term reversals, illiquidity, accruals, cash flow to price, earnings to price, long-term reversals and net share issuance). The table below shows the theoretical frequency with which the maximum drawdowns (MDDs) would occur if the return distributions were normally distributed. As you can see, the largest monthly drawdowns that actually occurred varied from once in 106 years (for long-term reversals) to once in 4.7 quadrillion (1015) years (in the case of operating profitability). Yet, they each occurred in the space of just 45 years.
Correlations of FactorsArnott, Harvey, Kalesnik and Linnainmaa again explain:
However, it’s important to understand that doesn’t mean that diversification across factors is not effective over the long term (the horizon that should matter for long-term investors). Consider the findings of a 2011 study by Clifford S. Asness, Roni Israelov and John M. Liew, “International Diversification Works (Eventually).” The authors explain that those who focus on the fact that globally diversified portfolios don’t protect investors from short systematic crashes miss the greater point that investors whose planning horizon is long term (and it should be, or you shouldn’t be invested in stocks to begin with) should care more about long-drawn-out bear markets, which can be significantly more damaging to their wealth. In their study, which covered the period 1950-2008 and 22 developed market countries, the authors examined the benefit of diversification over long-term holding periods. They found that over the long run, markets don’t exhibit the same tendency to suffer or crash together. Thus, investors shouldn’t allow short-term failures to blind them to long-term benefits. To demonstrate this point, they decomposed returns into two pieces: (1) a component due to multiple expansion (or contraction) and (2) a component due to economic performance. They found that while short-term stock returns tend to be dominated by (1), long-term stock returns tend to be dominated by (2). They explained that these results,
Investors should be careful not to be fooled by the long-term factor return averages, because the market betas of factors vary widely over time. The value factor, for example, typically correlates negatively with the market. During the global financial crisis, however, the value factor correlated positively and significantly with the market, performing poorly as the markets tumbled and soaring as the stock markets rebounded.
They further showed that “Countries exhibit significant idiosyncratic variation in long-run economic performance. Thus, country specific (not global) long-run economic performance is the most important determinant of long-run returns.” What is true of countries is also true of factors that meet the criteria we established—they both exhibit significant idiosyncratic variation in long-run economic performance. Thus, diversification across both countries and factors is the prudent strategy. And discipline, the ability to stick to a plan, absorbing the short-term pain, is the key to success in investing. If there’s no pain, there’s no premium. The bottom line is that investors must accept the risks that factor premiums, especially those tied to economic cycle risks, can crash at the same time during crises. However, that doesn’t mean that one should not diversify across factors. Instead, it means that it’s important that investors include a sufficient amount of safe fixed income assets in the portfolio to dampen the risk of the overall portfolio to an acceptable level.
…are consistent with the idea that a sharp decrease in investors’ risk appetite (i.e., a panic) can explain markets crashing at the same time. However, these risk aversion shocks seem to be a short-lived phenomenon. Over the long run, economic performance drives returns.
SummaryArnott, Harvey, Kalesnik and Linnainmaa provide investors with important information about factor-based investing. They demonstrated five unique risks that investors need to be cognizant of when investing in factors and why they could underperform the factors if they choose to ignore them. That knowledge can help you avoid making mistakes. In addition, knowledge helps provide the discipline that is critical to successful investing. As Warren Buffett proclaimed:
Investors who are informed about the risks of their portfolio are far more likely to stay the course during the inevitable periods when it will perform poorly. The bottom line is that factor investing, done well, can improve the odds of achieving your goals. As supporting evidence, consider the findings in the table below. It shows the live returns of the factor-based funds with the longest track record, those of Dimensional, and compares their returns with those of the market-like portfolios of the premier provider of index-based strategies, Vanguard. We’ll look at data for the longest period that both the factor-based fund of Dimensional and the total market fund of Vanguard have been available. Using live funds allows us to account for both fund expenses and trading costs. Data is from Portfolio Visualizer. (Full disclosure: My firm, Buckingham Strategic Wealth, recommends Dimensional funds in constructing client portfolios.)