Optimizing Mean Variance Optimization

/Optimizing Mean Variance Optimization

Optimizing Mean Variance Optimization

By | 2017-08-18T16:59:41+00:00 August 22nd, 2016|Research Insights|1 Comment
Print Friendly, PDF & Email
(Last Updated On: August 18, 2017)

In the 1950s, Harry Markowitz proposed a method to identify the optimal trade-off between risk and return for a portfolio. The theory is broadly termed, “Mean-Variance Optimization (MVO).”

Sam Wittig, a Drexel graduate I advised and who did some research for Alpha Architect, shared with us his undergraduate thesis project regarding Markowitz’s analysis.

Here is a link to Sam’s work: Shrinkage Theory for Portfolio Optimization with Correlated Geometric Brownian Motion. It’s open for debate, but this might just be the sexiest title for a paper ever, we’ll leave it up to our readers to decide.

Sam did a lot of great work on his thesis, and I thought it would be interesting to share two key findings with our readers.

Finding 1: Markowitz vs. Shrinkage

“The objective of this paper was to focus on the errors created by the sample covariance matrix within Markowitz portfolio optimization, and how “shrinkage” can be used to reduce the error within this matrix.” 

One key criticism of Markowitz’s MVO analysis is its estimation error. To be specific, the covariance matrix is measured using a finite number of observations. The “optimal” weights might be affected by the extreme values in limited sample data. We’ve mentioned this issue here and it is explained in detail in this paper.

A shrinkage estimator is a statistical technique that has some bias (the estimate is biased from the true population parameter), but the estimator is less noisy (ie, less variance). Sam tests whether a “shrinkage” estimator can help improve the performance of the MVO. Sam finds little evidence that the Shrinkage portfolio produces better results. Here is a formal paper on the subject with a great title, “Honey, I shrunk the sample covariance matrix.”

Finding 2: How much data is necessary to make accurate predictions?

“One of the main hypothesis behind any data analysis is that the larger the data set becomes, the more accurate and predictable future results will be… “

Is more always better? To test this question, we need a benchmark to be compared with. Sam chooses the Ground Truth (GT) portfolio. “The GT portfolio was a Markowitz portfolio with the maximum amount of historical data available.” That says, GT portfolio shall provide the most accurate future results based on the limited historical data.

Sam tests several different sample period: 3 months, 6 months, 1 year, 3 years, 5 years, and 10 years. He creates a simulation of the financial markets by using Correlated Geometric Brownian Motion (CGBM). Using the returns from the simulation, the data was tested using the Markowitz and Shrinkage portfolios. Then he compares the basic metrics (the rate of return, volatility, Sharpe ratio, etc) with those of GT portfolio.

Below table shows the results:

Shrinkage Theory for Portfolio Optimization

The long and short positions are extreme when dataset is short (3 months and 6 months).

When dataset length increase, the metrics began to stabilize and converge toward the values of the GT portfolio.

So, Sam concludes that “as T increased, the predictability of the market improved.”

Concluding Thoughts

From Sam’s essay:

Lastly, all who strive to understand Harry Markowitz’s portfolio optimization methods should think very hard about what they are trying to accomplish. Even the Nobel Prize winner himself was still not convinced of his own methods even decades later when he was discussing the 1987 stock market crash.

And we’ll leave the last word to Harry Markowitz:

“I should have computed the historical covariances of the asset classes and drawn an efficient frontier… I split my contributions 50/50 between stocks and bonds.” – Harry Markowitz


  • The views and opinions expressed herein are those of the author and do not necessarily reflect the views of Alpha Architect, its affiliates or its employees. Our full disclosures are available here. Definitions of common statistics used in our analysis are available here (towards the bottom).
  • Join thousands of other readers and subscribe to our blog.
  • This site provides NO information on our value ETFs or our momentum ETFs. Please refer to this site.

About the Author:

After serving as a Captain in the United States Marine Corps, Dr. Gray earned a PhD, and worked as a finance professor at Drexel University. Dr. Gray’s interest in bridging the research gap between academia and industry led him to found Alpha Architect, an asset management that delivers affordable active exposures for tax-sensitive investors. Dr. Gray has published four books and a number of academic articles. Wes is a regular contributor to multiple industry outlets, to include the following: Wall Street Journal, Forbes, ETF.com, and the CFA Institute. Dr. Gray earned an MBA and a PhD in finance from the University of Chicago and graduated magna cum laude with a BS from The Wharton School of the University of Pennsylvania.
  • W.J. Keller

    Thanks. As can be seen in our SSRN paper (papers.ssrn.com/sol3/papers.cfm?abstract_id=2606884), using Markowitz own computational CLA method for long-only, we could successfully use the simple past 12-month covariance matrix often even for large N. Also, as we argued in our paper, using returns from 3-6 years often make things worse because of reversal/value effects. Using 1-year momentum turns out to work great with MVO.