Formal Study of Overfitting in Trading System Design

Subscribe to newsletter

Trading systems often experience performance deterioration after going live, largely due to overfitting. Reference [1] formally studied this issue, using analytical approximations for the in-sample and out-of-sample Sharpe ratios of portfolios. The authors pointed out,

This paper derives analytical approximations for the in-sample and out-of-sample Sharpe ratios of portfolios constructed using linear prediction models. We show that increasing either the number of signals or assets too much makes this procedure susceptible to overfitting and thereby yields wildly overestimated in-sample Sharpe ratios.

We show that low true Sharpe ratio signals are particularly vulnerable to overfitting. Conversely, by extending the length of the in-sample period one can reduce the overfitting risk, and can produce a higher replication ratio out of sample.

Subscribe to newsletter https://harbourfrontquant.beehiiv.com/subscribe Newsletter Covering Trading Strategies, Risk Management, Financial Derivatives, Career Perspectives, and More

We test our results on commodity futures using momentum-style signals and find that allowing AR(1) signals and non-Normal signals/residuals does not significantly impact the validity of our results. In particular, once we match the theoretical out-of-sample Sharpe ratio to the observed value, we see that the replication ratio is primarily a function of the out-of-sample Sharpe ratio and the curves for the AR(1) signals closely matches those of iid signals.

From this analysis, it seems that the best way to minimize the potential for overfitting is to minimize the number of signals and assets that are being used for any predictive model used to trade, and utilize the largest amount of data possible.

In summary, the paper formally demonstrated that to minimize the risk of overfitting, one should,

  1. Keep models as simple as possible,
  2. Use the longest sensible backtest period available,
  3. Develop systems with high Sharpe ratios, and
  4. Rely on fewer signals.

While we completely agree with points #1 and #2, our experience casts doubt on points #3 and #4.

Let us know what you think in the comments below or in the discussion forum.

References

[1] Antoine Jacquier, Johannes Muhle-Karbe, Joseph Mulligan, In-Sample and Out-of-Sample Sharpe Ratios for Linear Predictive Models, 2025, arXiv:2501.03938

Further questions

What's your question? Ask it in the discussion forum

Have an answer to the questions below? Post it here or in the forum

LATEST NEWSUS tech stocks slide as traders fret over ‘frothy’ AI valuations
US tech stocks slide as traders fret over ‘frothy’ AI valuations

Nasdaq lower as heavyweights including Nvidia and Microsoft sustain steep falls

Stay up-to-date with the latest news - click here
LATEST NEWSElon Musk attends Trump's dinner with Saudi crown prince
Elon Musk attends Trump's dinner with Saudi crown prince
Stay up-to-date with the latest news - click here
LATEST NEWSCenovus Energy announces $2.6 billion offering of senior notes
Cenovus Energy announces $2.6 billion offering of senior notes

The prospectus supplement, the corresponding base shelf prospectus and any amendment thereto in connection with the offering of Canadian Notes will be accessible through SEDAR+ within two business days. CALGARY, Alberta, Nov. 18, 2025 (GLOBE NEWSWIRE) — Cenovus Energy Inc. (“Cenovus”) (TSX: CVE) (NYSE: CVE)…

Stay up-to-date with the latest news - click here
LATEST NEWSTenon Medical CEO Foster buys $25k in TNON stock and warrants
Tenon Medical CEO Foster buys $25k in TNON stock and warrants
Stay up-to-date with the latest news - click here
LATEST NEWSOil Loses Ground With US Stockpiles and Sanction Risks in Focus
Oil Loses Ground With US Stockpiles and Sanction Risks in Focus

Oil fell as a report showing rising US stockpiles helped to offset concerns about the fallout from Western sanctions on Russia.

Stay up-to-date with the latest news - click here

Leave a Reply