How Reliable Is Out-of-Sample Testing?

Subscribe to newsletter

Out-of-sample testing is a critical component of designing and evaluating trading systems. Trading systems are often developed and optimized using historical data, which can lead to overfitting – a situation where the system is excessively tuned to past data, resulting in poor performance on new, unseen data. Out-of-sample testing involves evaluating the trading system on data that was not used in the development process, allowing traders to gauge the system’s performance on new data and assess its robustness to market changes. By testing the system on a separate and distinct dataset, traders can be more confident that the system’s performance is not simply due to chance or overfitting, and that it is more likely to perform well in future market conditions.

Out-of-sample testing is a crucial step in designing and evaluating trading systems, allowing traders to make more informed and effective decisions in dynamic and ever-changing financial markets. But is it free of well-known biases such as overfitting, data-snooping, and look-ahead? Reference [1] investigated this issue. It pointed out,

In this paper, we examine the sources of excessively large Sharpe ratios associated with popular multifactor asset pricing models. Sharpe ratios remain too large to reconcile with leading economic models after applying simple, robust estimates of tangency portfolio weights, as well as under conventional pseudo-out-of-sample research designs that rely only on past data. We argue that the most compelling explanation behind these excessive Sharpe ratios involves a subtle form of look-ahead bias such that factors included in models, or alternatively the characteristics and portfolios from which factors are extracted, are selected based on prior research outcomes linking such characteristics with cross-sectional variation in returns…

Subscribe to newsletter https://harbourfrontquant.beehiiv.com/subscribe Newsletter Covering Trading Strategies, Risk Management, Financial Derivatives, Career Perspectives, and More

Our results have a variety of implications. First, researchers should be cautious in interpreting common out-of-sample research designs as providing assessments of factor models that are free of hindsight bias, because the samples analyzed often overlap heavily with samples previously analyzed in the literature establishing anomalous return patterns. Given the continuous and organic nature of asset pricing research, it is difficult to conduct bias-free validation analyses, but our paper attempts to make progress in this direction. Second, we interpret the much smaller Sharpe ratios associated with popular multifactor models that we obtain using alternative evaluation approaches as good news. This is because real-time investors who ‘factor invest’ using these models after they are proposed do not achieve exorbitant Sharpe ratios.

In short, out-of-sample testing also suffers, albeit subtly, from biases such as overfitting, data-snooping, and look-ahead.

We agree with the authors. We also believe that out-of-sample tests such as walk-forward analysis also suffer from selection bias.

Then how do we minimize these biases?

Let us know what you think in the comments below or in the discussion forum.

References

[1] Easterwood, Sara and Paye, Bradley S., High on High Sharpe Ratios: Optimistically Biased Factor Model Assessments (2023). https://ssrn.com/abstract=4360788

Subscribe to newsletter https://harbourfrontquant.beehiiv.com/subscribe Newsletter Covering Trading Strategies, Risk Management, Financial Derivatives, Career Perspectives, and More

Further questions

What's your question? Ask it in the discussion forum

Have an answer to the questions below? Post it here or in the forum

LATEST NEWS

State of economywide tariffs on Canada unclear as Trump’s global trade war escalates

WASHINGTON — As U.S. President Donald Trump prepares to unveil his so-called “liberation day” plan to hit multiple countries with tariffs, it’s still not clear whether a temporary pause on separate economywide duties on Canada will be lifted. In early March, Trump imposed — and…

Stay up-to-date with the latest news - click here

LATEST NEWS

‘Like a spare tire’: Waterloo company launches backup option for mobile outages

When a service outage in July 2022 left millions of Rogers customers in the dark for up to 15 hours, it underscored the importance of being prepared in case of a similar emergency. For some, that meant having lifelines in place that don’t rely on…

Stay up-to-date with the latest news - click here

LATEST NEWS

Danish prime minister heads to Greenland as Trump seeks control of the Arctic territory

NUUK, Greenland (AP) — Danish Prime Minister Mette Frederiksen is traveling to Greenland on Wednesday for a three-day trip aimed at building trust and cooperation with Greenlandic officials at a time when the Trump administration is seeking control of the vast Arctic territory. Frederiksen announced…

Stay up-to-date with the latest news - click here

LATEST NEWS

MLB’s average salary tops $5 million for first time, AP study shows

NEW YORK (AP) — Major League Baseball’s average salary broke the $5 million barrier on opening day for the first time, according to a study by The Associated Press. The New York Mets, with Juan Soto’s record $61.9 million pay, led MLB for the third…

Stay up-to-date with the latest news - click here

LATEST NEWS

Rogers Communications and NHL announce $11-billion rights deal

TORONTO — Rogers Communications Inc. and the National Hockey League have announced a new 12-year agreement valued at $11 billion for the national media rights to NHL games on all platforms in Canada. The agreement is worth more than double the current rights deal between…

Stay up-to-date with the latest news - click here

Harbourfront Technologies

Further questions

About the Author

Harbourfront Technologies

Leave a Reply

Further questions

Additional reading

About the Author

Harbourfront Technologies

Leave a Reply