0% found this document useful (0 votes)

169 views11 pages

The Bootstrap Test - How Significant Are Your Back-Testing Results - Au - Tra.Sy Blog - Automated Trading System

This document summarizes a blog post about using the bootstrap method to test the statistical significance of backtesting results for an automated trading strategy. It provides an example of applying the bootstrap method to evaluate whether improvements in performance from optimizing roll yield for a crude oil trading strategy were statistically significant or due to random chance. The key points made are: 1) The bootstrap method uses resampling with replacement of daily returns to approximate the sampling distribution and calculate a p-value to test if backtesting results are statistically different than random. 2) An example application tests the significance of improvements from optimizing roll yield by generating a "Roll Yield-only" equity curve and computing bootstrap results. 3) The post notes

Uploaded by

rlindsey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

169 views11 pages

The Bootstrap Test - How Significant Are Your Back-Testing Results - Au - Tra.Sy Blog - Automated Trading System

Uploaded by

rlindsey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

The Wayback Machine - https://web.archive.org/web/20210122152637/http://www.automated-trading-system.

com/bootstrap-test/
Au.Tra.Sy blog – Automated Trading System
Systematic Trading research and development, with a flavour of Trend Following

home
About
Library
Free Code
Resources
Consulting
Archives
Subscribe

Au.Tra.Sy blog welcome

Welcome to my online repository of research and insights on automated trading system development
~Jez Liberty
Read more about the blog...

Futures Broker

Wisdom Trading, a Futures Broker who can Execute your Trading System and provide access to Global Markets and CTA's – all at great rates.
Recent Posts
Trend Following Wizards – December 2019
Trend Following Wizards – October
State of Trend Following in October
Trend Following Wizards – September
State of Trend Following in September

Categories
Backtest (42)
Blog (15)
Books (13)
Code (11)
Data (17)
Development (5)
Fund Review (13)
Instruments (24)
Equities (5)
Forex (1)
Futures (22)
Money Management (13)
Off-track (11)
Software (19)
Strategies (26)
Trend Following (268)
the State of Trend Following (116)
Trend Following Wizards (127)
← VIX, Peso… Sometimes you just cannot trade it! Bootstrap – Take 2: Data Mining bias, Code and using geometric mean →

The Bootstrap Test: How significant are your back-testing results?

August 11th, 2010 · 31 Comments · Backtest, Books

As mentioned in the Evidence-based Technical Analysis review post, the main value of the book lies in the presentation of the two methods allowing
for computing the statistical significance of trading strategy results, despite having a single sample of data:

Both methods solve the problem of estimating the degree of random variation in a test statistic when there is only a single sample of data
and, therefore, only a single value of the test statistic.

Today, let’s look at the bootstrap test, with a practical application of it.

In very brief terms, the concept uses hypothesis testing to verify whether the test statistic (such as mean return of the back-testing sample) is
statistically significant. This is done by establishing the p-value of the test statistic based on the sampling distribution. (Aronson covers the basics of
statistical analysis earlier in the book. I have also mentioned previously The Cartoon Guide to Statistics, which covers these concepts too)

The problem with back-testing is that the results generated represent a single sample, which does not provide any information on the sample
statistic’s variability and its sampling distribution. This is where bootstrapping comes in: by systematically and randomly resampling the single
available sample many times, it is possible to approximate the shape of the sampling distribution (and therefore calculate the p-value of the test
statistic).

Bootstrap on Single Rule Back-Test

In the context of hypothesis testing, the bootstrap tests for the null hypothesis that the rule does not have any predictive power. In practical terms, this
is translated to the population distribution of rule returns having an expected value of zero or less.

The bootstrap uses the daily returns of a back-test (run on detrended data) and performs a resampling with replacement.

In practice:

1. A back-test is run on detrended data and the mean daily return, based on n observations, is calculated.
2. The mean daily return is substracted from each day’s return (zero-centering), This gives a set of adjusted returns.
3. For each resample, select n instances of adjusted returns, at random (with replacement), and calculate their mean daily return (bootstrapped
mean).
4. Perform a large number of resamples to generate a large number of bootstrapped means.
5. Form the sampling distribution of the means generated in the step above.
6. Derive the p-value of the initial back-test mean return (non zero-centered) based on the sampling distribution

A Practical Application

To illustrate the concept, we can look at a back-test and apply the bootstrap method to its daily return series. I decided to look at a back-test I
presented in Better Trend Following via improved Roll Yield. Remember: a standard 50/20 Moving Average cross-over system applied to Crude Oil
was improved by adding a roll yield optimisation process.

In that instance, the benchmark is the standard strategy and we want to check that the strategy improvement was not the result of random chance. In
Aronson’s book, benchmarking is achieved by detrending the data. However, this case is different as the benchmark is the standard strategy. The
improved strategy results can be thought of 2 distinct parts:

Results from standard Trend Following strategy

Results from Roll Yield Optimisation

I therefore generated a composite, “Roll Yield-only” equity curve (by removing from the improved strategy equity curve the returns that could be
attributed to the Trend Following component). I then computed the daily returns based on that equity curve.

1. This set of daily returns is the original sample of 5120 observations, with an arithmetic mean of 0.216%.
2. Substracting 0.216% to all 5120 returns adjusts those returns (zero-centering), ready to be picked for resampling.
3. The 10,000 resamples all pick at random, with replacement, 5120 observations from the zero-centered, adjusted returns. A mean is computed
for each resample.
4. Each of the resample means are used to form the sampling distribution of the mean return:

5. The last step is the comparison of the non-adjusted original sample mean (0.216%) to the sampling distribution to establish the p-value, which
is 0.006 in this example.

Once the p-value is obtained, it is simply a matter of deciding which threshold qualifies for statistical significance. Scientists usually determine the
statistical significance threshold at 0.05 (ie. the null hypothesis would be rejected for any p-value less or equal to 0.05).

Note on Arithmetic Mean vs. Geometric Mean

As discussed above, the assumption that the rule does not have predictive power is translated to the arithmetic mean of its returns being equal to zero.
In the bootstrap method, rejecting the null hypothesis occurs when the mean arithmetic return is statistically significantly positive.
I am usually no big fan of arithmetic mean of returns as it is a flawed indicator of profitability. In effect, a system can have a positive mean arithmetic
return and still be unprofitable – think about a return of 50% followed by a return of -40%: arithmetic mean return is +5%, yet the overall return is
minus 5.1%

Proving that the mean arithmetic return is significantly positive, and deducing that the trading system is therefore profitable is flawed. It is ironically
amusing that Aronson spends quite a lot of time talking about logic reasoning and usual traps people fall into, to actually present a flawed deduction
logic. To use an example from the book:

A dog having four legs (a profitable rule having a positive mean arithmetic return) does not imply that any four-legged animal is a dog
(ie. any rule with a positive mean arithmetic return is not necessarily profitable).

On the other hand, any profitable rule has a positive mean geometric return, and any rule with positive mean geometric return is profitable. On that
basis, using the mean geometric return as the test-statistic in the bootstrap must be more appropriate.

I’ll be running this post in 2 parts, and this concludes part 1…

In part 2, we’ll look at how the bootstrap method can be modified to handle the data mining process and its associated bias. I’ll also make the code
used for the practical application above available for download (this will be a simple bootstrap resample tool developed on the .net platform for
Windows). Finally we’ll explore the idea of using the geometric mean return as the test-statistic instead of its arithmetic cousin.

Tags: aronson·bootstrap·data mining

31 Comments so far ↓

KF // Aug 11, 2010 at 7:44 am

First off: Great blog!

For this specific post: What do you think about the de-trending? Always had mixed feelings about that.

Kim

Jez // Aug 11, 2010 at 8:39 am

Thanks Kim!
Re: detrending, I have mixed feelings about it too (see: http://www.automated-trading-system.com/detrending-for-trend-following/ for an
earlier post)
Aronson shows that detrending is benchmarking based on position bias (ie. avoid favoring rules with bullish biases during bull markets for
example) and therefore allows for comparing different rules on “equal footing”. There is a point there…
He also quotes a study done by Timothy Masters comparing the bootstrap and Monte Carlo methods and concluding that the two methods give
similar results, when both are run on detrended data.
I have not done enough testing to have a strong opinion on both and I would welcome additional comments from more experienced readers.
My initial thoughts are that at least you have to be aware of the position bias issue and decide how much it would affect the rules you’re testing
(for example a Long-only strategy is likely to outperform a Short-only strategy during a bull market – a Long/Short Trend Following strategy
goes equally likely long or short and should not suffer from position bias as much…)

KF // Aug 11, 2010 at 11:18 am

My big question after reading the book is: But aren’t I trying to exploit a bias in the data? Sure, omitting short trades in the 90’s trading Nasdaq
only is overoptimizing. However, what if there is a systematic bias in the data (i.e. roll yield), by detrending you are distorting reality.

Paolo // Aug 11, 2010 at 2:38 pm

The idea of bootstrapping the geometric mean returns sounds correct.

I had the feeling Aronson used arithmetic mean since permuting daily returns could produce the same arithmetic but different geometric mean
return but it’s not – thanks to a quick simulation in excel.

Looking forward to your suggested adjustment then.

Paolo
Motomoto // Aug 12, 2010 at 3:22 am

While discussing the mathematics is well above my pay grade – I have to agree with KF.
When asking how significant are the backtesting results, aren’t you really trying to look at how good are the backtesting results in replicating
what I could have profited from by following the system in the real world. It seems if you then start chopping up the data, detrending and then
running multiple tests, are you complicating things with tests that then distort what actually happens? These are not randomly generated
numbers that are based on a population, they are meant to be a ‘live time’ simulation of a series of trades – that may be related to previous
trades…. ie, the market trends – it does not revert back to a mean – or average?….. (as I said my statistical knowledge is sub-standard)

Jez // Aug 12, 2010 at 1:50 pm

I agree, this can seem counter-intuitive to reshuffle the results like this and I do have mixed feelings about some aspects of the approach –
detrending for instance (as I mentioned before – especially for strategies like trend following).

You also allude to dependency in the results – which is obviously discarded in this sort of approach and might indeed indeed be a weakness of
the methodology. Unfortunately I have yet to see a model of trading results dependency – in which case it could probably be integrated to that
method (ie some sort of conditional/random resampling to produce a more sophisticated testing method).

The problem is that the outcomes of a trading strategy is a stochastic process with a large part of randomness and therefore with only one set of
back-testing results (single sample), you cannot assert how much of the performance can be attributed to randomness.

And I think the bootstrap, amongst other methods, tries (with some weaknesses that you highlight) to address this issue to identify whether the
performance is more likely coming from random luck or strategy value. Not the holy grail of trading results significance checking but probably
a good tool in the system development box..

There are long debates on the practice of back-testing and this issue is surely one of them!… ;-)

Lou // Sep 21, 2010 at 5:38 pm

Am I missing something here?

Of course you were able to reject Ho. You built a distribution based on (x – mean) and then contrasted the mean against this distribution.

Jez Liberty // Sep 22, 2010 at 2:14 am

Lou, this can seem confusing at first indeed (but it makes sense, it’s all about checking the variance in the process)…
H0 is the hypothesis that the mean return is zero, so you need a distribution with a zero-mean.
What the bootstrap does is build such distribution using the actual data from the test in order to have variance/deviations in line with the
process being tested.
After the zero-mean distribution is built you perform a standard statistical significance test by comparing the data tested (mean return) against
the zero-mean distribution. This is a non-parametric method, but an analogy would be to calculate how many standard deviations the mean
tested lies at.

You only reject H0 if the mean return is in the top x%. If the process has high variance, it is likely that H0 will not be rejected (at x%) level
because the mean return will not be “far enough” on the right

Lou // Sep 22, 2010 at 8:40 am

Jez,
I think that the test you’ve created only shows that the mean does not come from the bootstrapped sample of (x – mean).

Jez Liberty // Sep 22, 2010 at 10:18 am

Hi Lou,
Not sure what you mean. I am only describing the bootstrap test as per Aronson’s book. Are you saying that the test is flawed or that there is
some mistake in the illustration?

I was not very clear/accurate in my description/comment above but in effect the test simulates a zero-mean sample process with similar
variance to the process (back-)tested. If we run that sample process a large amount of times, we know that the expected mean (ie mean of
means) will be zero but we can also check how far and frequenty the data spreads towards the right/left of the mean (as shown graphically in
the distribution).
If 95% of the means are below the back-test observed mean, there is statistical significance (at 95% confidence level) that the back-test’s
profitability was not the result of random variation from a zero-mean process with similar variance (which is H0, which can then be rejected).

Hope this clarifies things.

Lou // Sep 22, 2010 at 11:01 am

I don’t know anything about Aronson. Maybe you can post or send a link to a relevant excerpt on applying bootstrapping to hypothesis tests.

Here’s an example of what I was trying to say:

Given an initial sample of 1000 trees. We find that the mean height of the sample is 15 feet. We cut off the 1st 15 feet of each tree (we just write
down the negative values since we can’t have negative trees). The we create a simulated distribution of the detrended trees via bootstrapping.

So, now we have a distribution of tree tops (x – mean) and recorded values for negative trees.

If we want to test a full tree, one that hasn’t been cut (detrended) against the distribution of cut trees then our hypothesis test is Ho: tree(uncut)
= tree(cut).

If we reject Ho we’ve inferred that the uncut tree did not come from the cut tree sample.

That’s all that you’re doing in your example. You’ve inferred that the mean did not come from the detrended distribution.

Jez Liberty // Sep 22, 2010 at 2:36 pm

To stay with your tree analogy:

The 1000 trees only represent a sample from the total population of trees (for which we do not know the true average height).
We want to know if the sample average height (15 feet) is statistically significant to infer that the true average height (for all trees) is different
from 0 (which is our H0 hypothesis: true average tree height = 0).

One problem is that we only have one sample forest (our 1000 trees). We need to have many more sample forests to establish the sampling
distribution of the average height.
Moreover, if our assumption is true (H0 = the true tree average height is 0), then the sample forest is skewed/biased upwards. So we need to
adjust it (by cutting the tree tops) to “zero-center” it (and meet H0’s condition).

We can now create many forests/resamples from the adjusted trees and calculate the average height from each resample to establish the
bootstrapped sampling distribution of the average height.

The mean average height will be zero, but some samples’average height will be 15 feet or over. The number/frequency of these samples with
average heights of 15ft+ provide us with an indication of how rare they are. The more rare they are in the sampling distribution, the least likely
our initial sample’s average height was due to random variation from a zero-mean population: ie the total population is unlikely to have a zero
mean provided that our sample has a mean of 15 feet.

It is mostly related to variance within the sample: if all trees are 15 feet +/- 2 inches, it is very likely that the true average height is different
from 0. In the bootstrap test, very few or no sampling distributions (zero-centered) will have an average height of 15 feet or more: H0 is
rejected.

If if all trees are 15 feet +/- 100 feet (assuming negative trees), we have much less certainty about the true average height being different from 0
(as the 15 feet average could just be the result of random variation). In the bootstrap test, a much larger number of sampling distributions (zero-
centered) will have an average height of 15 feet or more: H0 can not be rejected .

Apologies as I do not have any good links on this topic to refer you to. I feel maybe we should “branch” out to a discussion offline on this
(email if you prefer) if you need…

I do think Aronson’s book does a good job of explaining the concept (obviously in a longer form) and I thought I managed to put a clear
synthesis of the ideas… Maybe not so clear.

Lou // Sep 22, 2010 at 9:04 pm

Jez,
It seems that we’re talking past each other. My issue isn’t with bootstrapping. My issue with your original example is that you are contrasting
the mean return with a sample distribution made up of (x – mean) observations and then stating that “…the bootstrap tests for the null
hypothesis that the rule does not have any predictive power”.

This sounds really vague and in any case I’m pretty sure that you haven’t found out anything about “predictive power”. I think it would help to
know exactly what Ho: and Ha: are and what they have to do with predictive power.

Thank you for your response.

-Lou

Jez Liberty // Sep 24, 2010 at 3:50 am

Lou,
Yes – sorry that we don’t seem to understand each other…
“the bootstrap tests for the null hypothesis that the rule does not have any predictive power” is a quote from Aronson’s book. In it he equates
this to “the null hypothesis that the arithmetic mean return of the rule being back-tested is zero”

So
H0: back-tested rule has no predictive power = arithmetic mean return of the rule is zero
Ha: back-tested rule has predictive power = arithmetic mean return of the rule is positive

I suppose you could also see/run this test in a different way:

generate a sample distribution made up of (x) observations (instead of x – mean) and check how many observations are positive. If the number
is high enough (ie 95%), the positive result would be deemed statistically significant.

This would give the same results as the method described above.
-Jez

Bootstrap Testing of My Backtest data | MTJC-Capital's Blog // Oct 1, 2010 at 12:01 am

[…] blog – Automated Trading Systems. His posts discussing bootstrap testing can be found here and […]

Mike // Oct 1, 2010 at 2:50 pm

Has anyone tried Aronson’s detrending on a portfolio? It is reasonably straight forward when dealing with a single symbol. But, what about
when dealing with trades from multiple symbols? Must we detrend the history of each symbol individually for the calculations of those trades,
or is there some way to amalgamate the data streams? Individual streams would likely have too few trades to really be of any value.

P.S. The mean of your example 50% followed by -40% is 5%, not the 10% stated in your example.

Jez Liberty // Oct 1, 2010 at 4:37 pm

Hi Mike – oops, thanks for letting me know about that mistake. Fixed now.
Re: the detrending question, I am not convinced by the concept of detrending ( I discuss about it here) and therefore have not researched it too
much. Intuitively I would think you need to keep track of trades and daily drift for each individual symbol and make individual adjustments.

Lou // Oct 1, 2010 at 5:00 pm

Jez,
Did you try the way that you proposed (quoted below) and were 95% of them positive? I’m curious to see how that worked out.
“I suppose you could also see/run this test in a different way:
generate a sample distribution made up of (x) observations (instead of x – mean) and check how many observations are positive. If the number
is high enough (ie 95%), the positive result would be deemed statistically significant.”

Also, until you stated the null I didn’t realize that you were just testing for Ho = 0. That was really what my original question was about.

Thx for your help.

-Lou

Jez Liberty // Oct 5, 2010 at 2:28 am

Lou – glad that we managed somehow to understand each other…

I have not tried the other option, but from a logical point of view it just seems to be the same thing phrased in a different way.

Lou // Oct 5, 2010 at 1:53 pm

Jez,
If you have the time would you mind trying the other option. I’d really like to see if it works.
Also, in this instance I don’t think that this was a particularly useful test. My understanding is that you had 2 strategies already optimized in this
test and then you did a simple hypothesis test for 1 mean. You’d expect to reject the null in this case.

The Bootstrap Method for Hypothesis Testing: Can it tell us anything we do not know? | Price Action Lab Blog // Nov 29, 2010 at 7:58
am

[…] Jez Liberty Blog. http://www.automated-trading-system.com/bootstrap-test/ 2. Free CTA Information […]

Attila // Jan 1, 2011 at 6:42 am

Hello, Great blog, I am just catching up with the posts

(and the book), but could you shed some light on this: “think about
a return of 50% followed by a return of -40%: arithmetic mean
return is +5%, yet the overall return is minus 5.1%” How did you
get the -5.1% for the overall return ? Thanks, Attila P.S.: Minor
technical issue: reading the post in Google Reader via RSS, it
seems it’s still picking up the original “arithmetic mean return is
+10%” version.

Jez Liberty // Jan 1, 2011 at 5:00 pm

The return from +50% and -40% is -10%. When taking the square root (for geometric average) the average return is -5.1%

Thanks for mentioning the issue on the rss reader. I’ll try to look into fixing it.

Headhorn // Mar 2, 2011 at 5:24 pm

Geometric Return = {[Prod_i(1+R_i)]^(1/n)}-1. So in the example given,

GR=sqrt[(1+.5)*(1-.4)]-1=-5.13%

Subhash // Oct 19, 2011 at 3:56 pm

Jez,

Lou is right. There is a logical fallacy in the RC test as suggested by the author.

This is because, the resampled distribution is from the “detrended” return (after subtracting the mean), but we are reading the p-value as the
fraction of observations from the resamplings that is above the “initial” mean return. Its not correct to compare these two because the initial
mean return is not detrended.

Performance Metric: Pessimistic Gain/MDD « Quanting Dutchman // Dec 30, 2011 at 1:36 pm

[…] of the original list of trades (If you want to learn more about bootstrap-testing I higly recommend this post by Jez Liberty). The Pessimistic
Gain/MaxDD is then calculated by subtracting 1xSD from the Gain […]

ZigZag // Mar 20, 2012 at 7:51 pm

Jez
First of all, thanks for a great blog.
Two basic (but important to me, at least) questions on bootstrapping as applied to estimating the distribution of equity CAGR and MaxDD of a
series of back-tested trades P&Ls:
1. re-sampling with replacement vs without replacement: I have seen plausible arguments for either approaches. Do you have a view?
2. how important it is to resample without destroying (totally or even partially) the original trade P&L series’ degree of randomness (or lack
thereof)?
Hope these issues are relevant to the other readers as well.
ZigZag

xi // Sep 9, 2012 at 10:28 am

Great blog but I have a question about the random re-sampling. Would the random re-sampling totally destroy the correlation structure the
original sample?
Jez Liberty // Sep 17, 2012 at 10:38 am

yes it would most likely but I do not think it is an issue with this approach/usage as we are not recreating trading signals off that re-sampled
data or even calculating time-sensitive stats such as MaxDD but only using one composite return stat.

Alex // Nov 24, 2014 at 7:45 am

This is a great blog. I wonder why has anyone paid attention to the stuff that guy Aronson wrote. He repeats the same things over and over
again in his book like he is talking to elementary school kids or even more seriously like he is trying to understand it himself. As some people
have noted above, his tests are seriously flawed. Why would anyone want to check: “H0: back-tested rule has no predictive power = arithmetic
mean return of the rule is zero”?

In practice this will happen if there is no commission and slippage. But in reality these two can result in serious performance degradation. YOU
DO NOT WANT to test the hypothesis before applying cost to your trading. This is preposterous indeed. Then, why bootstrapping at all?
Bootstrapping returns WILL NOT subject your system to stress from new market conditions, which is the real issue here. Again, the whole
book Aronson wrote promoted some idiosyncratic methods of hypothesis testing that have virtually little to do with real world system trading.

Jez, thanks anyway, I hope you are doing fine.

Rick // Jan 18, 2017 at 8:45 am

Hello everyone,

After a few years, have we settled on the following issues?

1. Does it make sense to subtract mean and center distribution to zero? Why not just counting the percentage of mean returns above the target
return and calculate p-value accordingly?

2. BTW, how is sample size determined in re-sampling? Does this have any effect? Or are trade return order just reshuffled?

Best.

Notify me of followup comments via e-mail

Submit

Free Updates

By
Email:
Enter your email address:

Get posts by email

Twee Twitter
Popular Posts
Trend Following Wizards performance
A trick to reduce Drawdowns
Trading Blox review
Better Trend Following via improved Roll Yield
e-ratio: How to measure your trading edge in 4 easy steps
A practical Guide to ETF Trading Systems
Which CTAs REALLY provide alpha (and HOW do you calculate it)?
Were the Turtles just lucky?...
Moving Median: a better indicator than Moving Average?

Global Futures Broker

Check the list of global futures markets Wisdom Trading offer access to, from Maize in South Africa, Palm Oil in Malaysia to Korean Won,
Brazilian Real or Japanese Kerosene to name a few, it is impressive and great to benefit from diversification.

Search Au.Tra.Sy blog

Blogroll
All About Alpha
Allocate Smartly
Attain Capital
Automated Trader
Beyond the Blue Event Horizon
CSS Analytics
FOSS Trading
GestaltU
Liquid Alpha
MarketSci
Michael Covel – Trend Following
Quantivity
Quantocracy (ex-Whole Street)
Quantum Financier
STROM Macro
TRADERS' magazine
Traders' Place
Trading Automatique (French)
Trading Blox forum
Wisdom Trading
World Beta

Tag Cloud ↓

Abraham Trading AHL Altis Partners amibroker Aspect Capital Bill Dunn Chesapeake Christian Baha Clarke Capital comparison correlation CSI dave harding
distribution diversification Drury Capital eckhardt EMC Capital Futures Hawksbill Howard Seidler Hyman Beck John W Henry Larry Hite Liz Cheval Man

Millburn Ridgefield optimisation Paul Rabar ralph vince report research paper robust rollover Saxon Investment screenshots Superfund Tom Shanks tradersstudio
Trading Blox Transtrend Trend Following Unfair Advantage winton capital wizards
Au.Tra.Sy blog, Systematic Trading research and development, with a flavour of Trend Following.

Disclaimer: Past performance is not necessarily indicative of future results. Futures trading is complex and presents the risk of substantial losses; as
such, it may not be suitable for all investors. The content on this site is provided as general information only and should not be taken as investment
advice. All site content, shall not be construed as a recommendation to buy or sell any security or financial instrument, or to participate in any
particular trading or investment strategy. The ideas expressed on this site are solely the opinions of the author. The author may or may not have a
position in any financial instrument or strategy referenced above. Any action that you take as a result of information or analysis on this site is
ultimately your sole responsibility.

HYPOTHETICAL PERFORMANCE RESULTS HAVE MANY INHERENT LIMITATIONS, SOME OF WHICH ARE DESCRIBED BELOW. NO
REPRESENTATION IS BEING MADE THAT ANY ACCOUNT WILL OR IS LIKELY TO ACHIEVE PROFITS OR LOSSES SIMILAR TO
THOSE SHOWN; IN FACT, THERE ARE FREQUENTLY SHARP DIFFERENCES BETWEEN HYPOTHETICAL PERFORMANCE RESULTS
AND THE ACTUAL RESULTS SUBSEQUENTLY ACHIEVED BY ANY PARTICULAR TRADING PROGRAM. ONE OF THE LIMITATIONS
OF HYPOTHETICAL PERFORMANCE RESULTS IS THAT THEY ARE GENERALLY PREPARED WITH THE BENEFIT OF HINDSIGHT. IN
ADDITION, HYPOTHETICAL TRADING DOES NOT INVOLVE FINANCIAL RISK, AND NO HYPOTHETICAL TRADING RECORD CAN
COMPLETELY ACCOUNT FOR THE IMPACT OF FINANCIAL RISK OF ACTUAL TRADING. FOR EXAMPLE, THE ABILITY TO
WITHSTAND LOSSES OR TO ADHERE TO A PARTICULAR TRADING PROGRAM IN SPITE OF TRADING LOSSES ARE MATERIAL
POINTS WHICH CAN ALSO ADVERSELY AFFECT ACTUAL TRADING RESULTS. THERE ARE NUMEROUS OTHER FACTORS
RELATED TO THE MARKETS IN GENERAL OR TO THE IMPLEMENTATION OF ANY SPECIFIC TRADING PROGRAM WHICH CANNOT
BE FULLY ACCOUNTED FOR IN THE PREPARATION OF HYPOTHETICAL PERFORMANCE RESULTS AND ALL WHICH CAN
ADVERSELY AFFECT TRADING RESULTS.

THESE PERFORMANCE TABLES AND RESULTS ARE HYPOTHETICAL IN NATURE AND DO NOT REPRESENT TRADING IN ACTUAL
ACCOUNTS.

(Hoff) Novel Ways of Implementing Carry Alpha in Commodities
No ratings yet
(Hoff) Novel Ways of Implementing Carry Alpha in Commodities
13 pages
IMC Trading Interview Preparation Guide 2023
No ratings yet
IMC Trading Interview Preparation Guide 2023
1 page
(Bethea, Robert M) - Statistical Methods For Engineers and Scientists, Third Edition-Routledge (2018)
80% (5)
(Bethea, Robert M) - Statistical Methods For Engineers and Scientists, Third Edition-Routledge (2018)
681 pages
Algorithmic Game Theory Mini-Project
No ratings yet
Algorithmic Game Theory Mini-Project
20 pages
Insider Weekly 119 Compressed PDF
No ratings yet
Insider Weekly 119 Compressed PDF
21 pages
SSRN Id2759734
No ratings yet
SSRN Id2759734
24 pages
Global Equity Momentum A Craftsmans Perspective
No ratings yet
Global Equity Momentum A Craftsmans Perspective
39 pages
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
No ratings yet
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
26 pages
Data Science
100% (1)
Data Science
7 pages
(Epelbaum) Vol Risk Premium
No ratings yet
(Epelbaum) Vol Risk Premium
25 pages
Article-22855 Lynn D. Torbeck PDF
No ratings yet
Article-22855 Lynn D. Torbeck PDF
2 pages
Cowboys Study Guide Vol Iv
No ratings yet
Cowboys Study Guide Vol Iv
81 pages
2marks With Answers
No ratings yet
2marks With Answers
10 pages
Lecture+14 SAS Bootstrap and Jackknife
No ratings yet
Lecture+14 SAS Bootstrap and Jackknife
12 pages
STA3030F - Jan 2015 PDF
No ratings yet
STA3030F - Jan 2015 PDF
13 pages
Cesar Alvarez Interview
No ratings yet
Cesar Alvarez Interview
18 pages
An Overview of Hedge Fund Strategies
No ratings yet
An Overview of Hedge Fund Strategies
23 pages
(SEC) Stats2005
No ratings yet
(SEC) Stats2005
31 pages
Algorithmic Trading With Learning
No ratings yet
Algorithmic Trading With Learning
28 pages
Notes On Chapter 1
No ratings yet
Notes On Chapter 1
9 pages
Sro Rulemaking - SR Nasd 98 21
No ratings yet
Sro Rulemaking - SR Nasd 98 21
28 pages
Strategy Portfolio: Dravyaniti Consulting LLP
No ratings yet
Strategy Portfolio: Dravyaniti Consulting LLP
13 pages
(Haque & McNeal) UPS Group Trust Corporate Pension Dilemma
No ratings yet
(Haque & McNeal) UPS Group Trust Corporate Pension Dilemma
21 pages
EP Chan Course Offerings
No ratings yet
EP Chan Course Offerings
18 pages
(Edwards) Underwater Investing in The Ice Age
No ratings yet
(Edwards) Underwater Investing in The Ice Age
48 pages
Nonparametric Econometrics. A Primer
No ratings yet
Nonparametric Econometrics. A Primer
103 pages
Momentum Strategies Futures Markets
No ratings yet
Momentum Strategies Futures Markets
60 pages
A Machine Learning Framework For An Algorithmic Trading System PDF
No ratings yet
A Machine Learning Framework For An Algorithmic Trading System PDF
11 pages
SSRN Id937847
No ratings yet
SSRN Id937847
18 pages
Director, Division of Trading and Markets U.S. Securities and Exchange Commission
No ratings yet
Director, Division of Trading and Markets U.S. Securities and Exchange Commission
12 pages
Commodity Currencies Orbit China P. 6 September's FX Event Risk P. 10 Early Morning Forex Setup: Time Is of The Essence P. 14
No ratings yet
Commodity Currencies Orbit China P. 6 September's FX Event Risk P. 10 Early Morning Forex Setup: Time Is of The Essence P. 14
29 pages
2005 Emilio Tomasini Trading Herbst Aschaffenburg
0% (1)
2005 Emilio Tomasini Trading Herbst Aschaffenburg
7 pages
Battle Tested Day Trading Strategies: With Boris Schlossberg & Kathy Lien
No ratings yet
Battle Tested Day Trading Strategies: With Boris Schlossberg & Kathy Lien
74 pages
MTA Symposium - My Notes (Vipul H. Ramaiya)
100% (2)
MTA Symposium - My Notes (Vipul H. Ramaiya)
12 pages
What Are The Differences Between Traders
No ratings yet
What Are The Differences Between Traders
14 pages
Ubs For Stock
No ratings yet
Ubs For Stock
239 pages
Luigi Piva
No ratings yet
Luigi Piva
2 pages
Quantiacs Reading List PDF
No ratings yet
Quantiacs Reading List PDF
7 pages
Walk Forward Optimization by John Ehlers
No ratings yet
Walk Forward Optimization by John Ehlers
3 pages
Backtest OverFitting
No ratings yet
Backtest OverFitting
58 pages
Model Description: Ticker Name
No ratings yet
Model Description: Ticker Name
14 pages
Shane's Simple Guide To F-Statistics PDF
No ratings yet
Shane's Simple Guide To F-Statistics PDF
21 pages
10 Introduction To Electronic Trading
No ratings yet
10 Introduction To Electronic Trading
6 pages
Margin Trading FAQ 11 Feb 2020
100% (1)
Margin Trading FAQ 11 Feb 2020
13 pages
Quantum Mechanics and Path Integrals: 1.1 Functional Derivative
No ratings yet
Quantum Mechanics and Path Integrals: 1.1 Functional Derivative
16 pages
Things You Wish You Knew Before You Started Writing Algorithms?: Algotrading
No ratings yet
Things You Wish You Knew Before You Started Writing Algorithms?: Algotrading
3 pages
How To Become Independent Trader and Have Confidence
No ratings yet
How To Become Independent Trader and Have Confidence
18 pages
A Tale of Two Traders
No ratings yet
A Tale of Two Traders
4 pages
Measuring TFP at The Firm Level
No ratings yet
Measuring TFP at The Firm Level
59 pages
Applications of AI and Data Mining in Trading
No ratings yet
Applications of AI and Data Mining in Trading
8 pages
Markplex Output Indicator To Text File
No ratings yet
Markplex Output Indicator To Text File
5 pages
EPAT-Course Mapping - Python
No ratings yet
EPAT-Course Mapping - Python
2 pages
DeepTrading With TensorFlow 1 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 1 - TodoTrader
6 pages
Wayne A. Thorp - Testing Trading Success PDF
No ratings yet
Wayne A. Thorp - Testing Trading Success PDF
5 pages
Resume 3
No ratings yet
Resume 3
2 pages
Cover Letter Optiver
No ratings yet
Cover Letter Optiver
1 page
Bootstrap - Take 2 - Data Mining Bias, Code and Using Geometric Mean - Au - Tra.Sy Blog - Automated Trading System
No ratings yet
Bootstrap - Take 2 - Data Mining Bias, Code and Using Geometric Mean - Au - Tra.Sy Blog - Automated Trading System
9 pages
Roadmap Ultimate Edition
No ratings yet
Roadmap Ultimate Edition
74 pages
Asymptotic Theory of Statistics and Probability: Anirban Dasgupta
No ratings yet
Asymptotic Theory of Statistics and Probability: Anirban Dasgupta
15 pages
Bootstrap 1
No ratings yet
Bootstrap 1
16 pages
Trader Job Resume
No ratings yet
Trader Job Resume
3 pages
Is That Back-Test Result Good or Just Lucky
No ratings yet
Is That Back-Test Result Good or Just Lucky
8 pages
Changes 2025
No ratings yet
Changes 2025
24 pages
Automated Trading Strategist
No ratings yet
Automated Trading Strategist
5 pages
How To Spot Backtest Overfitting: Lawrence Berkeley National Lab (Retired), and University of California, Davis
No ratings yet
How To Spot Backtest Overfitting: Lawrence Berkeley National Lab (Retired), and University of California, Davis
15 pages
A I in Financial Services
100% (1)
A I in Financial Services
7 pages
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
No ratings yet
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
14 pages
Investment Strategy: Bad Trade?!
No ratings yet
Investment Strategy: Bad Trade?!
6 pages
Pitch Shifter Project Report
No ratings yet
Pitch Shifter Project Report
15 pages
Lang and Litzenburger 1989
No ratings yet
Lang and Litzenburger 1989
11 pages
Concordance C Index - 2 PDF
No ratings yet
Concordance C Index - 2 PDF
8 pages
Comparative Study of Heavy and Light Users of Popular Smart Apps
No ratings yet
Comparative Study of Heavy and Light Users of Popular Smart Apps
8 pages
The Information Ratio
No ratings yet
The Information Ratio
8 pages
Blue Crest
No ratings yet
Blue Crest
8 pages
The Effect of Capital Structure On Firm Value For Vietnam's Seafood Processing Enterprises
No ratings yet
The Effect of Capital Structure On Firm Value For Vietnam's Seafood Processing Enterprises
14 pages
Merger Motives and Target Valuation Full Paper
No ratings yet
Merger Motives and Target Valuation Full Paper
19 pages
Tactical Investment Algorithms: Marcos López de Prado
No ratings yet
Tactical Investment Algorithms: Marcos López de Prado
5 pages
Thirty Years of Heteroskedasticity-Robust Inference: James G. Mackinnon
No ratings yet
Thirty Years of Heteroskedasticity-Robust Inference: James G. Mackinnon
25 pages
Exploring The Limits of Bootstrap
No ratings yet
Exploring The Limits of Bootstrap
458 pages
Ramniranjan Jhunjhunwala College of Arts, Science & Commerce (Autonomous)
No ratings yet
Ramniranjan Jhunjhunwala College of Arts, Science & Commerce (Autonomous)
35 pages
For Forex Trading
No ratings yet
For Forex Trading
8 pages
The Three Types of Backtests
No ratings yet
The Three Types of Backtests
19 pages
M. Phil. in Statistics: Syllabus
No ratings yet
M. Phil. in Statistics: Syllabus
12 pages
Slides Manresa
No ratings yet
Slides Manresa
84 pages
Beyond The Carry Trade:Optimal Currency Portfolios
No ratings yet
Beyond The Carry Trade:Optimal Currency Portfolios
43 pages
Data Pre-Processing - by Quant Arb - The Quant Stack
No ratings yet
Data Pre-Processing - by Quant Arb - The Quant Stack
9 pages
Fstat 294
No ratings yet
Fstat 294
54 pages
Modeling The Potential Distribution of The Threatened Grey Necked Picathartes Picathartes Oreas Across Its Entire Range
No ratings yet
Modeling The Potential Distribution of The Threatened Grey Necked Picathartes Picathartes Oreas Across Its Entire Range
9 pages
Confidence Intervals of Treatment Effects in Panel Data M - 2024 - Journal of Ec
No ratings yet
Confidence Intervals of Treatment Effects in Panel Data M - 2024 - Journal of Ec
27 pages
Starting Out in Statistics An Introduction For Students of Human Health Disease and Psychology 1st Edition Patricia de Winter
No ratings yet
Starting Out in Statistics An Introduction For Students of Human Health Disease and Psychology 1st Edition Patricia de Winter
51 pages
Goldman Sachs Asset Management-Fundamental Equity and Change
No ratings yet
Goldman Sachs Asset Management-Fundamental Equity and Change
12 pages
Goldman Sachs Asset Management-Quantitative Equity Research Notes Fundamental Drive Alpha
No ratings yet
Goldman Sachs Asset Management-Quantitative Equity Research Notes Fundamental Drive Alpha
11 pages
Goldman Sachs Asset Management-Active Alpha Investing All Alphas Are Not Created Equal II
No ratings yet
Goldman Sachs Asset Management-Active Alpha Investing All Alphas Are Not Created Equal II
4 pages
The Eurodollar Futures and Options Handbook
From Everand
The Eurodollar Futures and Options Handbook
Galen Burghardt
No ratings yet
Principles of Quantitative Development
From Everand
Principles of Quantitative Development
Manoj Thulasidas
No ratings yet
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
From Everand
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
Richard Tortoriello
4/5 (1)
ETF Trading System Made Easy: Simple method 100% automatic Profit from 25% to 125% annually depending on the market chosen
From Everand
ETF Trading System Made Easy: Simple method 100% automatic Profit from 25% to 125% annually depending on the market chosen
Richard Bastien
No ratings yet
Crypto A Beginner's Guide
From Everand
Crypto A Beginner's Guide
Jake Masterfield
No ratings yet
Master Market Entropy
From Everand
Master Market Entropy
Freya Johns
No ratings yet
Invest and Earn Quick: Mastering Technical Analysis of the Financial Markets: Winning Strategies of Professional Investment
From Everand
Invest and Earn Quick: Mastering Technical Analysis of the Financial Markets: Winning Strategies of Professional Investment
Warren H. Lau
No ratings yet

The Bootstrap Test - How Significant Are Your Back-Testing Results - Au - Tra.Sy Blog - Automated Trading System

Uploaded by

The Bootstrap Test - How Significant Are Your Back-Testing Results - Au - Tra.Sy Blog - Automated Trading System

Uploaded by

The Wayback Machine - https://web.archive.org/web/20210122152637/http://www.automated-trading-system.

Au.Tra.Sy blog welcome

The Bootstrap Test: How significant are your back-testing results?

Bootstrap on Single Rule Back-Test

Results from standard Trend Following strategy

Note on Arithmetic Mean vs. Geometric Mean

I’ll be running this post in 2 parts, and this concludes part 1…

Related Posts with Thumbnails

Tags: aronson·bootstrap·data mining

KF // Aug 11, 2010 at 7:44 am

First off: Great blog!

Jez // Aug 11, 2010 at 8:39 am

KF // Aug 11, 2010 at 11:18 am

Paolo // Aug 11, 2010 at 2:38 pm

The idea of bootstrapping the geometric mean returns sounds correct.

Looking forward to your suggested adjustment then.

Jez // Aug 12, 2010 at 1:50 pm

Lou // Sep 21, 2010 at 5:38 pm

Am I missing something here?

Jez Liberty // Sep 22, 2010 at 2:14 am

Lou // Sep 22, 2010 at 8:40 am

Jez Liberty // Sep 22, 2010 at 10:18 am

Hope this clarifies things.

Lou // Sep 22, 2010 at 11:01 am

Here’s an example of what I was trying to say:

Jez Liberty // Sep 22, 2010 at 2:36 pm

To stay with your tree analogy:

Lou // Sep 22, 2010 at 9:04 pm

Thank you for your response.

Jez Liberty // Sep 24, 2010 at 3:50 am

I suppose you could also see/run this test in a different way:

Bootstrap Testing of My Backtest data | MTJC-Capital's Blog // Oct 1, 2010 at 12:01 am

Mike // Oct 1, 2010 at 2:50 pm

Jez Liberty // Oct 1, 2010 at 4:37 pm

Lou // Oct 1, 2010 at 5:00 pm

Thx for your help.

Jez Liberty // Oct 5, 2010 at 2:28 am

Lou – glad that we managed somehow to understand each other…

Lou // Oct 5, 2010 at 1:53 pm

[…] Jez Liberty Blog. http://www.automated-trading-system.com/bootstrap-test/ 2. Free CTA Information […]

Attila // Jan 1, 2011 at 6:42 am

Hello, Great blog, I am just catching up with the posts

Jez Liberty // Jan 1, 2011 at 5:00 pm

Headhorn // Mar 2, 2011 at 5:24 pm

Geometric Return = {[Prod_i(1+R_i)]^(1/n)}-1. So in the example given,

Subhash // Oct 19, 2011 at 3:56 pm

ZigZag // Mar 20, 2012 at 7:51 pm

xi // Sep 9, 2012 at 10:28 am

Alex // Nov 24, 2014 at 7:45 am

Jez, thanks anyway, I hope you are doing fine.

Rick // Jan 18, 2017 at 8:45 am

After a few years, have we settled on the following issues?

Notify me of followup comments via e-mail

Get posts by email

Global Futures Broker

Search Au.Tra.Sy blog

© 2009-2012 Au.Tra.Sy blog – Automated trading System — Sitemap — Powered by Wordpress

You might also like