Column Understanding The Kelly Criterion
Column Understanding The Kelly Criterion
Column 23
by Edward O. Thorp
Copyright 2008
My 1962 book Beat the Dealer explained the detailed theory and practice.
Beat the Dealer I called this, naturally enough, “The Kelly gambling system,”
since I learned about it from the 1956 paper by John L. Kelly. (Claude
investing. Since 1966 I’ve called it “the Kelly criterion” in my articles. The
rising tide of theory about and practical use of the Kelly Criterion by several
readable book about the Kelly Criterion, Fortune’s Formula. (As this title
came from that of my 1961 talk, I was asked to approve the use of the title.)
reported that “everyone” said they were using the Kelly Criterion.
maximizing the expected value of the logarithm of wealth. But the details
6/25/2010 1
can be mathematically subtle. Since they’re not covered in Poundstone
(2005) you may wish to refer to my article “The Kelly Criterion in Blackjack,
Sports Betting and the Stock Market,” Handbook of Asset and Liability
Hedge fund manager Mohnish Pabrai, in his new book The Dhandho
Investor, gives examples of the use of the Kelly Criterion for investment
situations. (Pabrai won the bidding for this year’s lunch with Warren Buffett,
case scenarios and payoffs over the next 24 months which I summarize in
Table 1.
Probability Return
p1 = 0.80 R1 > 100%
p2 = 0.19 R2 > 0%
p3 = 0.01 R3 = −100%
______________________________
Sum=1.00
net worth is
3
(1) g ( f ) = ∑ pi ln ( 1 + Ri f )
i =1
6/25/2010 2
where ln means the logarithm to the base e. When we use Table 1 to
replace the pi by their values and the Ri by their lower bounds this gives the
noted by Pabrai. Not having heard of the Kelly Criterion back in 2000, Pabrai
only bet 10% of his fund on Stewart. Would he have bet more, or less, if he
had then known about Kelly’s Criterion? Would I have? Not necessarily.
probability of total loss, which Kelly always avoids. So f * < 0.50. The same
the other investments currently in the portfolio, any candidates for new
investments, and their (joint) properties, in order to find the Kelly optimal
fraction for each new investment, along with possible revisions for existing
investments.
considerable evidence that Buffet thinks like a Kelly investor, citing Buffett
bets of 25% to 40% of his net worth on single situations. Since f * < 1 is
6/25/2010 3
necessary to avoid total loss, Buffett must be betting more than .25 to .40 of
http://undergroundvalue.blogspot.com/2008/02/notes-from-buffett-meeting-
Emory:
in that discussion, but was curious if you could tell us more about
Buffett:
put money in your 20th choice rather than your 1st choice. “LeBron
James” analogy. If you have LeBron James on your team, don’t take
him out of the game just to make room for some else.
running 50, 100, 200 million, I would have 80% in 5 positions, with
heavier into, up to 40%. I told investors they could pull their money
out. None did. The position was American Express after the Salad
Oil Scandal. In 1951 I put the bulk of my net worth into GEICO.
6/25/2010 4
Treasury bonds, I would have been willing to put 75% of my
portfolio into it. There were various times I would have gone up to
75%, even in the past few years. If it’s your game and you really
Bill Ziemba’s new book, Scenarios for Risk Management and Global
Investment Strategies, that Buffett thinks like a Kelly investor when choosing
length.
investments is one of the most common oversights I’ve seen in the use of the
(2) Risk tolerance. As discussed at length in Thorp (2006, op. cit.), “full
Kelly” is too risky for the tastes of many, perhaps most, investors and using
much more to their liking. Full Kelly is characterized by drawdowns which are
(3) The “true” scenario is worse than the supposedly conservative lower
discussed in Thorp (2006, op. cit.), we get more risk and less return, a
against this.
(4) Black swans. As fellow Wilmott columnist Nassim Nicholas Taleb has
pointed out so eloquently in his new bestseller The Black Swan, humans tend
6/25/2010 5
not to appreciate the effect of relatively infrequent unexpected high impact
events. Failing to allow for these “black swans,” scenarios often don’t
(5) The “long run.” The Kelly Criterion’s superior properties are
instance:
known quants so I’ll take time here to explore some of its subtleties.
from Kelly on at least one of the first N trials but copies it thereafter.
6/25/2010 6
hence ahead for all n ≥ N . For example consider the sequence of the
more than Kelly for each n ≤ N where g n ≠ f ∗, hence exceeds Kelly for
all n ≥ N .
years ago in the newsletter Blackjack Forum when a well known anti
which differed from Kelly at every trial but would (with probability as
Kelly and stay ahead forever. When I read the challenge I immediately
( )
n
n, such that P Vn* < Vn for all n ≥ N > 1 − ε where V = Π ( 1 + f Χ ) and
n i i
i =1
( )
n
Vn* = Π 1 + f * Χi . Furthermore there is a b > 1 such that
i =1
( ) ( )
P Vn / Vn* ≥ b, n ≥ N > 1 − ε and P Vn − Vn* → ∞ > 1 − ε . That is, for some
6/25/2010 7
N there is a non Kelly sequence that beats Kelly “infinitely badly” with
PROOF. The proof has two parts. First we want to establish the
ahead.
some a > 0, b > 1. For instance VN − VN ≥ c > 0 since there are only a
*
the first N trials such that VN > VN . Setting c / 2 = a > 0 and d + 1 = b > 1
*
partition our capital into two parts: a and bVN . For n > N we bet
*
6/25/2010 8
total which is generally unequal to f * of our capital. If by chance for
to a / 3n for that n. The portion bVN will become bVN for n > N and the
* *
portion a will never be exhausted so we have Vn > bVn for all n > N .
*
*
( ) ( *
)
Hence, since P Vn → ∞ = 1, we have P Vn / Vn ≥ b = 1 from which it
(
follows that P Vn − Vn → ∞ = 1.
*
)
To prove the first part, we show how to get ahead of Kelly with
betting less than Kelly by a very small amount. If the first outcome is a
loss, then we have more than Kelly and use the strategy from the proof
of the second part to stay ahead. If the first outcome is a win, we’re
behind Kelly and now underbet on the second trial by enough so that a
loss on the second trial will put us ahead of Kelly. We continue this
strategy until either there is a loss and we are ahead of Kelly or until
even betting 0 is not enough to surpass Kelly after a loss. Given any
p N , 12 < p < 1. Hence, given ε > 0, we can choose N such that p N < ε
probability 1 − p N > 1 − ε .
6/25/2010 9
More precisely: suppose the first n trials are wins and we have
Vn
=
( ) (
1 + f * − a1 K 1 + f * − an )
Vn* ( ) (
1+ f * K 1+ f * )
a an
= 1 − 1 * K 1 − *
> ( 1 − a1 ) K ( 1 − an ) > 1 − ( a1 + K + an )
1+ f 1+ f
=
(
Vn +1 Vn 1 − f + b
*
)
> ( 1 − a ) 1 +
b
> ( 1 − a ) ( 1 + b ) ≥ 1 or b ≥
a
*
*
Vn +1 Vn 1 − f
* *
( ) 1− f 1− a
(
f ( x ) − a1 x = 2 xf ( x ) 1 + x + x 2 + K )
= 2 xf ( x ) / ( 1 − x )
6/25/2010 10
∞
whose solution is f ( x ) = a1 x + 2
∑3
n=2
n −2
x n from which an = 2a1 3n −2 if
procedure.
So far we’ve seen that all sequences which differ from Kelly for
infinitely often (even always), are not essentially different. How can
Going to a more general setting than coin tossing, assume now for
applications one commonly assumes that the f i are constants that are
6/25/2010 11
For a more general case, including the Leib example and many of the
( )
n n
for all i. As before, Vn = Π ( 1 + f i X i ) and Vn* = Π 1 + f i * X i from which
i =1 i =1
n n
( )
ln Vn = ∑ ln ( 1 + f i X i ) and ln Vn* = ∑ ln 1 + fi * X i . Note from the definition
i =1 i =1
( )
f * that E ln 1 + fi X i ≥ E ln ( 1 + fi X i ) , where E denotes the expected
*
{ ( }
n n
( ) )
E ln Vn* / Vn = E ∑ ln 1 + f i * X i − ln ( 1 + f i X i ) = ∑ ai
i =1 i =1
∑a
i =1
i tends to infinity as n increases. Otherwise, { fi } is not essentially
different from { f }.
i
*
The basic idea here can be applied to more
general settings.
6/25/2010 12
(6) Given a large fixed goal, e.g. to multiply your capital by 100, or 1000,
the expected time for the Kelly investor to get there tends to be least.
from 1956 to mid 2007, Warren Buffett has increased his wealth to about
had about $2.5x107 in 1969 so his multiple over the last 38 years is about
2x103. Even my own efforts, as a late starter on a much smaller scale, have
multiplied capital by more than 2x104 over the 41 years from 1967 to early
2007. I know many investors and hedge fund managers who have achieved
such multiples.
The caveat here is that an investor or bettor many not choose to make,
or be able to make, enough Kelly bets for the probability to be “high enough”
explore for which investors Kelly or fractional Kelly may be a more or less
6/25/2010 13