Stochastic Processes For Finance
Stochastic Processes For Finance
Contents
1 Pricing Derivatives
1.1 Note on Continuous Compounding . . . . . . . . . . . . . . . . .
1.2 Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3 Hedging a Forward . . . . . . . . . . . . . . . . . . . . . . . . . .
2 Binomial Model
2.1 One Period Model
2.2 Two Period Model
2.3 Portfolios . . . . .
2.4 N Period Model .
1
1
2
3
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
6
6
8
9
10
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
13
13
14
15
16
16
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
ii
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
24
24
24
25
26
26
26
28
28
29
29
30
31
32
34
6 Black-Scholes Model
6.1 Portfolios . . . . . . . . . . . . . . .
6.2 The Fair Price of a Derivative . . . .
6.3 European Options . . . . . . . . . .
6.4 The Black-Scholes PDE and Hedging
6.5 The Greeks . . . . . . . . . . . . . .
6.6 General Claims . . . . . . . . . . . .
6.7 Exchange Rate Derivatives . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
38
38
39
41
42
44
44
45
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
46
47
47
48
49
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
51
51
52
56
58
60
9 Risk Measurement
9.1 Value-At-Risk . . . . . . . . . . . .
9.2 Normal Returns . . . . . . . . . . .
9.3 Equity Portfolios . . . . . . . . . .
9.4 Portfolios with Stock Options . . .
9.5 Bond Portfolios . . . . . . . . . . .
9.6 Portfolios of Bonds and Swaptions
9.7 Diversified Portfolios . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
62
62
63
65
66
67
69
70
iii
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Preface
These are lecture notes for the course Stochastic Processes for Finance. Parts
marked by * are either hard or regarded to be of secondary importance.
Knowledge of measure theory is not assumed, but some basic measure theoretic notions are required and therefore provided in the notes. The notes are
relatively succinct, a significant part of theory (e.g., proofs of certain assertions) is in fact contained in the exercises. Below is some relevant literature. It
is recommended that you read these books, next to the lecture notes.
1. Baxter, M. and Rennie, A., (1996). Financial calculus. Cambridge University Press.
2. Bj
ork, T., (2009). Arbitrage Theory in Continuous Time. Oxford Finance
Series.
3. Etheridge, A., (2002). A Course in Financial Calculus. Cambridge University Press.
4. Shreve, S.E., (2004). Stochastic Calculus for Finance I. The Binomial
Asset Pricing model. Springer.
5. Shreve, S.E., (2004). Stochastic Calculus for Finance II. Continuous-Time
Models. Springer.
iv
Chapter 1
Pricing Derivatives
A derivative, or contingent claim is a financial contract that promises a certain
payment to the owner, or delivery, depending on the value of some underlying
asset. Examples of the latter include stocks, bonds, interest rates, exchange
rates and commodities. Many different types of derivatives exists and they are
widely traded, for a variety of reasons. Some types are traded on exchanges
(basic call and put options on stocks for instance, see below), others are more
specialized (e.g. derivatives on interest rates) and are traded directly between
the parties involves, typically large financial institutions. Apart from giving
investment opportunities, derivatives are extremely useful for risk management:
financial vulnerability can be reduced by fixing a price for a future transaction.
In this chapter we introduce some basic concepts through examples; formal
definitions and theory follow in later chapters.
1.1
Our model for a (fixed interest) savings account is that a capital of size R0 placed
in such an account at some time t increases to the amount R0 erT at time t+t.
The capital in the account remains ours without restriction: we can withdraw it
at no cost at any time. The constant r is the continuously compounded interest
rate and is not quite an ordinary rate for a savings account, which is more
often a yearly or monthly rate. If interest is added to the account at the end
of a time period of one unit, then this would increase the capital from R0 to
R1 = (1 + r1 )R0 , the interest being r1 R0 , and r1 being the rate per time unit.
If instead we would obtain the interest in two installments, the first after half
a time unit, and the second after one time unit, then the initial capital would
first increase to R1/2 = (1 + r2 )R0 and next to R1 = (1 + r2 )R1/2 = (1 + r2 )2 R0 .
The second time we receive interest on interest. The rate r2 would be the
rate per half time unit, and hence 2r2 should be compared to r1 . However,
the comparison would not be exact, because (1 + r1 /2)2 > 1 + r1 , i.e., a more
frequent compounding rule used in this way yields a greater return. It would be
logical that r1 and r2 relate through the equation (1 + r2 )2 = 1 + r1 , apart from
possibly a correction for the benefit of early payment in the second scheme. The
interest on interest is making the difference.
We could continue this thought experiment and break the time unit in n
equal parts. The reasonable rate rn per (1/n)th time unit would satisfy (1 +
rn )n = 1 + r1 , or
nrn n
1+
= 1 + r1 .
n
Here nrn is the rate per time unit. Taking the limit as n , and assuming
that nrn tends to a limit r we obtain the equation er = 1 + r1 . This r is the
rate of interest per time unit earned in a savings account in which the interest
is added continuously in time. Using the continuous rate r is convenient, as
exponentials multiply.
From now on, if we say that there is a constant continuously compounded
interest rate r, this means that 1 euro in the bank at time t grows to ert euros
at time t. In the discrete setting, the discrete model of a savings account (where
a capital R0 at time zero grows to the capital R1 = (1 + r1 )R0 at time 1 for
some chosen fixed time unit and some interest rate r1 ) can be reduced to the
model with a constant continuously compounded interest rate r by relating r1
and r via the equation er = 1 + r1 .
1.2
Derivatives
exchange hand. In other words, the payoff of the call is given by (ST K)+ ,
where x+ = x if x 0 and zero otherwise. The same reasoning shows that the
payoff of a put with strike K and maturity T is given by (K ST )+ .
Calls and puts can be used for risk management purposes. The owner of a
certain asset can for instance also buy an appropriate put option on the asset
the insure himself against a too large decrease of the price in a certain period
of time. However, calls and puts are also widely used by speculators, who use
them to gamble on the increase or decrease of asset prices.
The derivatives considered so far are all of European type, with a payoff
of the form C(ST ), a fixed function of the asset value ST at maturity. There
also exist different types of derivatives. First of all, there exist for instance call
and put options that can not only be exercised at maturity, but which give the
right to buy or sell and asset for a fixed price, at some time before maturity, to
be determined by the buyer. Such contracts are called American call and put
options.
Yet another type of options are options whose payoff depends on the whole
history of the asset price up to the time of maturity. Such derivatives are called
path-dependent, because their payoff depends on the whole path (St : t [0, T ]).
The payoff of an Asian option depends on the average of the asset price over a
certain time interval. For instance, an Asian call option on an asset with value
St at time t, with strike price K and maturity T , is a derivative which pays
RT
( T1 0 St dt K)+ to the holder at the time of maturity T .
There exists all kinds of other path dependent options, for instance barrier
options, whose payoff at maturity T depends on whether or not the asset price
has reached certain fixed levels before time T , or lookback options, whose payoff
depends on the minimum or maximum asset price before maturity.
1.3
Hedging a Forward
The owner of a derivative receives some uncertain payoff in the future, depending
on the future development of the value of the underlying asset. Since this might
result in financial gain for one of the parties in the contract, basic economic
reasoning says that this must come at a price. One of the key questions is: how
to determine the fair price of a derivative?
In fact, it is not at all immediately clear what we mean by the fair price
of a derivative. Consider for instance a forward on a stock with strike K and
maturity T , with payoff ST K for the buyer. What should the buyer of the
forward be willing to pay for entering into this contract?
Consider a market that consists of only one risky asset (the stock) and one
riskless asset (the savings account, or bond ). Suppose that we have, besides
buying the contract, two other options to invest our money:
(i) We can put our money in a savings account against a fixed, predetermined
interest rate r. One unit of money placed in the account grows to ert units
during a time interval [0, t] and is freely available. A negative balance in
our account is permitted, thus allowing us to borrow money at the same
interest rate r. If we borrow one unit at time 0, than we owe ert units at
time t, which is equivalent to having a capital of ert units.
(ii) We can invest in the stock. The stock price St at time t is a stochastic
variable, dependent on t. It may be assumed that we know the probability
3
provided of course that the integral is finite. If the interest rate is zero, then
the buyer should perhaps be willing to pay this amount of money at time 0 for
the forward contract. First, this does not really solve the pricing problem yet,
because in practice we would need to make some concrete choice for the density
p, perhaps on the basis of statistical analysis of stock price data. Much more
importantly however is that, as we argue below, pricing a forward in this way
leads to economic unreasonable and hence undesirable prices.
The key observation is that, by following a very simple strategy, it is possible
to replicate the payoff of the forward. Such strategies are called hedging. We
suppose that the interest rate is 0 for simplicity. At time 0 we borrow K euros
from the bank. We throw in another S0 K euros of our own money (with S0
the value of the stock at time 0) and use the resulting S0 euros to buy one stock.
Then we simply wait until time T . At that time, we pay back the K euros to
the bank and we sell the stock. The net effect of this strategy is easy to see: at
time 0 it costs us S0 K euros to be able to carry out the strategy. At time T
we end up with ST K euros in our hand.
In other words, the payoff that we get by following this simple trading strategy is exactly equal to the payoff of the forward. Basic economic reasoning then
implies that the cost of entering the forward at time 0 should also exactly equal
the cost of carrying out the trading strategy, i.e., the buyer of the forward should
pay S0 K euros to the seller (still under the assumption that the interest rate
is 0). Indeed, suppose the seller accepts a price of P < S0 K euros. Then the
buyer can apply the following strategy:
(i) At time 0, borrow P euros from the bank and enter into a the forward
contract. Also, borrow a stock from someone and sell it for S0 euros.
(ii) At time T , borrow K more euros from the bank and use the money to
buy one stock, as required by the forward contract. Return the stock to
the person it was borrowed from. Of the S0 euros that were obtained at
time 0, use K + P euros to settle the debt with the bank.
Observe that this strategy does not cost any money at time 0. At maturity T ,
it leads to a net payoff of S0 K P , which, by assumption, is strictly positive
in this case. If on the other hand the buyer accepts to pay a price P > S0 K
for the forward, the seller of the contract can follow a similar strategy in order
to earn some riskless money; see Exercise 1.1 below.
A trading strategy of this type, which costs no money at time 0, does not involve any injections or withdrawals of money between times 0 and T , and leads
to a nonnegative payoff at time T which is strictly positive with positive probability (in this case with probability 1 in fact), is called an arbitrage opportunity.
It is economically reasonable to assume the absence of arbitrage opportunities
4
in financial markets. The idea is that if they would exist, then immediately
there would be a huge demand for following the strategy, the price would then
immediately rise, and as a consequence the opportunity would disappear. We
simply have to set the value of the forward at time 0 to F = S0 K (for r = 0),
since any other value would introduce arbitrage opportunities in the market.
The forward example is somewhat special in the sense that the correct forward price F = S0 K does not depend on the distribution of ST . In fact,
it depends on the asset price only through its (observable) value at time 0. Is
there no role for probability theory in evaluating financial contracts?
There is. First we note that the expected gain of owning a contract is equal
to E(ST K), which does depend on the distribution of the asset price. For
2
example, E(ST K) = S0 eT + T /2 K if ST /S0 is log-normally distributed
with parameters T and 2 T , i.e., log(ST /S0 ) N (T, 2 T ).
Second, it turns out that the correct solution can be found from computing
an expectation, with respect to a special probability measure called a martingale
measure. To evaluate the price of a forward, this route would be overdone, as the
preceding hedging strategy is explicit and simple. However, the prices of other
contracts may not be so easy to evaluate, and it will be necessary to assume
some reasonable probabilistic models on the asset price.
The pricing and hedging of derivatives are important tasks in the financial
world. In the subsequent chapters we will treat two classical models that provide
a framework for carrying out these tasks. The first one is the binomial model,
which is a discrete-time tree model for asset prices. The second one is the
famous Black-Scholes model, which is a continuous-time model.
EXERCISE 1.1. Suppose the interest rate is zero. Show that if the buyer
accepts a price P > S0 K for the forward, there exists an arbitrage opportunity
for the seller. Give the explicit construction of the strategy.
EXERCISE 1.2. Suppose that there is a constant continuously compounded
interest rate r. Determine the fair price of the forward in this case.
EXERCISE 1.3. Show that if the price of a forward contract (with maturity
T ) is equal to 0, the fair strike price must be K = S0 erT .
EXERCISE 1.4. Consider European call and put options on a stock, with the
same maturity T and the same strike K. Let C0 , P0 and S0 be, respectively, the
price of the call, put and stock at time 0. Give an arbitrage argument for the
so-called put-call parity C0 P0 = S0 KerT .
2
EXERCISE 1.5. If log(ST /S0 ) N (T, 2 T ), show that EST = S0 eT + T /2 .
2
Hint: first establish EeZt = et /2 .
Chapter 2
Binomial Model
A main objective of financial engineering is to find a fair price of a derivative,
where by fair we mean a price acceptable both for the buyer and the seller.
Following the work by Black and Scholes in the 1970s the prices of derivatives are
found through the principle of no arbitrage introduced in the previous chapter.
In this chapter we introduce the notion of portfolio and briefly discuss the
pricing of derivatives in the so called Binomial Model, a finite security market
where trading takes place at discrete time instants t {0, 1, . . . , N }. Traded are:
one risky asset, a stock with price St at time t; and one riskless asset, a savings
account (or bond) with interest rate r 0 (theory works for r > 1). We
assume that an amount of one unit of money deposited in the savings account
at time 0 grows to a guaranteed amount of ert units at time t.
The goal is to determine the fair price of a composite contingent claim that
pays C(S0 , . . . , SN ) at maturity N . Sometimes we use terms claim and contract
for the payoff function C = C(S0 , . . . , SN ).
2.1
P(S1 = us0 ) = p,
P(S1 = ds0 ) = 1 p.
Here u (for up) and d (for down) are two known constants, with u > d, and
p is a number in (0, 1) that may be unknown; cases p = 0, p = 1 are trivial.
The initial date is t = 0 and the terminal date t = 1, with trading possible
only at these two dates. We want to find the fair price at time 0 of a contract
that pays the amount C = C(S1 ) (called the payoff ) at time 1.
Example 2.2. A European call option corresponds to C = (S1 K)+ , for a
given strike price K. The payment on a forward contract is equal to C = S1 K.
Suppose that at time 0 we buy assets and put an amount of money
units in the savings account, , R. A negative value of corresponds to
borrowing money from the bank and a negative value of 0 to short selling of
the stock. Then we have a portfolio (, ) whose value at time 0 is given by
V0 = s0 + 1.
(2.3)
C(us0 ) C(ds0 )
,
s0 (u d)
uC(ds0 ) dC(us0 )
.
er (u d)
V0 = er qC(us0 ) + (1 q)C(ds0 ) ,
where
q=
er d
.
ud
This is the fair price of the contract at time 0. Interestingly, the original probability p in (2.1) turns out to be unimportant for the fair price determination.
An essential feature of a financial market is that it is free of arbitrage, meaning that there is no riskless way of making money. Formally, an arbitrage opportunity is a trading strategy such that P(V0 = 0) = 1, P(V1 0) = 1 and
P(V1 > 0) > 0. A market is arbitrage free if no arbitrage opportunities exist.
Proposition 2.7. The one-period binomial market is free of arbitrage if and
only if d < er < u.
EXERCISE 2.8. Prove Proposition 2.7.
For the rest of the chapter, assume d < er < u. Then the number q from
(2.6) is contained in the interval (0, 1) and can be considered as an alternative probability for the upward move of the stock price S1 in (2.1). This new
probability distribution Q of the stock price (given by (2.1) with p replaced by
7
where the subscript q in Eq means that the expectation is computed under the
measure Q. It can be seen that q is the unique probability such that
Eq (er S1 ) = s0 .
We can write the above equation also in the form Eq (er S1 |S0 ) = S0 (with
S0 the random variable that is equal to the constant s0 with probability one).
In more sophisticated terms, we say that the discounted asset price process
S0 , er S1 is a martingale under Q.
Example 2.10 (Forward). The value at time 0 of a forward is Eq er (S1 K) =
Eq er S1 er K = S0 er K; cf. Exercise 1.2. The strike price that makes
this value equal to zero is K = er S0 ; cf. Exercise 1.3.
In all the exercise below consider the one period model as described above.
EXERCISE 2.11. Let r = 0, s0 = 100, u = 1.2 and d = 0.8. Determine the
price of a European call option which has strike price K = 90. Compute the
corresponding replicating portfolio.
EXERCISE 2.12. Let r = 0, s0 = 100, u = 1.2 and d = 0.8. Determine
the price of a European put option which has strike price 100. Compute the
corresponding replicating portfolio.
EXERCISE 2.13. Prove the put-call parity stated in Exercise 1.4 by using the
martingale measure.
2.2
Suppose that at time 0 we have the same possibilities for investing as in the
preceding section, but we now consider a full trading horizon of three times:
0, 1, 2. Suppose we have a contract (contingent claim) that pays the amount
C = C(S2 ) at time 2, and we wish to find its fair price at time 0. General case
C = C(S0 , S1 , S2 ) can be considered as well (Exercise 2.15).
The prices of the stock at the three time instants are S0 , S1 , S2 , where we
assume that S0 = s0 is fixed, S1 is equal to either dS0 or uS0 , and S2 is equal
to either dS1 or uS1 . Thus the stock prices follow a path in a binary tree. We
assume that at each node of the tree the decision to move up or down is made
with probabilities p > 0 and 1 p, independently for the different nodes.
Again we may deposit money in a savings account (also a negative amount,
indicating that we borrow money) at a fixed interest rate r. One unit in the
savings account grows to er units at time 1, and to e2r units at time 2.
We can evaluate the claim recursively, backwards in time. Let Vn denote
the value of the contract at time n = 0, 1, 2. Clearly, at time n = 2 the claim
is worth V2 = V2 (S2 ) = C(S2 ). At n = 1, given S1 = s1 , S2 can be either us1
or ds1 , leading to two possible payoffs: C(us1 ) or C(ds1 ). Thus, we are in the
one period model, and by using (2.6) we can evaluate the value of the claim at
time n = 1 as a function of the stock price S1 at time n = 1:
(
er qC(u2 s0 ) + (1 q)C(dus0 ) , if S1 = us0 ,
V1 = V1 (S1 ) =
er qC(uds0 ) + (1 q)C(d2 s0 ) , if S1 = ds0 .
We should see V1 as the value of the contract at time 1 (or the fair price of the
contract at time 1). Now we treat the two possible values of V1 as the payoff on
our contract at time 1 in the one period model. Hence, by using (2.6) and the
above expression for V1 , the value of the contract at time 0 is
V0 = er qV1 (us0 ) + (1 q)V1 (ds0 ) = e2r q qC(u2 s0 ) + (1 q)C(dus0 )
+ (1 q) qC(uds0 ) + (1 q)C(d2 s0 )
= e2r q 2 C(u2 s0 ) + 2q(1 q)C(uds0 ) + (1 q)2 C(d2 s0 )
= Eq e2r C(S2 ) .
(2.14)
In this way, the fair price of the contract once again turns out to be the expectation of the discounted payoff e2r C(S2 ) under the martingale measure Q.
By using the one period model, we obtain that Eq (er S1 |S0 ) = S0 and
Eq (er S2 |S1 , S0 ) = S1 (or Eq (e2r S2 |S1 , S0 ) = er S1 ). This means that the
process (ern Sn , n = 0, 1, 2) is a martingale. We shall elaborate on this later.
EXERCISE 2.15. Generalize the above approach to a contract of the form
C = C(S0 , S1 , S2 ) and apply this to determine the price of an Asian call option
+
which has payoff (S0 + S1 + S2 )/3 100 , in a a two period model with r = 0,
s0 = 100, u = 1.2, d = 0.8. Determine also the hedging portfolio.
To summarize, in the two period model we treated three one period submodels by backward recursion in time. First we determined the values of the
contract at time 1, V1 (us0 ) and V1 (ds0 ) (depending on the realized value of S1 ),
then the value of the contract V0 at time 0. In each one period submodel, we
used the formula (2.6), but there are also the three underlying hedging portfolios defined by (2.5). Denote them by (2 (us0 ), 2 (us0 )), (2 (ds0 ), 2 (ds0 )) and
(1 , 1 ), respectively. Notice that the hedging portfolio in the two period model
consists of two couples (n , n ), n = 1, 2, where (2 , 2 ) is actually a function
of the stock price S1 at time 1. The portfolio is thus dynamic in time and at
each time moment it is based on the available (by that time) information such
as the stock prices.
EXERCISE 2.16. Show that V1 = 1 S1 + 1 er = 2 S1 + 2 er .
2.3
Portfolios
Vn = n Sn + n enr ,
n = 1, . . . , N.
The idea is to use the results for the one period model recursively to solve
the pricing and replication problem for the N period model (as we did this for
the 2 period model). As we demonstrated in the 2 period model, to be able to
realize this idea, any reallocation (after an initial investment) of the portfolio in
time should be made without infusion or withdrawal of money, i.e., in a budget
neutral way. Precisely, a portfolio is called self-financing if
(2.17)
n = 1, . . . , N 1.
2.4
N Period Model
11
N =
N
To get the value VN 1 of the hedging portfolio at time N 1, we use the selffinancing property: VN 1 = N 1 SN 1 +N 1 e(N 1)r = N SN 1 +N e(N 1)r .
r
d
For q = eud
, inserting the expressions for N and N results in (check this!)
VN 1 = er qVN (S0 , . . . , SN 1 , SN 1 u) + (1 q)VN (S0 , . . . , SN 1 , SN 1 d) .
Iterating these arguments, for n = N 1, . . . , 0, backwards in time, we obtain
Vn = er qVn+1 (S0 , . . . , Sn , Sn u) + (1 q)Vn+1 (S0 , . . . , Sn , Sn d) ,
(2.20)
N
Y
Zm .
m=n+1
In particular, V0 = eN r Eq C(S0 , . . . , SN ).
Notice the only difference in formulas for the portfolio in multi-period and
one period cases: factor er in (2.5) becomes enr in (2.20). One can formally
have the same formulas if we change the definition on the portfolio and the
associated value process: V0 = 1 S0 + 1 , Vn = n Sn + n er n = 1, . . . N . The
property of self-financing is then Vn = n+1 Sn + n+1 , n = 0, . . . , N 1. The
new quantity n = n e(n1)r has then the interpretation of the total amount
of money at a savings account just before time n.
12
Chapter 3
This (X) is called the -field generated by X. Notice that (X) F. For a
set A, the indicator function is 1A (x) = 1 if x A and 1A (x) = 0 if x 6 A.
3.1
Stochastic Processes
13
is given by a path in the binomial tree, and the probability of a sample path
is the product of the probabilities of the edges along the path. This gives an
intuitively clear description of the process, but for later use it is instructive to
define the stochastic process also formally as a map on a given sample space .
Take to be the set of all N -tuples = (1 , . . . , N ), where each i {0, 1}.
The stochastic process can be formally defined by setting S0 = s0 and
Sn (1 , . . . , N ) = s0 u
Pn
i=1
i n
Pn
i=1
n = 1, . . . , N.
3.2
Conditional Expectation
For a discrete random variable (or vector) X and a discrete random vector Y ,
the conditional expectation of X given the event Y = y is given by
X
E(X|Y = y) =
xP(X = x|Y = y).
x
14
3.3
Filtration
n = 0, 1, . . . .
In a way, Fn contains all the information available by observing the process till
time n.
We say that a process (Xn )n0 is adapted to the filtration (Fn )n0 if (Xn )
Fn for every n. Thus the events connected to an adapted process up to time n
are known at time n. The natural filtration corresponding to a process is the
smallest filtration to which it is adapted. If the process (Yn )n0 is adapted to the
natural filtration of a stochastic process (Xn )n0 , then Yn = fn (X0 , X1 , . . . , Xn )
for some function fn , for each n.
We say that a process (Yn )n0 is predictable relative to the filtration (Fn )n0
if (Yn ) Fn1 for each n. Thus the events connected to a predictable process
are known one time instant before they happen. If Fn = (X0 , . . . , Xn ), then
this is equivalent to Yn being a function Yn = fn (X0 , . . . , Xn1 ) of the history
of the process (Xn )n0 up to time n 1, for some fn .
If FY is the -field generated by Y , then we define E(X|FY ) = E(X|Y ). In a
way, E(X|F) can be interpreted as the expected value of X given the information
F. The trivial -field {, } is the -field containing no information.
Lemma 3.4.
2 A rigorous mathematical definition of E(X|Y ) is as follows: first define E(X|A) for a
-field A F as an A-measurable function that satisfies E[E(X|A)1A ] = E[X1A ] for each
A A; next define E(X|Y ) = E(X|(Y )), where (Y ) is the -field generated by Y .
15
3.4
Martingales
i=1 i
2
EXERCISE 3.10. In a branching process we start with N0 = 1 individuals
at time 0, and at each time n each individual has a random number (chosen
from a fixed distribution) of offspring independent of the other individuals. The
new generation consists of the offspring only. Thus, given there are Nn = k
individuals at time n, the number of individuals Nn+1 in the (n+1)th generation
Pk
(n)
(n)
(n)
is distributed as i=1 Xi for iid random variables X1 , . . . , Xk . Show that
(n)
(Nn )n0 is a martingale if and only if EXi = 1.
3.5
Change of Measure
16
q()
p()
1 q N PN
q PN
i=1 i
i=1 i
p
1p
EXERCISE 3.12. In Example 3.11, let p (0, 1), but q = 0 (or q = 1). Show
that the resulting measures P and Q are not equivalent.
EXERCISE 3.13. Show that P and Q are equivalent if they agree which events
have probability one.
17
Chapter 4
4.1
d
. But p (0, 1), hence d < er < u must hold. Finally, we conclude
p = q = eud
r
that if d < e < u, then (Sn )n0 is a martingale for p = q.
n N.
4.2
In Example 4.2 we actually established that, provided d < er < u, the discounted stock price process (Sn )n0 forms a martingale under the measure Q,
r
d
the one with the up-move probability q = eud
. This implies the following.
Proposition 4.4. In the binomial model with d < er < u, let (n , n ) be a
self-financing portfolio with the associated value process Vn , n 0. Then the
discounted value process (Vn )n0 = (enr Vn )n0 is a martingale under Q.
That is way Q is called the risk-neutral (and martingale) measure.
EXERCISE 4.5. Prove Proposition 4.4.
The condition d < er < u from Proposition 4.4 has already occurred in the
one period model to prevent arbitrage. The same reason applies here. Indeed,
19
if er d, then the returns on the asset are with certainty bigger than the return
on the savings account, whereas if er u, then the returns are with certainty
smaller. Then the riskless savings account is never or always preferable over the
risky asset, respectively, and a reasonable portfolio will consist of only one type
of investment. We make this precise below.
Recall that an arbitrage opportunity is a self-financing trading strategy with
associated value process (Vn )N
n=0 , such that P(V0 = 0) = 1, P(VN 0) = 1 and
P(VN > 0) > 0. A market is arbitrage free if no arbitrage opportunities exist.
Proposition 4.6. The multi-period binomial market is free of arbitrage under
any probability measure P that is equivalent to Q iff d < er < u.
EXERCISE 4.7. Prove Proposition 4.6 (hint: use Propositions 2.7 and 4.4).
EXERCISE 4.8. Show that any P with an up move probability p (0, 1) is
equivalent to Q with an up move probability q (0, 1).
In view of Proposition 4.6 and Exercise 4.8, we require that d < er < u.
Now we are ready to solve the problem of pricing a claim C = C(S0 , . . . , SN ),
with the expiry time N , by using the martingale technique.
EXERCISE 4.9. Derive the solution (2.18) of the pricing problem. More generally, show that the fair price Vn of the contract C(S0 , . . . SN ) at time n is
(4.10)
Vn = e(N n)r Eq C(S0 , . . . , SN )|Fn , n = 0, 1, . . . , N.
Hint: use Proposition 4.4.
Clearly, Vn in (4.10) is a function of S0 , . . . Sn . It can be computed as
follows. Introduce the iid random variables Zn = Sn /Sn1 , each taking only
the values:
Qnd and u, Q(Zn = u) = q, P(Zn = u) = p, n = 1, . . . , N . Clearly,
Sn = S0 k=1 Zk . Given the (observed) values S0 = s0 , . . . , Sn = sn , the fair
price Vn = Vn (s0 , . . . , sn ) of the contract C at time n {0, 1, . . . , N } is
(4.11)
N
Y
Vn (s0 , . . . , sn ) = e(N n)r Eq C s0 , . . . , sn , sn Zn+1 , . . . , sn
Zm ,
m=n+1
n = 0, 1, . . . N.
Vn = n Sn ,
n = 1, . . . , N.
n = Vn1 n Sn1 ,
n = 1, . . . , N.
21
The portfolio management can be implemented in practice. If we have sufficient funds to form the portfolio at time 0, then we never run into debt when
carrying out the hedging strategy. We interpret the value of the portfolio at
time 0, the amount of money needed to create the portfolio (1 , 1 ), as the fair
price of the claim at time 0. Since V is a (Doob) martingale under Q, this is
V0 = V0 = Eq (erN C|F0 ) = Eq (erN C).
Note that the formula expresses the price in the claim C without intervention
of the portfolio processes. Notice that we in passing solved the pricing problem
(2.18) (and also (4.10)) once again.
Example 4.16 (Forward). The claim of a forward with strike price K is C =
SN K. The value of the forward at time 0 is equal to V0 = Eq erN (SN K) =
Eq erN SN erN K. Since the process (ern Sn )n0 is a martingale under the
measure Q, V0 = Eq S0 erN K = S0 erN K. In Section 1.3 we obtained the
same result by describing an explicit hedging strategy.
Example 4.17 (European call option). The claim of a European call option
with strike price K is C = (SN K)+ . The fair price of the option at time 0
is V0 = Eq erN (SN K)+ . The variable SN is distributed as S0 uXN dN XN ,
where XN is the number of upward moves in the tree. Because XN Bin(N, q)
under the martingale measure Q, it follows that
V0 = erN
N
X
(S0 ux dN x K)+
x=0
N x
q (1 q)N x .
x
4.3
n = 0, . . . , N.
In the real world asset prices change almost continuously in time. The binomial
tree model can approximate this if the number of steps N is large. Mathematically we can even compute limits as N , in the hope that this gives a
realistic model.
A limit exists only if we make special choices for the relative up and down
moves u and d. Unless u and d tend to 1 as the number of moves N increases,
the asset price will explode and our model does not tend to a limit. We shall
think of the N moves in the binomial tree taking place in a fixed interval [0, T ],
at the times N , 2N , . . . , N N for N = T /N . Then it is reasonable to redefine
the interest rate in one time instant as rN , giving a total interest of r over the
interval [0, T ]. We also assume that, for given constants R and > 0,
dN = eN
uN = eN +
,
22
Then both d and u approach 1 as the length of a time interval tends to zero.
These choices are somewhat special, but can be motivated by the fact that the
resulting model tends to continuous time model considered later on.
Denoting by XN the number of times the stock price moves up in the time
span 1, . . . , N , the stock price SN at the terminal time N can be written as
(XN N/2)
N N XN
p
.
SN = S0 uX
= S0 exp T + T
N dN
N/4
To price a European claim C(SN ), we need to determine the distribution of
SN under the martingale measure Q. Under the measure Q, XN Bin(N, qN ),
Using a Taylor expansion ex = 1 + x + x2 /2 + o(x2 ) as x 0, we evaluate
qN
1 1 p + 2 r
erN eN N
=
=
N
+ O(N ),
2 2
eN + N eN N
as N 0. Recalling that N =
log
T
N,
we obtain qN (1 qN )
1
4
as N and
2
1
XN N qN r
SN
2
p
T
+O
= T + T
S0
N
N/4
(XN N qN )
2
2
p
+ T (r 2 )
N (r 2 )T, 2 T , N
= T
N/4
2
V0 = erT EqN C(SN ) erT E[C(ST )] = erT E C(S0 e(r 2 )T + T Z ) .
Example 4.19 (European call option). The (limiting) fair price of a European
call option with expriry time T and strike price K is V0 = erT E(ST K)+ ,
2
where log(ST /S0 ) N (r 2 )T, 2 T . This can be computed to be
log(S /K) + (r +
0
V0 = S0
T
2
2 )T
rT
Ke
log(S /K) + (r
0
2
2 )T
where (z) = P(Z z) for Z N (0, 1). This is the famous Black-Scholes
formula found in 1973. We shall recover it later in a continuous time setup.
EXERCISE 4.20. Suppose that Z N (0, 1), S, K > 0, R, 6= 0, are some
constants. Show that, for z0 = 1 (log(K/S) ),
E(Se+ Z K)+ = Se+
2
2
( z0 ) K(z0 ).
Using this, derive the (asymptotic) Black-Scholes formula from Example 4.19.
23
Chapter 5
Stochastic Processes
5.2
Brownian Motion
Brownian motion (often called also Wiener process) is a special stochastic process, which is of much interest by itself, but will also be used as a building block
to construct other processes. It can be thought of as the standard normal
process. A Brownian motion is often denoted by the letter W , after Wiener,
who was among the first to study Brownian motion in a mathematically rigorous
way. The distribution of Brownian motion is known as the Wiener measure.
A stochastic process W is a Brownian motion if
(i) the increment Wt Ws N (0, t s), for any 0 s < t;
(ii) the increment Wt Ws is independent of (Wu : u s), for any 0 s < t;
(iii) W0 = 0;
(iv) any sample path t 7 Wt () is a continuous function.
It is certainly not clear from the definition that Brownian motion exists, in
the sense that there exists a probability space with random variables Wt defined
on it that satisfy the requirements (i)(iv). However, it is a mathematical
24
theorem that Brownian motion exists, and there are several constructive ways
of exhibiting one. We shall take the existence for granted.
Properties (i) and (ii) can be understood in the sense that, given the sample
path (Wu : u s) up to some point s, Brownian motion continues from its
present value Ws by adding independent (normal) variables. In fact it can
be shown that given (Wu : u s) the process t 7 Ws+t Ws is again a
Brownian motion. Thus at every time instant Brownian motion starts anew
from its present location, independently of its past.
The properties of Brownian motion can be motivated by viewing Brownian
motion as the limit of the process in a binomial tree model, where starting from
S0 = 0 a process (Sn )n0 is constructed
Pnby moving up or down 1 in every step,
each with probability 1/2, i.e., Sn = k=1 Xk , for an i.i.d. sequence (Xk )kN
with P(Xk = 1) = P(Xk = 1) = 1/2. For a given N , we could set the values
of the process S0 , S1 , S2 , . . . at the time points 0, 1/N, 2/N, . . . and rescale the
vertical axis so that the resulting process remains stable. This leads to the
process W (N ) given by
1 X
(N )
Xi .
Wt =
N i:itN
By the Central Limit Theorem, with
(N )
Wt
1
Ws(N ) =
N
Xk
N (0, t s),
as N .
k:sN <ktN
( tW1 )t0 , (ii) (Wt )t0 , (iii) (W2t Wt )t0 , (iv) (c1/2 Wct )t0 , (v) (Wt +
t )t0 . Motivate your answers.
(1 2 )1/2 W
5.3
Filtrations
25
5.4
Martingales
5.5
5.6
Variation
Brownian motion has strange sample paths. They are continuous by assumption,
but they are not differentiable. We can see this by studying the variation of the
sample paths.
Let 0 = tn0 < tn1 < < tnkn = t be a sequence of partitions of a given interval
[0, t] such that the mesh width maxi (tni tni1 ) tends to zero as n . Then
for a continuously differentiable function f : [0, t] R we have, as n ,
Z t
kn
kn
X
n
X
0 n n
f (ti ) f (tni1 )
f (ti1 )ti tni1
|f 0 (s)| ds.
i=1
i=1
The left side of this equation is called the variation of f over the given partition.
The approximation can be shown to be correct in the sense that the variation
indeed converges to the integral on the right as the mesh width of the partition
tends to zero. We conclude that the variation of a continuously differentiable
function is bounded if the mesh width of the partition decreases to zero. As a
consequence the quadratic variation
kn
kn
X
n
X
n
f (ti ) f (tni1 )2 max f (tni ) f (tni1 )
f (ti ) f (tni1 )
i=1
i=1
26
as n .
Because the second moment of a random variable is the sum of its variance and
the square of its expectation, convergence in L2 is equivalent to EXn EX
and var(Xn X) 0 as n .
Lemma 5.3. For any sequence of partitions 0 = tn0 < . . . < tnkn = t of [0, t]
2 L 2
Pkn
t as n .
Wtni Wtni1
with maxi (tni tni1 ) 0, we have i=1
Proof. The increments Wtni Wtni1 of Brownian motion over the partition are
independent random variables with N (0, tni tni1 )-distributions. Therefore,
E
var
kn
X
|Wtni Wtni1 |2 =
kn
X
(tni tni1 ) = t,
i=1
i=1
kn
X
kn
X
|Wtni Wtni1 |2 =
i=1
2
i=1
=2
kn
X
(tni tni1 )2 2t max |tni tni1 | 0,
i
i=1
5.7
Stochastic Integrals
5.8
t 0,
5.9
dSt = t dt + t dWt .
In the case t = 0, (5.6) reduces to the ordinary differential equation dSt = t dt.
Adding the term t dWt introduces a random perturbation of this differential
equation. The infinitesimal change dSt in St is equal to t dt plus a noise term.
Because the increments of Brownian motion are independent, we interpret the
elements dWt asRindependent noise variables.
t
The integral 0 s dWs in (5.5) must of course be interpreted as a stochastic
Rt
integral in the sense of Section 5.7, whereas the integral 0 s ds is an ordinary
integral, as in calculus. The stochastic differential equation (5.6), or SDE, is
merely another way of writing (5.5), the latter integral equation being its only
mathematical interpretation. The understanding of dWt as a random noise
variable is helpful for
R t intuition, but does not make mathematical sense.
For the integral 0 s dWs to be well defined, the process must be adapted.
In many examples the process t and t are defined in terms of the process
S. For instance, a diffusion equation takes the form
(5.7)
5.10
Markov Processes
A Markov Process X is a stochastic process with the property that for every
s < t the conditional distribution of Xt given (Xu : u s) is the same as the
conditional distribution of Xt given Xs . In other words, given the present Xs
the past (Xu : u s) gives no additional information about the future Xt .
Example 5.8 (Brownian motion). Because Wt = Wt Ws +Ws and Wt Ws is
normal N (0, t s) distributed and independent of (Wu : u s), the conditional
distribution of Wt given (Wu : u s) is normal N (Ws , t s) and hence depends
on Ws only. Therefore, Brownian motion is a Markov process.
Example 5.9 (Diffusions). A diffusion process S, as given by the SDE (5.7),
does not possess independent increments as Brownian motion. However, in an
29
infinitesimal sense the increments dSt depend only on St and the infinitesimal
increment dWt , which is independent of the past. This intuitive understanding
of the evolution suggests that a diffusion process may be Markovian. This is
indeed the case, under some technical conditions.
5.11
In Section 5.6 we have seen that the quadratic variation of Brownian motion
converges to a limit as the mesh width of the partitions tends to zero. This is
true for general solutions to SDEs, except that in general the convergence must
be interpreted in probability. We say that a sequence of random variables Xn
converges in probability to a random variable X if, as n ,
P |Xn X| > 0, for every > 0.
P
This is denoted by Xn
X. Denote also Yn = op (1) if Yn
0.
Lemma 5.10. Let a stochastic process S satisfy the SDE (5.6) for adapted
processes and . Then for any sequence of partitions 0 = t0,n < . . . < tkn ,n = t
of the interval [0, t] with maxi (ti,n ti1,n ) 0 as n , we have that
Z t
kn
X
2 P
Sti,n Sti1,n
s2 ds as n .
Vn =
0
i=1
kn
X
var t2i1 (Wti Wti1 )2 (ti ti1 )
i=1
kn
X
kn
X
2
Et4i1 E (Wti Wti1 )2 (ti ti1 ) = 2
Et4i1 (ti ti1 )2 0.
i=1
i=1
Here we used the relevant properties of Brownian motion. Together these equaP
tions suggest that E(An an )2 0, which in turn implies An an
0.
Combined with the convergence
an =
kn
X
i=1
s2 ds,
P Rt
this would give the result: Vn = An + op (1) = an + An an + op (1)
0 s2 ds.
This proof can be made precise without much difficulty if the process is
bounded and left-continuous. For a complete proof we need to use a truncation
argument involving stopping times.
30
2
Pkn
Sti,n Sti1,n is called the quadratic
The limit of the sums of squares i=1
variation of the process S. It is denoted by [S]t , and also known as the square
Rt
bracket process. For a solution S to the SDE (5.6) we have [S]t = 0 s2 ds.
Besides the quadratic variation of a single process, there is also a cross
quadratic variation of a pair of processes R and S, defined as the limit (in
probability)
[R, S]t = lim
kn
X
Rti,n Rti1,n
Sti,n Sti1,n .
i=1
EXERCISE 5.11. Suppose that the processes R and S both satisfy an SDE
(5.6), but with different functions and . Guess [R, S] if
(i) R and S depend in (5.6) on the same Brownian motion.
(ii) the SDEs for R and S are driven by independent Brownian motions.
5.12
It
o Formula
The geometric Brownian motion is actually a special case of the SDE approach.
By a celebrated formula of It
o it can be shown that geometric Brownian motion
satisfies a SDE.
It
os formula is a chain rule for stochastic processes, but due to the special
nature of stochastic integrals it takes a surprising form. The version of Itos
formula we present here says that a transformation f (St ) of a process that
satisfies an SDE by a smooth function f again satisfies an SDE, and gives an
explicit expression for it.
Recall that for a stochastic process S as in (5.6), the quadratic variation is
the process [S] such that d[S]t = t2 dt.
Theorem 5.12 (It
os formula). If the stochastic process S satisfies the SDE
(5.6) and f : R R is twice continuously differentiable, then
df (St ) = f 0 (St ) dSt + 12 f 00 (St ) d[S]t .
Sketch of the proof. For a sequence of sufficiently fine partitions 0 = tn0 < tn1 <
< tnkn = t of the interval [0, t] with maxi (tni tni1 ) 0 we have
f (St ) f (S0 ) =
kn
X
f (Stni ) f (Stni1 )
i=1
kn
X
i=1
1
2
kn
X
i=1
Rt
The first term tends to the stochastic integral 0 f 0 (Ss ) dSs . By the same arguRt
ments as used in Section 5.11, the second sum tends to 21 0 f 00 (Ss )s2 ds.
We have written It
os formula in differential form, but as usually it should
be mathematically interpreted as a statement about integrals.
The striking aspect of It
os formula is the second term 12 f 00 (St ) d[S]t , which
would not appear if the sample path t 7 St were a differentiable function. As
the proof shows it does appear, because the variation of the sample paths of S
is not finite, whereas the quadratic variation tends to a nontrivial limit.
31
Example 5.13. Brownian motion W itself certainly satisfies a stochastic differential equation: the trivial one dWt = dWt .
Applied with the function f (x) = x2 Itos formula gives dWt2 = 2Wt dWt +
Rt
1
conclude that Wt2 = 2 0 Ws dWs +t. Compare this
2 2 dt, because [W ]t = t. We
R
t
to the formula f 2 (t) = 2 0 f (s) df (s) for a continuously differentiable function
f with f (0) = 0.
Example 5.14 (Geometric Brownian motion). As a consequence of Itos formula, the geometric Brownian motion St = exp( + t + Wt ) = S0 exp(t +
Wt ), with S0 = e , satisfies the SDE
dSt = ( + 21 2 )St dt + St dWt .
To see this, apply It
os formula with the process Xt = t+Wt and the function
f (x) = S0 exp(x).
It
os theorem is also valid for functions of more than one process. For instance, consider a process f (t, St ) for a function f : [0, ) R R of two arguments. Write ft , fs and fss for the partial derivatives /tf (t, s), /sf (t, s)
and 2 /s2 f (t, s), respectively.
Theorem 5.15 (It
os formula). If the stochastic process S satisfies the SDE
(5.6) and f : [0, ) R R is twice continuouly differentiable, then
df (t, St ) = ft (t, St ) dt + fs (t, St ) dSt + 12 fss (t, St ) d[S]t .
As a second example consider a process f (Rt , St ) of two stochastic processes
R and S. If an index r or s denotes partial differentiation with respect to r or
s, then we obtain the following formula.
Theorem 5.16 (It
os formula). If the stochastic processes R and S satisfy the
SDE (5.6) and f : R2 R is twice continuously differentiable, then
df (Rt , St ) = fr (Rt , St ) dRt + fs (Rt , St ) dSt + 21 frr (Rt , St ) d[R]t
+ 12 fss (t, St ) d[S]t + frs (Rt , St ) d[R, S]t .
5.13
Girsanovs Theorem
Rt
The stochastic integral 0 Xs dWs of an adapted process relative to Brownian
motion is a (local) martingale. Thus the solution S to the SDE (5.6) is the sum
Rt
Rt
of a local martingale 0 s dWs and the process At = 0 s ds. The sample paths
of the process A are the primitive functions, in the sense of ordinary calculus,
of the sample paths of the process , and are therefore differentiable. They
are referred to as drift functions. The presence of a drift function destroys the
martingale property: a solution of an SDE can be a martingale only if the drift
is zero.
Rt
Rt
Lemma 5.17. The process S = (St )t0 defined by St = 0 s ds + 0 s dWs
is a local martingale if and only if = 0.
32
5.14
Brownian Representation
Let (Ft )t0 be the natural filtration of a given Brownian motion W . Stochastic
processes defined on the same outcome space that are martingales relative to
this Brownian filtration are referred to as Brownian martingales. Brownian
motion itself is an example, and so are all stochastic integrals X B for adapted
processes X.
The following theorem shows that these are the only Brownian martingales.
Theorem 5.21. Let {Ft } be the (completion of the) natural filtration of a
Brownian motion process W . If M is a (cadlag) local martingale relative to
Rt
{Ft }, then there exists a predictable process X with 0 Xs2 ds < almost
Rt
surely for every t 0 such that Mt = M0 + 0 Xs dWs .
This Brownian representation theorem remains true if the filtration is generated by multiple, independent Brownian motion processes W (1) , W (2) , . . . , W (d) .
Then an arbitrary (cadlag) local martingale can be written as Mt = M0 +
Pd R t (i)
(i)
i=1 0 Xs dWs .
Because the representation of X in terms of the intervals (ui , vi ] and Aui is not
unique (we could for instance split up the intervals further), it must be verified
that this definition is consistent, but we omit this part of the proof.
We next verify property (ii) for simple adapted processes X
Pand t = . 2If,
as we assume, the intervals (ui , vi ] are disjoint, then Xt2 =
i 1(ui ,vi ] (t)Aui .
Therefore, the right side of (ii) with t = is equal to
Z
Z X
X
E Xs2 ds = E
1(ui ,vi ] (t)A2ui ds =
EA2ui (vi ui ).
i
34
Because the intervals (ui , vi ] are disjoint, we have that E(Wvi Wui )(Wvj
Wuj ) = 0 for i 6= j, by the independence and the zero means of the increments
of Brownian motion. It follows that the diagonal terms in the double sum vanish,
whence the preceding display is equal to
X
X
X
E
A2ui (Wvi Wui )2 =
EA2ui E(Wvi Wui )2 =
A2ui (vi ui ),
i
where in the second step we use the independence of the increment Wvi Wui of
Fui and hence of Aui . Thus we have verified that for simple adapted processes
X
Z
Z
2
E
Xs dWs = E Xs2 ds.
In words we have shown that the integral is a linear isometry. A linear
isometry between normed spaces X and Y is a linear map I : X Y such that
kI(x)kY = kxkX for every x X. This isometry is the basis for the extension of
the integral to general adapted processes, by way of the following result from
analysis.
Any linear isometry I : X0 X Y from a linear subspace X0
of a normed space X into a complete normed space Y possesses
a
0 = X
unique extension to an isometry defined on
the
closure
X
X : {Xn } X0 such that kXn Xk 0 of X0 in X.
In our situation
we take the space X equal to all adapted processes X with
Rt
kXk2X = E 0 Xs2 ds < , and X0 equal to the Rcollection of all simple adapted
processes. We have seen that the map I : X 7 Xs dWs is an isometry into the
set Y of random variables with finite second moments, with kY k2Y = EY 2 . Thus
the integral can be extended to the closure of the set of simple Radapted processes.
t
That this closure is the set of all adapted processes with E 0 Xs2 ds < can
be shown by approximation by step functions. We omit the details of this part
of the proof.
Thus the integral is defined. The verification of its properties (i)(iii) proceeds by first verifying that these assertions hold on the set X0 of simple processes and next showing that these properties are preserved under taking limits.
For a simple adapted process of the form Xt = 1(u,v] (t)A with (A) Fu
and s < t we have
(
Z t
Z s
A(Wtv Wsu ), if t v > s u,
Xr dWr
Xr dWr =
0,
otherwise.
0
0
Because A is known at time u s u and Brownian motion is a martingale we
have E(A(Wtv Wsu )|Fsu ) = AE(Wtv Wsu |Fsu ) = 0. Therefore, with
the help of the tower property of conditional expectation, it follows that
Z s
Z t
E
Xr dWr
Xr dWr |Fs = 0.
0
Rt
Thus the stochastic integral 0 Xs dWs is a martingale for X of this form. Because the sum of two martingales is again a martingale, this conclusion extends
to all simple adapted processes. Because the martingale property is preserved
under taking L2 -limits, it next extends to the stochastic integral in general.
35
*Stopping
Stopping times are intuitively meaningful objects that have interest on their
own, and are also essential for extension of the definition of stochastic integrals,
as given in the next section. However, we shall not need the material in this
and the following section in later chapters.
A stopping time relative to a filtration (Ft )t0 is a random variable T with
values in [0, ] such that {T t} Ft for every t 0. A stopping time
formalizes a strategy to play (or invest) on the market until a given time, which
need not be predetermined, but may be based on observing the market. The
requirement that the event {T t} is known at time t says that the decision to
stop trading must be made based on information collected in past and present.
If the filtration is generated by a process X, then this requirement implies that
the decision to stop at time t must be based on the sample path of X until t.
Example 5.23 (Hitting time). If X is an adapted process with continuous
sample paths, then T = inf{t 0 : Xt B} is a stopping time for every (Borel)
set B. This is known as the hitting time of B.
Stopping times are important tools in the theory of stochastic processes,
but are also crucial to evaluate American options. These are contracts that give
the holder the right to collect a certain payment at a time t in a given interval
[0, T ] of his own choosing. The amount of the payment depends on the history
of an asset price up to the time of payment. For instance, an American call
option on an asset with price process S gives the right to buy the asset at a
predetermined price K at any time t in an interval [0, T ]. This corresponds
to a payment of (St K)+ at the chosen time t. The financial problem is to
determine an optimal stopping time for the payment, and to evaluate the value
of the resulting contract.
Given a stopping time T and a stochastic process the stopped process X T is
defined as the stochastic process such that
(X T )t = XT t .
The sample paths of the stopped process are identical to the sample paths X
up to time T and take the constant value XT for t T .
Theorem 5.24. If X is a martingale, then so is X T .
More explicitly, the theorem says that, if X is a martingale, then
E(XT t |Fs ) = XT s ,
s < t.
36
Rt
The finiteness (5.27) of the (nondecreasing) process 0 Xs2 ds implies that Tn
Rt
as n . From the definition of Tn it follows immediately that 0 Xs2 ds n
Rt
R tT
if t Tn . Consequently E 0 (Xs 1sTn )2 ds = E 0 n Xs2 ds En = n < .
We can therefore define, for every n and t,
Z t
(Xs 1sTn ) dWs .
0
Rt
We define 0 Xs dWs as the limit of these variables, in the almost sure sense, as
n . It can be shown that this limit indeed exists.
Each of the Rprocesses in the preceding display is a martingale. The stochastic
t
integral Yt = 0 Xs Ws is the limit of these martingales, but need not be a
martingale itself. (The limit is only in an almost sure sense, and this is not strong
enough to preserve the martingale property.) However, the stopped process Y Tn
Rt
is exactly the the integral 0 (Xs 1sTn ) dWs and hence is a martingale. This has
Rt
gained the stochastic integral 0 Xs dWs the name of being a local martingale.
37
Chapter 6
Black-Scholes Model
In this chapter we assume that we can trade continuously in a (riskless) bond
and some risky asset, for instance a stock. We assume that the bond price B
evolves as
Bt = ert ,
where r is the riskless interest rate. The price process S of the risky asset is
assumed to be a geometric Brownian motion, i.e.
St = S0 et+Wt ,
t 0.
Here W = (Wt )t0 is a Brownian motion, R is called the drift of the process,
and the volatility. We denote by (Ft )t0 the filtration generated by the price
process S. Observe that (Ft )t0 is also the natural filtration of the Brownian
motion W , since both processes generate the same flow of information.
For some fixed T > 0, let C be a FT -measurable random variable, i.e., its
value is determined by the information up till time T . We think of C as the
pay-off at time T of some contingent claim. For technical reasons, we assume
that EC 2 < . We want to answer the same question as in the discrete-time
setup: What is the fair price of the claim C at time zero?
To answer this question we follow the same route as in Chapter 4. We first
use Girsanovs theorem to change the underlying probability measure in such
a way that the discounted asset price St = ert St becomes a martingale under
the new measure Q. Then we consider the Q-martingale Vt = EQ (erT C|Ft )
and use the representation theorem to construct a self-financing trading strategy
that replicates the pay-off C. By an arbitrage argument, the value of this trading
portfolio at time zero must be the fair price of the claim. As in the binomial
model, the fair price of the claim at time t = 0 turns out to be EQ erT C, the
expectation under the martingale measure of the discounted pay-off.
6.1
Portfolios
Before we can carry out the programme outlined in the preceding section we
have to give a mathematically precise definition of a self-financing portfolio in
the present continuous-time setting. A portfolio is just a pair of predictable
processes (t , t ). We interpret t as the number of risky assets held at time t,
38
With the portfolio (, ) we associate the value process V = (Vt )t0 defined by
Vt = t St + t Bt .
For hedging strategies we need the notion of a self-financing portfolio. Such
a portfolio is created using some starting capital at time zero, and after time
zero the portfolio is only changed by rebalancing, i.e., by replacing bonds by
the risky asset or vice versa. No additional injections or withdrawals of money
are allowed. Loosely speaking, such a portfolio has the property that in an
infinitesimally small time interval [t, t + dt], the changes in the portfolio value
are only caused by changes in the price processes S and B, and not by changes
in t and t which are due to injections or withdrawals of money. Therefore,
we call a portfolio (, ) self-financing if its price process V satisfies the SDE
dVt = t dSt + t dBt .
A replicating, or hedging portfolio for the claim C is a self-financing portfolio
(, ) with a value process V which satisfies VT = C. If such a portfolio exists,
then an arbitrage argument shows that the fair price of the claim at time
t [0, T ] equals the value Vt of the portfolio.
Of course, the arbitrage argument is an economic one, and not a mathematical argument. When we use the phrase fair price in mathematical theorems
below, the fair price or value will always be understood to be defined as
the value process of a replicating portfolio. (We shall be a bit careless about
the still open trap that there may be more than one replicating portfolios, with
different value processes.)
6.2
Let us now derive the pricing formula for the derivative C announced in the
previous section.
The discounted asset price is St = Bt1 St = ert St = S0 exp{(r)t+Wt },
whence it is a geometric Brownian motion with drift r and volatility . By
Example 5.14, it satisfies the SDE
dSt = ( r + 12 2 )St dt + St dWt .
t = Wt + t( r + 1 2 )/, this simplifies to
If we define W
2
(6.1)
t.
dSt = St dW
Now consider the process Vt = EQ (erT C|Ft ), t [0, T ]. By the tower property of conditional expectations, this is a Q-martingale relative to the filtration
(Ft )0tT . It is obvious that the natural filtration (Ft ) of W is also the natural
, the processes generate the same flow of information.
filtration of the process W
Hence, by the Brownian representation theorem, Theorem 5.21, there exists a
t . Defining t = t / St , we have
predictable process such that dVt = t dW
dVt = t dSt .
40
EXERCISE 6.5. Show that a portfolio process (, ) with the value process V
is self-financing iff (6.2) holds for the discounted value process Vt = ert Vt .
Since S is a martingale under Q, a self-financing portfolio makes the discounted value process V a martingale under Q, by Exercise 6.5. This establishes
again Theorem 6.4.
EXERCISE 6.6. Let V be the value process of a self-financing portfolio that
replicates a simple claim C = C(ST ). Show that there exists a process g such
t.
that V satisfies dVt = rVt dt + gt dW
Let us turn to arbitrage portfolios. By definition these are self-financing and
such that the corresponding value process V satisfies P(V0 = 0) = 1, P(VT
0) = 1 and P(VT > 0) > 0. Clearly, we can equivalently rephrase the latter
three conditions in terms of the discounted valued process V , P(V0 = 0) = 1,
P(VT 0) = 1 and P(VT > 0) > 0. As before, a market is arbitrage free, if no
arbitrage portfolios exist.
EXERCISE 6.7. Show that Black-Scholes market is arbitrage free (you may
use the fact that the martingale measure Q and the measure P are equivalent).
6.3
European Options
If the claim C is European, meaning that it is of the form C = f (ST ) for some
function f , then we can derive a more explicit formula for its fair price.
Recall that
2
St = S0 et+Wt = S0 e(r
2 )t+ Wt ,
2
2 )(T t)+ T tZ ,
t [0, T ],
2
= er(T t) EQ f St e(r 2 )(T t)+ T tZ Ft = F (t, St ),
where
(6.8)
er(T t)
F (t, x) =
2
z2
2
f xe(r 2 )(T t)+z T t e 2 dz.
41
log(x/K) + (r + 2 /2)
d2 (, x) = d1 (, x) .
6.4
The pricing function F (t, x) from Theorem 6.9 can also be obtained as the
solution of the so-called Black-Scholes partial differential equation (PDE). This
provides a second method for finding the price of a European claim. Usually
this PDE cannot be solved analytically and one has to use numerical methods.
Theorem 6.11. The value of a European claim C = f (ST ) at time t [0, T ]
is given by Vt = F (t, St ), where F is the solution of the partial differential
equation
2
Ft (t, x) + rxFx (t, x) + 2 x2 Fxx (t, x) rF (t, x) = 0,
subject to the boundary condition F (T, x) = f (x).
Proof. The function F is smooth in both arguments. Therefore, we can apply
It
os formula to the value process Vt = F (t, St ) to see that this satisfies the SDE
dVt = Ft (t, St ) dt + Fx (t, St ) dSt + 21 Fxx (t, St ) d[S]t .
t under Q, St = S0 e(r
In terms of a Browinian motion W
t,
dSt = rSt dt + St dW
2 )t+ Wt ,
so that
2 2
2 St Fxx (t, St )
t.
dt + Fx (t, St )St dW
t , so that
On the other hand, (6.1) and (6.2) imply that dVt = t St dW
Vt = ert Vt satisfies
t = rF (t, St ) dt + t St dW
t.
dVt = rert Vt dt + ert dVt = rVt dt + t ert St dW
Comparison of the dt-terms of the last two equations for dVt yields the PDE for
the function F , with the boundary condition F (T, ST ) = VT = C = f (ST ).
42
It should be noted that the PDE for the value process is the same for every
type of European option. The type of option is only important for the boundary
condition.
In the proof of the preceding theorem we only compared the dt-terms of
the two SDEs that we obtained for the value process V of the claim. By
t -terms, we obtain the following explicit formulas for the
comparing the dW
hedging portfolio of the claim.
Theorem 6.12. A European claim C = f (ST ) with value process Vt = F (t, St )
can be hedged by a self-financing portfolio consisting at time t of t risky assets
and t bonds, where
t = Fx (t, St ),
t = ert F (t, St ) Fx (t, St )St .
t -terms of the
Proof. The formula for t follows from the comparison of the dW
two SDEs for V that we obtained in the proof of the preceding theorem. Recall
that t = Vt t St . Substituting Vt = F (t, St ) and t = Fx (t, St ) yields the
formula for t .
The hedging strategy exhibited in the preceding theorem is called the delta
hedge for the claim. Note that in general the numbers of stocks and bonds in the
hedging portfolio change continuously. In practice it is of course not possible to
trade continuously. Moreover, very frequent trading will not always be sensible
in view of transaction costs. However, the delta hedge can be derived to indicate
what a hedging portfolio should look like.
In all exercises below we consider the standard Black-Scholes model.
EXERCISE 6.13. For a < b, consider the claim C(ST ) = K1(a,b) (ST ) (called
a binary spread ). Determine the fair price of this claim at any time t [0, T ].
EXERCISE 6.14. Consider a straddle claim with pay-off at maturity T equal
to C = |ST K|. Determine the fair price of the straddle at any time t [0, T ].
This claim can also be hedged with a constant portfolio that not only consists
of shares and bonds, but contains as a third component European call options
as well. Find this portfolio.
EXERCISE 6.15. Consider a bull spread claim with the following pay-off C =
min{max{ST , A}, B}, where B > A > 0. This claim can be hedged with a
constant portfolio consisting of stocks, bonds and European call options. Find
this portfolio and the price process.
EXERCISE 6.16 (Feynman-Kac theorem). Consider the following SDE: dXt =
(t, Xt )dt + (t, Xt )dWt , t [0, T ], for a fixed T > 0, X0 is independent of W .
Let h(x) be a Borel measurable function such that E|h(XT )| < . Then there
exists a function g(t, x) : R2 R, such that g(t, Xt ) = E(h(XT )|Ft ), t [0, T ],
Ft = (Ws , s t). Assume that g is twice continuously differentiable. Show
that g(t, x) satisfies the PDE gt (t, x)+(t, x)gx (t, x)+ 21 2 (t, x)gxx (t, x) = 0 with
the boundary condition g(T, x) = h(x). Formulate a version of Feynman-Kac
theorem also for the discounted function f (t, Xt ) = E(er(T t) h(XT )|Ft ).
EXERCISE 6.17. For a fixed T > 0, consider the PDE ft (t, x) + 2tfx (t, x) +
t4
2
2 fxx (t, x) = 0, with the boundary condition f (T, x) = x . Find an explicit
solution of this problem by using the Feynman-Kac theorem (see Exercise 6.16)
and verify this solution by straightforward computations.
43
6.5
The Greeks
Parties which are trading a claim with associated value function Vt = F (t, St )
are often interested in the sensitivity of the price of the claim with respect to
the price of the underlying asses, and also with respect to time t, interest rate r
and volatility . Reasonable measures for these sensitivities are the derivatives
of the function F (t, x). These derivatives have special names. The quantities
= Fx ,
= Fxx ,
= Fr ,
= Ft ,
V = F
are called the delta, gamma, rho, theta and vega of the claim, respectively.
Together they are called the Greeks.
For instance, the delta of a claim measures the first order dependence of the
price of the claim relative to the price of the underlying asset. A very high delta
means that small changes in the asset price cause relatively large changes in the
value of the claim. Observe that the delta is precisely the number of stocks in
the hedging portfolio of Theorem 6.12.
EXERCISE 6.20. Calculate the Greeks for the European call option and give
the delta hedging strategy for the claim.
6.6
General Claims
2 )t+ Wt ,
44
convergence in distribution n(
cn EQ C)/sn
N (0, 1) as n . Hence, for
large n, we have the approximation
c E C
n
Q
Q n
> 1.96 0.05.
sn
It follows that
rT
e
(
cn 1.96sn / n), erT (
cn + 1.96sn / n)
is an approximate 95%-confidence interval for the price of the claim C. The
length of the interval tends to zero as n , which means that our simulation
scheme can achieve arbitrary accuracy if we simulate long enough. In practice
we shall be limited by computation time.
6.7
Bt = ert ,
respectively, where r is the European interest rate and q is the US interest rate.
The exchange rate Et , the euro value of one dollar, is modelled as a geometric
Brownian motion for certain parameters , and a Brownian motion W :
Et = E0 et+Wt .
From the Dutch perspective, we can now trade in two assets: the riskless
euro bond B and the risky US bond S, which have (euro) price processes
Bt = ert ,
St = Et Dt = S0 e(q+)t+Wt ,
Chapter 7
Extended Black-Scholes
Models
The classical Black-Scholes model that we considered in the preceding chapter
can be extended in several directions. So far we only considered markets in
which a single bond and one risky asset are traded. We can also study the more
complex situation when there are several risky assets with price processes that
do not evolve independently. This allows the pricing of derivatives which depend
on the behaviour of several assets. The assumption of a constant drift and
volatility can also be relaxed. They can be replaced by arbitrary, predictable
stochastic processes.
In general we can consider a market in which a bond is traded with price
process B and n risky assets with price processes
S 1 , . . . , S n . We assume that
Rt
the bond price is of the form Bt = exp( 0 rs ds) for rt the interest rate at
time t, so that it satisfies the ordinary diffential equation
dBt = Bt rt dt,
B0 = 1.
The interest rate r may be an arbitrary predictable process and hence depend on
all information before time t. We assume that the asset price processes satisfy
the system of stochastic differential equations
(7.1)
d
X
tij dWtj ,
i = 1, . . . , n,
j=1
7.1
The key structural condition needed to push through the theory is the existence
of a predictable, vector-valued process = (1 , . . . , d ), called the market price
of risk, such that
(7.2)
d
X
tij tj = rt it ,
i = 1, 2, . . . , n.
j=1
7.2
Fair Prices
dVt =
n
X
it dSti + t dBt .
i=1
j=1
j=1 0 s dWs .
St = Bt St = S0 e
The second equality follows from the definition of B, the SDE (7.1) for the asset
prices and It
os formula, applied as in Example 5.14. By Itos formula,
dSti = Sti (it rt ) dt + Sti
d
X
j=1
47
tij dWtj .
If there exists a market price of risk process , this can be rewritten in the form
d
X
dSti = Sti
tj ,
tij dW
j=1
Rt
If the matrices t are square and invertible, then this representation can be easily
obtained from the vector-valued version of the Brownian representation theorem,
Theorem 5.21, by the same arguments as in Section 6.2. More generally, the
desired representation is typically possible if the filtration (Ft ) is generated by
the asset processes Sti . In the following theorem we refer to this assumption by
assuming that the stock price processes possess the representation property.
Theorem 7.4. Assume that there exists a predictable process satisfying (7.2),
and that the stock price processes possess the representation property. Further
RT
RT
more, assume that E exp 21 0 ks k2 ds < and E exp 21 0 ks k2 ds <
. Then the value of the claim C FT at time t [0, T ] is given by
Bt EQ (BT1 C|Ft ), where Q is the measure under which the discounted price
processes Bt1 Sti are local martingales.
7.3
Arbitrage
i=1
n Z t
X
i=1
is dSsi =
Vs rs ds +
0
i=1
n Z t
X
i=1
The last display and the fact that dBt = rt Bt dt imply that
d(Bt1 Vt ) = Vt Bt2 dBt + Bt1 dVt = Bt1
n
X
i=1
48
Hence, in view of (7.1), the discounted value process takes the form
(7.5)
Vt = Bt1 Vt = V0 +
n Z
X
i=1
Bs1 Ssi is
d
X
sij dWsj (rs is ) ds .
j=1
This formula does not make explicit reference to the amount invested in
the bond, which has been eliminated. A partial strategy defines a value
process P
through (7.5), and given we can define from the equation Vt =
n
t Bt + i=1 it Sti . By retracing the calculations, the resulting strategy (, )
can be seen to be self-financing and to possess value process Vt . Thus, to see
which value processes are possible, it suffices to construct the stock portfolio .
Nonexistence of a market price of risk process implies that the vector rt 1t
is not contained in the range of t , for a set of times t. Then there exists a vector
t such that the vector (St1 1t , . . . , Stn nt ) is orthogonal to this range and its inner
product with the vector rt 1 t is strictly negative for a set of times t:
n
X
Sti it tij = 0,
j = 1, . . . , d,
i=1
n
X
i=1
We can arrange it so that the latter inner product is never positive and hence,
by (7.5), the corresponding discounted gain process will be zero at time 0 and
strictly positive at time T . This is an example of arbitrage.
On the other hand, if the market price of risk process exists, then the
discounted gains process in (7.5) can be written as a stochastic integral relative
R
, for W
t = Wt t s ds. Under the martingale measure Q
to the process W
0
is a Brownian motion, and hence the discounted gains process
the process W
will be a local martingale. Under the integrability assumptions of Theorem 7.4
it is a Q-martingale, and hence cannot become strictly positive as its mean must
remain zero. Thus existence of the market price of risk is intimately connected
to the nonexistence of arbitrage.
7.4
PDEs
= (W
1, . . . , W
d ) defined
Under the conditions of Theorem 7.4, the process W
Rt
t = Wt s ds is a Brownian motion under the martingale measure Q.
by W
0
Because option prices can be written as expectations under Q, it is useful to
. If
rewrite the stochastic differential equation (7.1) in terms of the process W
we also assume that the processes r and take the forms rt = r(t, Bt , St ) and
t = (t, Bt , St ), then the equations describing the asset prices become
dBt = Bt r(t, Bt , St ) dt,
(7.6)
d
X
tj ,
ij (t, Bt , St ) dW
i = 1, . . . , n.
j=1
are the same.
and hence the filtrations Ft and FtW generated by (B, S) and W
The process (B, S) is then Markovian relative to its own filtration Ft . In that
case a conditional expectation of the type EQ (X|Ft ) of a random variable X
that is a function of (Bs , Ss )st can be written as F (t, Bt , St ) for a function F .
This observation can be used to characterize the value processes of certain
options through a partial differential equation. The value process of a claim
that is a function C = g(ST ) of the final value ST takes the form
Vt = Bt EQ
g(S )
RT
T
Ft = EQ e t r(s,Bs ,Ss ) ds g(ST )|Ft .
BT
50
Chapter 8
8.1.1
Discount Bonds
Pure discount bonds are simple financial contracts that capture the time value
of money. A discount bond which matures at time T > 0, also called a T -bond,
is a contract which guarantees a pay-off of 1 euro at time T . The price of a
T -bond at time t T is denoted by P (t, T ). It is the amount we are willing to
pay at time t to receive 1 euro at time T . The collection {P (0, T ) : T > 0} of
all bond prices at time t = 0 completely determines the time-value of money at
time 0. It is called the term structure of interest rates.
For fixed t, the function T 7 P (t, T ) is typically smooth, since, for instance,
the price of a bond that matures 9 years from now will be close to the price of
a bond that matures 10 years from now. For a fixed maturity T > 0 however,
the function t 7 P (t, T ) will appear to fluctuate randomly. By construction it
holds that P (T, T ) = 1.
8.1.2
Yields
If we have 1 euro at time t, we can use it to buy 1/P (t, T ) T -bonds. At time
T we then receive 1/P (t, T ) euros. Hence, a euro at time t grows to 1/P (t, T )
euros at time T . If the interest rate over the interval [t, T ] had been constant,
say r, a euro at time t would have grown to er(T t) at time T . If we compare
51
these (i.e., 1/P (t, T ) = er(T t) ), we see that buying the T -bonds at time t leads
to a constant interest rate over the time interval [t, T ] of
Y (t, T ) =
(8.1)
log P (t, T )
.
T t
We call this the yield over [t, T ]. The collection of all yields of course contains
exactly the same information as the collection of all bond prices. However, the
yields have a somewhat easier interpretation in terms of interest rates.
8.1.3
Short Rate
Although the interest rate does not exist, we can construct an object that can
be interpreted in this way. We just saw that the yield Y (t, T ) can be interpreted
as the constant interest rate valid in the time interval [t, T ]. The number
log P (t, T )
rt = lim Y (t, T ) =
T t
T
T =t
can therefore be viewed as the interest rate at time t (or in the infinitesimal
interval [t, t + dt]). We call rt the short rate at time t. From its definition it is
clear that in general, the short rate does not contain all information about the
time value of money.
8.1.4
Forward Rates
Let t < S < T and consider the following strategy. At time t, we sell one S-bond,
giving us P (t, S) euros. We immediately use this money to buy P (t, S)/P (t, T )
T -bonds. At time S the S-bond matures, and we have to pay one euro to its
holder. At time T the T -bond matures, and we receive P (t, S)/P (t, T ) euros.
If we follow this strategy, the net effect is that one euro at time S grows
to P (t, S)/P (t, T ) euros at time T . If the interest rate were a constant r over
the time interval [S, T ], one euro at time S would grow to er(T S) at time T .
Hence, the constant interest rate over [S, T ] determined at time t is
This quantity is called the forward rate for [S, T ], contracted at time t. If we
let S T , we get
log P (t, T ),
T
which is the forward rate at time T , contracted at time t. Note that the short
rate is a particularRforward rate, namely f (t, t) = rt . Moreover, it is easy to see
T
that P (t, T ) = e t f (t,s) ds , so the collection of all forward rates contains all
information about the term structure of interest rates.
(8.2)
8.2
f (t, T ) =
So let us suppose that under the real world probability measure P, the
short rate satisfies the SDE
(8.3)
If we combine this with the SDE (8.3) for the short rate rt we obtain
(8.5)
T =
T
FtT + FrT + 12 2 Frr
,
FT
53
T =
FrT
.
FT
dP (t, T )
dP (t, S)
+ uSt Vt
,
P (t, T )
P (t, S)
Tt P (t, T )
,
Vt
uSt =
St P (t, S)
.
Vt
equivalently
tS rt
T rt
= t T .
S
t
t
54
Theorem 8.8. Let (t, rt ) denote the market price of risk. Then for every
T > 0 the function F T satisfies the PDE
T
FtT + ( )FrT + 21 2 Frr
rF T = 0,
Rt
0
(s, rs )ds).
Note that for every T > 0 the discounted price P (t, T ) = Bt1 P (t, T ) of a
T -bound satisfies
Bt1 P (t, T ) = EQ BT1 |Ft ,
so that for every T > 0 the process (P (t, T ))tT is a martingale under Q.
Therefore the measure Q appearing in the statement of the theorem is called
the martingale measure of the model. Observe that the formula
P (0, T ) = EQ BT1
for the current price of a T -bond is a statement of the usual form price of a
claim is the expectation of the discounted payoff under the martingale measure,
since a T -bond can be viewed as a claim which pays off 1 euro at time T .
Theorem 8.10 gives us a second method for the construction of a model for
an arbitrage free bond market:
55
1) Specify an SDE for the short rate rt under the martingale measure Q and
let (Ft ) be the natural filtration of the process r.
RT
2) Define the price P (t, T ) of a T -bond by P (t, T ) = EQ e t rs ds |Ft .
This second procedure for the construction of short rate models is known
as martingale modeling and has the obvious advantage that we do not have to
specify the market price of risk (and function ) explicitly. Instead, we only
need to specify the functions and , the volatility and the drift term of
the short rate SDE under the martingale measure Q.
8.3
The Hull-White model for the term structure of interest rates assumes that
under the martingale measure Q the short rate rt satisfies the SDE
(8.11)
(8.13)
+ B(t, T )eat
Z
= B(t, T )rt +
eau dWu +
0
T
B(u, T ) dWu
t
Z
B(u, T )(u) du +
B(u, T ) dWu .
t
Clearly, the first two terms on the right-hand side are Ft -measurable. The
third one is independent of Ft and is Gaussian with zero mean and variance
RT
2 t B 2 (u, T ) du. This completes the proof.
56
We can now derive the bond price formula for the Hull-White model.
Theorem 8.15. In the Hull-White model the price of a T -bond is given by
P (t, T ) = eA(t,T )B(t,T )rt ,
(8.16)
RT
t
2 2
2 B (u, T )
(u)B(u, T ) du.
RT
Proof. We have to calculate the expectation of exp( t rs ds) given Ft . By
the preceding lemma this boils down to computing the expectation of the exponential of a Gaussian random variable. If Z N (m, s2 ), it holds that
E exp(Z) = exp(m + s2 /2). Together with the lemma this yields the statement
of the theorem.
Short rate models in which the bond price is of the form (8.16) are called
affine models. The reason for this name is that the yields and forward rates are
affine in rt in that case. Indeed, by (8.16) and (8.1)(8.2), the yield Y (t, T ) and
the forward rate f (t, T ) in the Hull-White model are as follows:
Y (t, T ) =
A(t, T )
B(t, T )
rt
,
T t
T t
Now consider a specific bond market in which bonds of all maturities are
traded. Then at time zero, we can observe the bond prices and forward rates
with all maturities. We denote the observed prices and rates in the market by
P (0, T ) and f (0, T ), respectively. On the other hand, the Hull-White model
gives the formula for the forward rates:
f (0, T ) = BT (0, T )r0 AT (0, T ).
(8.17)
Obviously, we would like to match the theoretical rates f (0, T ) with the observed
rates f (0, T ). We will now show that this is possible by choosing an appropriate
function in (8.11). This procedure is called fitting the model to the term
structure of interest rates.
Theorem 8.18. Let the parameters a, in (8.11) be given. Then with the
choice
(T ) = af (0, T ) + fT (0, T ) + 2 B(0, T ) eaT + a2 B(0, T )
(8.19)
the theoretical Hull-White forward rates coincide with the observed rates. The
corresponding price of a T -bond is then given by
P (t, T ) =
2 2
P (0, T )
exp B(t, T )f (0, t)
B (t, T )(1 e2at ) B(t, T )rt .
P (0, t)
4a
Proof. If we insert the expressions for A and B into (8.17) we see that we have
to solve the equation
f (0, T ) = e
aT
Z
r0 +
ea(T u) (u) du
57
2
(1 eaT )2 = g(T ) h(T ),
2a2
8.4
The result of Theorem 8.10 can be viewed as a pricing formula for the simplest
possible claim which has a payoff of one euro at time T . Using the same arguments as above it can be extended to a general claim which pays some random
amount C FT at time T .
Theorem 8.20.
Let C FT be a claim. Its value at time t T is given by
RT
Vt = EQ e t rs ds C|Ft , where Q is the martingale measure.
Note that for C 1 we indeed recover the formula for the bond price P (t, T ).
Many interest rate derivatives do not only have a payment at the time T of
maturity, but also at certain fixed intermediate times. Holding such a product
(called coupon) is equivalent to holding a portfolio of derivatives with different
maturities. Hence, Theorem 8.20 implies the following.
Theorem 8.21. Let 0 < T1 < < Tn = T and Ci FTi for i = 1, . . . , n.
Consider a derivative with a payment of Ci at time Ti for i = 1, . . . , n. Its value
at time 0 is given by
n
R Ti
X
V0 = EQ
e 0 rs ds Ci ,
i=1
)
.
j
n
j=1
58
The usual approach to simulating realizations of the short rate is to discretize the SDE for rt under Q. Suppose that under Q the short rate satisfies
(8.3), where and are given functions. Then for small > 0 we have the
approximation
r(k+1) rk (k, rk ) + (k, rk )(W(k+1) Wk ),
k = 0, 1, 2, . . . .
The expression for t in the statement of the theorem now follows after inserting
(8.19) and some straightforward calculations. To prove the recurrence formula
for the process y, observe that the random variables
Z (k+1)
ea(k+1) y(k+1) eak yk =
eau dWu , k = 0, 1, . . . ,
k
R (k+1) 2au
are independent, Gaussian, with zero means and variances 2 k
e
du =
2 2ak 2a
e
(e 1)/(2a). Hence, with i.i.d. standard Gaussian Z1 , Z2 , . . .,
r
2 (e2a 1)
d ak
a(k+1)
ak
e
y(k+1) e yk = e
Zk+1 , k = 0, 1, . . . .
2a
The proof is completed by dividing this by ea(k+1) .
59
8.5
Below we discuss some common interest rate products and their valuation.
8.5.1
In practice pure discount bonds are not often traded. Instead, bonds typically
do not only have a payoff at maturity, the so called principal value, but also make
smaller regular payments before maturity. Such a bond is called a coupon bond.
A 10-year, 5% coupon bond with a principal value of 100 euros for instance,
pays 5 euros every year until maturity and 100 euros at maturity (after ten
year).
More generally, suppose that a bond makes a payment of k euros at times
T1 < . . . < Tn = T , and pays off its principal value of 1 euro at time T . Then
holding this coupon bond is equivalent to holding k pure discount bonds with
maturity Ti for i = 1, . . . , n, and one T -bond. Hence, the value of the coupon
bond is
n
X
P (0, Ti ).
P (0, T ) + k
i=1
We remark that conversely, the prices of pure discount bonds may be expressed in terms of the prices of coupon bonds. In practice this is the usual way
in which the prices of discount bonds are inferred from market data.
8.5.2
There also exist bonds with intermediate payments that depend on the interest
rates at the time of the payment. The LIBOR rate for the time interval [S, T ],
set at time S is defined as
1
1
P (S, T ) 1
=
1 .
L(S, T ) =
(T S)P (S, T )
T S P (S, T )
This is simply the return per time unit on an investment at time S in a T -bond.
A floating rate bond is a bond with additional payments at times T1 < . . . <
Tn = T . The payment Ci at time Ti is
(Ti Ti1 )L(Ti1 , Ti ) =
1
P (Ti1 , Ti )
1.
This is precisely the gain we would have had at time Ti if we had bought one
euro worth of Ti -bonds at time Ti1 . The principal value of one euro is payed
at time T .
By Theorem 8.21, the price of this asset at time 0 is given by
X
P (0, T ) +
EQ BT1
(P (Ti1 , Ti )1 1).
i
i
By the tower property of conditional expectation and Theorem 8.10 the ith term
in the sum equals
EQ EQ BT1
(P (Ti1 , Ti )1 1)|FTi1 = EQ (P (Ti1 , Ti )1 1)BT1
P (Ti1 , Ti )
i
i1
= EQ BT1
EQ BT1
P (Ti1 , Ti ) = P (0, Ti1 ) P (0, Ti ).
i1
i1
60
n
X
P (0, Ti1 ) P (0, Ti ) = P (0, 0) = 1
i=1
8.5.3
Swaps
n
X
P (0, Ti ) 1.
i=1
8.5.4
Swaptions
A swaption is a contract giving the holder the right to enter into a swap at a
future date. Suppose for instance that the contract gives the right to enter at
time T0 > 0 into a swap described in the preceding section. Then the payoff at
time T0 of the option is
P (T0 , Tn ) + k
n
X
+
P (T0 , Ti ) 1 .
i=1
R T0
0
rs ds
P (T0 , Tn ) + k
n
X
+
P (T0 , Ti ) 1 .
i=1
In general this expectation can not be evaluated analytically and one has to
resort to numerical methods. Note by the way that the latter formula shows
that a swaption can also be viewed as a call option on a coupon bond.
61
Chapter 9
Risk Measurement
Financial institutions deal in risk of various types. Market risk is the exposure to
the changing prices of assets on the market, and can be limited by using appropriate portfolios that include instruments such as options and swaps. Managing
risk is important for:
- Internal management, e.g. optimizing profit subject to restrictions on risk.
- To fulfill the requirements of regulatory authorities, such as national banks.
- Credit ratings.
The management of risk requires that risk be measured. In this chapter we
discuss the most popular measure of risk: Value-at-Risk, abbreviated VaR.
9.1
Value-At-Risk
62
big. Related to this is that VaR is in general not subconvex under combination
of portfolios. If two subunits of a financial institution both control their VaR,
then there is no guarantee that the VaR of the financial institution as a whole
is also under control. This is illustrated in Example 9.2. Other measures of
risk, which are not open to these criticisms are the conditional variance (or
volatility) and the expected excess loss, given by
var(Vt Vt+t |Ft ),
and
Here c is some given threshold. The volatility is a classical measure, dating back
to Markowitz, who first proposed to optimize profit under the side condition of
bounded volatility. It is somewhat unstable for heavy-tailed distributions and
perhaps can be criticized for being symmetric.
Example 9.2 (Nonconvexity). Suppose that the portfolio consists of two parts,
with values Xt and Yt at time t, so that the total value is Vt = Xt + Yt . Then
Vt Vt+t = Xt + Yt for Xt and Yt the losses on the two subportfolios in
the interval [t, t + t], and the relative contributions of the two subportfolios in
the total are wX = Xt /Vt and wY = Yt /Vt . If VaR(V ), VaR(X), VaR(Y ) are
the Value-at-Risks of the three portfolios, then it may happen that the VaR(V )
is bigger than the convex combination wX VaR(X) + wY VaR(Y ).
For example, choose some fixed and let the vector (Xt , Yt ) be distributed (conditionally on Ft ) on the set of points {(0, 0), (c, 0), (0, c)} according
to the probabilities 12, , , for a given c > 0. Then VaR(X) = VaR(Y ) = 0,
but VaR(V ) = c. We can make the discrepancy c between the total Value-atRisk and the convex combination arbitrarily large.
To determine VaR we need a model for the conditional distribution of Vt
Vt+t given Ft . There are many possibilities, such as the ARMA and GARCH
models from time series, or the Black-Scholes or Hull-White models from derivative pricing. It is important to note that we need the distribution of the value
process under the real-world measure, not the martingale measure. Thus given
a model the parameters are estimated from time series giving the actual prices
of the assets over time, so-called historical analysis. Because some of the parameters may be common to the real-world measure and martingale measure,
some parameters could also be calibrated using observed derivative prices.
In rare cases, depending on the model, it is possible to determine an analytic
expression for the conditional distribution of Vt+t Vt given Ft . More often
the VaR must be calculated numerically, for instance by stochastic simulation.
If we can generate a large number N (e.g. at least N = 10000 if = 1%) of
realizations from the conditional distribution of Vt Vt+t given Ft , then VaR
is approximately the (1 )N largest value among the realizations. Remember
that, unlike when using simulation to determine a derivative price, this time we
must simulate from the real-world measure.
9.2
Normal Returns
63
grid, but that is even more unrealistic and unimportant for the following.) If the
conditional mean and variance are t and t2 , then the conditional distribution
of Vt Vt+t is normal with mean Vt t and standard deviation Vt t , and the
Value-at-Risk is given by
(9.3)
VaR = Vt t 1 (1 ) t .
Note that it is proportional to the current capital Vt and linearly increasing in
the volatility t . A positive drift t decreases VaR.
If a portfolio consists of n assets or subportfolios, with value processes
1
V 1 , . . . , V n , then it is often assumed that the vector of returns (Vt+t
/Vt1
n
n
1, . . . , Vt+t /Vt 1) is conditionally multivariate-normally
distributed given Ft .
Pn
The value of the whole portfolio Vt = i=1 Vti can be written as
Vt+t Vt = Vt
n
X
wti
V i
i=1
wti
t+t
Vti
1 ,
Vti /Vt
where
=
is the relative contribution of asset i to the whole portfolio.
The sum is a linear combination of a Gaussian vector and hence is normally
distributed. If the return vector possesses conditional mean vector (1t , . . . , nt )
and covariance matrix (ti,j ), then the conditional distribution of Vt Vt+t
given Ft is normal with mean Vt t and standard deviation Vt t , for
(9.4)
t =
n
X
wti it ,
i=1
(9.5)
t2 =
n X
n
X
i=1 j=1
The Value-at-Risk again takes the same form (9.3), but with the new values of
t and t substituted.
The Cauchy-Schwarz inequality says that the covariances satisfy |ti,j |
i j
t t , where ti = ti,i are the variances of the components. This shows that
t
n
X
wti ti .
i=1
Because the VaR is linear in the standard deviation, this shows that the combined portfolio has a smaller VaR than a similar single portfolio of the same
volatility. This expresses the well-known guideline that diversification of a portfolio helps to control the risk.
As Example 9.2 shows, diversification is not always useful to control VaR,
but the preceding shows that for portfolios with normal returns it is.
Empirical studies have now well established that economic time series are
not Gaussian random walks, an assumption of the past that still lives on in
many VaR-methods. Returns are not i.i.d. and their marginal distributions
deviate from normal distributions in that they are typically heavier tailed and
sometimes skewed. Conditional normality of the returns given the past, as is
assumed in this section, is also debatable, but not always rejected by statistical
tests on empirical data. In particular, GARCH models are often combined
with normal conditional distribution, which automatically leads to heavier-tailed
unconditional distributions.
64
9.3
Equity Portfolios
St+t
= t + (Wt+t Wt ).
St
1
(9.6)
VaR = Vt 1 e t ()+t .
This has similar features as the equation (9.3): the risk is proportional to the
current value Vt , increasing in the volatility and decreasing in the drift .
Because log x x 1 for x 1, the log return Rt is close to the ordinary
return St+t /St 1, if is small. If we make this approximation and still assume
the normal model forthe return, then we are in the situation of Section 9.2, with
t = t and t = t. The resulting formula (9.3) is identical to the formula
obtained by replacing the exponential in (9.6) by its linear approximation.
The value process of a combined portfolio consisting of it assets of price Sti
(i = 1, 2) is given by Vt = 1t St1 + 2t St2 . If we assume that the numbers ti do
not change in the interval [t, t + t], then the gain in this interval is given by
1
2
Vt+t Vt = 1t (St+t
St1 ) + 2t (St+t
St2 ).
To determine VaR we need a model for the conditional distribution of the vector
2
1
St2 ). There are many possibilities.
(St+t
St1 , St+t
A natural generalization of the Black-Scholes model would be to assume that
both asset price processes follow Black-Scholes models Sti = S0i exp(i t + i Wti ).
Here W 1 and W 2 are Brownian motions, of which it would be natural to assume
that they are also jointly Gaussian with some correlation. Then the joint returns
i
/Sti , will be bivariate Gaussian, and we can compute
(Rt1 , Rt2 ), for Rti = log St+t
the VaR in terms of the parameters i , i and the correlation of Rt1 and Rt2 , at
least by computer simulation.
If, as before, we simplify by assuming that the returns, and not the log
returns, are bivariate Gaussian, then we shall be in the situation of Section 9.2.
The VaR is then given by (9.3) with t and t given by (9.4), where wti =
it Sti /Vt .
Alternatively, we may apply more realistic, but more complicated models.
The conditional distribution of the loss will then typically be non-Gaussian, and
not analytically tractable, but the Value-at-Risk can often be obtained easily
by simulation.
Deriving VaR of portfolios of more than two stocks does not cause conceptual difficulties. However, making realistic models for the joint distribution of
many equities is difficult. Gaussian models are a possibility, but unrealistic.
Other standard models may include many parameters, that may be difficult to
estimate.
65
9.4
In the Black-Scholes model a European call option with strike K and expiry
time T has value
log(S /K) + (r + 2 /2)(T t)
t
Ct = St
T t
log(S /K) + (r 2 /2)(T t)
t
.
Ker(T t)
T t
The distribution of this random variable, or, more appropriately, the conditional
distribution of Ct+t given Ft , is not easily obtainable by analytic methods, but
it is easy to simulate, being an explicit function of the stock price St (or St+t )
and a number of nonrandom quantities. Thus the VaR of a portfolio consisting
of a European call option can be easily obtained numerically.
The value of some other European options on a given stock with price process St can also be written as an explicit function Ct = F (t, St ) of the stock
price. The Value-at-Risk can then also be obtained by one round of computer simulation. Given Ft the stock price St is known and hence the gain
Ct+t Ct = F (t + t, St+t ) F (t, St ) is stochastic only through St+t . We can
simulate the conditional distribution of the gain by simulating a sample from
the conditional distribution of St+t given Ft . In the Black-Scholes model we
have that
St+t = St et+(Wt+t Wt ) ,
and hence an appropriate simulation scheme
is to simulate a standard normal
t+ tZ
.
variable Z and next compute St e
Even though this seems easy enough, in practice one often uses approximations of the form
Ct+t Ct = F (t + t, St+t ) F (t, St )
Ft (t, St )t + Fs (t, St )(St+t St ) + 12 Fss (t, St )(St+t St )2 .
The three partial derivatives on the right side are exactly the Greeks , ,
and , already encountered in
Section 6.5. For small t, the increment St+t St
is typically of the order O( t) in probability, and hence the middle term on
the right dominates. If we neglect the other two terms, then we arrive in the
pleasant situation that the gain Ct+t Ct is a linear transformation St Rt of
the return Rt = St+t /St 1. If we also assume that this return is conditionally
normally distributed, then we are back in the situation of Section 9.2, and VaR
takes a familiar form.
Options for which there is no explicit pricing formula are more difficult to
incorporate. According to the general pricing theory, the value of an option
with payment C at time T in the Black-Scholes model is equal to
Ct = EQ er(T t) C|Ft .
For complicated claims C this could be determined numerically by simulation,
this time under the martingale measure Q. Combined with the preceding this
leads to a double (nested) simulation scheme, sometimes referred to as the full
monte. We shall consider the most complicated case, that of a claim that
depends on the full path (St : 0 t T ) of the stock price.
66
s W
t)
/2)(st)+(W
9.5
Bond Portfolios
Thus we can simulate the bond price at time t by simulating the short rate rt .
Under the martingale measure Q the short rate is the sum of a deterministic
function and an Ornstein-Uhlenbeck process. Unfortunately, to compute VaR
we need to simulate under the true-world measure P, which may add a random
drift to the Ornstein-Uhlenbeck process. This may destroy the Gaussianity
and other nice properties, which the short rate process possesses under the
martingale measure.
More specifically, in the Hull-White model the short rate satisfies formula
(8.14), which can be written in the form
at
rt = (t) + e
eas dWs ,
Rt
for the deterministic function (t) = eat r0 + eat 0 (s)eas ds. The function
in this expression can be found by calibration on option prices observed in
the market, and so can the parameters a and . However, the process W in
the preceding display is a Brownian motion under the martingale measure Q,
and not under P. In agreement with GirsanovsR theorem, under P the process
t
W is a Brownian motion with drift, and Wt 0 (s, rs ) ds for the market
price of risk is a P-Brownian motion. There is no way the market price of
risk can be calibrated from derivative prices alone, but it can be determined
by historical analysis. If (t, rt ) does not depend on rt , then the change of
drift only changes the deterministic function in the preceding display, and
we can use the simulation scheme for Ornstein-Uhlenbeck processes discussed
in Section 8.4 and Theorem 8.22 to generate rt , the bond prices, and hence the
VaR. If (t, rt ) is random, then the drift of the short rate process under P is
random, and we must fall back on the more complicated simulation schemes for
diffusion processes, such as the Euler scheme discussed in Section 8.4. Rather
than calibrate a and from observed derivative prices on the market, we may
then also choose to fit the diffusion model
drt = (t, rt ) dt + dWt ,
with W a P-Brownian motion, directly to historical data. Note that the volatility parameter is common to both the P and Q models and hence can both be
calibrated and estimated.
This discussion of the Hull-White model extends to any model where the
bond prices are simple functions Pt,T = F T (t, rt ) of the short rate, and more
generally to multi-factor models in which the bond prices Pt,T = F T (t, Xt ) can
be written as a simple function of multivariate diffusion process X.
In practice one often uses simpler approaches based on approximations and
the assumption that theP
yields are multivariate normal. Given a bond portfolio
n
with value process Vt = i=1 it Pt,Ti , where it is assumed constant in [t, t + t],
we first approximate
Vt+t Vt =
n
X
i=1
n
X
it (Pt+t,Ti Pt,Ti ) =
n
X
it e(Ti tt)Yt+t,Ti e(Ti t)Yt,Ti
i=1
h
it Pt,Ti (Ti t)(Yt+t,Ti Yt,Ti ) + tYt+t,Ti +
i=1
68
(Ti t)2
(Yt+t,Ti
2
i
Yt,Ti )2 .
VaR based on using both duration and convexity in the approximation can be
determined by simulation.
9.6
A swaption relative to the swap times T1 < T2 < < Tn pays the amount
+
Pn
PT0 ,Tn +K i=1 PT0 ,Ti 1 at time T0 < T1 . Depending on the term structure
model used, there may or may not be an explicit formula for the value of a
swaption at time t < t0 . If there is, then determining the VaR does not present
great difficulties.
In general, the value at time t < T0 can be evaluated as
n
R T0
X
+
Ct = EQ e t rs ds PT0 ,Tn + K
PT0 ,Ti 1 Ft .
i=1
In short rate models, such as the Hull-White model, the variable inside the
expectation can be written as a function of the short rate process (rs : 0
t Tn ). Given Ft the initial path (rs : 0 s t) is known and hence we
can evaluate the price Ct by computing an expectation under Q of a function
ht (rs : t s T ) of the future sample path, given its present state rt . For
instance, in the Hull-White model Ct is given by
n
h R T0
X
+ i
EQ e t rs ds eA(T0 ,Tn )B(T0 ,Tn )rT0 + K
eA(T0 ,Ti )B(T0 ,Ti )rT0 1 r0 .
i=1
{
SIMULATE (rs : 0 s t + t) ACCORDING TO P
FOR (j in 1:MANY)
{
GIVEN rti SIMULATE (rsj : t < s T0 ) ACCORDING TO Q
i
GIVEN rt+t
SIMULATE (rsj : t + t < s Tn ) ACCORDING TO Q
}
COMPUTE Cti AS AVERAGE ht (rsj : t < s T0 ) OVER j
i
COMPUTE Ct+t
AS AVERAGE ht+t (rsj : t + t < s T0 ) OVER j
}
i
VaR IS 1 LARGEST OF THE VALUES Cti Ct+t
.
Rather than using this double simulation scheme we may here also apply
approximations. However, in general it is not easy to compute partial derivatives
of the value process relative to e.g. the yield (duration and convexity), and
we may need to use numerical (i.e. discretized) derivatives instead.
9.7
Diversified Portfolios
70