Vasicek Model
Vasicek Model
DERVİŞ BAYAZIT
JUNE 2004
YIELD CURVE ESTIMATION AND PREDICTION
WITH VASIČEK MODEL
A THESIS SUBMITTED TO
THE GRADUATE SCHOOL OF APPLIED MATHEMATICS
OF
THE MIDDLE EAST TECHNICAL UNIVERSITY
BY
DERVİŞ BAYAZIT
JUNE 2004
Approval of the Graduate School of Applied Mathematics
I certify that this thesis satisfies all the requirements as a thesis for the degree of
Master of Science.
This is to certify that we have read this thesis and that in our opinion it is fully
adequate, in scope and quality, as a thesis for the degree of Master of Science.
Bayazıt, Derviş
M.Sc., Department of Financial Mathematics
Supervisor: Assoc. Prof. Dr. Azize Hayfavi
The scope of this study is to estimate the zero-coupon bond yield curve of
tomorrow by using Vasiček yield curve model with the zero-coupon bond yield
data of today. The raw data of this study is the yearly simple spot rates of the
Turkish zero-coupon bonds with different maturities of each day from July 1,
1999 to March 17, 2004. We completed the missing data by using Nelson-Siegel
yield curve model and we estimated tomorrow yield curve with the discretized
Vasiček yield curve model.
Keywords: One factor short rate models, Vasiček yield curve, Nelson-Siegel yield
curve, Monte-Carlo method
iii
Öz
Bayazıt, Derviş
Yüksek Lisans, Finansal Matematik Bölümü
Tez Yöneticisi: Doç. Dr. Azize Hayfavi
Anahtar Kelimeler: Tek faktörlü kısa dönem faiz haddi modelleri, Vasiček verim
eğrisi, Nelson-Siegel verim eğrisi, Monte-Carlo metodu
iv
To my family
v
Acknowledgments
I would like to take this opportunity to thank Assoc. Prof. Dr. Azize Hayfavi
for patiently guiding, encouraging and motivating me throughout this study. I
would also like to thank the members of Risk Group, Hayri Körezlioğlu, Azize
Hayfavi, Kasırga Yıldırak, and Yeliz Yolcu, who helped me in every step of this
study.
I also would like to express my special thanks to Kasırga Yıldırak for his ad-
vices and comments on the application part of this study.
I also would like to thank Burak Akan and Özkan Erdal who provided the
data used in this study.
I also would like to thank my university, Atılım University, for giving me en-
couragement, patience and support whenever I needed. I am also grateful to my
friend, Uygar Pekerten, for welcoming me to his house and giving me feedback
when I needed.
Last, but by no means least, to Nermin, who was always ready to offer advise,
encouragement and emotional support whenever it is needed.
vi
Table of Contents
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Öz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x
Chapter
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
vii
2.11 Geometric Brownian Motion (GBM) Model . . . . . . . . . . . . 21
2.12 Marsh-Rosenfeld Model(1983) . . . . . . . . . . . . . . . . . . . . 22
4 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
4.1 Data Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
4.2 Yield Curve Fitting with Nelson-Siegel Model . . . . . . . . . . . 37
4.3 Yield Curve Estimation with Vasiček Model . . . . . . . . . . . . 44
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
viii
List of Tables
4.1 MNO: The maximum number of observations and its day for the
given data set. MINO: The minimum number of observations and
its day for the given data set. ∗ : An example from several days. . 36
4.2 Estimated values of parameters for constraint set a. . . . . . . . . 47
4.3 Estimated values of parameters constraint set b. . . . . . . . . . . 51
4.4 Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations:
given for each initial i point tuple and folds. Nelson-Siegel SSEs is
5.01802205688122. . . . . . . . . . . . . . . . . . . . . . . . . . . 52
4.5 Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations:
given for each initial ii point tuple and folds. Nelson-Siegel SSEs
is 5.01802205688122. . . . . . . . . . . . . . . . . . . . . . . . . . 52
4.6 Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations:
given for each initial iii point tuple and folds. Nelson-Siegel SSEs
is 5.01802205688122. . . . . . . . . . . . . . . . . . . . . . . . . . 52
ix
List of Figures
x
4.19 Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple b-ii and discretization fold-2 . . . . . . . . . . 67
4.20 Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-iii and discretization fold-1 . . . . . . . . . 68
4.21 Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-iii and discretization fold-2 . . . . . . . . . 69
4.22 Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple b-iii and discretization fold-2 . . . . . . . . . 70
4.23 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-i
and discretization fold-1 . . . . . . . . . . . . . . . . . . . . . . . 71
4.24 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-i
and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . . . 72
4.25 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple b-i
and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . . . 73
4.26 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-ii
and discretization fold-1 . . . . . . . . . . . . . . . . . . . . . . . 74
4.27 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-ii
and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . . . 75
4.28 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple b-ii
and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . . . 76
4.29 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-
iii and discretization fold-1 . . . . . . . . . . . . . . . . . . . . . 77
4.30 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple a-
iii and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . 78
xi
4.31 Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation
with the Vasiček Model by using constraint-initial point tuple b-
iii and discretization fold-2 . . . . . . . . . . . . . . . . . . . . . 79
xii
Chapter 1
Introduction
In our century the interest rate has an important place in all transactions
that include simply lending and borrowing. However, its importance has risen
with developing and slight stationary economic conditions of the world. The
desire to have foresight for the level of future interest rate has become crucial in
the sense that to know the lending and borrowing rate and not to missprice the
interest rate instruments. As a result, the modelling of interest rate has risen as
a problem. As solution to this problem there have been many models proposed.
Their common feature is that they all model a stochastic problem, i.e. they deal
with uncertainty. The first model in 1973 was proposed by Merton. However, the
pioneering one was suggested by Vasiček in 1977 and in the following years many
other models that are much more analytically untractable has come out. These
stochastic models are mainly classified with respect to the number of factors
which are assumed to have a stochastic evolution in the model.
In this study we will use Vasiček short-rate model among some of the most
fundamental one factor interest-rate models to predict the yield curve of tomorrow
by using today’s observed yield data. The models taken into account are presented
in Table 1.1. In the second chapter of this study we will present these one factor
short rate models with their solutions. In the following chapter, we will discuss
two different yield curve models: Nelson-Siegel and Vasiček models. The explicit
solution of the Vasiěk model will be presented in Section 3.2. In the last chapter,
the raw data of this study which is the yearly simple spot rates of the Turkish
zero-coupon bonds with different maturities of each day from July 1, 1999 to
1
Merton(1973) drt = α dt + σ dWt
Vasiěk(1977) drt = α(β − rt ) dt + σ dWt
Dothan(1978) drt = σ rt dWt0
Brennan-Schwartz(1980) drt = (β + α rt ) dt + σ rt dWt
√
Cox-Ingersoll-Ross(CIR)(1985) drt = α(β − rt )dt + σ rt dWt
Ho-Lee (1986) drt = θt dt + σ dWt
Exponential Vasiček(EV) drt = rt [ηt − a log rt ] dt + σ rt dWt
0
σ
Black-Derman-Toy(1990) d(log rt ) = [θt + σtt log rt ]dt + σt dWt
Hull-White- Extended Vasiček (1990) drt = (βt − αt rt ) dt + σt dWt
√
Hull-White Extended CIR(1990) drt = [βt − αt rt ]dt + σt rt dWt
Black-Karazinsky(1991) d(log rt ) = φt [log µt − log rt ]dt + σt dWt
Geometric Brownian Motion(GBM) drt = β rt dt + σ rt dWt
−(1−γ) γ/2
Marsh-Rosenfeld (1983) drt = [β rt + α rt ] dt + σ rt dWt
March 17, 2004 will be analyzed. Then we will fit a Nelson-Siegel curve to each
of the day in the data set we choose. At the end of the last chapter we will predict
the yield curve of March 17, 2004 by using Vasiček yield curve model in Monte
Carlo method. Before working on these models it is necessary to introduce main
mathematical notions used in these parts of this study.
We will begin with the following assumption.
Assumption 1.1. All the random variables and the stochastic processes are de-
fined on a given complete probability space (Ω, A, P).
2
(2) Stationary increments: ∀ s ≤ t, Wt − Ws and Wt−s − W0 have the same
probability law.
• W0 = 0,
• E(Wt ) = 0,
• E[Wt2 ] = t.
From now on,we will surly consider standard Brownian motion without especially
mentioning the word ”standard”.
• ∀ t ≥ 0, Wt is Ft -measurable.
3
The following notion is crucial for the characterization of arbitrage-free mar-
ket, option pricing and hedging.
Definition 1.9. A stochastic process (Mt )t≥ 0 adapted to (Ft )t≥ 0 is martingale if
for any t it is integrable, i.e. E (|Mt |) < +∞ and for any s ≤ t E (Mt |Fs ) = Ms
a.s.
Definition 1.11. A zero coupon bond(T -bond) with maturity date T is a con-
tract which guarantees the holder 1 unit of money to be paid on the date T . The
price at time t of a bond with maturity T is denoted by P (t, T ).
• For each fixed t, the bond price P (t, T ) is differentiable with respect to
maturity time T .
Note that written payment value, equal to 1 unit, on the contract is known
as the principal value or face value.
Suppose we are standing at time t, and let us fix two other points, S and T ,
with t < S < T . Now, let us write a contract at time t which will allow us to have
a deterministic rate of return on the interval [S, T ] determined at the contract
time t. This is achieved as follows [2].
1. At time t we sell one S-bond. This will give us P (t, S) unit of money.
P (t,S)
2. We buy with this income P (t,T )
T -bonds resulting a net investment zero at
time t.
4
3. At time S the S-bond matures, therefore we pay out one unit of money.
4. At time T the T -bonds mature at one unit of money a piece, thus we receive
P (t,S)
P (t,T )
units of money.
Definition 1.12. The following definitions are the implications of above con-
struction.
P (t, T ) − P (t, S)
L(t; S, T ) = − .
(T − S)P (t, T )
P (S, T ) − 1
L(S, T ) = − .
(T − S)P (S, T )
5
Remark 1.13. The spot rates are forward rates where the time of contracting
coincides with the start of the interval over which the interest rate is effective,i.e.
t = S.
The first hypothesis of this study is that all the work being done is in a filtered
probability space (Ω, A, P, (Ft )0≤ t≤ T ), where (Ft )0≤ t≤ T is the natural filtration
of a standard Brownian motion (Wt )0≤ t≤ T and that FT = A.
Rt
rs ds
Definition 1.14. e 0 is called as discount factor, where rt is an adapted
RT
process of short rate satisfying 0 |rt | dt < ∞, almost surely. rt , itself as being
short rate, provides a return equal to rt dt on the period (t, t + dt].
Rt
rs ds
P̃ (t, u) = e− 0 P (t, u)
is a martingale.
dP* = LT dP,
RT Rt
with density LT = exp( 0 qs dWs − 12 0 qs2 ds) a.s., where qs is an adapted pro-
RT
cess such that 0 qs2 ds < ∞ a.s..[1]
and when the discounting is eliminated, the following basic equation is obtained,
Ru
P (t, u) = E ∗ e− t rs ds |Ft , where E∗ denotes expectation under P*.
6
Chapter 2
In this chapter we will present some of the most famous one factor short-
rate models. The stochastic evolution of short-rate models is identified with the
following general stochastic differential equation;
drt = α dt + σ dWt ,
7
Z t Z t
rt = r 0 + α ds + σ dWs
0 0
= r0 + αt + σWt
=⇒ rt = ru + α(t − u) + σ(Wt − Wu )
where β, α and σ are non-negative constants and rt is the current level of interest
rate [1]. The parameter β is the long run normal interest rate. The model exhibits
mean reversion, which means that if the interest rate is above the long run mean
(r > β), then the drift becomes negative so that the rate will be pushed to be
closer to the level β on average. Likewise, if the rate is less than the long run
mean, (r < β), then the drift remain positive so that the rate will be pushed to
the level β. The coefficient α > 0 determines the speed of pushing the interest
rate towards its long run normal level. Such mean reversion assumption agrees
with the economic phenomenon that interest rates appear over time to be pulled
back to some long run average value.That is, when the interest rates increase, the
economy slows down, and there is less demand for loans and a natural tendency
for rates to fall([19], [14]). The opposite case can be argued in a similar way.
To obtain an explicit formulae for rt let us define a new process Xt where
Xt = rt − β. Therefore, Xt is the solution of the following stochastic differential
equation
dXt = −α Xt dt + σ dWt (2.2.3)
8
which implies that Xt is an Ornstein-Uhlenbeck process. To solve this process let
Yt = Xt eα t . By integration-by-parts formulae
Z t Z t
αs
Yt = x 0 + Xs d(e ) + eα s dXs + < X, eα. >t
0 0
αt αt
= a Xt e dt + e [−α Xt dt + σ dWt ]
= σ eα t dWt
Therefore, Z t
−α t −α t −α t
rt = r 0 e + β(1 − e )+σe eα s dWs
0
For u ≤ t,
Z t Z t
−α t −α(t−s)
rt = r 0 e + βα e ds + σ e−α(t−s) dWs
Z0 t 0
Z u
−α t −α(t−s)
= r0 e + βαds + βα e e−α(t−s) ds
u 0
Z t Z u
+σ e−α(t−s) dWs + σ e−a(t−s) dWs (2.2.4)
u 0
9
Also,
Z u Z u
−α u −α(u−s)
ru = r 0 e + βα e ds + σ e−α(u−s) dWs
0 0
That is,
Z u Z u
βα e −α(t−s)
ds + σ e−α(t−s) dWs = eα(u−t) (ru − r0 e−α u ) (2.2.5)
0 0
By using equation (2.2.6) we get the t-u expression of rt from the equation (2.2.4).
Z t Z t
−α(t−u) −α(t−s)
rt = r u e + αβ e ds + σ e−α(t−s) dWs (2.2.6)
u u
Vasiček model besides its advantages such as being analytically tractable, it has
several shortcomings. Since the short rate is normally distributed, for every t
there is a positive probability that r is negative and this is unreasonable from an
economic point of view. Because the nominal interest rate can not fall below zero
as long as people can hold cash; it can become stuck at zero for long periods,
however as when prices fall persistently and substantially.
Another drawback of the Vasiček model is that it assumes γ = 0. This
assumption implies the conditional volatility of changes in the interest rate to be
constant, independent on the level of r.
where r0 and σ are positive constants [9]. Then the following solutions are ob-
tained for r.
Z t
rt = r 0 + σ rs dWs .
0
10
1
and f (x) = − x12 . By Ito Lemma,
00
Let f (x) = log x. Then f 0 (x) = x
Z t Z
1 1 t 1 2 2
logrt = logr0 + rs σ dWs − r σ ds
0 rs 2 0 rs2 s
Z t Z
1 t 2
= logr0 + σ dWs − σ ds
0 2 0
1
= logr0 + σ Wt − σ 2 t
2
1 2
⇒ rt = r0 exp{− σ t + σWt }
2
For u ≤ t,
Z u Z
1 u 2
logru = logr0 + σ dWs − σ ds
0 2 0
Z t Z t Z Z
1 t 2 1 u 2
logrt = log r0 + σ dWs + σ dWs − σ ds − σ ds
u 0 2 u 2 0
Z t Z
1 t 2
= logru + σ dWs − σ ds
u 2 u
1
= logru + σ(Wt − Wu ) − σ 2 (t − u)
2
1 2
⇒ rt = ru exp{− σ (t − u) + σ(Wt − Wu )}
2
where α,β and σ are positive constants. Equation 2.4.8 is a linear nonhomoge-
neous stochastic differential equation in the form of
11
drt = [βt + αt rt ] dt + [γt + δt rt ] dWt
drt = α rt dt + σ rt dWt
Z t Z t
⇒ rt = r0 + α rs ds + σ rs dWs
0 0
Z t Z t Z t
1
log rt = log r0 + α ds + σ dWs − σ 2 ds
0 0 2 0
1 2
= log r0 + α t + σ Wt − σ t
2
1 2
rt = r0 |e(α− 2 σ{z) t+σ W}t .
φt
1
Now, let ηt = φt
and define ζt = ηt rt . Let f (x) = ex and define
1
Xt = −(α − σ 2 ) t − σ Wt
2
⇒ dXt = −(α − 12 σ 2 ) dt − σ dWt
By Ito Lemma,
12
1
dηt = ηt dXt + ηt σt2 dt
2
1 1
= ηt [−(α − σ 2 ) dt − σ dWt ] + ηt σ 2 dt
2 2
= ηt [(−α + σ 2 ) ds] − σηt dWt (2.4.9)
By integration-by-parts,
where ζ0 = r0 .Therefore, rt is
Z t
rt = φ t r0 + φ t β ηs ds
0
Z t
(α− 12 σ 2 ) t+σ Wt (α− 12 σ 2 ) t+σ Wt
= e r0 + e β ηs ds
0
Z t
(α− 12 σ 2 ) t+σ Wt 1 2
= e r0 + e(α− 2 σ ) (t−s)+σ (Wt −Ws ) β ds (2.4.11)
0
For u ≤ t,
Z t
(α− 21 σ 2 ) (t−u)+σ (Wt −Wu ) 1 2 ) (t−s)+σ (W
rt = e ru + e(α− 2 σ t −Ws )
β ds
u
13
2.5 Cox-Ingersoll-Ross (CIR) Model(1985)
An intriguing case of a non-linear stochastic differential equation can be de-
fined as
p
dXt = (θt Xt + γt )dt + υt Xt dWt , X0 = x0 . (2.5.12)
√
drt = α(β − rt )dt + σ rt dWt , r(0) = r0 (2.5.13)
where α, β, σ and r0 are positive constants. The condition 2αβ ≥ σ 2 provides the
positivity of r. It can be shown that equation (2.5.13) admits a unique solution
that is positive, but we do not have an explicit form for it [22].
Z t Z t
rt = r 0 + θs ds + σ dWs
0 0
Z t
= r0 + θs ds + σ Wt (2.6.15)
0
Z t
rt = r u + θs ds + σ[Wt − Wu ] (2.6.16)
u
14
2.7 Exponential Vasiček Model(EV)
EV is a lognormal model in which interest rate r satisfies the following stochas-
tic differential equation
where Θ, a and σ are positive constants and y0 is a real number [18]. Therefore,
to solve the 2.7.17 , it is convenient to obtain an Ornstein-Uhlenbeck process at
first hand. So, applying the Ito Lemma for f (x) = log x and putting yt = log rt ,
we get
t Z
1
log rt = log r0 + rs [η − a log rs ] ds + σ dWs
0 rs
Z
1 t −1 2 2
+ ( )σ rs ds
2 0 rs
Z t Z t
1 2
= log r0 + [(η − σ ) − a log rs ] ds + σ dWs
0 2 0
that is
Z t Z t
1
yt = y 0 + [(η − σ 2 ) − ays ] ds + σ dWs
0 2 0
we get,
1
dyt = [(η − σ 2 ) − ayt ] dt + σ dWt (2.7.18)
2
15
where Θ = η − 12 σ 2 . If we put β = Θ
a
, a = α and Xt = yt − β we get
dXt = −α Xt dt + σ dWt .
That is,
Z t
−α t −α t
yt − β = (y0 − β)e eα s dWs +σe
0
Z t
−α t −α t −α t
yt = y0 e + (1 − e )β + σ e eα s dWs
0
For u ≤ t,
Z t Z t
−α t −α(t−s)
yt = y 0 e + βα e ds + σ e−α(t−s) dWs (2.7.20)
0 0
Z t Z t
−α(t−u) −α(t−s)
yt = y u e + αβ e ds + σ e−α(t−s) dWs (2.7.21)
u u
That is,
Z t Z t
−a(t−u) −a(t−s)
rt = exp{log ru e +Θ e ds + σ e−a(t−s) dWs } (2.7.22)
u u
16
2.8 Black-Derman-Toy Model (1990)
In their article, Black, Derman and Toy proposed a discrete time approach of
interest rate modelling [3]. The continuous time equivalent of their model was
expressed as a stochastic differential equation which can be shown to be [10]
0
σ
d(log rt ) = [θt + t log rt ]dt + σt dWt . (2.8.23)
σt
In this model log rt is mean reverting. The function σt is chosen to make the
model consistent with the term structure of spot rate volatilities and may not
give reasonable values for the future short rate volatility. The model has the
disadvantage that bond prices can not be determined analytically.
For u ≤ t Z t
rt = ru exp{ θs ds + σ(Wt − Wu )}
u
17
where α, β, σ and γ are positive constants and W is the Wiener Process. In these
models, the interest rate r, is pulled toward to a long term interest rate level β
with rate α.
The condition of γ = 0 is considered by Vasiček. As we explained in section
2.2, besides being analytically tractable, this model has a fundamental drawback
that the short term interest rate, r, can become negative. On the other hand, CIR
considered an alternative where γ = 1/2 which results nonnegative r. However,
CIR case is not analytically tractable.
It is reasonable to anticipate that in some situations the market’s expectations
about future interest rates involve time dependent parameters. In other words,
the drift and diffusion terms can be defined as the functions of time as well as
being functions of r. The time dependence can arise from the cyclical nature
of the economy, expectations concerning the future impact of monetary policies,
and expected trends in other macroeconomic variables. In their article Hull and
White extend the model in (2.9.25) to reflect this time dependence. They add a
time dependent drift, θt to the process for r, and allow both the reversion rate,
α, and the volatility factor, σ, to be functions of time. This leads to the following
model for r:
dr = [θt + αt (β − rt )]dt + σt rtγ dWt (2.9.26)
18
where βt , αt and σt are time dependent deterministic functions [11]. For the
Rt
solution of this stochastic differential equation we define Kt = 0 αu du. That is,
Kt0 = αt . Then, we multiply both sides of the equation with eKt and take the Ito
differential of both sides with respect to t.
That is,
that is,
Z t Z t
−(Kt −Ks )
rt = e −Kt
r0 + e βs ds + e−(Kt −Ks ) σs dWs
0 0
To generalize this result for any u ≤ t we can make the following computations:
Z u Z u
−(Ku −Ks )
ru = e −Ku
r0 + e βs ds + e−(Ku −Ks ) σs dWs
Z 0t Z 0u
rt = e−Kt r0 + e−(Kt −Ks ) βs ds + e−(Kt −Ks ) βs ds
0 0
Z t Z u
+ e−(Ku −Ks ) σs dWs + e−(Kt −Ks ) σs dWs
u 0
Z t Z t
−Kt +Ks
rt = −Kt
e r0 + e βs ds + e−Kt +Ks σs dWs + e−Kt +Ku (ru − e−Ku r0 )
u u
Z t Z t
−(Kt −Ku ) −(Kt −Ks )
= e ru + e βs ds + e−(Kt −Ks ) σs dWs (2.9.30)
u u
19
2.9.2 Hull-White Extended CIR Model
For γ = 0.5 and βt = θt + βαt from (2.9.26), Hull and White in the same
article proposed [11]the extension of the CIR model based on the same idea of
considering time dependent coefficients of their Vasiček extension. The short rate
dynamics are then given by the following stochastic differential equation
√
drt = [βt − αt rt ]dt + σt rt dWt , (2.9.31)
where µt is the target rate, φt is the mean reversion and σt is the local volatility
in the expression for the local change in log rt [4]. They assumed these time
dependent deterministic functions to be inputs while they looked for the yield
curve, as the output of their model. While we are solving equation (2.10.32), for
the computational and traditional purposes, we will assume that αt = φt and
βt = φt log µt .
dlogrt = (βt − αt logrt ) dt + σt dWt (2.10.33)
20
and,
Kt0 = αt .
Let ζt = eKt Yt .
Z t Z t
Kt Ks
e Yt = Y 0 + e βs ds + eKs σt dWs
0 0
Z t Z t
−(Kt −Ks )
⇒ Yt −Kt
= e Y0 + e βs ds + e−(Kt −Ks ) σt dWs
0 0
For u ≤ t,
Z t Z t
−(Kt −Ku ) −(Kt −Ks )
Yt = e Yu + e βs ds + e−(Kt −Ks ) σt dWs
u u
21
Z t Z t
rt = r 0 + β rs ds + σ rs dWs (2.11.37)
0 0
The explicit solution satisfying above SDE can easily be derived by applying Ito
Lemma. Let f (x) = logx.
Z t Z
1 1 t 1
logrt = logr0 + drs − d < r, r >s
0 rs 2 0 rs2
Z t Z t Z
1 1 1 t 1 2 2
= logr0 + β rs ds + σ rs dWs − r σ ds
0 rs 0 rs 2 0 rs2 s
Z t Z t Z
1 t 2
= logr0 + β ds + σ dWs − σ ds
0 0 2 0
1
= logr0 + (β − σ 2 )t + σWt
2
1 2 )t+σW
⇒ rt = r0 e(β− 2 σ t
For u ≤ t,
1
logrt = logru + (β − σ 2 )(t − u) + σ(Wt − Wu )
2
1 2 )(t−u)+σ(W
⇒ rt = ru e(β− 2 σ t −Wu )
(2.11.38)
22
diffusion process as
−(1−γ) γ/2
drt = [β rt + α rt ] dt + σ rt dWt , (2.12.39)
β
drt = [ + α rt ] dt + σ dWt .
rt
where
y = r2−γ
0
a = 0.5σ(2 − γ)2
c = (2 − γ)(β + 0.5σ(1 − γ))
b = α(2 − γ) .
Marsh and Rosenfeld estimate the model on a time series of T-bill data using
maximum likelihood. They are unable to reach strong conclusions, but notice
that the likelihood is higher when γ = 2 than when γ takes lower values [20].
23
Chapter 3
There are many methods which have been used to model the zero-coupon
yield curve. We can put them into three categories; spline based models, function
based models, and lastly stochastic models. The idea of spline based models is
interpolating a spline function from the pre-known points. These points can
be given as a pair (time to maturity, yield of zero-coupon bond at that time to
maturity). The most famous models of this type are McCulloch and FNZ(Fischer,
Nychka and Zervos) [16].
The most popular function based models are Nelson-Siegel [23], and Svens-
son models [15]. In fact, Svensson model is an extension of Nelson-Siegel model.
In Nelson-Siegel model a relatively simple function is postulated for the instan-
taneous forward curve. Svensson extended this work by altering the functional
form of the instantaneous forward curve suggested by Nelson-Siegel. In this study,
however, we will be concentrated in Nelson-Siegel model, which is less complex
in the sense of number of parameters.
The stochastic modelling of zero-coupon yield curve depends on interest rate
modelling, especially on short-rate modelling. The idea is obtaining an explicit
pricing formula for the zero-coupon bond and then extracting the yield from that
formula by using the following fundamental equation:
where R(t, T ) is the yield of zero-coupon bond on the given period [t, T ]. In
the following sections of the chapter we will obtain the explicit zero-coupon bond
24
pricing and yield curve formulas for Vasiček short-rate model.
where θ = T − t. Since yield on the period [t, T ] is the average of sum of the rates
which are active on the period [t, t + dt], by this simple intuition, yield R(θ) can
be defined as the integral of 3.1.1. That is,
Z T
1
R(t, T ) = f (s, T )ds (3.1.2)
T −t t
Z T
1 −
(T −s) (T − s) − (T −s)
= β0 + β 1 e τ + β2 e τ ds
T −t t τ
T −t
[1 − e− τ ] T −t
= β0 + (β1 + β2 ) T −t
− β2 e− τ
θ
[1 − e− τ ] θ
= β0 + (β1 + β2 ) θ
− β2 e− τ
τ
Z θ hs i
1 s s
= β0 + β 1 e − τ + β 2 e− τ ds
θ 0 τ
Z θ
1
= f (s)ds
θ 0
= R(θ) (3.1.3)
Although it is easy to see the last equality, we showed it to be consistent for the
future notations of yield.
Now let us analyze the structure of the yield and forward curves. The limiting
25
value of R(θ) as θ approaches to infinity is β0 and as θ gets small values it is
(β0 + β1 ), which are necessarily the same as for the forward rate function since
R(θ) is just an averaging of f (θ). Now, let us separate the forward rate function
into three components: long-term, medium-term and lastly short-term. The
long-term component is identified by the asymptotic value, β0 , of the function.
θ θ
The medium-term is identified by the functional component τ
e− τ and for the
θ
designation of the short-term e− τ is used.
The long term component is a constant larger than zero. Thus, it can not take
zero value in the limit. The medium-term takes zero value at the starting point
zero. This indicates that it is not short-term and since it decays to zero in the
limit, therefore it is not long-term. As it is obvious that short-term curve has the
largest negative slope resulting the fastest decay among all.It takes monotonically
and asymptotically zero value.
In the model the contributions of these three components are given by β0 for
long-term, β1 for short-term and β2 for medium-term. Here, there are also some
other features of the parameters that we should consider. If β1 is negative the
forward curve will have a positive slope and vice versa. Also, if β2 , as being the
identifier of the magnitude and the direction of the hump, is positive, a hump
will occur at τ whereas, if it is negative, a U-shaped value will occur at τ . Thus,
we can conclude that parameter τ which is positive, specifies the position of
the hump or U-shape on the entire curve. As a result, Nelson and Siegel have
proposed that with appropriate choices of weights for these three components, it
is possible to generate a variety of yield curves based on forward rate curves with
monotonic and humped shapes [23].
26
Without repeating the solution for 3.2.4 that we followed in section 2.2 we will just
change the probability measure to risk neutral one because of quite crucial reason.
Unless we study with the risk neutral probability, it is not possible to make ex-
tinct the arbitrage possibilities in bond pricing and we know that the discounted
bond prices have martingale property only under the risk neutral probability,
which satisfies the non-arbitrage condition. With this aim, to change the mea-
sure we will use Girsanov Theorem. We assume a constant process q(t) = −λ,
Rt
with λ ∈ IR. Therefore, W̃t = Wt + 0 λ ds = Wt + λ t is a standard Brownian
motion under P*. Thus, dWt = dW̃t − λ dt and if we rewrite the equation 3.2.4
with respect to the new probability measure P ∗ we get the following result:
h i
dr(t) = α (β − r(t)) dt + σ dW̃t − λ dt
= (α β − α r(t) − λ σ) dt + σ dW̃t
λσ
= α β− − r(t) dt + σ dW̃t
α
λσ
where β ∗ = β − α
.
The main goal of this section is to obtain the Vasiček bond price formulae, there-
fore the yield curve. We will start with the fundamental equation of bond price,
27
with the short rate r given by Equation 3.2.5.
RT
rs ds
P (t, T ) = E ∗ e− t |Ft
RT ∗ ∗
= E ∗ e− t (rs −β +β ) ds |Ft
RT RT
β ∗ ds ∗
= e− t E ∗ e− t (rs −β ) ds |Ft
RT
−β ∗ (T −t) ∗ − Xs∗ ds
= e E e t |Ft (3.2.6)
by using the homogeneity of 3.2.7 and the Markov property of the solution Xtx =
Rt
x e−α t + σ e−α t 0 eα s dW̃s we can write
RT
Xs∗ ds
E ∗ e− t |Ft = F (T − t, Xt∗ )
= F (T − t, rt − β ∗ ) (3.2.8)
Rθ
x
where F is the function defined by F (θ, x) = E ∗ e− 0 Xs ds . Since (Xtx ) is
Gaussian with continuous paths, we can calculate F (θ, x), explicitly [1].
For the complete calculation of F (θ, x), consider the random variable Y =
Rθ
0
Xsx ds ∼ N (µ, σ 2 ) with the mean µ, and the variance σ 2 . The Laplace trans-
form [17] of Y can be defined therefore,
Z
−tY
LY (t) := E e = e−tY dFY (y)
R
28
Z Z
−tY 1 −(y−µ)2
e dFY (y) = √ e−ty e 2σ 2 dy
R 2πσ 2 R
Z
1 −2tyσ 2 −y 2 +2µy−µ2
= √ e 2σ 2 dy
2πσ 2 R
Z [
− y 2 −(2µ−2σ 2 t)y+µ2 ]
1
= √ e 2σ 2 dy
2πσ 2 R
Z [
− y 2 −(2µ−2σ 2 t)y+(µ−σ 2 t)2 −(µ−σ 2 t)2 +µ2 ]
1
= √ e 2σ 2 dy
2πσ 2 R
Z
1 −(y−(µ−σ 2 t))2 2 2 t + σ 4 t2 −µ2
+ µ −2µσ 2σ
= √ e 2σ2 2 dy
2πσ 2 R
Z
2 1 −(y−(µ−σ 2 t))2
−µt+ σ2 t2
= e √ e 2σ2
dy
R 2πσ 2
σ2 2
t
= e−µt+ 2 . (3.2.9)
−(y−(µ−σ 2 t))2
where √ 1 e 2σ2
is the density function of the random variable ∼ N (µ −
2πσ 2
σ 2 t, σ ). We need the case of t = 1.
2
Thus,
Rθ x σ2 ∗ 1
E ∗ e− 0 Xs ds = E ∗ e−Y = e−µ+ 2 = e−E (Y )+ 2 V ar(Y )
Rθ Rθ
= e−E ( Xsx ds)+ 21 V ar ( Xsx ds)
∗
0 0 (3.2.10)
29
We have
Z θ Z θ
E ∗
Xsx ds = E ∗ (Xsx ) ds
0 0
Z θ
= x e−αs ds
0
x
= (1 − e−α θ ) (3.2.11)
α
Therefore, when we take the expectation of Xtx we get zero for the stochastic
Rt
part,since 0 eα s dW̃s is a continuous Ft -martingale.
Z θ Z θ Z θ
V ar Xsx ds = Cov Xsx ds Xsx ds
0 0 0
For u = t = s Z Z Z Z
θ θ θ θ
∗ ∗ ∗
= E Xt dt Xu du − E Xt dt E Xu du
0 0 0 0
Z θ Z θ Z θ Z θ
= E ∗
(Xtx Xux ) du dt − E ∗
(Xtx ) dt E ∗ (Xux ) du
0 0 0 0
Z θ Z θ
= [E ∗ (Xtx Xux ) − E ∗ (Xtx ) E ∗ (Xux )] du dt
0 0
Z θ Z θ
= Cov (Xtx Xux ) du dt (3.2.12)
0 0
30
Cov (Xtx Xux ) = E ∗ [(Xtx − E (Xtx )) (Xux − E (Xux ))]
Z t Z u
∗ 2 −α(t+u) αs αs
= E σ e e dW̃s e dW̃s
0 0
Z t Z u
2 −α(t+u) ∗ αs αs
= σ e E e dW̃s e dW̃s
0 0
Z t∧ u
2 −α(t+u)
= σ e e2α s ds
0
2 −α(t+u) e2α(t∧ u) − 1
= σ e (3.2.13)
2α
R
θ
For the complete calculation of V ar 0 Xsx ds let us insert the result we found in
3.2.13 in 3.2.12. Let us define g(t, u) = Cov(Xtx , Xux ). We see that g is symmetric
with respect to line u = t on the region where the integral in 3.2.12 is taken.
Therefore it is enough to evaluate the above integral only for one of the regions
31
R
θ
to calculate V ar 0
Xsx ds . Let us chose the region {(t , u) : 0 < u < t}.
Z θ Z θ Z θ
V ar Xsx ds = Cov (Xtx Xux ) du dt
0 0 0
Z θ Z t
2 −α(t+u) e2α u − 1
= 2 σ e du dt
0 0 2α
Z θ Z t
σ 2 e−α t eα u − σ 2 e−α t e−α u
= 2 du dt
0 0 2α
Z θ t
σ 2 e−α t eα u σ 2 e−α t e−α u
= 2 + dt
0 2α2 2α2 0
Z θ
σ2 σ 2 −2α t σ2
= 2 2
+ 2e − 2 2e−α t dt
0 2α 2α 2α
Z θ
σ2
= 1 + e−2α t − 2e−α t dt
α2 0
θ
σ2 e−2α t 2e−α t
= t− +
α2 2α α 0
σ2 e−2α t 2e−α t 3
= θ− + −
α2 2α α 2α
θσ 2 σ 2 −2αθ −αθ
= − e − 4 e + 3
α2 2α3
θσ 2 σ2 h
−αθ 2 −αθ
i
= − 1 − e + 2 1 − e
α2 2α3
θσ 2 σ 2 −αθ
σ2 2
= 2
− 3
1 − e − 3
1 − e−αθ (3.2.14)
α α 2α
By replacing 3.2.11 and 3.2.14 into 3.2.10, we get F (θ, x) explicitly. Thus,
32
Rθ x
F (θ, x) = E ∗ e− 0 Xs ds
−α θ )+ θσ 2 − σ 2 2 2
x
= e α (1−e α2 α3
(1−e−αθ )− 2α
σ
3 (1−e
−αθ
) (3.2.15)
−α θ )+ θσ 2 − σ 2 2 2
P (θ) = e−β θ e α (1−e
∗ x
α2 α3
(1−e−αθ )− 2α
σ
3 (1−e
−αθ
)
∗ θ+ x (1−e−α θ )+ θσ 2 − σ 2 2 2
= e−β α α2 α3
(1−e−αθ )− 2α
σ
3 (1−e
−αθ
)
where R(θ, r) can be seen as the average interest rate for the period [t , T ], i.e.
R(θ, r) is the yield of the bond on the given period, and it can be calculated by
inverting the price formula.
33
θσ 2 σ2
σ2
2
−β ∗ θ + αx (1 − e−α θ ) + α2
− α3
1 − e−αθ − 2α3
1 − e−αθ
R(θ , r) = −
θ
σ2 θ
h σ2 σ2
i
−β ∗ θ + 2α2
+ 1 − e−α θ − αx − 2α3
− 4α3
1 − e−α θ
= −
θ
σ2 ∗1 −αθ
σ2 σ2 −αθ
= − −β + 2 − 1−e x+ 2 + 2 1−e
2α αθ 2α 4α
σ2 ∗1 −αθ
∗ σ2 σ2 −αθ
= − −β + 2 − 1−e (r − β ) + 2 + 2 1 − e
2α αθ 2α 4α
∗ σ2 1 ∗ σ2 −αθ
σ2 2
= β − 2− β − 2 −r 1−e − 2 1 − e−αθ
2α αθ 2α 4α
1 −αθ
σ2
−αθ 2
= R∞ − (R∞ − r) 1 − e − 2 1−e , (3.2.17)
αθ 4α
where
σ2
R∞ = lim R(θ, r) = β ∗ − .
θ→∞ 2α2
34
Chapter 4
Applications
In the first section of this chapter, we are going to present and explain the
data,representing the Turkish zero-coupon bond simple spot rate, that we use
in the following sections. In the second section, we will use Nelson-Siegel yield
curve model to fit the given data. To calibrate the Nelson-Siegel model we are
going to construct a sum of squared errors. Then, we are going to minimize this
function with appropriate constraints and an initial value. After the calibration,
the performance of Nelson-Siegel yield curve will be measured by the value of the
sum of squared errors. In the last section, we are going to use the Vasiček yield
curve to predict the March 17, 2004 yield curve by using the March 16, 2004 data.
35
MNO MINO MEAN
July 1, 1999-March 17, 2004 March 17, 2004(21) March 9, 2001(2) 10.9933
July 1, 1999-April 30, 2001 July 9, 1999∗ (12) March 9, 2001(2) 7.9542
May 1, 2001-March 17, 2004 March 17, 2004(21) Jul 1, 1999 ∗ (7) 12.8784
Table 4.1: MNO: The maximum number of observations and its day for the given
data set. MINO: The minimum number of observations and its day for the given
data set. ∗ : An example from several days.
· Including the most number of days satisfying the above three conditions.
Since the data is a raw data by means of the interest rate information that it
carries out, we worked on it to obtain an appropriate rate for the model based
usage. To express the data for our purpose we used interest rate definitions given
in definition set 1.12. Let us show our calculations, explicitly. First, we used
simple spot rate definition and then the definition of continuously compounded
36
spot rate.
P (S, T ) − 1
L(S, T ) = −
(T − S)P (S, T )
⇐⇒
1
P (S, T ) =
1 + L(S, T )(T − S)
⇐⇒
log P (S, T )
R(S, T ) = −
T −S
Since we are going to fit a yield curve, the data must be in the form of continuously
compounded spot rate. For a numerical example, let us calculate the price of the
zero coupon bond in May 1, 2001 having a yearly simple rate of return 0.7580 with
time to maturity 113 days. Thus, T − S = 113/365 and L(S, T ) = 0.7580. That
is, P (S, T ) = 0.8010 TL. This price coincide with the assumption P (t, t) = 1.
However, the exact price will be 80100 TL. We multiplied the result with 100000
which is the face value of the Turkish zero-coupon bond used in this study. To
evaluate R(S, T ) we use 0.8010. Therefore, R(S, T ) = 0.0018655. This gives a
continuously compounded rate of return of 186.55 TL over 100000 TL.
37
If we replace forward rate(in fact, instantaneous forward rate) with its definition
given in 1.12 we get
Z T
1 ∂ log P (t, s)
R(t, T ) = − ds
T −t t ∂s
1
= − (log P (t, T ) − log P (t, t))
T −t
log P (t, T )
= − (4.2.1)
T −t
Fitting implies calibration of the parameters of the given model. We will minimize
the sum of squared errors(SSEs) to calibrate the parameters of the Nelson-Siegel
model for a given day.
The Nelson-Siegel yield function obtained in Section 3.1 is as follows.
θ
[1 − e− τ ] θ
R(θ) = β0 + (β1 + β2 ) θ
− β2 e− τ
τ
38
β0 + β1 = Rshort ,
Note that in the initial sets first two initial values are default. However, the other
two initials are perturbed on the axis (0.9, 49.9). First, we perturbed 0.1 points in
all directions and we reached to (1, 50) and then we perturbed the axis with 0.01
points in all directions, but we remained on the same axis [24]. By using above
constraint and initial point sets we calibrated the Nelson-Siegel yield function
for all the days in our data set. That is we did 740x5 calibrations: For each of
the day in our data set we repeated the minimization procedure for each of the
constraint and initial point sets.
As a result, for each constraint-initial point set we estimated different param-
eters for a given day and we have chosen the parameters that minimize SSEs for
that day. In the Figures 4.1, 4.2, 4.3 and 4.4 we presented fits of yield curves for
some days. In most of the figures we resulted with a very fine fit. However, this
does not mean that all the yield curve fits for our data set is fine. For example,
in Figure 4.3 the days April 25 and April 26, 2002 fitted yield curves that are
S-shaped and humped, respectively, are not fine.1 Especially, the fit of April 25
is deficient which is mainly resulted from the difficulty of the data. The data
1
We are describing being fine empirically since we are going to take into account all of the
Nelson-Siegel fitted curves obtained from the above algorithm.
39
DAY: 01−May−2001 DAY: 02−May−2001
200 200
195 195
190 190
185 185
Yield
Yield
180 180
175 175
170 170
165 165
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
195 190
190 185
Yield
Yield
185 180
180 175
175 170
170 165
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
40
DAY: 12−Oct−2001 DAY: 15−Oct−2001
180 180
160 170
160
140
150
Yield
Yield
120
140
100
130
80 120
60 110
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
180 180
170 170
160 160
Yield
Yield
150 150
140 140
130 130
120 120
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
41
DAY: 24−Apr−2002 DAY: 25−Apr−2002
130 122
120 120
118
110
Yield
Yield
116
100
114
90 112
80 110
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
120 115
110
115
105
Yield
Yield
110
100
105
95
100 90
95 85
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
42
DAY: 14−Feb−2003 DAY: 17−Feb−2003
140 140
130 130
120 120
Yield
Yield
110 110
100 100
90 90
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
125 125
120 120
115 115
Yield
Yield
110 110
105 105
100 100
95 95
0 200 400 600 800 0 200 400 600 800
Time to Maturity Time to Maturity
43
with this structure, i.e. data having a U-shape and a hump, can be modelled by
using Svenson model which has two place holder parameters τ1 and τ2 , and two
shape identifier parameters β2 and β3 .
The solution of this SDE is going to be the adapted one to the solution of SDE
2.2.2 given under the objective probability in section 2.2 as:
Z t
−α t −α t −α t
rt = r 0 e ∗
+ β (1 − e )+σe eα s dW̃s ,
0
From the last solution we see that given ru , the possible values of rt are be-
ing normally distributed. The reason is that the distribution of dW̃s is normal
with mean zero and variance one. Therefore, the integral itself is being nor-
mally distributed and since the remaining part of the equation other than the
stochastic part is deterministic the current distribution of the stochastic part
does not change except its mean and variance. As a result, rt is normally
−α(t−u) ∗ −α(t−u)
distributed with mean µ r (u, t) = r u e + β 1 − e and variance
R 2 R
t t σ2
σr2 (u, t) = E ∗ σ u e−α(t−s) dW̃s = σ 2 u e−2 α (t−s) ds = 2α 1 − e−2 α (t−s) .
44
The discretization of rt can now be easily done [25]. That is,
where 0 = t0 < t1 < ... < tn are the time points and Zi ’s, i = 1...n are the
independent random variables with distribution N (0, 1). Equation 4.3.5 is the
exact discretization of the Vasiček short-rate process. However, a slightly simpler
Euler scheme of the short-rate process can be given as
p
r(ti+1 ) = r(ti ) + α (β ∗ − r(ti ))(ti+1 − ti ) + σ ti+1 − ti Zi+1 (4.3.6)
which entails some discretization error. In the following sections of the chapter
we will measure the performance of both equations while we are simulating the
Vasiček yield function.
In Section 3.2 we obtained the Vasiček yield function. We can rewrite it as
follows.
1 −αθ
σ2
−αθ 2
R(θ , rt ) = R∞ − (R∞ − rt ) 1 − e − 2 1−e ,
αθ 4α
σ2
where R∞ = limθ→∞ R(θ, r) = β ∗ − 2α2
. As R(θ , rt ) is a smooth function of
θ, it is a stochastic process of t and its stochastic evolution is determined by
Vasiček short-rate process. Therefore, it is possible to use R(θ , rt ) for the future
estimations of yield curve.
The discretization of R(θ , rt ) begins with the following stochastic differential
equation.
1 − e−αθ
dR(θ , rt ) = drt (4.3.7)
αθ
and we write
1 − e−αθ
R(θ , rti+1 ) − R(θ , rti ) = rti+1 − rti (4.3.8)
αθ
45
The following part of the discretization procedure has two folds:
p
α (β ∗ − rti )(ti+1 − ti ) + σ ti+1 − ti Zi+1
Assume we are in March 16, 2004. We will predict the yield curve of March
17, 2004 with the Vasiček yield curve model by using the information of and
up to March 16, 2004. To make the prediction it is needed to determine the
parameters of R(θ , rt ) of March 16, 2004. For this purpose we will fit R(θ , rt ) to
the yield data of that day by using the same manner as we followed in Section
4.2 with a slight difference. In Section 4.2 we exactly fit the observed data, but
now we will fit the data that is generated by Nelson-Siegel model for that day.
Why we use this data has the reason that we will have a finer fit since the data
is increased from 20 to 800 and to decrease the optimization difficulty of Vasiček
model for small number of data. For the minimization procedure we proposed a
set of parameters.
This set is formed by P = [α, R∞ , σ, r]. For this parameter set we will use
two sets of constraints and for each sets of constraints we will use three differ-
ent sets of initial points. After obtaining the estimated values of parameters
P̂ = [α̂, R̂∞ , σ̂, r̂] for each of the constraint-initial point tuple, we will use this
estimated parameters to predict the yield curve of March 17, 2004 by using the
Vasiček yield curve process.
Before all, let us introduce some notations used to express these constraint-
initial point tuples.
46
α R∞
a-i 3121926696.79853 8.89897355073613e-017
a-ii 3074582904.78327 57.5663641192539
a-iii 3122613471.94644 57.5663640805259
σ r
a-i 1.02702656015236e-015 -8.67535580071379e-026
a-ii 1.72842316498964e-014 1.66798115571358e-007
a-iii 1.75930947185959e-014 0
• r short; the time series of yield of bonds with time to maturity with 1 day.
We gathered this series in two steps:
1) We fitted the Nelson-Siegel model to each daily data of our data set.
2) We collected the yield of bond with time to maturity 1 day for each day
of data set.
• yield ns(#); the yield of bond with time to maturity # days and obtained
from the Nelson-Siegel model for the day March 16, 2004.
• µ(yield ns); the mean of the bond yield data obtained from the Nelson-
Siegel model for the day March 16, 2004.
• yield observed long; observed yield value of the bond with the longest time
to maturity in March 16, 2004.
• yield observed short; observed yield value of the bond with the shortest
time to maturity in March 16, 2004.
47
DAY: 16−Mar−2004
70
60
40
Yield
30
20
0
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.5: Yield Curve Fitting with the Vasiček and Nelson-Siegel Models by
using constraint-initial point tuple set a-i.
(a) 0 < α ; natural condition coming from the hypothesis of Vasiček short rate
model,
0 < R∞ − r ;We expect long-run bond yield to be larger than short rate.
0 < σ ;Positivity of σ comes from the assumptions of Vasiček model.
0 < r < yield ns(1) ; it is logical to put this constraint in the sense that r
is a short rate.
(i) Pinitial = [0.2, µ(yield ns), σ(r short), yield ns(1)]. The result for March
16, 2004 is given in Figure 4.5.
(ii) Pinitial = [0.2, β0 , σ(yield ns), β0 + β1 ]. The result for March 16, 2004
is given in Figure 4.6.
(iii) Pinitial = [0.2, yield observed long, σ(r short), yield observed short].
The result for March 16, 2004 is given in Figure 4.7
48
DAY: 16−Mar−2004
62
60
58
56
Vasicek Yield Curve
Yield
54
52
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.6: Yield Curve Fitting with the Vasiček Model by using constraint-initial
point tuple set a-ii
49
DAY: 16−Mar−2004
62
60
58
56
54
52
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.7: Yield Curve Fitting with the Vasiček Model by using constraint-initial
point tuple set a-iii
50
α R∞
b-i 0.12656452867452 58.0289089496322
b-ii 0.126565260211965 58.0289077616854
b-iii 0.126565348814999 58.0289100190384
σ r
b-i 3.9547173954881e-007 48.9046368960016
b-ii 2.91328054828062e-006 48.904580334028
b-iii -1.34651426739607e-005 48.9045822601353
Vasiček Yield Curve is constant in the Figures 4.5, 4.6, 4.7. The reason
of this situation is that for each of the tuples a-i, a-ii, a-iii the estimated
values of σ and r considerably small, and the estimated values of σ is
considerably large. The Table 4.2 summarizes the estimated parameter
values for constraint-initial point tuples, a-i, a-ii, a-iii.
(b) 0.01 < α < 1 ; We give an upper boundary 1 to α since α is the mean
reversion rate.
0.01 < R∞ − r ;
0.01 < σ ;
0.01 < r < yield ns(1).
For this constraint set we also take the below boundary of the conditions
0.01 units perturbed, which provides a different initial direction vector for
the minimization procedure.
(i) Pinitial = [0.2, µ(yield ns), σ(r short), yield ns(1)]. The result for March
16, 2004 is given in Figure 4.8.
(ii) Pinitial = [0.2, β0 , σ(yield ns), β0 + β1 ]. The result for March 16, 2004
is given in Figure 4.9.
(iii) Pinitial = [0.2, yield observed long, σ(r short), yield observed short].
The result for March 16, 2004 is given in Figure 4.10.
51
a-i b-i
fold1 14.385093067632 17.6622167220191
fold2 14.385093067632 17.4400854139305
Table 4.4: Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations: given
for each initial i point tuple and folds. Nelson-Siegel SSEs is 5.01802205688122.
a-ii b-ii
fold1 50.7273031357703 17.6622419124489
fold2 14.3850930738333 17.4401076514605
Table 4.5: Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations: given
for each initial ii point tuple and folds. Nelson-Siegel SSEs is 5.01802205688122.
1 − e−αθ h p i
R(θ , rti+1 ) = R(θ , rti ) + α (β ∗ − rti )(ti+1 − ti ) + σ ti+1 − ti Zi+1
αθ
(4.3.9)
a-iii b-iii
fold1 50.7273032551583 17.662240917152
fold2 14.3850930737449 17.4401091872071
Table 4.6: Sum of Squared Errors-SSEs of Vasiček Yield Curve Estimations: given
for each initial iii point tuple and folds. Nelson-Siegel SSEs is 5.01802205688122.
52
DAY: 16−Mar−2004
62
58
56
Yield
52
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.8: Yield Curve Fitting with the Vasiček Model by using constraint-initial
point tuple b-i
53
DAY: 16−Mar−2004
62
58
56
Nelso−Siegel Yield Curve
Yield
54
52
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.9: Yield Curve Fitting with the Vasiček Model by using constraint-initial
point tuple b-ii
54
DAY: 16−Mar−2004
62
60
58
56
Vasicek Yield Curve
Yield
54
Nelson−Siegel Yield Curve
52
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.10: Yield Curve Fitting with the Vasiček Model by using constraint-
initial point tuple b-iii
55
1 − e−αθ
R(θ , rti+1 ) = R(θ , rti ) + ∗
" αθ #
r
−α(ti+1 −ti )
∗ 1
1−e (β − rti ) + σ (1 − e−2α(ti+1 −ti ) )Zi+1
2α
(4.3.10)
When we compare the Figures 4.5 and 4.8, we see a non-zero Vasiček yield
curve fitted to the data of March 16, 2004. The reason of this improve is better
estimated parameter values. While we are estimating the parameter values with
b-i, the constraint 0.01 < α < 1 gives a better estimation(Table 4.3). As the
estimated value of α decreases the effect of exponential terms increase and the
1
effect of αθ
decreases in amount in R(θ, r). As the effect of exponential term
increases we obtain a hump in the figure. Also, we obtained the difference R∞ − r
getting faraway from 0 in positive way with b-i. The Figures 4.9 and 4.10 can
be explained in a way similar to 4.8.
Moreover, when we compare the Figures 4.6 and 4.5 obtained from a-ii and
a-i, respectively, we see that the Vasiček yield curve in Figure 4.6 is a better
fit for March 16, 2004 in the sense that it is not zero. But, again, it is linear
with an approximately zero slope. When we look at the Table 4.2 we see that
this is resulted from the fact that the large value of α decreases the effect of
exponential form in R(θ, r). In the graph, we observe that initial value is al-
most equal to its asymptotic value R∞ . Although this is theoretically not true
(since limθ→ 0 R(θ, r) = r), it is possible in machine size calculation: That value
on the graph is calculated by taking θ = 1. When this equality holds, the asymp-
totic value of R(θ, r) is R∞ and this result is consistent with the zero slope value.
A similar result to a-ii can be obtained for the Vasiček yield curve fit of a-iii.
The Figures 4.14 and 4.15 represent the Vasiček yield curve estimations for a-i
fold1 and a-i fold2. Although both of the graphs are obtained from the same
constraint- initial point a-i, the folds, .i.e. discretization methods of R(θ, r t ),
are different. However, the sums of squared errors for each fold are same as it is
presented in Table 4.4. This result is possible for very high values of α (see Table
4.2). When we examine Figure 4.16 we see that it is a quite similar one to the
56
graph presented in Figure 4.14 and 4.15.2 However, the sum of squared errors
of the estimation with b-i fold2 is 17.4400854139305 which is 3.0549923462985
larger than b-i of both folds.
The Figures 4.17 and 4.18 represents the yield curve estimation of March
17, 2004 with a-ii fold1 and a-ii fold2, respectively. Figure 4.17 represents an
estimated yield curve which has a better fit for θ nearly larger than 108. However,
its performance is not good for the short period. It overestimates the yield of
bonds having short-term maturity. This results from the estimated parameter
set of a-ii and discretization method fold1. For the small values of θ ≥ 1, a-ii
fold1 gives a result such that
This result is not surprising since α is quite large(see Table 4.2). On the other
hand in Figure 4.18, i.e. for a-ii fold2 this approximation can not be seen
1−eα θ
because of the term αθ
. This term decreases the value for small θ ≥ 1 with the
help of the negative draws. The performance of the estimation with a-ii fold1 is
quite below the performance of the estimation with a-ii fold2 by means of SSEs.
This huge difference is represented in Table 4.5. Also, for not to be confused by
the Figures 4.17 and 4.18, we measured the performance of the estimation of both
a-ii fold1 and a-ii fold2 for θ ≥ 112 by means of SSEs. We observed that the
estimation with a-ii fold2 is again better than the estimation with a-ii fold1 in
the long-run. The measured value for fold2 is 1.87909334435726 and for fold1,
it is 2.84815794190374.
The collected information of performance of all the tuples is given in Tables
4.4, 4.5 and 4.6. We can conclude that constraint set b is much more consistent
with the all three initial points and it gives an estimated yield curve similar to
Nelson-Siegel fitted yield curve. However, when tomorrow comes out we see that
Nelson-Sigel gives a better fit of the day with 5.01802205688122.
The most important problem here is the distributions of data occurred in both
March 16, 2004 and March 17, 2004. If the both distributions are Gaussian, we
2
The figures of constraint set b with fold1 are not given since they are very similar to the
figures obtained from the constraint set b with fold2.This similarity can also be seen in Tables
4.4, 4.5, and 4.6.
57
expect a very well fit. In our case both of our data are not Gaussian distributed.
In Figures 4.11 and 4.12 the quantile-quantile plots of March 16 and March 17
are given. As it can be seen from these plots that the distributions of each of
the days are not Gaussian. However, for each of the days there is are parts of
the data drawn from a distribution closed to a Gaussian one. If the estimation
of yield curve for March 16 is done with this part we obtain a better fit in the
prediction of the yield curve of March 17. We will show this result for March 16 3 .
Let us exclude the last three yield data of bonds with the longest maturities from
the whole data set of March 16, 2004 yield data. That is we exclude the yields
of bonds with maturities 372, 407 and 526 days since these data points are the
outliers of the data. When we apply Kolmogorov-Simirnov test to the remaining
part of the data we can not reject that the data is Gaussian distributed. In the
next step, we repeated all the procedures that we have followed before(estimation
and prediction procedures). After obtaining the estimated parameters of Vasiček
model we predict the yield curve of March 17 again with the Vasiček yield curve
model. As a result, we improved our prediction by means of fitting the observed
data of March 17, i.e. we decreased the SSEs approximately from 17.4401 to
16.3376(see also Figure 4.13).
3
We repeated this procedure just for constraint-initial point tuple b-ii and for fold2 as an
example of above argument.
58
4
1
16−Mar−2004 Quantiles
−1
−2
−3
−4
−5
−3 −2 −1 0 1 2 3
X Quantiles
Figure 4.11: Quantile-quantile Plot of X and March 16, 2004 yield data.
59
4
2
17−Mar−2004 Quantiles
−1
−2
−3
−4
−3 −2 −1 0 1 2 3
X Quantiles
Figure 4.12: Quantile-quantile Plot of X and March 17, 2004 yield data.
60
62
60
58
56
54
52
50
0 100 200 300 400 500 600 700 800
Figure 4.13: Constraint initial point tuple b-ii and discretization fold2.
61
DAY: 17−Mar−2004
62
58
Yield
56
54
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.14: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-i and discretization fold-1
62
DAY: 17−Mar−2004
62
58
Yield
56
54
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.15: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-i and discretization fold-2
63
DAY: 17−Mar−2004
62
60
58
Yield
56
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.16: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple b-i and discretization fold-2
64
DAY: 17−Mar−2004
110
100
90
Yield
80
70
Vasicek Yield Curve
60
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.17: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-ii and discretization fold-1
65
DAY: 17−Mar−2004
62
60
58
Yield
56
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.18: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-ii and discretization fold-2
66
DAY: 17−Mar−2004
62
60
58
Yield
56
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.19: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple b-ii and discretization fold-2
67
DAY: 17−Mar−2004
110
100
90
Yield
80
60
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.20: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-iii and discretization fold-1
68
DAY: 17−Mar−2004
62
60
58
Yield
56
54
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.21: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple a-iii and discretization fold-2
69
DAY: 17−Mar−2004
62
60
58
Yield
56
52
50
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.22: Yield Curve Estimation with the Vasiček Model by using constraint-
initial point tuple b-iii and discretization fold-2
70
DAY: 17−Mar−2004
62
58
56
Yield
54
Vasicek Yield Curve
52
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.23: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-i and discretization
fold-1
71
DAY: 17−Mar−2004
62
58
56
Yield
54
52
Vasicek Yield Curve
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.24: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-i and discretization
fold-2
72
DAY: 17−Mar−2004
62
58
56
Yield
54
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.25: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple b-i and discretization
fold-2
73
DAY: 17−Mar−2004
110
100
90
80
Yield
60
50
Vasicek Yield Curve
40
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.26: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-ii and discretization
fold-1
74
DAY: 17−Mar−2004
62
60
Nelson−Siegel Yield Curve
58
56
Yield
54
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.27: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-ii and discretization
fold-2
75
DAY: 17−Mar−2004
62
58
56
Yield
54
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.28: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple b-ii and discretization
fold-2
76
DAY: 17−Mar−2004
110
100
90
80
Yield
60
50
40
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.29: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-iii and discretization
fold-1
77
DAY: 17−Mar−2004
62
58
56
Yield
54
Vasicek Yiel Curve
52
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.30: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple a-iii and discretization
fold-2
78
DAY: 17−Mar−2004
62
58
56
Yield
54
50
48
46
0 100 200 300 400 500 600 700 800
Time to Maturity
Figure 4.31: Nelson-Siegel Yield Curve Fitting, and Yield Curve Estimation with
the Vasiček Model by using constraint-initial point tuple b-iii and discretization
fold-2
79
Chapter 5
Conclusion
In this study, we presented some of the most fundamental one factor interest
rate models with their solutions. Among them, Vasiček model is used to reach the
main scope of this study: prediction of the yield curve of tomorrow. Nelson-Siegel
model which is a function base yield curve model is used to complete the missing
data which is Turkish zero coupon bond simple spot rate gathered for several
maturities from May 1, 2001 to March 17, 2004. We discretized Vasiček short rate
model, that is used in Monte Carlo method to predict the yield curve of tomorrow.
We used two different discretization methods: discretization of 4.3.3(fold1) and
discretization of 4.3.4(fold2). It is seen that discretization method fold2 results
with a better prediction of yield curve of March 17 by means of smaller SSEs. In
the last section, it is also shown that if the distribution of the data(today’s data)
used to predict the yield curve of tomorrow is Gaussian, then we obtain a better
prediction.
Although this exploratory method followed in this study can be extended to
more elaborated methods, we conclude that by taking the Nelson-Siegel model
as a benchmark we measured the performance of Vasiček model as a predictor
and we found a considerable difference. The performance might be improved by
using different constraint initial point tuples and by developing other methods
in the optimization of SSEs. We also conclude that all the methods followed in
this study are applicable to affine models in which the solution of bond price is
explicitly obtained.
80
References
[4] F. Black, P. Karasinski Bond and Option Pricing when Short Rates are Log-
normal., Financial Analysist Journal 47, 52-59 (1991).
[6] J.C. Cox, Notes on Option Pricing I:Constant Elasticity of Variance Diffu-
sion, Working paper Standford Uni., (1975)
[7] J.C. Cox, S.Ross, The Valuation of Options for Alternative stochastic Pro-
cesses., Journal of Financial Economics 3, 145-146, (1976).
[8] J.C. Cox,J.E. Ingersoll, S.Ross, A Theory of the Term Structure of Interest
Rates. , Econometria 53, 385-407, (1985).
[9] L.U. Dothan, On the Term Structure of Interest Rates. Journal Of Financial
Economics 6, 59-69, (1978).
[10] J. Hull, A. White, Pricing Interest Rate Derivative Securities, The Rewiev
of Financial Studies 3, 573-592, (1990).
[11] J. Hull, A. White, One-Factor Interest Rate Models and the Valuation of
Interest Rate Derivative Securities., Journal of Financial and Quantative
Analysis 28, 235-254, (1993)
81
[12] F. Jamishidian, An Exact Bond Option Pricing Formula., The Journal of
Finance, 44, 205-209, (1989).
[13] T.A. Marsh, E.R. Rosenfeld Stochastic Proceses for Interest Rates and Equi-
librium Bond Prices, The Journal Of Finace 38, no.2, 635-646, (1983).
[14] D.M. Chance, Teaching Note 97-03:The Vasiček Term Structure Model,
c:/Classes/Teaching Notes/TN97-03.doc, (2003)
[15] D. Bolder, D. Streliski, Yield Curve Modelling at the Bank of Canada, Tech-
nical Report 84, (1999).
[18] D. Brigo, F. Mercurio, Interest Rate Models Theory and Practice, Springer,
(2001).
[23] C.R. Nelson, A.F. Siegel, Parsimonious Modelling of Yield Curves., Journal
of Business 60, (1987).
[24] J.H. Mathews, K.D. Fink, Numerical Methods Using Matlab.,Prentice Hall,
Upper Sadle River, (1999).
82