Nonlife Actuarial Models: Applications of Monte Carlo Methods
Nonlife Actuarial Models: Applications of Monte Carlo Methods
Chapter 15
Applications of Monte Carlo Methods
Learning Objectives
1. Monte Carlo estimation of critical values and p-values
2. Bootstrap estimation of p-values
3. Bootstrap estimation of bias and mean squared error.
4. Simulation of lognormally distributed asset prices
5. Simulation of asset prices with discrete jumps
1. Generate a random sample of n (call this the estimation sample size) standard normal variates x1 , , xn . Calculate the sample
mean x and sample variance s2 , and use these values to compute
the estimated distribution function F (xi ), where F () is the df of
N (
x, s2 ). Then use equation (13.4) to compute D.
2. Repeat Step 1 m times (call this the Monte Carlo sample size)
to obtain m values of Dj , for j = 1, , m.
3. At the level of significance , the critical value of the KolmogorovSmirnov D statistic is computed as the (1)-quantile of the sample
of m values of D, estimated using the method in equations (11.9)
and (11.10).
The following critical values are proposed by Lilliefors (1967) for
testing normal distributions with unknown mean and variance
5
Level of significance
0.10
0.05
0.01
Critical value
0.805
0.886
1.031
.
n
3. At the level of significance , the critical value of the KolmogorovSmirnov D statistic is computed as the (1)-quantile of the sample
of m values of D, estimated using the method in equations (11.9)
and (11.10).
The following critical values are proposed by Lilliefors (1969) for
testing exponential distributions with unknown mean
Level of significance
0.10
0.05
0.01
Critical value
0.96
1.06
1.25
.
n
This result holds asymptotically for any null distribution. Yet Monte
Carlo simulation can be used to investigate the performance of the
test and improve the estimates of the critical values in small samples
if required.
Example 15.3: Estimate the critical values of the chi-square goodnessof-fit statistic X 2 using Monte Carlo simulation when the null hypothesis
is that the observations are distributed as E(), where is unknown.
Compute the X 2 statistics based on the MLE using individual observations
as well as the MMLE using grouped data.
Solution:
We group the data into intervals (ci1 , ci ], and use the following 4 intervals: (0, 0.4], (0.4, 1], (1, 1.5] and (1.5, ). The MLE of
using the complete individual data is 1/
x. Let n = {n1 , , n4 }, where
ni is the number of observations in the ith interval. Using grouped data,
8
4
X
i=1
0.10
0.05
0.01
n = 50
MLE
MMLE
4.95
4.70
6.31
6.07
9.45
9.25
n = 100
MLE
MMLE
4.93
4.70
6.30
6.07
9.48
9.39
n = 200
MLE
MMLE
4.91
4.61
6.38
6.05
9.60
9.37
n = 300
MLE
MMLE
4.91
4.66
6.30
6.04
9.41
9.14
22,1
4.61
5.99
9.21
The asymptotic critical values 22,1 are shown in the last column. Two
points can be observed from the Monte Carlo results. First, the asymptotic
9
results are very reliable even for samples of size 50, if the correct MMLE
is used to compute X 2 . Second, if MLE is used to compute X 2 , the use
of 22,1 as the critical value will over-reject the null hypothesis.
2
10
(15.1)
2. Generate a sample of observations from the distributional assumption of H0 (), call this x . Compute the test statistic using data x
and call this t .
3. Repeat Step 2 m times, which is the bootstrap sample size, to obtain
m values of the test statistic tj , for j = 1, , m.
4. The estimated p-value of t is computed as
1 + number of {tj t}
.
m+1
(15.3)
= P20
,
i=1 log(xi + 5) 20 log(5)
14
and we obtain
= 2.7447. The computed D statistic is 0.1424.
To estimate the p-value, we generate 10,000 bootstrap samples of size 20
each from P(2.7447, 5), estimate and compute the D statistic for each
sample. The proportion of the D values larger than 0.1424 calculated
using equation (15.3) is 0.5775, which is the estimated p-value. Thus,
the P(, 5) assumption cannot be rejected at any conventional level of
significance.
For (b), the MLE of is
20
= P20
= 15.8233.
log(x
+
40)
20
log(40)
i
i=1
significance of 10%, the null hypothesis P(, 40) is rejected, but not at
the level of significance of 5%.
For (c), the MLE of is
1
= = 0.3665,
x
and the computed D value is 0.2307. We generate 10,000 samples of size
20 each from the E(0.3665) distribution using the inversion method. The
estimated p-value of the D statistic is 0.0603. Thus, the assumption of
E() is rejected at the 10% level, but not at the 5% level.
To conclude, the Kolmogorov-Smirnov test supports the P(, 5) distribution assumption for the loss data, but not the P(, 40) and E() distributions.
2
16
we may use the empirical distribution define by x as the assumed distribution. We generate a sample of n observations x = (x1 , , xn )
by re-sampling from x with replacement, and compute the estimate
(or g( )) based on x .
19
by
m
1 X
(
xj E )
m j=1
and
m
1 X
(
xj E )2 .
m j=1
(15.4)
4. The bias and the mean squared error of s2 are estimated, respectively, by
m
1 X
2
(s2
E)
m j=1 j
and
m
1 X
2 2
(s2
E) .
m j=1 j
(15.5)
j
E , so that the bootstrap estimate of the biases should
converge to zero when m is large.
20
(15.6)
21
(X; F ) = (X) (F ),
(15.8)
(15.9)
(15.10)
For another application, let T (X) be a test statistic for a hypothesis H0 and its value computed based on a specific sample x =
(x1 , , xn ) be t = T (x). We now define
(X; F ) = T (X) t.
(15.11)
(15.12)
In the above cases, we are interested in the expectation or the population proportion of a suitably defined function (X; F ). This set-up
includes the evaluation of bias and mean squared error of an estimator and the p-value of a test, as well as many other applications.
As F is unknown in practice, the quantities in equations (15.9),
(15.10) and (15.12) cannot be evaluated.
However, we may replace F by a known df F and consider instead
the quantities
EF [(X; F )] = EF [(X) (F )],
(15.13)
(15.14)
(15.15)
and
24
26
Wt = " t,
(15.16)
A continuous-time stochastic process satisfying the above two properties is called a Wiener process or standard Brownian motion.
From the first of these two properties, we can conclude that
E(Wt ) = 0,
(15.17)
Var(Wt ) = t.
(15.18)
and
For the change over a finite interval [0, T ], we can partition the
interval into N nonoverlapping small segments of length t each,
such that
T = N(t)
(15.19)
and
WT W0 =
N1
X
Wi(t) =
i=0
28
X
i
i=1
t,
(15.20)
N
X
E( i ) t = W0 ,
(15.21)
i=1
and
Var(WT ) =
N
X
Var( i )t =
i=1
N
X
t = T.
(15.22)
i=1
Hence, WT is the sum of W0 and N iid normal variates, which implies, given W0 ,
WT N (W0 , T ).
(15.23)
The Wiener process can be extended to allow for a drift in the
process and a constant volatility parameter.
29
Xt = a t + b Wt ,
(15.24)
where a is the drift rate and b is the volatility rate (a and b are
constants), and Wt is a Wiener process.
It can be verified that, given X0 , we have
XT N (X0 + aT, b2 T ),
for any finite T .
30
(15.25)
(15.27)
The terms a(Xt , t) and b(Xt , t) are called the drift rate and the
diusion coecient, respectively.
Xt is in general no longer normally distributed.
We consider a specific member of diusion processes, called the geometric Brownian motion.
Let St be the price of an asset at time t. St is said to follow a
geometric Brownian motion if
dSt = St dt + St dWt ,
(15.28)
32
(15.29)
d log St =
2
dt + dWt ,
(15.30)
so that log St follows a generalized Wiener process and hence is normally distributed. Thus, following equation (15.25), we conclude
log St N log S0 +
t, 2 t ,
(15.31)
t
t+
= S0 exp(t).
2
(15.32)
St
log St log S0 = log
S0
so that
St
log
S0
=
2
t, 2 t ,
t + tZ,
(15.33)
(15.34)
St
1
R log
(15.36)
t
S0
is the continuously compounded rate of return over the interval
[0, t].
1
St
E log
t
S0
2
= ,
2
(15.37)
(15.38)
Hence, expressed in terms of the total return and the dividend yield,
35
1
St
E log
t
S0
2
2
=
= .
2
2
(15.39)
We now consider the simulation of asset prices that follow the geometric Brownian motion given in equation (15.28), in which the
parameter captures the return due to asset-price appreciation.
From equation (15.34), we obtain
St = S0 exp
"
t + tZ ,
(15.40)
(15.41)
R =
xR
h
and
sR
= .
h
(15.42)
for i = 0, 1, 2, ,
(15.43)
There are in total 250 index values and we compute 249 daily returns,
which are the logarithmic price dierences.
The price index graph and the return graph are plotted in Figure
15.1.
We estimate the parameters of the price process and obtain xR =
0.0068% and sR = 1.0476%.
These values are in percent per day. If we take h = 1/250, the
2
xR s2R
=
R +
=
+
= 3.0718% per annum.
2
h
2h
(15.44)
S&P500 index
S&P500 return
1600
4
2
Return in %
Index value
1550
1500
1450
1400
1350
51
101
151
201
2007/01/04 2007/12/28
51
101
151
201
2007/01/04 2007/12/28
Normal probability plot
100
Probability
Frequency
80
60
40
20
0
S&P500 return
Data
1700
2
Return in %
1600
1500
0
2
1400
1
51
101
151
201
2007/01/04 2007/12/28
51
101
151
201
2007/01/04 2007/12/28
Normal probability plot
60
50
40
Probability
Frequency
Simulated value
30
20
10
0
Return of simulated series
Data
d log St =
2
(15.45)
d log St = J
2
dt + dWt + Jt dNt .
(15.46)
If J > 0, the jump component induces price appreciation on average, and the diusion part of the price will have a drift term adjusted
downwards. On the other hand, if J < 0, investors will be compensated by a higher drift rate to produce the same expected return.
To simulate the jump-diusion process defined in equation (15.46)
we first consider the jump component.
41
m
X
Zi .
(15.47)
i=1
J
2
h + hZ,
(15.48)
h + hZ exp mJ + J
Zi .
St+(i+1)h = St+ih exp J
2
i=1
(15.49)
42
For illustration we simulate a jump-diusion process using the following parameters: = 3.0718%, = 16.5640%, = 3, J = 2%
and J = 3% (the first 3 quantities are per annum).
Thus, the jumps occur on average 3 times per year, and each jump is
normally distributed with mean jump size of 2% down and standard
deviation of 3%. To observe more jumps in the simulated process,
we simulate 500 daily observations (about 2 years) and an example
is presented in Figure 15.3.
43
1700
2
Return in %
Simulated value
1600
1500
1400
1300
0
2
4
6
101
201
301
Time
401
101
201
301
Time
200
Probability
Frequency
150
100
50
0
Return of simulated series
401
Data