CH 2
CH 2
Survival models
2.1 Summary
In this chapter we represent the future lifetime of an individual as a random
variable, and show how probabilities of death or survival can be calculated
under this framework. We then define an important quantity known as the force
of mortality, introduce some actuarial notation, and discuss some properties of
the distribution of future lifetime. We introduce the curtate future lifetime ran-
dom variable. This is a function of the future lifetime random variable which
represents the number of complete years of future life. We explain why this
function is useful and derive its probability function.
17
18 Survival models
Thus, Sx (t) represents the probability that (x) survives for at least t years, and
Sx is known as the survival function.
Given our interpretation of the collection of random variables {Tx} x::: o as
the future lifetimes of individuals, we need a connection between any pair of
them. To see this, consider To and Tx for an individual who is now aged x. The
random variable To represented the future lifetime at birth for this individual,
so that, at birth, the individual's age at death would have been represented by
To. This individual could have died before reaching age x - the probability of
this was Pr[ To < x] - but has survived. Now that the individual has survived
to age x, so that we know that To > x, her future lifetime is represented by Tx
and her age at death is now x + Tx. If she dies within t years from now, then
Tx:::; t and To:::; x + t. Loosely speaking, we require the events [Tx:::; t] and
[To:::; x + t] to be equivalent, given that the individual survives to age x. We
achieve this by making the following assumption for all x :::: 0 and for all t > 0
that is,
Fo(x + t) - Fo(x)
(2.2)
Fx(t) = .
So(x)
Also, using Sx(t) = 1- Fx(t),
This is a very important result. It shows that we can interpret the probability
of survival from birth to age x + t as the product of
So(x + t + u)
Sx(t + u) = So(x)
So(x + t) So(x + t + u)
::::} Sx(t + u) = --------
So(x) So(x+t)
::::} Sx(t + u) = Sx(t)Sx+1(u). (2.5)
We have already seen that if we know survival probabilities from birth, then,
using formula (2.4), we also know survival probabilities for our individual from
any future age x. Formula (2.5) takes this a stage further. It shows that if we
know survival probabilities from any age x ( ::=: 0), then we also know survival
probabilities from any future age x + t ( ::=: x).
Any survival function for a lifetime distribution must satisfy the following
conditions to be valid.
Condition 1 Sx(O) = 1; that is, the probability that a life currently aged x
survives 0 years is 1.
These conditions are both necessary and sufficient, so that any function Sx
which satisfies these three conditions as a function of t ( ::=: 0), for a fixed
20 Survival models
x ( :=::: 0), defines a lifetime distribution from age x, and, using formula (2.5),
for all ages greater than x.
For all the distributions used in this book, we make three additional assump-
tions:
Assumption 2.1 Sx(t) is differentiable for all t > 0. Note that together with
fr
Condition 3 above, this means that Sx (t) ::::; 0 for all t > 0.
We remark that in the above example, So(120) = 0, which means that under
this model, survival beyond age 120 is not possible. In this case we refer to 120
as the limiting age of the model. In general, if there is a limiting age, we use
the Greek letter w to denote it. In models where there is no limiting age, it is
often practical to introduce a limiting age in calculations, as we will see later
in this chapter.
2.3 The force of mortality 21
Thus, for very small dx, we can interpret /J.,x dx as the probability that a life
who has attained age x dies before attaining age x + dx. For example, sup-
pose we have a life aged exactly 50, and that the force of mortality at age
50 is 0.0044 per year. A small value of dx might be a single day, or 0.00274
years. Then the approximate probability that the life dies on his 50th birthday
is 0.0044 X 0.00274 = 1.2 X 10- 5 •
We can relate the force of mortality to the survival function from birth,
So. As
S (dx) = So(x + dx)
x So(x) '
(2.9)
22 Survival models
fo(x)
f.Lx = So(x) ·
We can also relate the force of mortality function at any age x + t, t > 0,
to the lifetime distribution of Tx. Assume x is fixed and t is variable. Then
d(x + t) =dt and so
1 d
f.Lx+t = - So(x + t) d(x + t) So(x + t)
1 d
=- -So(x+t)
So(x + t) dt
1 d
= - So(x + t) dt (So(x)Sx(t))
So(x) d
= - So(x + t) dt Sx(t)
-1 d
= ---Sx(t).
Sx (t) dt
Hence
fx(t)
f.Lx+t = Sx (t)' (2.10)
This relationship gives a way of finding f.Lx+t given Sx(t). We can also use
equation (2.9) to develop a formula for Sx (t) in terms of the force of mortality
function. We use the fact that for a function h whose derivative exists,
d 1 d
-logh(x) = - - h ( x )
dx h(x) dx '
so from equation (2.9) we have
d
f.Lx = - - log So(x),
dx
and integrating this identity over (0, y) yields
and so
-1 d 1 1 1
/,lx = ---So(x) = 720 (1-x/120)- = - - -
So(x) dx 720 - 6x
As an alternative, we could use the relationship
Example 2.3 Let /,lx =Bex, x > 0, where B and c are constants such that
0 < B < 1andc>1. This model is called Gompertz' law of mortality. Derive
an expression for Sx (t).
B lx+t
= - - exp{r log c}
loge x
B
= __ (cx+t _ex),
loge
giving
The force of mortality under Gompertz' law increases exponentially with age.
At first sight this seems reasonable, but as we will see in the next chapter, the
force of mortality for most populations is not an increasing function of age
over the entire age range. Nevertheless, the Gompertz model does provide a
fairly good fit to mortality data over some age ranges, particularly from middle
age to early old age.
Example 2.4 Calculate the survival function and probability density function
for Tx using Gompertz' law of mortality, with B = 0.0003 and c = 1.07, for
x = 20, x = 50 and x = 80. Plot the results and comment on the features of the
graphs.
Solution 2.4 For x = 20, the force of mortality is µ 20+1 = Bc20+t and the
survival function is
ho(t)
f-l20+1 = - - =} ho(t) = f-l20+1 S20(t) = Be 20+1 exp { --B
- c 20 (c t - 1) } .
~~) ~c
Figure 2.1 shows the survival functions for ages 20, 50 and 80, and Figure 2.2
shows the corresponding probability density functions. These figures illustrate
some general points about lifetime distributions.
First, we see an effective limiting age, even though, in principle, there is no
age to which the survival probability is exactly zero. Looking at Figure 2.1,
we see that although Sx(t) > 0 for all combinations of x and t, survival beyond
age 120 is very unlikely.
-
2.3 The force of mortality 25
0.9
0.8
0.7
~
.g 0.6
'8
k
\'.>< 0.5
';rj
.::: 0.4
~
r/J
0.3
0.2
0.1
0
0 10 20 30 40 50 60 70 80 90 100
Time, t
O.D7
0.06
0.05
0.04
~
~
O.D3
0.02
0.01
0
0 10 20 30 40 50 60 70 80 90 100
Time, t
Second, we note that the survival functions are ordered according to age,
with the probability of survival for any given value of t being highest for age
20 and lowest for age 80. For survival functions that give a more realistic rep-
resentation of human mortality, this ordering can be violated, but it usually
26 Survival models
(2.13)
That is
We may drop the subscript t if its value is 1, so that Px represents the proba-
bility that (x) survives to at least age x + 1. Similarly, qx is the probability that
(x) dies before age x + 1. In actuarial terminology qx is called the mortality
rate at age x. We call ultqx a deferred mortality probability, because it is
2.4 Actuarial notation 27
the probability that death occurs in the interval oft years, following a deferred
period of u years.
The relationships below follow immediately from the definitions above and
the previous results in this chapter:
tPx +tqx = 1,
ultqx = uPx - u+t Px•
t+uPx = t Px uPx+t from (2.5), (2.15)
1 d
/Lx = - - -d xPO from (2.9). (2.16)
xPO X
Similarly,
1 d d
/Lx+t = -t Px
- -dt tPx::::} -d tPx = -tPx /Lx+t• (2.17)
t
fx(t) (2.18)
/Lx+t = Sx (t) ::::} fx (t) = t Px /Lx+t from (2.10),
(2.20)
Age
I
x I
x+s
I
x+s+ds I
x+t
Event (x) survives s years (x)
\_
~
Probability sPx µx +sds
~ fo ~ µx+l/2,
1
qx µx+sds
where the second relationship follows by the mid-point rule for numerical inte-
gration.
for 0 :": x :": 120. Calculate both qx and µx+l/2 for x = 20 and for x = 110,
and comment on these values.
So(x + 1) ( 1 ) l/ 6
1
Px = So(x) = - 120 - x '
giving q20 = 0.00167 and quo = 0.01741, and from the solution to Exam-
ple 2.2, µ 20 1=0.00168 and µ 110 1 = 0.01754. We see that µx+l/2 is a good
approximatibn to qx when the mohality rate is small, but is not such a good
approximation, at least in absolute terms, when the mortality rate is not
close to 0. 0
0
evaluate ex, we note from formulae (2.17) and (2.18) that
d (2.21)
fx(t) = tPx /J.,x+t = - dt tPx·
From the definition of an expected value, we have
~x = LXJ t fx(t)dt
= loot t Px /J.,x+1dt.
We can now use (2.21) to evaluate this integral using integration by parts as
-(t1Px\~ -
00
= l tPxdt).
(2.22)
= -(t 2
tPx\: - loo tPx 2t dt)
(2.23)
= 2 loo t1Px dt.
So we have integral expressions for E[Tx] and E[T}J. For some lifetime distri-
butions we are able to integrate directly. In other cases we have to use numer-
ical integration techniques to evaluate the integrals in (2.22) and (2.23). The
variance of Tx can then be calculated as
for 0 :S x :S 120. Calculate ~x and V[Tx] for (a) x = 30 and (b) x = 80.
30 Survival models
Now recall that this formula is valid for 0 :S t :S (120 - x), because, under this
model, survival beyond age 120 is impossible. Technically, we have
tPx --I 0
(1 - 12 6-x )
116
for x
for x
+t
+t
::::: 120,
> 120.
So the upper limit of integration in equation (2.22) is 120 - x, and
0
1120-x ( t ) 1/6
ex = 1- dt.
0 120 - x
We make the substitution y = 1 - t/(120 - x), so that t = (120 - x)(l - y),
giving
1
~x = (120 - x) fo y 116dy
= ~(120 - x).
0 0
Then e3o = 77 .143 and eso = 34.286.
Under this model the expectation of life at any age x is 6/7 of the time to
age 120.
For the variance we require E[T}]. Using equation (2.23) we have
{120-x
E [r;] = 2 lo ftPxdt
= 2 {120-x t
lo
(1- t
120 - x
)l/6 dt.
= 2(120 - x ) (
2
~- 6
13) .
Then
Since we know under this model that all lives will die before age 120, it
makes sense that the uncertainty in the future lifetime should be greater for
younger lives than for older lives. D
A feature of the model used in Example 2.6 is that we can obtain formulae for
quantities of interest such as ~x. but for many models this is not possible. For
example, when we model mortality using Gompertz' law, there is no explicit
formula for ~x and we must use numerical integration to calculate moments of
Tx. In Appendix B we describe in detail how to do this.
Table 2.1 shows values of ~x and the standard deviation of Tx (denoted
SD[TxD for a range of values of x using Gompertz' law, f-Lx = BcX, where
B = 0.0003 and c = 1.07. For this survival model, l30PO = 1.9 x 10- 13 , so that
using 130 as the maximum attainable age in our numerical integration is accu-
rate enough for practical purposes.
We see that ~x is a decreasing function of x, as it was in Example 2.6. In
that example ~x was a linear function of x, but we see that this is not true in
Table 2.1.
We are sometimes interested in the future lifetime random variable subject
to a cap of n years, which is represented by the random variable min(Tx, n).
For example, suppose that (x) is entitled to a benefit payable continuously
for a maximum of n years, conditional on survival. Then min(Tx, n) would
represent the payment period for the benefit. We derive the mean and variance
of this random variable, using a similar approach to the derivation of the mean
----------------..................
32 Survival models
and variance of Tx. The expected value of min(Tx, n) is denoted ~x:ii1' and is
called the term expectation of life.
n tPx µx+t dt
=font (-:ttPx)dt+nnPx
We can think of the curtate future lifetime as the number of whole years lived
in the future by an individual. As an illustration of the importance of curtate
future lifetime, consider the situation where a life aged x at time 0 is entitled
to payments of 1 at times 1, 2, 3, ... provided that (x) is alive at these times.
Then the number of payments made equals the number of complete years lived
after time 0 by (x). This is the curtate future lifetime.
We can find the probability function of Kx by noting that fork= 0, 1, 2, ... ,
K x = k if and only if (x) dies between the ages of x + k and x + k + 1. Thus
fork= 0, 1, 2, ...
The expected value of Kx is denoted by ex, so that ex= E[Kx], and is referred
to as the curtate expectation of life (even though it represents the expected
curtate lifetime). So
E[Kx] =ex
00
= L)Pr[Kx = k]
k=O
00
= L kPx· (2.24)
k=l
00
= 2 L k kPx - L kPx
k=l k=l
00
= 2 Lk kPx - ex.
k=l
As with the complete expectation of life, there are a few lifetime distributions
that allow E[Kx] and E[K~] to be calculated analytically. For more realistic
models, such as Gompertz', we can calculate the values easily using Excel or
other suitable software. Although in principle we have to evaluate an infinite
sum, at some age the survival probability will be sufficiently small that we can
treat it as an effective limiting age.
Analogous to the random variable min(Tx, n) we have the random variable
min(Kx, n ). For example, if a life aged x is entitled to payments of 1 at times
1, 2, 3, ... , n, where n is an integer, then min(Kx, n) represents the number of
payments made. An important difference between these two random variables
is that min(Tx, n) is a mixed random variable (with a density over (0, n) and
a mass of probability at n), whereas min(Kx, n) is a discrete random variable
since Kx is a discrete random variable. The expected value of min(Kx, n) is
34 Survival models
ex:lil = L kPx ·
k=I
~x = f
lo
00
tPx dt = f
l=O jl
1
~i+ tPx dt.
If we approximate each integral using the trapezium rule for numerical inte-
gration (see Appendix B), we obtain
{}+! I
jl tPx dt;::::; z (Jpx + J+lPx),
and hence
00 00
(2.25)
In Chapter 5 we will meet a refined version of this approximation. Table 2.2
shows values of ~x and ex for a range of values of x when the survival model
is Gompertz' law, with B = 0.0003 and c = 1.07. Values of ex were calculated
by applying formula (2.24) with an upper limit of summation of 130 - x, and
values of ~ x are as in Table 2.1. This table illustrates that formula (2.25) is a
very good approximation in this particular case for younger ages, but is less
accurate at very old ages. This observation is true for most realistic survival
models.
(2.26)
This is very similar to Gompertz' law, but adds a fixed term that is not age
related, that allows better for accidental deaths. The extra term tends to improve
the fit of the model to mortality data at younger ages. See Exercise 2.11.
In recent times, the Gompertz-Makeham approach has been generalized fur-
ther to give the GM(r, s) (Gompertz-Makeham) formula,
2.8 Exercises
Exercise 2.1 Let Fo(t) = 1 - (1 - t /105) 115 for 0 .::: t .::: 105. Calculate
(a) the probability that a newborn life dies before age 60,
(b) the probability that a life aged 30 survives to at least age 70,
(c) the probability that a life aged 20 dies between ages 90 and 100,
(d) the force of mortality at age 50,
(e) the median future lifetime at age 50,
(t) the complete expectation of life at age 50,
(g) the curtate expectation of life at age 50.
Exercise 2.3 Calculate the probability that a life aged 0 will die between ages
19 and 36, given the survival function
1
So(x) = .J100 - x, 0.::: x .::: 100 (= w).
10
Exercise 2.4 Let
(a) ShowthatSx(t)=e-AI.
(b) Show that f.1,x =A.
1
(c) Show that ex= (e" - 1)- .
(d) What conclusions do you draw about using this lifetime distribution to
model human mortality?
Exercise 2.6 Given Px =0.99, Px+l =0.985, 3Px+l =0.95 and
qx+3 = 0.02, calculate
(a) Px+3,
(b)2Px•
(c) 2Px+l,
(d) 3Px,
(e) 1i2qx.
Exercise 2.7 Given
1
Fo(x) = 1 - - - for x 2: 0,
l+x
find expressions for (a), (b), (c) below, simplifying as far as possible,
(a) So(x),
(b) fo(x),
(c) Sx(t),
and calculate:
Exercise 2.10 Suppose that Gompertz' law applies with µ,30 = 0.000130 and
f..lso = 0.000344. Calculate 10P40·
for x :=::: 0.
(2.27)
Exercise 2.12 (a) Construct a table of Px for Mak:eham's law with parame-
ters A= 0.0001, B = 0.00035 and c = 1.075, for integer x from age 0 to
age 130, using Excel or other appropriate computer software. You should
set the parameters so that they can be easily changed, and you should
keep the table, as many exercises and examples in future chapters will use
Mak:eham's law.
(b) Use the table to determine the age last birthday at which a life currently
aged 70 is most likely to die.
(c) Use the table to calculate e10.
(d) Using a numerical approach, calculate ~70·
Exercise 2.13 A life insurer assumes that the force of mortality of smokers at
all ages is twice the force of mortality of non-smokers.
(a) Show that, if* represents smokers' mortality, and the 'unstarred' function
represents non-smokers' mortality, then
(b) Calculate the difference between the life expectancy of smokers and non-
smokers aged 50, assuming that non-smokers mortality follows Gompertz'
law, with B = 0.0005 and c = 1.07.
(c) Calculate the variance of the future lifetime for a non-smoker aged 50 and
for a smoker aged 50 under Gompertz' law.
Hint: You will need to use numerical integration for parts (b) and (c).
2.8 Exercises 39
~x = -
1
-1
So(x) x
00
So(t)dt,
ex:iil = L kPx ·
k=l