Epidemicmodellinganintroduction 150418094321 Conversion Gate02 PDF
Epidemicmodellinganintroduction 150418094321 Conversion Gate02 PDF
IN MATHEMATICAL BIOLOGY: 15
Editors
C. CANNINGS
University of Sheffield, UK
F. C. HOPPENSTEADT
Arizona State University, Tempe, USA
L. A. SEGEL
Weizmann Institute of Science, Rehovot, Israel
E P I D E M I C MODELLING: AN I N T R O D U C T I O N
Epidemic Modelling:
An Introduction
CAMBRIDGE UNIVERSITY PRESS
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, Sao Paulo
Published in the United States of America by Cambridge University Press, New York
www.cambridge.org
Information on this title: www.cambridge.org/9780521640794
© Cambridge University Press 1999
A catalogue record for this publication is available from the British Library
... It follows that epidemic theory should certainly continue to search for new
insights into the mechanisms of the population dynamics of infectious dis-
eases, especially those of high priority in the world today, but that increased
attention should be paid to formulating applied models that are sufficiently
realistic to contribute directly to broad programs of intervention and control.
Norman T. J. Bailey,
The Mathematical Theory of Infectious Diseases (1975, p. 27)
Preface xi
1 Some History 1
1.1 An empirical approach 1
1.2 A deterministic model 2
1.2.1 The Law of Mass Action 6
1.3 Prom curve-fitting to homogeneous mixing models 7
1.4 Stochastic modelling 11
1.5 Model fitting and prediction 13
1.6 Some general observations and summary 15
1.6.1 Methods and models 16
1.6.2 Some terminology 16
1.7 Exercises and Complements to Chapter 1 17
2 Deterministic Models 20
2.1 The simple epidemic in continuous time 20
2.2 The simple epidemic in interacting groups 23
2.3 The general epidemic in a homogeneous population 27
2.4 The general epidemic in a stratified population 35
2.5 Generation-wise evolution of epidemics 38
2.6 Carrier models 45
2.7 Endemicity of a vector-borne disease 46
2.8 Discrete time deterministic models 48
2.9 Exercises and Complements to Chapter 2 53
vn
viii Contents
subject are Bailey (1975) and Anderson and May (1991); both are often
referred to in the text. The former considers most of the classical epidemic
models, both deterministic and stochastic, while the latter concentrates
essentially on deterministic results. We believe that both types of models
have a role to play in describing the spread of infections in large and small
populations, and have attempted to give due weight to each.
Current epidemic modelling relies on a great variety of mathematical
methods; we have endeavoured to emphasize the intrinsic interest of such
methods, as well as demonstrate their practical usefulness. Exercises and
complements have been provided at the end of each Chapter, with the dual
intentions of extending the text and of providing opportunities for readers
to practise their skills in modelling and the analysis of models. The exercises
are not of uniform difficulty.
For those who wish to reach the forefront of current research, the vol-
umes of papers edited by Mollison (1995) and Isham and Medley (1996),
both arising from a six-month research programme on Epidemic Models at
the Newton Institute in Cambridge in 1993, provide further illustrations of
epidemic modelling and many challenging problems in the field.
Finally we thank all of our colleagues who have collaborated with us
over many years in both this area of applied probability and others; their
contributions are too many to mention individually, save that DJD pays
tribute to David Kendall who first introduced him to the topic of this book,
and JMG expresses his gratitude to Norman Bailey and Maurice Bart let t,
pioneers of stochastic epidemic modelling.
We also thank David Tranah, and the Copy Editor and others at Cam-
bridge University Press for their cooperation in producing this book without
invoking LaTJ^X.
Daryl Daley, Joe Gani
Canberra, October 1998
Some History
Table 1.1. Numbers of deaths due to eight causes, and related risks
Causes Deaths Risk
1. Thrush, Convulsion, Rickets, Teeth and Worms; Abortives,
Chrysomes, Infants, Liver-grown and overlaid 71,124 0.310
2. Chronical Diseases: Consumptions, Ague and Fever 68,271 0.298
3. Acute Diseases, and Miscellaneous 49,505 0.216
4. Plague 16,384 0.071
5. Small-pox, Swine-pox, Measles and Worms without
Convulsions 12,210 0.053
6. Notorious Diseases: Apoplex, Gowt, Leprosy, Palsy,
Stone and Strangury, Sodainly, etc. 5,547 0.024
7. Cancers, Fistulae, Sores, Ulcers, Impostume, Itch, King's Evil,
Scal'd-head, Wens 3,320 0.014
8. Casualties: Drowned, Killed by Accidents, Murthured 2,889 0.013
Source: Graunt (1662). The figures for Groups 1, 2, 3, 6 and 8 are quoted directly in
Graunt's text, while those for Groups 4, 5 and 7 are obtained from his complete table
of casualties appended after his comments in 'The Conclusion'.
In the 20 years 1629-36 and 1647-58, there were 229 250 deaths recorded
from 81 different causes. Table 1.1 consolidates these data into eight main
groups. The relative risks of death from each of the eight causes are indi-
cated in the column furthest to the right in Table 1.1.
The main killers were Groups 1, 2 and 3; Graunt was led to observe that
whereas many persons live in great fear, and apprehension of some
of the more formidable, and notorious diseases following [Group 6]; I
shall onely [sic] set down how many died of each: that the respective
numbers, being compared with the Total of 229,250, those persons may
the better understand the hazards they are in.
The notorious diseases were further broken up by Graunt into the sub-
categories of Table 1.2. Among these, apoplexy appears to have been the
largest killer. Graunt's analysis of the various causes of death provided the
first systematic method for estimating the comparative risks of dying from
the plague, as against the chronical or other diseases, for example.
These observations may well be considered to be the first approach to
the theory of competing risks, a theory that is now well established among
modern epidemiologists.
so
= £(0) exp ( - ft n(u) du) = £(0)e~M«) = C(t), (1.2.2)
where M(i) is the cumulative hazard. We shall use £(•) below.
Consider another cohort subject to both the general per capita death
rate /x(t) as above and a further hazard (infection) like smallpox with a
constant infection rate (3 per individual per unit time. Individuals succumb
1. Some History
only once, the result of such infection being either death in a fraction a of
cases or immunity for the remainder of life in the complementary fraction
I — a. Denote the number of individuals still susceptible to the disease
at age t by x(t), and the total number of the surviving cohort of age t,
whether immune or not, by £p(t) as shown in Figure 1.1. To simplify the
mathematical model, the infectious state is assumed to be instantaneous,
so that as soon as an infection occurs, the infective individual either dies or
recovers immediately. Then for the x(t) susceptibles and z(t) = £@{t) —x(t)
immunes in this cohort,
x(t) =-(n(t) + 0)x(t) (1.2.3a)
and
z(t) = -v{t)z{t) + (1 - a)/3x(t). (1.2.3b)
1.2. A deterministic model
1200-
1000-
800-
Cohort
600-
size
400-
200-
These equations are solved via integrating factors. Using M(t) and x(0)
£ ( 0 ) = £(0) = £0 a s before, we have
whence
x(t) = (1.2.4)
and
- e""')
(1.2.5)
using (1.2.2); observe that £(£) = £0(£) when the infection rate (3 = 0.
Equation (1.2.5) relates the sizes of the surviving cohorts of age t in pop-
ulations with ({3 > 0) and without (/? = 0) smallpox, respectively. Bernoulli
used it in the form
(1.2.6)
1 - a + ae-#
to estimate the size of a surviving cohort £(£) in a 'state without smallpox'
on the basis of Halley's (1693) data from Breslau, now Wroclaw. This esti-
mation required parameters a and /?; on reviewing what evidence he could
1. Some History
so that
In A(fr(f)/C(O) = "(<» + 0 0 , (1-2-7)
where a = — ln[a(l — e~^)]. This is the simplest way of expressing the
result (1.2.5) for the purpose of estimating f3 and a conditional on such
extended data being available. All that Bernoulli could do was to present
the advantage of variolation (i.e. absence of deaths due to smallpox) on the
basis of his model-based calculations. Note too that the population risk
of death from smallpox (cf. Tables 1.1-2) as implied by Table 1.1 is about
100/1300 « 7.7%, higher than in Table 1.1 because the London population
from which Graunt drew his data, included more immigrants than Breslau.
In Halley's day Breslau had rather few immigrants, and hence, propor-
tionately more infant and childhood deaths, smallpox being more prevalent
amongst children than adults.
Now the outbreak will take much longer to decline to extinction than it
took to rise, for those especially exposed have in large part been already
attacked and the disease must spread, in the main, among persons
whose manner of life brings them comparatively little into contact with
their fellows.
1.3. From curve-fitting to homogeneous mixing models
where the constant (3 is such that (3xtyt < xt for all t, i.e. (3 < l/(maxi>i yi).
These new infectives are a proportion (3 of the number xtyt of contacts be-
tween susceptibles and infectives, where /? is known as the infection param-
eter. Because of the constraint on /?, it follows that in a closed population
in which yt — N — xu we have (3xt(N — xt) < xt or f3(N - xt) < 1; this is
certainly satisfied if /3 < 1/N.
Continuous time versions of epidemic equations were used by Ross (1916)
and Ross and Hudson (1917) in their studies of populations subject to in-
fection. But the form of equations most commonly used to characterize the
typical general epidemic with susceptibles x(t), infectives y(t) and immunes
z(t) (such as a measles epidemic) is due to Kermack and McKendrick (1927).
They assumed a fixed population size N = x(t) H- y(t) 4- z(t), and using the
homogeneous mixing principle for continuous time t > 0 derived the (now)
classical equations
d^c
-77 = -0xy, (1.3.2a)
d£
^ = 0xy - 7i/, (1.3.2b)
^ = 72/, (l-3.2c)
\w=N-z
0 2oo
so that
= N-z- xoe~z/p.
Hence
shows that if the epidemic is ever to grow, then we must have dy/dt\t_0 > 0,
or xo > p, i.e. the initial number of susceptibles must exceed a threshold
value p. The second, deduced from (1.3.3), states that as t —> oo, z(i) —>
Zoo < N. Now in the limit t —• oo, N — z^ — x0e~ZoG^p = 0 (see the graphs
in Figure 1.2).
Suppose now that x0 is close to N; then z^ will be the approximate
solution of
0 = N - Zoo - N -
-f+p> <»*•>
1.4. Stochastic modelling 11
xt
5
4
3
2
: = X 3 = 1, ^ 3 = 0
1
JT3 = X4 = 0, =0
0 1 2 3 4
Figure 1.3. Two sample paths of a Reed-Frost epidemic, ending at
t =3( ) and £ = 4 ( ) respectively.
x2
(1.4.2)
1.5. Model fitting and prediction 13
(1.5.1)
Abbey obtained estimates of the probabilities qi of no contact in each sep-
arate interval i = 1,2,3,4 by the Maximum Likelihood method as
The fit of this model to the data turns out to be far from perfect: on a
goodness-of-fit test Abbey reported %2 = 53.1 on 4 degrees of freedom, and
noted a significantly improved fit from estimating (rather than counting)
the number of susceptibles. Abbey (1952) found the same true of other
measles, cHickenpox and German measles data, and investigated variation
of the contact rate either with time or between individuals, or both, as
other possible reasons for the inadequate fit of the model to the data (see
Chapter 4).
Individual's
disease state: Susceptible Latent < Infectious > Immune/Removed
1 1 1 1 1 > time
Epoch: tA tB tc t£> tE
<— Incubation period —>
< Serial interval •
tj\'. Infection occurs £#: Latency to infectious transition
tc- Symptoms appear t^: First transmission to another susceptible
tE- Individual no longer infectious to susceptibles (recovery or removal)
Note: tp is constrained to lie in the interval (iB»^i?)» s o t& > tc (as shown) and
tD < tc are both possible.
the incubation period. Often the serial interval and latent period are used
interchangeably although their meanings are different. The fundamental
quantity in the process of infection is the serial interval, but the latent period
is often used in the literature because it is assumed that a second infection
will occur as soon as the first infective becomes infectious. Anderson and
May (1991, §3.2.4) use the term generation time of the disease agent; in
terms of the notation in Figure 1.4 it is the expectation of (£# — t^) +
\(tE — £#)> i-e. the mean latent period plus half the mean infectious period.
Source: From bar charts in En'ko (transl. 1989). See Exercise 1.3 for more explanation.
initial case on day 0, to generation 0, 1, ... respectively. The table gives for
each epidemic the first day of observing a case for each generation, and the
number of cases reported on the various days within the generation, for as
long as there were any such cases.
(a) Investigate whether the spread of the dates of recording cases within an
epidemic shows any systematic trend from one generation to another (if not,
then periodicity is strong, and identifying a case is variable).
(b) Repeat the analysis of (a) on the combined data: what sort of additional
variability does such pooling of data introduce?
(c) The construction of analogues of qt and q as in section 1.5 entails the
estimation of the population size as well.
(d) If a model is fitted as in (c), then a \2 goodness of fit test can be
performed.
1.7. Exercises and Complements to Chapter 1 19
1.4 Wilson and Burke (1943) list the monthly numbers of measles cases in Prov-
idence RI for the years 1917-1940 as in Table 1.7. Plot out the course of the
nine epidemics for which the epidemic curves peak around May 1918, Mar.
1921, Mar. 1923, Jan. 1926, Apr. 1928, Jan. 1932, May 1935, Mar. 1937,
Mar. 1940. Observe that there is a marked seasonality effect if the dataset
is treated as a whole.
(a) Assuming a mean serial interval of 0.5 months and a closed population
for the course of the epidemic, investigate how estimates of any or all of JV,
/o, /3, 7 might be constructed.
(b) Repeat this analysis for the record as a whole assuming instead an
immigration of 2130 new susceptibles each September (thus, Aryear varies as
each year changes).
(c) Repeat the analysis of (b) but assuming now a steady immigration of
178 new susceptibles per month.
20
2.1. The simple epidemic in continuous time 21
where, as throughout this chapter, x(t) and y(t) denote the numbers of
susceptibles and infectives at time t, with initial conditions (x(Q),y(0)) =
(%o-> Vo) with 7/o > 1. Then assuming that the individuals of the population
mix homogeneously, we can write
y(N-y) \y N-yJ N
l n ^ l n ^
N - y(t) N-y0
Hence
As t —> oo, equation (2.1.2) shows that y(t) —> N, so that according to the
model all individuals in the population eventually become infected, thus
causing the end of the epidemic (in the mathematical sense).
In this model we have both x(t) > 0 and y(t) > 0 for all finite positive
£, so the question arises as to when we may consider the epidemic to have
terminated in practical terms. Realistically, we could define the 'end' of the
epidemic to occur at Ti = inf{t : y(t) > N - 1}, i.e. when the number of
infectives is within 1 of its final value. Since the function y(-) has a positive
independent rediscovery of the logistic curve by Pearl and Reed in 1920', and that to his
knowledge 'the only reference to the work of Verhulst in modern times prior to [1920] is
[a paper in 1918 by Du Pasquier]'. Bailey (1975) gives no account of its emergence in
epidemic theory; Bailey (1955) attributes the stochastic version of the model to Bartlett's
1946 lecture notes (see Bartlett, 1947).
22 2. Deterministic Models
SO
At this time we have x(t) = y(t) = |JV, and (dy/dt) = /3(|JV)2. The dashed
curve in Figure 2.1 illustrates equation (2.1.4), i.e. the epidemic curve for
the deterministic model of a simple epidemic (cf. also Figure 2.12 below).
2.2. The simple epidemic in interacting groups 23
y(t)
dy/dt
For ij = l , . . . , m :
Let Xj(t), yj(t) denote the numbers of susceptibles and infectives in each
of the groups j = 1 , . . . ,m respectively. Then we readily see that when
infection is transmitted both within and between groups, equation (2.1.1)
can be generalized to the set of equations
+ (j = 1 , . . . , ra), (2.2.1)
dt
subject to the initial conditions Vj(O) = yj0 and Xj(0) = Nj — yjO, as follows
from Xj = Nj — yj.
24 2. Deterministic Models
(2.2.2)
dt
(j = l , . . . , m ) . (2.2.3)
If all the initial values t/jo — Vo are the same, this set of equations basically
reduces to the single equation
- y)y[l + (m - l)/c] =
VoN
— (j = l , . . . , m ) . (2.2.4)
or in terms of m-vectors U and InU and the mxm matrix B = (6^) defined
by
f lnC/i 1 ' 1
I/2 lnC/2
lnU =
Un(7mJ
2.2. The simple epidemic in interacting groups 25
dv
i.e.
m
d
2=1
Hence on integration,
where X,(0) = nili^f l (0) = UZo (*Jo/Nf\ Now from (2.2.7), for all
j = l,...,m,
26 2. Deterministic Models
,K/(,C-1)
K - 1 dv '
whence
v=
n. K. — 1 /o,
so that the time is given parametrically in terms of G. We can now find the
solutions for Uj(v) and hence for the original yj — N — Ne~aNPtUj.
These computations simplify as follows in the special case where yio — 1
and yjo — 0 for j = 2,..., ra, so the y3,(t) for j = 2 , . . . , m are identical for
all t > 0, and equations (2.2.4) reduce to the two equations
= xi + (m — 1)«£2 — Na,
(2 2 n)
- -
[1 + (m - 2)«]a;2 - iVa.
so that
Xt(0) = u[1+{m-2)K]/K(0)U^{m~1)/K(0) = (1 - # - i
K/K /K
X2(0) = Ui (0)Ul {0) = (1 - AT- )""7^.
1
2.3. The general epidemic in a homogeneous population 27
/
f
h
>']'k \\ \
\\
/ \\
/1 w
J
/ w
o to
(a) K = 0.1 (b) « =
Figure 2.3. Numbers of infectives y\ and 2/2 ( ), and infection rates y\
and y2 ( ), for a Rushton-Mautner simple epidemic spreading in two
communities, with #i(0) = £2(0) = N, yi(0) = 1 and 2/2(0) = 0, and K as
shown.
Hence
Xi(v) = ((1
X2(v) - ((1
where
v = — 1
K l
'd5. (2.2.13)
~ Jo
Hence, for any time v, G(v) is known by (2.2.13), and thus also X\{v) and
X2(v). Prom these Ux{v) and U2(v) are obtained as Ui = ^X^ 7 7 1 " 1 ^ and
C/2 = X ^ 1 " ^ ^ " 2 ^ , and thus z, and Vj = N - Xj (j = 1,2).
Figure 2.3 depicts the spread of infection in a population of two equally
sized strata. For larger K as in (a), the outbreaks largely overlap and re-
inforce each other, whereas in (b) the epidemics occur more slowly and
approximately in sequence.
Section 2.1, but infectives may die, be isolated, or recover and become im-
mune. Individuals in the population are counted according to their disease
status, numbering x(t) susceptibles, y(t) infectives and z(t) removals (dead,
isolated or immune), so that x(t) is non-increasing, z(t) non-decreasing and
the sum x(t) + y(t) -b z(t) = JV, for all t > 0. The differential equation
governing x(t) is
d^c
— = -(3xy, (2.3.1)
where /? > 0 is the pairwise rate of infection as before, and (x,y, z)(0) =
(#o>2/o?2o) with yo > I, zo = 0. The number of infectives simultaneously
increases at the same rate as the number of susceptibles decreases, and
decreases through removal (by death, isolation or immunity) at a per capita
rate 7 > 0, so that
^ L i y . (2.3.2)
Finally the number of removals increases at exactly the same rate as the
loss of infectives, so that
ft=iv. (2.3.3)
Observe that (d/dt)(x(t) + y(t) + z(t)) = 0, as is consistent with the total
population size remaining fixed at TV.
In their first paper entitled 'A contribution to the mathematical theory of
epidemics', Kermack and McKendrick (1927) proposed these equations as
a simple model describing the course of an epidemic. We can write (2.3.1)
and (2.3.3) as
l . d * = _ 0 d z = _ l dz
K }
x dt 7 dt p dt'
where p = 7//? is the relative removal rate. Integrating this differential
equation directly, and using the initial values xo and z0 — 0 as above, we
obtain
x(t) = z o e - 2 ( t ) / p . (2.3.5)
A second integral is also readily obtained: equations (2.3.1-2) imply that
x(t) and y(t) satisfy
ax x
so
x(t) + y(t) - plnx(t) =xo + yo- plnx0. (2.3.6)
Within the region where x, y and z are non-negative, equation (2.3.2)
yields the inequality y > -7?/, which in turn implies that y(t) > yoe~l1 > 0
2.3. The general epidemic in a homogeneous population 29
(all 0 < t < oo). Similarly, x > -px(xo+yo) so that x(t) >
0 (all 0 < t < oo). However, from (2.3.1), x(t) is strictly decreasing for all
such t. Consequently, x(t) and z(t), and hence y(t) as well, converge to
finite limits XQO, Z^ and ^ as t -> oo, with y^ — 0 as we would have
l i m ^ o o i > 0 otherwise. Further, from (2.3.5), XQO = x0e~Zoo/p. Because
z<x> < #o + 2/o < oo, XQO > 0, and equation (2.3.2) then implies that y is
ultimately monotonic decreasing; it is in fact monotonic decreasing for all
t > 0 if and only if x 0 < 7//? = p.
Kermack and McKendrick's results constitute a benchmark for a range of
epidemic models, so we state them formally for later reference.
Theorem 2.1 (Kermack-McKendrick). A general epidemic evolves ac-
cording to the differential equations (2.3.1-3) from initial values (xo,2/o?O),
where x 0 + Vo = N.
(i) (Survival and Total Size). When infection ultimately ceases spreading, a
positive number XOQ of susceptibles remain uninfected, and the total number
Zoo of individuals ultimately infected and removed equals xo + yo — x<x> and
is the unique root of the equation
where yo < Zoo < #o + 2/o? P — i/P being the relative removal rate.
(ii) (Threshold Theorem). A major outbreak occurs if and only if y(0) > 0;
this happens only if the initial number of susceptibles Xo > p.
(iii) (Second Threshold Theorem). If XQ exceeds p by a small quantity u,
and if the initial number of infectives yo is small relative to v, then the final
number of susceptibles left in the population is approximately p — v, and
Zoo « 2V.
The major significance of these statements at the time of their first publi-
cation was a mathematical demonstration that even with a major outbreak
of a disease satisfying the simple model, not all susceptibles would neces-
sarily be infected. Conditions were given for a major outbreak to occur,
namely that the number of susceptibles at the start of the epidemic should
be sufficiently high; this would happen, for example, in a city with a large
population. These conclusions were consistent with observation, such as
had been noted by Hamer in his 1906 lectures.
It remains to demonstrate part (iii) of the theorem. Kermack and McK-
endrick did so by first finding an approximation to x(t) as an explicit func-
tion of t. To this end, observe that substituting from (2.3.5) into (2.3.3)
30 2. Deterministic Models
dz
i ^ . ~\vj •"u^ /• yZ.o.o)
This differential equation does not have an explicit solution for z in terms of
t. However, using the expansion e~u = 1 — u + \v? + O(u3) and neglecting
the last term, yields the approximate relation
\2x° ifx° A 2
fx° 2
Now setting
a=\JQ(N-xo)+(—--l\ (2.3.10)
this reduces to
dz p2
Substitute
atanh^^f.-p
p2 L x \p o
d^ P2J , 2 2, U2 \ P2 u2 d v
- r « -— va - a tanh v)f = —asech v—.
dt 2x0 x0 dt
Hence
dv T ,
— « f 7a, so v « ^ 7 ^ -h
and
() i (£2 ) ^ ! tanh (I7erf - ^) (2.3.12)
XQ
1
with </? = tanh"
" 1 [(l/a)((x o /p) - l ) ] .
2.3. The general epidemic in a homogeneous population 31
~
Equivalently, XQQ ~p-\-v — 2v = p — V. Note that this result is obtained
from the approximation (2.3.9) to the differential equation (2.3.8).
Another route to part (iii) of Theorem 2.1 is to analyse equation (2.3.5)
more directly. Observe that the function f(z) — x^~zlp is convex mono-
tonic non-increasing for z > 0, so it intersects the line g(z) = N — z =
#o 4- 2/o — z at most twice. In fact, since #(0) > /(0), there is exactly one
point of intersection z^ say, in z > 0 unless t/o = 0, in which case z — 0
is also a point of intersection. Now if /'(0) = —xo/p ^ —1? the point of
intersection in z > 0 is necessarily close to the origin; conversely, z^ is much
larger than zero if /'(0) < —1. This effectively substantiates (ii). For (iii),
again use an expansion of the exponential function, this time in (2.3.7), so
that for ZQO > 0 and yo ~ 0,
0 9
Figure 2.4. Kendall's modified infection parameter f3(z) ( ) and /3 ( ).
The solution of equation (2.3.4) with (3{z) in place of /? is
= XQ -f 2/0 - z
Z-ooO
Figure 2.5. Zoo and z-oo as points of intersection of
w = xo -f 2/0 — z and w = xoe~z^p.
(4>Vo,zo) = - p - - p- pln(xo/p),0),
except that we consider them as being defined for all — oo < t < oo, and
satisfying (xf+ y' + z')(u) = N — zp (all u). The limiting values at u = —oo,
oo are (7V',0,-Iz^l) and (Nf - \ZLQOI - ^ , 0 , ^ ) respectively, where
zLoo = z-oo - zp, z'oo = Zoo- zp. Note that I^L^I + z ^ = |z_oo| + z^oo.
These quantities are illustrated in Figure 2.6.
This information can be interpreted conveniently in terms of the intensity
of the epidemic, defined by
\Z-c
i = (2.3.18)
N1
34 2. Deterministic Models
w = N -zp-z'
using x' = (xrz'/p, Nf = pe^~^/p, and TV' - K J -z'oo = pe~ *'<*>'p. Hence
TV'
or
(2.3.19)
Further, since N' = pe|2:-<~l/p, or \ZLQOI = pln(N'/p), then
W-c ln(N'/p)
(2.3.20)
—
N'i
Table 2.2 lists various indicators of the characteristics of an epidemic in
terms of the intensity. Note that an epidemic of zero intensity represents
the limiting case where i | 0.
We can see from Table 2.2 that for all i in 0 < i < 1, or equivalently,
for 0 < p < Nf < oo, there are more removals after u = 0 than before.
For example, if N' = 1000 and p = 896, so N'/p = 1.116 and i = 0.2,
then about 20% of the population become infected and thus about 800
susceptibles remain at the end of the epidemic. On the other hand, with
the same N' but now p = 390 so that N'/p = 2.564 and i = 0.9, about
90% of the population are affected by the disease and only 100 remain
susceptible at the end of the epidemic. Broadly speaking, while a small
major outbreak occurs when the population size parameter N' is in the
2.4. The general epidemic in a stratified population 35
region of the critical threshold size p, most of the population is affected (i.e.
a large major outbreak occurs) as soon as N' is 3 or more times p.
Table 2.2 can also be used to relate the measures (#o,2/o) m t n e original
time scale t to the 'standardized' measures of the table. For, supposing
that (a?o, Vo)i P a n d N are given, then we can solve equation (2.3.5) to find
the value zp for which xo — p, namely z p = p\n(xo/p)y and hence obtain
z' = z — zp. Then z-co and z^ are the two roots of
N - z - xoe~z/p = 0,
and finally N' = N 4- |^-oo|- All the quantities of Table 2.2 can now be
found.
For example, (a?0, yo) = (800,100), N = 900 and p = 390 gives zp = 280.2,
z-oo = -78.6, z^ = 796.1, so AT' = 978.6 and z = (78.6 4- 796.1)/978.6
= 0.8938, i.e. close to 90% of the population are infected by the epidemic.
Note that the epidemic is skewed about the central value zp\ in the ex-
ample just given, z^ = z^ - zp = 796.1 - 280.2 = 515.9, while -z^^ =
\z-oo ~ zp\ = 78.6 4- 280.2 = 358.8, which is about two-thirds of 515.9. The
corresponding values of the susceptibles x' are x'^ — 103.9 and x!_oo —
978.6. The value v discussed after (2.3.14) could be estimated as either
800-390 = 410 or 390-103.4 = 286.6, again differing appreciably. In other
words, in terms of Xo = p+v, the rough approximation x^ « p—v — x$ — 2v
holds at best for a small range of intensities i
(2.4.1)
Vj = + • •+• (2.4.2)
(2.4.3)
2/1 7i
X2 2/2 Z2 72
x = y = 7 =
We indulge in the same sort of abuse of notation for lnx as with lnU in
(2.2.6), and extend it to the use of diag7 - 1 = diag(7f 1 ,7^" 1 ,... ,7^*) for
the diagonal matrix whose elements are the reciprocals of the elements of
a vector like 7. In this vector notation, with B = (A?)? the differential
equations (d.e.s) (2.4.1-3) are expressible as
Writing B 7 = B'diag(7~ 1 ), the first and third of these give (cf. (2.3.4))
dlnx . , . 1x
—r— = -B / diag(7" 1 )z = - B 7 z .
j - 1, •..,m), (2.4.5)
i=\
2A. The general epidemic in a stratified population 37
provided that this solution curve or trajectory lies in the region X defined
by
Xj, Vjy Zj > 0, Xj + yj + zj = Xjo + yjo (j = 1, • •
•, m). (2.4.6)
It is not difficult to check that the trajectory does indeed lie in X: it fol-
lows from (2.4.1) that trajectories in X are monotonic in each Xj, and
yj(t) > 0 for 0 < t < co assuming the matrix B is primitive (i.e. for
sufficiently large n, all components of B n are strictly positive). Then
using the boundedness as well, it follows that the component vectors of
lini£_>Oo(x,y,z) = ( x 0 0 ^ 0 0 ^ 0 0 ) exist and satisfy
y~=0, x^xo-fvo-z~,
x? = xj0 exp ( - [B7zoo]i) (j = 1,..., m).
Thus, the d.e. system (2.4.1-3) yields a unique limit point ( x 0 0 ^ 0 0 ^ 0 0 ) .
Further analysis completes the proof of part (i) of the following statement,
which is an analogue of the Kermack-McKendrick Theorem 2.1.
Theorem 2.2. A general epidemic evolves in a stratified population ac-
cording to the differential equations (2.4.1-3) in the region X, starting from
initial values (xo,yo,0), where the transmission matrix B is primitive and
the removal rate vector 7 has all components positive.
(i) (Survival and Total Size). When infection ultimately ceases spread-
ing, positive numbers of susceptiblcs Xj° remain in each of the strata j =
1 , . . . , m. The numbers of removals Zj° constitute the unique solution in X
of the equations (2.4.7). If Xo is replaced by any XQ that is componentwise
larger than XQ, i.e. XQ >: xo, then z°° is replaced by z /oc >z z°°.
(ii) (Threshold Theorem). A major outbreak occurs if and only if the eigen-
value A max of the non-negative matrix diag(xo)B 7 with largest modulus,
lies strictly outside the unit circle, where B 7 = B / diag(7~ 1 ).
(iii) (Second Threshold Theorem). If x 0 = f -h Ax, where the components
of Ax are non-negative and sufficiently small and A max (diag(£)B 7 ) = 1 <
Amax(diag(x o )B 7 ), then
(2A8)
2=1
1
i.e. z°° w 2[l'diag(v~
~ ) )(x 0 - £)]v, where v is the right eigenvector of the
1
- expf-CB^W),]), (2.4.9)
starting from any convenient z^0) e X such as z^0^ = yo. Thus, the vector
z°° is a fixed point for T, i.e. Tz°° = z°°, and under the stated conditions
it is the only fixed point in X.
The criticality statement (ii) is motivated in part by recalling that for a
general epidemic in a homogeneous population, the initial point (xo,yo) is
close to the singular point (#o, 0) for the pair of d.e.s (2.3.1-2) describing the
evolution of the process. The linear d.e. system approximating (2.3.1-2) at
that singular point either diverges when XQ > p or converges when x 0 < p. A
linear approximation to (2.4.1-3) in the neighbourhood of (xo,yo) ~ (xo, 0)
leads to a similar divergent/convergent dichotomy, determined here by the
dominant eigenvalue of the matrix analogous to the product xo/^T"1 of the
Kermack-McKendrick Threshold Theorem. We shall meet Amax again in
Section 3.5 in the context of the Basic Reproduction Ratio.
The approximation in (2.4.8) is derived by a power series expansion in
the solution equations (2.4.7), much along the lines of the argument below
(2.3.14) based on (2.3.5).
The description we have given for an epidemic in a stratified popula-
tion is not the only possibility. For example, Ball and Clancy (1993) and
Clancy (1994) allow individuals to move between strata, hence offering a
more general cross-infection mechanism than our formulation with the ma-
trix B allows. Exercise 2.4 sketches details of a special case of Theorem 2.2,
and Exercise 3.9 evaluates Amax for a special case of B.
Exercise 2.5 discusses briefly the question of stratified population ana-
logues of the 'other' root Z-OQ (see Figure 2.5) and the standardized in-
tensity measure i at (2.3.18). Certainly, B " 1 is well-defined when B 7 is
primitive.
(2.5.3)
so
/ / ^ ^ d u j (j = 0 , 1 , . . . ) ,
Joo Jo
40 2. Deterministic Models
150
100
1 2 3 4 0 5
(a) (b)
Figure 2.7. Growth of 'generations' 0, 1, 2, 4, 6, 8 of infectives in a
simple epidemic with (AT, /) = (1000,1), 0 = 0.0001, on 0 < 0N't < 10;
part (a) is an enlargement of the start of (b). The number at the right-
hand end of the curve shows the generation number; generation 6 is the
modal generation for large t.
yo(uo)W(uo)W(u1)...W(uj)duodu1...dur
0<uo<ui<-"<u3<t
Here, yo(u) = / (all u > 0), and the (j + l)-fold integral is symmetric in its
arguments, so with
we have
(2.5.4)
Equation (2.5.4) shows that the sequence {y3\t)} is proportional to the terms
of a Poisson distribution with increasing mean In U(t) (cf. Daley, 1967a).
For small t (meaning, 0N't = o(l)), lnU(t) « (1 - I/N')0Nrt. Figure 2.7
illustrates the growth of the 'generation sizes' in the case (JV, /) = (1000,1),
0 = 0.0001 and /?AT't < 10. Part (a) of the figure shows the behaviour for
smaller t.
2.5. Generation-wise evolution of epidemics 41
where y = YlJLo Vj as before, and y = (f3x — ^)y. Thus, (3x = 7 -I- y/y, and
the second of the equations at (2.5.5) becomes
or equivalent ly,
, j ( t ) _
J'
Thus, as for (2.5.4), the relative frequencies of the generation types among
the infectives at time t are proportional to the terms of a Poisson distribu-
tion. However, since from (2.5.5),
1 t t l a |
.,(.)-7 / * * ( . ) * . - 7 / / ' . - ' • ' ;| l ) /
"'' < ., (2.5.7)
^0 Jo J'
we see that for the removals at time t, the numbers in the various generations
are proportional to the terms of a mixed Poisson distribution. The mixing
42 2. Deterministic Models
10 15 20
(a) (b)
Figure 2.8. Average generation number among removals in a general
epidemic ( ), deaths in a linear birth-and-death process (—•—•—)
and 'aged' infectives in a simple epidemic ( ) with (AT,/) =
(1000,1), (3N't < 20 and (a) 7 = 0.01 (hence, p = 100, Ro = N/p = 10),
and (b) 7 = 0.05 (hence, p = 500, Ro = 2). See Section 3.5 for discussion
concerning the Basic Reproduction Ratio RQ.
0
*) du. (2.5.9)
ln[N/ x(t)} Jo
This function is conveniently studied numerically by means of the three d.e.s
rh = m (2.5.10)
The function m{t) is plotted in Figure 2.8 together with the corresponding
functions from the simple epidemic and linear birth-and-death processes
(see Exercises 2.6 and 3.5 respectively).
We see that the ultimate 'average' generation number of those infected
in a general epidemic is larger than the corresponding 'average' in a simple
epidemic. This occurs because in the former the number of infectives is
reduced by removal, thereby slowing down the rate of spread of infection
2.5. Generation-wise evolution of epidemics 43
(so, the pool of susceptibles remains larger). Thus the difference in the
ultimate averages reflects the later occurrence of infection in the general
epidemic as against the simple epidemic. These interpretations are further
borne out by the comparison of the two sets of averages for the same initial
conditions and pairwise infection rate, but increased removal rate of (b)
over (a) in Figure 2.8.
It follows from (2.5.7) that Zj° = lim^oo Zj(t) equals the limit as t —•oo
of the given integral. Recall from (2.3.6) that x(t) and y(t) are related by
dx x_
N
fN ^__ fN
i\~L 0xy"Jx
Writing v{x) = /Qr W(u) du, the limit of the integral (2.5.7) after a change
of variable from t to x gives
Dunstan (1982) used Daley's (1967b) result (2.5.7) to show that for fixed
/ and j and large iV, with N ^$> p,
(2.5.15)
irrespective of the infection and removal rates /3 and 7. This right-hand side
is the same as limt_>oo Vj(t) in the simple epidemic (see (2.5.4)), and hence
is a limit property of {z?°} when p I 0. To explain this coincidence, observe
from the d.e.s at (2.5.1-2) and (2.5.5) that the simple and general epidemic
models with the same initial conditions and pairwise infection rate /?, have
44 2. Deterministic Models
100
30 30
Figure 2.9 shows how the odd-numbered Zj(t) evolve in general epidemics
for the same models and initial conditions as in Figure 2.8.
Exercise 2.7 sketches another model for the generation-wise evolution of
an epidemic in which successive generations are non-overlapping.
2.6. Carrier models 45
w(t) = 0 \
x(t) = x(t0) exp ( - (f3w(to)h')(l - e - ^ - 1 0 ) ) ) .
Carrier models have a much simpler structure than the general epidemic
model, due to the fact that the process w(t) in the model is independent of
x(t); x(t), however, depends on the evolution of w(t). See also Exercise 2.8.
46 2. Deterministic Models
where j3 denotes the feeding rate per mosquito and 7^ the infection trans-
mission rate from an infective of type i to a susceptible of type j (i, j = 1,2).
In terms of proportions xk = Xk/Nk = 1 — yk (k = 1,2),
jV2
V\ = 0l2ij^V2Xi - /Ziyi = fii(a1y2x1 - 2/1),
2/2 = /? ( )
2.7. Endemicity of a vector-borne disease 47
0.5 0 0.5 1
(a) threshold just exceeded (b) threshold well exceeded
Figure 2.10. Level curves 2/1 = 0 and 2/2 = 0 in the 2/1-2/2 phase-plane.
Thus
2/2 = (2.7.2)
determine large changes in the equilibrium level of the other. Under these
circumstances the disease, while endemic, is still subject to large changes
in infectivity resulting from relatively small changes in the infestation level
of the mosquito population. The second extreme corresponds to the case
where a significant change in the level of either y\ or 1/2? determines a sig-
nificant change in the other; this corresponds to an endemicity whose level
is sustained quite steadily.
N
(a) PN < 1 (b) 0N > 1
Figure 2.11. Discrete time logistic epidemic.
When (3N > 1, it follows from (2.8.2) (cf. Figure 2.11(b)) and the con-
cavity of / that if (3~l < y* < N, then f(y*) > N, while if 0 < y* < /3'1
then (f3N)y* < f(y*). Consequently, for t such that maxo<s<t{y*} < (3~l,
< 2/t+1, and therefore for some t sufficiently large, t = T say, we
have > (3~l > ?/f, and thus >
N. This means that for such
Vs = (2.8.3)
Thus, for /?iV > 1, the entire population contracts the disease in a finite
time T-hi.
This type of behaviour is expressed formally in the theorem below: there
is no such analogous behaviour of continuous time deterministic simple epi-
demics. The result is a consequence of the discretization of time.
Theorem 2.3 (Threshold Theorem for Discrete Time Logistic Equation).
For an epidemic described by the logistic growth process in discrete time as
in equation (2.8.1) with an initial number of infectives yo in (0, N), either
(a) (3 < N~l, and at any time t = 0,1,... after the initial infection, a
positive number of susceptibles remains in the population (i.e. yt < N); or
(b) j3 > N~x and no susceptibles remain after some finite time T -f-1.
Spicer (1979) used a discrete time deterministic model for predicting the
course of influenza epidemics in England and Wales. It is an analogue of the
continuous time general epidemic model of Section 2.3. Let xt, yt denote
the numbers of susceptibles and new infectives on day £, and (3 the pairwise
infection rate as defined at (2.8.1). Removal of infectives occurs daily, with
the proportion ^ of the new infectives on any given day t remaining in the
population j days later. Assume that 0 < ipj < 1 (all j = 0,1,...).
50 2. Deterministic Models
30-
20-
10-
0-
0 8 10 12
Figure 2.12. Weekly deaths from influenza and influenzal pneumo-
nia in Greater London, 1971-72. (Data from Spicer, 1979.)
=Yt- {Yt+l -
i=o
Spicer used the estimated proportions ^ o , . . . ,^5 — 1-0, 0.9, 0.55, 0.3, 0.15
and 0.05, with xjjj = 0 for j = 6,7,..., to predict the progress of influenza
epidemics in Greater London in 1958-73, and hence to estimate the number
of deaths (as a proportion of the number of infectives) from influenza and
influenzal pneumonia in that period. The important factor was to obtain
an estimate of /?#o; this was found to range between 4.5 and 7.7 in the
period 1958-73. Figure 2.12 illustrates the quality of the fit of the model to
the data (see also Section 6.1 and Exercise 6.1 below). If we assume that
deaths are proportional to the number of new infections in any week, then
the plotted points and fitted curve of Figure 2.12 are proportional to the
epidemic curve defined around (2.1.4).
Saunders (1980) used a slightly different approach to study the spread
of myxomatosis among rabbits in Australia. After infection, a rabbit goes
through a latent period of about 7 days, followed by an infectious period of
about 9 days during which symptoms can be observed. Nearly all infectives
2.8. Discrete time deterministic models 51
die, so that removals in this case are due to death. The model postulates
that a rabbit infected on day t enters and remains in the latent period until
day t + t—1 (£ = 7). It then becomes infectious on day t + £ and remains
so until day t+£+k-l (fc = 9), after which it dies.
Let ?/i (t) denote the number of rabbits first observed to be infectious on
day t; these were infected on day t — £, and remain infectious and in the
population until day t + k — 1. Hence on day £, the total number of infectives
is
fc-i
y(t)= Y,yi(t-j), (2.8.5)
i=o
since any rabbits becoming infectious before day t — k -f 1 have died by day
t. Starting with XQ susceptibles at time t = 0, the number of remaining
susceptibles at time t is
t+t
iO'), (2.8.6)
£) = (3y(t)x(t) (2.8.8)
i.e. (3 has been replaced by a/p(x(t)), on the assumption that each infected
rabbit makes contact with a constant number a of rabbits each day. An
estimate of a gave a = 0.34, and this change with p(n) = y/n, resulted
in better agreement with data, as can be seen from Figure 2.13(b). Saun-
ders also used a goodness-of-fit criterion to discriminate between possible
density-dependence functions p(n) = c, y/n and n, where c is a constant.
52 2. Deterministic Models
120-
100-
80-
\ \
60-
40-
20-
U 1 1 1 1 1
0 10 20 30 40 50 0 10 50
(a) (b)
Figure 2.13. Rabbit population affected by myxomatosis: observed
data ( ) and fitted models ( ): (a) homogeneous mixing, and
(b) population dependent model. (Data from Saunders, 1980.)
with Xt + yt -f- zt — TV, and /?, 7 are the infection and removal parameters
which may differ from those in the continuous time model. More correctly,
to avoid negative values of xu one should rewrite the first two of these
relations as
and restrict the value of 7 to (0,1). This model was studied in detail by
de Hoog, Gani and Gates (1979) who derived an analogue of the classi-
cal Kermack-McKendrick theorem, but it is somewhat more complex than
Theorem 2.3; we refer the reader to the 1979 paper for details.
Enough has been said to indicate that while it is mathematically more
convenient and elegant to deal with deterministic models in continuous time,
their discrete time analogues are used fairly often when it comes to analysing
data collected daily, or at other serial intervals. The examples sketched
above are intended to indicate both the usefulness and the complexity of
discrete time deterministic models.
2.9. Exercises and Complements to Chapter 2 53
where Z = X/"Li ZT ls
^ n e u n iQ u e positive root of
where ln[] here denotes a vector with components ln[l + (yj0 - z^ n ) )/x j 0 ],
much as below (2.4.4), as a possible analogue of the equation for ^ ~ ' n - 1 \
When zj~ oo) = lim n-oo z{rn) exists and differs from ^ (oo) = limw.->oo^n),
define analogues of i at (2.5.18) by ij = [^oo) + l^'^O/ATj where N'j =
Xjoexp(—[B 7z^~oo^]J). As an analogue of zp consider the vector B " 1 ln[p]
where ln[p] is the vector with components \n[yj/0jj].
2.6 For a simple epidemic model, the analogues of the j th generation removals
Zj(t) in a general epidemic model (cf. (2.5.7)), are the jth generation in-
fectives zs3(t) who have been in this state for an exponentially distributed
time with mean I/7, i.e. 'aged infectives'. These zs3(t) satisfy the d.e. Zj =
7(yj ~ zj) where quantities with superscript s refer to the simple epidemic.
Show that the analogue ms of the function m at (2.5.8) equals Ms/zs, where
Ms = ^2 •3zj> illustrated in Figure 2.8, satisfies the d.e.
= 7M + l ln
n y. y
2.7 The following is a prescription for using continuous time methods to track
the generation-wise evolution of an epidemic process when the latency period
is large relative to the infectious period, and successive generations of infec-
tives are non-overlapping. Introduce functions {X n (), Y n () : n = 0,1,...}
satisfying the d.e.s
(a) Show that Xc» = limn_>oo Xn(tn) exists and satisfies equation (2.3.7).
(b) Prove that, if / is allowed to vary, then limjio[Yi(0)/>o(0)] = 0N/-y = RO
as in Figure 2.8 and in Section 3.5.
(c) Compare Fo(0)H \-Yn(0) numerically with Z0(oo)H \-Zn(oo), using
the solution of the generation-wise evolution results in Section 2.5, for some
suitable 0/*y, N and /.
2.9. Exercises and Complements to Chapter 2 55
(d) Interpret the sequence {Zn(oo)} in the context of a discrete time general
epidemic model (cf. discussion at end of Section 2.8 and Chapter 4).
2.8 In the carrier model described by the d.e.s at (2.6.1), suppose that a (small)
proportion a of the susceptibles infected become carriers, so that the second
of the equations is now
dw „ 7 d In x dx
[Observe that with this modification, the carrier model is similar to the gen-
eral epidemic model.]
2.9 Suppose that outbreaks of an epidemic take place in a given region (or village)
each year. Consider how to combine data from several years so as to display
features of the evolution of the disease; how could these data be used to
estimate parameters of a suitable model? As a possible model, assume the
same population size for different years and the same parameters j3 and 7 in
the case of the general epidemic of Section 2.3 or its discrete analogue (cf.
En'ko's work in Dietz (1988), En'ko (transl. 1989), and Exercise 1.3 above).
Stochastic Models in Continuous Time
In this chapter and the next, perhaps the most important of this book,
we describe the evolution of an epidemic as a stochastic process. Such a
stochastic approach is essential when the number of members of the popula-
tion subject to the epidemic is relatively small. While deterministic methods
may be adequate to characterize the spread of an epidemic in a large popu-
lation, they are not satisfactory for smaller populations, in particular those
of family size.
We shall use the tools of non-negative integer-valued Markov processes in
continuous or discrete time, retaining much of the approach of the previous
chapter in the dissection of the population. However, we no longer use
continuous functions to approximate the evolution of the epidemic; rather,
we consider a closed population of individuals, each of whom is classified
as either a susceptible, an infective or (except for the 'simple' epidemic of
Section 3.1) a removal. This S-I-R classification1 is possible because we
regard the epochs where individuals change their classification as randomly
determined points in continuous time, or asfixedpoints in discrete time. For
mathematical convenience but with sufficient simplistic realism, we assume
that the processes involved are Markovian. The reader is presumed to have
some familiarity with the elements of Markov processes on countable state
space, as for example in Bailey (1964), Cox & Miller (1965), Karlin & Taylor
1
Waltman (see Waltman and Hoppensteadt (1970, 1971)) introduced this notation to
designate both the possible disease states of an individual, and the possible progression
of disease in an individual; here, it denotes susceptible —> infected —> removed. A simple
epidemic as in Chapter 2 or the first two sections below is an S-I epidemic. For an S-I-S
disease like malaria, a simple model envisages that an individual is either susceptible,
infectious or (after recovery) susceptible again. Superinfection is not allowed in this
simplest version in which individuals are either infected to the same extent, or susceptible;
a variant of this model would allow superinfection. Hethcote (1994) surveys deterministic
models more extensively.
56
3.1. The simple stochastic epidemic in continuous time 57
Xt
0 tx tjv-l t^-2 ts £2 ^1
Figure 3.1. The death process {X(t) : t > 0}.
where X(t) and Y(t) denote the numbers of susceptibles and infectives at
time t, with initial conditions (X, Y)(0) = (N, I) where / > 1. We assume
that {(X, Y)(t) : t > 0} is a homogeneous Markov chain in continuous
time, with state space the non-negative integers in Z 2 satisfying (3.1.1)
and N > X(t) > 0. The only non-zero transition probabilities occur for
0<i < N, j = N + I -i and are
former being a death process and the latter a birth process on the integers
concerned. This enables us to write down certain properties of the model
fairly easily. For example, concentrating on {X(t) : t > 0} as a pure death
process and referring to Figure 3.1, it is clear that X(-) evolves by unit
decrements at the epochs ts, £/v-i, • • • ,t\ say, where
N
T
i (1<J<JV), (3.1-3)
N N N j
1 1 1 [ 1 ~
v
1=3
/ N(N 4- I -I)
Here 7/v,/j = 7iv — 7 / - i + 1N+I-J — lj-i with the sequence {77} defined
by
3 1
7o = 0, 7j=53T-lnj (j-1,2,...), (3.1.5)
z
t=i
with X(t) = 0 being the absorbing state. Then the forward Kolmogorov
equations for the state probabilities pi(t) = Pr{X(t) = i \ X(0) — AT},
(0<i< iV), are
pN = -/3NIpN(t),
Pi = -0i(N + / -
(o.l.o)
- 1 ) Oj Po(t)
so that
P = -0AP(t). (3.1.7)
Here the square matrix A of order N + 1 is the difference of two matrices,
one diagonal and the other subdiagonal:
A = diag(JV7, (N - 1)(7 + 1),..., 1(7 + N - 1), 0)
- subdiag(AT7, (N - 1)(7 4-1),..., 1(7 + N - 1)).
Note that both the diagonal and subdiagonal matrices may have repeated
elements, as in Example 3.1.1 worked out shortly.
The solution of this matrix equation (3.1.7) is known to be
P(t) = e - ^ P ( O ) , P(0) = (1,0, • •
•,0)' = eN+1, (3.1.7')
but explicit results may not be easy to derive by this method. A much
simpler approach is through the Laplace transforms pi(0) = /o°° e~etpi(t) dt,
Re(0) > 0, 0 < i < N which, applied to equations (3.1.7), yield
f
I
Jo
- p dt = 0pi(O) = -0i(N + 7 - i)pi(0)
PN(0) =
0NV
We can solve for Pi{6) recursively, starting with PN(6) to obtain the result
where some factors Q-\-j3j{N+I—j) appear twice for pi(6) with z < ^(
Example 3.1.1. To illustrate the procedure, suppose that (N,I) = (4,1),
13=1. Then
iy v ; Fzy
4' " (0 + 4)(0 -f 6) ' > ~ (6 + 4)(0 -f 6)2 '
Note the double factors in pj{6) for j = 0,1,2. Inverting these Laplace
transforms shows that for all 0 < t < oo,
6 ( e e 2 t e ) ,
= 36( - e"4< + te~4t + e" 6i + te"6'), ( j
For details of similar calculations see Bailey (1963; 1975, Chapter 5).
3.1. The simple stochastic epidemic in continuous time 61
(6 + 2c)(4 + e) ( )
J'
(6 + 3c)(6 4- 2c)(4 + c)
Pi W ^-7-:
(2 + 2e)3e (2-f2c)(2-c)
e-(6+2c)t _ e -
(3.1.13)
Recovery of the solutions at (3.1.11) for e = 0 is a straightforward matter
of taking limits. By inspection, as e —• 0, p\(t) —•> e~4* and p|(£) —>
2(e~ 4i — e~ 6t ). In the case of p|W> taking limits for the last term yields
lim e~6t(e~3€t — e~2et)/e = —te~ 6*
so that
- e )- te~6t).
Similarly we can check that p\(t) —> pi(t) as at (3.1.11), and then appeal to
pe0(t) as at (3.1.13) to complete this derivation of the full solution (3.1.11).
We use this simplifying device later when considering the probability
generating function method in Section 3.2. The solution (3.1.13) for small
6 can be considered as a close approximation to the exact solution (3.1.11).
62 3. Stochastic Models in Continuous Time
jff .-AM* =
This p.d.f. is related to the state probabilities pj(-) at (3.1.7) by
Kendall (1957) used the transform at (3.1.15) to obtain the limit distri-
bution of ^i in the case / ? = 1, / = 1, N —> oo. Using the transformation
ft (l + ") "1 2
[T(l + 0)}} 2 (N - oo); (3.1.20)
<p(z,t) = E(zxM) = Y^ Pf
{ * ( 0 = 3 I X{fy}zj (M < 1), (3.2.1)
64 3. Stochastic Models in Continuous Time
equations (3.1.6) are replaced by the same relations with (3 = 1 and t'
in place of t. For convenience below, we write t = tf. Now multiply the
equation for dpj/dt by zj and sum over j . Writing
this gives
j=o
(3.2.3)
Recall from (2.1.1) that for the deterministic model of a simple epidemic,
with the change of time scale used here, the function x(t) denoting the
number of susceptibles at time t satisfies
x = —x{N + I — x).
3.2. Probability generating function methods for Markov chains 65
where
= dZ(£) „= d2Z(z) . = dT(t)
dz ' dz 2 ' dt
and A is a constant. The simpler relation at (3.2.5) gives T = —AT, so
that T = e~ At . However we know from Section 3.1.2 that this result will be
correct only if the initial number of susceptibles N is replaced by Ne = N+e
(0 < e < 1) in the eigenvalues of the matrix A of (3.1.7). Assume this is
done: then from the rest of (3.2.5),
Ziz) -
[Z)
^ r(a)r(b)T(c+k)k\
The eigenvalues Xj must be such that F(a, 6; c; 2) is a polynomial in z of
degree not greater than iV, which means that a or 6 must be a negative
integer such that a + b = —(N f + /) is satisfied. Thus the Aj must be of the
form
A, = j(Ne + / - j) (0 < i < TV), (3.2.7)
with a = —j say and b = — (iVf + J — j ) . It follows that
TV
y>0M) = ^ a i e ~ J ' ( N e + 7 " J ) ^ ( - ^ i - AT, - /; 1 - iVe - / ; z), (3.2.8)
66 3. Stochastic Models in Continuous Time
£ (x, r)(o) =
0<i<N
-0cov(X(t),Y(t)),
Pcov(X(t),Y(t)),
respectively. Neglecting the covariance terms and putting (x(t),y(t)) =
(EX(t), EY(t)) yields the d.e.s. of the deterministic model of Section 2.3.
Equation (3.3.5) is more simply studied in terms of its Laplace transform
/•OO
, Pn(0)= e-etPij(t)dt.
0<p(z,w;O) - 2 W=
p)w~p}^,
dw
, (3.3.8)
3.3. The general stochastic epidemic 69
A — + 0F = u/e7v+i, (3.3.9)
where F = (f^ //v-i • •/o)' • and e^+i is the (AT -f l)-row vector with 1 in
its first row and zeros elsewhere, as at (3.1.9). With IJV+I denoting the unit
matrix of order N + 1, the square matrix A = A(w) of order N -f 1 equals
N+I j
F(w;0) = Y^ — FiJ\O;0), (3.3.11)
or
F'(O;0) = (0/P)F(O;0). (3.3.12)
Differentiating (3.3.9) with respect to w yields
P\
(3.3.17)
These equations yield a first-order recurrence relation when expressed in
the matrix form
9
f Fk +1 )(0;6>n
{ F<»>(O;0) J
A'(O) + «Af+i (J2)A"(0)^j f FO)( 0 ; e) ^j I\6jt (eN+1)
plN+i 0 J lpW-D(0;^)J ( / - j ) ! I 0 J'
(3.3.18)
0)
this relation is valid for j = 0 , 1 , . . . , N + I when we define F< (0; 9) =
F(O;0), F < - 1 ) ( 0 ; « ) = 0 .
We see that for j = 0 , . . . , / - 1,
=
where for k < j we define the product Jl^fc ^* EjEj-i • ••Bfe+iBfc. For
j = / , however, we have
r F C + ^ O ^ ) 1 _ r r B . f F(o;*) ^| /! r e w + 1 )
Bl
I F(')(0;») J ~ l = l I 0 J 7 1 0 J' (3-3-19)
and for j = 1 + 1 , . . . , / + AT,
F(O;0)^| /! A few+1)
Bi (3 320)
i—0 0 J ~ 7 11 [ 0 J -
3.3. The general stochastic epidemic 71
AT+7+l)( 0; e)
( F F ^N+/)(0;6>) )
N+I
)) \ /' (
0
i=0
Hence,
[
iV+7 -i y, r iV+7 -I
Bi Bi
where [*]JV+I
n j 1
F(O;0)=
7 [ n j
denotes the truncated (N + l)-square matrix consisting of the
first TV + 1 rows and columns of its argument. Now from (3.3.18), because
jA'(Q) -f 0IJV+I is a diagonal matrix and A"(0) is a subdiagonal matrix,
both with non-zero elements, any product n i = o ^ ls a l° w e r triangular
matrix with non-zero eigenvalues for Re(0) > 0. Hence its inverse exists,
and thus
(3-3-21)
w,9) ) _N¥f1w*
F(w,9) ( FW(O;0) \ (3 3 22)
«; 9)dvj- j ^ J ( PW-D(O; 9) J -"
WJ 1 F(0; e) WJn 1 ejv
Z. In*- * 1
f P J[ o J .f ) T ' [n^ B 1 f +!
J),
72 3. Stochastic Models in Continuous Time
The full solution to the 2-person epidemic may now be obtained from
(3.3.22) as the upper left 2 x 2 matrix in
so F(w; 9) equals
8(1 +p + 6)
2
0
)
(3.3.24)
3.3. The general stochastic epidemic 73
1 - pt(z - 1)
Note that the result for N = p can be readily derived from that forN^p
by setting N = p + e in this case and letting e —> 0. We are interested
in the probability of extinction of Y(i), i.e. in lim^oo Pr{Y(t) = 0} =
lim^oo </?(0, t). Prom the first of equations (3.3.25) we see that the be-
haviour differs between N > p and N < p, namely
r=f-i N
< * < " > ")>
]imip(0,t)= ' " *~ ~ '
74 3. Stochastic Models in Continuous Time
with Pn the probability that the final size of the epidemic is n, as given
below (3.3.3). We can bound the component Y{t) in the epidemic process
{(X, Y){t) : t > 0} by F(f) in two bivariate processes
and
Now the forward Kolmogorov equations for the birth-and-death process with
general birth parameter A are
where for 0 < w < 1, rji > 772 > 0. The general solution is thus of the form
<p(z,w;t) =
where {Pn(X)} is the probability distribution for the total number of initial
susceptibles ultimately infected, for a specific birth parameter A.
Now we know from (3.3.31) that for N —> oo,
( P < A)
lim < ,1; t) = V Pn(A) = i ( A ) ' (3.3.33)
U (p>A),
so that
/
f ; [ ( ^ ) ] (3.3.34)
n=0
or equivalently
(3.3.35)
Pn=limGPN_nj0(t) (3.4.1)
where pi(6) = /0°° e~etpi(t)dt, yields this probability TT,. In terms of the
generating function fi(w,8) of transforms defined at (3.3.7), this means
that, since for all i, \imt-+ooPij(t) = 0 for j =£ 0. then
Pn = lim 0pN-n,o(0) = lim 6fN-n(0,0).
B—*0 0—>0
Thus
f IN (0,9)
Pi
= lim 6 = lim0F(O,6>), (3.4.2)
{PN) I fo(O,0) )
where F(0,0) is given by (3.3.21). From that equation, recalling that
Bo=
we see that
n
iV+I
BA 6F(0,0) =I\\
/+N
T\ Bi
AT+I
1 / N+l
lim 0F(0,0)
(3.4.3)
n 0
liV+l
the products appearing in (3.4.3) on the right- and left-hand sides are,
respectively,
-111 81 0.0833
p = 6 -38 16 0 0.1042
0 2 -7 1, > 0> w 0.5625 >
Observe the presence here of both positive and negative terms. In larger
matrices this is a source of numerical instability, assuming that there is the
capacity to handle the larger size (e.g. square matrices that are O(100)).
the natural process to consider is the one embedded at the jump epochs
of the process. Except for certain pathological cases which do not arise in
the models we have considered, the continuous time and embedded jump
processes both have the same long-term behaviour, such as the development
of major and minor epidemics, and the ultimate size of the epidemic. This
section reviews briefly some results of this kind. The particular usefulness
of embedded jump processes is that they are well-suited to numerical work.
To make these ideas explicit, we use {X(t) : t > 0} in this subsection to
denote a Markov process in continuous time on the countable state space
X = {i,j,...} having time-homogeneous transition probabilities with rates
(Qij)- We assume that X is determined by its initial state and the rates q^
(jf ^ i) and qi = Yli^j Qij ~ ~Q™ (^J ^ ^0- One dichotomous classification
of states is that any state i is either absorbing or non-absorbing, depending
on whether qi — 0 or qi > 0.
Given a sample path for X for which X(0) = i, the time 7$ = inf{£ > 0 :
X(t) ^ i} is well-defined, finite or infinite, with T* < oo a.s. if and only if
qi > 0, i.e. i is non-absorbing. Then Ti is an exponentially distributed r.v.
with mean l/q{. Indeed, supposing without loss of generality4 that the sam-
ple paths are right-continuous with left-limits, these paths can be described
by a sequence of intervals of lengths T^n\ with epochs tn determined by the
partial sums
(n = 0,l,...). (3.4.4)
For these,
X(t) = in (tn<t< tn+1, n = 0,1,...), (3.4.5)
for some sequence of states i 0 , t'i,..., it being understood that if any such
in is an absorbing state the associated interval length T^ is infinite and
tr = oo for all r > n + 1. We can now define the embedded jump chain by
Also, for all sample paths for which Xn = in, it follows from the strong
Markov property that T^n\ when finite, is exponentially distributed with
mean l/qin.
4
To be more precise mathematically, suppose that the process is separable, etc.
3.4. The ultimate size of the general stochastic epidemic 81
Markov chains with well-ordered sample paths are easily studied using
this formalism. Define TT^ = Pr{X(t) = j for some t | X(0) = i). Then
the forward Chapman-Kolmogorov equations for the jump chain give the
relations
Kik =Pik+ Yl KijPjk' (3.4.8)
In this equation, all quantities on the right-hand side are positive; this is the
reason the equation is the basis of a relatively stable numerical procedure
for the computation of the probability
tik = inf{t > 0 : X(t) =fe;X{u) = k for some u > 0 | X(0) = i}. (3.4.10)
tik=Uj+tjk. (3.4.11)
Define
More complex relations for higher moments can be developed similarly (see
Exercise 3.8).
3.4.3 The total size distribution using the embedded jump chain
It follows from the development of the general stochastic epidemic model in
Section 3.3.1 that an embedded Markov chain which is conveniently denoted
82 3. Stochastic Models in Continuous Time
Pn PT
0.015- 0.015 -
0.01- 0.01-
0.005 - 0.005 -
0 0
I I I I I I
0 20 40 60 80 0 50 100 150
(a) (b)
Figure 3.2. Final size distributions of epidemics with p = 150 and
(a) (TV, /) = (100,1), (b) (iV, /) = (200,1).
by {(X n , Yn) : n = 0,1,...} takes place on the finite region XNj of the two-
dimensional integer lattice defined by
f3ij + 7 j (3.4.14)
13
'J ; 0ij + yj i+P
Consequently, writing p{iJ) = Pr{(X n , Yn) = (i,j) for some n \ (X0,YQ) =
(-/V,/)}, it follows from (3.4.8) that
T
N-n,0
0.08- 0.08-
0.06- ..••**
0.06-
0.04- 0.04-
0.02- 0.02-
0- 1 1 1 1 ° I I
() 20 40 60 80 50 100 150
(a) (b)
Figure 3.3. Conditional mean durations of epidemics with p = 150 and
(a) (N,I) = (100,1), (b) (N,I) = (200,1). Abscissae give the number of
susceptibles infected. The means were not computed for large n for which
Pn = pjy-n.o < 10~9. Cf. Exercise 3.13 and Daley et al. (1999).
We have also computed the conditional mean durations of these epidemics
with /? = 1 for those final sizes for which the probabilities exceed 10~ 9 . In
the notation of (3.4.12), we have found ri0 = T(N,i),(i,o) for those i for which
P(i,o) > 10~ 9 . These conditional means satisfy the recurrence relations, for
{i, j) € XNI, starting from r^v/ = 0,
U>
and
where fiij = l/[j(p + i)] = 1/q^j) for the state (i,j) in the notation of
(3.4.13). They are illustrated in Figure 3.3 for the same two epidemics as
in the previous figure.
Threshold Theorem 3.1. Both the duration and final size of the epidemic
process are roughly the same as for the approximating birth-and-death pro-
cess.
The super-critical case is more involved. Here it is possible that only a
minor epidemic occurs, with probability approximately (p/N)1. The size of
such a minor outbreak is similar to that of a super-critical birth-and-death
process conditional on extinction. Prom the work of Waugh (1958), this
is exactly that of a sub-critical birth-and-death process whose linear birth
(death) rates are the linear death (birth) rates of the original super-critical
process respectively. Otherwise, the early behaviour of the epidemic pro-
cess is similar to that of a super-critical birth-and-death process conditional
on non-extinction, until the number of infectives represents an apprecia-
ble proportion of the initial number of susceptibles. If the deterministic
model of the epidemic is any guide, the evolution of the epidemic process
through a number of generations of infectives is still similar to the birth-and-
death process (see Figures 2.7 and 2.8 and Exercise 3.5), until the number
of susceptibles approaches the borderline value p between sub- and super-
criticality. At this stage, in terms of numbers of susceptibles and infectives,
the process follows approximately the solution of the deterministic epidemic
model (with E X(t) « x(t) etc., much as for the simple epidemic; cf. Exercise
3.1). The 'imprecision' of the approximation, which in model terms incor-
porates both the stochastic lag and variability about the mean behaviour
(~ deterministic solution), is approximately normally distributed if N and
p < N are both sufficiently large. This leads (Barbour, 1972, 1974) to the
ultimate number of susceptibles being approximately normally distributed
about the mean EX(oo) « NO, where NO is the number of susceptibles
surviving the deterministic epidemic, with a variance that can be computed
using Kendall's 'Principle of Diffusion of Arbitrary Constants' (cf. Daley
and Kendall, 1965, and Section 5.4 below).
If it happens that Ro = N/p is somewhat larger than 1 (e.g. 3 or more),
then, conditional on the occurrence of a major outbreak, the number of
susceptibles surviving the epidemic is approximately Poisson distributed,
still with mean NO where 0 is now much closer to 0 (cf. Daniels, 1967).
This number can be regarded as representing those individuals having a
'small' chance of avoiding infection. If NO is itself not close to 0, then the
closeness of a Poisson distribution to normality ties together this result with
the previous paragraph. Sellke (1983) proved Daniels's result by considering
a sequence of general epidemic models with parameters (Nk, Ik, pk). These
parameters are such that as k —> oo, all three diverge to oo in such a way that
3.5. The general stochastic epidemic in a stratified population 85
and the initial condition is that (X, Y, Z)(0) = (N, li, 0), where the m-row
vector li has 1 as its i th component and all other elements zero.
The process {(X,Y, Z)(£) : t > 0} takes place on a finite state space,
so it is completely specified as a continuous time Markov process when its
initial state and infinitesimal transition probabilities are given. We know
(X,Y, Z)(0) already, and the constraint (3.5.1) means that it is enough to
describe (X, Y)(t). We assume that for the strata i = 1,..., ra, with x and
y denoting m-vectors of non-negative integer-valued components,
(3.5.2)
where B = (Aj)- We can now write down the forward Kolmogorov equa-
tions, but they are too unwieldy to give an explicit solution, whether in
the time or transform domain. We have described this model here because
(a) it is easily studied via simulation techniques, (b) it is general enough to
capture inhomogeneity, and (c) it provides an adequate setting to discuss
the basic threshold phenomenon.
We first consider a method of approximating the behaviour of the process,
entirely consistent with Whittle's approach in Section 3.2. This approach
effectively keeps the size of the susceptible population fixed during the early
stage of the epidemic, so that the only changes occurring in the population
are in the numbers of infectives Y say. In the m-strata context we there-
fore approximate the process by a multi-type linear birth-and-death process
3.5. The general stochastic epidemic in a stratified population 87
), in which at time t there are Yj(t) (j = I,..., ra) j-type individuals, for
whom the birth and death rates are respectively YALI 0ijYi(t)Nj anc * lj%\
provided no Nj is small these birth rates differ little from YllLi Aj^i W^O W
for t small enough (cf. (3.5.2)). This process Y(t) is an irreducible multi-
type Markovian branching process in continuous time. For such processes
it is known (e.g. Athreya and Ney, 1972, §V.7.5) that the vector q whose
i th element
qi = Pr{ lim ^ ( t ) = 0 | Y(0) = 1*} (3.5.3)
Now qi = 1 (all i) is always a solution, but need not be the least positive
one. Also, the process is irreducible if and only if the matrix B is primitive,
meaning that for some integer n, which is at most equal to ra, B n has its
(z,j)th element positive. Because of irreducibility, either <ft < 1 (all i) or
else qi = 1 (all i), and the former holds if and only if
has a real positive eigenvalue; this is equivalent (see Exercise 3.9) to R > 1
where R is the largest eigenvalue of the non-negative matrix
* ^ ,3.5.7)
P
where i^o is as below (3.5.6), and the approximation is valid when p ^> 1,
i.e. (3 <C 7. While this condition is stated in terms of p rather than TV, the
case of super-criticality (i.e. RQ > 1) then implies that N ^$> I. The analogy
of the criticality condition, that Ro < or > 1, with that for Galton-Watson
branching processes, is immediate.
To apply this idea to the m-strata context, we refer to the initial infective,
in stratum i say, as a zero generation z-stratum infective. Let m^ denote
the expected numbers of first generation j-stratum infectives (j = 1,..., m)
to which it gives rise. We ask whether these various first generation in-
fectives in different strata, if each is placed in the same environment of N
susceptibles as the initial infective, would produce on average a larger num-
ber of second generation infectives, and so on. Now the expected number
of /c-stratum second generation infectives produced by an initial i-stratum
infective, on the assumption that each first generation infective is placed in
the same initial population of N susceptibles, equals Y^JjLi rriijmjk, i.e. the
(i,/c)th component of the matrix M 2 where M = (rriij). Continuing this
process, the expected numbers of j-stratum infectives after r generations,
for r not too large, equals the (i,j)th element of the matrix M r . Since
M is a non-negative matrix (indeed, it is primitive because B is primi-
tive) , Perron-Frobenius theory implies that each column of M r is like RQV
for large r, where Ro is the largest eigenvalue of M and v is the right-
eigenvector corresponding to Ro. It follows that the expected number of
infectives after r generations either dies away to 0 when RQ < 1, or else
grows exponentially for a while when RQ > 1, reflecting the possibility of a
major outbreak.
To identify M, note that an i-stratum infective is infectious for an expo-
nentially distributed time Ti with mean 1/7^ If the N susceptibles were not
otherwise subject to infection during this period T^, the number of j-stratum
3.5. The general stochastic epidemic in a stratified population 89
the approximation again being valid for 0ij <C 7*. The matrix M is
f}HN2h-\
/(A) = det
(3HCN2-\ 0
= det 0 0
X 0 0 ...
with Nj = and /3Hc = PH — Pc- Expanding the determinant,
+ • • • + PcNm(f3HCNi - A) F J (J3HCNi - A)
P
+
i - A)) \
and non-zero for A > max; PncNi, SO any zero of the second factor in that
region is a candidate for Ro. Now this second factor is strictly increasing in
A for max* 0nc^% < A < OO, ranging from —oo at the left-hand end of this
semi-infinite interval to -f 1 at -foo, so
y ( )
fe i-X
has exactly one zero there. Therefore Ro equals this zero, i.e.
mfoAT/jr
K
-RfaN/j' '
where N = Y1T=\ N»/ra; equality holds if and only if either 0H = Pc or
Ni = Nx (all i). Hence, when 7 > (3H > (3c,
(3516)
(see Exercise 3.11). For any other household where there are initially i
susceptibles, the probability that j susceptibles remain after the random
time T is Pij((3c)- Regard the household of the initial infective as being
additional to the initial population of N susceptibles as described; then the
expected number rrtij of j-type infectives produced by this initial i-type
infective is given by
(3.5.17)
r=l
say. Define M = (m^). Then using the same argument as above (3.5.8),
the expected number of second generation fc-type infectives (i.e. infectives
in households with k susceptibles) equals the (i,/c)th element of M 2 . We
3.5. The general stochastic epidemic in a stratified population 93
conclude that, with n' = maxi<r<m(7Vr — 1), the Basic Reproduction Ratio
.Ro for the process is the largest eigenvalue of the (p! + 1) x (n' + 1) matrix
m0 m\ ... m n' \
^H
mi
M = i +mi VTL2
^/o ( ^i ^2
(3.5.18)
Explicit computation of this eigenvalue does not appear possible in general.
We can compute the limiting value of Ro when /3H ^> 7 because then
p ^ ^ fijo a n d i?o is the largest root of /C(A) = 0, where
mo — A mi vri2
1 + 7Tio i —A rri2
/c(A) = det 2 + mo mi m2 — A
in' - A
r
mo — A m i rri2 - •. rnn>
1+ A -A 0 ... 0
= det 2 +A 0 -A ... 0
n' 0 0 ... -A J
say. (3.5.19)
(3.5.20)
A= (3.5.21)
' + 0c '
because
A=
where terms in the inner summation are zero for j > ATj, and the inner
sum equals the expected number of infectives in a given household of iV»
94 3. Stochastic Models in Continuous Time
in which the inner sum is expressible in terms of the same indicator r.v.s as
so that
and
E[W(t) | W(0) = b) = fcT7* - tu(t)
where iu(£) is as in the deterministic carrier model of Section 2.6.
To analyse the process {X(t) : t > 0}, assume that {(X, W)(t) : t > 0} is
a bivariate Markov chain with transition probabilities
The forward Kolmogorov equations for the process are given for 0 < i < n,
0 < fc < 6, by
= P(i + l)fcpi+i,fc - ()
dc
and pn+i,fc = Pi,b+i = 0 when either i or fc lies outside its respective range.
Use the same transformation tf = (3t as at (3.2.2), and set p = j/(3. Then
for 0 < i < n, 0 < fc < 6, the forward Kolmogorov equations become
with the initial condition pnb(fy = 1- For convenience, we use t below for
this new t1.
We discuss two methods for solving this equation: the first is the p.g.f.
method discussed earlier, while the second is based on a method of Puri
(1975) for a Markov process developing under the influence of another pro-
cess. Such a process could be described as a doubly stochastic Markov
process by analogy with the previously named doubly stochastic Poisson
process.
96 3. Stochastic Models in Continuous Time
the p.d.e.
t Z1 V V
Z _ P(l -1;) A V _
11 Z)
~Z~ v W'~~h ( j
&___Z3_ V _ A
V
-(-r
Now we know that </?(z, v, f) must be a polynomial in z and v, of degree
n in z and 6 in v. Hence j must be an integer in the range 0 < j < n, and
so too must 0 < X/(j + p) < b. Corresponding to a particular value of j ,
the eigenvalue A^ will be
It follows that
n b
Y^'^/C', t / , LJ — 7 7 L-jf^Kj yL — Zl I C— ~ I , I O.D.U)
3.6. The carrier-borne epidemic 97
where we now need to find the values of the coefficients Cjk- We do this by
setting t = 0, when
<p{z,v,0) = znvb =
j=0 fe=
j=0 fc=0
whence Cj^, being the coefficient of £J7/fc in the expansion of the left-hand
side, can be identified as
and
(3.6.9)
It follows that
= i | X(0) = n} =
C::)[%]
98 3. Stochastic Models in Continuous Time
and
For the deterministic model of Section 2.6 the analogous function is x(t),
given at (2.5.4). Rewrite it, after changing the time-parameter to make it
consistent with the notation used in this section, as
x(t) = n exp ( —
Then
lltTI T * i / l TDf* I r ^
11x11 JU\V I — ILKZ \
t—• OO
where the inequality, a trivial consequence of e ly/p > 1 + 1/p, shows that
the expected ultimate size of the stochastic epidemic is smaller than the
ultimate size of the deterministic epidemic. More precisely, we can show
that lim^oo (E[X(£)] - x(t)]) « 6/2p 2 , which is o(l) for small to moderate
integers b and somewhat larger integers p. See Exercise 3.12 for more detail.
As in the case of earlier models, we are interested in the number of initial
susceptibles that are ultimately infected. For this, we can readily obtain
the probabilities P^ = \imt-^oo Pn-k,•(£) of the size k of the epidemic from
(3.6.10), namely
= (I) B- 1 )^
Let T be the duration of the epidemic and F its d.f. Then F(t) =
Pr{no further infection after t} is given by
F(t)=Po.(t)+p.o(t)-Poo(t),
where the terms on the right-hand side can be determined algebraically from
(3.6.10), (3.6.1) and (3.6.8), respectively. Hence,
(3.6.13)
3.6. The carrier-borne epidemic 99
In the particular case 6 = 1 this expression simplifies and yields for example
P Jo P Jo
(Bailey, 1975, p. 196). For integers b > 2, the complete beta integral here
is replaced by one or more incomplete beta integrals that can no longer be
simplified.
The following, alternative approach, due to Puri (1975), relies on the
observation that, conditional on W(-)> ^ W 1S itself a pure death process
with the random, time-varying death-rate parameter j3W{-). This implies
(e.g. Kendall, 1948) that
where W(t) = /0* W(u)du, the initial condition (X,W)(0) = (n,6) being
understood. Expanding (3.6.14) and taking expectations yields the p.g.f.
(3.6.15)
dt dv d0 dA
T ~ {0 + p)v-p ~T ~ 0 '
whence
In ((0 + v)p - p) = (0 + p)t -f A, ft = B,
where A and i? are constants. It follows that
This gives
p{1
f ( ^ -;-{e+P)t))b (3.6.19)
o+P
Setting 0 = j for j = 0 , . . . , n and combining the resulting expressions as in
(3.6.15), we recover (3.6.8).
Other examples exploiting this method can be found in Puri (1975).
where JN,I,J is defined below (3.1.4). This quantity is negative, and is some-
times called the stochastic lag. An alternative derivation entails the use of
the approximation ]T^_ i~l ~ / _ i ^ u~l ^u-
3.2 Show that in the special case of a simple stochastic epidemic starting with
one infective introduced into a population of N susceptibles, the first two
moments Eii and varti of the duration t\ of the process in Section 3.1 are
related by
1 An 1 i»_ 2/?E«x + i g 2 + O ( J V - 1 )
1
/3 (N + 1)2 ^ [ i
2
N+ l \
OO /»OO
exp(-a: -
/ y ts 1M
e- y~ dy = / e
J -OO
is the characteristic function of the extreme value distribution with p.d.f.
exp(—x — e~ x ). Hence deduce that the asymptotic distribution of W at
(3.1.18), having for its Laplace-Stieltjes transform the product of two gamma
functions, is the convolution of two extreme value density functions:
/•oo
3.4 (See also Example 3.3.1.) For the general stochastic epidemic model of Sec-
tion 3.3 starting from (N, I) = (1,1), and assuming that 0^% 27, use direct
d.e. solution methods to solve the equations p n = -(/3 + 7)pn, pio = 7Pn>
p02 = pPll - 27/?o2, Poi = 27P02 - 7Poi, Poo = JPoi- For example, show that
7(1 - e ^ ^ ) p ,F 2 / * T
3.5 (Cf. Section 3.3.2 and Figure 2.4.) In a linear birth-and-death process {Y(t) :
t > 0} with birth and death rates A and fi per individual, where X > fi
and Y(0) — / , suppose that at time t there are Y3(t) jth generation live
descendants of the original / individuals and Zj (t) j th generation descen-
dants no longer living (cf. Section 2.5). Show that the expected numbers
yj(t) = EYj(t) and Zj(t) = EZj(t) satisfy the system of d.e.s
J- o
so that m(t), the 'average' generation number of the {z3(t)}, is given by (cf.
Figure 2.4)
Xt X
u\
jZ,(u) | H I ) - 0 (I - oo» =
Deduce that Pr{Y(t) —•0} = (^/A)7, so that for the non-extinction set
{Y(t) -+ ex)}, setting r = /i/A,
E[^.jZ3(u)\Y(t) -^ 00 (t ^ 00)}
3.6 For the general stochastic epidemic, show that the first passage time distri-
bution into any state can be constructed as a weighted sum of convolutions of
distributions of exponential r.v.s, where the components of the convolutions
correspond to the sojourn times in the states visited along the paths and the
weights are the respective probabilities of those paths.
[Remark. The same result holds true for any continuous time Markov pro-
cess on countable state space with well-ordered sample paths and bounded
transition rates. This algorithmic principle explains why the p.g.f. method
is applicable to the simple and general stochastic epidemic models.]
3.7 Show that if j is a non-absorbing state in a Markov process with well-ordered
sample paths, then -K%3 = q3 /0°° Pr{X(u) = j \ X(0) = i}du (cf. (3.4.8)). In
particular, for the general stochastic epidemic as in Section 3.4.3,
3.8 Using the same notation as at (3.4.10), show that the quantities
3.9 Let the m x m non-negative matrix A be primitive, and let the diagonal
matrix D have all elements positive, much as in Section 2.4. Use Perron-
Frobenius theory (e.g. Seneta, 1981, Chapter 1) as at Lemma 2 of Daley and
Gani (1994) to show that
(i) the eigenvalue Amax of A — D that has largest real part, is real;
(ii) the eigenvalue /imax of AD'1 that has largest modulus, is real and posi-
tive;
(iii) Amax <, = or > 0 according as /imax <, = or > 1, respectively.
Suppose that the pairwise infectious rates fiio of Sections 2.4 and 3.5 have the
product form 0i3 = frcty so that B = (0%3) = 0a\ where the vectors 0 and
a have components /3t denoting the infectivity rate of infectives in stratum i
and relative susceptibility of susceptibles in stratum j . Show that when Mi
at (3.5.6) is primitive, its largest eigenvalue is R = ^2iNlat/3t/ji. When is
it true that R w RQ, the Basic Reproduction Ratio for such an epidemic?
104 3. Stochastic Models in Continuous Time
3.10 In the context of Example 3.5.1, with the 0tJ as given there, show that the
dominant eigenvalue i f o o f t h e m x m matrix M of (3.5.8), is as at (3.5.12)
if we replace Nt by Nx, 0Hc by 0H/(0H + 7) - 0c/(0c + 7) a n d 0c by
0c/(0c + 7). Deduce that when N3 = Ni (j = 2,... ,ra) (cf. (3.5.13)),
_ {[(m - l)/
Check that this is smaller than the case of equality in (3.5.15), which ap-
proximates this expression if 7 ^> max(fc,fe).
3.11 Derive (3.5.16) using the embedded jump technique of Section 3.4.2. Show
that when 0 < 7, PtJ(0) « (0h)%~3(i\/j\)> Use (3.5.20) to conclude that in
the setting of Example 3.5.2, RQ > 1 when N% > l/(ra/fc/7); this condition
yields N\ > 4 when m = 100 and 0C = O.OOI7, and Ni > 10 when m = 100
and /3c = O.OOOI7 (cf. the text preceding Example 3.5.2).
3.12 Use the generating function (3.6.8) to show that for the stochastic carrier
epidemic,
r 1 - U
E[X(t)W(t)} = E[X(t)} E[W(t)]e-* [l
1+p
and deduce that cov(X(t),W(t)) < 0 for 0 < t < oo. Use (3.6.11) to
conclude that (d/dt)E[X(t)} = -E[X(t) W(t)] > -E[X(t)]E[W(t)]. Hence,
since E|W(£)] = w(t) (see below (3.6.1)), conclude that for the deterministic
and stochastic carrier models, the inequality
105
106 4. Stochastic Models in Discrete Time
say; after contact with the susceptibles, the Yt infectives are identified and
so removed. Note that Xt + Yt = Xt-i-
In the simplified Greenwood model (Greenwood, 1931), it is assumed
that the cause of infection is not related to the number of infectives so
that a can be regarded simply as the probability of non-infection. In this
case the number of infectives yt at time t is determined by x t _i and xt as
yt = xt-i - xu and
{X,Y)t = (x,y)t]
x
W( ) (4.1.3)
t+l/
It follows that a realization (x, y)o,(x,y)i,..., (x, y)r of the epidemic, where
yT = 0 < 2/ r _!, will have a cchain binomial' probability of products of
the form (4.1.3). Here the yt infectives behave independently, resulting in
a probability ayt of non-infection and xt = xt+i H- yt+i- The yt+\ new
infectives will now mix with the remaining xt-\.\ susceptibles at time t + l
4.1. The Greenwood and Reed-Frost models 107
xt
5 r
4—
2
Y2 ; = 0
1
= XA = 0, = 0
- • *
(f
before being removed, to produce the next generation yt+2 of new infectives
at t + 2. Clearly
We have already noted that when 0 < a < 1, Xt = 0 for sufficiently large
t. However, from an epidemiological viewpoint we are interested in the first
time T when there are no infectives, and in the number W of susceptibles
who have been infected by then. Finding the joint distribution of (W, T)
numerically is not difficult. Starting from p® = Pr{Xo = i} = 8Xoi, we have
ptj=Pr{Xt=j,Yt>0} =
0J
4.1. The Greenwood and Reed-Frost models 109
= A1 {I - 6P)-l0QE (4.1.9)
t=i
'1
1
= (0,0,0,1)
1 . 0
. 1 3a2(l-a) 0
0 .
3 3 2
-a) 6a (l-a) 0 0
1
(4.1.10)
110 4. Stochastic Models in Discrete Time
Hence
p r {T = 1} = a 3 , Pr{T = 2} = (1 - a) 3 + 3a 2 (l - a) 2 + 3a 4 (l - a),
Pr{T = 3} = 3a(l + a)(l - a) 3 + 6a 4 (l - a)2,
0
0
2a(l - a)
(4.1.11)
where the n t h subdiagonal is the set of the equivalent probabilities in
P multiplied by <^n, for n = l , . . . , x o - Similarly, P2(ip) has elements
Pr{X*+ 2 — j I Xt — i} multiplied by (pl~i to reflect a total of i — j in-
fectives at times t - f 1 and t + 2. By using the same argument as underlies
the relation (4.1.9), it now follows that the joint p.g.f. of (W,T), where W
denotes the total size of the epidemic, is given by
Example 4.1.2. For the same Greenwood model for a family of three
susceptibles as Example 4.1.1, the joint p.g.f. ^W,T{V->8) equals
1 . . . 0
. 1 . . 0
(0,0,0,1) . . 1 .
+0
(1 - a)V2 2a(l -a)(p 0
. . . 1 ( l - a ) V 3a(l-a)V 3a2(l-a)ip 0
0
0 0 . .
2a(l-a)V 0 0 .
a)(l-a)V 6a3(l-a)V 0 0
0 0 . .
(4.1.13)
0 0 0 .
l6a3(l-a)V3 0 00
4.2. Further properties of the Reed-Frost model 111
= (0,0,0, l)(J-P(</>)) - 1
a2
(a) (b)
Figure 4.2. (a) Susceptibles and (b) infectives in successive generations of
deterministic analogues of the Reed-Frost or non-overlapping Kermack-
McKendrick epidemic, with (JV, /) = (100,1), a = e~1/p = 0.98.
Poo 0 0
Pio Pn
P = (4.2.2)
r
x00
Here each submatrix Pij is square, of order x 0 + 1> with P{j = (p(k,i),(t,j))
where we have from (4.1.3), for k, I = 0 , . . . , x 0 , and j + £ = k,
(X,Y)t - (M)} =
0 0 0 ... o
0 0 0 ... o
- ay 0 0 ... o
o (j-h l)a*(l -ay ••• 0 ... o
: : '•• 0 ••• 0
0 0 •••(^^a^o-rtil-ay ••• 0
Setting A! = (0 • • 0•10 • • 0) • where the 1 occurs in the place corre-
sponding to (x,y)o, it follows as in the previous section that the duration
time T of the epidemic has
Similarly, the joint p.g.f. of (W, T), where W is the total size of an epidemic
and T its duration, is
where P((p) equals the matrix P modified by having the elements in the
j t h subdiagonal of its component matrix P^ in the partition at (4.2.2),
multiplied by ^ .
It should be clear from this discussion that the principles for the use of
the Reed-Frost and Greenwood models are identical, though the Reed-Frost
model requires more computation.
Example 4.2.1 (Measles epidemic data from En'ko (1889)). For the
epidemics listed in Exercise 1.3 we can extract the following data for the
successive numbers of infectives in the several generations of the epidemics.
En'ko's paper gives sufficiently detailed data to make this possible.
114 4. Stochastic Models in Discrete Time
t=i
(N - zt)(l - a*-*)
T-l
=£
t=i
-a*-)
where T == inf{£ : yt = 0} as before, with respect to both N and a, subject
to the constraint that N > J2t=i Vt = Nmin. The minimizing values N and
a satisfy the first of the equations
4.3. Chains with infection probability varying between households 115
and the second equation as well if TV > Nmin. We found (N, a) = (54,0.794).
For Epidemic D we found (iV,a) = (29.2,0.835), where now iVmin = 29.
but assume now that a varies from one household to another: model a as
a random variable with a density function f(a). Then for any particular
realization of a chain in a household, denoted (X, V)[O,T] — {(X,Y)t : t =
0 , 1 , . . . , T}, with the epidemic ending at time T, the probability of the chain
can be denoted by
P(a) = P((X,Y)[0,n\a). (4.3.1)
Since a varies between households, we need to average P(a) over all possible
values of a on (0,1), yielding
/ P(a)/(a)da. (4.3.2)
Below, we assume that the density function /(•) is ofthe beta function form
(1 - a)a~lab-l/B(a, b) for a, b > 0.
We illustrate the procedure for both the Greenwood and Reed-Frost mod-
els with transition probabilities as given at (4.1.2) and (4.1.3). In the former
case these probabilities are of binomial form
Integrating over all values of a when a has the beta density above, we
deduce that for integers i, j and general positive a, 6,
j\B{a + i-j,
(4
) B(a,b)
i\ (a + z - j - l)"-a(b-\-j-1)---6
x\J V XT /
(4.3.4)
3M (l-a)Ja v
^ ^ da
3MB(a + 3, b + 1) _ 3M(a + 2)(a + I)a6
B(a, b) (a + b + 3)(a + b + 2) (a + b+ l)(a + b)
for the Greenwood model, and for the Reed-Frost model,
If similar integrations for the other possible realizations are carried out, then
the parameter values a, b can be estimated by moment fitting techniques or
maximum likelihood methods using the data. See Bailey (1975) or Gani
and Mansouri (1987) for details.
For a Reed-Frost-type model, we assume that the Yt' infectives mix with
the X[ susceptibles and produce a further Bin(Xf;, 1 — aYt) infectives, so
that
;a y 0- (4A3)
4.4. Chain binomial models with replacement 119
q
l
If X[ < TV, infection now follows as in the Greenwood model. The transition
probability matrix R2 = ( ^ | ) say from X[ to Xt+i = N — Y{ — Zt is lower
triangular and of the form (4.2.1), namely
r (2)
jk = (4.4.5)
^ SNk j=
.0 otherwise,
so that, with 7 = 1 — a as the probability of infection,
1 . . . . .
7 a . . . .
0 0 ... 0 U
It now follows that transitions from Xt < N to Xt+i < N have distributions
given by the elements s^ of the matrix S = Rify, where the N x N matrices
.Ri and R2 are derived from Ri and R2 by deleting the last row and column.
We obtain
j=max(i,fe)
(N-i)\pN/a^ "-1
120 4. Stochastic Models in Discrete Time
where i, k lie in {0,1,..., N - 1}. Note also from (4.4.4) and (4.4.5) that
The process {Xt} ultimately ends in the absorbing state AT, but the time
T to absorption (i.e. the duration of the epidemic) need not be small: to get
an idea of the expected behaviour of the process we consider E(Xt+\ \ Xt).
From (4.4.2) we have
YD = 0} + NI{Y[ = 0}
where /{•} denotes the indicator r.v. Prom (4.4.1), Pr{F/ = 0 | Xt} =
qN~x* and E(F/ | Xt) = (N - Xt)p. Thus
or equivalently,
it has a maximum at a point interior to (0, N). Table 4.3 shows the threshold
values NPtOt = (1 — pa)/[(I — a)| ln(l — p)\ ] which are such that when the
group size N > Np,a there exists a positive value for Yt such that E(Yt+i \
Yt) — Yt, i.e. {Yt} remains stationary (in the mean) and endemicity of
infection is likely.
Table 4.3. Threshold values Np,a of group size for endemicity
Let £ be the least positive zero of the right-hand side of equation (4.4.8)
rewritten in terms of Xt = N — Yt, i.e. £ is the smallest positive root of
some values are shown in Table 4.4 when N — 200. Then £ < TV if and only
if N > Npja, and the process {Xt} has mean drift away from its absorbing
state when £ < Xt < N. Typically, we should have p small, in which case
the right-hand side of (4.4.9) « Np(l - a) and 7Vp,a « l/[p(l - a)].
t
1 - (nni\
EXt = [N-(N- EXt-i)p]a = Na(l -p) ^ } + (pa)*EJf0. (4.4.12)
1 — poc
Such a process has a stationary version with generic marginal r.v. X for
which
EX=^-^«ATa, (4.4.13)
1 — pa
the approximation holding when p is small. See Exercises 4.1 and 4.2.
The process {Xt} satisfies equation (4.4.7), so we might expect EX w £
because Xt has mean drift always towards £, i.e. it has zero mean drift only
when Xt = £. Comparison of (4.4.10) and (4.4.13) shows that, for p small,
£ « Na only if qN~t « 0, which occurs only if a is not too large. See
Exercise 4.3.
122 4. Stochastic Models in Discrete Time
j j j
(4.4.15)
where VJ is estimated by Pr{T = n}/Pr{T = n — 1} provided these terms
are not excessively small, and n is the number of transitions computed. If
the terms are very small, then it is of no consequence, because the last term
in (4.4.15) is negligible. Table 4.5 lists some values of ET starting from
XQ = 1 for N = 200 and the same p and a as in Table 4.4.
It is evident from the structure of the state space for the Markov chain
Xt that the conditional distribution
where the r.v.s {v(Xt) : t = 0,1,...} are independent with the mixed bino-
mial distribution (4.5.1), while if Yt = 0, Yt+\ = 0 also.
We can easily evaluate the distribution of Xt. Set
N-t
Modifications for the case where the initial number of infectives is larger
than 1 are easily made (see Exercise 4.6).
4.5. Final size of an epidemic with arbitrary infectious period 125
Define
T = inf{t:Xt + t>N}. (4.5.5)
Inspection of the construction underlying (4.5.2) shows that T is the total
size of the epidemic (including the initial infectives), i.e. T = AT-f 1 — XT —
ZT- It follows from (4.5.4-5) that the distribution of T is given by
In the argument above, we see that we can replace the exponential distri-
bution of S by an arbitrary distribution for the infectious period, with d.f.
F(-) say, leading to a different mixed binomial distribution on k = 0 , . . . , X,
namely
= k}= f°
Zn (otherwise),
Exercise 4.6 indicates how to use the approach of this section to compute
the total size distribution {Pk}- See e.g. Ball (1986) for another approach.
Exercise 4.7 suggests a relation between the mean total size starting with
N + 1 susceptibles to the distribution starting with N,
The model described in this section is based on part of Daley and Gani
(1999). It attempts to evaluate the importance of the usual homogeneous
mixing assumption based on the Law of Mass Action. It also investigates
the effect of pairwise interaction.
For the simplest construction, we assume a closed population of size
N = 2M individuals, of whom initially one is an infective and all others
susceptibles. At discrete epochs in time £ = 1,2,..., these individuals form
M pairs, with each individual in contact with exactly one other individual
at each time epoch. As a result of this pair formation, we assume that any
susceptible in a susceptible-infective pair becomes an infective. Assuming
that individuals, once infective, remain so, we can ask in the usual fashion
how long it will take for all individuals to become infectives.
Assume that pair formation occurs at random. Denote the number of
susceptibles after epoch t by Xt, with XQ = N — 1. Then we can use
indicator random variables and simple combinatorics to deduce that
Next, the number of mixed pairs is odd or even according to whether the
number of infectives Yt = N - Xt is odd or even, so irrespective of Yo, Yt is
even for t = 1,2,... . Define zM{i) = min(i, 2M - i) and set Zt = zM(Xt).
Then the number of ways of forming exactly j mixed pairs from the Xt
4.6. A pairs-at-parties model 127
otherwise. It now follows that the one-step transition matrix for the Markov
chain {Xt : t = 1,2,...} has elements Pi^-j = rj(i,M) where, with Zt =
above,
We can now use this Markov chain to find numerically the distribution of
the time T until all 2M individuals are infective.
The assumption that transmission of infection occurs in every infective-
susceptible pair is unrealistic. If infection occurs with probability /?' say,
then by using indicator variables we should have in place of (4.6.1) the
relation
Y\ P'Xt(N - Xt) (4.6.5)
X) =
The transition probabilities V^i-j sav> a r e now binomial mixtures of the
terms r^(z, M), because from the random number Kt of mixed pairs formed
by the Xt susceptibles and N — Xt infectives, the number of new infectives
is a binomially distributed r.v. Bn^i^,/?'). We have
min(i,N-i) .
( (4.6.6)
=3
128 4. Stochastic Models in Discrete Time
1
1 —
/
/ /'
0.8- /
1 /
1 /
0.6- 1 /
< 1
0.4- 1
1
1 1
1
1:,
1
0.2-
o 0
1
10
i
20
1
30
1
40 12
(a) (b)
Figure 4.3. Duration time distributions for the pairs-at-parties model with
0' = 1 ( ), /?' = 0.5 ( ), and ff = 0.3 ( ), for N = 20.
(a) Actual time Pr{Xt = 0}, (b) rescaled time Pr{Xt/(3f = 0}.
Figure 4.3 plots the distrubution of the duration time of an epidemic in this
model for three values of /?', with values rescaled in part (b) of the figure
indicating convergence as f¥ | 0 to a limit in which the effect of exactly M
pairs being formed in each time unit largely disappears. These distributions
can also be illustrated by plotting the mean numbers of new infectives in
a time unit (see Figure 4.4). Rescaling again emphasizes similarities and
suggests convergence (see Exercise 4.8).
We now attempt to understand better the effect of the pair-formation
aspect of the model relative to 'classical' homogeneously mixing models,
subject of course to appropriate choice of such a classical-style model. For a
start, we observe that in the pairs-at-parties model every individual in the
closed population is either susceptible or infective, and that once infected, an
individual remains so forever. In this respect the model resembles the simple
stochastic epidemic X(t) of Section 3.1 starting from N — l susceptibles and
1 infective, with transmission rate /? which we must determine appropriately.
We deduce from Section 3.1 that
d_
(4.6.7)
we can immediately see the similarity of functional form with the right-hand
side of (4.6.5). This derivative best approximates the discrete-time mean
decrement at (4.6.5) (which includes the case /?' = 1 in (4.6.1)) by setting
P = (3'/(N — 1). The deterministic analogue of this stochastic model gives
the logistic growth curve as in Section 2.1.
4.6. A pairs-at-parties model 129
(4.6.8)
if j = i,
otherwise.
We recognize that the term Pi,i-i equals the product of the probabilities
that a randomly chosen pair is a mixed pair and that there is infection
transmission. Since M pairs are formed in each time unit of the pairs-at-
parties process {Xt} it is appropriate that it be compared with the process
{Xt} =
5-
2-
4-
/
A
\
/ , /~v 1.5-
o
//
\
/ /
2—
/ / \\
il
if
it \ ij
1- / 1 0.5-
_
0-
1 i 1 1 1 "I12 u i I 1 1
0 2 4 6 8 10 )
( 5 10 15 20 25
(a) (b)
Figure 4.4. Mean numbers of new infectives in pairs-at-parties model
( ), approximating discrete simple epidemic model ( ), and
approximating Reed-Frost-style simple epidemic ( ), for iV = 20
and (a) ff = 1, (b) ff = 0.5.
Figure 4.4 compares the two processes {Xt} and {Xt} for N = 20 and
the two cases /?' = 1 and /? = 0.5, plotting for each time unit the mean
decrements E(Xt — Xt-\), i.e. the mean numbers of new infectives. The
figure also shows the corresponding data for the other discrete-time pro-
cess {Xt} that is a simple epidemic analogue of the Reed-Frost model of
Sections 4.1-2. Conditional on Xti Xt+i is a binomially distributed r.v.
130 4. Stochastic Models in Discrete Time
0.6-1
0.15-
/
0.4-
0.1-
I 1;
0.2- 0.05- //
1 \ \
o- 1
J I 1
\
1
^.
1
() 5 10 15 20 25
(a) (b)
Figure 4.5. Increments Vi{Xt = 0} — ¥x{Xt~\ = 0} in the duration time
distribution of pairs-at-parties model ( ), approximating discrete simple
epidemic model ( ), and approximating Reed-Frost-style simple epi-
demic ( ), for N = 20 and (a) ff = 1, (b) ff = 0.5.
4.1 (a) Use a sketch graph to check that when (4.4.9) holds, the root £ of equation
(4.4.10) lies between N and £0 = Nct(l - p)/(l - pa).
(b) Expand the term qN~* when (N - £0)| ln(l - p)\ is small (i.e. (N - £0)p
is small) and deduce that
4.6 (a) Show that the total size distribution {Pk} of the stochastic carrier epi-
demic model of Section 3.6 can be computed from Pk = Pn-k,o where, start-
ing from Pnb — 1, the quantities PtJ = Pr{for some t there are i susceptibles
and j carriers} are defined recursively by
Pib = P t + i P < + i , 6 (i = n - l , . . . , 0 ) ,
Pnj =qnPn,j + l (j = 6 - 1, . . . , 0),
Ptj = Pt+iPt+i,j + QiPt,j+i (i = n - 1,. • . ,1; j = b - 1 , . . . , 1),
ti /
7
TV + 1
compare with the argument in Ball (1986). Cf. also (2.3.7) and Exercise 5.6.
4.8 Investigate appropriately rescaled limits of the pairs-at-parties model and
of the two simple epidemic analogues of Section 4.6, for (a) TV —> oo, and
(b) ff - 0.
Rumours: Modelling Spread and its Cessation
133
134 5. Rumours: Modelling Spread and its Cessation
where Jr(Kr) denotes the number of contacts with ignorants amongst the
138 5. Rumours: Modelling Spread and its Cessation
(5L6)
K) -11 * = o - 1 A G) (*:;) /CO •
K—J
and similarly,
^ =-2+^, (5.2.2)
dx x
whose solution consistent with the initial conditions is
2x(t) + y{t) + NIn - ^ = A(a(t), j/(t)) = 2JV + 1 (all t > 0). (5.2.3)
x{t)
It follows from (5.2.3) that the limit lim^oo x(t)/N = ON say, analogous to
the proportion / at (5.1.3) of ignorants who never hear the rumour, satisfies
2 ( 1 - 0 ) +In 0 = 0. (5.2.5)
This is the same as the equation for the deterministic model of a general
epidemic considered in Chapter 2.3, having relative removal rate ^N (cf.
equations (2.3.6) and (2.3.7)). Unlike the epidemic model, the rumour model
no longer has a family of processes with a parameter determining threshold
behaviour. Instead, the analogy is with a single process; it is not difficult
to check that the dependence on N of the root ON of (5.2.4) is O(N~l) (see
Exercise 5.1).
In a similar fashion, the general (a,p)-probability variant of the basic
[DK] model leads to the pair of differential equations
x = (~l)pxy,
y = (+l)pxy + (-l)[p(2 - p)2a(l - a)\y(y - 1) +payz]
+ (-2)p{2-p)a2\y{y-l)
^ = _{1 + a)+a(N-l+P)+a(l-P)y
dx x x
140 5. Rumours: Modelling Spread and its Cessation
0.8-
0.6-
0.4-
0.2-
This is still a first order linear differential equation, with the general solution
(1 + a)x N
V N pX
~~ l-a(l-p) 1-p ^
for some constant XN,Q,P of integration which is determined by the initial
conditions, yielding
(l4-cy)r N
11 T ulX IV
The proportion 0N of ignorants who never hear the rumour behaves much
as the root of equation (5.2.4), but with limit 0 = 6(a,p) — linijv^oo ON
now satisfying
(2 - - p)0 = 1 - a ( l - p). (5.2.8)
The contour curves in Figure 5.1 depict the non-unit root 0(a,p) as a func-
tion of a and p; always, 0 < 0(a,p) < 0 m a x , where 0 m a x « 0.284668 is the
smaller positive root of 0(1 - 2 In 0) = 1 arising from the limit a -> 1 and
p —> 0 (see also Exercise 5.2).
The deterministic analysis of Maki and Thompson's model is sketched in
Exercise 5.4.
5.3. Embedded random walks for rumour models 141
{
(—1, +1) with probability i/fij,
(0, -1) with probability (N+l-i- j)/fih
(0, -2) with probability \(j - l ) / / y ,
(5.3.1)
where fij = N — ^(j — 1). Note that the first case is excluded if i — 0,
when the probability equals zero, or j = 0, while the last case is excluded if
j < 1. This embedded jump process is a random walk on that part of the
two-dimensional integer lattice for which 0<i<N,0<j < 7V+1 — i, and
the states {(z, 0) : i = 0,..., N - 1} are absorbing. Prom equations (5.3.1)
we can derive equations for the probabilities P^ on the path of a rumour,
defined by
Pit = Pr{(X, Y)(t) = (ij) for some t}
( j
= Pr{(X, Y)n = (i,j) for some n}. ' '
where A denotes one of the three jumps (-1,1), (0,-1) and (0,-2) at
equation (5.3.1). Specifically, starting from PNi = PN-I,2 = 1, we have for
142 5. Rumours: Modelling Spread and its Cessation
N-i + D, (5.3.4a)
(J = N- *), (5.3.4b)
JV$(j + l)PfJ+a
(2 < j < N - 1 - i), (5.3.4c)
From these relations, in the same way as the distribution of the size of
the general epidemic process was derived in Section 3.4.3, we can find the
distribution
= 0.310681 « « » „ « „ . ,5.3.7,
For the [DK] model the general form of equation (5.3.4) involves three
terms on the right-hand side. For the Maki-Thompson model, however,
while the same deterministic equation (5.2.2) applies, the analogous equa-
tion has only two terms, corresponding to the simpler possible transitions
5.3. Embedded random walks for rumour models 143
100
0.15 0.15
0.05 0.05
10 20 30 40 50 0 10 20 30 40
(a) [DK] model (b) Maki-Thompson model
Figure 5.2. Frequency polygons of the distribution of the numbers of
ignorants ultimately hearing a rumour in two rumour models, for N = 50
( , scales as shown) and N = 100 ( , double the abscissa scale, halve
the ordinate scale, as indicated at top right-hand corners).
1), (5.3.8a)
N
(5.3.8b)
j = 0,l). (5.3.8c)
These results differ from the relations at (5.3.6) for the [DK] model except
for the commonality of 0 = 0.203188 in E£N. The values for N in the table
are used to facilitate direct comparison with [DKj's Table 1. Note that the
moments given are of X(oo) conditioned to be less than N — y/N: condi-
tioning is necessary because otherwise the term PN-i,o = I/AT2 contributes
144 5. Rumours: Modelling Spread and its Cessation
significantly to var£/v- The result would then belie the description of the
distribution as approximately normal, and would be reminiscent of the ma-
jor/minor humps of the final size distribution of the general epidemic (cf.
Figure 3.2). Pittel (1990) illustrates ways around the complications that
result from this fact when attempting to use a moment method for proving
asymptotics of the final size distribution. Watson (1988) remarks that the
fact that the asymptotic value of NVSX£N for the Maki-Thompson model
is smaller than for the [DK] model 'accords with the change to smaller more
frequent jumps in the removal process'.
Figure 5.2(b) complements part (a) by presenting the distribution for the
Maki-Thompson model. The probability of very few ignorants ever hearing
the rumour, which is approximately ^N~l for the [DK] model and N~2 for
the Maki-Thompson model, shows as a small peak in (a) and not at all in
(b), at the scale of the diagram and for the values of N used.
Both the general epidemic and Maki-Thompson models have embedded
random walks that are of a simpler structure than that in the [DK] model.
Specifically, because the process is strictly evolutionary and the jumps are
to 'adjacent' states, much as in a birth-and-death process, the 'time' argu-
ment n of the embedded jump process can be used to reduce the dimension
of the state space, from a two-dimensional to a one-dimensional lattice. To
study such processes we replace the embedded jump process {{X,Y)n} by
{(X',Y')n] which is defined as starting from the same initial state and as
having one-step transitions to (-1,1) and (0,-1) with the same probabili-
ties as for {{X,Y)n}. But now {{X\Y')n} is no longer confined to a first
passage into the states (0,j) (j = 0 , . . . , N); rather,
x'n) = [ ( T )
LV7Viy J (5.3.II)
N
on
implying that {Vn} is a martingale with respect to the cr-fields {.Fn}
which M is a stopping time.
Similarly, E ( [ X ; + 1 - X^] 2 | ^ n ) = X'JN so that
and
l (5-4.1)
\{X(t),Y(t)) = 2X(t) + Y(t) ^
where A() is clearly a random variable. Recall from (5.2.3) that X(t) =
\(x(t),y(t)) = 2x(t)+y(t)-Nln[x(t)/N] = 27V+1 (alH > 0) for (x(t),y(t))
denoting the deterministic path. We use this relation later in the form
y(t) -1 = 2[N- x{t)} + JV In ^ . (5.4.2)
Now the process A(t) is a martingale with respect to the cr-fields {^i-} =
{a({(X, Y)(s) : s < £})}, as is verified by evaluating
E(dA(t) | Tt-)
-Y(t)Z(t) + 2Y(t)[Y(t)-l]\dt
= Y(t)^N-2N+-~+Y(t)~l]dt. (5.4.4)
5.4. A diffusion approximation 147
fN
var A(oo) « /
Jx=N0
N2 x
- I - TV + — + 2(N - x) + ATln ± I dx
f1 l
Ar
= N / dtx
7^
varA(oc)« ^ L _ i , (5.4.6)
-i=A"(oo) + l
Let X(oo) = EX(oo) + W « N0N + W for some random deviation VT. Then
JVJ-1-7V ^ T «Wf'(EX(oo)),
148 5. Rumours: Modelling Spread and its Cessation
(5.4.7)
for this model (cf. equation (2.3.6)). The jumps of (X, Y)(-) give
at rate pX(t)Y(t)dt,
w
dA(t) \Ft- = { Mt) ' - ' (5.4.9)
-1 at rate jY(t)dt.
NO X
NO
v , , N0(l-0)(0
V K + K)
}
varX(oo) « ^J 0)2 -, (5.4.12)
5.5. P.g.f. solutions of rumour models 149
mary of the information needed to derive equations for the moments of the
process concerned.
Consider the [DK] model by way of example. As a continuous time
Markov chain model it has the infinitesimal transition rates at (5.1.2).
Defining Pij(t) = Pt{(X,Y)(t) = (ij) | (X,Y)(0) = (iV,l)} we readily
deduce from the forward Kolmogorov equations that
j=0
Forming the Laplace transforms <&i(w, 6) = f£° e~0t fi(w, i) dt and using the
initial conditions $i(w,0) = SNiw gives
5.1 Show that the root 0N of equation (5.2.4) satisfies ON = 0 + C/N + O(N~2),
where 0 = 0.203 188... and C = 0/(1 - 20) « 0.34.
5.2 Consider the non-unit root 0(a,p) of equation (5.2.8). Rearrange the equa-
tion and take limits so as to deduce that 0(a) = limp— i 0{a,p) satisfies the
equation (l + a~ 1 )(l — 0) = — ln0 (this equation coincides with (5.2.5) when
a = 1). Conclude also that 0(a,p) < 0 max on 0 < p < 1, 0 < a < 0, where
#max is the smaller positive root of 0(1 — 2 In 0) = 1, attained when a —> 1
and p —• 0.
5.3 Formulate a deterministic version of the A;-fold stifling model for which the
function yi(t) (i = 1 , . . . , k) denotes the number of spreaders that have had
contact with a spreader or stifler on i — 1 occasions. Setting y = yi~\ \-yk,
the differential equations are
x = -xy,
yi =xy-yi(y-l + z),
yi = (jfc-i ~ Vi)(y - 1 + z) (i = 2 , . . . , fc),
x = (-l)xy = -xy,
y = (+l)xy + (-l)y(N - x) = y(2x - N).
Conclude that dy/dx satisfies equation (5.2.2) so the same solution (5.2.3)
and other consequences follow. Investigate the time tu = inf t{y(t) < 1}.
[Maki and Thompson's original formulation is as a discrete time model in
which exactly one pairwise contact occurs at each discrete epoch in time.
This corresponds to scaling time in our formulation by A^(A^ -f 1).]
152 5. Rumours: Modelling Spread and its Cessation
5.5 ([DK] model with forget fulness). Modify the [DK] model as follows. Assume
that pairwise meetings occur at rate (3 per unit time, and that each spreader
may also become a stifler by means of 'forgetfulness' or 'loss of interest' at a
rate 7 per unit time, independently for each individual and of any interaction
with others in the population. To reflect this assumption, replace the last
two equations at (5.1.2) by
"j — ^ • •
ax x
Hence conclude that, unless the individual rate of forget fulness 7 < 0N, the
rumour spreads to only a small fraction of the population, and that no matter
how small 7 is, about 80% at most of the population would hear the rumour.
5.6 (cf. end of Section 5.3, around equation (5.3.10)). Extend the state space
XNI to {(ij) : i = 0,...,7V, j = I + N - i,I + n - i - 1,... ,0, - 1 , . . . } on
which a random walk {(X1 ,Y')n} is defined so as to coincide on XNI with
the embedded random walk jump process defined on the general epidemic
model of Chapter 3 (see the start of Section 3.4.3). Show that the processes
{Vn} and {Wn} defined by
and
'r+dX'r+l-l)
V+X'r+p-2)
r=o * + '•
5.8 (Variance of final number with k-fold stifling). Consider the /c-fold stifling
variant of the [DK] model and use standardized variables x(t) — X(t)/N,
yx(t) = Yt(t)/N. Construct differential equations analogous to those of Ex-
ercise 5.3 in conjunction with the procedure of Section 5.4 to conclude that
X(oo), the number of ignorants never hearing the rumour, has approximate
5.10 (Differential equations for first moments). Show that the first moments for
both the [DK] and Maki-Thompson models satisfy the same pair of equations
(cf. equations (3.3.6) for the general epidemic model), namely
= 2EX(t)EY(t) - EY(t)
6
154
6.1. Influenza epidemics: a discrete time deterministic model 155
Such change has been particularly rapid during the twentieth century, with
faster transportation methods and different life styles that affect both in-
fection and removal rates.
= (3xtYt = 0xt
= x t-
156 6. Fitting Epidemic Data
0 V>02/0 — YQ XO 0
1 ^02/1 •+• ^12/0 = Y\ x\ — xo — 2/1 2/1 ==: / t o o ^ o
where the (fu are based on empirical observations, and K is taken as 2 x 10~4,
a rough estimate of the influenza death rate. Note that Qt here refers to a
subset of all removals whereas z(t) as in (2.3.3) refers to all removals up to
time t.
Spicer (1979) reports that, for his analysis, given that notification of
deaths occurs weekly in England and Wales, the time unit was changed
from one day to one week. He states that this 'does not seriously affect the
model owing to the short infectious period of influenza, and it introduces
a useful smoothing effect'. In order to use the data on the %pj we must
calculate the evolution of the epidemic via (6.1.2) and (6.1.3) on a daily
basis, and then accumulate them over a week to obtain
7T+6 7T+6
yt anc =
VT = X / * & y i Ct
t-lT t—lT
(cf. Exercise 6.1 for a continuous time analogue). The fitting procedure was
the standard one of finding a pair of parameters a = f3x0 (it is impossible to
estimate /3 and XQ separately) and yo which minimized the sum of squares
W of the differences between the observed and calculated weekly deaths
over a period of V weeks, namely
r
W
=Y1^T-CT)2, (6.1.4)
T=0
where <^ and ^T denote respectively the observed and fitted numbers of
deaths in week T. A weighted least squares method was also used.
6.1. Influenza epidemics: a discrete time deterministic model 157
500-
400-
300-
200-
100-
0 1 1
0 10 15
Figure 6.1. Weekly deaths from influenza and influenzal pneumonia
in England and Wales, 1965-66. (Data from Spicer, 1979.)
Spicer fitted his theoretical results to data 1 covering all influenza epi-
demics in England and Wales for 1958-73 except those that were bimodal.
His paper illustrates his results graphically for nine unimodal data sets on
deaths in this period, and for seven data sets on deaths in Greater London
for 1958-72. To the naked eye, the fits appear reasonably good but with
several of the 16 fitted curves being somewhat too high at the beginning
and too low at the end of the epidemic. The graph for 1965-66 for England
and Wales is reproduced roughly in Figure 6.1; see also the graph of Greater
London 1971-72 in Figure 2.12.
One of the two quantities which needs to be estimated is the value a =
/?£o, and this is given, for example, by Spicer for the nine sets of data on
deaths in England and Wales mentioned previously, a s 3 . 9 < a < 7 . 7 (see
Table 6.2), and the seven sets of data for Greater London as 4.5 < a < 7.7.
The position of the optimum fit was found to be determined more by /3xo
than by K and JJQ.
Table 6.2. Estimated values of a = /3x0 for England and Wales, 1958-73
Year 1958-59 60-61 61-62 62-63 65-66 67-68 69-70 71-72 72-73
a 5.2 5.5 6.5 4.7 4.5 5.9 7.7 3.9 5.7
Source: Spicer (1979). Note that these data exclude bimodal cases.
If we take the average value a = 5.51, this implies that an infective trans-
mits infection to about five susceptibles, which may be an overestimate. The
1
Spicer's calculations were 'based on observations reported by Stuart-Harris et al.
(1950)'; the data are not otherwise identified; the only data from the 1950 reference
matching this description give estimates of Y^=i V5d+j (d = 0,1,2) and *52
equal to 0.31, 0.41, 0.26 and 0.02 respectively.
158 6. Fitting Epidemic Data
fits are, however, sufficiently good for the model to be used for predictions
of influenza deaths.
Wilson (1989) fitted several curves to these quarterly data including the
three empirical curves listed below, namely the log linear (T), log quadratic
(Q) and log log (LT) models with Poisson errors, using the statistical pack-
age GLIM. These are
T: In A = <* + /?*,
Q: In A = a + /ft + 7*2,
LT: In A = a + pint,
where the LT curve was originally proposed by Whyte et al. (1987). Using
t = 1,..., 25 to denote the 25 quarters of the data in Table 6.3, Wilson
6.2. Extrapolation forecasting for AIDS: a continuous time model 159
150- 150-
100- • 100-
«
50- 50-
#
1 1 1 1 1
() 10 15 20 25
5
(a) (b)
Figure 6.2. 1982-1988 Australian AIDS data •) with fitted curves:
(a) LT ( ) ) and T ((----), and (b) Q ( -). (Based on data in
text from Wilson (1989).)
The fit of T gives 159 for the last quarter (t = 25) and this was considered
unsatisfactory. The fits of both Q and LT curves were considered adequate
as indicated in Figure 6.2.
The fit was further refined to take account of two events towards the end
of 1987:
(a) the extension of the World Health Organization's (WHO's) definition
of AIDS from its 1985 revision; this may have led to an increase in
AIDS diagnoses; and
(b) the availability of zidovudine (AZT) at about that time in Australia,
with the consequent effect of delaying the onset of AIDS.
Figure 6.3 shows the fitted values taking account of event (b); the disconti-
nuity at t = 20 in (6.2.1) reflects (b) also.
Predictions were then made for the next 10 quarters on the basis of these
(and other) models, in the cases where the effect of AZT was considered
or neglected, and for the cases where all the 1982-1988 data were used, or
only the data from 1985-1988. Figure 6.4 gives the predictions based on the
1982-1988 data, with and without the AZT effect. The two LTjZ models
(j = 1,2) considered, in which the effect of AZT was taken into account,
160 6. Fitting Epidemic Data
150-
(a) (b)
Figure 6.3. 1982-1988 Australian AIDS quarterly data (•) with fitted
curves allowing for possible delayed onset of AIDS: (a) LT,Z ( )
and T,Z ( ), and (b) Q,Z ( ). (From Wilson's data, by personal
communication.)
though they varied only slightly, yielded very different predictions within
two years.
400- 400-
300- 300-
200- 200-
100- 100-
1^ I I I 1 1 1 1 1 1
26 28 30 32 34 36 26 28 30 32 34 36
(a) (b)
Figure 6.4. Predicted numbers of new cases of AIDS diagnosed in Australia
from the first quarter of 1989, using all the data: (a) T ( — — — ),
LT( ), Q (- — );(*>) T,Z( ), LT,Z( ), LT,Z(2)
( ), Q,Z ( ). (From Wilson's data, by personal communication.)
The results in Figure 6.5 were obtained using only the last four years
of data 1985-1988. These were deemed to be more satisfactory than the
models using all the data, and not quite so divergent in their predictions.
6.2. Extrapolation forecasting for AIDS: a continuous time model 161
400- 400-
300- 300-
200- 200-
n
1 1 1 1 1 1 U I 1 i 1 I 1
26 28 30 32 34 36 26 28 30 32 34 36
(a) (b)
Figure 6.5. Predicted numbers of new cases of AIDS diagnosed in Australia
from the first quarter of 1989, using 1985-1988 data only: (a) T ( ),
LT( ), Q ( — - ) ; ( b ) T , Z ( - - - ), LT.Z ( ) and
Q,Z ( ). (From Wilson's data, by personal communication.)
While all models predict the same numbers for 1989, they range widely
for 1993, the last year of the predictions. The actual numbers are now
known and are shown in the last line of Table 6.4: on this basis the QZ
model came closest. These methods of extrapolation forecasting have now
been displaced by back-calculation models, described in Section 6.5.
162 6. Fitting Epidemic Data
from which we can write down lnL^ and maximize it in a to obtain the
equation for the maximum likelihood estimator (MLE) d
Similarly, for the Reed-Frost model, with the different likelihood function
LRF say, the MLE d satisfies
Wilson et ah (1939) contains data from Providence RI for the total num-
bers of infections in 100 households of three infected by one infective (see
Table 6.6).
6.3. Measles epidemics in households: chain binomial models 163
Table 6.6. Fit of the Greenwood and Reed-Frost models to Providence RI data
Fitted values
Total no. cases Data Greenwood model Reed-Frost model
0 4 2.5 4.2
1 3 1.5 2.8
2 9 14.9 9.0
3 84 81.1 84.0
Total no. households 100 100 100
Using the data values (a, 6, c, d) = (4,3,9,84), the values & for the proba-
bilities of non-infection in these two models satisfying, respectively, (6.3.1)
and (6.3.2) are
6LG = 0.291, aRF = 0.347. (6.3.3)
The fitted values given by the second and third columns of Table 6.5 with
a = &G, OLRF as at (6.3.3) yield the expected numbers shown in the last two
columns of Table 6.6. The value for the chi-square goodness-of-fit statistic
X% for the Greenwood model, combining the categories with 0 and 1 cases
because their expectations are both small, equals 3.02/4.0 + 5.92/14.9 4-
2.92/81.1 = 4.69; this is significant at the 5% level but not at the 2.5% level
for x 2 on 1 d.f. For the Reed-Frost model, again combining categories with
0 and 1 cases, the x 2 statistic on 1 d.f. equals zero, reflecting the perfect fit.
Table 6.7. Fit of the Greenwood and Reed-Frost models for specific chain types
Providence RI Fitted values
Type of chain Generic data Greenwood model Reed-Frost model
Xt = 33 a 4 0.9 1.2
3 22 b 3 0.4 0.7
32 11 c 1 0.7 1.0
3 11 d 8 8.2 2.2
3 2 100 e 4 2.7 3.4
3 200 f 3 6.5 7.3
3 100 9 10 31.0 38.7
300 h 67 49.6 45.5
Total no. households n 100 100 100
For the Greenwood model the log likelihood lnLcr, neglecting a constant
term, equals
164 6. Fitting Epidemic Data
3a -f 46 + 4c + 2d + 3e + 2 / + .
-f 6e + 5/
Similarly, for the Reed-Frost model, In LRF is a linear function of In a,
ln(l — a) and ln(l 4- a ) , and has derivative
dLRF 3a + 46 + 4c + 3d + 3e 4- 2 / + g g
da OLJIF 1 ~f~
6 + 2c + 2a1 + 3e + 3 / + 3g + 3ft tn n r ,
(6.3.5)
1 -OflF
Thus, ay = a for the Greenwood model and ay for the Reed-Frost model.
Becker (1981) was concerned with the data collected by Heasman and
Reid (1961) on outbreaks of the common cold in families of 5 with 1 initial
infective (equivalent to households of 4 infected by 1 external infective). He
fitted both the general model, where the ay were estimated directly from the
6.4. Variable infectivity in chain binomial models 165
Table 6.8. Reed-Frost chain binomial probabilities for households of size four
Type of chain Probabilities* Expected values of probabilities*
44 a4 zq(3)/z(3)
43 3 4a6(l-a) 4zq(b)zp(0)/z(6)
43 2 2 12a 7 (l-a) 2 12zq(6)zp(l)/z(S)
4 3 2 11 24a 7 (l - a ) 3 24zq(6)zp(2)/z{9)
4 3 2 10 0 24a 6 (l - a) 4 24zq(5)zp(3)/z(9)
43 2 00 12a 5 (l-a) 4 \2zq(4)zp(3)/z(S)
4 3 11 12a 6 (l - a)3 12zq(5)zp(2)/z(S)
4 3 10 0 12a 4 (l - a) 3 (l - a 2 ) I2zq(3)zp(3)(l + q + 12z)/z(S)
4 300 4a 3 (l - a) 4 4zq(2)zp(3)/z(6)
4 22 6a 6 (l - a ) 2 6zq(b)zp(l)/z{7)
4 2 11 12a 5 (l - a) 2 (l - a 2 ) I2zq(4)zp(2)(l + q+ 13z)/z(S)
4 2 10 0 12a 4 (l - a) 3 (l - a 2 ) 12*g(3)zp(3)(l + q + I2z)/z(8)
4 200 6a 2 (l - a) 2 (l - a 2 ) 2 Qzq(l)zp(3)[76z2 + (17 + 19q)z + (1 + q)2)/z(7)
411 4a 4 (l - a ) 3 4zq(3)zp(2)/z(6)
4 10 0 4a(l - a) 3 (l - a 3 ) 4zq{0)zp(3)[3Sz2 + (12 + 9q)z + (1 + q)2]/z(7)
400 (I-a)4 zp(3)/z(3)
*See equation (6.4.5) for definition of p, q, z and zp(-).
data, and the Reed-Frost model where ay = ay, to the data. The Reed-
Frost probabilities are shown in column 2 of Table 6.8. Both fits proved
adequate but not ideal, with that of the Reed-Frost model the better.
Greenwood (1949) had considered possible variations in a, while Bailey
(1975) showed that if a is a random variable with a beta distribution, re-
flecting the variable infectivity of households, then chain binomial models
could provide an improvedfitfor the Providence RI measles data (see Bailey,
1975, Chapter 14, pp. 254-260).
Dietz and Schenzle (1985) reviewed the history of household epidemic
statistics and considered the data of Heasman and Reid (1961) who had
obtained a = 0.886 for the probability of no contact in the Reed-Frost
model. Becker (1981) found a = 0.884, and Schenzle (1982), after pooling
some of the data (see Table 6.9, column 3), obtained a = 0.893. Schenzle
(1982), following Bailey (1975), also considered the case of a Reed-Frost
model in which a is a beta random variable, but did not provide any details;
we reproduce his figures in Table 6.9, without having full knowledge of his
method of fit.
We now sketch the treatment of Gani and Mansouri (1987), from which
paper further details can be obtained. Following Bailey (1975) and Section
4.3 above, we have for the Reed-Frost model
Suppose that a (0 < a < 1) varies between households and follows the beta
distribution with density function
(a,6>0).
(a 6> (6.4.3)
B(a,b) '
Then, integrating Pij(ot) over all values of a gives
da
>3
(6.4.4)
(6.4.5)
thereby obtaining the formulae in column 3 of Table 6.8.
Table 6.9. Common cold data for households of size four with four model fits
Observed Reed-Frost models Becker's Schenzle's
Chain data variable a fixed a general model model*
Xt = 44 423 420.77 405.2 403.9 420.9
433 131 133.52 147.1 147.3 133.4
43 2 2 36 40.17 45.3 45.6 40.1
43211 14 10.43 10.5 10.7 10.4
43 2 100 4 1.82 1.4 1.4 1.8
43 200 2 1.09 0.8 0.8 1.1
4 3 11 8 6.12 6.0 6.2 6.1
4 3 100 2 2.41 1.7 1.6 2.4
4300 2 0.53 0.3 0.3 0.5
422 24 23.15 25.6 26.9 23.1
4 2 11 11 13.34 12.7 12.2 13.4
4 2 100 3 2.41 1.7 1.6 2.4
4 200 1 3.21 2.0 1.8 3.2
4 11 3 2.84 2.5 3.7 2.8
4 100 0 2.18 1.1 0.0 2.0
400 0 0.24 0.1 0.1 0.2
Total 664 664.23 664.0 664.1 663.8
X 2 2.74 9.1 9.5 5.8
* A Bailey-type variant of the Reed-Frost model; see text for detail.
6.4. Variable infectivity in chain binomial models 167
for some constant C. Gani and Mansouri (1987) used the ZXMWD subrou-
tine of IMSL to minimize — In L, and found for the MLEs of q and s
T (31747.4 -4727.7 \ ( .
1 (6A8)
= (-4727.7 8535.7 J '
and variance-covariance matrix
_ / 0.3433 x 10~4 0.1901 x 10"4 \ , .
~ \ 0.1901 x 10~4 1.277 x 10"4 ) ' [ }
a = 39.91, 6 = 4.999.
with 5 d.f. (P-value = 0.091). The Reed-Frost model with variable a gives
a closer fit: \2 = 2.74 with 5 d.f. (P-value = 0.728). Schenzle's variant is
likewise closer than the fixed a and Becker's model.
These results lend support to the hypothesis, originally made by Green-
wood (1949), that there is variation in the susceptibility to infection between
households. Bailey (1975) had already shown in his studies of the Provi-
dence RI measles data that both Greenwood and Reed-Frost models with
variable a provided improved fits. The present application to common cold
data arrives at a similar conclusion.
a(t)= / I(s)f(t-s)ds
Jo
where I(s) ds is the number of HIV cases reported earlier in (s, s + ds). The
integrated form of this equation expresses the cumulative number A(t) of
AIDS cases to time t in terms of the distribution function F(t) = JQ f(u) du
of the AIDS incubation period, namely
these authors estimated the parameters A, r and 6 from the data, and
concluded, on the basis of 83 adult individuals undergoing transfusions from
1978 to 1983, that the mean incubation period was 54 months = 4.5 yr, with
90% confidence bounds of 2.6 and 14.2 yr. The estimated Weibull density
function for this case is shown in Figure 6.6 below. We used the parameters
A = 0.000044 5, f = 2.51 and 6 = 2.5 months for this density function, and
our time axis is in months. The modal value of the density turns out to be
larger than in Lui, Lawrence et al. (1986).
0.02-
0.015-
0.01-
0.005 -
I I I I I
20 40 60 80 100 120
Figure 6.6. Estimated Weibull density function for the transfusion-
associated AIDS incubation period in months based on 83 adult patients.
(Based on Figure 2 of Lui, Lawrence et al. (1986).)
170 6. Fitting Epidemic Data
Other authors have used different models for the distribution of the in-
cubation period, such as the gamma density function
Among these are Anderson and Medley (1988), who estimated values of
a = 2.7, A = O.lOyr"1, hence a mean of 2.7/0.19 = 14.3 yr, based on
fitting a gamma distribution to data from England and Wales up to April
1988. They also fitted the 2-parameter Weibull distribution F(t\ p, /?) =
1 - e " ( ^ , obtaining {3 = 2.33, p = 0.12 yr" 1 and a mean of 7.4yr (cf. the
parameters used for Figure 6.6, for the d.f. 1—exp (—[p(t — O')]P), which has
/? = f = 2.51,0' = 0.2yr, p = 12(0.000044 5) 1 / 2 5 1 = 0.221 yr" 1 ). Lawless
and Sun (1992) considered in addition a log logistic distribution function
(6
">
assuming (6.5.2) holds, the likelihood of the observed data can be written
as
L
=
where ^ = 1 if the critical transfusion date for i was before January 1982,
Si = 0 otherwise. Thus, when 8i — 1,
and when 8i — 0,
ainL32 **
32
32 32
E (^ -«) - E A«< - °r ^ -
ln
v
— OiAylji — uj LllyL/i — uJQ ' -\- Ayiii — u) in^XL^ — I
0.025 -
0.02-
0.015 -
0.01-
0.005 -
0- I I
0 20 40 60 80 100
Figure 6.7. Estimated Weibull density function for the AIDS incubation
period in months based on 32 paediatric patients. (Based on data in text
from Lui, Peterman et al. (1988).)
Quarter
Year
1982 1
1983 0 1 2 3
1984 1 6 16 23
1985 26 37 29 30
1986 42 51 63 70
1987 86 96 87 102
1988 112 112 143 155
1989 142 121 153 152
1990 150 131 167 143
(6.5.7)
6.5. Incubation period of AIDS and the back-calculation method 173
J+1
A(tj+i) - A(tj) = f I(s)F(t -s)ds (j = 0,..., 33).
Here the increments in A(-) are known from Table 6.10, F(t) is given by
(6.5.7) with (r, A) = (2.55,0.002078), and I(t) has one of the two para-
metric forms mentioned; it is, of course, possible to reconstruct I(t) non-
parametrically.
Given these conditions, assuming the epidemic to have begun in 1981,
Solomon et ai. (1991) obtained the following estimates of Jy+ I(s)ds for
the years (y,y + 1) in the period 1981-1990.
Table 6.11. Estimated annual HIV infection incidence in Australia
Year 1981 '82 '83 '84 '85 '86 '87 '88 '89 '90
Model
(a) Quadratic (i) 34 912 4046 4380 1160 73 1
exponential (ii) 34 912 4046 4380 1374 1000 1000 1000 1000 1000
(b) Linear 0) 62 678 4032 5522
logistic (ii) 62 678 4032 5690 1000 1000 1000 1000 1000 1000
The values at i) for both the quadratic exponential and linear logistic curves
are calculated without adjustment, while at ii) they include various adjust-
ments, such as an allocation of 1000 new HIV cases annually from 1986 in
A and 1985 in B, or the effects of various treatments, in order to achieve
greater realism. From these scenarios, HIV infection curves can be recon-
structed under different assumptions.
These HIV infection curves can then be used to make short-term projec-
tions of new AIDS diagnoses from (6.5.1). Table 6.12 sets out these for the
years 1991-1995 for Australia as given in Solomon et ai. (1991).
Table 6.12. Estimated annual HIV infection incidence in Australia
Year 1991 '92 '93 '94 '95
Quadratic exponential I(t) 765 830 863 863 831
Linear logistic I{t) 756 816 844 840 804
Note: No actual numbers are available for comparison
174 6. Fitting Epidemic Data
6.1 As a setting for a continuous time analogue of (6.1.2) and (6.1.3), let X(t),
Y(t), Z(t) denote the number of susceptibles, the (analogue of the) number
of infectives at time t, and the number of deaths by time t. Develop the
following relations:
rJo
= -0X(t)Y(t), i.e. ±
v(u)du + J <p(t~u)\dX(u)\\,
where X/J(U) denotes the probability that an infected individual is still in-
fectious (and mixing in the population) time u later, and Kip(u)du is the
probability that an infected individual dies in the interval (u,u + du) after
infection. Observe that, from an algorithmic viewpoint, the discrete time
model is easily implemented, but not so the continuous time model.
6.2 Table 1.6 presents data of En'ko (1889) recording the daily numbers of
measles cases for several years from records at the Smolnyi Institutions. Find
estimates of the variance of the dates of identification of cases within each
generation, (a) for each year as tabulated, (b) for the pooled data. Is there
any evidence for a model like s^U) = am + 0mj, for m = a, b according
as data are from (a) or (b) and j = generation number. How would you
interpret a& = 2a a , 0b = Pa ?
6.3 Consider a chain binomial model in which the probability pt of pairwise in-
fection is no longer constant but is monotonically non-increasing in t. This
can reflect the propensity of individuals who are more gregarious or more
infectious or more susceptible, to be infected ahead of others. Investigate the
estimation of {pt} and the fit of such models to data like Aycock's measles
epidemic (Table 1.5) or En'ko's data (Tables 1.6 and 4.1).
Hem ark: This discrete time model attempts to reproduce the effects of het-
erogeneous mixing or infectivity and susceptibility as done for continuous
time models in Cane and McNamee (1982) and Daley, Gani and Yakowitz
(2000).
7
175
176 7. The Control of Epidemics
may, for example, need to know the likely effects of spending funds on two
alternative policies, or the optimal method of immunizing a population.
Here, exact models may not be easy to formulate, though one usually tries to
make all modelling as realistic as possible. Accurate data may be impossible
to obtain, but one should always be in a position to minimize the cost of
a policy or to compare the effects of policy A against those of policy B,
however approximately. Examination of the control methods discussed in
this chapter shows that they use and extend the simple models discussed in
previous chapters.
When a policy depends only on a single variable, it is relatively easy to
minimize the cost. If two policies are to be compared, one can examine
their respective costs and choose the cheaper policy. Alternatively, if the
criterion is not cost, one can rank the policies with respect to the criterion
selected.
The control methods we describe in this chapter can be understood in
terms of the general epidemic model of Chapters 2 and 3, with pairwise
transmission rate (3 and removal rate 7, or of more complex models that
include this as the simplest case. We present strategies aimed at one or
more of the following results:
(a) depressing the number of susceptibles in the population, and, where
possible, to below the threshold level p = 7//? described earlier in the
Kermack-McKendrick criticality theorem (Theorem 2.3.1 above);
(b) accelerating the rate of removal of infectives to reduce their mixing
with the population of susceptibles (i.e. increasing 7, hence p also);
(c) lowering the pairwise rate of infectious contact between infectives and
susceptibles (i.e. decreasing /?, thereby increasing p).
For example, immunizing some or all of the population reduces the initial
number XQ of susceptibles; operating a screening program or raising public
awareness of higher disease prevalence may raise 7 or lower (5 (or both);
discouraging the assembly of large crowds reduces /?.
This assumes that condoms provide total protection from HIV infection.
Table 7.1 lists the number of new infectives expected in a year for different
values of the proportions m of regular and p of occasional condom users,
the latter using them a proportion r of the time. The values of m and p in
the table are assumed to be similar to those reported in Switzerland. The
infection rate is set at 0.05(0.05)0.15, and the number of sexual partners
178 7. The Control of Epidemics
where k > 0 is some constant, and the maximum proportions of regular and
occasional condom users, whatever sum y is spent on education, are | and
| respectively. Then the expression for C at (7.1.2) becomes
1500-
1000-
500-
0
Figure 7.1. Graph of y as a function of In k.
While there is no claim that these are realistic figures, the calculations
we have given indicate the type of decision that policy makers can attempt
to make in the allocation of funds. In this example, spending on education
a sum that is approximately 4% of the cost of medical treatment, results in
a minimal cost C.
A recent paper of Kaplan (1995) on the merits of needle exchange for
intravenous drug users provides an alternative example of HIV control by
education. O'Neill (1995) has also studied epidemic models in which be-
havioural change plays a role.
Then, a result dating back at least to Gordon Smith (1964, Figure 3) states
that v*, the minimum fraction of the population that should be immunized
so as to prevent a major outbreak, is given by
v* = 1 - l/ifo. (7.2.2)
mixing population, also holds for stratified populations with the general defi-
nition of Ft®. To see this, recall that Ro is defined as the dominant eigenvalue
of a linear mapping between strata determined by the mean number ra^
of j-stratum infectives produced by an z-stratum infective during its entire
period of infectivity. Now a random uniform immunization programme at
level v reduces the expected number of susceptibles uniformly by the con-
stant factor (1 — v). The m^ are therefore reduced similarly, and thus all
eigenvalues are reduced by the same factor. In particular the Reproduction
Ratio is (1 — v)Ro, and when (7.2.2) holds, this equals the critical level of
unity (see (7.2.1)).
That this relation fails to provide the optimum immunization strategy in a
genuinely heterogeneous population, where pairwise infection transmission
rates (3 within groups are typically higher than between groups, is intu-
itively understandable (Anderson and May (1991), p. 307). Immunizing the
same fraction in all strata results in relatively too many immunes in small
groups and too few in large groups; the deficit in the latter is not made up
by the excess in the former because the overall rate of infectious contacts
is quadratic in the group size. Thus a uniform immunization rate, to be
effective in the population as a whole, must produce more immunes than
would be needed for uniform immunity within the separate strata. Using a
deterministic model, Anderson and May showed that, if the pairwise infec-
tion rates (3ij for an z-stratum infective to transmit the disease to a specified
j-stratum susceptible are fiij = (3H OT /3C depending only on whether i = j
or i 7^ j respectively, then the optimal policy (i.e. requiring the minimum
total amount of immunization) is such that the non-immunized suscepti-
ble numbers in each stratum are identical. Again, this is understandable,
for such an immunization program reduces the susceptible populations to a
collection of identical sub-population units.
In more detail and in a stochastic setting, consider a population of m
strata with Ni susceptibles, i = 1,... ,ra, as in Example 3.5.1, in which
the pairwise infectious contact rates are (3H within each stratum and (3c
between strata. We saw at (3.5.14) (recall also Exercise 3.10) that when no
Ni is small,
Ro > P" + ( " y 1 ) f r l * , (7.2.3)
where N = Y^iLi Ni/m; equality holds, when (3H ¥" Pc, if and only if all Ni
are identical. Suppose that as a result of an immunization programme the
Ni susceptibles are reduced to TV*, say. Then the resulting Reproduction
Ratio RQ, say, would satisfy (7.2.3) with N replaced by ~N*, and the equality
182 7. The Control of Epidemics
would now hold if and only if all N* are equal. Consequently, among those
immunization programmes that reduce RQ to RQ = 1, the programme re-
quiring the smallest number of people to be immunized (i.e. the programme
for which m(N - ~N*) is least), is the one for which each Ni is reduced to
l
B* = ^=* * v l
Y,i=iNi{Ni-l) ' (7.2.6)
Table 7.3. Values of the reproduction ratio and immunization fractions for
different strategies in the case /3H ^> 7
k 1 2 3 4 5 6
™k 300 400 200 75 20 5
Quasi-optimal strategy
R* 0.4998 1.3626 1.7439 1.8933 1.9379 1.9485
^opt 0.5305 0.2019 0.0610 0.0141 0.0023 0.0000
*>unif 0.7435 0.3007 0.1050 0.0283 0.0054 0.0000
All-or-none in households > k
K 0.1499 0.9640 1.5106 1.8027 1.9114 1.9485
v
all/none 0.8592 0.4836 0.2019 0.0610 0.0141 0.0000
other policies. For example, if instead one were to immunize the largest
households completely, how much more immunization would be needed?
We discuss such questions in Example 7.2.1 and Exercises 7.1 and 7.2.
Example 7.2.1. Consider a population of m = 1000 households consisting
of rik households with k susceptibles (k = 1,... ,6) as shown in Table 7.3;
let 7 = 1, (5c — 0.0005. Suppose first that /?# » 7 so that all the analysis of
Example 3.5.2 is applicable. From equation (3.5.20) the Basic Reproduction
Ratio .Ro of the population equals 1.9485 as shown in the last column.
The rest of the entries in the table show the effects of three immunization
strategies.
The effect of the Quasi-optimal strategy, when immunization is carried
out to reduce all households to a maximum of j susceptibles, is shown
via the reproduction ratio RQ that would then be realized. This entails
immunizing a fraction vopt = Ylk>j(^ ~ J)nk/ Ylk>i ^nk °^ ^ ne population;
of course, vopt < fUnif = 1 — RQ/RO, the fraction needing immunization
under a random strategy to reduce the reproduction ratio by the same
amount.
Consider next the effect of an all-or-none strategy in which all members
in every household of more than k members are immunized (strategy (ill)).
Then the reproduction ratio R'o and fraction immunized ^aii/none a r e a s
shown. The population numbers in the households were chosen so as to
make some of the vaii/none fractions coincide with fractions vopt under the
quasi-optimal strategy but for smaller k. Observe that the ratios under
the quasi-optimal strategy are smaller than the ratios under the all-or-none
strategy when ^li/none = vopt.
We can draw at least two conclusions from this table, for this particular
distribution of households. First, to reduce the value of the reproduction
ratio to below 1, requires the immunization of about 50% more individ-
184 7. The Control of Epidemics
Table 7.4. Values of the reproduction ratio and immunization fractions for
different strategies in the case /3H = 0.27
k 1 2 3 4 5 6
nk 300 400 200 75 20 5
Quasi-optimal strategy
R* 0.4998 0.9697 1.1933 1.2802 1.3065 1.3131
^opt 0.5305 0.2019 0.0610 0.0141 0.0023 0.0000
Vunif 0.6194 0.2615 0.0912 0.0250 0.0050 0.0000
All-or-none in households > k
R'o 0.1499 0.6518 1.0235 1.2184 1.2892 1.3131
^all/none 0.8592 0.4836 0.2019 0.0610 0.0141 0.0000
uals under a random strategy than under the optimal one. Second, the
all-or-none strategy is marginally more efficient than the random strategy.
Strategies (ii) and (iv) are considered in Exercises 7.1 and 7.2.
Suppose now that /3# ^> 7 does not hold. The analysis we have given
must then be reworked starting from the matrix M at (3.5.18) and finding
its eigenvalue of largest modulus, much as in the concluding paragraph of
Section 3.5. We illustrate this approach in the context of Example 7.2.1 ex-
cept that now f3H = O.27 while the other parameter values are unchanged.
We computed the dominant eigenvalue by reduction of M to upper Hessen-
berg form via Gaussian elimination and use of the QR algorithm as in Press
et al. (1987, Chapter 11). This leads to Table 7.4 in place of Table 7.3.
Qualitatively, what is an optimal immunization strategy for an epidemic
that may spread in a community of households, for which the conditions
justifying the approximations do not hold? Intuitively, since the optimal
strategy described earlier is a function of the graph-theoretic structure of
the infectious path of the disease, then, assuming the same homogeneity
within households and the same infectivity relation between households,
we would expect the same strategy to remain optimal. In other words, the
quantitative details may change, but the qualitative principle would remain.
Survey period Number of cases Per cent increase from preceding survey
November 1985 766
October 1986 1232 61%
October 1987 1964 59%
October 1988 3136 60%
October 1989 5411 72%
October 1990 6985 29%
November 1992 11565 66% [= 29% p.a.]
Source: Thomas and Moerings (1994), p. 139.
is not compulsory. The fact that HIV is more prevalent in prisons than
in the community at large, suggests that HIV may spread more rapidly in
confinement than in the free community (see Exercise 7.2).
HIV is spread in prisons by sexual contacts and by needle sharing among
intravenous drug users (Brewer et al. 1988). The US Department of Justice
(1993) reports that there were 11 500 known cases of AIDS in Federal prisons
for the five months from November 1992 to May 1993, with an average
growth of over 50% annually during the past several years (see Table 7.5).
This was considerably greater than the rate of increase in the total number
of prisoners (roughly, 75% between 1985 and 1992), and represented an
incidence rate of over 1% (cf. less than 0.1% in the wider population then).
Thomas and Moerings's book (1994) has documented worldwide concern
about the spread of HIV/AIDS in prisons.
Blumberg and Langston (1991) raised the question of mandatory HIV
testing of prisoners on entry into jail, but such a procedure is not currently
acceptable. However, voluntary reporting or testing for HIV is possible
(Stevens, 1993; see also Hsieh, 1991), and both medical and educational
help then becomes available to HIV 4- prisoners; Siegal et al. (1993) state
that HIV-f prisoners who do not report their status do not receive medical
help. Hsieh (1991) considered a model of HIV screening that incorporates
the quarantine of infectives.
We consider in turn models for a single isolated prison, and for a prison
interacting with the outside world. Then in Section 7.3.3, we discuss a
screening and quarantine procedure for which the overall medical costs over
a fixed period of time T, including costs of treating undiagnosed HIV-h
prisoners, are minimized.
While only deterministic models are used here, their stochastic equiva-
lents can also be analysed (see Yakowitz, Gani and Blount (1996), Gani,
Yakowitz and Blount (1997) for details, and Section 4.4 above for a related
model).
186 7. The Control of Epidemics
infectives, and amongst the inflow of n new prisoners we can expect a pro-
portion fit to be infectives, where 0 < nt < 1, so at time t -f 1-f there
are
(1 - r)yt[l + 0{N - yt)] = /(y t ,Mt) (7.3.3)
infectives amongst the N prisoners.
Are there stable scenarios for this model? We could ask whether a sta-
ble regime is reached starting from t/o infectives at time 0-f under the as-
sumption that ^t — M ( a ^ *)• T° this end we examine the transformation
V *-»•fiy,^) f° r an Y fixed points lying in [0, AT]. Observe that the function
g(y) = f(y, /i) — y is concave, with #(0) = rtfi > 0 and g(N) = n(/x — 1) < 0
under our assumptions on fi. Thus, a unique stable scenario yt = ys (all t)
exists, namely
Table 7.6. Stable infective sizes in the single prison model for different n, fi, j3N
/3N
n 0.0 0.05 0.075 0.1 0.125 0.15 0.2 0.25 0.5 5
25 0.01 5.0 39.8 160.1 242.3 293.1 327.2 370.2 396.1 448.0 494.8
0.005 2.5 25.4 154.8 239.6 291.3 325.9 369.3 395.4 447.7 494.8
50 0.01 5.0 9.0 14.5 31.8 82.5 142.6 228.3 281.7 390.3 489.0
0.005 2.5 4.5 7.5 18.7 71.2 136.4 225.3 279.8 389.6 489.0
(1 -f /3N)(1 — r) < 1. In other words, (7.3.4) always gives the required root:
when fji = 0 it simplifies to
0 H/3N <r/(l-r),
(7.3.5)
N — r/[/3(l — r)] otherwise.
4)9(1-
xt + yt+qt = N. (7.3.14)
w t+ i = n/x t+ i (7.3.16)
infectives are added. At the same time, n prisoners leave, among them
(7.3.18a)
and
tt-ttft+i = (l-r)gt. (7.3.18b)
Let us now screen a proportion a (with 0 < a < 1) of the non-quarantined
prison population, and quarantine those who test HIV-h Then at £ + 1+ the
numbers of non-quarantined and quarantined infectives, and susceptibles,
are respectively
Ct. (7.3.23)
t=\
7.3. Control by screening and quarantine 191
80-
60-
40-
20-
0- I I T
0 0.1 0.2 0.3 0.4
Figure 7.2. Cost of an epidemic in prison as a function of the
screening rate cr, for finite ( ) and infinite ( ) time horizons.
Example 7.3.3 (cf. Gani, Yakowitz and Blount (1997)). Assume that
b > c > a > 0, so we can expect the optimal screening rate o1 < 1. We
confirm this expectation for the parameter values TV, /i, n and {3 much
as in Example 7.3.1, under the two scenarios of stability, and of a finite
time horizon T = 50. In general the optimal screening rate differs for these
conditions. It is arguable that the screening rate should be chosen according
to the worse scenario which, in the case of a low initial incidence rate, is
always the stationary case.
Figure 7.2 shows the average cost J(<r,T)/T and its limit for T —> oo as
functions of the screening rate a in the case /i = 0.005, n = 25, (3N = 0.15
with yo = 0. It shows that the optimal screening rate is about 10% or 15%
depending on whether a finite (T = 50) or infinite time horizon is used.
Table 7.8 shows the optimal screening rates for the two scenarios for the
range of parameter values considered.
Table 7.8. Optimal prison population screening rates for isolated prison model
(3N
n A* 0.05 0.075 0.1 0.125 0.15 0.2 0.25 0.5
25 0.01 0.0 0.027 0.065 0.100 0.129 0.182 0.228 0.396
0.038 0.082 0.119 0.151 0.180 0.231 0.276 0.440
0.005 0.0 0.0 0.032 0.065 0.094 0.146 0.191 0.357
0.008 0.051 0.086 0.116 0.144 0.194 0.237 0.396
50 0.01 0.0 0.0 0.026 0.063 0.095 0.151 0.200 0.378
0.0 0.036 0.075 0.108 0.139 0.193 0.240 0.412
0.005 0.0 0.0 0.0 0.026 0.057 0.112 0.159 0.334
0.0 0.003 0.040 0.073 0.102 0.154 0.199 0.367
For each (n,/x), the upper (lower) rows correspond to finite (infinite) time horizon.
192 7. The Control of Epidemics
In terms of our model with constant prison population size TV, the effect
of screening is two-fold: it periodically reduces the number of infectives who
may infect others, and it also reduces the size of the population where in-
fectious contact can occur. Another variable that is of concern in managing
a screening procedure is its frequency, but this is not investigated here (see
Exercise 7.6).
7.1 (Community ofm households with /3H ^> j)- In Example 7.2.1, the assump-
tion PH » 7 implies that either none or all of the members of any given
household become infected in the course of an epidemic. Regard such a com-
munity of m households as consisting of n' strata, where stratum k consists
of the nfc households with j susceptibles initially and n = max{fc : nk > 0}.
The probability that a household with j infectives would infect a specific k th
stratum household of susceptibles (assuming no other households have any
effect in the meantime) equals
T7 V
11
^_A 27 + ik/3c \ 7 +fc/fc/ J 7 Pc '
where pc = j/Pc and the approximation is valid for Pc <C 7. Then ra^.,
the mean number of k th stratum households infected by a household of j
infectives, is given by
pc
- A) «(-xr'-'f y pc
(cf. (3.5.19)), so that
RH ~ V ^ fe2yifc
PC
~ k^i
7.3 For the prison model of Section 7.3.1 investigate numerically the rate of
attainment of equilibrium values with N = 500, different /3 and yo.
7.4 In the context of the model of Section 7.3.2, show that when M » AT, the
stationary solution (Ys,ys) cannot have Ys/M <C ys/N if (5Q — /3.
7.5 In Example 7.3.2 investigate the rate of convergence to equilibrium (cf. Ex-
ercise 7.3).
7.6 In Example 7.3.3, investigate the effect on the optimal screening rate o of
screening only every d time units.
References and Author Index
194
References and Author Index 195
Becker, Niels G. (1989). Analysis of Infectious Disease Data. Chapman and Hall,
London. [107, 115, 154]
Becker, N. G. and Dietz, K. (1995). The effect of the household distribution
on transmission and control of highly infectious diseases. Math. Biosci. 127,
207-219. [180]
Becker, N. G. and Dietz, K. (1996). Reproduction numbers and critical immunity
levels for epidemics in a community of households. In Athens Conference on
Applied Probability and Time Series (Springer Lecture Notes in Statistics,
Vol. 114), C. C. Heyde, Yu. V. Prohorov, R. Pyke and S. T. Rachev (Eds),
Springer-Verlag, New York, 267-276. [180, 192]
Bennett, C. See Wilson et ah
Bernoulli, Daniel (1760). Essai d'une nouvelle analyse de la mortalite causee
par la petite verole et des avantages de l'inoculation pour la prevenir. Mem.
Math. Phys. Acad. Roy. Sci., Paris, 1-45. In Histoire dc VAcademie Royalc des
Sciences (1766). [2 -4, 6, 154]
Blount, M. See Gani, Yakowitz and Blount; Yakowitz, Gani and Blount.
Blumberg, M. and Langston, D. (1991). Mandatory HIV testing in criminal justice
settings. Crime and Delinquency 37, 5-18. [185]
Bregman, D. J. See Lui, Lawrence et al.
Brewer, T. F., Vlahov, D., Taylor, E., Hall, D., Munoz, A. and Polk, F. (1988).
Transmission of HIV1 within a statewide prison system. AIDS 2, 363-367. [185]
Brookmeyer, R. (1991). Reconstruction and future trends of the AIDS epidemic
in the United States. Science 253, 37-42. [170]
Brookmeyer, R. and Gail, M. H. (1988). A method of obtaining short-term pro-
jections and lower bounds on the size of the AIDS epidemic. J. Amer. Statist.
Assoc. 83, 301-308. [170]
Brownlee, J. (1906). Statistical studies in immunity: the theory of an epidemic.
Proc. Roy. Soc. Edinburgh 26, 484 521. [8, 175]
Brownlee, J. (1915). Historical note on Farr's theory of the epidemic. Brit. Mcd.
J. 1915(2), 250-252. [8]
Burke, M. H. See Wilson and Burke.
Burnet, M. and White, D. O. (1972). The Natural History of Infectious Disease
(4th Edn). Cambridge University Press, Cambridge. [1]
Cane, V. and McNamee, R. (1982). The spread of infection in a heterogeneous
population. J. Appl Probab. 19A, 173-184. [174]
Carlson, R. G. See Siegal et al.
Centers for Disease Control. (1990). HIV prevalence estimates and AIDS case
projections for the United States: Report based on Workshop. Morbidity and
Mortality, Weekly Report 39 (RR-16) 1-31. [170]
Clancy, D. (1994). Some comparison results for multitype epidemic models. J.
Appl. Probab. 31, 9-26. [38]
Clancy, D. See also Ball and Clancy.
Cooper, D. A. See Whyte et al.
Cox, D. R. and Miller, H. T. (1965). The Theory of Stochastic Processes. Methuen,
London. [56]
References and Author Index 197
Hsieh, Y.-H. (1991). Modelling the effects of screening in HIV transmission dy-
namics. In Differential Equations Models in Biology, Epidemiology and Ecol-
ogy, S. Busenberg & M. Martelli (Eds), Lecture Notes in Biomathematics 92,
Springer-Verlag, Berlin, 99-120. [185]
Hudson, H. P. See Ross and Hudson.
Isham, V. and Medley, G. (Eds) (1996). Models for Infectious Human Diseases:
Their Structure and Relation to Data. Cambridge University Press: Cam-
bridge, [xii, 16]
Ivannikov, Yu. G. See Baroyan, Rvachev and Ivannikov.
Jazek, Z. See Fenner et ah
Jerwood, D. See Gani and Jerwood.
Jewell, N. P., Dietz, K. and Farewell, V. T. (Eds.) (1992). AIDS Epidemiology:
Methodological Issues. Birkhaiiser, Boston. [170]
Kaplan, E. H. (1995). Probability models of needle exchange. Oper. Res. 43,
558-564. [179]
Karlin, S. and Taylor, H. M. (1975). A First Course in Stochastic Processes (2nd
Edn). Academic Press, New York. [56]
Karlin, S. and Taylor, H. M. (1984). An Introduction to Stochastic Models. Aca-
demic Press, New York. [56]
Kendall, D. G. (1948). On the generalized "birth-and-death" process. Ann. Math.
Statist. 19, 1-15. [99]
Kendall, D. G. (1956). Deterministic and stochastic epidemics in closed popula-
tions. Proc. Third Berkeley Symp. Math. Statist. Probab. 4, 149-165. Univer-
sity of California Press, Berkeley. [31-32]
Kendall, D. G. (1957). La propagation d'une epidemie ou d'un bruit dans une
population limitee. Publ. Inst Statist. Univ. Paris 6, 307-311. [62, 101, 133]
Kendall, D. G. See also Daley and Kendall.
Kermack, W. O. and McKendrick, A. G. (1927). A contribution to the mathemat-
ical theory of epidemics. Proc. Roy. Soc. Lond. A 115, 700-721. [9-11,
13, 28-29, 37-38, 52, 73, 105, 112, 123-124, 155, 175-176]
Knox, E. G. (1980). Strategy for rubella vaccination. Inter. J. Epidemiology 9,
13-23. [179]
Ladnyi, I. See Fenner et ah
Langston, D. See Blumberg and Langston.
Lawless, J. and Sun, J. (1992). A comprehensive back-calculation framework for
the estimation and prediction of AIDS cases. In Jewell, Dietz &; Farewell
(1992), 81-104. [170]
Lawrence, D. N. See Lui, Lawrence et ah; Lui, Peterrnan, et ah
Lehmann, P. See Hausser et ah
Lefevre, C. and Picard, P. (1994). Distribution of the final extent of a rumour
process. J. Apph Probab. 31, 244-249. [145]
Lui, K-J., Lawrence, D. N., Morgan, W. M., Peterman, T. A., Haverkos, H. W. and
Bregman, D. J. (1986). A model-based approach for estimating the mean incu-
bation period of transfusion-associated acquired immunodeficiency syndrome.
Proc. Nat. Acad. Sci. USA 83, 3051-3055. [169]
References and Author Index 201
205
206 Subject Index