Discrete Difference Equations: Reference Readings & Homework Exercises For
Discrete Difference Equations: Reference Readings & Homework Exercises For
Written by
Prof. Erin N. Bodine
Prepared for
Math 214: Discrete Math Modeling (Fall 2014)
Rhodes College
Discrete Difference Equations 1
The field of mathematics provides many different means for modeling the world around us. Some mathematical tools
are more appropriate for modeling certain phenomenon and less appropriate for other phenomenon. In this course we
focus on how to use mathematics to simulate biological situations in which it is reasonable to assume the underlying
variable of the model is discrete, that is, it can be written as a sequence of values. In contrast, the real number line
is continuous and for any two values a and b (no matter how close a and b are on the number line), we can always
find another value that is between a and b. We may have to increase our precision to achieve this, but it can always
be done. A sequence of discrete values is a set of points along the real number line. Thus, with a discrete model, we
are assuming that we can ignore the space between the points.
As one example, suppose we grow a cell culture in petri dish and measure the cell density (cells per unit area)
every hour for 24 hours. This would create a sequence of 25 data points (if we include the initial cell density). We
know that the cell culture is growing during the time between when we take our measurements, but measuring the
population each hour over the course of a day will give us a reasonable representation of how the culture grows over
the course of one day. Correspondingly, when we build a mathematical model to represent the growth of the cell
culture, it is reasonable for the model to only predict how the culture grows from hour to hour. Thus, the underlying
variable, time, is represented in the mathematical model as increasing in discrete, one hour increments.
For another example, consider a population which breeds seasonally, that is, only once a year. If we wanted to
determine how the population size is changing over the course of many years, we could collect data to estimate the
size of the population at the same time each year (say shortly after the breeding season ends). We know that between
the times at which we estimate the population size some individuals will die and during the breeding season many
new individuals will be born, however, we ignore the changes in population size from day to day, or week to week, and
look only at how the population size changes from year to year. Thus, when we build a mathematical model of this
population, it is reasonable for the model to only predict the population size for each year shortly after the breeding
season. Here, the underlying variable, time, is represented in the mathematical model as increasing in discrete, one
year increments.
Let us consider an actual data set for the cell growth example described above. The data in Table 1 give the optical
density of an Escherichia coli (commonly referred to as E. coli ) cell culture measured every 30 minutes for 3 hours.
The culture was grown in a nutrient solution at 37◦ C.
Cell Density (OD600 ) 0.055 0.120 0.231 0.360 0.516 0.821 1.300
Discrete Difference Equations 2
Escherichia coli : Abbreviated as E. coli, this bacteria lives in the lower intestines of endotherms (warm blooded
organisms), including humans. There are variety of strains of E. coli many of which are part of the natural microbiome
of an endotherm’s intestinal tract. However, there are some strains of E. coli which can cause severe and adverse
reactions in their hosts including vomiting, fever, aches, and diarrhea (in humans).
where the underlying variable, time, is represented as increasing in discrete 30 minute increments. Table 2 shows
both the difference and ratio of consecutive terms in the sequence representing the cell density data from Table 1.
The difference between consecutive terms in the sequence are increasing as n increases. However, with the exception
of the first two ratios, the ratios of consecutive terms appears to remain relatively constant as n increases. Suppose
the ratios of consecutive terms were constant with a value c as n increased, then
xn+1
= c ⇒ xn+1 = cxn ,
xn
Discrete Difference Equations 3
and thus we have written xn+1 as a function of xn (where f (xn ) = cxn ). Given that the ratios of consecutive terms
show little fluctuation, we might assume that the fluctuations we see are due to measurement error (either by the
spectrophotometer used to measure the optical density or by the human taking the measurements). If we assume
that the ratios of consecutive terms would be constant if not for measurement error, then what would be the value
of c? We could use the average (mean value) of all of the ratios: 1.7. We could also use the average of the last four
ratios: 1.5. Thus, we have two options for our model:
Figure 2 show a plot of the sequence of the original data (gray) against the sequences generated by model 1 (blue)
and model 2 (red). Which model appears to approximate the data better? It would appear that model 1, though
not perfect, does a better job approximating the data.
Optical Density: How does one “count” the number of cells in a cell culture? The number of cells in a culture could
be quite large (on the order of millions or billions). It would indeed be a tedious task to sit down and count the total
number of cells in such a culture. In fact, if the culture is growing (the number of cells are increasing), by the time
you counted the first 200 cells, the size of culture would likely have changed. Instead of counting the number of cells
in a culture of volume V , we sample the culture, that is, we take a representative portion of the culture and count
the number of cells in the sample. Suppose the sample has Ns cells, and a volume of Vs . If we assume that the cell
density (cells per unit volume) are the same in the whole culture as in the sample, then N/V = Ns /Vs , where N is the
total number of cells in the sample. Thus, N = V (Ns /Vs ). The ratio Ns /Vs is proportional to the optical density (i.e.
Ns /Vs = a · OD where a is a constant and OD is the optical density) and is measured using a spectrophotometer. The
sample is placed in a cuvette (essentially a rectangular test tube) and the spectrophotometer shines a beam of light (of
a specified wavelength) through the cuvette. The denser the cell culture, the less light will shine through the cuvette.
By measuring the amount of light that does not make it through the cuvette (light that scatters due to the cells in
the cuvette), the spectrophotometer measures the optical density of the cell culture sample. The intensity of the light
scattering, and thus the measure of optical density, depends on the wavelength of light being used. The wavelength
of light used, measured in nanometers (nm), is denoted by a subscript of OD. Thus, an optical density measure taken
using a light with wavelength 600 nm is indicated by OD600 .
Discrete Difference Equations 4
The equation xn+1 = axn is known as the discrete exponential growth model , and sequences generated by
functions of the form xn+1 = axn are known as geometric sequences. Given a value of x0 , then
x1 = ax0
x2 = ax1 = a(ax0 ) = a2 x0
x3 = ax2 = a(a2 x0 ) = a3 x0
..
.
xn = axn−1 = an x0 .
The equation
xn = an x0 (5)
is known as the general solution or closed form solution of the equation xn+1 = axn . Note, for Equation 5, if
you know the initial value of the sequence, x0 , you are able to determine the value of the nth term of the sequence
without iterating through all n terms (which could be tedious for large n).
Suppose a > 1, then xn+1 = axn > xn . This means that when a > 1 the (n + 1)th term in the sequence is always
larger than the term that came before it, the nth term. We describe such a sequence as an increasing sequence.
In the example where xn represents the cell density of the E. coli population at time step n, a > 1 implies that the
cell culture is growing; specifically, it is becoming more dense. By examining the closed form solution xn+1 = an x0 ,
we can see that as n becomes increasingly large (denoted by n → ∞), the value of an → ∞ if a > 1. Thus, as we
allow n to increase towards infinity, the values of the sequence always increase towards infinity. This makes intuitive
sense since this discrete difference equation models exponential growth.
If a < 1, then xn+1 = axn < xn which means the (n + 1)th term in the sequence is always smaller than the term
that came before it, the nth term, and thus we have a decreasing sequence. By examining the close form solution
xn+1 = an x0 , we can see that as n → ∞, an → 0 when a < 1. This also makes intuitive sense since the amount of x
is being reduced by some proportion with each increment of n. Note, in the special case when a = 1, xn+1 = xn for
all n, and thus every value in the sequence is the same.
The red term, xn , represents the amount of x at time step n, and the blue term, rxn , represents a proportion of x
at time step n. Thus, the amount of x at time step n + 1 is the amount of x at time step n plus a proportion of x
at time step n. Notice if r < 0 (which corresponds to a < 1), then we will be subtracting a proportion of x at time
step n.
If, for example, we knew a population was increasing 3% each year, we could represent the population size as
where xn represents the population size at year n. Note that this difference equation produces an increasing sequence
Discrete Difference Equations 5
and thus, xn → ∞ as n → ∞. If, on the other hand, the population was decreasing by 3% each year, then we would
represent the population size as
xn+1 = xn − 0.03xn = 0.97xn ,
where xn represents the population size at year n. This difference equation produces a decreasing sequence and thus,
xn → 0 as n → ∞.
Table 3: Difference quotients of consec- Note that Equation (6) can be rewritten as
utive terms of Sequence (2).
xn+1 − xn = rxn , (7)
n (xn+1 − xn ) /xn
0 1.182 which denotes the difference in the amount of x from time n to time n + 1.
1 0.925 The equation in this form reveals why we refer to these types of equations as
2 0.558 difference equations. Note, the change in x over one time step is a proportion
3 0.433 (r) of the amount at the previous time step (xn ). Thus, we say “the change is
4 0.591 x is proportional to the amount of x.” Furthermore, if we rewrite Equation (7)
5 0.583 as
xn+1 − xn
= r, (8)
xn
then we have a direct way to estimate the value of r for a model. Table 3 shows the values of the difference quotients
in Equation (8) for n = 0, 1, . . . , 5. If we average the values in the second column of Table 3, we obtain 0.619 ≈ 0.7
as an estimate of r. Note, this concurs with the value of a = 1.7 since a = 1 + r.
Example 1.
We determined that a reasonable model for the density of E. coli cells given the data presented in Table 1 is
xn+1 = 1.7xn .
Example 2.
The Forest and Wildlife Research Center at Mississippi State University measured the survival and mortality rates of bobcats (Lynx
rufus) in and near the Tallahala Wildlife Management Area through their Carnivore Ecology Research Project [8]. They studied 68
bobcats (28 males and 40 females) and found that the mean annual survival was 75% for males and 84% for females.
(a) Use a discrete difference equation model to express how many male bobcats (of the original 28) are still alive at year n. Assume that
there are 28 male bobcats at year n = 0. Express the model in closed form solution.
(b) Use a discrete difference equation model to express how many female bobcats (of the original 40) are still alive at year n. Assume
that there are 40 female bobcats are year n = 0. Express the model in closed form solution.
(c) Both models generate decreasing sequences. Which is decreasing more quickly?
(d) What is the first year in which there will be no male bobcats left (i.e. less than 1 male bobcat) of the original 28? What is the first
year in which there will be no female bobcats left of the original 40?
(a) Suppose the number of adults in the population increases by 10% each year. Use a discrete difference equation model to express how
the number of adult elk in the population at year n (express your model in the closed form solution). Does your model generate an
increasing or decreasing sequence? Determine how many adult elk are in the population after 20 years.
(b) Suppose that the probability that an adult elk survives in a given year is 90%. Use a discrete difference equation model to express
how many of the originally introduced adult still alive at year n (express your model in the close form solution). Does your model
generate an increasing or decreasing sequence? Determine how many of the originally introduced elk are still alive after 20 years.
Elk live, on average, 15 years in the wild. Does your answer make biological sense?
Time (days) 0 1 2 3 4 5 6 7 8
Digoxin (mg) 0.500 0.345 0.238 0.164 0.113 0.078 0.054 0.037 0.026
(a) Suppose the amount of digoxin in the bloodstream of the patient can be modeled using xn+1 = axn , where xn is the amount of
digoxin (in mg) in the bloodstream on day n. Use the ratio of consecutive terms in the sequence to determine the value of a.
(b) Express your model in its closed form solution.
(c) In this patient the digoxin is effective if it remains above 0.01 mg, however digoxin toxicity occurs when the amount of digoxin in
the bloodstream exceeds 0.6 mg. When would you recommend this patient take his or her next dose of digoxin.
Note, we will revisit this example and the idea of timing between doses once we have explored some more sophisticated difference
equation models.
Discrete Difference Equations 7
In the discrete exponential growth model, the change in x (represented as xn+1 − xn ) was proportional to x (that is
xn ), thus we had the equation
xn+1 − xn = rxn ,
where r is the proportionality constant which represents the rate of change. However, what if the change is not
proportional, but constant, that is what if
xn+1 − xn = b, (9)
xn+1 = xn + b, (10)
and we will refer to this model as the constant growth model . The sequence generated by Equation (10) is known
as an arithmetic sequence. As with the discrete exponential growth model, we can find a closed form solution of
the constant growth model. Given a value for x0 , then
x1 = x0 + b
x2 = x1 + b = (x0 + b) + b = x0 + 2b
x3 = x2 + b = (x0 + 2b) + b = x0 + 3b
..
.
xn = x0 + nb.
Note, if b > 0 then the values of xn increase as n → ∞, but if b < 0 then the value of xn decrease as n → ∞. In
particular, the values of xn can become negative if b < 0 and n is large enough, even if x0 > 0. This may or may not
be biologically reasonable depending on what is being modeled.
In general, we will not use the constant growth model on its own, but in combination with the exponential growth
model. In this section we will examine a variety of models (known as first order linear difference equations) of
the form
xn+1 = axn + b . (11)
|{z} |{z}
exponential constant
growth growth
Before we look at specific biological phenomenon which can be modeled using Equation (11), we shall derive the
closed form solution (see below) and discuss the notion of the model begin at equilibrium and how to determine the
long term behavior of a model (see the next section). Given x0 , then the closed form solution is derived as
x1 = ax0 + b
x2 = ax1 + b = a [ax0 + b] + b = a2 x0 + (a + 1)b
= ax2 + b = a a2 x0 + (a + 1)b = a3 x0 + a2 + a + 1 b
x3
..
.
= an x0 + an−1 + · · · + a2 + a + 1 b.
xn
Discrete Difference Equations 8
This is a way to express the closed form solution, but it seems rather unwieldy. It would be nice if that (n − 1)th
order polynomial in terms of a weren’t mucking up the equation. With some good old fashioned polynomial long
division (think back to your fondest memories of Algebra II), it can be shown that
1 − an
1 − an = (1 − a) 1 + a + a2 + · · · + an−1 = 1 + a + a2 + · · · + an−1 ,
⇒
1−a
1 − an
xn = an x0 + b,
1−a
which feels much more manageable as an equation, and will be easier to work with as we progress forward. Notice
if a = 1, then Equation (11) reduces to the constant growth model represented by Equation (10).
Equilibria
The term equilibrium refers to being in a state of balance and remaining as you are. In the case of discrete difference
equation models, a number x∗ is called an equilbrium point or fixed point of the difference equation xn+1 = f (xn )
if x∗ = f (x∗ ). Let us examine this concept for the few discrete difference equation models we have encountered so
far.
Exponential Growth Model · Recall the exponential growth model has the form xn+1 = axn , where a is a
constant. The model will be in equilibrium when x∗ = ax∗ which can only occur if a = 1. Thus, when a 6= 1, there
is no fixed point for the model which makes sense because the values of xn are either always increasing (a > 1) or
always decreasing (a < 1). When a = 1, xn = x0 for all n.
Constant Growth Model · Recall the constant growth model has the form xn+1 = xn +b, where b is a constant.
The model will be in equilibrium when x∗ = x∗ + b which can only occur if b = 0. Thus, when b 6= 0, there is no
fixed point for the model which make sense because the values of xn are either always increasing (b > 0) or always
decreasing (b < 0). When b = 0, xn = x0 for all n.
Exponential & Constant Growth Model · Recall the difference equation model with exponential and
constant growth has the form xn+1 = axn + b, where a and b are constants. The model will be in equilibrium
when
b
x∗ = ax∗ + b ⇒ (1 − a)x∗ = b ⇒ x∗ = .
1−a
This is different from the previous two examples because this does not require that a and b take on certain values
for the model to have a fixed point, and the fixed point is not necessarily the initial condition x0 . When we examine
this model as it pertains to different biological phenomenon, we will see the different ways in which we can interpret
the meaning of the equilibrium value x∗ .
Discrete Difference Equations 9
Long-Term Behavior
When you are examining the features of a model xn+1 = f (xn ), it if often useful to know what happens to the model
as n → ∞. This is known as the long-term behavior of the model. We already examined the long term behavior
of the exponential growth model and constant growth model under various parameter scenarios (a > 1, a = 1, a < 1
for the exponential growth model, and b > 0, b = 0, b < 0 for the constant growth model).
It turns out there is another, more visual way to examine the long-term behavior of these models called cobweb
diagram analysis. On the xn , xn+1 plane (where xn is the horizontal axis and xn+1 is the vertical axis), we plot
xn+1 = f (xn ) and xn+1 = xn . On the xn (horizontal) axis, locate x0 . Draw a vertical line from x0 on the horizontal
axis to the xn+1 = f (xn ) curve. Your height on the vertical axis now represents x1 . To locate x1 on the horizontal
axis, draw a horizontal line from (x0 , x1 ) to the xn+1 = xn line; you are now at (x1 , x1 ). Then, (starting with
n = 1)
1. Draw a vertical line from (xn , xn ) to the curve xn+1 = f (xn ); you are now at (xn , xn+1 ).
2. Draw a horizontal line from (xn , xn+1 ) to the line xn+1 = xn ; you are now at (xn+1 , xn+1 ).
3. Repeat steps 1 & 2 with increasing values of n as many times as needed to see a pattern.
Figure 3 shows what this looks like for the discrete exponential growth model when (a) a > 1 and (b) a < 1. In
each graph the blue curve represents xn+1 = axn and the black line represents xn+1 = xn . The dotted red line show
the “cobwebbing”. In Figure 3a (where a > 1) we can see that the cobwebbing (dotted red line) steps up and away
from x0 as n increases. This is because the values of xn (and thus xn+1 ) are increasing as n increases. In Figure 3b
(where 0 < a < 1) we can see that the cobwebbing (dotted red line) steps down towards (0, 0) as n increases.
This method may seem like overkill for these simple difference equations but it will be very handy for analyzing the
long-term behavior of xn+1 = axn + b.
Figure 3: Cobweb diagrams for the discrete exponential growth model for (a) a > 1 and (b) a < 1.
Note that for the difference equation model with exponential and constant growth, xn+1 = axn + b, it is a little
harder to determine the long term behavior because their are two parameters, a and b, and the relative size and sign
of each parameter will have a bearing on what happens to the values of xn as n → ∞. Suppose x0 > 0. If a > 1 and
b > 0 we would expect the population to grow, and if a < 1 and b < 0 we would expect the population to decline.
Discrete Difference Equations 10
And indeed, in both cases we are correct as shown in the cobweb diagrams depicted in Figures 4a and 4b. Notice
that in the case where a < 1 and b < 0, the values of xn quickly become negative despite the fact that x0 > 0.
Now, suppose a < 1 and b > 0. What do we expect to happen in this case. The value of a will cause a decrease
while the value of b will cause an increase. If we examine the cobweb
diagram in Figure 4c we see that the lines
b b b
xn+1 = xn and xn+1 = axn + b cross in the first quadrant at the point 1−a , 1−a . Notice if we choose x0 = 1−a ,
then xn = b
1−a for all n, and thus x∗ = b
1−a is an equilibrium point. If we choose x0 such that 0 < x0 < b
1−a , then
b b b
the values of xn increase towards 1−a . If we choose x0 > 1−a , then the values of xn decrease towards 1−a . Thus, no
b
matter what value of x0 we choose, as n → ∞ the values of xn approach 1−a . Since the sequence of values approach
the equilibrium point, we classify the fixed point as a stable equilibrium.
Lastly, suppose a > 1 and b < 0. If we examine the cobweb diagram in Figure
4d we see again that the lines
b b b
xn+1 = xn and xn+1 = axn + b cross in the first quadrant at the point 1−a , 1−a , and that if we choose x0 = 1−a ,
then xn = b
1−a for all n, and thus x∗ = b
1−a is an equilibrium point. However, if we choose x0 such that 0 < x0 < b
1−a ,
b
then the values of xn decrease towards −∞. If we choose x0 > 1−a , then the values of xn increase towards ∞. In
b
this case, not matter what value of x0 we choose, with the exception of x0 = 1−a , as n → ∞ the values of xn move
b
away from 1−a , and we classify this type of fix point as an unstable equilibrium.
Discrete Difference Equations 11
Example 3.
Consider a lake fish population which increases on average by 20% annually. Each year, fishing on the lake is allowed until 1200 fish are
caught. Thereafter, fishing is banned. Currently, there are 12,230 fish in the lake.
(a) Write a difference equation for the lake fish population and find the general solution.
(b) How many fish are in the lake after 5 years?
(c) If the resource managers of the lake wanted the population to remain constant each year, what level of harvesting should they allow?
Note, this value is referred to as the maximum sustainable yield (MYS) .
(d) How is the maximum sustainable yield related to the equilibrium value of this system?
Example 4.
Consider a lake fish population whose yearly birth rate is 1.5 and whose yearly death rate is 0.7. Each year, fishing on the lake is allowed
until 1200 fish are caught, thereafter, fishing is banned. Currently, there are 12,230 fish in the lake. If the resource managers of the lake
wish to keep the fish population constant from year to year, how many fish are needed to stock the lake each year?
Exercise 4 – Buffalo
A population of buffalo increases in size by about 10% each year. Let xn be the population count after n years. Suppose that hunting
allows h buffalo to be removed from the herd each year.
Exercise 5 – Trout
Suppose that in a trout population increases its own numbers by 10% each year. After the births occur each year, 100 young trout are
added to the population in an effort to build up the population. Let xn denote the size of the population after n years, and assume
x0 = 1000.
Pharmacokinetics is a branch of pharmacology which quantifies how the concentration of a substance administered
to a living organism declines over time (from the moment it is introduced to the body/organism to the point at which
it is entirely eliminated from the body). You have already seen one example of pharmacokinetics in Exercise 3 where
you expressed the decay of digoxin (a drug used in treated heart disease) as a discrete exponential decay model. In
that example, only once dose was given. In this section we examine what occurs to the concentration of a drug in
the body when we give successive doses of the same drug over some period of time.
Discrete Difference Equations 12
Notice, as the value of k increases, the value of t∗ decreases, i.e., the time at which half of the drug is eliminated
from the body occurs sooner.
Now suppose an additional dose of size b is given every τ units of time. After τ units of time, the amount of the
original dose left in the body is
x(τ ) = be−kτ ,
as given by our assumption in Equation (12). If we add another dose, then at time τ we will have be−kτ + b. If we
let x0 represent the concentration o the drug in the body at the time the initial dose is given, and x1 represent the
concentration of the drug in the body at the time the second dose is given then
x0 = b and x1 = x0 e−kτ + b.
x2 = x1 e−kτ + b.
We know that the equilibrium for the linear difference equation model is given by b
1−a , and that a = e−kτ , thus the
equilibrium is
b
x∗ = .
1 − e−kτ
In this drug dosing model, the equilibrium represents the maximum amount of drug that could possibly accumulate
in the body when a dose of size b of a drug with decay rate k is being given every τ units of time. This is useful
information when you want to make sure the amount of drug in the body does not exceed a certain toxicity level.
Notice that as the value of τ is increased the value of x∗ decreases. This makes intuitive sense, since we would expect
that as we allow more time to pass between doses, a patient should not be able to accumulate as much drug in their
body.
Example 5.
Recall in Exercise 3 you constructed the difference equation xn+1 = 0.69xn where xn represented the amount (in mg) of digoxin (a drug
used in treating heart disease) in a patient after n days, and x0 = 0.5 mg.
(a) In a particular patient taking 0.5 mg doses, digoxin is effective if it remains above 0.01 mg, however digoxin toxicity occurs when the
amount of digoxin in the bloodstream exceeds 0.6 mg. What would be a suitable dosing strategy for this patient?
(b) Given the dosing schedule you proposed in (a), what is the maximum amount of drug that could possibly accumulate in the patient’s
body?
(c) Suppose it is easier for this patient to take digoxin daily. What drug dose size should be patient take each day in order for the digoxin
to remain effective and non-toxic?
Exercise 6 – Cipro
Cipro is an antibiotic taken to combat many infections, including anthrax. Cipro is filtered from the blood by the kidneys at a rate of
about 33% per 24-hours (that is, after 24 hours only one third of the Cipro initially in the blood remains). Suppose a patient is given a
500 mg dose each day for many days.
(a) If you wish to express the drug dosing scheme using Equation (13), what are the values of k, τ , and b? Assume that xn is measured
in mg.
(b) What is the maximum amount of drug that could possibly accumulate in the patient’s body?
(c) Suppose the patient taking Cipro will experience detrimental side effects if the amount of Cirpo in her bloodstream exceeds 700 mg
(this can be calculated based on the patient’s body mass index). Will the dosing scheme need to be changed. If so, propose a new
dosing scheme.
x(t) = x0 e−0.05t ,
where t is measured in milliseconds. Thus, if x0 is the present charge on the cell, the charge remaining after 2 milliseconds is
(a) Write a difference equation representing xn+1 in terms of xn , and find the general solution assuming x0 = 0.
(b) The neuron will fire as soon as the total charge on the cell exceeds 4 millivolts. How frequently will the neuron fire?
Discrete Difference Equations 14
7 Population Genetics
Population genetics is the study of how the genetic composition of a population changes over time. If you are
unfamiliar with basic genetic theory please read through the Genetic Terminology box.
Genetic Terminology
Gene · Genes are genetic material on a chromosome that code for a trait. Often a trait is determined by multiple
genes or even material on multiple chromosomes. For example, human eye color is determined by genes on two different
chromosomes. A gene may be determined by the genetic information at a single location, it’s locus, or by genetic
material at several locations, or loci. Diploids are organisms with two copies copies of each chromosome (one from each
parent), except for the sex chromosome, and thus two copies of each gene (one copy at each locus on each chromosome
in a pair). Haploids are cells or organisms with one copy of each chromosome, and thus one copy of each gene.
Allele · An allele is a form of a gene at a single locus. For example, in humans there is an eye color gene on
chromosome 15 with two possible alleles: B (brown) and b (blue). The gene for common blood type in humans has
three alleles: A, B, and O. Some alleles are dominant over others, known as dominant alleles, while other alleles are not
dominant, known as recessive alleles. For genes with only two alleles we typically use capital letters to denote dominant
alleles and lower case letters to denote recessive alleles. For example, the dominant allele in the eye color gene is the
allele for brown eyes, B, while the allele for blue eyes, b, is recessive. A gene typically contain pairs of alleles where one
allele is inherited from the father and the other is inherited from the mother.
Genotype · A genotype is the set of alleles an organism carries. For diploid organisms like humans, each gene
contains one or more pairs of alleles and may involve alleles at multiple loci. Genotype is denoted as a pair (or pairs) of
letters that represent the pair (or pairs) of alleles for that particular gene. For example, the eye color gene at one locus
can have alleles B (brown) and b (blue), resulting in four possible genotypes: BB, Bb, bB, and bb. The common blood
type gene has three possible alleles, A, B, and O, resulting in nine possible genotypes: AA, BB, OO, AB, BA, AO,
OA, BO, and OB. When it does not matter which of the alleles came from which parent, genotypes like Bb and bB are
considered the same genotype. Genes with two dominant alleles are known as homozygous dominant, genes with two
recessive alleles are called homozygous recessive, and genes with different alleles at a locus are known as heterozygous.
Phenotype · A phenotype is the physical expression of a trait, as determined by the interaction between genetics
and developmental or environmental influences. For traits determined by a gene at a single locus, the gene will express
the dominant allele if its genotype is homozygous dominant or heterozygous, and the recessive allele if its genotype is
homozygous recessive. Under the simplest conception of the of human eye color phe-
Table 4: Blood type genotypes
notypes, the genotypes BB and Bb have the phenotype brown eyes, while genotype bb and corresponding phenotypes.
has the phenotype blue eyes. In the heterozygous case, the dominant allele (B) masks
the recessive allele (b).1 For the common blood type gene, the allele O is recessive Phenotype Genotypes
while A and B are dominant. This arises because alleles A and B each result in the
AA
production of their own antigens while allele O is inactive. Thus, the phenotypes are A
AO, OA
determined by the antigens produced (see Table 4). In the case where the genotype
is AB or BA, neither allele dominates the other, both antigens are produced, and an BB
B
BO, OB
additional blood type is formed.
AB AB, BA
Punnett Square · A Punnett square (named after Reginald C. Punnett) is
a diagram used to show the potential genotypes resulting from a mating where the O OO
genotype of each of the parents is known. For example, if a woman with brown eyes ♀
(genotype Bb) and a man with blue eye (genotype bb) mated, the Punnett square in B b
Figure 7 shows the possible genotypes of their offspring. ♂ b bB bb
Carriers of Recessive Alleles · Many genetic diseases, like sickle-cell ane- b bB bb
mia and albinism, are only expressed in the phenotype if an individual has the ho-
mozygous recessive genotype. In these cases, the individual is referred to as a “carrier” Figure 7: Example of a Punnett
square with Bb(♀) × bb(♂).
of the genetic disease if their genotype is heterozygous. For example, the hemoglobin
gene has dominant allele S and recessive allele s. A person is a carrier for sickle-cell anemia if they have genotype Ss.
1 This is a simplified view of human eye color control, since it is actually controlled by multiple genes and there are more eye colors
Suppose we want to keep track of the proportion of genotypes of one particular gene within a population over several
generations, and suppose that gene has two possible alleles, A (dominant) and a (recessive). Let P be the frequency
of the homozygous dominant genotype (AA), Q be the frequency of the heterozygous genotype (Aa), and R be the
frequency of the homozygous recessive genotype (aa). Thus, if 20% of the individuals in the population have the
homozygous dominant genotype, then P = 0.20. Since P , Q, and R are proportions, and they represent all possible
genotypes (for this particular gene) within the population, P + Q + R = 1.
If there are N individuals in the population, then there are 2N alleles (for this particular gene). This means
that
Let p represent the proportion of alleles (for this particular gene) which are A, and let q represent the proportion of
alleles that are a. Then,
2P N + QN 1 QN + 2RN 1
p= = P + Q, and q= = Q + R.
2N 2 2N 2
Since p and q represent proportions and A and a are the only two possible alleles (for this particular gene), the values
of p and q should sum to 1. And indeed,
1 1
p+q = P+ Q + Q + R = P + Q + R = 1.
2 2
A classic method for modeling the allele frequencies of a population from one generation to the next is known as
the Hardy-Weinberg model. The model was developed independently by G. H. Hardy (mathematician) and Wilhelm
Weinberg (physician) in 1908. The Hardy-Weinberg model has eight assumptions [2]:
1. The organism is diploid, sexual, and has discrete generations. Discrete generations refer to a life history like that
of an annual plant, in which the parental generation has died by the time the offspring generation reproduces.
3. Mendelian segregation occurs, which means that individuals with the heterozygous genotype produce equal
numbers of gametes (haploid reproductive cells, e.g., eggs and sperm) containing each allele. For example, an
Aa individual produces equal numbers of A and a gametes. There are a few genes that violate this assumption;
this condition is known as meiotic drive or segregation disorder. When meiotic drive occurs, once allele in
heterozygous individuals is overrepresented in the gametes.
4. Random mating occurs, meaning that mating is random with respect to the genotypes under consideration (it
may be non-random with respect to genotypes at other loci).
5. There are no mutations (permanent change to the DNA molecule), or at least the mutation rate is negligible,
Discrete Difference Equations 16
6. There is no migration (movement of individuals between populations). This assumption is also referred to as
the population having no gene flow.
7. There is no random genetic drift which refers to fluctuations in allele frequencies that occur by chance, particu-
larly in small subpopulations, as a result of random sampling error in the choice of gametes that form the next
generation. For large populations with random mating, it is reasonable to assume there is no genetic drift.
8. There is no natural selection. Natural selection refers to a consistent (over multiple generations) relationship
between fitness and phenotype, or differences in fitness among genotypes.
Let pt be the allele frequency of A in generation t, then qt = 1 − pt is the allele frequency of a in generation t.
Since the allele frequencies are the same in both sexes (Assumption #2), we can assume that in both males and
females that allele frequencies of A and a are pt and qt , respectively, and the genotype frequencies of AA, Aa, and
aa are Pt , Qt , and Rt , respectively. If we assume random matting occurs (Assumption #4), then we can assume the
matings occur in proportion to the genotypic frequencies in the population. For example, the proportion of the male
population in generation t that are genotype AA is Pt , and the proportion of the female population in generation t
that are genotype AA is also Pt , thus probability of a AA × AA mating is (Pt )(Pt ) = (Pt )2 . Similarly, the probability
of a AA × Aa matings is the sum of the probability of a AA(♂) × Aa(♀) mating and a AA(♀) × Aa(♂) mating, i.e.
(Pt )(Qt ) + (Pt )(Qt ) = 2Pt Qt . The probability of each of the six types of mating occurring (denoted as the mating
frequency) are given in Table 5.
AA × AA (Pt )2 1 0 0
AA × Aa 2Pt Qt 1
⁄2 1
⁄2 0
AA × aa 2Pt Rt 0 1 0
Aa × Aa (Qt )2 1
⁄4 1
⁄2 1
⁄4
Aa × aa 2Qt Rt 0 1
⁄2 1
⁄2
aa × aa (Rt )2 0 0 1
Using a Punnett square, the frequency of each genotype in the offspring can be determined for each mating type (see
Table 5). The genotype frequency of AA in generation t + 1 is
2
1 1 1 1
Pt+1 = (1)(Pt )2 + (2Pt Qt ) + (Qt )2 = (Pt )2 + Pt Qt + (Qt )2 = Pt + Qt = (pt )2 . (14)
2 4 4 2
Notice that
2 2 2 2
Pt+1 + Qt+1 + Rt+1 = (pt ) + 2pt qt + (qt ) = (pt + qt ) = (pt + (1 − pt )) = 1,
as it should since Pt+1 , Qt+1 , and Rt+1 are the frequencies of the only three possible genotypes for this particular
gene. Using the fact that pt+1 = Pt+1 + 12 Qt+1 and qt = 1 − pt , we find that
1
pt+1 = Pt+1 + Qt+1 = (pt )2 + pt qt = (pt )2 + pt (1 − pt ) = (pt )2 + pt − (pt )2 = pt .
2
Thus, we have just shown that the allele frequency of A in generation t + 1 is equal to the allele frequency of A in
generation t, i.e. the allele frequency of A (under the eight Hardy-Weinberg assumptions) does not change over time.
Furthermore, since the allele frequency of A does not change over time, the allele frequency of a also does not change
over time,
qt+1 = 1 − pt+1 = 1 − pt = qt .
The Hardy-Weinberg Principle encapsulates what we have just shown mathematically and states that the allele
and genotype frequencies in a population will remain constant over time given the eight assumptions above (i.e. we
are leaving out all the interesting and complicating factors). Note, the Hardy-Weinberg principle is sometimes also
referred to as the Hardy-Weinberg equilibrium.
The Hardy-Weinberg model is as simple as it get when it comes to modeling population genetics. To add some
realism to this model, we can “relax” one of the eight assumptions and modify the model accordingly. We will start
by relaxing Assumption #6 and assume that some migration is occurring. To keep things relatively simple, we will
focus on one-way migration.
Suppose an island population receives migrants from a large source (continent) population. This migration causes
gene flow in the direction of the island population (i.e. the influx of migrants to the island each generation changes
the natural Hardy-Weinberg equilibrium that would occur in the absence of the migrants). Gene flow from the island
population to the continent population may exist, but we will assume it to have a negligible effect on the continent
population.
Let m be the proportion of the total number of island inhabitants in each generation which are migrants, and thus
(1 − m) represents the proportion of native inhabitants. Let q̂ be the frequency of allele a in the continent population
(we will assume this to be constant over time), and qt to be the frequency of allele a in the island population in
generation t. Then in generation t + 1,
qt+1 = (1 − m)qt + mq̂. (17)
Equation (17) is referred to as the Continent-Island model and was originally developed by Sewall Wright in 1931.
Note this is different from the Hardy-Weinberg model which assumes qt+1 = qt . Now, only a fraction of qt+1 come
from the native island inhabitants, (1 − m)qt , with the rest coming from the migrant island inhabitants. Note,
this model assumes that once migrants are on the island they will assimilate into the island population and mate
Discrete Difference Equations 18
Notice that Equation (17) can be rewritten as qt+1 = aqt + b where a = (1 − m) and b = mq̂. Thus, the continent-
island model is really just a first order linear difference equation (exponential and constant growth) in disguise. Thus,
the closed form solution is
1 − (1 − m)t
qt = (1 − m)t q0 + mq̂ = (1 − m)t q0 + q̂ − (1 − m)t q̂ = (1 − m)t (q0 − q̂) + q̂.
1 − (1 − m)
What happens to the allele frequency of qt over many generations? Note that 1 − m < 1, and thus (1 − m)t → 0
as t → ∞. Thus, qt → q̂ as t → ∞. This means the allele frequencies of A and a on the island approach the allele
frequencies of A and a on the continent as successive generations of island inhabitants mate with migrants from the
continent.
Example 6.
Red wolves (Canis rufus) historically occurred throughout southeastern North America, but by the 1960s, they were confined to a small
population in Louisiana and Texas where there was hybridization with the much more abundant coyote (Canis latrans). Wolves from this
population were captured to start a captive population in 1987, this captive population was used to establish a wild population in eastern
North Carolina. However, over the next decade, coyotes colonized this area, and in the late 1990s, it was estimated that approximately
15% of the litters in the newly established population were hybrid.
To examine the impact of coyote introgression on red wolf ancestry, assume that the “island” red wolf population is receiving gene
flow from the “continent” coyote population. Assume that in the initial generation the “island” red wolf population has 100% red wolf
ancestry, and the “continent” coyote population has 0% red wolf ancestry. Additionally, in hybrid litters, half the ancestry (genes) are
from the red wolves, and half are from the coyotes. Assume the generation length of the red wolves is approximately 5 years, and that
t = 0 corresponds to 1990.
(a) To form the continent-island model, what are the values of m, q0 , and q̂?
(b) Find the closed form solution of the continent-island model using the parameter values found in (a).
(c) What proportion of red wolf ancestry remains in the red wolf population after 50 years (i.e. in 2040)?
1. Suppose there is a 50% reduction in hybrid litters. What is the new value of m? What proportion of red wolf ancestry remains in
the red wolf population in 2040 if the management actions are put into place in (i) 1990, (ii) 2000, and (iii) 2010?
2. Suppose there is a 90% reduction in hybrid litters. What is the new value of m? What proportion of red wolf ancestry remains in
the red wolf population in 2040 if the management actions are put into place in (i) 1990, (ii) 2000, and (iii) 2010?
The last several sections have focused on biological examples which can be modeled using the linear first order
difference equation xn+1 = axn + b, where a and b are constants. However, there are many biological systems whose
dynamics cannot be captured by this linear model. In this section, we will explore some biological dynamics which
are best modeled using nonlinear first order difference equations.
Discrete Difference Equations 19
Let’s return to our first motivating example of the growth of a culture of E. coli cells. Table 1 shows the optical
density of an E. coli cell culture measured every 30 minutes for 3 hours. However, Table 1 gives an incomplete data
set. Data was actually collected every 30 minutes for 9 hours. Table 6 shows the full set of data and Figure 8 shows
a graph of the full set of data (in gray, filled circles).
Time (hours) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5
Cell Density (OD600 ) 0.055 0.120 0.231 0.360 0.516 0.821 1.300 1.752 2.102 2.683
Time (hours) 5.0 5.5 6.0 6.5 7.0 7.5 8.0 8.5 9.0
Cell Density (OD600 ) 3.163 3.532 3.725 3.882 3.993 4.076 4.002 4.205 4.131
xn → K. Thus, as the values of xn approach K, the difference between xn+1 and xn will go to zero, i.e., the growth
rate is shrinking to 0. For the data given in Table 6 it can be estimated that r = 0.64 and k = 4. Figure 8 shows a
graph of both the data and the model (in red, empty circles).
We can reduce the number of parameters in the discrete logistic growth model by “normalizing the model” with
respect to the carrying capacity. Let un = xn /K, then xn = Kun , and Equation (18) becomes
Not we refer to replacing xn with Kun as either a substitution or change of variables. What is the equilibrium of
the normalized discrete logistic model? Recall, the equilibrium occurs when un+1 = un . Let u∗ be such a value.
Then,
u∗ = u∗ + ru∗ (1 − u∗ ) ⇒ 0 = ru∗ (1 − u∗ ),
which will hold true when u∗ = 0 or u∗ = 1. Thus, the normalized discrete logistic model has two equilibria: (1)
when the population size is 0 (i.e., extinct), and (2) when the population size has reached carrying capacity.
Example 7.
Use cobweb diagrams and sequence plots to explore the dynamics of the normalized discrete logistic model as the parameter value r is
increased from 0.2 to 3.1.
The solutions to this example will be worked in class.
(a) Show that the substitution ut = 1/xt transforms Equation (20) into a linear difference
equation (i.e., it has the form ut+1 = aut + b, where a and b are in terms of r and K).
(b) Given the linear difference equation you generated in part (a), find the closed form solution
to the Beverton-Holt equation.
(c) Find the equilibrium for the transformed Beverton-Holt equation. Interpret this equilib- Figure 9: Plot of sequence produced by
rium in terms of the original Beverton-Holt equation. the Beverton-Holt difference equation.
(d) The original Beverton-Holt has an additional equilibrium. What is it?
(e) To determine the stability of each equilibria, consider two cases: (i) r > 1, and (2) r < 1.
Use the closed form solution of the Beverton-Holt equation to determine the stability of
each equilibria in each case.
(f) Use Matlab to plot the sequences generated by the Beverton-Holt difference equation and
the logistic difference equation for x0 = 0.2, K = 1, and (i) r = 0.25, (ii) r = 0.75, (iii)
r = 1.25, (iv) r = 1.90, (v) r = 2.20, (vi) r = 2.50, and (vii) r = 3.00. Describe the ways
in which the Beverton-Holt and logistic difference equations are similar and the ways in
which they are different.
Natural Selection
Recall in Section 7 we examined the Hardy-Weinberg model. The model had eight assumptions, and when we
relaxed the “no migration” assumption (Assumption #6) we developed the continent-island model. In this section,
we will develop a model for when we relax the “no natural selection” assumption (Assumption #8). Recall that
natural selection refers to a consistent (i.e., over multiple generations) relationship between fitness and phenotype,
or differences in fitness among genotypes.
Let A and a be the two possible alleles for a particular gene of interest where A is the dominant allele and a is the
recessive allele. Let pt be the allele frequency of A in generation t, and qt = 1 − pt be the allele frequency of a in
Discrete Difference Equations 21
Table 7: Genotype frequencies in generation t + 1 with and without natural selection, where w̄ = w1 p2t + 2w2 pt qt + w3 qt2 .
AA Aa aa
generation t. In the absences of natural selection, according to the Hardy-Weinberg model, the frequency of each
possible genotype (AA, Aa, and aa) in generation t + 1 are given in Equations (14) - (16), and shown again in Table
7. In the presence of natural selection, each genotype may confer a different relative fitness (relative with respect
to the other genotypes). Let w1 be the relative fitness of genotype AA, w2 the relative fitness of genotype Aa, and
w3 the relative fitness of genotype w3 . We define the mean fitness of the population, w̄, as the sum of the relative
contributions of each genotype, that is,
w̄ = w1 p2t + 2w2 pt qt + w3 qt2 . (21)
Then, the frequency of each genotype in the presence of natural selection is the ratio of the relative contribution of
each genotype to the mean fitness (see Table 7). Thus, in the presence of natural selection we have that
Notice that, as with the Hardy-Weinberg model (without selection), Pt+1 + Qt+1 + Rt+1 = 1.
As with the continent-island model, let us form construct a difference equation to model the frequency of the
recessive allele, a, over several generations. This time we will assume there is no gene flow, but that there is natural
selection.
1 w2 pt qt + w3 qt2
qt+1 = Qt+1 + Rt+1 = (23)
2 w1 pt + 2w2 pt qt + w3 qt2
2
Equation (23) expresses qt+1 as a function of pt and qt . To express qt+1 solely as a function of qt we can use the fact
that pt = 1 − qt , and thus,
w2 (1 − qt ) qt + w3 qt2
qt+1 = 2 . (24)
w1 (1 − qt ) + 2w2 (1 − qt ) qt + w3 qt2
We will refer to Equation (24) as the natural selection difference equation model .
We can change of the relative fitness values of w1 , w2 , Table 8: Relative fitness values for different fitness relationships.
and w3 (mathematically described as weight values or
weight constants) to reflect different “fitness relation- Genotype
ships” between genotypes. Recall that the values of w1 , AA Aa aa
w2 , and w3 reflect the relative fitness of each genotypes.
General relative fitness w1 w2 w3
Thus, if two genotypes have the same relative fitness,
Lethal recessive 1 1 0
their weight values should be the same. If one genotype
Detrimental alleles (recessive) 1 1 1−s
has a greater fitness than another, than the relative fit-
Detrimental alleles (additive) 1 1 − s/2 1−s
ness of the former genotype should be larger than that
of the latter genotype. The relative fitnesses of each Dominance (purifying selection) 1 1 − hs 1−s
Dominance (positive selection) 1+s 1 + hs 1
Heterozygote advantage 1 − s1 1 1 − s2
Heterozygote disadvantage 1 + s1 1 1 + s2
Discrete Difference Equations 22
Example 8.
Lethal Recessive Submodel · There are some genes for which the homozy-
gous recessive genotype has the phenotype of a lethal disease which causes pre-
reproductive death, resulting in the fitness of the homozygous recessive genotype
being 0, i.e., w3 = 0. Some examples of such diseases in humans are Tays-Sachs
disease and cystic fibrosis.
In Drosophila melanogaster (also known as the common fruit fly or vinegar fly),
the homozygous recessive genotype of the gene glued causes pre-reproductive
death, and the heterozygous genotype reduces eye size and affects eye appear-
ance. Two replicate populations of Drosophila melanogaster were initiated with
all heterozygotes so that q0 = 0.5. The frequency of the recessive allele was
recorded for 7 successive generations for each replicate population (data shown
Figure 10: The frequency of the recessive (lethal)
glued allele for two D. melanogaster population repli- in Figure 10). We will compare two different lethal recessive submodels of the
cates each starting with all heterozygotes (q0 = 0.5). natural selection difference equation model and determine which more accu-
rately represents the data shown.
(a) For the first model, we will assume that the recessive allele only causes a reduction in fitness in the homozygous recessive genotype,
that is, the homozygous dominant and heterozygous genotypes have the same relative fitness: w1 = 1, w2 = 1, and w3 = 0. Given
the values of w1 , w2 , and w3 , simplify Equation (24), and determine a closed form solution. What happens to the values of qt as
t → ∞?
(b) For the second model, we will assume that in additional to the lethality of the homozygous recessive genotype the defects in the
heterozygotes (reduced eye size and altered eye appearance) cause a reduction in fitness in the heterozygotes such that w1 = 1,
w2 = 2/3, and w3 = 0. Note, this would be a case of “dominance (purifying selection)” with s = 1 (lethal recessive) and h = 2/3.
Given the values of w1 , w2 , and w3 , use a cobweb diagram to determine what happens to the values of qt as t → ∞.
(c) Which model best describes the data shown in Figure 10?
(d) For each model calculate the number of generations needed for the recessive allele frequency to fall below 0.01. Which fitness
relationship removes the recessive allele frequency from the population more quickly? Does this make intuitive sense?
Example 9.
Detrimental Recessive Submodel · In many instances, there is no complete selection against the homozygous recessive genotype,
and thus, the relative fitness of the homozygote is only partially reduced (when compared with other genotypes). For many human
genetic diseases, such as albinism and sickle cell anemia, homozygous recessive individuals can survive and produce progeny, although
the probability of this occurring is reduced compared with that of other individuals. In Drosophila, mice, corn, and other organisms that
have been investigated in detail genetically, there are many examples of recessive morphologic mutants that reduce fitness of homozygotes
but do not cause lethality. In the case of a detrimental (but non-lethal) recessive allele, w1 = 1, w2 = 1, w3 = 1 − s, where 0 < s < 1.
(a) Given the values of w1 , w2 , and w3 , simplify Equation (24) to form the detrimental recessive submodel.
(b) What are the equilibria of the detrimental recessive submodel?
(c) Use a cobweb diagram to determine what happens to the values of qt as t → ∞ for different values of q0 and s. What affect does
increasing the value of s have on the model?
Discrete Difference Equations 23
Incompatible combinations can only occur when the mother is Rh negative and the father is Rh positive, that is matings RR(♂) × rr(♀)
and Rr(♂) × rr(♀). In the matings RR(♂) × rr(♀), all offspring are incompatible with the mother (since all progeny have genotype Rr
which is Rh positive). In the matings Rr(♂) × rr(♀), half of all offspring are Rr while the other half are rr. Thus, only half of the progeny
are Rh positive and incompatible with the mother. Since a mother who is Rh negative will never produce progeny with genotype RR,
selection is only occurring against heterozygous progeny.
(a) Assuming relative fitnesses of w1 = 1 + s1 , w2 = 1, and w3 = 1 + s2 , simplify Equation (24) to form the heterozygous disadvantage
model.
(b) Find all equilibria for the model you formed in (a). Hint: There are 3 equilibria, and you will need to use the quadratic formula to
find two of them.
(c) Assume that s1 = 1 and s2 = 0.5 use a cobweb diagram to determine the stability of each of the equilibria.
(d) Most human populations have high frequencies of the R allele, but Basques, who live in the Pyrenees Mountains between Spain
and France, have a high frequency (q = 0.65) of the r allele. Assuming that the allele frequency of r in Pyrenees Moutain Basque
population is at equilibrium determine an expression for the relationship between s1 and s2 using the non-trivial equilibrium found
in part (b). Is the advantage greater for the homozygous dominant or homozygous recessive genotype?
Discrete Difference Equations 24
Biological processes rarely happen in isolation. Interactions such as those between predators and prey, competitors
for limited resources, and many biochemical processes at the cellular level are major drivers of biological complexity.
These interactions are also rarely linear; that is, we typically cannot derive how one process affects another by
assuming that one is proportional to the other. In this section we will explore how we can use discrete difference
equations to model a biological system consisting of multiple species, classes, or organism groups.
Predator-Prey Systems
Examine the data in Figure 12a. What do you notice? You should
observe that both populations cycle over roughly a 10 year period.
Additionally, the cycles in the lynx pelts harvested (on average) lags
behind the cycles in the hare pelts harvested by 1 to 3 years.
where r > 1 is the growth rate of the prey in the absence of the predator, 0 < s < 1 is the survival rate of the
predator in the absence of its prey source, α > 0 is the consumption rate of the predators (where αxn is the average
number of prey eaten per predator in time step n), and β > 0 is the growth rate of the predator population due to
the consumption of prey. Notice the amount of prey consumed by the predator at each time step depends on not
only on the numbers of predators present, but also the numbers of prey present. When the prey density is higher, it
Discrete Difference Equations 25
is easier for the predators to find prey and thus more prey will be consumed. Likewise, the growth rate the predators
depends not only on the number of predators present (more predators should produce more offspring overall), but
also on the number of prey present (a large food source often correlates with higher reproductive rates and/or larger
litter or clutch sizes).
Any equilibrium point of the system must satisfy both equation (x∗ = rx∗ − αx∗ y ∗ and y ∗ = sy ∗ + βx∗ y ∗ ) simulta-
neously. The algebra above shows us we should consider four different cases: (1) x∗ = 0 and y ∗ = 0, (2) x∗ = 0 and
y ∗ 6= 0 (i.e. 1 − s − βx∗ = 0), (3) x∗ 6= 0 (i.e. 1 − r + αy ∗ = 0) and y ∗ = 0, and (4) x∗ 6= 0 and y ∗ 6= 0.
Case 1: If x∗ = 0 and y ∗ = 0, then both equations are satisfied simultaneously, and (0, 0) is an equilibrium.
Case 2: Suppose x∗ = 0 and y ∗ 6= 0. If y ∗ 6= 0, then x∗ = (1 − s)/β which cannot be true since x∗ = 0. Thus, we
cannot have an equilibrium where x∗ = 0 and y ∗ 6= 0.
Case 3: Suppose x∗ 6= 0 and y ∗ = 0. If x∗ 6= 0, then y ∗ = (r − 1)/α which cannot be true since y ∗ = 0. Thus, we
cannot have an equilibrium where x∗ 6= 0 and y ∗ = 0.
Case 4: If x∗ 6= 0 and y ∗ 6= 0, then x∗ = (1 − s)/β and y ∗ = (r − 1)/α, and both equations are satisfied
simultaneously. Thus, ((1 − s)/β, (r − 1)/α) is an equilibrium.
Discrete Difference Equations 26
Therefore, the two equilibria of the discrete Lotka-Volterra model are (0, 0), which we will refer to at the trivial
equilibrium, and ((1 − s)/β, (r − 1)/α).
Example 10.
Consider a discrete-time predator-prey model in which the prey grows logistically in the absence of the predator, that is
where r > 1, 0 < s < 1, α > 0, and β > 0 are defined the same as in the discrete Lotka-Volterra.
(a) Assume there is only one population, i.e. yt = 0 for all t. Simplify the equation for the x population. What is the equilibrium of
this population? Use cobweb diagramming to determine the stability of each equilibria (Hint: Each equilibria will be stable under
certain conditions). Based on your examination of the model so far what does the remaining parameter in the equation represent?
(b) What happens to the x population if the parameter value c1 is increased? What happens to the y population if the parameter value
c2 is increased? If b1 = b2 and c1 > c2 what do you think will happen to the two competing species?
(c) Find all equilibria of the competition model.
(d) Create a plot of yt vs. xt and additionally plot each of the equilibria points on this graph. For each of the following parameter sets,
indicate which equilibria are in the first quadrant (i.e., biologically reasonable), and which equilibria is approached as t → ∞. You
should examine several different initial conditions for each parameter set (always choose x0 > 0 and y0 > 0).
(i) b1 = b2 = 2, c1 = 0.4, and c2 = 0.6
(ii) b1 = b2 = 2, c1 = 1.4, and c2 = 0.6
(iii) b1 = b2 = 2, c1 = 0.4, and c2 = 1.6
(iv) b1 = b2 = 2, c1 = 1.4, and c2 = 1.6
Discrete Difference Equations 27
In dioecious organisms (those species with two sexes for which one
member of each sex is required for reproduction), sex is determined
by a difference in the chromosomes of each sex. The form that
these difference take vary among different species. In many animals
(humans included), sex is determined by a single chromosome which
we will refer to as the sex-chromosome, the remaining chromosomes
are referred to as autosomes.
copy of each autosome plus one X chromosome, and the other which only contains one copy of each autosome. When
a sperm containing an X chromosome fertilizes an egg, the egg develops into a female grasshopper. When a sperm
which does not contain an X chromosome fertilizes an egg, a male grasshopper develops. We will refer to the female
grasshoppers as being XX, and the male grasshoppers as being XO.
In mammals (including humans), females have two X chromosomes and males have one X chromosome and one Y
chromosome. Both males and females have two copies of each autosome. Like the grasshoppers, mammal males
produce two types of sperm, but in this case one which contains one copy of each autosome plus one X chromosome,
and the other which contains one copy of each autosome plus one Y chromosome. Here, we refer to female mammals
as being XX, and male mammals as being XY.
In birds, butterflies, and moths, the females are XY and the males are XX. To avoid confusion, we use a different
set of letters for these species. We will refer to females as ZW, and males as ZZ.
For the following discussion we will assume that females are XX and males are XO or XY. When the locus of a gene
occurs on one of the sex chromosomes, we must modify the Hardy-Weinberg model. Recall that Assumption #4 of
the Hardy-Weinberg model was, “Allele frequencies are the same in both sexes.” If the locus of a gene occurs on
the X chromosome, then the females will have two alleles for that gene and the males will only have one allele for
that gene. This will necessarily alter the allele frequencies between the sexes and we must account for this in the
model.
Let Ptf , Htf , and Qft represent the diploid genotype frequencies AA, Aa, and aa in the females of the population in
generation t, and Ptm and Qm
t represent the haploid genotype frequencies A and a in the males of the population in
generation t. Then, the allele frequencies in each of the sexes in generation t are
1
(♀, A) pft = Ptf + Htf and (♂, A) pm m
t = Pt ,
2
f f 1 f
(♀, a) qt = Qt + Ht and (♂, a) qtm = Qm
t .
2
If we assume that there are an equal number of males and females in each generation, then two-thirds of the alleles
are in the females and one-third of the alleles are in the males, and thus the frequencies of alleles A and a in the
population in generation t are
2 f 1 m 2 f 1 m
pt = p + pt and qt = q + qt ,
3 t 3 3 t 3
Discrete Difference Equations 28
respectively. To determine the allele frequencies in the next generation (t + 1), we start by examining the genotype
frequencies in generation t + 1. Table 9 shows the genotype frequencies in males and females of generation t + 1 sort
by mating type. For example, given a AA(♀) × a(♂) mating (row 2 in Table 9), the only possible genotype of the
offspring is Aa in the females and A in the males (remember, the males get their X chromosome from their mother
and their Y chromosome from their father). Since the genotype frequency of AA in females in generation t is Ptf ,
f m
and the genotype frequency of a in males in generation t is Qm
t , the probability of a AA(♀) × a(♂) mating is Pt Qt .
The frequency of female (and male) offspring from this type of mating is the probability of this mating type, i.e.
Ptf Qm
t . Since there is only one female genotype resulting from this type of mating (Aa), the frequency of the Aa
genotype in female offspring resulting from a AA(♀) × a(♂) mating is Ptf Qm
t . Likewise, since there is only one male
genotype resulting from this type of mating (A), the frequency of the A genotype in male offspring resulting from a
AA(♀) × a(♂) mating is also Ptf Qm
t .
In contrast to this, consider a Aa(♀) × A(♂) mating (row 3 in Table 9). Since the genotype frequency of Aa in
females in generation t is Htf , and the genotype frequency of A in males in generation t is Ptm , the probability of a
Aa(♀) × A(♂) mating is Htf Ptm . From this type of mating (on average) half of the female offspring will have genotype
AA and half will have genotype Aa. Thus, the frequencies of the AA and Aa genotypes in female offspring are both
1 f m
2 H t Pt . Likewise, from this type of mating (on average) half of the male offspring will have genotype A and half
will have genotype a. Thus, the frequencies of each genotype in male offspring is 12 Htf Ptm .
Table 9: Genotype frequencies of offspring for a gene on an X chromosome assuming random mating with ♀ XX and ♂ XY or XO.
♀ ♂ Frequency AA Aa aa A a
AA × a Ptf Qm
t — Ptf Qm
t — Ptf Qm
t —
Aa × A Htf Ptm 1
Hf P m
2 t t
1
Hf P m
2 t t
— 1
Hf P m
2 t t
1
Hf P m
2 t t
Aa × a Htf Qm
t — 1
H f Qm
2 t t
1
H f Qm
2 t t
1
H f Qm
2 t t
1
H f Qm
2 t t
aa × a Qft Qm
t — — Qft Qm
t — Qft Qm
t
Total 1 pft pm
t pft qtm + qtf pm
t qtf qtm pft qtf
In the final row of Table 9 we calculate the frequency of each genotype in each of the sexes. For example, the
genotype frequency of AA in female offspring (generation t + 1) is found by summing all the frequencies in the AA(♀)
column.
f 1 1 f
Pt+1 = Ptf Ptm + Htf Ptm = Pt + Ht Ptm = pft pm
f
t .
2 2
Similarly, the genotype frequency of A in male offspring is
1 f m 1 f m 1
m
Pt+1 = Ptf Ptm + Ptf Qm
t + Ht Pt + Ht Qt = Ptf + Htf (Ptm + Qm f
t ) = pt ,
2 2 2
f f
since Ptm + Qm m m
t = 1. Using the fact that pt + qt = 1 and pt + qt = 1, we can now calculate the allele frequencies
Discrete Difference Equations 29
f 1 f 1 1 1 f
qt+1 = Qft+1 + Ht+1 = qtf qtm + 1 − qtf qtm + qtf (1 − qtm ) = qt + qtm ,
2 2 2 2
m f
qt+1 = Qm
t = q t .
As with the predator-prey system we now have a system of difference equations. The value of q f in the next generation
depends on the values of q f and q m in the current generation, and the value of q m in the next generation depends
on the value of q f in the current generation.
Example 11.
Show that the system of difference equations
f 1 f
qt+1 = qt + qtm and m
qt+1 = qtf
2
has infinitely many points of equilibrium. Then, test many different initial conditions, (q0f , q0m ), and devise a rule of thumb for the value
of the equilibrium point given the initial conditions. The solutions to this example will be worked in class.
Example 12.
An interesting example of a gene residing on the X chromosome is the orange coat-color locus in cats. This gene has two alleles, O
which codes for an orange coat, and b which codes for a black coat. Females with the heterozygous genotype Ob have tortoise-shell or
calico coats. In a population of feral London cats, the sex and genotype of cats sent to a clinic for “destruction” were noted by a clinic
worker. PSA: If hearing about cats being sent to clinics for destruction (and yes, that means what you think it means) makes you sad,
you should advocate for spaying and neutering cats. Most medium to large cities in the United States have free/low cost spaying and
neutering programs to help reduce feral cat population sizes.
(a) Determine the genotype frequencies of each genotype for both male and females.
(b) Let qtf represent the frequency of the O allele in the females in generation t, and qtm the frequency of the allele O in the males in
generation t. If the data presented above were collected over a short period of time, we can assume it represents one generation (we
will call it generation 0). Calculate the values of q0f and q0m .
(c) Given the values of q0f and q0m , and the system of difference equations
f 1 f
qt+1 = q + qtm and m
qt+1 = qtf
2 t
estimate the equilibrium point.
(d) At the equilibrium point, what proportion of females have tortoise-shell coats.
where w11 , w12 , w22 , w1 , and w2 are the relative fitnesses of genotypes AA(♀), Aa(♀), aa(♀), A(♂), and A(♂), respectively. Let us
consider a case where selection occurs against the homozygous recessive genotype in females and against the recessive genotype in males,
i.e.,
w11 = 1, w12 = 1, w22 = 1 − sf , w1 = 1, and w2 = 1 − sm .
(a) Given these relative fitnesses, simplify the natural selection model for an X-linked gene.
(b) Now, suppose the recessive allele is lethal, in that individuals with genotypes aa and a
do not survive to reproduce. Then sf = sm = 1. Further simplify the model assuming a
lethal recessive allele.
(c) Find thr equilibria of the system assuming a lethal recessive allele.
(d) An example of X-linked selection occurs in Drosophila melanogaster (commonly known
as fruit flies) on the white gene. The white gene controls eye color. The dominant allele A Figure 15: Drosophila melanogaster
codes for red eyes, but the recessive allele a codes for white eyes. The diploid genotypes with red eyes (left) and white eye (right).
AA and Aa yield red-eyed females, while the aa genotype yields white-eyed females. The
haploid genotype A yields red-eyed males, while the genotype a yields white-eyed males. The selection acts primarily against the
white-eyed males by reducing their mating ability (sm = 0.9), however there is also some reduction in the fertility of white-eyed
females (sf = 0.5). Use Matlab to generate a plot of qtf , qtm , and qt for t = 0, 1, . . . , 7 given (i) q0f = q0m = 0.833, and (ii) q0f = 1,
q0m = 0.5.
Discrete Difference Equations 31
References
[1] E.N. Bodine, S. Lenhart, and L.J. Gross. Mathematics for the Life Sciences. Princeton University Press, 2014.
[2] J.K. Conner and D.L. Hartl. A Primer of Ecological Genetics. Sinauer Associates, Inc., 2004.
[3] J.M. Cushing, S. Levarge, N. Chitnis, and S.M. Henson. Some discrete competition models and the competitive
exclusion principle. Journal of Difference Equations and Applications, 10(13–15): 1139–1151, 2004.
[4] F.R. Giordano, W.P. Fox, S.B. Horton, and M.D. Weir. A First Course in Mathematical Modeling, 4th Edition.
Brooks/Cole, 2009.
[5] P.W. Hedrick. Simulatoin of X-linked selection in Drosophila. Genetics, 83: 551-571, 1976.
[6] P.W. Hedrick. Genetics of Populations, 4th Edition. Jones & Bartlett, 2011.
[8] http://www.fwrc.msstate.edu/predator/bobSurvive.asp
Discrete Difference Equations 32
(d) xn → ∞ as n → ∞ which is not biologically reasonable. it is reasonable to assume that though growth may be
exponential for a time, at some point the hare population will begin to exceed the capacity of its environment
in terms of space or other resources (e.g., food), and thus limit reproduction or increase the death rate within
the population.
(b) xn = (0.9)n (50); decreasing sequence; x20 ≈ 6 elk; If the average lifespan of a wild elk is 15 years, there certainly
must be elk that live longer than 15 years. However, to truly test if the assumption in this case makes sense, we
would need to know the ages of the 50 elk that were originally introduced into the wildlife management area,
and an upper bound on the age of an elk in the wild. If most of the elk were already between over the age of 5,
then this model would predict that 6 of the original elk lived to be at least 25 years old. This is 10 years older
than the average and may or may not be reasonable.
(a) a = 0.69
Exercise 4 – Buffalo
(b) No
Exercise 5 – Trout
(a) n = 5
Exercise 6 – Cipro
(b) ≈ 746.3 mg
(c) Yes, the dosing scheme will need to be change. It can be changed by either lowering the dose size or increasing
the time between doses. If lowing the dose size, b would need to be lowered to below 469 mg. A safe dose would
be 450 mg. If increasing the time between doses, τ would need to be increased above 1.14 days (≈ 1 day, 3
hours, 22 minutes). This might be a difficult regime for the patient to follow. It may be reasonable to increase
the time between doses to 2 days, but it is unknown if this would allow the amount of Cipro in the bloodstream
to fall below its level of effectiveness.
(a) m = 0.0375
(i) qt = (1 − m)t q0 with q0 = 1; in 2040 (t = 10) 68% red wolf ancestry remains
(ii) qt = (1 − m)t q0 with q0 = (1 − 0.075)2 ; in 2040 (t = 8) 63% red wolf ancestry remains
(iii) qt = (1 − m)t q0 with q0 = (1 − 0.075)4 ; in 2040 (t = 6) 58% red wolf ancestry remains
(b) m = 0.0075
(i) qt = (1 − m)t q0 with q0 = 1; in 2040 (t = 10) 93% red wolf ancestry remains
(ii) qt = (1 − m)t q0 with q0 = (1 − 0.075)2 ; in 2040 (t = 8) 81% red wolf ancestry remains
(iii) qt = (1 − m)t q0 with q0 = (1 − 0.075)4 ; in 2040 (t = 6) 70% red wolf ancestry remains
(a) Let ut = 1/xt , then xt = 1/ut . Substitute xt = 1/ut into Equation (20) and simplify to obtain ut+1 = 1r ut + r−1
K .
1 1 t 1 K
(b) ut = u0 − K r +K ⇒ xt = t
1+(u0 K−1)( r1 )
(c) u∗ = 1
K (transformed Beverton-Holt), thus x∗ = K (original Beverton-Holt). This means that the equilibrium
of the Beverton-Holt model is the carrying capacity of the model.
(f)
(d) Expect to see q ∗ = 55/101 in the population since it is the stable equilibrium. Genotype frequencies at equilibrium:
P ∗ = 0.12454 (DD), Q∗ = 0.66181 (Dd), R∗ = 0.21365 (dd)
(d) s2 = 7s1 /13, the advantage is greater for the homozygous dominant genotype
b1 xt
(a) xt+1 = 1+xt ; the equilibria are x∗ = 0, b1 − 1;
x∗ = 0 is stable when 0 < b1 < 1, x∗ = b1 − 1 is stable when b1 > 1, and when b = 1 the only equilibrium is
x∗ = 0 which is stable in this case; The parameter b1 acts like a growth rate, and when b1 > 1 the populations
is growing and when 0 < b1 ≤ 1 the population is declining to extinction.
(b) As the value of c1 is increased the value of xt+1 decreases. The parameter c1 is a measure of the severity with
which x’s competition with y reduces the growth of x. Likewise, as the value of c2 is increased, the value of yt+1
decreases, and the parameter c2 is a measure of the severity with which y’s competition with x reduces the growth
of y. If the growth rates of the two populations are the same (b1 = b2 ), then the fate of the populations comes
down to who has a competitive advantage, which is population y when c1 > c2 . We would expect population y
to grow more quickly than population x, and possibly even drive population x to extinction.
b1 − b2 c1 + c1 − 1 b2 − b1 c2 + c2 − 1
(c) (x∗ , y ∗ ) = (0, 0) , (0, b2 − 1), (b1 − 1, 0), , ; Notice that if b1 < 1, then the
1 − c1 c2 1 − c1 c2
equilibrium (b1 − 1, 0) is not biologically reasonable as it implies x∗ < 0. Likewise, if b2 < 1, then the equilibrium
(0, b2 − 1) is not biologically reasonable. Notice that under certain parameter sets, the fourth equilibrium does
not reside in the first quadrant and is thus biologically unreasonable. If you want an extra exercise, show that
when b1 = b2 , the fourth equilibrium will be in the first quadrant when c1 < 1 < 1/c2 .
b1 − b2 c1 + c1 − 1 b2 − b1 c2 + c2 − 1
(d) EQ1 = (0, 0), EQ2 = (0, b2 − 1), EQ3 = (b1 − 1, 0), and EQ4 = ,
1 − c1 c2 1 − c1 c2
(i) All four equilibria are in the first quadrant. All initial conditions lead to EQ4.
(ii) EQ4 is no longer in the first quadrant. All initial conditions lead to EQ2.
(iii) EQ4 is no longer in the first quadrant. All initial conditions lead to EQ3.
(iv) All four equilibria are in the first quadrant. The equilibria that is approaches as t → ∞ is determined by
b2 −b1 c2 +c2 −1
the relative position of (x0 , y0 ) with respect to the line y0 = b1 −b2 c1 +c1 −1 x0 = 1.5x0 . When y0 < 1.5x0 ,
then solutions approach EQ3; When y0 = 1.5x0 , then solutions approach EQ4; When y0 > 1.5x0 , then
solutions approach EQ2.