Analytical and Computational Methods of Advanced Engineering Mathematics - Recomended
Analytical and Computational Methods of Advanced Engineering Mathematics - Recomended
Editors
].E. Mar5den
L. Sirovich
M. Golubit5ky
WJăger
Advisor
G. 10055
" Springer
Grant B. Gustafson
Calvin H. Wilcox
Department of Mathematics
University of Utah
Salt Lake City; UT 84112
USA
Series Editors
].E. Marsden L Sirovich
Control and Dynamical Sytems, 107-81 Division of Applied Mathematics
California Institute of Technology Brown University
Pasadena, CA 91125 Providence, RI 02912
USA USA
M. Golubitsky W]ăger
Department of Mathematics Department of Applied Mathematics
University of Houston Universităt Heidelberg
Houston, IX 77204-3476 69120 Heidelberg
USA Germany
Cover photo © Lightscape.
All rights reserved. This work may not be translated or copied in whole or in part without the written permission of
the publisher (Springer-Science+ Business Media, LLC, except for brief excerpts in connection with reviews or
scholarly anaIysis. Use in connection with any form of information storage and retrieval, electronic adaptation,
computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden.
The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the former are not
especially identified, is not to be taken as a sign that such names, as understood by the Trade Marks and
Merchandise Marks Act, may accordingly be used freely by anyone.
98 76 S4 3 2 1
SPIN 10557758
Preface
and complexity of problems that students, and their teachers, can solve. The capability of
checking calculations and answers with electronic assistance is appreciated by instructors
and students alike.
Textbook vs. Handbook. Traditionally, engineers have liked to get double duty from
their AEM texts, using them first as textbooks during their student years, and later as
handbooks of mathematical methods during their professional lives. The second use has
led to very large books with many chapters and topics. Unfortunately, these two uses are
in conflict. A textbook should be written in a discursive, conversational style that allows
the student to read linearly and learn the material without constant help. By contrast, a
handbook should be terse and tightly organized to make it a convenient reference.
Time Allotted to AEM in Engineering Curricula. Most engineering curricula are very
full, crowding essentially five years of courses into four years. As a consequence, engineer-
ing faculties allow strictly limited time for mathematics and other service courses. In par-
ticular, this leads to crowded syllabi for the AEM courses and the omission of many topics.
Description of the New AEM Text. The Table of Contents provides a detailed topical
outline. This will be used to describe and comment on the manuscript.
Chapter 1. Numerical Analysis. The placing of this material at the beginning of the
book is new. Traditionally, a differential equation is considered solved if an implicit
equation for the solution is found. But a modern engineer or applied scientist does not
consider a problem to be solved until he or she can program a computer to generate
the values of the function on demand and to generate tables and graphs of the function.
Chapter 1 provides elementary methods to do this. The topics treated in Chapter 1
are direct applications of first-year calculus. They provide a seamless transition from
elementary calculus to ordinary differential equations.
Chapter 2. Ordinary Differential Equations of the First Order. Most AEM texts begin
with this topic because of its widespread use in applications. Here it is emphasized that a
first-order ordinary differential equation plus an initial condition is one of the best ways
to define and generate a function. The numerical algorithms presented in Chapter 2 are
natural extensions of the numerical integration methods of Chapter 1.
Chapter 3. Ordinary Differential Equations of Higher Order and Systems. Here
the emphasis is on second-order linear ordinary equations, a topic that pervades
mathematical physics (Chapters 4, 8, and 9). The structure theorem for solutions of
such equations provides concrete examples of the abstract vector spaces that will be
encountered in Chapter 5.
Chapter 4. Laplace Transform. This standard topic is still required by electrical
engineers. All AEM texts present it. It provides a flexible method to solve systems of
PREFACE vii
Additional Remarks on Differences Between the New Text and Existing Texts. The
focus of the new book is narrow. It does not try to be a treatise or handbook on engineering
mathematics.
This book has evolved over fifteen years into a text that is independent of any partic-
ular computing environment. This plan emerged in response to changes in computing
equipment and software systems, which tend to be subject to rapid flux, quickly dating
any text that is based upon specifics of a system or language. Instead, we adopted the
viewpoint that the mathematics is the fundamental element to be learned, and the soft-
ware system is certain to change, and soon! Accordingly, no computer-specific elements
appear in the main text or in the exercise statements.
Linear algebra is a strong point of the book. Unlike other texts that follow an approach
used in pure algebra texts, we use a practical approach based upon solving equations. All
ideas are introduced from this basic viewpoint, which allows engineers in their second
year to understand concepts that would otherwise be impossibly abstract.
Partial differential equations are introduced in the context of physical problems,
rather than in an abstract setting (classification into hyperbolic, parabolic and elliptic,
etc.), which is so common in other texts. Emphasis is on the solution of a variety of
models that arise in physics and engineering. Competing books have few applications
that are treated with so much completeness and depth.
Numerical analysis is treated as an extension of the calculus, and it is available
from the first days of the course. The text uses the numerical material as a bridge from
the mathematics to the applications. We understand that engineering involves graphs,
numbers, and experiments. The numerical methods provide a platform for interpretation
and evaluation of abstract results.
The chapters are as independent as possible. To achieve this it is sometimes necessary
to repeat or summarize material that was developed earlier. An example occurs in the
Sturm-Liouville theory of Chapter 8, which uses ideas and methods from the theory of
second-order linear ordinary differential equations. Here the text reviews the necessary
theory from Chapter 3.
Short One-Year Course. The shortest course over major topics is organized as shown
below. A quarter system one-year course does a short course in three quarters, Chapters
1-4, Chapters 5-6, Chapters 7-9. A semester system one-year course might do a short
course in two semesters, Chapters 1-5, Chapters 6-9. The pace in a short course is slow,
allOwing for diverse backgrounds and only a calculus prerequisite.
Chapter 1. Treat linear interpolation only. Skip most of 1.2 and all of 1.3 through 1. 7.
Introduce topics from Chapter 1 when demand dictates. Start Chapter 2 as
quickly as possible.
PREFACE ix
Chapter 2. Skip exact equations 2.4 and numerical methods 2.7. Skip most applications
in 2.5.
Chapter 3. Skip power series solutions 3.4, numerical methods 3.5, and higher-order
linear equations 3.6.
Chapter 4. Skip applications to differential equations 4.4 and 4.5.
Chapter 5. Skip most of the fundamental theorem of linear algebra 5.5, treat just
dimensions less than 5 for determinants 5.6, and skip quadratic forms in 5.7.
Chapter 6. Already this chapter assumes a calculus course over the same topics. The
topics in 6.5 are used in Chapter 7. Skip unrelated topics in 6.1-6.4, gUided
by student background. Generally, three semesters of calculus will cover the
majority of topics in 6.1-6.4. Skip 6.6 for a short course.
Chapter 7. The chapter can be skipped and inserted into future lectures on topics from
Chapter 8 and Chapter 9. Chapter 7 is designed to be reference material for
reading rather than lecturing. The design originated with short courses where
there is not adequate class time for partial differential equations.
Chapter 8. Short courses do 8.1-8.5, 8.11-8.13 only. This leaves out the more difficult
topics and especially the Fourier integral.
Chapter 9. Short courses skip all topics that use Legendre and Bessel functions, multiple
integrals, Laplace and Fourier transforms. Only selected topics from 9.3 are
done in the lecture.
9.1: Problems 9.1.1, 9.1.2, 9.1.3, 9.1.4.
9.2: Problems 9.2.1, 9.2.2, 9.2.3.
9.3: Problems 9.3.1, 9.3.2, 9.3.7, 9.3.8, 9.3.9.
Normal One-Year Course. This course is defined as the short course above plus topics
selected to fit the curriculum. The pace through the text is faster, and much of the
workload is passed on to the student. It is suitable for well-prepared calculus students who
can dedicate time to independent learning. In designing a course, we suggest beginning
with the selection of topics from Chapter 9. For example, selection of a problem that
uses the Fourier transform requires deeper coverage of Chapter 8 and possibly Chapter
6. One critical issue for course design is whether or not to study power series methods
in the first term. This topic generally consumes five or more lectures. Some instructors
prefer to do the topics in the same term as the Bessel and Legendre applications.
Intensive One-Year Course. This course covers all the short course topics for the term
in 70% of the allotted time, e.g., six weeks for the quarter system and nine weeks for the
semester system and nine weeks for the semester system. Satisfactory completion of the
short course topics is scored as a passing grade in the course. To earn an honor grade,
the remaining 30% of the course is devoted to new topics from the text.
X PREFACE
Preface v
Introduction xv
1 Numerical Analysis 1
1.1 The Nature of Numerical Analysis. 1
1.2 Polynomial Interpolation 2
1.3 Numerical Integration and Differentiation . 22
1.4 Solution of Equations 41
1.5 Inverse Functions . . .. 55
1.6 Implicit Functions . 64
1.7 Numerical Summation of Infinite Series. 70
References 709
Index 711
Introduction
This book presents an introduction to the methods by which mathematical analysis and
the computing power of modern digital computers are combined to solve problems of
engineering science.
Most physical phenomena obey mathematical laws that can be formulated as
differential equations. This astonishing fact was first discovered by Isaac Newton in the
years 1665-1666 when he formulated the laws of mechanics as differential equations
and used them to predict the orbits of the planets. During the three centuries that
have followed Newton's discoveries many differential equations that govern physical
phenomena have been found. They include Euler's equations for the dynamics of rigid
bodies, the Navier-Stokes equations for the motion of viscous fluids, the Cauchy-Green
equations for the static and dynamic behavior of elastic bodies, Fourier's equations for the
diffusion of heat in solids, Kirchhoff's equations for electrical circuits, Maxwell's equations
for the electromagnetic field and Schrbdinger's equation for the dynamics of atoms and
molecules, to mention only the best-known examples. An examination of the recent
engineering and scientific literature reveals an almost unlimited number of differential
equations that have been introduced to model physical, chemical and biological systems
and processes. The goal of this book is to provide an introduction to the differential
equations that arise in engineering science and to the analytical and computational
methods that are used to solve them.
Mathematical Analysis of a Physical Problem. The mathematical analysis of a
physical problem involves the following four distinct steps.
xvi INTRODUCTION
At this point some readers will ask why we don't simply "put the problem on the
computer" at the outset, i.e., eliminate steps 1, 2, and 4. This question reflects a lack of
appreciation of the complementary roles of theory and computation in applied science.
In order to compute anything there must be equations to be solved. If the solutions of
these equations describe the states of a physical system then the equations themselves
are a mathematical model of the system. They can come only from considerations of
the kind indicated in steps 1 and 2. Moreover, the results of a computation can be
related to the states of a physical system only by means of some theory or model. Of
course, in some cases step 3 may be emphasized over the other steps. Nevertheless, the
successful application of mathematical analysis and computation to any physical problem
necessarily involves all four steps.
The method outlined above will now be illustrated by means of a simple example
for which all four steps can be carried out easily The example will also show the origin
and method of solution of a simple differential equation.
INTRODUCTION xvii
A Flask Cooling Problem. A flask of water (see Figure 1) is heated to the boiling
point (100 degrees Celsius). It is then removed from the heat and stirred constantly while
it cools in a room with air temperature 20°e. After 10 minutes the water temperature is
60°e. Ihe problem is to predict when the water temperature will be 40°C and when it
will have cooled to room temperature (20°C).
Step 1. Mathematical Formulation. Ihe wording of the problem suggests the
follOwing hypotheses.
When removed from the heat the water is uniformly at 100°e.
Because of stirring, the water temperature at any instant is uniform throughout
the flask.
Ihese hypotheses suggest that the water temperature I, expressed in degrees Celsius,
is a definite function of the time. If
(1) T = u(t), t ~ 0,
Moreover, experience tells us that u(t) is a steadily decreasing function of t that satisfies
20 ~ u(t) ~ 100. Ihe problem is to determine the unknown function u(t).
A physical hypotheSiS is needed to determine the rate at which the flask of water
cools. A good approximation is Newton's law of cooling. In the present context it states
that the rate of decrease of I (i.e., -dI/dt) is proportional to the difference between
the instantaneous flask temperature and the ambient air temperature (i.e., T - 20). Ihis
leads to the differential equation
dT
(3) - = -h(T - 20)
dt '
where h is a positive constant, as yet unknown, that presumably depends on the size and
shape of the flask. The minus sign in (3) is needed because dTldt < 0 and T - 20 > O.
Step 2. Solution of the Mathematical Problem. Equation (3) is understood to
mean that the unknown function u(t) satisfies
duet)
(4) - - = -h{u(t) - 20} for each t ~ O.
dt
Now, the chain rule of calculus implies that
d 1 duCt)
dt log{u(t) - 20} = {u(t) - 20} dt .
d
(5) -log{u(t) - 20} = -h for each t ~ O.
dt
(8)
Equations (2) and (8) give u(lO) = 60 = 20 + 80e- 10h , whence C 10h 112, or
h = (0.1) log 2, and
(9)
Hence, combining equations (7), (8) and (9), the solution of the mathematical problem is
T
100
80
60
40
20 t
0 20 40 60 80 100 FIGURE 2. Temperature versus time
equation (3) is satisfied, because by (0), U'(t) = 0 + 80( -h)e- ht and -h(u(t) - 20) =
( -h)(80e- ht ).
Step 3. Numerical Computation. This step is easy in the present case. The solution
may be written in the form
From equation (0) it is clear that u(t) = 40 when t = 20 minutes. More generally, the
temperature difference u(t) - u(oo) = u(t) - 20 is cut in half every 10 minutes. The
answer to the question "when does the water reach room temperature" is "never,"
because u(t) > 20 for all t ~ O. If temperature differences ofless than 0.1 °C cannot be
measured, then the water is effectively at room temperature when u(t) = 20.1°C, or
or
2t/IO = 800.
in the cooling model. Assume that the stirrer adds energy to the water at the constant rate
calories
A--.
minute
Let the specific heat of the water be
calories
c
(gram) (oC)
M grams.
Then, in the absence of cooling, the stirrer would raise the temperature of the water at
the rate
cal 1 DC A °C
A-x--=-=Q-
min eM cal eM min
When this effect is added to the cooling by Newton's law, one gets the model equation
dT
dt = -h(T - 20) + Q.
It will be convenient to write this as
dT
(12) dt = -h(T - 20 - To),
u(t) = 20 + To + Ke- ht .
Conditions (2) give first K = 80 - To and then
40 - To -lOh
(14) ---=e
80 - To
On combining this with (13) it is found that the value of h is determined implicitly by
the equation
40h - Q -lOh
(15) ----e
80h - Q - .
INTRODUCTION xxi
calories
A=lO---
minutes'
calories
c = 1 ____________---,--
(gram)(oC) ,
M = 1000 gram,
h= ~ 10 (80 - To)
10 g 40 - To '
and hence the first equation of (18) may be written
40 - To)U10
(18) u(t) = 20 + To + (80 - To) (
80 - To
From this form it is clear that if To is small, as in the numerical example, then u(O is
nearly the same as the solution (10) of the first model problem, and the effect of the
stirring can be ignored. It is also clear that the time t at which T = u(t) = 40° will be
greater than for the first model. This t can be found from (18) by putting u(t) = 40 and
solving for t. Numerical methods for doing this are presented in Chapter 1. Note that
the temperature function u(t) defined by (18) is again a strictly decreasing function of t.
However, in the present case,
and
Hence in this more refined model the flask never appears to reach room temperature
unless To is less than the smallest measurable temperature difference.
Each of the analytical and computational methods presented in this book is illustrated
by application to one or more physical problems. In many cases all four steps of the
analysis, as outlined above, will be presented. In other cases the main emphasis will be
on steps 2 and 3, which usually contain most of the mathematical work. In such cases
the construction of the model (step 1) and the interpretation of the results (step 4) may
be discussed briefly, or references to specialized textbooks may be given.
Analytical and Computational Methods of
Advanced Engineering Mathematics
CHAPTER 1
Numerical Analysis
in the 1940s, stimulated vigorous research in numerical analysis. The search for new,
more effective algorithms continues at a high level and may be expected to do so in the
foreseeable future.
The enormous power of modem computers has led many laymen and some scientists
to believe that any problem can be solved by "putting it on the computer." This is certainly
not true. First of all, there are problems for which no satisfactory mathematical model has
been found. It should be clear from the discussion in the Introduction that computers
cannot be applied to such problems. Secondly, it has been found in recent years that
some problems possess chaotic solutions that are essentially unpredictable. This class
probably includes some problems of turbulent fluid flow and weather prediction. Finally,
some applications lead to mathematical problems whose computer solution would take
so long as to be beyond the capabilities of any existing or foreseeable computers.
In recent years many prepackaged computer programs that implement some of
the algorithms presented in this book have become available. For this reason students
of engineering and science sometimes ask why they should take the trouble to learn
the principles underlying such programs. The Simplest reply to this question is that
all algorithms, and all programs, have their limitations. Users must understand these
limitations in order to apply the programs successfully Moreover, engineering science
leads to an ever-increasing number of mathematical models, most of which do not fit
preexisting programs. Hence it is necessary for engineers to understand the principles
that underlie the analysis of engineering problems. In fact, many engineers become
involved in designing large programs to simulate complex physical systems. For the
successful completion of such projects it is essential that they understand the principles
of numerical analysis.
Problem 1. Filling of a Reservoir. The flow of water into a reservoir is measured during
a 24-hour period by means of a flow meter whose output is recorded on magnetic
tape. The tape then contains a complete analog record of a function q = q(t), where
1.2 POLYNOMIAL INTERPOLATION 3
t is the elapsed time in hours from the beginning of the measurements and q(t) is the
instantaneous rate of flow at time t, in cubic feet per hour (abbreviated Cfh).l
Several problems connected with such flow records are solved in this chapter and
the next chapter. The first is to compute the total volume V of water in the reservoir as
a function of t. If Vo is the volume present at t = 0 then V is given by
(2) ]o(x) = -1
7(
l
0
rr
cos(x sin t) dt,
is called the Bessel function of order zero. 2 It occurs in the solution of many problems of
engineering and science, and for this reason it has been studied extensively Two standard
reference works are [McL] and [A-SJ.
The properties]o(O) = 1, lJo(x) I ::: 1 andio( -x) =.]o(x) follow immediately from
equation (2). Table 2 below is a short table of ]0 (x) for 0 ::: x ::: 6.0 and is accurate
to four decimal places. It is shown below how polynomial interpolation can be used to
compute values of]o(x) to table accuracy for any x on the interval 0.0 ::: x ::: 5.9.
The Interpolation Problem. In problems 1 and 2 the values of a function j(x)
are known at a finite set of x-values, say Xo < Xl < X2 < ... < x n. The notation
jo = j(XO),fl = j(xl),h = j(X2) , .. . ,fn = j(xn) will be used for these values. The
numbers XO,Xl,X2, ... ,Xn and jO,fl ,h, ... ,fn are called the nodes and nodal values of
j(x), respectively They are to be stored in computer memory A numerical algorithm is
then to be used to compute j(x) for other values of x.
'The conventional unit is cubic feet per second (abbreviated cfs). Here it will be more convenient to use cfh.
2The function is named for Friedrich Wilhelm Bessel (1784-1846). German astronomer and mathematiCian, who
applied it to the solution of an astronomical problem in a paper published in 1824.
4 NUMERICAL ANALYSIS
k = 0,1, . .. , n. To find an algorithm for computing PI (x), note that we may write
(3)
(4) ao = fk,
a] = (jk+l - aO)!(xk+l - Xk).
Hence PI (x) may be computed for any given number x such that Xo :s x :s Xn by the
following alogrithm.
For the data of Table 1 the graph of the interpolant PI ex) is shown in Figure 1 below.
Uniformly Spaced Nodes. If
PI
232000.0
131000.0
o 12 24
so that all pairs of consecutive nodes have the same separation h, then the last two steps
of the algorithm for Pi (x) are usually replaced by
Note that r is a fraction of the distance from Xk to Xk+i, so that 0 .:::: r .:::: 1.
The Error in Linear Interpolation. The function value Pi (x) will differ from j(x) for
most x-values other than the nodes. The error is given by the error function
(see Figure 2). The error may be expected to be small when Xk+i-Xk is small. Quantitative
estimates of Ei (x) may be obtained from a theorem (see [Ch-K]) that states that iff" (x) , the
second derivative of j(X) , is continuous, then for each number x such that Xk < x < Xk+l
there is a second number ~ such that Xk < ~ < Xk+i and
1 N
(8) Ei (x) = l(x - Xk)(X - Xk+l)j (~).
Equation (8) cannot be used to compute Ei (x) exactly because the value of ~ is
not known. However, useful estimates may be found by maximizing the factors on the
right-hand side of (8). It is a simple maximum problem to show that
(9)
(10)
1 2
(12) IE] (x) I S Sh M2 for Xo S x S x n ·
As a simple application of (12) consider the function J(x) = cosx. In this case rex) =
- cos x and we may take M2 = 1. Thus the error in the linear interpolation of the cosine
function never exceeds h2/S. For example, to have lEI (x) I S 10-4 it is enough to require
that h2/8 S 10- 4 , i.e., h2 S 8 X 10- 4 or h S (8 X 10- 4)1/2 = 0.0283.
Quadratic Interpolation. Higher accuracy can be obtained if J(x) is approximated by
a quadratic polynomial in x on consecutive pairs of intervals, i.e., on
where it is assumed that n = 2m is even. Graphically, this means that on each interval
(X2k,X2k+2) the graph ofJ(x) is approximated by the unique parabola that passes through
the points (X2k,f2k), (X2k+l,f2k+l) and (X2k+2,f2k+2); see Figure 3.
This process produces a piecewise quadratic function P2 (x) whose graph is a parabola
on each interval (X2k, X2k+2) and that satisfies P2 (Xk) = Jk for k = 0, 1, ... ,2m = n. It
will be convenient to write P2 (x) in the form
(13)
___ f
t
12k P2(X) hk+l /2k+2
~ i ~ ~ ;0 X
X2k X X2k+l X2k+2 FIGURE 3. Quadratic interpolation
8 NUMERICAL ANALYSIS
where the constants ao, al and a2 are to be computed. Equation (13) clearly defines
a quadratic polynomial on (X2k,X2k+2). Moreover, the interpolation conditions at the
consecutive nodes X2h, X2k+l, X2k+2 give the equations
(15) ao = hk'
hk+l - ao
al = ,
X2k+l - X2k
hk+2 - ao - al (X2k+2 - X2k)
a2 = .
(X2k+2 - X2k) (X2k+2 - X2k+l)
Hence P2(X) may be computed for any given number such that Xo .::s x .::s X2m by the
following algorithm.
Quadratic Interpolation Algorithm.
(16) If x = Xc, then P2(X) = fa.
If Xo < x .::s X2m, find k such that X2k < x .::s X2k+2
(k is the largest integer such that x - X2k > 0).
Compute ao, al, a2 by equations (15).
Compute P2(X) by (13).
For the flow meter data of Table 1, the quadratic interpolant P2 (x) is illustrated by
Figure 4 below.
P2
232000.0
131000.0
o 24
Uniformly Spaced Nodes. If the node separations are all equal, Xk+l - Xk = h for
k = 0,1, ... , 2m - 1, then P2(X) is often written as
1 2
(17) P2(X) = hk + rt..hk + -r(r - l)t.. ilk for X2k:::: x:::: X2k+2,
2
where
and
X -X2k
(20) r = ---h
for X2k:::: x :::: X2k+2.
The last two steps of the algorithm for P2 (x) may then be replaced by
Note that °: :
r :::: 2 when X2k :::: x :::: X2k+2. Moreover, r = 0 when x = X2k, r = 1
when x = X2k+ 1, and r = 2 when x = X2k+2. It follows from these facts that (17) defines
the same quadratic polynomial as (13).
The Error in Quadratic Interpolation. This error is given by
(22)
It is shown in [Ch-KJ that if the third derivativej"'(x) is continuous, then for each x
such that X2k < x < X2k+2 there is a number ~ such that X2k < ~ < X2k+2 and
(23) E2(X) 1 -
= 6(x X2k) (x - X2k+l) (x - X2h+2)j '" (~).
If the nodes are uniformly spaced with Xk+l - Xk = h, then it is a simple maximum
problem to show that for X2. :::: x :::: X2k+2
(24)
In the example of the function f(x) = cos x we have 1'"(x) = sin x, and hence M3 = 1
is a suitable value. Thus the error in quadratic interpolation of the cosine function never
exceeds h3 /(9J3). In particular, to have IE21 S 10-4 it is sufficient to require that
h3 S 9J3 X 10-4 . On solving the equation
by means of logarithms, we get h = 0.1159. Note that this is more than four times as
large as the maximum permissible h for the same accuracy with linear interpolation.
Error Control for Experimental Data. The error estimate (12) for linear interpolation
implies that for functions f(x) with continuous second derivative 1"(x) , the error can
be made less than any prescribed tolerance by chOOSing the sampling step size h
small enough. The estimate (25) implies that a similar statement holds for quadratic
interpolation whenever 1'" (x) is continuous. For experimental data, such as the flow
meter data of Problem 1, information about the derivatives off(x) is usually not available.
Hence no a priori determination of h can be based on the error estimates (12) and (25).
In such cases a suitable value of h must be found by trial and error. To do this, a first value
h = hI is selected, and PI (x) (or P2(X)) is computed for a representative set of x-values,
such as the midpoints of the intervals (Xk, Xk+I). These values are then compared with the
experimental values. If the errors are too great, then h is reduced, typically to h = ~ hI, and
the computation is repeated. This process is continued until a suitable value of h is found.
If at the first trial the errors are acceptable then a larger value such as h = 2hI may be tried.
Error Control for Mathematical Data. The trial-and-error method may also be applied
to mathematical functions f(x). In fact, this may be the only feasible method in cases
where 1" (x) and 1'" (x), or their maxima M2 and M3 , are difficult to calculate. However, if
good estimates for M2 and M3 are available, then the error estimates (12) and (25) can be
used to determine the optimum step sizes h that will yield a desired degree of accuracy:
This will be illustrated for the Bessel function of Problem 2.
The Bessel function Jo(x) was defined by the integral formula (2) above. The
derivatives of Jo(x) can be calculated by differentiation inside the integral sign. Thus,
one has
]~' (x) = -1
7r
l0
rr
sin (x sin t) sin3 t dt.
The validity of this procedure is demonstrated in advanced calculus texts; see, for
example, [T-MJ. Notice that the integrands of these integrals are never larger than 1 in
magnitude. It follows that lJ~n)(x)1 ::: 1 for all x and for n = 0,1,2, .... Thus we may
take M2 = 1 and M3 = 1 in the error estimates for linear and quadratic interpolation.
In particular, (12) implies that If 1 (X) I ::: h2/8 ::: 10-4, provided that h ::: 0.0238. This
suggests, correctly, that linear interpolation in Table 2 for ]0 (x) does not maintain the
accuracy of the table. However, (25) implies that If2(x) I ::: h3/(9.J)) ::: 10- 4 provided
that h ::: 0.11592.
Higher-Order Polynomial Interpolation. Linear or quadratic interpolation is
adequate for most engineering computations. However, occasionally it is desirable to
have the greater accuracy provided by cubic or higher-order polynomial interpolation.
This will be the case if the nodal values Jk are difficult or expensive to obtain, or if
the computer memory space allocated to them is limited. In such cases, higher-order
interpolation can be used to reduce the number of nodal values needed to obtain an
accurate representation of J(x).
The general polynomial interpolation problem requires us to compute a polynomial
Pl(X) of degree.e that interpolates the nodal values at.e + 1 consecutive nodes Xo < Xl <
X2 < ... < Xl; i.e.,
(28)
where ao, aI, ... , al are to be computed. Note that the number of these coefficients is
.e + 1, the same as the number of conditions (28).
It is important to note that there is exactly one polynomial Pi (X) that interpolates the
data at.e+ 1 distinct points. To see this, assume that qe(x) is a second polynomial of degree
.e that interpolates the data at Xo, Xl, ... , Xi such that qe (Xk) = Jk for k = 0, 1, ... , .e.
Then the difference ri(x) = Pi (X) - qi(X) is a polynomial of degree.e that is zero at the
.e + 1 points xo, Xl, ... , Xi. But a nonzero polynomial of degree.e can have at mostl real
zeros. It follows that re(x) = 0 for all x, so that Pe(X) = qi(X). This shows that there is
at most one interpolating polynomial. The existence of Pi (X) will be shown by deriving
an algOrithm for its construction.
12 NUMERICAL ANALYSIS
The assumed form (29) of Pe(x) and the interpolation conditions (28) imply that the
following equations hold.
(30) fo = ao,
fl = ao + al (Xl - Xo),
f2 = ao + al (X2 - Xo) + a2(X2 - XO)(X2 - Xl),
This system of equations can be solved for ao, aI, ... , ae in succession because the
differences Xh - Xe are not equal to zero when k 'I- e. It is important to note that each
coefficient ah depends only on the nodes Xo, Xl, ... , Xh and nodal values fo, fl, ... , fh.
This property, which is evident from (30), implies that Ph (X) can be computed as soon
as ao, aI, ... , ak have been found. This property and equation (29) imply that
(31)
These properties can be used to formulate an algorithm for computing pe(x). Note that
(30) can be rewritten as
(33) fo = ao,
fl = PO(XI) + alql(xl),
h= PI (X2) + a2q2 (X2),
(34) ao = fo,
aj = (h - po (Xj»)!ql (Xl),
a2 = (h - PI (x2»/q2 (X2),
1.2 POLYNOMIAL INTERPOLATION 13
When ao, al, ... , ac have been computed then pc(x) is computed from (29). The
algorithm is easy to program. First ao is set equal to fo. Then PO(Xl) = fo, q(Xl) and al
are computed. With ao and al determined, Pl(X2), q2(X2), and a2 are computed, etc.
Uniformly Spaced Nodes. If Xk+l - Xk = h = constant for k = 0,1, ... ,.e - 1, then
another algorithm for pc (x) , based on the forward differences t:.fk, t:. 2fk, t:. 3fk, etc., is
often used. The forward differences are defined inductively by
(37) r = (x - xo)Jh.
The correctness of (36) can be verified by noticing that r = 0 when x = Xo, r = 1 when
x = Xl, r = 2 when x = X2, ... , r = .e when x = Xf. It follows that the right -hand side
of equation (36) is a polynomial inx of degree.e that satisfies the interpolation conditions
(28). Hence it coincides with Pe(x) by the uniqueness of the interpolating polynomial.
The algorithm (36) is called the Gregory-Newton forward difference formula 3 It
may be given a compact form by introducing the binomial coefficients
(38)
\
G) = 1,
r) = r(r - l)(r - 2) ... (r - k + 1)
(k k! (k > 0).
(39)
For the programming of (39), the computation may be broken into the following steps.
3The fonnula was first stated by James Gregory 0638-1675) in 1670. The first proof was given by Isaac Newton.
14 NUMERICAL ANALYSIS
The Error in Higher-Order Interpolation. The error is given by the error function
It is shown in [Ch-KJ that for each x such that Xo < x < Xe there is a point ~ such that
Xo < ~ < Xe and
(42)
Quantitative error estimates may be obtained from this equation. In particular, for
uniformly spaced nodes with Xh+1 - Xh = hand
(43)
(44)
This result shows the increased accuracy provided by higher-order interpolation. Note
that for .e = 2 the estimate (44) is not quite as good as the estimate (25).
Spline Interpolation. Linear interpolation produces functions that are continuous
and piecewise linear. Their derivatives are piecewise constant functions that in general
are discontinuous at the nodes. Similarly, quadratic interpolation produces continuous
functions whose derivatives are discontinuous at the alternate nodes where two quadratic
polynomials are matched. Corresponding remarks apply to higher-order polynomial
interpolation.
For some applications it is highly desirable to have an interpolating function whose
first derivative and (possibly) higher-order derivatives are continuous throughout their
domains of definition. The process of constructing such functions is called spline
interpolation. The name is derived from the early draftsmans spline, which is a flexible
elastic rod used to draw a smooth curve through a prescribed set of points.
The simplest spline functions are piecewise quadratic functions q(x) that interpolate
the nodal points (Xh,fh) and whose derivative q'(x) is required to be continuous. An
algorithm for computing such quadratic splines is given below, followed by an example
1.2 POLYNOMIAL INTERPOLATION 15
of their application. The most widely used splines are the cubic splines, which interpolate
the data and have continuous first and second derivatives.
Quadratic Splines. The quadratic spline function q(x) for the nodes Xo < Xl < ... <
I
Xn and nodal values fo,fl, ... ,fn is a function of the form
Note that the numbers Zk are as yet unknown and must be computed. The continuity of
q' (x) for Xo :s x :s Xn will hold if the derivatives q~ (x) match at the nodes. This means that
Each quadratic polynomial qk(X) has three coefficients, as yet unknown, for a total of 3n
coefficients. The interpolation equations (46) provide 2n equations, while the matching
of q~ (Xk+ I) with q~+ I (Xk+ I) at the internal nodes Xl, X2, ... ,Xn-l gives an additional n - 1
equations, for a total of 3n - 1, i.e., one less than the number of unknown coefficients.
It will be shown that Zo can be prescribed arbitrarily and that the 3n coefficients of the
qk(X), as well as the derivative values Zl, Z2, ... , Zn, are then uniquely determined.
To derive an algorithm for q(x) it will be convenient to write qk(X) and q~(x) in the
form
1 2
(49) qk(X) = "2ak(X - Xk) + Zk(X - Xk) +fk'
q~(x) = ak(X - Xk) + Zk,
for k = 0,1, ... , n - 1, where the constants ao, aI, ... , an-l are to be computed. Note
that both the first of equations (46) and the first of equations (48) are satisfied by (49).
16 NUMERICAL ANALYSIS
(50)
or
(51)
1 2
(52) -ak(Xk+l - Xh) + Zh(Xk+l - Xh) + Ih = Ik+l.
2
Solving this for ak and equating the result to the quotient in (51) gives the equation
Finally, writing
If a value for Zo is selected then (55) and the data (Xk,Jk) can be used to compute Zl,
Z2, ... , Zn. Then (51) gives the values of ao, al, ... , an-l, and (49) can be used to evaluate
q(x). The algorithm may be outlined as follows. The input data are xo, Xl, ... , x n, 10,
11, ... ,In, and Zo, as well as the value of x.
(56) Compute mo, ml, ... , mn-l by means of (54).
Compute Zl, Z2, ... , Zn by means of (55).
Compute ao, a], ... , an-l by means of (51).
If X = xo, put q(x) = 10.
If Xo < X ::: X n , compute the integer k
such that Xk < x ::: Xk+l.
Compute q(x) = qk (x) by means of (49).
If desired, compute q'(x) = q~(x) by means of (49).
ammeter
voltmeter
~
./?'
battery R
It is assumed that the internal resistance of the ammeter and voltmeter have been
chosen to be close to zero and infinity, respectively, so that they do not affect the current in
the circuit. By adjusting the variable resistance Rv, the current i in the loop, as measured
by the ammeter, can be made to have any value from i = 0 to a maximum value i = imax
amperes. As i is varied, the voltage v across R, as measured by the voltmeter, also changes
in a definite reproducible way. Thus to each value of i on the interval 0 ::: i ::: i max there
corresponds a unique voltage v; i.e., v = F(i), where F is a function defined on the interval
o ::: i ::: i max . It will be assumed that F(i) has been recorded in analog form, for example
on magnetic tape, so that the experimental values of F(i) can be reproduced on demand.
It will be convenient to work with the function f(i) = F(i)/i rather than F(i). Note
that f(i) is the resistance of the resistor R, in ohms, at the current level i. In particular, if
f(i) = Ro = constant for 0 ::: i ::: i max then the resistor is a conventional linear one, and
vii = Ro is Ohm's law.
If f(i) is not constant then the resistor R is said to be nonlinear. In this case its
performance in any circuit is determined by the function f(i). An example is given in
Section 1.3. To make a computational analysis of such a circuit, the function f(i) must
be represented in the computer, as discussed at the beginning of this section. It will be
assumed that the values of f(i) are sampled at uniformly spaced values of i, say
Many methods of analysis require a representation offCO that has a continuous derivative.
Quadratic spline interpolation provides a suitable method. Application of the quadratic
spline algorithm to the data ofTable 3 gives Table 4. Note that the value Zo = 0 has been
selected. This is appropriate because f(O is nearly constant near i = 0 and hence has
slope near zero there.
A graph of the quadratic spline interpolation is shown in Figure 6. For comparison,
the graphs of the linear and quadratic polynomial interpolations are also shown.
Exercises 1.2
Linear Interpolation. Compute the linear interpolation segment equations and plot the interpolant.
1. 3.
x 0.5 0.9 1.2 x 0.5 1.0 1.5
2. 4.
x 0.5 0.8 1.2 x 0.5 1.0 1.5
Quadratic Interpolation. Compute the quadratic interpolant equation in the form ao + al (x - xo) +
a2 (x - xo)(x - Xl) and plot.
5. 7.
x 0.5 0.9 1.2 x 0.5 1.0 1.5
6. 8.
x 0.5 0.8 1.2 x 0.5 1.0 1.5
9. (Flow Meter) Compute the first three segment equations for the flow meter problem, using linear
interpolation.
10. (Flow Meter) Compute the first three parabolic equations for the flow meter problem, using quadratic
interpolation.
11. (Flow Meter) The first three parabolic equations for the flow meter problem, using natural quadratic
spline interpolation, Zo = 0, are
13. (Cubic Interpolation) Forfourequidistant nodesxk = xo+hh, h = 0, 1,2,3, verify that I(x-xo)(x-
Xl)(X - X2)(X - x3)1 ::: h4 for Xo ::: x::: X3.
14. (Cubic Interpolation) Use the previous exercise and equation (42) with I = 3 to derive the error
estimate
20 NUMERICAL ANALYSIS
15. (Linear Interpolation) Write a computer program to implement the linear interpolation algorithm
with data items XO, Xl, ... , Xn and Yo, Yl, ... , Yn . The program should accept input x in the domain
Xo ::: x ::: Xn and produce output Y = PI (x), the linear interpolant of the data points at x.
16. (Quadratic Interpolation) Write a computer program to implement the quadratic interpolation
algorithm with data items Xo, Xl, ... , X2m and Yo, YI, ... , Y2m. Testthe program on Table 1, producing
a data file of 49 pairs, suitable for use in plotting Figure 4.
17. (Cubic Interpolation) Repeat the previous exercise, using cubic interpolation instead of quadratic
interpolation, replacing 2m by 3m.
18. (Quadratic Spline Interpolation) Write a computer program to implement the quadratic spline
interpolation algorithm. Test the program on the nonlinear resistor data and produce a figure for f(O
onO::: i::: 5.
19. (Quadratic Spline Interpolation) Apply quadratic spline interpolation to the flow meter data of Table
1 in order to produce a plot very different from the ones in Figures 1 and 4.
20. (Bessel Function}o(x» Write a computer program that accepts a table of values oflo(x) and a number
x in 0 ::: x ::: 6.0. The output is a quadratic interpolation value oflo(x), correct to 4 significant figures
in the sense that IE2 (x) I < 10-+ Verify the accuracy theoretically using error estimates. Part of the
problem is to determine the table size.
21. (Arctangent Identities) The principal value of the Arctangent function is given by the integral
tan -1 (x) = (X ~. Prove these identities:
10 1 +u
(a) limHoo tan-I(t) =~. (b) tan-I (-x) = -tan-I(x). (c) tan-I (x) + tan-I (l/x) =~.
22. (Arctangent Table) A table of arctangent values is to be computed such that linear interpolation with
uniform spacing gives lEI (x) I < 10- 4 on the interval 0 ::: x ::: 1. Use estimate (12) and report the
smallest number of table entries that will suffice.
23. (Arctangent Evaluation) Use the results of the two preceding problems in order to develop a computer
program that evaluates tan -I (x), accurate to 4 significant digits, for 0 ::: x ::: 1, in the sense that
lEI (x) I < 10-+
is used to produce an evenly spaced table with n + 1 entries on 0 ::: x ::: 1. Find the least n that produces
h (x) I < 10- 4 with quadratic interpolation.
25. (Cubic Interpolation) Develop the theory of cubic interpolation for uniformly spaced nodes. To be
givenarexo and h and the values Yo, Yl, ... , Y n , wheren = 3m. Inparticuiar,ifyo = Y3r,YI = Y3r+],
Y2 = Y3r+2, Y3 = Y3r+3, then derive these formulas:
ao = YO,
al = (YI - ao)/h,
a2 = (Y2 - ao - 2alh)/(2h2),
a3 = (Y3 - ao - 3alh - 6a2h)/(6h3).
1.2 POLYNOMIAL INTERPOLATION 21
R
7.5
26. (Runge's Example) I,et j(x) = 11(1 + x 2 ) on -5 < x < 5. For any integer n, there is a uniformly
spaced table of x and y values with n + 1 rows that is reproduced by f. Plot pn (x) for this table, n = 8
and n = 12. Runge showed that max IPn(x) - j(x) I --* 00 as n --* 00.
27. (Nonlinear Resistor) Plot the interpolation polynomial of degree 10 for the nonlinear resistor data. Is
the plot reasonable?
(a) Jo(O) = 1.
(b) lJo (x) I ~ 1.
=
(c) Jo( -x) Jo(x)
31. (Quadratic Interpolation) Compute by quadratic interpolationj(1.0) from the data items f(0.7) =
0.566,f(0.9) = 0.612,f(1.5) = 0.857.
32. (Linear and Quadratic Interpolation) Compute v'55.25 and v'54.75 with a 10-digit calculator and
also by linear interpolation and quadratic interpolation, using data values .J53, ../54, y'55, .J56.
Compare your answers in a brief table.
R
7.5
5.0
o 5 FIGURE 7. Linear interpolation of the data of Table 3.
22 NUMERICAL ANALYSIS
R
7.5
5.0
o 5 FIGURE 8. Ouadratic interpolation of the data of Table 3.
33. (Linear Interpolation) Suppose that cos(0.15728) is to be evaluated by linear interpolation, using
adjacent table values of cos(0.15) and cos(0.16). How many digits of the linear interpolation answer
agree with the exact answer?
34. (Comparison of Linear, Quadratic and Quadratic Spline Interpolation) Apply the linear interpo-
lation, quadratic interpolation and quadratic spline interpolation programs to the data of Table 1 to
produce data files suitable for producing the graphs of Figures 6, 7 and 8.
35. (Linear Interpolation) Complete the derivation of the error bound (12) by verifying the inequality (9).
Hint: Introduce the new variable ~ = x - ~ (Xk + Xk+ 1)'
36. (Quadratic Interpolation) Complete the derivation of error bound (25) by verifying inequality (24).
2 t
(1) erf(x) = J1i 10
2
e- t dt.
It occurs frequently in heat transfer problems and in probability theory. If the function to
be integrated or differentiated is known only numerically, either from experimental data
or as a result of computation, then only numerical methods are available to compute
its integral and derivative. Examples involving both kinds of functions are given below.
Numerical integration is treated first.
1.3 NUMERICAL INTEGRATION ANO DIFFERENTIATION 23
The computation is to be based on a set of nodes Xo, Xl, ... , Xn such that a = Xo < Xl <
... < Xn = b and the corresponding nodal values fh = f(Xh), as in the interpolation
problem of Section 1.2. Only the case of uniformly spaced nodes will be treated, so that
Moreover,
(6) l Xh +1
Xk
dx = h, lXk
XH1
rdx = -h
2 '
1
whence
(7)
and
(10)
(11) 11 ~
xk 1
+ El(X)dx I :S-M2h3.
I
12
Finally, (9) and (11) imply that
1 3 1 2
(ll) IE1(h)1 < -nM2h = -(b - a)M 2h ,
-ll II
if M2 :::: max If''(x) I on a :s x :s b, because n = (b - a)lh. The error estimate (12)
gives us control of the error in the trapezoidal rule (8) whenever M2 can be computed.
This will be illustrated by a computation of the function value erf(O.!) defined by (1).
Equation (8) will be applied with a = 0, b = 0.1 and f(x) = (21 ",fii)C X2 , so that
It is not difficult to show that the maximum of If'' (x) I for 0 :s x :s 0.1 occurs when
x = 0, so that we may take
(14)
= 0.11242
(16)
Recall that in quadratic interpolation on the interval a ::: x ::: b one takes n = 2m =
(b - a)lh and
(17)
where r = (x - X2k)lh for X2k ::: x ::: X2k+2. It is easy to verify that
X2k x2k 2
l
+2
dx = lx2k+2 r dx = 3 l +r(r - 1) dx = 2h.
X2k X2h X2k
1 m-l
= '3 h L(f2k + 4hk+l + hk+2).
k=O
(21)
4 Thomas Simpson (1710-1761), self-taught English mathematician. Authored several popular texts. HIS rule
appears in computations of Gregory in 1668 and Newton in 1676.
26 NUMERICAL ANALYSIS
for X2k~ X ~ X2k+2, where M3 2: max If'''(x) I· If this estimate is integrated over
X2k ~ x ~ X2k+2 and then summed over k = 0,1, ... , m -1, as in the derivation of (12),
the estimate
is obtained. However, it is a remarkable fact that a much better error estimate is available.
It is shown in [Ch-KJ that in fact,
(22) 11 X2
X2k
h+2 I
f2(X) dx ~ -M4 h5 ,
1
90
where M4 2: max If(4)(x)l. Combining (20) and (22) gives the estimate
1 5 1 4
(23) IE 2 (h)1 < -mM4h = - ( b - a)M4h ,
- 90 180
where M4 2: max If(4) (x) 1 for a ~ x ~ b, because m = (b - a)/(2h).
It is clear from (21) that both quadratic interpolation and Simpson's rule are exact
when f(x) is a quadratic polynomial. However, (22) implies that Simpson's rule is also
exact when f(x) is a cubic polynomial, since f(4) (x) = 0 in this case. This property is
easy to verify directly from (18).
Other Methods. Numerical integration algorithms are available that are based on
higher-order interpolation or unequally spaced nodes (see [Ch-KJ and [He]). The latter
include the very accurate Gauss quadrature formulas (see [Ch-K]). However, the simple
methods presented above are satisfactory for most engineering computations.
Error Control. The errors produced by the numerical integration of experimental data
must be assessed by the method of trial and error, just as in the interpolation problem.
The theoretical estimates (12) and (23) tell us how these errors should behave when
h is varied. For the integration of mathematical functions, the error estimates provide
rigorous control of the errors whenever realistic values for M2 or M4 can be found. This
is illustrated in the next example.
EXAMPLE 1. Computation of erf(x). The function erf(x) is defined for all x by the
equation (1). It is an odd function, erf( -x) = - erf(x) , and it is a monotone increasing
function of x with derivative
(24)
This means that erf(x) = 1.0000, with four-place accuracy, when x ::: 3. Hence, to this
accuracy only the values of erf(x) with 0 :::: x :::: 3 need to be computed. This will be
done by applying Simpson's rule to evaluate the integral in (1). The error estimate (23)
will be used to determine a suitable step size h.
The integrand in (1) and its first two derivatives are given by (13). Two additional
differentiations give
Hence the error in using Simpson's rule to evaluate erf(x) with 0 :::: x < 3 may be
estimated by (23) with a = 0, b = 3 and M4 given by (28), so that
3
(29) IE 2 (h)1 < -(l3.5406)h4 = (0.2257)h 4.
- 180
Thus four-place accuracy will be achieved if (0.2257)h 4 :::: 5 x lO-5, or h4 :::: 2.2153 x
lO-4. On taking the square root twice, it is found that the condition h :::: 0.1220 will
guarantee four-place accuracy. Table 5 was computed with h = 0.1. Note that
h
;::;;; erf(x2k) + 3(hk + 4hk+l +hk+2),
where fk = f(Xk) = J"e-(Xh)2. The table was computed by means of (30) with Xh = kh =
(O.l)k and k = 0,1, ... ,14.
It is easy to verify that neither linear nor quadratic interpolation in Table 5 will
maintain table accuracy. Hence, if values of erf(x) are needed with x =1= (0.2)k, it is best
to compute them directly using Simpson's rule with a suitable value of h :::: (0.1).
EXAMPLE 2. Water Stored in a Reservoir from Depth Data. The reservoir problem in
Problem 1 provides an opportunity to apply numerical integration to experimental data.
28 NUMERICAL ANALYSIS
The volume in cubic feet of the water in the reservoir at time t is the function defined by
(1), Section 1.2; i.e.,
This function is computed below for the data of Table 1, Section 1.2, and an assumed
value of Vo = 1.13 X 10 5 cubic foot (cD.
A more interesting problem arises if during the filling of the reservoir the depth x
of the water at the deepest point in the reservoir is recorded as a function of t. Assume
that an analog record of this function x = X(t) is available. Clearly, X(t) is a monotone
increasing function and has an inverse t = X-I (x). The function
(33)
(34)
1.3 NUMERICAL INTEGRATION AND DIFFERENTIATION 29
determine VI, V2 , ... , V 24 . The results for the data of Table 1, Section 1.2, are shown in
Table 6.
Application of Simpson's Rule. Equation (31) also implies the relation
(36)
the original flow meter data could be sampled at a shorter interval such as h = 0.5 or
h = 0.25, and the computation of Vk could then be repeated.
Next assume that the depth record x = X(t) for the reservoir, when sampled at the
same one-hour intervals as in Table 1, Section 1.2, produces the data shown in Table
7. The corresponding quantities of water stored, obtained from Table 1, Section 1.2, by
Simpson's rule, are also shown.
Our goal is to find an algorithm for computing values of the function V = F(x) ,
°
giving volume versus depth, for arbitrary values of x on the interval ::s x ::s 133.3 feet.
The most straightforward approach is to recognize that in Table 7,
This suggests using quadratic interpolation in the table to obtain intermediate values.
The algorithm described in Section 1.2 (following equation (11), Section 1.2) will be
applied with n = 2m = 24, and Xk as in Table 7 for k = 0,1, ... ,24. The results are
shown in Table 8, where for brevity, only the values for x = 100, 105, ... , 130 ft. are
shown. For other x-values, the method gives the values of V = F(x) with equal ease.
Numerical Differentiation. The solution of many engineering problems requires
an estimate of the derivative of a function that is known only numerically or from
experimental data. A first estimate may be based on the definition of the derivative:
where h is a small positive number. The number of(a, h) is called the forward difference
approximation to l' (a).
Loss of Accuracy. These numbers look satisfactory. However, let us try some smaller
values of h. The follOwing values were obtained with a calculator that maintains lO-place
accuracy
The reason for these inaccurate results is not difficult to discover. For these values ofh,
fO +h) andf(l) agree to 10 digits and hence the calculator delivers fO +h) -j(l) = 0, to
this accuracy In fact, it is clear that for any continuous function, and any digital computer,
f(a + h) and f(a) will be equal, to machine accuracy; when h is sufficiently small. This
means that the approximation (41) will fail on any computer if h is taken too small.
An Error Estimate. The example shows that the approximation (41) must be used with
care. If h is chosen too large then the approximation is poor. If h is chosen too small then
32 NUMERICAL ANALYSIS
(44)
where h is a small positive number. The number Dj(a, h) is called the central difference
approximation to l' (a). We shall find that for a given value of h, the approximation (47)
1.3 NUMERICAL INTEGRATION AND DIFFERENTIATION 33
is usually more accurate than (41). To see this, note that Taylor's theorem implies that if
1"'(x) is continuous then
(48)
(49)
where a - h < ;_ < a. Subtracting (49) from (48) and rearranging terms gives
(53)
~M3h2 ~ 5 X 10- 7 . Now, forf(x) = X1l2 we have 1"'(;) = ~;-5/2 and the maximum of
rw for I; - 11 ~ h occurs when; = 1 - h. If we restrict h by 0 < h ~ 10- 2 then
M3 = ~(l - 102 )-5/2 = 0.3845 is a suitable value. Thus
(54)
ifh ~ 10- 2 and ~(0.3845)h2 ~ 3 X 10- 7 , or h2 ~ 7.8023 X 10-6 , or h ~ 2.79 X 10-3 <
3 X 10- 3 . This is more than 750 times as large as the h provided by the estimate (44).
Richardson Extrapolation. If both Df(a, h) and Df(a, hJ2) have been computed then a
r
still more accurate estimate of (a) can be obtained. To see how this may be done, note
that by Taylor'S theorem,
j(a - h) = L f(n)(
00
__a) (_h)."
"=0 n!
When these equations are subtracted, alternate terms cancel, and division by 2h gives
It follows that
(59)
I I
f (a) = Df(a, hl2) + -~h
1 4
+ ....
4
This equation implies that an error estimate of the form
is valid, where C depends on f. The h4 term in (60) implies that the error in the
approximation l' Ca) ~ D'fCa, h) decreases very rapidly with h. The process of computing
D'fCa, hl2) from DfCa, h) and DfCa, hl2) is called Richardson extrapolation. The process
can be repeated one or more times to get still greater accuracy
EXAMPLE 4. CExample 3 continued). The accuracies of the forward and central difference
formulas, and of Richardson extrapolation are illustrated in Table 9 for the case of
fCx) =Xl/2 and a = 1.
Notice that for h = 2-13 the forward difference method gives only 4-place accuracy,
while for h = 2- 9 the central difference method gives 6-place accuracy Richardson
extrapolation gives 5-place accuracy with h = 2- 4 = 0.0625 and 6-place accuracy with
h = 2- 5 = 0.03125.
EXAMPLE 5. f(x) = Jo(x), a = l. The Bessel function]oCx) was defined by (2), Section
1.2, and was tabulated in Table 2, Section 1.2 to 4-place accuracy for 0.0 to 6.0 in steps
of 0.1. Application of the central difference formula and Richardson extrapolation to
1.3 NUMERICAL INTEGRATION AND DIFFERENTIATION 35
0.485281 0.500983
0.492422 0.500245 0.499998
0.496154 0.500061 0.500000
0.498062 0.500015
0.499027 0.500004
0.499513 0.500001
0.499756 0500000
0.499878
0.499939
0.499970
0.499982
the functionjo(x) and a = I gives Table 10. Details of the computation are left for the
Exercises.
As a check, note thatjb(x) = -j1(X) is tabulated to IS-place accuracy in [A-5].
These tables, when rounded to four places, givejbO) = -0.440l.
EXAMPLE 6. Surface Area of a Reservoir from Depth Data. The rate of evaporation of
water in a reservoir depends on the area of its exposed surface. Thus in order to compute
estimates of the effect of evaporation, it is essential to know the surface area A(x) as a
function of the water depthx. We shall compute A (x) from the volume function V = F(x)
by numerical differentiation. The influence of A(x) on reservoir dynamics is analyzed in
the next chapter.
The volume V = F(x) of water in the reservoir of Figure 9, when filled to depth x,
f:
is related to A(x) by the equation
0.40 -0.43l4
0.20 -0.4379 -0.4400
0.10 -0.4395 -0.4400
0.05 -0.4399 -0.4401
36 NUMERICAL ANALYSIS
From the data of Table 7 we may take Xo = 8.1 ft and Vo = 0.113 X 106 cf, then
A(x) can be estimated from the (Xk, Vk) data of Table 7 by interpolation and numerical
differentiation. However, it will be more convenient to use the depth record x = X(t) of
the table. We have
where q(r) is the flow meter output tabulated in Table 1, Section 1.2. Differentiation of
(62) with respect to t gives the relation
The values of qk are given in Table 1, Section 1.2. The values of x;' may be obtained
from Table 7 by numerical differentiation. The results are shown in Table 11 with the
corresponding values of Ak.
The values of x~ and x~4 were obtained by the forward difference formula and the
remaining values by the central difference method.
Table 11 gives Ak = A(Xk) at the unequally spaced depths Xk of the table. Values of
A(x) for other x-values may be obtained by quadratic interpolation of the tabular points
(xk,Ak).
The availability of A(x) offers an opportunity to check the consistency of our
computations by recomputing V = F(x) through numerical evaluation of the integral in
(61). The results of such a check are shown in Table 12. The computation was restricted
to 35 S x S 135 to avoid the comparatively inaccurate value Ao = A(8.1) produced
by the forward difference computation of x~. This term affects the interpolated values
1.3 NUMERICAL INTEGRATION AND DIFFERENTIATION 37
TABLE 11. Flow rate, depth, depth rate of change, and surface area vs. t
tk qk x 10- 5 Xk X'
k
Ak x 1O-4(sq ft)
of A(x) for Xo = 8.1 S x S 30.4 = X2. The table interval for x is taken to be 5 ft, so
that Xk = 35.0 + (5.0)k, k = 0,1,2, ... ,20. Column 3 gives the values of Ak = A(Xk)
obtained from Table 11 by quadratic interpolation. Column 4 gives the approximate
values of Vk = F(Xh) obtained from (61) by applying Simpson's rule
VI = Vo + ~(Ao + AI). Note that the agreement of columns 4 and 5 is quite good in
view of the large step size for x of h = 5 ft.
Exercises 1.3
Trapezoidal Rule. Compute f~ j(x)dx using the trapezoidal rule with 10 divisions and report the error
compared to the exact value.
Simpson's Rule. Compute f~ j(x)dx using Simpson's rule with 10 divisions and report the error compared
to the exact value.
1.3 NUMERICAL INTEGRATION AND DIFFERENTIATION 39
13. (Trapezoidal Rule) Apply the trapezoidal rule to compute a table for the error function erf(x) on
x = 0.1 to x = 3 in steps of 0.1. How many subdivisions are needed in the trapezoidal rule to insure
error IEl (h)1 < 10- 5 ,
14. (Simpson's Rule) Write a computer program to implement Simpson's rule. The program accepts as
inputs the values of a, band m = nl2. It generates internally the values h = (b - a)ln and nodes
Xk = a+kh. The function values j(x) may be generated by a function subroutine. Test it on J(;12 sin(x)dx
with 20 divisions.
15. (Error Function) Write a computer program based upon Simpson's rule that will accept any floating
point number x in 0:::: x:::: 1.0 and output erf(x) , accurate to four places.
16. (Bessel Function Jo(x)) Use the integral representation
17. (Reservoir Problem: Depth Versus Volume) Use Table 7 and quadratic interpolation to develop a
computer program that evaluates V as a function of x for 8.1 :::: x :::: 133.3. Apply the program to verify
that V = 2.8 x 10 6 when x = 102 (approximately).
IS. (Reservoir Problem: Consistency of Surface Area and Flow Data) Compare numerically the two
integrals in equation (62) by using Simpson's rule on the integral and quadratic interpolation on the
integrand. Produce a table of values for the two integrals for t = 0 to t = 24 in steps of 2.0 and compare
the relative errors.
40 NUMERICAL ANALYSIS
19. (Error Function) Letf(x) = Jrre- x2 Compute for 0::: x::: 1 the derivative!"(x) and show that
max If''(x) I = If''(O) I = Jrr = 2.2568.
Apply the trapezoidal error estimate to verify that erf(0.2) = 0.22270 with absolute error less than
0.00001.
20. (Error Function) Find the first four derivatives of f(x) = Jrr exp( _x2). Verify that
max If(4) (x) I = If(4\0)I = 2!. = 13.5406.
0::;x::;3 v'rr
21. (Reservoir Surface Area) Verify Table 11 using a computer program that includes internal tables for q
and x. The program should compute x' and A, then print the columns of the table.
22. (Numerical Differentiation) Let f(x) = y'x. Compute the forward, central difference and Richardson
extrapolation approximations at x = 1 for step size h = 2- 9
23. (Numerical Differentiation ofJo(x)) Design a calculator program that reproduces Table 13 below. Use
the values
The answers can be checked using the identity 1b(x) = -11 (x) and Bessel function tables, such as in
[A-Sl, which show JbO) = -0.4401 to four places.
24. (Central Difference Approximation tof") Use Taylor's theorem with fourth-order remainder to show
that iff(4) (x) is continuous, then
040 -04314
0.20 -04379 -0.4400
0.10 -04395 -04400
0.05 -04399 -04401
1.4 SOLUTION OF EQUATIONS 41
Then Taylor's theorem implies that VfCa,h) = fl/Ca) + b2h2 + b4 h4 + .... Show by Richardson's
extrapolation method that
, 4 1
V fCa, h) == - VfCa, h) - - VfCa, 2h)
3 3
(1) j(x) = 0,
where j(x) is a known function that may represent either experimental data or a
mathematical function. Examples of both types are given below. The problem may also
arise in different forms. For example, we may wish to solve the equation g(x) = c where
the constant c and the function g(x) are known. This problem takes the form (1) if we
define j(x) = g(x) - c. Similarly, we may wish to solve the equation g(x) = hex) where
the functions g(x) and hex) are known. This problem also takes the form 0) if we define
f(x) = g(x) - hex).
The solution set of an equation (1) is the set of all numbers x that satisfy the equation.
A second equation g(x) = 0 is said to be equivalent to 0) if it has the same solution
set. For example, the equations {f(X)}2 = 0 and exp {f(x)} - 1 = 0 are both equivalent
to 0). This freedom to change from 0) to an eqUivalent equation is frequently useful in
solving equations.
The solution set of (1) is the set of points where the graph of y = j(x) crosses the x-
axis. Hence, the number of solutions of (1) may be determined by graphing the equation
y = j(x) and determining how many times it crosses the axis. Even a rough graph may
suffice for this purpose.
The ideas mentioned above are illustrated and clarified in this section by means of
two examples. Many additional examples occur in the remainder of the text.
EXAMPLE 1. A Heat Transfer Problem. In the Introduction the cooling of a stirred flask
of water subject to Newtons law of cooling was analyzed. When the heat due to the
stirring was included it was found that the flask temperature was given by equation (18)
42 NUMERICAL ANALYSIS
a
y=2+--
--
x-a
there is a solution in the interval a < x < c, while ifjCc)jCb) < 0 there is a solution
in the interval C < x < b. This process of bisection can be repeated until the desired
accuracy is obtained.
The method of bisection, as outlined above, may be formulated as an algorithm
that is easy to program for a computer. To begin, an interval Cao, bo) is found such that
jCao) jCbo) < O. Then a sequence of intervals Cal, bl ), (a2, b2), ... is computed as follows:
Thus for 5-place accuracy it will suffice to terminate when n satisfies Cbo - ao)l2 n =
_ 2 X 10- 5 ,or
2 - n - l <!
log 105 5
n > --- = = 16.6.
- log 2 0.30103
Table 14 shows the result of the computation, terminating with n = 17. The solution,
correct to 5 decimal places, is x = 0.69495.
Advantages and Disadvantages of the Method. The primary disadvantage of the
method of bisection is the slowness with which it converges. In fact, the error estimate
(6) implies that more than three steps are needed to gain one additional decimal place
of accuracy. Thus in Example 1 fifteen steps were required for 5-place accuracy. An
advantage of the method is its generality. It works for any continuous function whose
values can be computed. Also, each step is simple, requiring only one function evaluation
at x = Cn, since Jean) is available from the preceding step.
44 NUMERICAL ANALYSIS
(7) x = g(x).
Graphically, the fixed points of y = g(x) are the ordinates of the points in which the
graph of y = g(x) intersects the straight line y = x. Thus in Figure 11 the points Xl, X2,
X3 are fixed points of g(x).
y
y=x
y = g(x)
~----~------~----~--?>x
This process is called iteration of the function g(x). If the sequence {xo, Xl, X2, ... }
converges to a limit C and if g(x) is continuous, then
Table 15 shows the results obtained by iteration of the function (9) with various choices
of the initial point Xo.
The table reveals the surprising fact that for any Xo > a the fourth iterate X4 =
0.69495 is accurate to five decimal places. Examination of Figure 12, the graph of
the function g(x), reveals the reason for this behavior. If Xo > 0.69495 then log 2 =
0.6932 < Xl < 0.69495, while if a < Xo < 0.69495 then Xl > 0.69495 and the
remaining steps are as in the first case.
Simple conditions that guarantee the convergence of the method of fixed points can
be given. Note that to be sure that all the iterates (8) are defined, we must choose Xo from
an interval I of the real axis that is carried into itself by g, so that g(x) is in I whenever X
is in I. (The interval I defined by a < X < 00 is suitable in the case of Figure 12.) If this
0.69495
log2
~--------------------~x FIGURE 12. Graph of g(x) =
0.69495 log(2 + x~,,). ct = 2.5 X 10- 3
are sufficient to guarantee that I contains a unique fixed point £ = g(£) and that for every
choice of Xo in I the sequence {XO,XI,X2, ... } defined by (8) converges to £. Condition
(10) is called a Lipschitz condition, and L is called the Lipschitz constant 5
To see that the Lipschitz condition with 0 :s L < 1 ensures that £ is unique, suppose
that £1 and £2 are two fixed points in I, so that £1 = geed and £2 = g(£2). Then by (10)
applied to x = £1 and y = £2 we have
This is possible with 0 :s L < 1 only if 1£1 - £21 = 0; i.e., £1 = £2, and the two points
are the same. The existence of a fixed point, which also follows from (10) and (11), will
not be proved here.
To verify the convergence of xo, Xl, X2, ... to £, for any choice of Xo in I, note that by
(8) and (10), we have
The right-hand side of this inequality tends to zero when n -+ 00 if 0 :s L < 1, and
hence limxn = £.
It is not always easy to verify the Lipschitz condition (10). A simpler condition is
available if g(x) has a derivative g'(x) such that
'Named after Rudolph Lipschitz 0832-1903), German mathematician, who applied the condition to the theory of
differential equations; see Chapter 2.
1.4 SOLUTION OF EQUATIONS 47
i.e., the maximum of Ig'(x)I is less than 1. In this case the mean value theorem,
together with (13) gives (10). Notice that conditions (10) and (11) are sufficient to
ensure convergence but are not necessary The function of Figure 12 does not satisfy
these conditions on the interval a < x < 00, but the iterates xn+l = g(xn ) converge to
1= 0.69495 for every choice of Xo > a.
EXAMPLE 3. Currents in a Nonlinear Circuit. As a second example of the method of
fixed points we shall analyze the currents in the nonlinear circuit of Figure 5, Section 1.2.
The voltage across the nonlinear resistor R will be assumed to be given by VI = F(i) =
if(i), where f(i) is defined by the data of Table 3, Section 1.2, extended by quadratic
spline interpolation as in Table 4, Section 1.2. The second resistor is assumed to operate
in the linear range and to provide an adjustable resistance of Rv ohms, so that the voltage
across it is V2 = Rvi. The battery delivers a constant voltage E. As before, the ammeter and
voltmeter are assumed to have internal resistances near zero and infinity, respectively, and
thus do not affect the current i. Kirchhoff's voltage law then implies that E = VI + V2, or
v
40
. (5,37.4)
v = E - iRv
30
20
10
o
2 3 4 5
FIGURE 13. Graphical solution of
-10 E = 40, Rv = 10 E = iRv + if(i)
48 NUMERICAL ANALYSIS
gives a current i ~ 5 satisfies 40 = 5Rv + 37.4 (see Table 3, Section 1.2), so Rv = (40-
37.4)15 = 0.52 ohms. Thus our problem is to solve (15) for i when E = 40 volts and Rv
has a prescribed value such that 0.52 ~ Rv < 00. Note that a carefully drawn diagram will
give a first numerical approximation to the current i corresponding to a given value of Rv.
Equation (15) with E = 40 is
Hence the solution of (17) satisfies 2.5 < i < 3.0 by the intermediate value theorem.
On this interval, by Table 4, Section 1.2, and (49), Section 1.2, we have
1
f(O = qs(O = "2as(i - 2.5)2 + Zs(i - 2.5) + fs,
which can be written
with f(O defined by (18), and three different starting values io, gives Table 16.
Clearly, the iteration converges rapidly in this case.
Case 2. Rv = 5. In this case, (16) with Rv = 5 is equivalent to
(20) C(O == 40 - 5i - if(i) = O.
1.4 SOLUTION OF EOUATIONS 49
Hence the solution satisfies 3.5 < i < 4.0. On this interval, by Table 4, Section 1.2, we
have
40
(22) g(i) = 5 + f(i) , 3.5:S i :s 4.0,
Hence the solution satisfies 4.5 < i < 5.0. On this interval, by Table 4, Section 1.2, we
have
g'(.)
1 =
-401'(0
,
(l +1(0)2
and by Table 4, Section 1.2, we have
, . -401'(0
g (1) = (10 + 10))2' 2.5 :s i :s 3.0.
The maximum value of If'(Ol for 2.5 :s i :s 3.0 is IZsl = 0.2 (see Table 4, Section 1.2)
and the minimum value of1(0 is easily found to be 1(0.28) = 4.94. Hence (13), Section
1.4 SOLUTION OF EQUATIONS 51
Applying the error estimate (12) with n = 2 to the first column of Table 16, with
f = 2.6763, and using (27) gives
I . -401'(0
g (I) = (5 + j(0)2' 3.5:::: i :::: 4.0.
In this case
40(1.68)
L = max Ig (01 = Ig (3.5)1 = (5 + 5.42 )2 = 0.619.
I I
(28)
Applying the error estimate (12) with n = 9 to the second column of Table 17, with
f = 3.7218, and using (28) gives the estimate
This guarantees convergence to at least 3 decimal places after 9 iterations. The estimate
is slightly pessimistic because i g is actually correct to 4 places.
Advantages and Disadvantages of the Fixed-Point Method. The principal advantage
of the method is that it is easy to program and requires only one function evaluation
per step. Moreover, if an interval I containing the solution can be found for which
the Lipschitz constant L is small compared with 1, then the method converges rapidly
Equation (13) shows that a key parameter is the value of Ig' (f) I. If it is small compared
with 1, then an interval containing f can be found in which L is small. The main
disadvantages of the method are that it may converge slowly and that it may be difficult
to put a given equationj(x) = 0 into the fixed point form x = g(x) in such a way that
g'(f) is small.
Newton's Method. One of the oldest iterative methods for solving j(x) = 0 was
invented by Newton. The geometric idea behind his method is shown in Figure 14.
Beginning with a first approximation Xo, obtained graphically or by another method,
the tangent line to y = j(x) at the point (xo,f(xo» is computed. The point Xl where the
tangent line crosses the x-axis is taken as a second approximation. The process is then
repeated until a sufficiently good approximation is found.
52 NUMERICAL ANALYSIS
y
tangent lines
I I
I (xo,f(xo))
The tangent line to y = j(x) at (xo,J(xo)) has the equationy = j(xo) +1'(xo)(x-xo).
Hence Xl is defined by the equationj(xo)+1'(XO)(XI-XO) = 0, orXI = xo-(f(xo)/1'(xo)).
Repetition leads to the sequence XO,XI,X2, ... defined by
This is just the iterative method for finding a fixed point of g(x) = x - (f(x)I1' (x)). It
is clear that if lim Xn = 1'- exists and l' (£) =j:. 0 then 1'- satisfies j(£) = O. Of course the
procedure will fail if one of the numbers 1'(xo) , 1'(XI), 1'(X2) , ... is zero. Hence, the
method is usually applied within an interval in which1'(x) =j:. 0.
EXAMPLE 4. (Solved by Newton's Method). The method of fixed points was found to
fail when applied to the nonlinear circuit equation (16) with Rv = 1. The equivalent
equation (23) will now be solved by Newton's method. Note that
Newton's method applied to G(i) gives the sequence io, iI, i2, ... defined by
. . GCin)
In+! = In - G'Cin) , n = 0,1,2, ....
Applying this with three different starting values io gives Table 18.
1.4 SOLUTION OF EQUATIONS 53
Error Estimates. Newton's method for solvingJ(x) = 0 was seen to coincide with the
method of fixed points for the function
J(X)
(32) g(x) = x - 1'(x)'
Moreover,
, J(x)j" (x)
(33) g (x) = [1'(x)]2'
and J(I) = O. Thus ifl' (I) t 0 then g' (I) = 0 and there will be intervals I containing I in
whichL = maXI Ig'(x) I < 1. Hence, the error estimate (12) will be applicable withL < 1.
An improved error estimate that explains the rapid convergence of Newton's method
can be obtained by applying Taylor's theorem with second derivative remainder to
g(x). If xo, Xl, X2, ... are defined by Newton's method (29), i.e., x n+! = g(xn ) and if
lim n --+ oo Xn = £ = ge£) then Taylor's theorem permits us to write
1 2
(35) IXn+! - £1 ~ 2:M2lxn - £1 for n = 0,1,2, ... ,
where M2 = maxI Ig"(x) I and I is any interval that contains £ and Xo, Xl. X2, .... Note
that if M2 ~ 2 then
necessity of computing l' (x) at each iteration. If l' (x) is difficult or costly to compute
then it may be preferable to use a slower method.
Graphical Solution. Find the given solution by graphical methods. Use a domain small enough to ensure
that the solution given is accurate to one decimal place.
1. 3x = X2 + 1, x = 2.62. 3. x 3 = 1 - x, x = 0.68.
Bisection Method. Find the solution to 4 digits by the bisection method. Select the initial interval [a, b]
by graphical methods and record the number of iterations required for 4-digit accuracy
5. x 2 - 3x + 1 = 0, x = 0.3819660113. 7. x 3 + x + 1 = 0, x = -0.6823278038.
Fixed-Point Method. Find the solution to 4 digits by the method of fixed points, given the initial
approximation Xo. Choose the interval [a, b] graphically. Record the number ofiterations required for 4-digit
accuracy.
Newton's Method. Find the given solution to 4 digits by Newton's method, given the initial approximation
xo. Choose the interval [a, b] graphically Record the number of iterations required for 4-digit accuracy
Interpolation Methods. Let g(x) be the quadratic spline interpolant made from a data table of 11 elements,
x = 0 to 1 in steps of 0.1, using the functiony = 1 - sinx and gl(O) = -1.
17. Solve the equation x = g(x) graphically to obtain x = 0.51.
18. Find the quadratic polynomial q(x) = ao + al (x - xo) + a2 (x - xo)(x - Xl) that satisfies q(x) = g(x)
near x = 0.51.
19. Solve x = g(x) near x = 0.51 by the bisection method for the 4-place answer x = 0.51097.
20. Solve near x = 0.51 the quadratic equation x = g(x) by Newton's method for the 5-place answer
x = 0.510966.
21. (Heat Transfer Problem) Show that the fixed point equation
x = log (2 + _ a )
x-a
a
is equivalent to ~ = 2 + - - on the interval x> a.
x-a
1.5 INVERSE FUNCTIONS 55
22. (Increasing Functions) A function j(x) is defined to be increasing on an interval a :::: x :::: b if j(XI) <
j(X2) for a :::: Xl < x2 :::: b. Show that iff' (x) > a on a :::: x :::: b, then j(x) is increasing on a :::: x :::: b.
23. (Decreasing Functions) A function j(x) is defined to be decreasing on an interval a :::: x :::: b if
j(XI) > j(X2) for a:::: Xl < X2 :::: b. Show thatj(x) is decreasing on a:::: x :::: b if and only if -j(x) is
increasing on a:::: x :::: b.
24. (Monotone Functions) A function j(x) is said to be monotone on an interval I if it is either increasing
or decreasing on I. Show that if j(x) is monotone on I, then the equation j(x) = c can have at most one
solution x in I.
25. (Flask Cooling) Show thatj(x) = ex - 2 - -a- is increasing for x> a = 0.0025. Explain why this
x-a
shows the uniqueness of the solution x = 0.69495 to the equation j(x) = O.
26. (Graphical Solution) Write a computer program that generates a data file suitable for producing the
graphs of y = eX and y = 2 + a/(x - a) in the region a < x :::: 1,0 :::: y :::: 7, for parameter value
a = 0.0025.
27. (Method of Bisection) Write a computer program to implement the method of bisection and apply it
to duplicate the results of Table 14.
28. (Method of Fixed Points) Write a computer program to implement the method of fixed points. The
computation should continue for N steps or until IX n - xn-ll :::: IXn IE. Program inputs include the
function g(x), the endpoints a and b of the interval where a fixed point is sought, the initial guess Xo
for a root, the value of E, and the maximum number N of steps.
29. (Method of Fixed Points) Make a table [or the first four iterates xi (i = 1,2,3,4), accurate to five
decimal places, in the method of fixed points for the function g(x) = log(2 + a/(x - a)), a = 0.0025,
using the interval [0,10 10 ] and initial guesses Xo = 1, xo = 100, Xo = 109 , Xo = a + 10- 11 . This
duplicates Table 15.
30. (Newton's Method) Write a computer program to implement Newton's method. The program should
fail if anyone of the follOWing conditions holds:
It should succeed if accuracy E is achieved. The inputs are the functionj(x), the derivative j'(x) ,
the interval la, b] in which j is defined, the initial approximation xc, the accuracy E and the maximum
number N of steps.
31. (Existence of Fixed Points) This exercise assumes a strong advanced calculus background. Let a function
g(x) map an interval [a, b] into itself and satisfy a Lipschitz condition with constant l. < 1. Show that
for any starting point xo in [a, b], the sequence of points xc, Xl = g(xo), X2 = g(Xj), ... is a Cauchy
sequence and hence converges in [a, b] to a fixed point of g.
This function is the inverse of F, usually denoted by i = F- I (v). This last relation is of
direct physical interest because it tells us the current i that will flow through R when a
voltage v is placed across the terminals.
The need to construct a computational algorithm for the inverse of a known function
Y = j(x) arises frequently in engineering analysis. The function j(x) may represent
either experimental or mathematical data. Three examples are presented in this section.
Additional examples occur in subsequent sections and chapters.
A continuous function y = j(x) , defined on an interval 1: a :s x :s b, will have a
single-valued inverse x = j-I (Y) if and only if j(x) is either an increasing function of x
or a decreasing function of x. Such functions are called monotone functions (see the
exercises in Section 1.4). The function j will map I onto the interval]: j(a) :s y :s j(b)
if j is increasing and onto the interval]: j(b) :s y :s j(a) if j is decreasing. A sufficient
condition forj(x) to be monotone increasing (resp., decreasing) is thatJ'(x) > 0 (resp.,
rex) < 0) for every x in 1.
Use of Inverse Interpolation. If a table of values (xo,Yo), (XI,YI), ... , (xn,Yn), with
Yk = j(Xk) , is known for a monotone function Y = j(x) then a simple method to construct
x = j-I(y) is to interchangexk = j-l(Yk) andYk = j(Xk) in the table and then interpolate
the data (YO,xo), (YI,XI), ... , (Yn,xn). This procedure is called inverse interpolation.
interchanged. Of course, any of the interpolation methods of Section 1.2 may be applied.
Here the quadratic spline algorithm will be used. Application of the algorithm to the
pairs (Vk, ik), k = 0,1, ... ,10, gives Table 19. A graph of i = F- 1 (v) computed by this
algorithm is given in Figure 15. Note that the value Zo = 0.20 has been selected. This
was motivated by the observation that F(i) = if(i) , where f(i) is the nonlinear resistance
function of Table 3, Section 1.2. It follows that F/(i) = f(i) + i1'(i), and in particular,
F/(O) = f(O) + 01'(0) = 5.00. Hence Zo = (F- 1 ),(0) = [F /(0)]-1 = 0.20.
The Method of Solution of Equations. If a value of y is given then the computation of
the value x = f- 1 (y) is eqUivalent to solving for x the equation y = j(x) or the equivalent
equation f(x) - y = O. This last equation, with y fixed, may be solved for x by one of
the methods of Section 1.4. Two examples will be given.
5.0
2.5
:rr12
o +------~---~y
(3) E - e sin E = M
is called Kepler's equation. 6 Its solutions determine the positions of the planets in their
orbits around the sun, as well as the positions of other satellites such as the planetary
moons, comets, and artificial Earth satellites. Solving Kepler's equation for E as a function
of M, e being fixed, is equivalent to computing the inverse of the function fCE) = E -
e sin E. Algorithms for doing this are given below for values of e that are of astronomical
interest. First, the role of Keplers equation in orbital calculations will be explained.
The orbit of each planet is an ellipse with the sun at one focus.
II The line joining the sun to a planet sweeps out equal areas in equal times.
We shall show how these two principles make it possible to compute the position of
a planet, or other satellite, by solving Kepler's equation.
6 After Johannes Kepler (1571-1630), German mathematician and astronomer who discovered the laws of planetary
motion. Kepler's first two laws were published in 1609. His third law was published in 1618.
1.5 INVERSE FUNCTIONS 59
a
b
c
I----~--------~--------~--~A
o
(6)
is well known to follow from (5). Moreover, from the figure and (6) we have
FR=rcosv, x=OR=acosE,
y = PR = r sin v = b sin E,
y
... Q(x,y')
o R F
so that
(8) r = P
1 + fCOS v'
while eliminating v gives
Finally, eliminating r between (8) and (9) gives the following relations between v and E:
cos E - f cos v + f
(10) cos V = , cosE= - - - -
1- fCOS E 1+ fCOS V
Mathematical Formulation of Kepler's Second law. In Figure 19, the area A of the
sector FPoP of the ellipse is given by the equation
(11)
(12) ACt) = -1
2
I
Vo
V
(t)
r2 dv
1
= -hCt
2
- to),
where h and to are constants and Vo = vCto). In particular, if T is the period of the planet,
i.e., the time needed to traverse the ellipse once, then ACT + to) = nab = !hT, so that
2nab
(13) T = -h- .
y
Po
P /v
········1.
It can be shown that h is just the angular momentum of the planet in its motion around
the sun.
Derivation of Kepler's Equation. The function vet) that describes the planetary motion
is defined implicitly by (12). However, to evaluate the integral it is more convenient
to change the integration variable to the angle E of Figure 18, defined by (10). On
differentiating (10) and using (8) and (9), it is found after a short calculation that with r
defined by (9),
dv b
(14)
dE r
Thus the change of variable gives
(15) 1 V (t)
r2 dv = b
lE(t)
rdE
l
Vo Eo
E(t)
= ab (l - e cos E) dE
Eo
The notation v, E, M used in this example is due to Gauss. 7 Astronomers call v the
true anomaly, E the eccentric anomaly and M the mean anomaly of the planet.
Solvability of Kepler's Equation. For satellites moving in elliptic orbits, the eccentricity
e satisfies 0 :s e < 1. The left-hand side of Kepler's equation (3) is the function
7Karl Friedrich Gauss (1777-1855). famous German mathematician. His book Theory of Motion of the Heavenly
Bodies (1809) gave the first complete account of methods for determining the orbits of celestial bodies.
62 NUMERICAL ANALYSIS
feE) == E - e sin E with derivative l' (E) = 1 - e cos E. Clearly, l' (E) ~ 1 - e > 0 and
hence feE) is a monotone increasing function for 0 ::: E ::: 2n. It maps this interval onto
itself, sincef(O) = 0 andf(2n) = 2n.
Solution by the Method of Fixed Points. Kepler's equation can be rewritten as
(18) E = M +esin E,
which has the fixed-point form E = geE) with geE) = M +esin E. Here g'eE) = ecos E
and Ig'(E)1 ::: e. Hence, the method of fixed-points can be expected to work well if e is
small compared with 1. Now, the known planets have the eccentricities shown in Table
21. Note that they lie in the range 0 ::: e ::: 0.25. It follows that the method of fixed points
will work well for them. In fact, it is the method that was used by Gauss. The method
was used to compute Table 22, which is applicable to the planet Mercury (e = 0.2056).
Solution by Newton's Method. Most of the known comets move in elliptic orbits
that have eccentricities greater than 0.9. For example, Halley's comet has eccentricity
e = 0.967. For such orbits the convergence of the method of fixed points for Kepler's
equation is very slow. For example, ifthe method is applied with e = 0.967 and M = 3.0,
then 50 iterations, beginning with Eo = 3.0000, produces Eso = 3.0583, while the
correct value to four decimal places is E = 3.0696. On the other hand, Newtons method
works well for this value of e. To apply the method, define
En - e sin En - M
(20) En+1 = En - 1- ecos En
, n = 0,1,2, ....
x
6.615
4.410
2.205
x = 0.967 sin(x) +y y
0.000 FIGURE 20. Graph of the solution of
0.0 2.1 4.2 6.3 Kepler's equation for Halley's comet (e = 0.967)
Table 23 shows the solution of Kepler's equation for the case of Halley's comet (e =
0.967). It was computed by means of (20) with Eo = M. The maximum number of
iterations required was 5. Figure 20 shows a graph of the solution.
Inverse Table. A table of x and y values has an inverse table, formed by swapping the x and y columns
and sorting the first column to nondecreasing order. Produce a table from y = f(x) with II rows, x = 0 to
1 in steps of 0.1. Then determine the inverse table.
l.y=1+x 2 4. Y = sin(nx).
2. Y =x3 5. Y =..;x.
1
3. y= - - . 6. Y = cos(.nxl2).
l+x
7. (Inverse Functions) Produce a table fromy = x 2 - x with 11 rows, x = 0 to 2 in steps of 0.2. Does the
inverse table represent a functional relationship x = g(y)?
8. (Computation of sin- 1 (y» Compute an evenly spaced inverse table for the function y = sinx, using
Newton's method, y = -1 to y = 1 in steps of 0.25. Check the inverse table with the inverse function
equation x = sin -1 (y).
9. (Nonlinear Resistor) Given the nonlinear resistor data (Vk, ik) (h = 0, ... , lO)in the firstthree columns
of Table 19, apply quadratic spline interpolation with Zo = 0.2 to verify the columns for mh, Zh, ah.
10. (Computation of cos- 1 (y» Apply Newton's method toy = cos x at initial guessesxo = 0.5, Xo = 1.0,
xo = 1.5, Xo = 2.0 in order to produce a table of approximations to n12. In the process, verify Table 20.
11. (Kepler's Equation for Mercury) Apply the method of fixed points to Kepler's equation for Mercury,
x = y + 0.2056 sinx, for y = 0 to y = 2n in increments of n/4, in order to produce an inverse table
accurate to five digits. The table of nine y and x values duplicates Table 22.
64 NUMERICAL ANALYSIS
12. (Kepler's Equation for Halley's Comet) Apply Newtons Method to Kepler's equation x =
0.967 sin (x) + y for y = 0 to y = 2:n: in increments of :n:/4 to produce an inverse table. The table of
nine y and x values duplicates Table 23 (see also Figure 20).
13. (Kepler's Equation for Pluto) Apply the method of fixed points to x = y + 0.2486 sinx, Keplers
equation for the planet Pluto, to create an evenly spaced inverse table with y = 0 to y = 2:n: in
increments of :n:/4. The result is similar to Table 2.2..
14. (Ellipse Parameters) Verify p = b2 /a in equation (4), using the focal distance given by equation (5)
and Figure 17.
15. (Elliptical Coordinates) Verify equations (8), (9) and (0).
dv b
16. (Kepler's Equation Derivation) Verify -
dE
= -;
r
see equation (4).
40h - Q -lOh Q
(1) -,--------'. = e
SOh - Q ' To = h'
where h was the coefficient in Newton's law of cooling. On eliminating h between these
equations we find that
40 -
- - -
To =e-lOQIT0
(2)
80 - To
Clearly, To is a function of Q that is defined implicitly by equation (2). An algorithm for
computing To = To(Q), is given below.
Equation (2) may be reformulated as the equation F(Q, To) = 0, where
The problem of finding the solution set of an equation F(x,y) = 0 occurs frequently
in engineering analysis. For example, if T = f(x, y) is the temperature at a point in a
body with coordinates (x, y) then the equation f(x, y) = c defines a curve of constant
temperature, or isotherm. Note that this equation has the form F(x,y) = 0 with
F(x,y) = fex,y) - c. Similarly, if p = gex,y) describes a pressure distribution then the
equation g(x,y) = c defines a curve of constant pressure, or isobar. A third example
is provided by the stream function 1jI(x,y) of fluid dynamics. In this case the curves
defined by 1jI(x,y) = c are the stream lines of the fluid. Physically, these are the curves
1.6 IMPLICIT FUNCTIONS 65
that would be followed by a small test particle that drifts with the fluid. Mathematically,
they are curves that at each point have the direction of the fluid velocity Many additional
examples of physical problems that lead to an equation F(x,y) = 0 will be found in
Chapter 2 and subsequent chapters.
The Implicit Function Theorem. In the examples given above it is intuitively clear that
the equation
(4) F(x,y) = 0
should define y as a function y = f(x) that satisfies F(x,f(x)) = 0 for every value of x.
However, this is not always the case. For example, the function F(x,y) = x 2 +y2 + 1 is
never zero, and hence in this case equation (4) has no solutions. Another case for which
(4) cannot be solved for y occurs when F(x,y) is independent of y. These examples
show the need to find properties of the function F(x, y) that guarantee the solvability of
equation (4). The following three properties are known to be sufficient.
(A) F(x,y), of(x,y)lox, and of(x,y)lOy are defined and continuous in a region D
in the (x, y)-plane.
(B) There is a point (xo,yo) interior to D such that F(xo,yo) = O. (A point is said
to be interior to D if it is the center of a disk that is contained in D.)
(C) of(xo,yo)!oy i= O.
If conditions (A), (B) and (C) hold then there is a rectangle al < x < a2, bl < Y < b2
contained in D with the property that for each x in the interval al < x < a2 the equation
(4) has a unique solution y = f(x) such that bl < Y < h The function f(x) satisfies the
equations Yo = f(xo) and F(x,f(x)) = 0 for al < x < a2· Moreover,f(x) is continuous
and has a continuous derivative.
The preceding results are one form of the implicit function theorem of advanced
calculus [T-MJ. The situation described by the theorem is illustrated in Figure 21.
y
y = y(x)
domainD
®~~'k
YO +b
Yo
YO - b
Note that the uniqueness of the solution y = j(x) of F(x, y) = 0 may fail if condition
(C) is not satisfied. A simple example is provided by the function F(x,y) = x 2 - i at
the point (0,0). For this case aF(O, O)!ay = 0, so that (C) fails and F(x,y) = 0 has the
two distinct solutions y = x and y = -x that pass through the point (0,0).
To compute a value Yl = j(Xl) of the implicit function defined by F(x,y) = 0 we
need only substitute x = Xl in the equation and then use one of the numerical methods
of Section 1.4 to solve the equation F(Xl,y) = 0 for y = Yl. This process is illustrated
here by two examples. Many additional examples will be found in subsequent chapters.
EXAMPLE 1. A Heat Transfer Problem. Recall that in the heat transfer problem of the
Introduction, To = To(Q) was the equilibrium temperature of the stirred flask, in degrees
Celsius, when the stirring produced a warming of Q degrees Celsius per minute. To(Q)
is defined implicitly by equation (2). However, it will be more convenient here to solve
the first of equations (1) for h = h(Q) and then obtain To(Q) from the second equation.
Proceeding as in Example 1, Section 1.4, we can show that the first equation of (1)
is equivalent to the equation
This equation may be solved easily by the method of fixed points. The case Q = 0.01
was treated as Example 1, Section 1.4. Application of the same method to (5) gives x =
x(Q), h(Q) = (O.l)x(Q) and To(Q) = Qlh(Q) = lOQlx(Q). This process gives Table 24.
1.6 IMPLICIT FUNCTIONS 67
(6) 1/I(x,y) = V (r - ar 2
) sin (),
where rand () are polar coordinates (so that x = r cos (), y = r sin ()) and a is the radius
of the cylinder.
The flow near the cylinder may be visualized by means of the stream lines 1/I(x, y) =
c. Figure 22 shows the stream lines for the parameter values V = 1, a = 1 and
c = 0,0.2,0.4, ... ,2.0. Note that the dividing streamline 1/I(x,y) = 0 consists of the
portions of the x-axis where Ixl > 1 (so that () = 0 and () = n, r > 1) plus the
circumference ofthe cylinder x 2 + y2 = 1 (so that r = 1,0 ::: () ::: 2n). Streamlines with
positive (respectively, negative) values of c pass above (respectively, below) the cylinder
in Figure 22.
2.0
1.6
1.2
0.8
0.4
0.2
t
values
of c
(8) ( 1-~)Y=C.
x +y
On subtracting e from each side and multiplying by the denominator, we find the
equivalent equation
(9)
(10)
For fixed values of e and x, the equation F(x,y) = 0 is a cubic equation for y that is
difficult to solve algebraically However, it is easy to solve by Newton's method
F(x,Yn)
(11)
Yn+! = Yn - 8F(x,Yn)18y'
Note that
On substituting this into (11) one obtains, after simplification, the iteration
2y~ - ey~ + ex 2
(13)
Yn+l = 3y~ - 2eYn + (x 2 - 1)
4. (Multiplicity) Find an example of an implicit equation F(x,y) = 0 that is satisfied by three explicit
equations of the form y = j(x), all of which pass through the point (l, 1).
6. (Heat Transfer) Use the method of fixed points on the equation x = log(2 + yl(4x - y» for y = 0 to
Y = 0.6 in increments of 0.1 to produce an inverse table of y and x values. Apply the inverse table to the
heat transfer problem with To = 10ylx to verify the entries in Table 24.
7. (Streamlines) Use Newton's iteration method Yn+ 1 = Yn - F(x,Yn)/Fy(x,Yn) on the streamline equation
F(x,y) = y3 - c/ + (xl - l)y - cx l to generate graph data for the streamline at c = 0.2. Use x = I
to x = 0 in steps of -0.25, solving for the unknown y in each case. This verifies one of the curves in
Figure 22 and also portions of Table (25).
8. (Implicit Function Theorem) Explain why the equation yl + 2y + 2 + x 2 = 0 cannot be solved for y
in terms of x.
9. (Fluid Flow) Convert the polar streamline equation (r - r- 1) sine = c into rectangular coordinates to
obtain (l - (x 2 + /)-1)y = c.
10. (Streamlines) The streamline 0- (xl + /)-1)y = c gives rise to a differential equation for the curve
Verify this equation and explain why all streamlines have the same differential equation, regardless of
the value of c.
n. (Implicit Function Theorem) Suppose curve y = y(x) satisfies the equation F(x,y) = 0, a :::: x :::: b,
and assume that F and yare continuously differentiable. Find a differential equation for y in terms of
the partial derivatives JFlJx and JFlfJy.
Implicit Function Theorem. Verify that the equation F(x,y) = 0 satisfies F(O,O) = 0 and compute
FyeO,O)
16. (Implicit Function Theorem) Apply the implicit function theorem to the equation xl + yl = 1 to
solve for y in terms of x at any point ex,y) on the curve. Which points (x,y) give trouble with the
hypotheses of the theorem 7 In what sense must a given solution agree with one of the two solutions
obtained by college algebra (namely, -~ and )1 - x 2 )? In your answer, pay close attention to
the domain of definition of a [unction.
70 NUMERICAL ANALYSIS
Explain, by means of the size of ICy (x,y) I, why the method of fixed points fails.
18. (Implicit Function Theorem) Let F be continuously differentiable. Assume that two explicit solutions
Y1 (x) and Y2 (x) satisfy F(x,y) = 0 and pass through (XO,YO)· Given that Yl and Y2 differ in every
interval about x = Xo, show that Fy(xO,Yo) = O.
19. (Implicit Function Theorem) Read a proof of the implicit function theorem and then explain it in your
own words. If you can construct a better proof, then outline the ideas of the proof and defend your
position.
40h-- -
20. (Heat Transfer Problem) Write - Q= e- lOh
in the fixed point form x = log ( 2 + -Q)
- ,
SOh - Q 4x - Q
where x = lOh.
lOQ
21. (Heat Transfer Problem) Derive the temperature formula To = - from To = QIh and x = lOh.
x
L
00
(1) 5 = ak = al + a2 + a3 + ....
k=l
Many such problems will be encountered in Chapter 3, Chapter 8 and Chapter 9. Let us
introduce the nth partial sum of the series,
n
(2) 5n = L ak = a1 + a2 + a3 + ... + an,
k=l
L
00
so that
Then the convergence of an infinite series in (1) means that to every number E > 0 (no
matter how small) there corresponds a positive integer N = N(E) such that
Note that if (5) holds for a particular integer N(E) then it will also hold for larger integers.
Forthe theory, the exact value of N(E) is unimportant. For practical computations we shall
wish to choose N(E) as small as possible. This point will be illustrated in several examples.
The Geometric Series. This series is defined by (1) with ah = X k- I , where x is a
real parameter:
(7)
Clearly (6) implies that 5(1) does not exist. For x ::f 1 a well-known algebraic identity
implies that
1
(8) --=l+x+x + .. ·+x
l-x
2 n-I
+--,
Xn
l-x
so that
xn
(9) Rn =- -
l-x'
n = 1,2,3, ....
It follows that Rn (x) --+ 0 as n --+ 00 if and only if -1 < x < 1 and hence
We shall find the allowable integers n by solving this inequality. Taking the logarithm
of (11) and recalling that logx is a monotone increasing function gives the equivalent
inequality
or
2
(12) n > -- = 43.67.
0.0458
72 NUMERICAL ANALYSIS
Thus 44 terms are needed to reduce the error to less than 0.1.
00
We have seen with the geometric series that N(x, E) can vary greatly with x.
Uniform Convergence. If we can find an integer N = N(E), independent of x, such that
for all n :::: N(x, E) and all a ::::: x ::::: b then we say that the series (15) converges uniformly
for a ::::: x ::::: b. For numerical summation of a series (15), uniform convergence on an
interval a ::::: x ::::: b is one of the most useful properties a series can have. Uniform
convergence is illustrated by the geometric series (lO). Using the remainder Rn(x) =
xnl(l - x), it can be shown that it converges uniformly on any interval a ::::: x ::::: b such
that -1 < a < b < 1.
It is known that the sum Sex) equals ~ (~ - Ix l) for -n ::::: x ::::: n; see Section 1.4,
Example 2. However, this fact will not be used here. Instead, we shall show how to
evaluate Sex) numerically to within a prescribed error E.
To estimate Rn(x), let ¢(x) = 1/(2x + 1)2 and note that because ¢(x) is monotone
decreasing for x :::: 0 we have
[h+l
(21) ¢(k + 1) S Jh ¢(x) dx;
y
~</J(X)
</J(k) </J(k + 1)
::::
1n-1
00
</lex) dx = 2(2n - 1)
1
for all x. Thus IRn (x) I :::: E if 0.5/(2n - 1) :::: E. For example,
1
IRn(x) I :::: 2(2n _ 1) < 0.01 if n::: 25.5.
Geometric Series Estimates. Estimate the minimum number of terms N(E) for 2-digit accuracy of the
truncated geometric series Sn (x), using geometric series formulas.
1
1. The geometric series whose sum is - - . Use x = -0.9 and E = 0.01.
l+x
2. The series whose sum is InO + x), obtained by termwise integration of the series for 1/0 + x). Use
X = -0.9 and E = 0.01.
1
3. The series whose sum is --2' Use x = 0.5 and E = 0.01.
l+x
4. The series whose sum is tan -1 (x), obtained by termwise integration of the series for 1/0 + x 2 ). Use
x = 0.5 and E = 0.01.
1.7 NUMERICAL SUMMATION OF INFINITE SERIES 75
Trigonometric Series Estimates. Estimate the minimum number of terms N(E) for 2-digit accuracy of
the truncated series Sn (x), using the integral test method of the text. The estimate should be unifonn in x
with E = 0.01.
~(_ )k sin(2h + l)x
5. ~ 1 2 .
k=O (2h + 1)
~ sin(2h + l)x
6. ~ 3 .
k=O (2h + 1)
C HAP T E R 2
Ordinary Differential
Equations of First Order
This chapter takes up a major theme of this text: differential equations and their
applications in science and engineering. Differential equations were discovered by Isaac
Newton some time before 1666. Newton was so convinced of the importance of his
discovery that he revealed it only in an anagram, one of the scientific puzzles that scholars
of his time set for one another. The anagram, when resolved into Latin and then translated
into modern English, states, "It is useful to solve differential equations." The soundness of
Newton's judgment is confirmed by the many textbooks and journals of modern science
and engineering that are filled with examples of differential equations and their solutions.
where q(t) is the rate of flow into the reservoir, as recorded by a flow meter. Equation (1)
is one of the simplest examples of a differential equation. The volume V = Vet) in the
reservoir is to be determined from its rate of change q(t) and its initial value Vo = Veto).
The fundamental theorem of calculus implies that the equation
(2) V = Vo + t q( r) dr
llO
78 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
defines the unique solution of (1) that passes through the point (to, Vo) in the (t, V)-plane.
The heat transfer problem of the introduction led to the differential equation
dT
(3) dt = -h(T - 20).
The solution T = T(t) that passes through a prescribed initial point (to, To) may be
found by the method of the Introduction to be
It is easy to verify directly that this function satisfies (3) and T(to) = To. Again there is
a unique solution through each point (to, To) of the (t, T)-plane.
As a third example, consider the simple L, R electrical circuit of Figure 1. Here Land
R are prescribed positive constants, the circuit inductance and resistance, respectively,
and E(t) is the prescribed output of a generator or other voltage source.
Kirchhoffs circuit laws imply that the electric current i(t) satisfies the differential
equation
For simplicity, only the special case L = 1, R = 1 and E(t) = sin t (alternating current)
will be analyzed here, so that
It is not immediately obvious how to solve this equation. However, if the differentiation
formula
(8)
R
FIGURE 1. L. R electrical circuit with voltage
source E(t)
2.1 THE NATURE OF DIFFERENTIAL EQUATIONS 79
(10) C = Cto (.
10
1 1.)
+ "2 cos to - "2 SIn to .
(ll)
2
1
i(t) = - (sin t - cos t) + c-(t-to) (1
+-
io
2
1)
cos to - - sin to
2
,
as may be verified directly. There is exactly one solution curve i = i(t) through each
point in the (t, i)-plane.
Each of the three examples above involves a differential equation of the form
dx
(12) dt = F(t,x),
,7////,7/,7// //
jjjjjjjj,//jjj
Illlllll 1111
~~~~~~~!~~~~j
1llllll'!lllll
I11111 I11111
///// ///////
IExercises 2.1 I
f ;,'"' ~~'=~""~_~'''''''' ___ '' ~ _,oj
f:
1. (Reservoir Volume) In each of the cases below, given the flow rate q(t), the initial volume Vo and the
time interval [a, bl, verify the given volume V using the formula V = Va + q(t)dt.
[a,b] q(t) Vo V
[0,24] 1 + 2t 601
[0,3] 1 + 2t + t 2 22
[0,1] 2e- t 1.13 3.13 - 21e
[0, rr] sin 2 (t) 1 1 + rr/2
[0,2]
[0, rr/4]
sinh(t)
tan(t) ° cosh(2) - 1
1 + log(2)/2
2. (Reservoir Volume) Let the flow rate q(t) be given as the linear interpolant of the table below. Using the
volume formula V = 1.5 + f5 4 q(t)dt, determine V. The units for q are 2.3 x 105 acre-feet per hour and
t is in hours.
3. (LR Circuit) Solve the equation i'(t) + i(t) = sint with initial values iCO) = ° = 5. Plot both
°
and i(O)
solutions together, using ~ t ~ 6rr and -1 ~ i ~ 5.
4. (LR Circuit) Consider the LR circuit of Figure 1 with parameter values L = 1, R = 1 and voltage source
E(t) = sin t. Specialize the solution i(t) obtained in (11) to the initial condition i(O) = iO to obtain
5. (Flask Cooling) Apply the cooling equation u' (t) = -h(u(t) - 20) and initial data u(O) = 100 to verify
the second and third columns of the table below.
log SO
u(lO) = 60 h = 0.1In(2) t = 1 0 - - = 63.22
log2
10gSO
uclO) = 70 h = 0.lln(S/5) t = 1 0 - - - = 93.23
log(S/5)
10gSO
uclO) = so h = 0.lln(4/3) t = 10--- = 152.32
log(413)
log SO
u(20) = 50 h = (lI20) In(S/3) t = 2 0 - - - = S9.35
log(S/3)
log SO
u(30) = 35 h = (1130) In(16/3) t = 30 = 7S.53
logCl6/3)
8. (Direction Field) Determine a direction field for the equation x' (t) = sine4t)x(t) on -1 ~ t ~ 1,
-1 ~ x ~ 1, using 25 line segments.
9. (Direction Field) Plot the direction field for the equation x' (I) = x(t)(2 - xU)) in order to determine
all possible values oflimHoo x(l).
Direction Fields. In the folloWing exercises, sketch a direction field and plot several approximate solution
curves.
Method of Isoclines. An isocline is a curve of constant inclination. In a direction field, line segments
with the same slope are called isoclines. A direction field can be created by plotting the curves F(t, x) = c
for certain values of c, then adding line segments to these curves. This is called the method of isoclines.
16. (Identifying Isoclines) In Figure 2 above, identify four sets of isoclines. Suggestion: Trace the figure,
and then color the isoclines (red, green, blue, yellow).
17. (Plotting Isoclines) Plot some isoclines for the equation x' (I) = (l - x(t)) (2 - x(t)), using a square
of side 10 and center (0,0). Add line segments to make a direction field plot.
2.2 SEPARABLE EQUATIONS 83
=
18. (Isoclines and Exact Solutions) The equation Xl (I) (l-x(t»(2 -x(l) has constant solutions 1 and
2. The exact solution for x(O) =f. 1 and x(O) =f. 2 is log Ix(t) -II-log Ix(t) - 21 + t = C. Superimpose
the solution curves for x(O) = 1, x(O) = 2 and x(O) = 3 onto the direction field.
We shall assume that the functions F(t) and G(x) are defined in intervals al < t < a2
and b1 < x < b2 , respectively. The rectangle indicated in Figure 3 will be called the
domain of definition of equation (1). The graph of each solution x = ¢(t) of equation
(1) must lie in this rectangle, as illustrated in the figure, since F(t) or G(x) , or both, are
undefined at points (t,x) outside the rectangle.
We shall see that exactly one solution curve of (1) passes through each point of the
rectangle.
If G(b) = 0, where b1 < b < b2 , then the horizontal line x = b = constant is a
db
solution of (1), since dt = F(t)G(b) = 0. The line x = b then divides the rectangle of
Figure 3 into two rectangles, whose integral curves are distinct and which may be studied
separately. For this reason we shall restrict our attention to rectangles in which G(x) #- 0.
Let x = ¢(t) define the solution of (1) that passes through a point (to, xo), as
indicated in Figure 3. Then rjJCt) will be defined in some interval tl < t < t 2 , where
al :s tl < to < t2 :s a2. To calculate the value ¢(t) at any point of this interval, note that
(2) ¢(to) = Xo
and
(3) ¢/(r) = F(r)G(¢(r))
..... ~
x(t)
Xo
•
. .
since it is assumed that G(</J( r)) 1= O. Now, integration of (4) over the interval to :::: r :::: t
(or t :::: r :::: to if t < to) gives
(5) 1t </J1(r)
( ( )) dr =
1t F(r)dr.
to G</Jr to
In the integral on the left, the variable of integration may be changed from r to ~ = </J(r).
The rules for changing the variable of integration in a definite integral give
t-,:-.....,--,-dr
</J1(r) 14>(t) -d~-
(6) 1to G(</J( r))
-
- Xo G(~)'
(7) l -~() =
x
xoG~
1t to
F(r)dr.
This equation defines the solution x = </J(t). In fact, the implicit function theorem of
Section 1.6 implies that (7) can be solved for x = 4>(t). Moreover, it may be verified by
differentiating (7) that the function </J(t) so defined is a solution of (1).
The solution x = </J(t) of (7) may be constructed explicitly by means of the functions
and
(9)
J(x) =
l Xo
X
d~
G(~)' b1 < X < b2 .
Notice thatl'(x) = I/G(x) 1= 0 and hence y = J(x) is a monotonic function and has an
inverse x = r 1 (y). Applyingr 1 to the two sides of (10) gives the explicit solution
(ll)
If the integrals let) andJ(x) are elementary functions then (11) gives </J(t) as an elementary
function. In the more typical case that I and/or J are not elementary functions, they may
2.2 SEPARABLE EQUATIONS 85
r
be computed by numerical integration, and I may be computed by the methods of
Section 1.5. Both possibilities are illustrated in the examples and exercises that follow.
The Equation dxldt = F(t). This simple differential equation, obtained by taking
G(x) = 1 in (1), is integrated in most elementary calculus texts. The solution
(12) x = Xo + t F(r)dr,
1o
t
which follows from the fundamental theorem of calculus, can also be obtained from (7)
by taking G(~) = 1. Two applications of this equation will be presented.
dV
(13) dt = q(t), t::: 0,
where q(t) is the fill rate. Thus
where V = Vo when t = 0. The key feature of this example is that q(t) is known only at
a table of values (see Table 1 in Section l.2). In numerical work q(t) is represented by
an interpolation function.
o
x
x+dx
steel 30 x 106
du F
(15)
ax EA(x) ,
0:::: x:::: e,
which gives
F t ax
(16)
u= E10 A(x)'
(1S) u= ( 36
30n x 10
6) F = (3.S2 x 1O-7)F inches,
(19)
In this case only the functionr 1 needs to be computed. Two applications ofthis type of
equation will be given.
EXAMPLE 3. Heat Transfer. The heat transfer problem of the Introduction led to the
differential equation dT/dt = -h(T - 20). This has the form (1) with F(t) = 1 and
G(T) = -h(T - 20). The initial values were to = 0 and To = 100. Application of (19)
gives T = r I (t), or t = J(T), where by (9),
(21) v= /fiX,
where g is the acceleration of gravity. This principle is known as Torricelli's law. 1 If the
valve channels the water into a pipe of cross-sectional area a, then the rate at which the
water drains is avo This is the rate of decrease of the volume V of water in the reservoir,
so that
dV
(22) at = -avo
dV
Now, dx = A(x) is the surface area of the reservoir when filled to depth x, and hence
(23) dV = dV dx = A(x) dx .
dt dx dt dt
Combining (21), (22) and (23) gives a differential equation for x that may be written as
dx _CX ll2
(24)
A(x) ,
c= aJ2g.
dt
dx
This has the form -:it = G(x) and hence the solution that satisfies x = Xo when t = 0 is
given by (10) with let) = t. Thus,
(25) t = -1
c Jxo
r Am~ = ~ l
~lI2 C x
xo A(~)d~ .
~1/2
For example, if R = 10 ft, r = 0.25 ft, and Xo = 20 ft, then the time required to drain
the reservoir completely (x = 0) is
1
t = _(40)2(20)112 = 1789 sec = 29.8 min.
4
Natural Reservoir. In Example 6 of Section 2.3, the area function A(x) for a natural
reservoir was computed numerically from measured volume data. In this case the draining
time must be computed by numerical integration. As an example, the data of Table 12
in Section 1.3 will be used. A circular valve of radius r = 0.5 ft is assumed, so that
a = nr2 = 0.25n, and t is assumed to be measured in hours, so that
The initial depth, denoted by Xo above, will be taken to be X20 = l35 ft, so that
(28)
_ ~ f135 A(~)~
t(x) - 1/2
C x ~
is the time required to drain [rom a depth of l35 ft down to x ft (d. (25»). The notations
tk = t(Xk) and
(29)
(30)
and
(31)
Note that (30) gives t20 = 0, since X20 = l35. Moreover, the trapezoidal rule (8) of
Section 1.3 applied to (31) with k = 20 gives the approximation
(32)
where h = X20 - XI9 = 5 feet and Fk = F(Xk)· For k .::: 18, equation (31) gives
and application of Simpson's rule (19) in Section 1.3 gives the approximation
(33)
where again h = Xk+1 - Xk = 5 ft. Table 2 shows the results obtained from (32), (33),
where Fk = Akl(C,jXh) and Ak is obtained from the data of Table 12 in Section 1.3.
90 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
Separable Equations. Each of the exercises below is an initial value problem for a separable differential
equation. Solve each problem by the method of the text to obtain the solution in the implicit Jorm](x) = r(t).
Solve analytically for the explicit form x(t) = ]-1 (l(t)) when possible.
Initial Value Problem. Each of the exercises below is an initial value problem for a separable differential
equation of theforrnx' (t) = J(t). Graph the solution on [0, IJ. In some cases a numericalintegration method
is required to produce the graph.
12. x' = sin nt, x(O) = O. 17. x' = 10g(1 + t), x(O) = l.
21. (Reservoir Volume) Using the flow-rate data for q in Table 2, Section 2.1, produce a graph of Von
a :<S t :<S 0.6. Use the trapezoidal rule applied to theformula V = Vo + fat q(v)dv with Vo = 1.13 x 105
cubic feet.
22. (Conical Column) Consider a conical aluminum column with radii rO = 1.5 and q = 2.8 at the ends
and length 1: = 140, all units inches. Find the total compression for a uniform top load of 2000 lb.
23. (Cylindrical Column) Consider a cylindrical steel column of radius r = 1.2 and length 1: = 60, all
units inches. Find the total compression for a uniform top load of 2000 lb.
24. (Flask Cooling) A flask of water is heated to 95 degrees Celsius and then removed from the heat and
stirred constantly while it cools in a room with air temperature 22°e. Find the differential equation for
the temperature u(t) at time t and predict the cooling time to 23°C, given that the flask cools to 90°C
in 5 minutes.
25. (Thermometer) An outdoorthermometer brought indoors at 7:00 in the morning reads 12°F. Afterthree
minutes, it reads 17°F. Assuming that the room is at 70°F, at what time will the thermometer read 69°F?
26. (Torricelli's Law) Derive from Newton's laws the formula v = y'2gh for the velocity v of a body
undergoing free fall from rest, through height h, neglecting air resistance. Assuming that the falling
body is a small cell of fluid, give an argument for the plausibility of Torricelli's law.
27. (Torricelli's Law) Let T1 be the time it takes to empty half of a cylindrical tank. Let T2 be the time it
takes to completely empty the tank. Explain phYSically why T2 = 2 T1 is false. Determine both T1 and
T2 for a cylindrical tank of radius 10 feet, filled to a depth of 20 feet, that drains from a circular valve
at the bottom of radius 0.25 ft.
28. (Natural Reservoir Surface Area) Let A(x) be the surface area at depth measurement x(t) and time t,
q(t) the flow rate and Vet) the reservoir volume. Show by the chain rule that q(t) = V' (t) = A(x(1) )x' (I)
and derive the formula A(x(t» = q(t)/x' (t).
29. (Draining a Natural Reservoir) Verify the drain time of 19.43 hours in Table 2, Section 2.1, using
Simpsons rule (12) in Section 1.3 and the formula
1 [135 A(u)du
t = 7200n 135 --;J72'
30. (Draining a Spheroidal Tank) A water storage tank has the shape of an oblate spheroid with major
axis a feet and minor axis b feet (so a < b < a). An equation of the tank surface is
r2 (x - b)2
a2 + -b-2- = 1,
where r and x are cylindrical coordinates with origin at the bottom of the tank; see Figure 5. Establish
these results:
(a) The radius of the circular cross-section at height x is
r=a/l-(x-b)2/b2,0:<sx:<s2b.
(0,2b)
(r,x)
(0, b)
a
(a) Loss due to evaporation alone satisfies dV/dt == -JLA(x), where JL > 0 is a constant.
(b) Combine the differential equation of part (a) with the equation dV/dt == A(x)dxldt to solve for
x(t) == x(O) - JLt.
(c) Determine the evaporation time for a cylindrical reservoir of radius 10 feet, filled to a depth of
20 feet, assuming JL == 0/300,000) ftlsee.
32. (Reservoir with Draining and Evaporation) Consider a reservoir in which both draining and
evaporation occur. Establish these results:
dV
(a) The rate equation for V is dt == -av - JLA(x), for some constant JL > o.
dx cx l/2
(b) The water depth x satisfies - == - - - ) - JL, where c == afig.
dt A(x
(c) Determine the time to empty a cylindrical reservoir of radius 10 feet, filled to a depth of 20 feet,
that drains from a circular valve at the bottom of radius 0.25 ft, assuming an evaporation constant of
JL == (1/30,000) ftlsee.
33. (Special Relativity) Let c be the speed of light and rna the rest mass of a particle traveling at velocity
v in a field of constant gravitation g. Einstein's special theory of relativity predicts a variable mass rn
according to the formula
ma
m == -v't=l==-==(=v/==c)=:;'2
d(rnv)
----;It == mg, v(O) == 0,
dx
(2) - = A(t)x.
dt
It is called the homogeneous linear equation associated with equation (1). It is separable
and the method of Section 2.2 provides the solution (11) of Section 2.2. Moreover,
comparison of (2) and (1) in Section 2.2 gives F(t) = A(t), G(x) = x and (9) of Section
2.2 becomes
(4) let) = /t
to
A( r)dr.
The general linear equation (1), with B(t) "I 0, is called a nonhomogeneous linear
equation. We shall now show how a particular solution of (1) can always be found by
Lagrange's method of variation of parameters. The idea of the method is to seek a
x(1)
Xo
where let) is defined as in (4) and the function c(t) is to be determined. On differentiating
(5) and noting that dI/dt = A, we find that
dx dc
(6) - = _cI(t) + c(t)AcI(t)
dt dt '
while
(7) Ax + B = B + cAd(t) .
dc = c-l(t)B(t).
-
(8)
dt
Hence a particular solution of (1) is defined by
Clearly, (10) defines a solution of (1) that passes through the point (to,xo). The preceding
derivation shows that equation (1) has a unique solution that passes through a given
point (to,xo) of its domain of definition. Moreover, from (8) we see that the solution is
defined on the entire interval al < t < a2 in which A(t) and B(t) are defined.
EXAMPLE 1. An LR Circuit. Consider the simple loop circuit of Figure 7. The inductance
L and the resistance R are assumed to be positive constants, while the applied voltage
E(t) may be any function of t.
If i = ¢(t) is the electric current in the loop at time t, then Kirchhoff's laws give the
linear first order equation
di
(11) L dt + Ri = E(t).
2.3 LINEAR FIRST-ORDER EQUATIONS 95
0v
E(t)
R
FIGURE 7. An LR circuit
Note that this may be written in the form (1) with x replaced by i, A(t) = -R/L =
constant and B(t) = E(t)!L. Hence if we choose to = 0 then (4) and (10) give
(12) l(t)
r (-R)
= 10 L dr =L -Rt
and
(13)
where the amplitude Eo and the frequency ware positive constants. Then a simple
lit
integration gives
E ~tfL
(15) - 2 0 22(Rsinwt-wLcoswt)
eRT1LE(r)dr=
L o R +w L
EowL
+ R2 + w 2 L2
Substituting this expression into (13) gives the solution
(16) ._
1-
-RtfL (.
e 10
+ 2
EowL )
R +w 2L2
Eo
+ 2 2 2 (Rsinwt - WLC05Wt).
R +w L
Let ¢ denote the unique angle, measured in radians, such that 0 < ¢ < nl2 and
(17)
96 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
Then (16) and the identity sin(a- b) = sin a cos b - cosa sin b imply that the response
of the L, R circuit to the AC voltage (14) is given by
Initial Value Problems and Separation of Variables. In the exercises below, solve the given initial value
problems using separation of variables.
In the exercises below, solve the given initial value problem by using basic methods or variation of
parameters.
17. (Atmospheric Pressure) Assume the model pi = Ap for the pressure p(x) at elevation x feet above sea
=
level. Given p(l7500) p(0)/2, find a formula for p(x).
2.3 LINEAR FIRST-ORDER EQUATIONS 97
18. (Yeast Culture) The population xCt) in a yeast culture satisfies Xl = Ax and doubles every 16 hours.
In how many hours will the population be 10 times its original size?
19. (Half-Life) A radioactive substance obeys the decay law Xl = Ax with A = -0.0004332. Find the time
t in years when 2x(t) = x(O), called the half-life of the radioactive substance.
20. (Radiocarbon Dating) The idea of radiocarbon dating originates with the work of Nobel laureate
W Libby (Chemistry, 1960). It is based upon absorption of carbon-14, 6C 14, which stops when an animal
dies. The amount x of carbon-14 obeys the decay law x' = Ax and it is known that 2x(5730) = x(O)
(half-life is 5730 years). What percentage of carbon-14 is present in a fossil that is supposed to be 2500
years old?
21. (Loss of Moisture) The drying of a beach towel satisfies the decay law x' = Ax, where x(t) is the amount
of moisture present. If it is half dry in 30 minutes, then when will it be within 1 %of being completely dry?
22. (RC Series Circuit) Use Kirchhoff's laws to derive the linear first -order equation
dq q
(20) R- + - = vet)
dt C
for the RC series circuit of Figure 8, where q is the charge on the capacitor C and vet) is an applied voltage.
23. (RC Series Circuit) Solve the initial value problem for (20) with R = 1, C = 11l0, vet) = 0 and
q(O) = 5.
24. (RC Series Circuit) Solve the initial value problem for (20) with R = 1, C = 11l0, vet) = 20 and
q(O) = o.
25. (RC Parallel Circuit) Solve the initial value problem for (21) with C = 1, R = 11l0, iCt) = 20 sin t
and v(O) = o.
26. (RC Parallel Circuit) Use Kirch hoff's laws to derive the linear first -order equation
dv v
(21) C- +- = i(t)
dt R
for the RC parallel circuit of Figure 9, where v is the voltage drop from junction P to junction Q and
i(t) is a prescribed current generated by a current source.
27. (Continuous Interest) The investment x(t) after t years compounded n times per year at interest rate r
per annum is given by x(t) = x(O)(l +rln)nt. Continuous interest satisfies the growth equation x' = Ax,
where A = r. If x(O) is one hundred dollars and the rate is 8 percent, then for some n both methods of
interest result in identical amounts, to the nearest cent, for the first 50 years. Is n < 365 or is n > 365?
vet)
~C
R
i(t)
for which the solutions are precisely the curves (1). Then the construction is reversed to
construct the solutions of (2) as a family (1).
Recall that if u, U x and uy are continuous and if uy 1= 0, then the implicit function
theorem implies that for each fixed constant c (in the range of u), equation (1) has a
solution
(3) y = j(x),
which is defined and has a continuous derivativej'(x) on a suitable interval a < x < b.
Of course, j(x) also depends on c, but this will be suppressed in the notation. The
definition of j(x) implies that
d
(5) dx u(x,J(x)) = 0 for a < x < b.
2.4 EXACT EQUATIONS 99
But this equation means that y = jex) satisfies the differential equation
dy
(7) uxex,y) + uyex,y) dx = 0 for a < x < b.
EXAMPLE 1. If u(x, y) = y/x then (1) defines the family of straight lines through the origin
(9) ~- c
x
In this case Ux = _ylx 2 , uy = l/x and hence the family (9) satisfies
-y 1 dy
(10) -+--=0
x2 X dx '
or
dy y
(ll)
dx x
The procedure described above can often be reversed to obtain the solutions of a
differential equation (2) as a family (1) For if a function u(x,y) can be found such that
(8) holds, then (2) can be rewritten as (7), which clearly implies (1).
To discover how a function u(x,y) that satisfies (8) may be found when F(x,y) is
given, it is convenient to rewrite (2) as
dy
(12) - F(x,y) + dx = O.
Note that an exact equation can be solved immediately: Its solutions are defined implicitly
by u(x,y) = c. Numerical values of the solutions can be constructed by the methods of
Chapter 1, as illustrated in Section 1.6.
EXAMPLE 2. Equation (10) is exact with u(x,y) = y/x, while the equivalent equation
dy
(15) -y+x- =0
dx
is not exact. Still another equivalent equation is
(16) - -1 + --
1 dy
= O.
x ydx
It is exact and has the integral u(x,y) = log(y/x).
(17) Uxy = u yx .
(18)
Exactness Theorem. Let M(x,y) and N(x,y) have continuous first partial derivatives.
Then if (13) is exact then (18) holds. Conversely, if (18) holds in a simply connected
region D then (13) is exact in D.
(19)
has
(21)
2.4 EXACT EQUATIONS 101
Solution Procedure for Exact Equations. Example 3 will be used to illustrate the
procedure. We must construct u(x,y) such that
(24)
where h(y) is an arbitrary function of y. This result must be reconciled with (23). Equation
(24) implies that
(27)
and the solutions of the differential equation (19) are defined implicitly by the equation
(28)
If (13) is not exact then it may be possible to choose p,(x, y) such that (29) is exact. Such
a function p,(x,y) is called an integrating factor.
102 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
EXAMPLE 4. Equation (15) is not exact, but multiplication by /-L(x,y) = (xy)-l yields
the exact equation (16). Note that another integrating factor for (15) is /-L (x, y) = l/x2,
which yields the exact equation (10).
(30)
or
(33)
Clearly, the criterion for this case is that (My - Nx)/N should be a function of x alone.
(34)
Integrating Factors /-L(y). A similar argument shows that there is an integrating factor
/-L = /-L (y) if and only if
(35)
In this case
is an integrating factor.
2.4 EXACT EQUATIONS 103
(37)
(39)
is exact. The solution procedure for exact equations then yields the implicit solution
(40)
(41)
where <p(x, y) is a scalar field called the velocity potential. Moreover, the incompress-
ibility condition implies that
(42)
The streamlines of a velocity field (41) are the curves in the (x,y)-plane that would
be followed by a particle that moved with the flow velocity V. Thus they satisfy
dx dy
(43) dt = <Px(x,y), dt = <p/x,y),
where t denotes time. Elimination of t gives the differential equation dy/dx = <Pyi<Px, or
dy
(44) - <Py + <Px dx = O.
The integral curves of this equation are precisely the streamlines of the flow (41). Note
that this has the standard form (13) with
Flow Past a Circular Cylinder. The planar flow past a circular cylinder has velocity
potential
(48)
and
Moreover, (50) is exact. Hence the streamlines are the level curves
(52) 1fr(x,y) = c,
where the stream function 1fr(x,y) satisfies 1frx = M,1fry = N. Integration of the first
equation gives
(53) 1fr(x,y) = V J (x 2
2a2xy
+ y2)2 dx + hey)
a2y
= - V-2- -2 + hey).
x +y
2.4 EXACT EQUATIONS 105
Level Set Equations. Verify that the equation is exact and find the level set equation u(x, y) = c.
5 --=--y
y x, 4/ - 2x2 S/ - x 2 ,
. x2 + y2 x2 + y2 . 10. 2
4xy-x
3 + 4y-xy
3 2 Y = O.
Integrating Factors. Find an integrating factor of the indicated form, determine the new exact equation
and find the level set equation u(x, y) = c.
22. (Separable Equations) Show that x'/g(x) = J(t), a completely separated equation, is also exact.
23. (Linear Equations) Show that a linear differential equation x' = a(t)x + bet) can be multiplied by a
suitable function JL(t) to make it exact.
24. (Integrating Factors) Assume that JLMdx + JLNdy = 0 is exact and assume that an integrating factor
JL(x,y) is a product of powers xnym Find conditions on nand m involving the functions M and N.
22/25, ux (2, 1) = 4/25. Explain what this has to do with solving for x in terms of y and y in terms of
x near the point (2, 1), using the implicit function theorem.
26. (Implicit Functions) Assume that u(x,y) is continuously differentiable and u(XO,yo) = c. Give two
conditions for solving for one variable in terms of the other, based upon the implicit function theorem.
(1) d2x (
dt 2 = F t,x, dt
dx)
where the unknown x is a function x = </J(t) of the independent variable t. Such
equations, together with the first-order equations (1) of Section 2.2, occur as models for
a large number of physical, chemical and biological phenomena. In fact, the theory of
such equations is a main theme of this book and many applications are presented here
and in subsequent chapters.
This section treats several special types of second-order equations that can be solved
by integrating two first -order equations. Such equations can be solved by the methods
of this chapter. More general second-order equations, not solvable by these methods, are
treated in Chapter 3.
Second-Order Equations Independent of x. If the variable x does not appear
explicitly in equation (1), so that
(2)
(3)
dx
v = dt'
dv d2x
Clearly, - = - 2 ' and hence v is a solution of the first -order equation
dt dt
dv
(4) dt = Fet, v).
2.5 ApPLICATIONS TO SOME SECOND-ORDER EQUATIONS 107
Let us assume that a solution v = 1jI(t) of (4) has been constructed by one of the methods
dx
of this chapter. Then (3) can be written as at = 1jI(t) and a second integration produces
a solution of (2) of the form
EXAMPLE 1. Free Fall Without Air Resistance - Plane Earth Model. Consider the
free fall of an object from a height h above the surface of the earth. If h is a small fraction
of the earth's radius then the gravitational field may be treated as a constant of magnitude
g = 32 ftlsec 2 (or 980 cm/sec 2 ), see Figure 10, where the x-axis is directed vertically
downward, with the origin at the earth's surface.
The gravitational force on an object of mass m is F = mg. Hence, if the resistance of
the air is neglected then Newton's laws imply that the position of the object at time t is
a function x = ¢(O that satisfies md2 x1dt 2 = mg, or
d2x
(6) dt 2 = g = constant.
This has the form (2) and v = dxldt is the speed of the object. It satisfies
dv
(7) dt = g,
and hence
(8) v = Vo + gt,
1 2
X = Xo + vot + 2. gt ,
where Vo and Xo are the initial speed and position, respectively. Equations (8) are Galileo's
law of free fall in a vacuum.
x = Xo =-h
F=mg
----1---- x = 0
earth
x FIGURE 10. Free fall of an object- plane Earth model
108 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
EXAMPLE 2. Free Fall with Air Resistance - Plane Earth Model. An object that falls
freely in a gas will experience a force R that opposes the motion, producing a net force
F = mg -.R. We expect that R = R(v) will be a function of the speed of fall v. Thus
Newtons laws give md 2x1dP = mg - R(dxldt), or
d2x
(9) dt 2 = g - <I> (dxldt) ,
where <I>(v) = R(v)/m. The precise form of the function <I>(v) will depend on the shape
of the falling object and on the gas through which it falls. It may be expected to be a
continuous function of v with the follOwing additional properties.
(11)
Notice that (9) has the special form (2), and equation (4) for the speed is
dv
(13) dt = g - <I>(v).
For this reason VM is called the terminal speed. If the object has this speed at time t = to
then it will continue in uniform motion
(15)
Equation (3) is separable. Application of equations (8)-00), Section 2.2, with F(t) = 1,
G(v) = g - <I>(v), and initial values x = Xo, v = Vo = 0 when t = 0 gives
and so
(17)
and
(18)
2.5 ApPLICATIONS TO SOME SECOND-ORDER EQUATIONS 109
Notice that g - <1>( v) > 0 for 0 :s v < VM by (1) and (12), and hence r 1 exists, since
j' > O. These results will be applied to three special forms of the resistance function <I> (v).
Case 1. (<I>(v) = kv, k > 0). This function clearly has properties (0)-(12), and the
defining equation for the terminal velocity is kVM = g, or VM = glk. Moreover,
(19)
f~ g - kTJ
= -~f~
k TJ - VM
= -~logITJ-VMI.
k
Hence (16) becomes, for 0 :s v < VM,
(20) j(V) =
10r ~
g - kv
= -~ log
k
(1 - ~) VM
= t.
(21)
(23) j(V) =
10
r g -dTJkTJ2 =_l_IOg(vM+v)=t
2y'gk VM - V '
for 0 :s v < VM. On solving this equation for v, we get (VM + v)I( VM - v) = e2t .Jiik and
hence, after some algebraiC manipulation,
v e2t .Jiik - 1
e2t.Jiik + 1
or
will. Table 3 shows values of Vk and ct>k = ct>(Vk) that might have been obtained in this way.
The units are ftlsec for v and ftlsec 2
for ct>(v). Note that VM =
150 ftlsec, approximately.
When ct>(v) is obtained from experimental data then the integralj(v) of (16) must
be computed by numerical integration. The final column in Table 3 gives the values of
j(Vk) = tk that were obtained by computingj(v) by means of Simpson's rule with step
size h = 5. The value t30 = 16.40 is probably quite inaccurate because the integrand
(32 - ct>(V))-l varies rapidly in the interval 145 ::: v ::: 150. More accurate values of
j(Vk) may be obtained by reducing the step size h. Of course, this will necessitate the use
of the experimental values ct>(v) at additional values of v.
r
The velocity v is given as a function of t by equation (17): v = 1 (t). This function
can be computed from Table 3 by inverse interpolation.
Quadratic interpolation, using the algorithm of Section 1.2, was used to compute
the values of Table 4 below. For brevity, t is restricted to the values 0.0 to 3.0 in steps of
0.1. The positions Xk = X(tk) can be obtained from these data by numerical integration.
Second-Order Autonomous Equations. A physical system is said to be au-
tonomous if its governing differential equation does not contain the independent variable
dv I dx dv
(26) dt = 1/1 (x) dt = v dx·
On combining (25) and (26) and using d2 x1dt 2 = dvldt, we find that
dv
(27) v dx = F(x, v).
Thus v may be determined as a function of x by integrating the first -order equation (27).
Moreover, when v = 1/I(x) is known then the equation v = dxldt becomes
dx
(28) dt = 1/I(x).
112 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
(29)
by Newton's inverse square law, where m is the mass of the probe, g is the gravitational
acceleration at the earth's surface and R is the radius of the earth. 2 Approximate numerical
values are g = 32 frlsec 2 and R = 4000 mi. The minus sign is needed because F is directed
toward the center of the earth. Equation (29) and Newton's laws of motion imply that the
position x = ¢(t) of the probe satisfies the differential equation md 2x1dt 2 = -mk/x 2 , or
k
(30)
X2 '
earth
vo
, - - - - + - + -....- X
Xo
FIGURE II. Initial conditions for launch of a space probe
2A more realistic model would include the influence of the sun and Jupiter, but these effects will be ignored here.
2.5 ApPLICATIONS TO SOME SECOND-ORDER EQUATIONS 113
for all x > R. Equation (30) has the form (25), and the associated first-order equation
(29) is vdvldx = -klx 2 , or
dv
(31)
dx
This equation is separable and the integration technique of Section 2.2 is applicable with
F(x) = l/x 2 and G(v) = -klv. We shall measure time t from the moment of rocket
burnout and assume that the initial values Xo and Vo satisfy
(33) lex) = IX ~ = ~ - ~,
I1
Xo ~ Xo x
= -- 1 (2
vo-v 2) .
V
J(v) I1 dl1=-
k Vo 2k
The solution v = 1j;(x) of (31) that satisfies 1j;(xo) = Vo is defined implicitly by the
equationJ(v) = l(x) , which may be written as
(34)
This equation gives the square of the probe's speed at each distance x from the earth. It
is clear that the escape velOCity VE is the speCial value of Vo for which v -+ 0 as x -+ 00.
From (34) we see that v~ = 2k1xo, or
(35)
Normally, Xo will differ from R only by one or two percent. Taking Xo = R in (35) gives
VE = J2gR as the escape velocity from the earth's surface. To obtain a numerical estimate,
take R = 4000 miles and g = 32 ftlsec 2 = 32 x (3600)2/5280 milhr2 These values
give VE = 25067 milhr.
Equation (34) can be written as
(36)
Clearly, if 0 < Vo < VE then v~ - V6 > 0 and v = 0 when 2klx = v~ - V6 > O. In this
case X max = 2k1( v~ - V6) is the maximum distance reached by the probe. On the other
114 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
hand, if Vo > VE then v2 > V6 - v~ > 0 for all x and v2 ---+ v~ - V6 as x ---+ 00. Thus
(37)
is the limiting value of v for large x. Note that by (36) and (37),
(38) v --~
V~ + v~ > V<Xl> for all x> R.
(40)
Thus
x] -Xo
(41) tmax = - - -
Voo
is an upper bound to the travel time to Jupiter. The minimum distance from the earth
to Jupiter is about 4.2 A.U 3 If we require that tmax = 5 yr = 43800 hr, then we must
have Voo = 4.2 x (93 x lO6)143800 = 8917 milhr = JV6 -
v~. Thus V6 = v~ + v~ =
(25067)2 + (8917)2 or Vo = 26606 milhr.
Solution of the Equation of Motion (30). The solution x = ¢(t) of (30) may be
obtained by integrating the separable first-order equation
(42) dx =
dt
.j2kx + v 2
00
(43)
where
(44)
30ne A.U. (= Astronomical Unit) equals the mean radius of the eanhs orbit around the sun. This distance is
approximately 93 x 106 miles.
2.5 ApPLICATIONS TO SOME SECOND-ORDER EOUATIONS 115
(45) f .jf+a2
~1/2d~
~+ a2
= J a2~ + e - a2log(J~ + a2 + .j€),
whence
The solution x = ¢J(t) may be obtained by solving (46) for x by one of the methods of
Chapter 1. It is interesting that the time tj obtained from (46) with Voo = 8917 milbr,
x = Xj = 4.2 x (93 x 10 6 ) = 3.906 X 108 mi, and Xo = 4000 mi is tj = 4.998 yr, in
du]
dt = -u1 + F(t)Uj + G(t)
2
(51)
116 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
(52)
(55)
(59)
V L + VR + Vc = 0.
i C dQ
(65) Y=-=--.
Vc Q dt
The Riccati Method. We will solve the linear constant-coefficient differential equa-
tions of L, R, C circuits by the Riccati method. Essential use is made of the impedance
and admittance equations.
Clearly, Z and Y satisfy Riccati equations. A short calculation, using (62), gives
dZ 1 2 R 1
(66) = - -Z - -Z - -
dt L L c'
dY 1 2 R 1
--Y - - Y - -
dt C L L
These Riccati equations have constant coefficients and may be integrated by separation of
variables. Moreover, after they are solved for Z and Y, one can solve (64) and (65) to get
2 4L
(69) D=R - - ,
C
is positive, zero, or negative.
Case 1 (D > 0). Write 2fJ = y'D/L. Then
(70) f Z2 + LdZ
RZ + UC = -731 tanh -1 ( Z
f3L
R)
+ 2fJL = -t + (1,
(73)
implies that
where
(76)
It can be shown that (75) is a solution of (63) for any values of A and B.
Case 2 (D = 0). In this case one has
(77) f L~
Z2+RZ+UC =L
f ~ ~
(Z+Rl2)2 = Z+Rl2 = -t+C1,
whence
R L
(78) Z= --+--.
2 t - C1
(80)
whence
R
(83) Z = -- - YL tan yet - (1)'
2
Integrating again gives
(85)
where (1 and (2 are arbitrary. The formula for cos(x - y) implies that
y
0.2
-0.2
y = e- O.2x _ 2e- 0 .4x
-0.6
x
-1.0
o 10 20 30 FIGURE 13. Overdamped oscillations
0.8 Y
0.0
Y = (x - l)e- O.4x
x
-1.0
0 10 20 FIGURE 14. Critically damped oscillations
y
0.8
Y = e- O.4x sin(2x)
0.0
x
-0.4
0 10 FIGURE 15. Underdamped oscillations
If R > 0 then the oscillations are damped, with amplitude that tends to zero like e- RU(2L).
The circuit is said to be underdamped in this case. Figures 13, 14 and 15 illustrate the
three cases.
Equations Independent of x. Convert the differential equations below into two equations of the form
v = x' and Vi = F(t, x') and solve for the general solution, containing two arbitrary constants.
122 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
2. x" = 1. 7. x"-x'=t.
3. x" = 1 + t. 8. x" + 2x' = et.
4. x" +x' = O. 9. x" + (l/t)x' = O.
Equations Independent of t. Convert the differential equations below into an equation of the form
v(dvldy) = F(y, v), usingy = x(t) and v = x'(t). Solve for the general solution, containing two arbitrary
constants, if possible.
23. (Rockets) The equation of motion for the velocity vet) of a rocket in a constant gravitational field g is
In this equation, M is the initial rocket mass in grams and the rocket loses gas at a grams per second at
constant velocity b centimeters per second relative to the rocket (all constants are positive). Solve the
equation for the position x and velocity v. Hint: yeO) = 0 and x(O) = Re (radius of the earth). The
answers involve the reciprocal and logarithm of M - at.
24. (Cantilever) A uniform cantilever beam of length L and of constant weight w pounds per foot has a
shape curve y(x) known to satisfy the initial value problem
25. (Riccati Equation) The Riccati equation y' = xy2 - Y + 1 - x has a particular solutionYl == 1. Use the
transformationy = Yl + l/u to develop a formula forthe solution in terms ofthe integral ofx exp(x 2 -x) .
26. (Riccati Method) The differential equation x" (t) - x' (t) = 0 can be solved by transformation to a
Riccati equation. Show that the Riccati equation is u' (t) = _u 2 (t) + u(t) with solution u/(l - u) = het .
Use the equation u(t) = x' (t)lx(t) to find the general solution x(t) = q + [2 et .
27. Verify the tk column of Table 3 by applying Simpson's rule to evaluate the integral](v) in (16), using
the data in the Vk and <Pk columns. Take g = 32 and use the trapezoidal rule for tl.
2S. Verify Table 4 by applying inverse quadratic interpolation to the Vk and tk columns of Table 3.
(2)
It has been shown above that if equation 0) is separable, or linear, or exact then the initial
value problem has a unique solution which can be constructed by direct integration.
Moreover, the integral curves fill the region D and there is exactly one of them through
each point of D. An example was shown in Figure 2 of Section 2.l.
It is natural to ask whether this unique solvability of the initial value problem also
holds for more general equations (1) that are not solvable by the special methods of
Sections 2.2 to 2.4. The purpose of this section is to present simple and general conditions
on F(t, x) that guarantee the unique solvability of the initial value problem. They are
described below.
Fundamental Theorem. Let F(t,x) and its partial derivative aF(t,x)lax be defined and
continuous in the region D. Then the initial value problem (1), (2) has a unique solution
for each initial point (to,xo) in D.
Notice that there are two parts to the fundamental theorem. First, it states that for
each point (to, xo) in D there is a solution of 0) that passes through that point. Second,
it states that that solution curve is unique.
Meaning of the Existence Statement of the Fundamental Theorem. The
existence part of the theorem means that there is a function x = xU), defined on an
124 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
interval to - a < t < to + b, such that (t, x(t)) is in D and satisfies (1) for each point of
the interval. If lim x(t) = x(to + b) exists and (to + b, x(to + b)) is in D then one can
t-Ho+b
apply the fundamental theorem with (to, xo) replaced by (to + b, x(to + b)) to extend the
interval of existence to a larger one. Thus we may assume that a and b are chosen to be as
large as possible. These values will in general depend on to and Xo, which will be indicated
by the notation a = aCto, xo), b = b(to, xo). The intervalto - aCto, xo) < t < to + b(to, xo)
will be called the maximal interval of existence of the solution through (to,xo).
(6)
Notice that if Xo is very large and positive then b(to, xo) will be very close to 0, while if Xo
is very large and negative then aCto, xo) will be very close to O. This example demonstrates
that the size of the maximal interval of existence will not be obvious from the differential
equation alone.
whose region D is the whole (t,x)-plane. Through the point (to,xo) = (0,0) there are
the distinct solutions x(t) = 0 and also x(t) = t 3. Thus solutions of the initial value
problem are not unique for this equation. It is easy to see that the fundamental theorem
does not apply to equation (7) in any region D that includes points of the t-axis, because
F(t, x) = 3x2/3 and FxCt, x) = 2x- 1I3 is not continuous for x = O. In particular, solutions
can branch from (to,xo) when aFlax fails to be continuous in D.
EXAMPLE 3. Notice that the fundamental theorem gives no indication of how the solutions
of 0) may be computed. Consider, for example, the equation
(8) dx = /1 _ t2 _ x2
dt Y .
(9) F(t,x) = J1 - t 2 - x2
(10)
aF(t, x) -x
ax
Equation (8) is neither separable nor linear nor exact and it cannot be solved by
elementary methods. Nevertheless, F and aFlax are continuous in D and hence the
fundamental theorem guarantees that there is a unique solution of (8) for each (to, xo)
in D. Numerical methods for computing the values of that solution are given in Section
2.7 below.
The fundamental theorem suggests a new point of view. We may define a function
x = x(t) by means of a differential equation (1) and an initial condition (2). In Section
2.7, accurate numerical algorithms to compute the values of such functions are presented.
The algorithms work directly from the differential equation and the initial condition.
Such methods frequently offer the most efficient way to generate and compute a function.
Differentiability of Solutions. It is shown in elementary calculus that every differen-
tiable function is continuous. Hence, solutions of a differential equation (1) are necessarily
126 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
In particular, if F(t, x) is a continuous function of t and x then (11) implies that x' (t) is
also continuous for a < t < b. Continuous functions x(t) such that x'(t) is continuous
are said to be continuously differentiable. Solutions may have higher-order derivatives.
In particular, if F(t, x) and its first partial derivatives Ft(t, x) and FxCt, x) are continuous
then (] 1) implies that x"(t) exists and satisfies
Differentiability Theorem. Let F(t, x) have continuous partial derivatives of orders less
than or equal to k. Then every solution x = xCi) of (1) has continuous derivatives of orders
less than and equal to k + 1. In particular, if F(t, x) has continuous partial derivatives of
all orders then every solution of (1) has continuous derivatives of all orders.
In Section 2. 7 these results will be used to derive algorithms for the numerical
solution of first-order differential equations.
Determine the natural domain D of the fundamental theorem. Show that this problem has a unique
solution, defined for -00 < t < 00, which is given by
t :s -n/2,
1
-1 for
x(t) = ~in t for -n/2 :s t :s n/2,
for t ~ n/2.
:s a,
xU) = 1 -1
~in(t - a - nl2)
for
for
for
-00 < t
a:st:sa+n,
t ~ a+n,
with parameter a ~ O. Explain why this example does not violate the existence-uniqueness theory
2.7 NUMERICAL METHODS FOR THE INITIAL VALUE PROBLEM 127
x' = 3x 2/ 3 , xCO) = 0,
I°
has an infinite two-parameter family of solutions, defined by
u- a)3 for t ~ a,
x(t) = for a ~ t ~ b,
(t - b)3 for t ?:. b.
Determine the separation of variables solution and use it to show that the maximal interval of existence
is -00 < t < l/xo.
5. (Differentiability of Solutions) Show that if F(t,x), FtCt,x) and Fx(t, x) are continuous in a region D of
the (t, xl-plane and if x(t) is a solution of x' = F(t,x) in D, then xU) has a continuous second derivative
that is given by
////////// //
jjjjjjjjjljjj
Illlllllllll
jjjj~jj!jj~~~
1llllll!lllll
I111I1 I11111
///// ///////
~~~~~~~~~~~~~ FIGURE 17. Direction field for the equation x' = xCl - x)
Note that a carefully prepared direction field gives an excellent qualitative description
of the solution curves of a differential equation. Computer programs that generate
direction fields are widely available.
The line elements for (1) were defined geometrically above, and it is easy to write
an equation for them. The line element through (to,xo) is a straight line segment with
slope F(to, xo). Thus it is given by
(3)
is introduced and approximate values Xj ~ x(tj ) are computed at these points only: Values
x(t) at other t-values may then be computed by the interpolation methods of Chapter 1.
It will be shown that x(t) can be computed on the interval [to, to + TJ to prescribed
accuracy by finite difference methods based on a suitably fine mesh.
Euler's Method. The simplest way to generate an approximate solution of (1) is to
follow the line element (2) from (to,xo) to (tl,XI), where Xl = Xo + F(tO,XO)(t1 - to).
The process may then be repeated Lo get (t2,X2), etc. The process is implemented by the
following algorithm:
(4)
2.7 NUMERICAL METHODS FOR THE INITIAL VALUE PROBLEM 129
The process is recursive: Each step provides the data for the next. The approximate
solution is a polygonal curve with vertices (to,xo), (tl,Xl), ... , (tN,XN) and sides that
are line elements. Algorithm (4) was proposed by L. Euler in 1768 and is usually called
Euler's method.
The case of a uniform mesh (3) is defined by tl - to = t2 - tl = ... = tN - tN-l = h,
which gives tk = to + kh, k = 1,2, ... , N. In particular, tN = to + Nh = to + T, so that
(5) Nh= T,
(8)
Application of Eulers method on the interval 0 .:s t .:s 1 with various values of N
gives the results in Table 5, where only the value XN ~ x(I) is recorded. 4 Note that
xCI) = 0.73106, to five decimal places. Hence, Euler's method gives the result to the
same accuracy if the step size h = liN is suffiCiently small.
4The third row of the table gives the error EN = x(l) - XN.
130 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
(9)
measure the error produced by Eulers method at the mesh points tk. How small must
h be to guarantee that IEk(h)1 is less than a prescribed tolerance? This question will be
answered by means of a careful estimate of IEk(h)l.
Taylors theorem with remainder implies that
(10)
(11)
(12)
from (11) and using the definition (9), we get the relation
and hence
(15)
(18)
Combining the estimates (13), (15) and (18) gives the estimate
1
(19) IEk+l(h)1 ::; (1 + hL)IEk(h)1 + "2h2(K + LM),
for k = 0,1,2, ... , N - 1. Moreover, E(h) = x(to) - Xo = 0. On applying (19) for
k = 0, 1,2, ... ,N - lone gets the following error estimate.
An Error Estimate for Euler's Method. If F(t,x) satisfies (14) and (17), then
where
(21) C= ~ (M + ~) (e LT - 1).
The details of the proof of (20) will not be given here. They can be found in most
numerical analysis texts; see, e.g., [HeJ or [Ch-KJ. Note that (20) implies that if Ch ::; E,
so that IEk(h)1 ::; E, then
(22)
Thus, at worst, halving the step size h will reduce the error to one-half its former value.
This explains the rather slow convergence shown in Table 5. Note that the four h values
in the table are hI = 1/4, h2 = 1/64, h3 = 1/256 and h4 = 1/4096 with the successive
ratios hk+I/h k = 1/16. Hence, each EN should be about 1/16 of the preceding one. The
table entries are consistent with this estimate.
The remainder of this section presents some more accurate finite difference methods
for solving the initial value problem (1). In each case an error estimate of the form
(23)
holds, where m is a positive integer. A method for which (23) holds is said to be an mth-
order method. Thus, Euler's method is a first-order method. Methods of orders 2 and 4
are described below. Note that (23) implies that
(24)
Hence methods of order m 2: 2 converge much more rapidly than Euler's method.
Taylor Series Methods. In the error analysis of Euler's method, Taylor'S theorem
with second-order remainder, equation (10), led to Euler's equations xk+l = Xk + hx~,
132 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
x~ = F(tk,Xk) when the second-order error term h2X"(~k)/2 was dropped. It is natural
to construct higher-order numerical methods by retaining additional terms in the Taylor
series. The simplest case, after Euler's method, is suggested by Taylor's theorem with
third-order remainder:
(25)
where th :::; ~k :::; tHI. Again, the differential equation (1) gives X'(tk) = F(tk,X(tk».
Moreover, differentiaLing (1) gives
d
(26) x"(t) = d/(t,x(t») = Ft(t,x(t» + FxCt,x(t»x'(t)
and hence
(27)
An error analysis similar to that for Euler's method above confirms that this algorithm is
second order, as expected. We shall refer to (28) as the Taylor2 algorithm.
The Taylor2 algorithm with these values gives the results in Table 6.
2.7 NUMERICAL METHODS FOR THE INITIAL VALUE PROBLEM 133
Note that the Taylor2 method yields x(l.O) to 5-place accuracy in only 64 steps, many
fewer than the 4096 steps needed with Euler's method. The error results in the table are
consistent with the 2nd order error estimate lEN I :::: Ch 2 , since the h values for the table
are hI = 118, h2 = 1/16, h3 = 1132 and h4 = 1164 with common ratio hk+llhk = 112.
The Taylor4 Method. As a final example of the Taylor series method we shall
consider the Taylor4 method. This is defined by keeping the terms through order 4 in
the Taylor series. Thus
where the coefficients x~£) = dixkldt£ for f = 1,2,3,4 are generated from F(t,x) and its
partial derivatives.
obtained by integrating x' (r) = F( r, x( r)) over the interval to ::: r ::: to + h. If h is small,
the integral in (32) may be estimated accurately by one step of the trapezoidal method.
Thus
h
(33) x(to + h) ~ x(to) + "2 [F(to, x(to)) + F(to + h, x(to + h))].
Of course, the value x(to + h) on the right is not yet known. However, a good
approximation to it is given by Euler's method:
(38)
where IR(h)1 S Mh 3 . Thus Heun's approximation Xl (h) differs from the Taylor2
approximation by a term of order 3. It can be shown, by a calculation similar to that for
the error analysis of Euler's method, that the overall error EN(h) for Heuns algorithm (36)
satisfies IEN(h)1 ::: Ch 2 Therefore, the method has order 2. In contrast with the Taylor2
2.7 NUMERICAL METHOOS FOR THE INITIAL VALUE PROBLEM 135
method, second-order accuracy is achieved with only two evaluations of F(t, x) per step,
and the partial derivatives Ft(t,x) and Fx(t,x) are never used.
'The original references, (in German), are C. Runge, Math. Ann. 46, 167-178 (1895); K. Heun, Z. Math. Phys. 45,
23-38 (1900); W Kutta, Z. Math. Phys. 46, 435-453 (1901); and C. Runge, Nachr. K. Ges. Wiss. G6ttingen, 252-257
(1905)
136 2 ORDINARY DIFFERENTIAL EQUATIONS OF FIRST ORDER
method requires us to calculate and evaluate F(t, x) and its partial derivatives through
order three at each step.
IExercises 2.7 ~
"~,ok~,~~.%,;n,mijJ , $W11>
Direction Fields. Apply the method of direction fields to produce a graphical solution to the given initial
value problem. Use 25 grid points.
Euler's Method. Apply Euler's method to the following initial value problems using 4 steps. Compare with
the exact solution.
13. x' = t +x, xeO) = I, h = 0.2. 15. x' = x(l - x), x(O) = 0.5, h = 0.2.
Heun's Method. Apply Heun's method (Improved Euler) to the following initial value problems using 2
steps. Compare with the exact solution.
17. x, = x, x(O) = 1, h = 0.2. 19. x' = x(l - x), x(O) = 0.5, h = 0.2.
Runge-Kutta Method. Apply the Runge-Kutta4 method to the following initial value problems using I
step. Compare with the exact solution.
21. x' = x, x(O) = 1, h = 0.4. 23. x' = x(I - x), x(O) = 0.5, h = 0.4.
25. (Euler) Write a computer program to implement Euler's method. Test it on the logistic equation and
verify Table 5.
26. (Taylor2) Write a computer program to implement the Taylor2 method. Test it on the logistic equation
and verify Table 6.
27. (Taylor4) Write a computer program to implement the Taylor4 method. Test it on the logistic equation
and verify Table 7.
28. (Heun) Write a computer program to implement the Heun method. Test it on the logistic equation and
verify Table 8.
29. (Runge-Kutta4) Write a computer program to implement the Runge-Kutta4 method. Test it on the
logistic equation and verify Table 9.
30. (Comparisons) Use the Euler, Heun, and Runge-Kutta methods to compute x(l) for the problem
x'(t) = x 2 (t) + 1, x(O) = 0, with step size h = 0.1. Compute the exact answer and compare the four
answers in a table.
31. (Quadrature) The integral f;/2 cos(t)dt can be evaluated by Simpson's rule. It can also be evaluated
by solving the initial value problem X'(t) = cos(t), x(O) = 0, for the value x(rr/2). Choose step size
h = O.lrr12. Apply the Euler, Heun, and Runge-Kutta4 methods to evaluate x(rr/2) and compare the
answers with the corresponding answer obtained by Simpson's rule using 10 divisions.
C HAP T E R 3
Ordinary Differential
Equations of Higher Order
This chapter continues the theme of Chapter 2: differential equations and their
applications in science and engineering. Chapter 2 emphasized first-order equations
dx/dt = F(t,x).
Here equations that contain second, and higher-order derivatives are analyzed. Section
3.1 shows how such equations arise naturally in applications. Sections 3.2 through "3.5
develop the theory and applications of linear second-order equations. Section 3.6 treats
linear equations of the third and higher orders.
IV
E(t)
c
R
FIGURE 1. An LRC circuit with voltage
source E(t)
In addition, the algebraic sum of the voltage drops equals zero, by Kirchhoff's law:
VL + VR + Vc - E = O.
The charge q(t) on the capacitor C and electric current i(t) in the circuit are related by
. dq
(2)
1= dt'
whence
d'
(3) Ldl + Ri + 3.. = E(t).
t C
Elimination of i(t) gives a single second-order equation for q:
(4) Ld
2
q + R dq + 3.. = E(t).
dt 2 dt C
In Section 2.5 it was shown that if E(t) = 0 then (4) can be solved by means of a Riccati
equation. This method fails if E(t) =f=. O. More general methods that are applicable in all
cases are developed in Sections 3.2 and 3.3 below, and in Chapter 4.
Mechanical Oscillators. Consider the mechanical oscillator of Figure 2.
A mass m is suspended from a spring and attached to a damper such as a
pneumatic door stopper or an automotive shock absorber. The displacement of m from
its equilibrium position under gravity is denoted by x. It will be assumed that three forces
act on m:
Tx
m
c
FIGURE 2. A mechanical oscillator with damper
where k and c are positive constants that characterize the spring (Hooke's law) and the
damper, respectively. The acceleration of the mass m is d2x1dt 2 . Thus Newton's second
law of motion implies that x satisfies the differential equation
d2x dx
(5) m dt 2 + c dt + kx = F(t).
Notice the similarity between this equation and equation (4). The equations become
identical if one makes the identifications L = m, R = c, lie = k and E(t) = F(t). Thus
if (4) has been solved then (5) may be solved by just renaming the letters. Such equations
are said to be isomorphic, meaning that they have the same form.
A Torsional Pendulum. Yet another oscillator that is modeled by an equation
isomorphic to (4) is depicted in Figure 3. It is assumed that the structure is cylindrical, is
made of an elastic material such as steel, and is fastened rigidly at the top. Such torsional
pendulums are used in some clocks.
If the pendulum is rotated about its axis, the displacement is described by the twist
angle e. If I denotes the moment of inertia of the pendulum then Newton's law for
rotating bodies states that
d2e
(6) 1 - = net torque on the pendulum.
dt 2
....... ~
.. ~.
where k and ( are positive constants characteristic of the particular pendulum. Combining
(6) and (7) gives the differential equation
(8)
which dearly is isomorphic to (4). It will be seen below that this kind of isomorphism
extends to many mechanical and electrical systems. Further examples may be found in
[DenH].
Equations with Variable Coefficients. In the LRC circuit described above the
coefficients L, R, and C are constants. However, in some applications, L, R, or C may be
caused to vary with t in a prescribed way, either by mechanical or by electronic means.
This leads to equations of the form
(9)
where the notation means that one or more of the coefficients L, Rand C may vary with t.
Isomorphic equations arise from mechanical oscillators whose parameters m, ( and k vary
with t. A variety of examples from the theory of mechanical vibrations may be found in
[DenH], Chapter 8. Here two of the problems from [DenH] will be described. Solutions
are presented in Section 3.5 below. Both problems involve an equation of the form
(10)
where bet) is positive and periodic with period p > 0, so that for all t,
where k and I'.k are constants and w > 0, so that p = (2rr)/w. Equation (10) with
periodic bet) is called a Hill's equation. 1 The special case (12) is usually called Mathieu's
equation. 2
Two examples of physical problems that lead to Hill's equation will be given. The
examples are from page 340 in [DenHJ.
Pendulum of Variable Length. The small amplitude oscillations of a pendulum are
governed by the equation
(13)
see Figure 4 and page 340 in [DenHJ. If f. is varied periodically with period p (and hence
with frequency w = (2rr)/p) by pulling and releasing the string through the point of
support, then (13) has the form (10), (1l).
This example suggests how solutions of (10), (11), and (12) may be unstable due to
resonance. The frequency of the unperturbed pendulum (I'.k = 0) is Wo = .Jk radians
per sec. If w = 2wo and if the initial conditions are chosen properly, then f. is shortened
when x = 0 and the tension in the string is a maximum and f. is lengthened when
x = ±xmax , and the tension is a minimum. Indeed, this is essentially what a child does
to "pump" a schoolyard swing. The result is that work is done on the system, twice for
each oscillation, producing an ever increasing amplitude of the motion. It will be shown
below how mathematics can be used to predict and analyze such instabilities. A second
physical example is the following:
Oscillation of a Mass on a String with Variable Tension. Consider a mass m,
vibrating at the midpoint of a taut string with length f. and tension T; see Figure 5 and
page 340 in [DenHJ.
m
T
J
FIGURE 5. Mass on string with variable tension
Newton's laws imply that for small vertical displacements x of the mass m, one has
is determined by the external and internal sources of heat. For static (time-independent)
sources F(x) and temperatures u(x) one gets the equation
(16) dx dU) -
d ( rex) dx q(x)u = -F(x),
or
d u (1 dr(X)) du q(x)
- - - - - - U = - -F(x)
2
(17) -+ -.
dx2 rex) dx dx rex) rex)
Here c2 = Tip, T is the tension in the membrane and p is its mass density. Next consider
the synchronous vibrations of the membrane, defined by the product
(20)
a2v 1 av 1 a-
-ar2 + -r -ar + -
2v
+w2 v = o.
r2 ae 2
Substituting this into (20) gives, after some rearrangement of terms, the equation
rlRI/(r) + rR'(r) GI/((})
(22) __ ----,--,----__ + (j}r l = _ --.
R(r) G(e)
This must hold for all points of the membrane; i.e., 0 < r < a and 0 :::: e::
2n. Hence
the two sides of (22) must equal one and the same constant, which we shall assume
positive and write as JL 2 . Thus
(23)
and
Equation (23) has constant coefficients and can be solved by the method of Section 2.5.
The general solution is
(27)
(28) x =wr,
y(x) = y(wr) = R(r).
(29)
or, equivalently,
-d y2 + -1 -dy + ( 1 -
2 n2)
(30) - Y = O.
dx X dx x2
3.1 EXAMPLES FROM ENGINEERING ANO PHYSICS 147
This is the well-known Bessel equation of order n, which has many applications in
physics and engineering. 3 Properties of its solutions are studied below, in Section 3.4 and
in Chapters 8 and 9. The analysis of the vibrating membrane problem will be completed
in Chapter 9.
Steady-State Temperature in a Disk. Consider the disk of Figure 7 and let T =
uCr,8) denote the steady-state Ctime-independent) temperature there. It will be shown
in Chapter 9 that u(r, 8) satisfies the Laplace equation in polar coordinates r, 8, namely,
a2 u 1 au 1 a2 u
(31) -+--+--=0
ar2 r ar r2 a8 2 .
Note that this is identical to equation (20) with w = O. Hence the product solutions of
(31),
d2 R 1 dR n 2
(33) -+----R=O.
dr2 r dr r2
This equation will be solved below. The solution of the steady temperature problem for
the disk will be completed in Chapter 9.
Steady-State Temperature in a Sphere. The steady temperature in a solid homo-
geneous sphere is a function T = u(x,y, z) of the three rectangular coordinates x, y and
z that satisfies the partial differential equation of Laplace:
a2 u a2 u a2u
(34) -+ -+-= 0
2
ax 2 ay2 az .
3The word "order" is overworked in mathematics. Here it denotes the parameter n rather than the second derivative.
148 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
()
x FIGURE 8. Spherical coordinate system
shown in Chapter 9 that the Laplace equation (34) is equivalent to the partial differential
equation
(36) -J(2JV)
Jr
r -
Jr
+ -1- -J(.sm ¢ -
sin¢ J¢
av) = O.
J¢
A family of product solutions of this equation,
will be sought, as in the preceding cases. Substitution of (37) into (36) gives, after
separation of variables,
This relation must hold for 0 < r < a and 0 < ¢ < 1T, and hence, as in the disk problem,
the two sides of (38) must equal the same constant f,A,. Thus
d2R 2 dR f,A,
(41) -+----R=O
dr2 r dr r2 '
which is similar to, but not identical with, equation (33). It will be solved below. Equation
(40) for <I> is much more complicated. It can be simplified somewhat by the change of
variable
d<l> dy dx . dy
- = - - = -(sm¢)-
d¢ dx d¢ dx
and hence
d ( (1- x 2) dy
(44) dx dx ) + J-LY = 0,
or, equivalently,
2 d2y dy
(45) (1 - x ) - - 2x- + J-LY = O.
dx 2 dx
Solutions of this equation are derived below, in Section 3.4, and are applied in Section
3.9 to construct steady-state temperature distributions in spheres.
Examples of Systems from the Theory of Oscillations. The remaining examples
illustrate the occurrence of systems of ordinary differential equations in the theory of
electrical and mechanical systems with several degrees of freedom. It will be shown that
such systems lead by elimination to higher-order linear differential equations; that is,
equations that contain derivatives of order two or more.
Multiloop Electric Circuits. Kirchhoff's laws, when applied to multiloop circuits,
lead to systems of ordinary differential equations with two or more unknown functions.
This will be illustrated here with the simple two-loop circuit of Figure 9.
The variables ql and q2 denote the charges on the capacitors CI and C2 , while iI,
i2, and i3 are the currents in the three branches, as indicated in the figure. Kirchhoff's
current laws applied to the branch junctions give
4 After A.M. Legendre 0752-1833), French mathematician, who introduced it in the theory ofthe Laplace equation.
150 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
q]
t ~i2
C] i3 q2
i]
C2
~
(46)
and
. dq]. dq2
(47) I] = dt' 12 = dt'
Moreover, Kirchhoffs voltage laws applied to the two loops give (since b =h - i2 )
(48)
and
(50)
and
(52)
3.1 EXAMPLES FROM ENGINEERING AND PHYSICS 151
If this is solved for q2 and the result is substituted for q2 in (50), then the following
equation for ql alone is found:
(55)
where AI, A 2 , and A3 are arbitrary constants. Moreover, substituting this in (52) gives
1 -t 1 -2t -3t
(56) q2 = --Ale - -A2e - A3e .
5 2
It can be shown that (55), (56) give the general solution of the system (50), (51) for the
special parameter values of R, L, Cl and C2 chosen above.
Mechanical Oscillators with Two Degrees of Freedom. The electric circuit of
Figure 9 is said to have two degrees of freedom because two functions, ql (t) and q2 (t),
are needed to specify the state of the system. Similarly, mechanical oscillators are said
to have two degrees of freedom if their states are specified by two displacements. An
example is shown in Figure lO.
The masses ml and m2 are attached to springs with spring constants hI and h2 ,
respectively The displacements of ml and m2 are both measured downward from their
eqUilibrium positions, which are separated by the fixed distance Xo. The masses are
coupled by a damper, which opposes the relative motion of m1 and m2 with a force
This is a system of two second-order differential equations for the unknown displace-
ments Xl (t) and X2 (t). In Chapter 4 the system is solved directly by means of the Laplace
transform. Here it will be shown how it may be solved by eliminating X2 to obtain a
single fourth-order equation for Xl.
It will be convenient to write
d
(59) D=-
dt
and rewrite (57), (58) as
d4xI d3XI
(63) (mlm2) dt 4 + (ml + m2)c dt 3
d2XI dx l
+ (ml k2 + m2 hl) dt 2 + (hI + h2 )CTt + (k l h2 )x1 = O.
(64)
3.1 EXAMPLES FROM ENGINEERING AND PHYSICS 153
(65)
where A I ,A2 ,A3, and A4 are arbitrary constants. Moreover, with Xl known, the function
DX2 = dx 21dt can be calculated from (60) and X2 from (58). This procedure gives
LRC Circuits. Determine the differential equation and initial conditions for the given circuit.
6. R = 100, C = 1110, L = 1, E = 5. The capacitor is charged to 0.001 farads and an ammeter reads 3
amperes at time t = O.
Mechanical Oscillators. Determine the differential equation and initial conditions for the given
mechanical system. All units are cgs.
7. No damping is present, the mass is 10 and the Hooke's spring constant is 2. No external force. At time
t = 0 the position and velocity are zero and one, respectively
8. No damping is present, the mass is 10 and the Hooke's spring constant is 2. The externalforce is 100 sin wt.
At time t = 0 the position and velocity are both zero.
Torsional Pendulum. Determine the differential equation and initial conditions for the given system. All
units are cgs.
9. The moment of inertia is 1, no damping is present and the elastic restoring force constant is 10- 2 No
external torque is present. The twist angle and velocity are initially one.
10. The moment of inertia is 5, the damping constant is 2 and the elastic restoring force constant is 1. No
external torque is present. The twist angle and velOCity are initially one.
154 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
11. The moment of inertia is 1, the damping constant is 3 and the elastic restoring force constant is 2. The
external torque is 10sinwt. The twist angle and velocity are initially zero.
12. The moment of inertia is 1, no damping is present and the elastic restoring force constant is 2. No
external torque is present. The twist angle is initially 90 degrees and the velocity is initially zero.
Pendulum of Fixed Length. Determine the differential equation and initial conditions for the given system.
13. A string oflength 10 creates a pendulum with attached mass 0.01, all in fps units. The small oscillations
of the pendulum begin by pulling the pendulum to angular position 7 degrees and releasing the mass.
14. A string of length 5 creates a pendulum with attached mass 10, all in cgs units. The small oscillations of
the pendulum begin by pulling the pendulum to angular position rrll2 radians and releaSing the mass
with velOCity 1 em/sec.
Temperature Distribution in a Nonuniform Rod. Determine the linear second-order ordinary differential
equation for the time-independent temperature u(x).
au = -a ( yix-
- au) - x 2 sin(x)u.
at ax ax
17. (Bessel Equation) In r 2R"(r) + rR'(r) + (<<}r2 - n 2 )R(r) = 0, make the change of variables X = wr,
y(x) = R(r) to obtain the Bessel differential equation x 2y" + xy' + (x 2 - n 2 )y = O.
18. (Steady-State Temperature in a Disk) Derive the differential equation r2R" (r) + rR'(r) - n2R(r) = 0
by substitution of the product solution u = R(r)EJ(iI) into the partial differential equation r 2 u rr + fUr +
ulJIJ = O. Assume that the separation constant A has the form A = n2 (see Chapter 8).
19. (Legendre Differential Equation) Explain why (0 - x 2 )y')' + f.LY = 0 is equivalent to the equation
(l - x 2 )y" - 2xy' + f.LY =
O.
20. (Euler's Differential Equation) Define variables r = e:", y(x) = R(r). Find relations for r 2d2R1dr2
and rdRldr in terms of y and x. Use them to show that the Euler equation r2 ~ + r¥r - f.LR = 0 is
equivalent to the constant coefficient equation y" - f.L y = 0
21. (Multiloop Circuit) Verify directly that q = Aj c- t + A2c-2t +A3c-3t is a solution of the loop equation
q'" + 6q" + 11q' + 6q = O.
22. (Mechanical Oscillator) Verify directly that x = Aj c- t + A2te-t + A3 cos t + A4 sin t is a solution of
the oscillator equation X'III + 2X'" + 2x" + 2x' + x = O.
23. (Parallel LRC Circuit) Consider the parallel LRC circuit of Figure 11 with current source i(t). Let ij,
i2, i3 and i be the currents in the four branches, as shown, and let u(t) = Li3(t). Use Kirchhoff's laws
to show that
(u " 1, + -u
+ -u 1 = i(t).
R L
24. (Mass-String System) For the mass-string system of Figure 5, show that for small displacements x
the vertical force on the mass m is -2Tsin8 "" -2Ttan8 and tanil = 2x1e. Then use Newton's law to
verify the differential equation (14).
3.2 LINEAR SECOND-ORDER EQUATIONS - STRUCTURE OF SOLUTIONS 155
«-- i
FIGURE 11. Parallel LRC circuit with current source
25. (Two-Loop Circuit) Consider the two-loop electrical circuit of Figure 12. Let i), i2 be the electrical
currents in the two branches, as shown, and let q), q2 be the corresponding electrical charges on the
plates of the two capacitors.
Use Kirchhoff's laws to derive a system of two differential equations for q), q2. Then show that the
system is isomorphic to the system (57), (58) for the mechanical oscillator of Figure 10.
For applications to electric circuits,f(t) may represent an input voltage that has simple
jump discontinuities such as a square wave or sawtooth wave. Similarly, in applications
to mechanics, J(t) may represent a discontinuous applied force, and aCt) and bet) may
be system parameters that vary discontinuously. Hence, in what follows aCt), bet) and
J(t) are allowed to be sectionally continuous. The solution concept for (1) may then be
formulated as follows.
Fundamental Theorem. For each pair of numbers Co and CI, equation (1) has exactly
one solution x(O such that
(3)
Clearly, this theorem guarantees that (1) has a two-parameter family of solutions on
any interval to ~ t ~ tl, where aCt), bet) andJ(t) are defined and sectionally continuous.
This fact underlies all the work that follows on second-order linear equations. The proof
of the fundamental theorem, like that for the first-order case in Section 2.6, requires
techniques of advanced calculus, see, e.g., [B-RJ.
The Homogeneous Equation. Note that the identically zero function x(t) == 0 is
always a solution of (2). It will be called the trivial solution. Next, let Xl (t) and X2{t) be
two solutions of (2) and let Al and A2 be constants. Then the function
(4)
is said to be a linear combination of XI(t) and X2(t), with coefficients Al and A 2. A
simple but fundamental property of the linear homogeneous equation (2) is that linear
combinations oj solutions are again solutions. The reader is advised to check this property.
Linear Dependence. A linear combination (4) is said to be trivial if Al =
0 and
A2 = O. Otherwise, it is said to be nontrivial. Two functions Xl (t) and X2 CO are said to
be linearly dependent if a nontrivial linear combination of them equals zero:
(5)
This may be expressed by stating that Xl (0 and X2 (t) are linearly dependent if and only
if one of them is a constant multiple of the other. For example, if (5) holds with Al =I- 0
thenxI(t) = -(A2/A I )X2(t).
3.2 LINEAR SECOND-ORDER EQUATIONS - STRUCTURE OF SOLUTIONS 157
Linear Independence. Two functions Xl (0 and X2 (t) that are not linearly dependent
are said to be linearly independent. Thus, if Xl (t) and X2 (0 are linearly independent and
AIXI (t) + A2X2 (t) = 0 then Al = 0 and A2 = O. This may be expressed by stating that
Xl (0 and X2 CO are linearly independent if and only if neither is a multiple of the other.
The Wronskian. A simple test for linear independence uses the Wronskian
functions
(6)
It follows that W(XI, X2, t) = 0 for to ::: t ::: tl if and only if W(XI, X2, to) = O. Thus
linear dependence can be checked by calculating W(XI, X2, t) at a single point.
To verify Abel's formula (8), differentiate (6) and use the assumption that Xl (t) and
X2 (t) are solutions of (2) to give
the fundamental theorem implies that (2) has a pair of solutions, Xl CO and X2 (t), with
initial values
X2(tO) = 0, x;(to) = 1.
This pair is linearly independent because W(Xl,X2, to) = 1. Of course, Xl and X2 can be
defined in many other ways. It is only necessary to choose their initial values such that
W(Xl,X2, to) 1= o.
Structure Theorem for Equation (2). It was noted above that if Xl (t) and X2(t) are
solutions of (2), then so are all of their linear combinations (4). It will now be shown
that if Xl (t) and X2(t) are any linearly independent pair of solutions of (2) and if x(t) is
any third solution, then there exist unique constants Al and A2 such that (4) holds. To
verify this, let x(t) be any solution of (2) and define Co = x(to), Cl = x'(to). Choose AI,
A2 to satisfy the equations
This is a pair of linear equations for AI, A 2, and it has a unique solution for AI, A 2.
This is true because the determinant of the system is W(Xl, X2, to), and W(Xl, X2, to) 1= 0
because Xl (t) and X2 (t) are linearly independent. Now let
(12)
where AI, A2 satisfy (11). Then yet) is a solution of (2) and yet) satisfies y(to) = co,
y' (to) = (1, by (11). But this means that the solutions x(t) and yet) have the same initial
values. Hence x(t) and yet) are identical by the fundamental theorem. Thus if Xl and X2
are linearly independent solutions of (2) then every solution of (2) has the form (4). For
this reason a pair of linearly independent solutions of (2) is called a solution basis for (2).
The Inhomogeneous Equation. Next let x(t) and xp(t) be any two solutions of (1)
with the same right-hand sidej(t):
If these equations are subtracted, then the jet) terms cancel, and yet) = x(t) - xp(t) is
seen to be a solution of the homogeneous equation. Thus yet) has the form (12):
(14)
3.2 LINEAR SECOND-ORDER EQUATIONS - STRUCTURE OF SOLUTIONS 159
Structure Theorem for Linear Second-Order Equations. Let xp(t) be any particular
solution of (1) and let Xl (t), X2 (t) be any pair of linearly independent solutions of (2).
Then for each solution x(O of (1) there exist unique constants AI, A2 such that
(15)
The constants A I, A2 play the role of the two constants of integration that are expected
in the general solution of a second-order equation. Thus the structure theorem reduces
the calculation of the general solution of (2) to two steps: CO find a pair of linearly
independent solutions of the homogeneous equation (2), and (ii) find a particular
solution of the inhomogeneous equation (1). Of course, the steps can be carried out in
either order.
The most difficult step in finding the general solution (5) of 0) is to find a single
nontrivial solution Xl (t) of the homogeneous equation. In fact, the surprising result
will be shown that if a nontrivial solution Xl (t) of (2) can be found then a second
independent solution of (2) and a particular solution xp(t) of 0) can be calculated by
simple integrations.
The Method of Reduction of Order. This method starts with a solution Xl (0 =1= 0
of (2) and seeks a second solution X2 (t) of the form
(16)
Note that if vet) is not constant then the pair Xl (0, X2 (0 will be linearly independent.
Differentiation of (16) gives
(18)
Clearly, vet) = ( = constant is a solution, but this gives X2 (t) = (Xl (t), and Xl, X2 are
linearly independent. What is needed is any nonconstant solution of (18). Now, (18) is
a linear separable first -order equation for
and, by exponentiation,
(23)
Note that constants of integration can be chosen in any convenient way, since only one
solution is needed. It is clear from (22) that v' is not zero and hence Xl and X2 are linearly
independent.
and so
Since multiples of solutions are also solutions, it is seen that Xl = el and X2 = eSt are
linearly independent solutions of (24). Hence, the general solution of (24) is
(25)
where VI (t) and V2(t) are to be calculated. Of course, if VI and V2 are constants then
(25) defines a solution of the homogeneous equation. It will be shown that nonconstant
functions VI (t), V2 (t) can be constructed such that (25) defines a particular solution of
the nonhomogeneous equation.
Differentiation of (25) gives
(26)
Before differentiating again, note that two conditions can be imposed on the functions
VI and V2. One condition is the differential equation (1). As a second condition it is
traditional to take
(27)
Other choices are possible, but this one will prove to be convenient. Combining (26)
and (27) gives
(28)
(29)
On substituting (25), (28) and (29) into (1), one finds that
(30)
The terms containing VI and V2 (undifferentiated) cancel because Xl and X2 are solutions
of the homogeneous equation (check this!). Equations (27) and (30) are a pair of
simultaneous linear equations for v~ and v;. Moreover, the determinant of the system
is the Wronskian W(XI, X2, t) and is not equal to zero because Xl and X2 are linearly
independent. Hence, (27) and (30) have a unique solution for v~ and v;, which is readily
found to be
(31)
7Joseph Louis Lagrange 0736-1813), famous European mathematician. He was born in Turin and worked there
and in Berlin and Paris.
162 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Functions VIet) and V2et) are obtained by integration of (31). Note that constants of
integration can be ignored, since they change xp only by a solution of the homogeneous
equation. If the equations of (31) are integrated and substituted into (25) then the explicit
()-1t
formula
[X2 (t)XI (r) - Xl (t)X2 (r) If( r) d
(32) Xp t - r
to W(XI,X2, r)
It was seen above that one may take Xl = et , Xl = e5t , whence W(XI, X2, t) = 4e6t . Thus,
by (31)
, 1
V ---
1- 4'
Note that since et is a solution of the homogeneous equation, a Simpler particular integral
is xp(t) = -tet/4. Thus, by the structure theorem, the general solution of (33) is
1
x(t) = Alet + A2 e5t - 4"tet ,
IExercises 3.21
~''C'",,>",'_ 'wVv,"")Llh",-_-,,,_v~"_Cvv __ -'_"
Wronskians and Linearly Independent Solutions. In each of the following exercises, (a) compute the
Wronskian determinant of the given solutions, (b) verify that the solutions satisfy the differential equation,
(c) apply the Wronskian test to establish linear independence of the solutions on the indicated interval.
1. XI(t) = et ,x2(t) = e- t , -00 < t < OO,X" -x = O.
2. Xl (I) = cosh(t), X2 (I) = sinh(l), -00 < t < 00, x" - X = O.
3. Xl (I) = cosh(t - to), X2(t) = sinh(t - to), -00 < t < 00 (to fixed), x" - x = O.
4. XI(t) = t,X2(t) = tlogt, 0 < t < 00, [2X" - tx' +x = O.
Abel's Formula. In each of the following exercises use Abel's formula to evaluate the Wronskian of the
given solutions Xl and X2.
3.2 LINEAR SECONO-OROER EQUATIONS - STRUCTURE OF SOLUTIONS 163
Method of Reduction of Order. In each of the following exercises a linear homogeneous differential
equation and one nontrivial solution Xl (t) are given. Use the method of reduction of order to construct a
second solution X2 (t) such that XI (t) and X2 (t) are a solution basis on the interval.
14. t 2x" + tx' + (t 2 - *)x = 0, XI (1) = t- 1/2 cost, 0 < t < 00.
Method of Variation of Parameters. In each of the following exercises a linear nonhomogeneous equation
is given, together with a solution basis for the homogeneous equation. Use the method of variation of
parameters to find a solution of the inhomogeneous equation.
22. (Linearity) Let Xl (1) and Xl (t) be solutions of the linear homogeneous second-order equation (2) on
an interval to ~ t ~ tl and let Al and Al be constants. Show that the linear combination x(t) =
A IX] (t) + A2Xl (t) is also a solution of (2) on the same interval.
23. (Rockets) A skyrocket is fired vertically upward from the ground. Find the time at which it returns
to the ground if the rocket has mass m = 100 gm, thrust T = 2 x 10 5 dynes, and if it burns for
164 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
to = 3 sec. Ignore air resistance. Suggestion: Let the rocket's height at time t sec be x(t) cm. Show that
if g = 980 cmlsec 2 then
Hencex(t) is the unique solution ofthis equation that satisfies the initial conditionsx(O) = 0, x' (0) = O.
where a and b are constants. Note that in Section 3.1, the LRC circuit equation (4),
with source term E(l) = 0, can be put in the form (1) by dividing by L. The analogous
statements are true for the mechanical oscillation equation (5) and the torsional oscillator
equation (8) of Section 3.1.
Pairs of solutions of (1) with
R
(2) a= -
L'
b=~
LC
were derived in Section 2.5 by the Riccati equation method. Three cases were found,
depending upon whether the parameter
a2 R2 1 R2 - 4UC
(3) - - b=- - - = ---::--
4 4U LC 4U
was positive, zero, or negative. The solutions may be written as follows.
a2
Case l. 4- b > 0.
a2
Case 3. - - b < 0.
4
(7)
where the parameter A was to be determined. It is easy to check that (7) defines a
nontrivial solution of (1) if and only if A is a root of the characteristic equation
(8) A2 +aA+b=0.
The three cases defined above correspond to whether the discriminant of (8), a2 - 4b,
is positive, zero, or negative.
In Case lone has a 2 - 4b > 0, and (8) has the two real distinct roots:
a a2
(9) A2 = -- -f3; f32 = - - b.
2 4
°
Equation (7) with these values of A gives the two solutions (4). In Case 2 one has
a2- 4b = and (8) has the double real root
a
(10) Al = A2 = --.
2
Equation (7) with this value of A gives the solution Xl (t) of (5). The second solution of
(5), X2(t) = t e- at12 , can then be found by the method of reduction of order. In Case 3
one has a2 - 4b < 0, so that the roots of (8) are complex and (7) does not immediately
give the solutions (6). However, in this case one can look for solutions of (1) of the form
(ll)
(12)
Combining this with (11) gives the solution pair (6) of Case 3. Note that formally, Case
3 is the case where (8) has the complex roots
a
(14) A2 = -2 - iy,
Thus, in every case, the roots of Euler's characteristic equation (8) permit one to calculate
a solution basis for (1).
where f(t) is a prescribed function of t. The structure theorem of Section 3.2 guarantees
that the solutions of (15) have the form
(16)
where Xl, X2 is a solution basis for (1), constructed as above, and xp(t) is any particular
solution of (15). The variation of parameters method of Section 3.2 provides the integral
formula (32), Section 3.2, for xp(t). It will be shown that it takes a particularly simple form
when the homogeneous equation has constant coefficients. The three cases described
above will be considered separately
Case 1. In this case Xl(t) = d- 1t , X2(t) = d- 2t , and W(Xl,X2, t) = 0"2 - AI) eO.j +'"2)t,
where AI, A2 are defined by (9). Thus equation (32), Section 3.2, with these choices and
to = 0 gives
(17)
or
(18)
(19)
3.3 LINEAR SECOND-ORDER EQUATIONS WITH CONSTANT COEFFICIENTS 167
sinh f3t
(20) h(t) = e- atl2 _ _ (Case 1).
f3
Case 2. Substituting the solution basis (5) into (32), Section 3.2, again gives a particular
solution xp(t) of the form (18). The function h(t) is most easily calculated from (20) by
fixing a and making b -+ a2/4 so that f3 -+ O. This gives
Case 3. Equation (32), Section 3.2, again gives a particular solution of the form (18). The
simplest way to find h(t) forthis case is to use the relation f3 = iy (see (4), (6)). This gives
sin yt
(22) h(t) = e- atl2 _ _ (Case 3).
y
Alternatively, the use of complex quantities may be avoided by using the solution basis
(6). This gives W(XI ,X2, t) = ye at and hence the trigonometric identity
gives
(25)
or
This is easily verified by making the change of variable r = t - s (t fixed) in the integral
of (24). The convolution will play an important part in the applications of the Laplace
transform in Chapter 4.
168 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Here A = -1 is a double root of the characteristic equation, and hence both e- t and
t e- t are solutions of the homogeneous equation. Hence the trial solution xp(t) = At e- t
will not succeed. The variation of parameters method yields a solution of the form
xp(t) = At 2 e- t If this is substituted into (32), it is readily found that A = 112, so
xp(t) = (l/2)t 2 e- t .
EXAMPLE 6. A third type of function for which the method of undetermined coefficients
is effective is illustrated by the equation
Since differentiation of sin t gives multiples of sin t or cos t, it is natural to try xp(t) =
A sin t + B cos t with undetermined coefficients A and B. Thus x~ = A cos t - B sin t,
x; = -A sin t- B cos t = -xp, and substituting into (33) gives 2A cos t-2B sin t = sin t
or 2A cos t - (2B + 1) sin t = O. Since cos t and sin t are linearly independent, the last
equation implies that A = 0 and 2B + 1 = 0, or B = -112. Thus xp(t) = -(1/2) cos t.
(34)
where n > 0 is an integer and p and q are real. These functions have the important
property that all their derivatives are sums of functions of the same type. Thus if a function
is substituted into the left side of (15) then a right-hand side of the form
(36)
J(t) = (pot + Plt n- l + ... + Pn) ept cosqt
+ (Qotn + Qltn- l + ... + Qn) ept sin qt
is generated. It is a remarkable fact that the converse of this statement is essentially true.
Exceptions occur only when some of the terms in (35) are solutions of the homogeneous
equation. This happens precisely when the real or complex number p + iq is a simple
or double root of the characteristic equation (8). Complete rules for finding xp(t) in all
cases may be formulated as follows.
Rules for the Undetermined Coefficients Method. Consider the nonhomoge-
neous equation (15) with right-hand side J(t) given by (36). Then (15) has a particular
solution xp(t) that is given by one of the following three cases.
Case 1. p + iq not a root oj (8). Then there is an xp(t) of the form (35).
Case 2. p+iq a simple root oj (8). Then the correct form of xp(t) is found by multiplying
the right-hand side of (35) by t.
Case 3. p+iq a double root oj (8). Then the correct form of xp(t) is found by multiplying
the right-hand side of (35) by t 2 .
Once the correct form of xp(t) is determined, the "undetermined coefficients" AD, A l ,
... , An, Bo, Bl , ... , Bn are calculated by substitutingxp(t) and (36) into the differential
equation and then equating coefficients of like terms. This leads to a system of 2n linear
equations for the unknowns AD, Al, ... ,An, Bo, Bl, ... , Bn.
An argument that the procedure described above always works can be based on the
Laplace transform.
Note that Case 3 above can occur only if q = O. This is because complex roots of
(8) occur only in complex conjugate pairs Al = P + iq, A2 = P - iq. The method may
also be applied when J(t) is a sum of several terms of the form (36), each with different
values of n, p, anellor q. This follows from the follOwing theorem.
Then the sum x(t) = Xl (t) + X2 (t) satisfies (15) with J(t) = Jl (t) +h (t).
This is verified by adding the two equations of (37).
The remainder of this section illustrates the methods developed above by applications
to physical problems. Additional examples and applications are given in the exercises at
the end of the section.
3.3 LINEAR SECOND-ORDER EQUATIONS WITH CONSTANT COEFFICIENTS 171
Applications to LRe Series Circuits. A simple LRC series circuit with voltage
source E(t) is shown in Figure 13. The charge q(t) on the capacitor C and the current
i(t) in the circuit satisfy the differential equations (2) and (4) in Section 3.1. In particular,
if E(t) is a given function of t ~ 0 and if q(O) = qo and i(O) = dq(O)!dt = io are given,
then q(t) is a solution of the initial value problem for (4), Section 3.1, namely
1
(38) Lq" + Rq' + - q = E(t) for t ~ 0,
C
q(O) = qo, q' (0) = i o·
The fundamental theorem of Section 3.2 guarantees that this problem has one, and only
one, solution. Moreover, the solution can be constructed explicitly by the methods of
this section. Here several physically distinct cases are discussed below that correspond
to different voltage sources E(t) and parameter values L, Rand C.
The general form of the solution of (38) may be obtained from (15)-(18) above with
a = RlL, b = l/LC andj(t) = E(t)IL. Thus
where ql (t) and q2 (t) are any linearly independent solutions of the homogeneous
equation and k(t) is the particular solution of the homogeneous equation that satisfies
k(O) = 0 and h'(O) = 1. Note that
E(t)
c
The constants AI, A2 in (39) are determined by the initial conditions of (38). In fact,
This is a pair of simultaneous linear equations for Al andA 2 . They have a unique solution
because the determinant of the system is
(44)
The precise forms of ql and q2 depend on the values of L, Rand C. This was discussed
in Section 2.5. Three cases were distinguished depending on whether the discriminant
D = R2 - (4UC) was positive, zero, or negative. If R > 0 and D ~ 0 then q(t) was
seen to tend exponentially to zero with at most one change of sign, while for R > 0
and D < 0 the charge q(t) was oscillatory with exponential damping. Such solutions are
called natural oscillations of the circuit. The initial state of the circuit, defined by qo
and i o, gradually decays to zero.
The Case R = O. In an idealized circuit with R = 0, called a superconducting circuit,
energy is conserved and the solution is purely oscillatory with no decay The exact form
is given by the solution basis (see (87), Section 2.5)
(48)
Thus
This differs from the case of no applied voltage (Eo = 0) only by the shifted value of q(t).
Thus q(t) oscillates about the constant charge CEo rather than about zero. The current
i(t) is unchanged.
AC Applied Voltage. If E(t) = Eo sin wt with w > ° then
,1
+ Rq + Cq =
If
(SO) Lq Eo sin wt.
(52)
If
q + Y2 q = -Eo.SIn wt.
L
Substituting (51) into (52) gives
E
(53) (y2 _ ( 2) (Ao cos wt + Bo sin wt) = ~ sin wt.
L
Equating coefficients of cos wt and sin wt gives, since y2 - w2 =1= 0,
Eo 1
(54) Ao = 0, Bo = -L 2 2'
Y -w
and
Eo sinwt
(55) qp(t) = L(y2 _ (2) .
(58)
-RwEo
(65) Ao = -Ll-'
and hence
-Rw cos wt + (_LW2 + lIC) sin wt
(66) q ( t) - E - - - - - - - - - - -
p - 0 (Lw2 _ l/C)2 + R2w2
Resonance when R > O. Notice that the point in the (x,y)-plane with coordinates
-Lw 2 + lIC
(67) y= ~
and nl2 < ¢ < 3nl2 (since cos ¢ < 0). On substituting (68) into (66) and using the
identity
one gets
(t) = Eo cos(wt - ¢)
(69)
qp .J(Lw2 - lIC)2 + R 2w 2
It is clear that resonance occurs at frequency w = W max such that x = w~ax maximizes
the amplitude
2 1
(70)
Y = LC
(71)
for all x> O. Hence A'(x) < 0 and A(x), is monotone decreasing for all x> O. There is
no resonance.
Case 2. y2 > R2/(2L)2. In this case A'(x) has exactly one positive zero at x = cv~x'
and resonance occurs at frequency
~
(72) CVmax = yy2 - W'
Figure 14 shows sample graphs of A(cv2 )/Eo versus cv for L = 1, C = 1, and several
values of R.
General Applied Voltages. Let E(t), defined for t ::: 0, be an arbitrary applied
voltage such as a signal from a transmission line. If the circuit is quiescent for t < 0, so
q(O) = 0, q'(O) = 0, then by (40) one has the circuit response
This defines the unique solution of the initial value problem for (38) with zero initial
values, for any function E(t) that is sectionally continuous for t ::: O.
Causality. Equation (73) clearly expresses the causality principle for circuits: The
state of the circuit at a time t > 0, as described by q(t) and i(t) = q'(t), is determined
by the values of the applied voltage E( T) for 0 :::: T ~ t only.
Numerical Integration. If E(t) is known only from experimental data then the
circuit response q(t) may be obtained directly from the convolution (73) by numerical
integration. Recall that k(t) is an elementary function, given by (20), (21), (22). For
o
o 0.5 1.0 1.5 2 FIGURE 14. Graphs of A(o/)IEo vs. w
3.3 LINEAR SECOND-ORDER EQUATIONS WITH CONSTANT COEFFICIENTS 177
and
(75)
Moreover, h(t) = q2 (t)/y, and hence, by the addition formula sin(a - b) = sin a cos b -
cos a sin b, one has
(77)
where
(78) 1
VI (t) = - yL 10t( ell sin yr
R, )
E(r)dr,
1
V2 (t) = yL 10t( ell cos yr
R, )
E( r)dr.
These last two functions can be computed directly from the data E(r) by numerical
integration.
Applied Voltage of Finite Duration. If E(t) == 0 for all t 2: to > 0 then VI and V2
are constants Al and A2 for t 2: to and one has
(79)
(80)
1
Al = - yL
to
10 (e~ sin yr) E(r) dr,
A2
1 to
= yL 10 (e~ cos yr) E(r) dr.
Isomorphic Physical Problems. Each result obtained above for LRC series circuits
has an analogue for each of the isomorphic physical problems derived in Section 3.l.
These are the LRC parallel circuits and the linear and torsional mechanical oscillators.
To obtain the analogous result, one need only rename the variables and parameters
178 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
q u x (J
L C m
R lIR
C IlL 11k k
according to the following table. Some applications of these isomorphisms are given in
the Exercises below.
Euler's Method. Find solution bases for the following equations by Euler's method.
L
1 c
19. Compute iCt) and plot its graph if L = 0.10 H, R = 2 n, C = 20 /-LF, and Ve = 200 V.
20. Compute iCt) and plot its graph if L = 0.10 H, R = 2 n, C = 10 /-LF, and Ve = 100 V.
Applications to Mechanical Oscillators.
Consider the mechanical oscillator of Figure 16. The damper is attached to a fixed support, while the
spring is attached at its lower end to an oscillating platform. The displacements of the mass and the platform
are denoted by x(t) and y(t), as indicated in the figure. Newton's second law gives
where F, the force due to the spring, is given by Hooke's law F = key - x + K), where K is a constant.
It will be assumed that the spring is unstretched when x = Xo and y = YO, so that Yo - Xo + K = 0, or
.. yet)
J
platform FIGURE 16. Mechanical oscillator
180 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
If we write
(83) x(t) = xo +u
and define yet) by
23. Calculate Amax = maxA(w) on w ::": O. Give separate answers for the two cases 2km - c2 > 0 and
2km - [2 ::: O. Explain the connection with resonance.
Applications of the Convolution Integral to lRC Circuits. In the exercises below, refer to the lRC
circuit of Figure 13. The input voltage E(t) represents a signal, delivered to the circuit by a transmission
line. The goal is to compute the voltage across the capacitor,
Vet) = q(t)/C,
where q(t) is the charge on the capacitor It is assumed that q(O) = 0 and q' (0) = i(O) = 0, so that q(t) is
given by (73) and hence
25. Graph the function Vet) on the interval 0 ::: t ::: 1.0, if L = 1, R = 2, and C = 0.5, while E(t) is the
linear interpolant of the 11 experimental data points given in Table 2. The data points are (x, sin(nx))
for x = 0 to x = 1 in steps of 0.1. Without computer assistance, use E(t) = sin(nt).
3.3 LINEAR SECOND-ORDER EQUATIONS WITH CONSTANT COEFFICIENTS 181
Euler's Method Verification. The following exercises complete statements made in the text about the
validity of Euler's method and linear independence of the solutions.
26. Show by direct differentiation that the solutions Xl, x2 obtained for cases 1, 2, 3 indeed satisfy the
differential equation x" (t) + ax' (t) + bx(t) = 0
27. Calculate the Wronskian of the solution pair xl (t), x2 (t) given in cases 1,2,3. Verify in each case that
the Wronskian never vanishes. Apply the theory of linear independence in Section 3.2 to verify that the
solution pairs given in cases 1, 2, 3 are linearly independent.
Overdamped Spring. A spring with attached mass and fluid dashpot has its free vibration x(t) satisfying
the model
28. Find the roots of the characteristic equation}.. 2 + 3}" + 2 = O. Use the analogy of the LRC circuit
equation to solve for x(t) = Ae- t + Be- 2 t.
The damping effects are in the constant 16, because if 16 is decreased by any amount at all, then oscillations
are produced. Therefore,
33. Plot the general solutions of x"Ct) + 16.5x'Ct) + 64x(t) = 0 and x"(t) + 17x'(t) + 64x(t) = 0 for
A = B = 1 and 0 ~ t ~ 1. Where do they cross?
Underdamped Spring. A spring with mass and fluid dashpot usually displays sustained oscillations of
decreasing magnitude. A characteristic model is
34. Solve the differential equation by analogy with the LRC circuit equation.
35. Consider the particular solution x(t) = e- t sin(t). Graph this solution for 0 ::s t ::s T, where T is the
solution of e- T = 11100 (then x(T) is 1% of the maximum amplitude 1). How many times does x(t)
cross the t-axis on 0 < t < T7
182 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
The Power Series Method. This method seeks to construct solutions x(t) of
equation (1) as the sums of convergent power series:
L [k(t -
00
Taylor's theorem implies that the series above must be the Taylor series of x(t), so that
The Power Series Theorem. Let the coefficients aCt) and bet) in (1) be analytic at to:
L ak(t - L bk(t -
00 00
for It - tol < r. Then for each choice of the constants Co = x(to) and [1 = x/(to) the
equation (1) has an analytic solution (2). The solution may be constructed by substituting
the series (2) and (4) into (1) and calculating the coefficients [2, [3, ....
Proofs of this theorem may be found in advanced textbooks such as [B-RJ. No general
proof will be given here. However, the theorem will be verified in several important
special cases.
Rules for Power Series. The application of the power series method depends on
three simple rules for calculating with series. The proofs of these rules, which are based
on Taylor's theorem, will be left for the exercises.
3.4 LINEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 183
The Differentiation Rule. The power series representation (2) can always be
differentiated term by term to get a power series representation for the derivative:
00
Moreover, the derived series converges in the same interval It - tol < T as for x(t).
The Product Rule. Let aCt) and bet) be analytic at to with power series (4) convergent
for It - to I < T. Then a(t)b(t) is analytic at to and
00
with coefficients
k
(7) Yk = Laebk- e, k=0,1,2, ....
£=0
Note that the coefficients (7) are those that result from the formal multiplication of the
series (4) for aCt) and bet)
The Vanishing Rule. If the sum of the power series in (2) is zero for It - tol < T
°
then the coefficients Ck = for k = 0, 1,2, ....
The power series method is applied below to the Bessel and Legendre equations.
First, the method will be illustrated by application to two simpler examples.
The method of Section 3.3 provides the solution basis Xl(t) = ct , X2(t) = tc t . These
solutions will be rederived here by the power series method.
For (8) the constant coefficients aCt) = -2, bet) = 1 are analytic at to for every to.
Taking to = °gives the trial solution
00
and
00 00
The second summations in (10) and (11) are obtained from the first by increasing the
summation index from k to k + 1. Verify that the two summations represent exactly
the same power series. On substituting the series for x(t), x'(t), and x"(t) into (8) and
combining the three sums, one gets
00
By the vanishing rule, this equation implies that the coefficient of each power tk must
equal 0, so that for k = 0, 1,2, ... ,
Clearly, when Co and CI are given then (14) with k = 0 gives C2. Next, (l4) with k = 1
gives C3, etc. Thus all the Ck for k 2: 2 are generated from Co and CI by (l4).
To generate the solution XI (t) = et (so x~ (t) = et ), one must take Co = XI (0) = 1
and Cl = x~ (0) = l. Then (14) with successive values h = 0,1,2, . .. gives
2Cl 1 1
C2 = -
Co
- - =-
2 2 2!'
C3 = -2C2
3
- CI
--
2·3
= -
3!'
etc.
The correctness of this equation for all k 2: 0 may be verified by mathematical induction.
Thus if (15) holds for C£ and CHI then (14) gives
2 1 1 1 1
C£+2 = £ +2 (£ + 1)! - (£ + 1)(£ + 2) £! = (£ + 2)!'
This completes the induction. Finally, substituting (15) into (9) gives the familiar
exponential series
00 tk
(16) x(t) = " - = et =Xl(t).
~ h!
k=O
3.4 LINEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 185
The second solution X2 (t) = t if (so x~ (t) = (t + I)e t ) can be obtained by the same
method from (14) and the initial values Co = X2(0) = and C1 = x~(O) = l. °
Example 1 was simple because (8) has constant coefficients. A more interesting case
is the following.
This has the form (1) with variable coefficients aCt) = 2t and bet) = 2, which are analytic
at all points to. Taking to = ° gives
oc
(18) x(t) = L Ck tk ,
k=O
oc
x'(t) = L kCktk-1,
k=O
oc
tx'(t) = L kCk tk ,
k=O
oc oc
L
00
or
Consider first the solution with Co = 1, C1 = 0. Then (21) with k = 1,3,5, ... gives
C1 = (3 = C5 = ... = 0; that is,
Next, (21) with k = 0,2,4, ... gives C2 = -1, C4 = -(212 = 112, C6 = -I/3!, etc. It is
easy to guess that
To verify this, note that (21) with k = 2l and (23) with k = £ give
where the series was summed by the familiar exponential series. It is easy to verify, by
direct differentiation, that Xl (t) = e- t2 is a solution of (17).
To find a solution basis for (17), a second solution X2 CO with initial values Co = 0,
Cl = 1 will be constructed. Note that Co = ° and (21) imply that
It is easy to show that X2 (t) is not an elementary function. In fact, the method of reduction
of order gives a second solution
(29)
whence
(31) X2 (t) = e- t2 1 t r2
e di.
This is the solution whose Taylor series is (28). In fact, from (31),
(32)
3.4 LINEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 187
Hence, both (28) and (31) define solutions of (17) that have initial values X2(0) = 0,
x;(O) = 1. It follows that they are identical, by the uniqueness part of the fundamental
theorem.
The Legendre Equation. The Legendre equation (45) of Section 3.1 can be written
"lx, J.l 0
(33)
Y - I - x 2Y +1- x 2Y = .
(35) y(x) =L Ck xk ,
k=O
so
00
2: k(k -
00 00
k(k + 1) - J.l
(42) CH2 = (k + 1)(k + 2) Ck, k = 0, 1,2, .. , ,
188 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Clearly, once Co and Cl are chosen, the remaining coefficients Ck are uniquely determined.
Two cases will be discussed.
Case 1. (co = 1, Cl = 0). In this case, (42) with odd indices gives C2k+! = 0 for k = 0,
1, 2, ... and so one solution of (40) is
L
00
where
(2k)(2k + 1) - J.l
(44) C2k+2 = (2k + 1)(2k + 2) C2k for k = 0, 1,2, . ...
This relation implies that if no C2k = 0 then
(45) · -
11m -= 1
C2k+2
k-->oo C2k
and hence by the ratio test, the radius of convergence of (43) is 1; that is, Yl (x) is defined
by (43) for Ixl < 1. The successive coefficients C2k are determined by Co = 1 and (44).
Thus
(-J.l)
C ---
2 - 1.2'
2·3 - J.l (-J.l) (2 . 3 - J.l)
C4 = 3 . 4 C2 = 4!
It is not difficult to prove by induction that
(-J.l)(2·3 - J.l)(4· 5 - J.l) ... «(2k - 2)(2k -1) - J.l)
(46) C2k =
(2k)!
Case 2. (co = 0, Cl = 1). This second choice gives C2k = 0 for k = 0, 1, 2, ... and
hence a second solution of (40) is
(47)
where
(2k - 1)(2k) - J.l
(48) C2k+l = (2k)(2k 1) C2k-l+ for k = 1,2,3, . ...
Note that W(Yl ,Y2, 0) = Yl (O)Y~ (0) - Y2 (O)Y~ (0) = 1 and hence Yl ,Y2 are a solution
basis for the Legendre equation. If no C2k+! = 0 then
(49)
· -
11m -= 1
C2k+l
k-->oo C2k-l
3.4 LINEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 189
and hence the radius of convergence of (47) is l. The successive coefficients C2h+1 are
determined from C1 = 1 and (48). Thus
1·2-11
2·3
3·4 - 11 (1 ·2 - 11)(3 . -11)
C5 = 4.5 C3 = 51
Equations with Singular Points. Equation (1) is said to have a singular point
at to if one or both of the coefficients aCt), bet) is not analytic there. Thus the Bessel
equation (30) of Section 3.1 has a singular point at Xo = 0 and the Legendre equation
(33) has Singular points at Xo = ±l. Moreover, for applications it is often important to
discover the behavior of the solution of the differential equation near such points. All of
the examples given ahove can be treated by the theory of linear second-order equations
with regular singular points. The theory is based on the following definition.
Definition. A linear second order equation (1) is said to have a regular singular point
at t = to if it can be put into the form
a(t) , bet)
+--x +
/I
(51) x x=o,
t - to (t - toF
(52) x"+ -x
£10 '+ -x=
bo 0
t t2 '
or, equivalently,
(53)
Note that in the examples from physics of Section 3.1, equations (33) and (41) are Euler
equations. Euler's method for solving (53) is to look for solutions of the form x(t) = ItI",
where the index a is to be calculated. For simplicity it will be assumed here that t > 0,
so that
Then
(55)
and similarly,
Three cases arise, depending on whether the roots of this equation are real and distinct,
real and equal, or complex conjugates. The results may be formulated as follows.
Case 1. (al and a2 real and distinct). Then (53) has the solution basis
(59)
Clearly, the ratio of the two solutions is not constant if al =1= a2, and hence they are
linearly independent.
Case 2. (al = a2 = a real). Then (53) has a solution basis
(60)
It is clear that t Oi is one nontrivial solution. The second can be obtained from the first by
the method of reduction of order.
3.4 liNEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 191
Case 3. (al = p + iq, a2 = p - iq, q > 0). Then (53) has a solution basis
EXAMPLE 3. The search for product solutions u(r, ()) = R(r)8«()) of the Laplace equation
in polar coordinates, (31), Section 3.1, led to the Euler equation (33) (Section 3.1); i.e.,
(63)
(64)
The Method of Frobenius 8 • If (51) has a regular singular point at to but is not
an Euler equation (so aCt) and bet) are not both constants), then Euler's substitution
x(t) = It - tol a does not give a solution. For this more general case Frobenius proposed
the trial solution
L Ch(t -
00
where, for simplicity, it is assumed that t > to. This clearly generalizes the power series
method (where a = 0) and the Euler method (where Ck = 0 for k = 1,2, .. .). The
idea is to substitute (66) and its derivatives into (51) and to determine, in succession,
the values of the exponent a and the coefficients co, C1, C2, .... Here the main steps of
the calculation will be written down to display the equations by which a and the Ck are
determined. For brevity, only the case to = 0, t > 0 is shown.
Moreover, the product b(t)x(t) is analytic at to = 0 with power series obtained by the
product rule (6):
(75)
Similarly,
(76)
3.4 LINEAR SECOND-ORDER EQUATIONS WITH ANALYTIC COEFFICIENTS 193
If the series in (69) is to define a solution of the differential equation (67) then the sum
of the series (73), (76), and (75) must be zero:
(77)
By the vanishing rule this implies that the coefficients are all zero:
=°
k k
(78) C~ + L ak-jC; + L bk-jCj for k = 0, 1,2, ....
j=O j=O
Since Co i= °
by assumption, the exponent a must satisfy the indicial equation
Main Frobenius Theorem. Let a be the larger of two real roots of the indicial equation
or either of two complex roots. Then for each Co i= 0, equation (67) has a nontrivial
solution (69) with coefficients generated by the recursion relation (82).
194 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
To complete the proof of this theorem one need only prove that the series (69)
generated by the recursion (82) must converge. Such proofs require techniques of
advanced calculus. They may be found in advanced texts such as [B-RJ.
Of course, if the roots of the indicial equation do not differ by an integer or
are complex, then both roots generate solutions, and a solution basis is obtained.
However, the Main Frobenius Theorem suffices for applications because a second, linearly
independent, solution can always be constructed by the method of reduction of order.
These techniques are illustrated in the remainder of this section by application to the
Bessel and Legendre equations.
Bessel Functions. Only Bessel functions of integer order n ~ 0 will be discussed.
For the application of the Frobenius method it will be convenient to write the Bessel
equation (30) of Section 3.1 in the alternative form
(83)
Here the coefficients a(x) and b(x) of (51) are a(x) == 1 and b(x) = _n2 + x 2, so in (68)
ao = 1, bo = _n 2 , and the indicial equation (80) is
(84)
with roots al = nand a2 = -no Thus the main Frobenius theorem guarantees the
existence of a solution
00 00
Now, the Bessel equation (83) implies that the sum of the last four displayed equations
is zero:
L [(k + n)(k + n -
00
Hence all the coefficients in the square brackets must vanish by the vanishing rule. These
conditions can be written as
(92)
because Cl 0, and hence C1 = O. It then follows from (91) that C2k+l °for
k = 0, 1,2, ... , and (85) takes the form
L C2kX2k+
00
(93) y(x) = n.
k=O
This determines the coefficients C2, C4, ... uniquely when Co is given. A short calculation
gives
-Co
C2 = -22-=-(-1-!)-Cn-+-1-) ,
Co
C4 = -24-C-2-!)-(n-+-1-)C-n-+-2)'
-Co
C6 = -26-C-3-!)-Cn-+-1-)(-n-+-2)-(n-+-3-r
This suggests the general fonnula
( -1) kco
(95) C2k = 22kk!Cn + 1)(n + 2) ... (n + k)' k = 1,2, ... ,
which is easy to check by induction. The standard choice of Co is Co = 1I(2 nn!), which
gives the formula
(_l)k
(96) C2k= 2 2k +n k!(k+n)!' k=0,1,2, ....
196 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
00 (_l)k (x12)2k+n
(97) In(X)=~ k!(k+n)! ' n=0,1,2, ....
It is easy to verify by the ratio test that this series converges for all finite x. Note in
particular the important speCial case
00 (_1)k(x12)2k
(98) lo(x) =~ (k!)2
Other Solutions of the Bessel Equation. The method of reduction of order applied to
the Bessel equation and the particular solution Yl (x) = In(x) yields a second linearly
independent solution (if Xo is any positive constant):
(99)
See (16) and (23) in Section 3.2. Thus Yl (x) = In (x) and Y2 (x) are a solution basis, and
every solution of the Bessel equation can be written Y = AnYl + BnY2, or
(100)
where An and Bn are arbitrary constants. The theory of these functions is developed in
standard references, e.g., [A-SJ. Here only the following fact, needed for applications in
Chapter 8 and Chapter 9, will be presented.
Finiteness Property for In. The solution (l00) of the Bessel equation has a finite limit
as x ~ 0 if and only if Bn = O. In other words, the only solutions of the Bessel equation
that are finite at x = 0 are the scalar multiples of ln (x).
It is clear that if Bn = 0, then yeO) = AJn (0) is finite. To verify the converse it will
be enough to show that
(101) . l
hmln(x)
x...,. a x
Xo
~ = +00.
- 2-
~n (~)
is a convergent alternating series. The Leibnitz theorem for alternating series implies that
for x small enough, 0 < ]o(x) :::: 1, and hence
1 1
-- > - for 0 < x :::: Xo.
x]~(x) - x
Integration gives
(102) [
xO d~
-2-::::
[xO - d~
=logxo-logx=log(-).
Xo
x ;Jom x ~ x
Since]o(O) = 1, this implies (101) with n = O. Proceeding similarly for n :::: 1 one has
and hence 0 < ]n(x) :::: (xJ2)n/n! for x small enough, whence
X2n+l
X]2 (x) < ---::--::-
n - (n!)222n
and
1 22n(n!)2 1
-->--->--
x]~(x) - X 2n + 1 x2n+l
(104) l x
Xo d~
;J~m
> 1 ( 1
- 2n x2n - x~n
1) for 0 < x :::: Xo.
[Xo d~ ( 1 xn)
(106)
]n(x) lx ;J~(~):::: Yn xn - x~n for 0< x:::: xo,
where Yn = 1/[(n!)(2n+l)(2n)] > O. Making x --+ 0 in this inequality gives (101) for
n:::: l.
Behavior of ]n(x) for x --+ +00. Another approximation that gives inSight into the
nature of the function]n(x) is defined by the equation
Of course, this may taken as a definition of the function cn(x). The equation is made
interesting by the statement that one has the estimate
(108)
where Kn is a suitable constant. The known proofs of (l08) and estimates of the constant
Kn are not elementary. They may be found in advanced texts such as [B-RJ.
The estimate (108) implies that for large x the function 1n(x) is approximated by
the elementary function on the right-hand side of (l07). Moreover, the approximation
improves with increasing x. Such approximations are called asymptotic estimates.
Estimate (l08) implies that1n(x) is an oscillatory function for large x, with infinitely
many zeros. Moreover, the large zeros of 1n ex) are spaced about rr units apart. In fact,
the zeros of10(x) are close to those of cos(x -rr/4), even for relatively small values of x.
This will be explored numerically in the exercises.
Recurrence Relations. The values of Bessel functions and their first derivatives are
closely related. The most basic relations are
(l09) d n ) n
dx [x 1n(x J = x )n-1 (x)
and
(1l0)
These equations are verified by substituting the Frobenius series for 1n (x) on the left-
hand sides and checking that the resulting series are those of the right-hand sides. From
(109), (110) one can easily derive the recurrence relations for n 2: 1:
2n
(lll) - 1n(x) = 1n-1 (x) + 1n+l (x)
x
and
(1l2)
It is not obvious that this integral defines the same function as the Frobenius series (97).
This can be verified by expanding the integral in a power series in x and comparing it
with (97), but the calculation is very long. A more efficient method can be based on
Fourier analysis.
Numerical Evaluation of ln (x). Bessel's functionln (x) is a complicated function of the
two variables x and n, and no single computational algorithm can be effective for all
values. For small x and moderate n, truncation of the alternating series (97) suffices. This
is explored below in the exercises. In Chapter 1 the integral representation was evaluated
accurately by numerical integration for moderate values of x. This also is effective for ln (x)
with moderate x and n values. For large x and moderate n the asymptotic approximation
and its refinements are accurate. The recursion relations are also useful for computing
ln (x) and 1~ (x); see [A-51 for a useful discussion. In recent years sophisticated software
for computing Bessel functions has been developed that makes these functions almost
as easy to compute as the trigonometric functions.
Legendre Functions. The power series method was used above to construct a
solution basis Y1, Y2 of the Legendre equation
These series solutions, defined by (43), (46) and (47), (50), respectively, converge on the
interval -1 < x < 1 only and have no limits when x ~ ± 1. The only exceptions occur
when f1. = nCn + 1), with n = 0,1,2, .... In this case one of the solutions reduces to
a polynomial. Here the Frobenius method will be used to show that for any value of f1.
there is a nontrivial solution of the Legendre equation that is finite at x = 1. However,
this function will have no limit when x ~ -1, except when IL = n(n + 1). Hence these
IL values are the only ones for which the Legendre equation has a solution that is finite
on the closed interval -1 :::: x :::: 1.
The Legendre equation can be written in the form
II a(x) I b(x)
(115) Y +x - I Y + (x _ I)2Y = 0,
where
2x
(116) a(x) = - - = 1
x+I
+ a1 (x - 1) + a2(x - 1)
2
+". '
b(x) = f1. (I-X)
I+x
-- = 0 + b1(x -1) + b2 (x - 1)2.
200 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Therefore,
Lk
00
and
L k(k -
00
It will be convenient to substitute these series into the Legendre equation in the form
(114). Note that
-2x = - 2(x - 1) - 2,
whence
L [k(k -
00
and
L [kCk + (k + l)ck+d (x -
00
L [(JL -
00
(125) lim
k.... oo
ICk+ll
Ck
=~.
2
Thus (17) has radius of convergence 2 and converges for - 2 < x-I < 2, or -1 <
x < 3. Note that any solution of (114) that is linearly independent of y+ will tend to 00
at x = 1. In fact, the reduction of order method produces all such solutions and (6),
(23) in Section 3.2 with aCt) = -2U(1 - t2 ) gives the solution
(126)
The technique used to prove (01) above can be used to show that
1
(128) [y+(x) [ ~ y log - - for - 1 < x::: -Xo.
x+l
This implies that the Legendre equation has a nontrivial solution that is finite at both
x = 1 and x = -1 if and only if /-L = n(n + 1) with integer n ~ O. This important fact
will be needed in Chapter 8.
Exercises 3.4
Power Series Method. Compute the coefficients co, q, ... , cy in the trial power series solution
L cn tn
00
x(t) =
n=O
for the following initial value problems. Find a general formula for Cn and sum the trial series (if possible).
A computer algebra system may be helpful.
202 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Standard Basis. Find series formulas for the standard basis Xl, X2.
Euler Differential Equation. Apply Euler's method to calculate a solution basis Xl, x2 of the given Euler
differential equation.
Frobenius Method. Use the Frobenius method to calculate a nontrivial solution Xl (t) of the given
differential equation corresponding to the larger root of the indicial equation. Find an independent solution
X2 (I) corresponding to the smaller root. Sum the series where possible.
It is assumed known that Jo(t) is the unique solution of Bessel's differential equation of order zero x" +
(l/t)x' + x = 0 satisfying limHo+ x(t) = x(O) = 1. The derivation defines
Then limt-+o+ x(t) = x(O) = 1. To prove relation (129), it suffices to show that x" + (l/t)x' + x = O.
37. Derive the formula
Then integrate by parts (u(B) = sin(t since», v(O) = - cos B) to show that
(132)
x'(t)
-- = - -
1111:
cos(t sin 0) cos 2 B dB.
t Jr 0
39. Add equations (l33), (l32) and (130) to prove x" + (l/t)x' + x = o.
Recurrence Relations for Jn (x). Verify the statements using the given suggestions.
40. Verify equation (l09). Show that
by writing the left hand side and the right hand side as series.
42. Verify equation (lll). Show that
2n
- In(x) = In-l (x)
x
+In+l (x)
by expanding the derivative terms in (l09) and (110) and equating the resulting formulas for J~ (x).
43. Verify equation (l12). Show that
by expanding the derivative terms in (l09) and (lID) and adding the resulting formulas for J~ (x).
Approximations ofJo(x). Define the nth partial sum of the power series for Jo(l) by the formula
Sn(t) = ~
n (_l)h
(kl)2
(t2' )2h
The limitJo(l) = limn-+oo Sn (t) is valid for every real t. Complete the follOwing exercises.
204 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
°
44. Plot on the same axes the graphs of]o(t), 54(t), 56(t) and 5s(t) for 0.0 =:: t =:: 8.0. Hint:]o(t) ~ 515(t)
to 5 digits on =:: t =:: 8 by the Taylor remainder formula.
45. Plot on the same axes the graphs of]o(t) and -!2/(rrt) coset - rr/4» for 0.1 =:: t =:: 10.0.
is known to have a nontrivial solution that is finite when x = ±1 if and only if fL = n(n + 1) where
n = 0, 1,2, .... That solutiony(x) = Pn(x) is a polynomial of exact degree n and is odd (resp. even) if n is
odd (resp. even). It is customary to normalize Pn(x) by the condition
With this convention the polynomials are uniquely defined. Power series methods can be used to show that
they are related by the Legendre polyomial recurrence relation
47. Apply power series methods to show that Po (x) = landPl(x) = x.Hint: Solve (134) with fL = n(n+l),
°
n = and y(l) = 1 by the Frobenius method. This proves Po(x) = y(x) = 1. Repeat with n = 1 to
=
show PI (x) x.
48. Use the Legendre polynomial recurrence relation (135) to calculate P2 (x) and P3(X). A computer algebra
system is useful to check the answers.
49. Deduce from the Legendre polynomial recurrence relation (135) and mathematical induction that
PnO) = 1 for all n = 0,1,2, ... .
50. Produce graphs of Po (x), PI (x), ... , P3(X) on -1 =:: x=:: 1.
wheie the functions aCt), bet) and jet) are sectionally continuous on the interval to <
t ::; to + T.
The Initial Value Problem. The fundamental theorem of Section 3.2 guarantees
that (1) has exactly one solution with prescribed initial values x(to) = co, x' (00 = Cl. In
particular, this theorem implies the following theorem.
Unique Continuation Theorem. Let to < tl < to + T and let Xl (t) be a solution of (1)
for to ::; t ::; tl. Then Xl (t) has a unique continuation to thelarger interval to ::; t ::; to + T.
3.5 NUMERICAL METHODS FOR SECOND-ORDER EQUATIONS 205
For the proof one notes that the sectionally defined function (3) has all the properties
of a solution of the initial value problem on the interval to .:::: t .:::: to + T; see Section 3.2.
Hence it is the solution, by the uniqueness part of the fundamental theorem.
The unique continuation theorem is useful in allowing one to build solutions of (1),
section by section, when aCt), bet) and jet) are discontinuous. This is illustrated in a
vibration problem below.
Numerical Methods for the Initial Value Problem. The fundamental theorem
guarantees the unique solvability of the initial value problem for (1) for all sectionally
continuous coefficients. However, the solutions can be represented by elementary
functions and their integrals only in very special cases: constant coefficients (a(t) = ao,
bet) = bo), Euler equations (aCt) = aolt, bet) = bolt 2 ), and coefficients that are sectionally
of these types. For more general coefficients it is natural to adopt the point of view of
Section 2.7: The differential equation (1) and the initial conditionsx(to) = Co, x'(to) = c)
define a unique function x(t). The analysis then shifts to a search for effective numerical
algorithms to compute the values of x(t). This search leads quickly to an important
discovery, namely, that the numerical methods for first-order equations of Section 2.7
can also be applied to second-order equations. To do this one must write the latter as a
pair of first-order differential equations.
First-Order Systems. The decisive step is to regard the derivative yet) = x'(t) as a
second unknown function and consider the first-order system
Before describing numerical methods for solving (4), it will be helpful to write it in
a more symmetrical form. First, note that the general second-order equation
(6) x'=y,
y' = F(t,x,y).
The symmetrical form of this is the general first -order system of two differential equations
where Fl and F2 are any two functions of the three variables t,x and y. In particular,
system (4) has the form (7) with
(8) Fl(t,X,y) = y,
Finite Difference Methods. The methods of Section 2.7 can be extended to system
(7) almost without change. Approximate solutions will be constructed to the initial
value problem for (7) on a prescribed interval [to, to + TJ with prescribed initial values
x(to) = Co, y(to) = Cl. Of course, as remarked in Section 2.7, the existence of the
solution on this interval must be known if meaningless computations are to be avoided.
Uniform Mesh. For Simplicity, only uniform meshes will be discussed. Thus
(9)
tk = to + kh, Nh = T.
The approximate values of x(t) and yet) at the mesh points will be denoted by Xk, Yk, so
that
(10)
Euler's Method. Equation (7) is similar to the Single first -order equation x' = F(t, x)
of Chapter 1. It differs only in having two unknown functions and two equations. When
written in vector notation, (7) is identical in form to the single equation. This suggests
that we may extend the methods of Section 1.7 directly to the systems (7). The simplest
numerical method is the following:
3.5 NUMERICAL METHODS FOR SECOND-ORDER EQUATIONS 207
This has the same recursive structure as the single-equation case. Thus for h = 0
one starts with the given initial data Xo = x(to), Yo = y(to) and one computes Xl, YI.
Then, with these data one computes x2, Y2, etc. For h = N - lone gets the final values
XN ~ x(to + T), YN ~ y(to + T).
Error Estimates. The analysis of Section 2.7 extends to the system (7). Thus
(12)
for a suitable constant C, provided that FI(t,x,y), F2(t,X,y) and their first partial
derivatives are continuous. For the linear equation (4) this is valid if a(t), b(t), and J(t)
have continuous first derivatives.
EXAMPLE 1. Euler's method for systems will be illustrated by application to the problem
(15)
Then compute
(17)
Then compute
(18)
Then compute
Finally, compute
1
(20) Xk+l = Xk + -6 (Xl + 2X2 + 2X3 + X 4) ,
1
Yk+l = Yk + '6 (Yl + 2Y2 + 2Y3 + Y4 ) .
EXAMPLE 2. Application of The Runge-Kutta4 method to problem (13) gives the results
in Table 4.
Note the rapid fourth-order convergence.
Other Methods. The Taylor2, Taylor4, and Heun algorithms of Section 2.7 have
versions for second -order equations. The key to such generalizations is the vector notation
of Chapter 5, which shows that each method of Section 2.7 is a special case of a method
that is valid for equations of any order.
3.5 NUMERICAL METHODS FOR SECOND-ORDER EQUATIONS 209
Exercises 3.5
Unique Continuation and Exact Solutions. Below, solve the given initial value problem to obtain the
sectionally defined solution x(t) on the given interval.
1. x" = jet), x(O) = x/CO) = O,j(t) = ton [0, 1],j(t) = 1 - t2 on [1, 3J.
2. x" = jet), x(O) = 1, x/CO) = O,j(t) = ton [0, 1],j(t) = 1 - t 2 on [1, 3J.
3. x" = j(t),x(O) =x/(O) = O,j(t) = sint on [O,rrJ,j(t) = 1- sint on [rr,2rrJ
4. x" =j(t),x(O) = O,x/(O) = 1,j(t) = sint on [O,rr],j(t) = I-sinton [rr,2rrJ.
Numerical Calculations and Unique Continuation. A controller x(t) for a process is given as two initial
value problems: x" = 1, x(O) = x/CO) = 0 on [0, IJ and x" = j(t) on [1, 2J. Findx(2) numerically, assuming
that j(t) is a table of values given as below.
5. j(1) = 1,j(l.2) = 1.5,j(1.4) = 1.3.1(1.6) = 1.0,j(1.8) = 1.2,j(2) = 1.4.
6. f(1) = 0.9,f(1.2) = 0.5,f(1.4) = 0.7.10.6) = 0.85,j(1.8) = 0.95,j(2) = 1.1.
7. J(t) is a sampling of sin(rrU2) on [1, 2J at steps of 0.2.
27. (lC Circuit) Consider the LC circuit of Figure 17 with parameters L = 1, C = 1 and a battery
that supplies Eo = 10 volts. At times t < 0 the switch is open and there is no charge or current:
q(t) = i(t) = 0 for t < O. At time t = 0 the switch is dosed to position B and the capacitor begins to
charge. At time t = 10 seconds the switch is set to position A, shorting out the battery. Calculate the
charge q(t) for all t ::: 0 using the unique continuation theorem.
28. (Paratrooper) A paratrooper falls from rest at 12000 feet for to seconds, then the parachute opens.
Estimate the time of the journey to the ground using the model below.
W
-x
II {WW 0:::. t :::. to,
g (t) = - ex '()
t t> to,
with the follOWing physical constants:
Combined weight W = 200 pounds (paratrooper and parachute),
Gravitational constant g = 32 feet per second,
Parachute open at to = 3 seconds,
Parachute drag force constant e = 1.
29. (Unsolvable Numerical Problem) Verify that the initial value problem x" = lx' 12/3 , x(O) = x' (0) =0
has infinitely many solutions. Hence no numerical method can make sense for this problem.
A
open
B Eo
L
~ C
32. (Conversion to Systems) The differential equation ulll (t) + 4u" (t) - Su' (t) = sin(t) can be converted
to a system of first-order differential equations by the substitutions x = u(t), y = u'(t), Z = u"(t).
Show that the resulting system is
x' =y,
y' = Z,
z' = Sy - 4z + sin(t).
33. (Third-Order System) The equation u'" (t) + 4u" (t) - Su' (t) = sin(t) can be converted to a system
of first-order differential equations and solved numerically by a generalization of the Runge-Kutta4
method for systems of dimension 2. Develop the general equations. Apply them to this problem with
initial conditions u(O) = 0, u' (0) = 0, u" (0) = 1 and step size h = 0.1 to compute the values of u, u',
u" at t = 0.1. Compare with the exact solution
3 1. 2 1 t
u(t) = - cos(t) - - sm(t) - - + -e + -9e-5t
26 13 5 4 260
and its derivatives.
(1)
where the coefficients al(t), ... , an(t) and the right-hand termj(t) are prescribed func-
tions. The equation is said to be homogeneous ifj(t) == 0, otherwise inhomogeneous.
It is said to have constant coefficients if aI, ... , an are all constants. Examples from
physics with n = 3 and n = 4 were given atthe end of Section 3.1, and more complicated
electrical and mechanical oscillators lead to equations of all orders.
The solution concept and fundamental theorem of Section 3.2 extend completely to
arbitrary n > 2. They may be formulated as follows.
Definition. A function x(t) (defined on an interval to :::: t :::: tl) is said to be a solution
of (l) if (i) x(t) and the derivatives dkxldt k with k :::: n - 1 are continuous, (ii) dnx(t)ldt n
is a sectionally continuous function with discontinuities only at discontinuities of
al (t), ... , an (t) and jet) (if any), and (iii) equation (1) holds at each point of continuity
of al (t), ... , an (t) and j(t).
With this definition, one has the following result.
212 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Fundamental Theorem. For each set of n real numbers Co, Cl, ... , cn-l, equation (1)
has exactly one solution x(t) such that
This theorem makes precise the sense in which the solutions of (1) form an
n-parameter family Proofs of the theorem may be found in advanced texts such as [B-RJ.
The Homogeneous Equation. If each of a set of functions Xl (t), X2 (t), ... , Xk (t) is a
solution of the homogeneous equation
(3)
(4)
is easily seen to be a solution of (3). The function x(t) defined by (4) is said to be
a linear combination of the functions Xl(t), X2(t), ... , Xk(t). The constants AI, A 2,
... , Ak are said to be the coefficients. A linear combination (4) is said to be trivial if
Al = A2 = ... = Ak = 0. Otherwise, it is said to be nontrivial.
In Section 3.2, functions Xl (t) and X2 (t) were said to be linearly dependent if one of
the functions was a constant multiple of the other. The extension of this important idea
to sets of k > 2 functions is contained in the following.
Definition. A set of functions Xl(t), X2(t), ... , Xk(t) (defined on a common interval
to :::s t :::s tl) is said to be linearly dependent if and only if there is a nontrivial linear
combination of them that equals zero:
(5)
If the set Xl (t), X2 (t), ... , Xk (t) is not linearly dependent then it is said to be linearly
independent.
If Xl, X2, ... ,Xk is a linearly dependent set, so that (5) holds with not all Ai = 0, then
some of the functions in the set are equal to linear combinations of the others. Thus, if
°
Al =j:. in (5) then Xl can be written as a linear combination of X2, X3, ... ,Xk, etc. Note
that when k = 2 the new definition of linear dependence coincides with the old. Also,
if Xl, X2, ... , Xk are linearly independent and (5) holds then it must be the case that
Al = A2 = ... Ak = 0. The basic facts concerning linearly independent solutions are
described by the following theorem.
Theorem. There are sets of n linearly independent solutions of (3). Every set of n + 1
solutions is linearly dependent.
3.6 LINEAR EQUATIONS OF ORDER n> 2 213
These statements follow from the fundamental theorem. To prove the first statement,
let Xl, X2, ... ,Xn be solutions of (3) with the initial values
(
6) Xl ( = 1,
to ) XlI (to) = 0, = 0, ... ,Xl(n -1) (to) = 0 ,
Xl/I (to)
to = 0 ,
X2 () x2I (to) = 1, 2 to = , ... ,x2
x"() 0 (n-l) (to) = 0 ,
Taking t = to in (7) and using (6) gives Al = O. Next, differentiating (7) once and setting
t = to gives A2 = 0, etc. Hence, the coefficients AI, A 2 , ... , An must all be zero, and
therefore Xl, X2, ... , Xn are linearly independent.
At this point we have not developed enough linear algebra to verify the second part
of the theorem. This will be completed in Chapter 5. Note, however, that if x(t) is any
solution of 0), characterized by its initial values (2), and if Xl, X2, ... , Xn is the special
set of solutions defined by (6), then
(8)
This follows immediately from (2) and (6). Thus the set XI, X2, ... , Xn is linearly
dependent.
Solution Bases. Equation (6) defines one set of n linearly independent solutions of 0),
Section 3.3. There are (infinitely) many other such sets, each generated by initial values.
Each such set of n solutions will be called a solution basis for 0). The reason for this
terminology is the following theorem.
Basis Theorem. If Xl (t), X2 (t), ... ,Xn (t) is any solution basis for 0), then every solution
of that equation can be written in one and only one way as
(9)
Note that the existence and uniqueness statements for (9) follow directly from the
preceding theorem. Indeed, if X, Xl, X2, ... , Xn are linearly dependent, then there must
be constants B, BI , B2 , ... , Bn , not all zero, such that
(10)
214 3 ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
Moreover, B #- 0 because otherwise, Xl, X2, ... , Xn would be linearly dependent. Thus
we may divide by B in (10) and write Ak = -BkIB to get (9). The uniqueness of the
representation (9) follows similarly.
Now consider the nonhomogeneous equation (1). In complete analogy with the case
n = 2 one has the following.
Structure Theorem for Linear nth-Order Equations. let xp(t) be any particular
solution of (1) and let Xl (t), X2(t), ... ,Xn (t) be any solution basis for (3). Then for each
solution x(t) of (1) there exist unique constants AI, A2 , ... , An such that
(ll)
This result is immediate from the basis theorem, since the difference x(t) - xp(t) is
a solution of the homogeneous equation (3).
Solution of the Nonhomogeneous Equation. The method of variation of parameters
has a straightforward extension to nth-order nonhomogeneous equations.
Equations with Constant Coefficients. Consider the homogeneous equation
(12)
with constant coefficients al, a2, ... , an. Eulers method of exponential solutions
(14)
(17)
one has
(19)
(20)
Rules for Calculating a Solution Basis. The preceding examples illustrate the fact
that if the roots of the characteristic equation (14) can be found, then a solution basis
of (12) can be written down. Two cases arise, corresponding to real and complex roots,
respectively.
Case 1. Real Roots. Let A = Al be a real root of (14), of multiplicity m ::: 1, so that
(A - Al)m is a factor of Pn(A). Then (12) has the m linearly independent solutions
(21)
The final rule states that the set of n solutions obtained in this way is a solution basis
of (12). The algebraic proofs of these rules are rather long. They will not be written down
here. However, they are readily verified by the Laplace transform method of Chapter 4
and the theory of partial fractions.
IExercises 3.61
;Lk<i$~~"'1!(hb [,u"iG!k ".,C41\,,,,,=,
Solution Basis. Find a solution basis for the given differential equation. First try to factor the characteristic
polynomial into linear and quadratic factors. If this is possible then an exact basis should be given. If not
then compute and use approximate roots.
1. x''' + 6x" + 11x' + 6x = O.
2. x'1II + 2Xlll + 2x" + 2x' + x = O.
3. XIII + 3x" + 4x' + 2x = O.
4. 9x"" - 24xlll - 29x" + 120x' - 80x = O.
5. 2Xlll + 3x" - 12x' + 5x = O.
6. 4Xlll - 3x" - 18x' + 8x = O.
7. 100xlll - 334x" + 668x' - 536x = O.
S. 100xlll - 334x" + 768x' - 670x = O.
9. 9x"" - 42x1ll + 22x" + 126x' - 147x = O.
10. 2500xlll + 9600x" - 14325x' - 5500Sx = O.
Initial Value Problems. Calculate the solution of the initial value problem with the prescribed initial values.
11. 2Xlll + 3x" - 12x' + 5x = 0, xCO) = 1, x'CO) = 0, x"CO) = O.
12. 4x''' - 3x" - 18x' + 8x = 0, xCO) = 0, x'CO) = 1, x"CO) = o.
13. XIII + 6x" + llx' + 6x = 0, xCO) = 1, x'CO) = 1, x"CO) = 1.
14. x"" + 2x1ll + 2x" + 2x' + x = 0, xCO) = 1, x'CO) = 0, x"CO) = 0, XIII CO) = 1.
Cauchy's Solution. Let the homogeneous linear differential equation (3) have constant coefficients.
Cauchy's solution is defined by the initial conditions
25. = sin t.
XIII 30. x"" - x = 1.
26. x"" = cost. 31. x"" -x = et.
27. 100x lll = t2 32. x"" - x = e-t.
General Solution. Find the general solution xCI) = xh (I) + xpCI) for the following equations.
33. XIII = I. 36. x"" = t3 .
This chapter presents an efficient and powerful method for solving the differential
equations that govern electrical networks and many other problems of engineering and
applied science. The method, which originated in work of O. Heaviside 1 dating from
the 1890s, consists in transforming the differential equations, according to certain rules,
into corresponding algebraic equations. The latter are then solved algebraically and the
solutions to the original differential equations are obtained from them by Heaviside's rules.
Heaviside did not base his transform method on the Laplace transform, although
he was aware of the connection. However, most subsequent workers have founded their
treatments of Heaviside's method squarely on the Laplace transform. It is this approach
that is followed here.
t-Domain s-Domain
Original Problem Transformed Equation
=*
y" + 5y' + 6y = 0, s2y + 5sY + 6Y = 1
y(O) = 0, y'(0) =1
.IJ.
Solution of the Solution of the
I Oliver Heaviside 0850-1925). British scientist famous for his many contributions to electromagnetic theory, based
on Maxwells equations, and for their applications to electrical engineering and technology
220 4 THE LAPLACE TRANSFORM
(1)
If F(s) is related to j(t) by this equation then F(s) is said to be the Laplace transform of
jet). When it is necessary to express the dependence of F(s) onj(t), the notation
The existence of this limit depends on the function ¢J(t). The improper integral is said to
exist if the limit exists. As an example, consider the function ¢J(t) = e;<H. Direct integration
gives
(4) io
T eat
eatdt=-
a
IT
0 a a
(a:j:. 0).
(5) io
oo 1
eat dt =--
a
4.1 THE NATURE OFTHE LAPLACE TRANSFORM 221
if and only if a < O. A simple sufficient condition for the Laplace transform of J(t) to
exist is
(6) 1 00
e-sotlf(t)I dt < 00 for some so.
Theorem. If condition (6) holds then the Laplace transform F(s) = .c{f(t)} exists for
all s ~ so.
A useful criterion for the existence of F(s) is given by the following corollary
where M and a are constants, then F(s) exists for all s > a.
The proof is based on showing that (7) implies that (6) holds whenever So > a.
Linearity of .c. IfJl (t) and h (t) have Laplace transforms and if Cl and C2 are constants,
then elementary properties of the integral imply that
(8)
Equivalently,
(9)
implies
(10)
for all numbers s for which Fl (s) and F2 (s) exist. Operators.c that satisfy (8) are said to
be linear operators. This simple property of the Laplace transform plays a fundamental
role in applications.
The Heaviside Function. The step function defined by
= (~
for -00 < t < 0,
(ll) H(t)
for 0:::: t < 00,
g
is known as the Heaviside function. A related function is
where a is a constant. If a ::: 0 then H(t - a) = 0 for t :s a, and its Laplace transform
can be calculated from the equations
(13) iT
o
e- 5t H(t - a) dt =
iT e-
a
5t dt = -=-
e- IT
5t
5 a 5 5
This result plays an important role in many applications of the Laplace transform.
Uniqueness of the Transform. For applications it is often necessary to recover the
original function J(t) when its transform F(5) is known. Thus the question arises whether
eachF(5) comes from a uniqueJ(t), orwhethertwo or moreJ(t),s can have the same F(5).
If discontinuous J(t)'s, such as the Heaviside function, are allowed, then J(t) is certainly
not unique. Indeed, in the definition (1) J(t) can be changed at a finite number of points
without affecting the values of F(5). Nevertheless, the original function is "nearly unique"
in the sense of the following theorem.
Uniqueness Theorem. Let Jl (t) and !I (t) have Laplace transforms .c{fd = Fl (5) and
.c{f2} = F2 (5). Then
(15)
if and only if
(17)
The rather complicated proof of this theorem may be found in advanced texts on
Laplace transforms, e.g., [WiJ.
The remainder of Chapter 4 is organized as follows: In Section 4.2 it is shown that
the Laplace transform (14) of the Heaviside function and the linearity and continuity
properties of the operator .c allow one to calculate the Laplace transforms of many
elementary functions without carrying out the t-integration of the definition (1). Section
4.3 presents the operational rules for the Laplace transform that are needed for the
applications to differential equations. These rules also permit one to calculate the direct
and inverse transforms of large classes of functions. Section 4.4 develops applications
4.1 THE NATURE OF THE LAPLACE TRANSFORM 223
of the Laplace transform to linear ordinary differential equations of any order. Finally,
Section 4.5 develops applications of the transform to linear systems of differential
equations of the kind that arise in electric circuit theory and mechanics.
In Section 4.2 many transforms will be calculated by applying three simple properties
of the operator C. The indicated integration in equation 0) will be actually performed
for only a limited number of examples.
Exercises 4.1
Heaviside's Function. Graph the given combination of Heaviside functions on [0, (0).
3. f(t) = HU - 1) - H(t - 2)
Calculation of .e-transforms by direct integration. Verify the given .e-transform equation by evaluating
the integral fle-stf(t)dt, using an integral table or a computer algebra system, and then letting T -+ 00.
Assume throughout that a :::: 0.
Calculation off(t) from F(s) = .e{j(t)}. Apply the uniqueness result (17) to £indf(t).
9 . .e{j(t)} = .e{e- 7t }
-2s
13 . .e{j(t)} = ~-l
e-l. IS
14 . .e{jU)} = ~.
224 4 THE LAPLACE TRANSFORM
The last property means that if jn(t) -+ jet) as n -+ 00 then £{fn(t)} -+ £{f(t)} as
Next, the linearity and continuity of £ imply that the integration on the left in (4) can
be brought inside the operator £. Thus
(5)
(8) 1 a
00
(t - b)nH(t - b) db =
ret-a)
10 rnH(r) dr
4.2 THE LAPLACE TRANSFORMS OF SOME ELEMENTARY FUNCTIONS 225
(t - a)n+l
= H(t-a).
n+l
Thus (7) and (8) with n = 1 give
2e- as
(9) C{(t - a)2H(t - a)} = - 3 - for all 5> 0 and a 2: o.
5
(12)
which is valid for all t. If the operator C is applied to the series on the right, then C and
the summation may be interchanged, by the linearity and continuity of C. If (11) is then
used, one gets
Cleat} = 00
~ _
an = _1 ~
00 (a)n
_ .
(13)
~ sn+l 5 ~ 5
n=O n=O
(14) Lxn - --
n=O
00 1
1- x
for Ixl < l.
This shows that for la/51 < 1 the right-hand side of (13) equals 11(5 - a). Thus
1
(15) Cleat} = - - for all 5 > la!-
s-a
This equation may now be differentiated with respect to a to get further C-transforms.
The differentiation may be brought inside the C operator by the linearity and continuity
properties. Thus (15) implies
I' at 1
(16) L{te } = (
s-a)2
226 4 THE LAPLACE TRANSFORM
and
(17)
(18) .t.-{t e } =
I' n at
(5 - n!a)n+ l' n = 0, 1,2, ....
+ i£{sinwt} = + i -2--2'
5 W
(2l) £{coswt} -2--2
5 +w 5 +w
Finally, equating real and imaginary parts gives the .c-transforms of cos wt and sin wt:
5
(22) £{coswt} = -2--2'
5 +w
w
(23) .c{sin wt} = -2--2'
5 +w
Products of Powers of t and Trigonometric Functions. Equation (18) with
a = iw (w real) gives
. t n!
(24) £{tneIW } = - - - -
(5 - iW)n+l
(25)
£{teiwt } = (5 + iw)2 (52 - ( 2) + i(2W5)
(52 + ( 2 )2 (52 + ( 2 )2
whence on equating real and imaginary parts one gets
52 _ w2
(26) £{tcoswt} = ( 2
5 +w2)2'
2W5
(27) £{t sin wt} = (52 + ( 2 )2
4.2 THE LAPLACE TRANSFORMS OF SOME ELEMENTARY FUNCTIONS 227
Similar, but more complicated, formulas for CW cos wt} and C{ tn sin wt} follow from
(24).
Products of Powers of t, Real Exponentials, and Trigonometric Functions.
Equation (18) with a = a + iw gives
(28) C {t nat
e e
iwt}
=
n!(s-a+iw)n+l
, n = 0, 1,2, ....
[(s - a)2 + w2]"+!
(32) F( ) = N(s)
s D(s) ,
where N(s) and D(s) are polynomials in s. It has been shown above that functions J(t)
of the form tne"t cos wt and tneat sin wt have C-transforms that are rational functions of
s with degree N < degree D. More generally, the linearity of C implies that any linear
combination of such functions has the same property. It is interesting that the converse
of this statement is true. This may be stated as follows.
Theorem. Let F(s) be any rational function with degree N < degree D. Then F(s) =
C{j(t)}, where J(t) is a finite linear combination of functions of the form tne"t cos wt and
tme"t sin wt.
The parameters n, m, a, and w in the theorem may have different values in the terms
of the linear combination.
The proof of this theorem depends on the theory of partial fractions. These are
special rational functions of the form
A
(33) Rl (s) = (
s-a)m'
228 4 THE LAPLACE TRANSFORM
where a, ex, fJ, A, B, C are real, fJ > 0, and m is a positive integer. The partial fraction
decomposition theorem states that every rational function F(s) = N(s)/D(s) with real
coefficients and degree N < degree D equals a finite sum
= L Fj(s),
k
(34) F(s)
j=l
(35) ()
Rl S
A ) = .c{ ( A ) t m-lat}
= (s-a e .
m m-1!
A similar, but more complicated, result holds for R2(S). The case m = 1 is covered by
equations (30) and (31). For m = 2 it can be shown, using (26) and (27), that
(36) [(
(s - ex)
s - ex
)2
at. }
2]2 =.c -te smfJt ,
+ fJ
{I
2fJ
[( 1
)2 = .c { -13 eat(.
smfJt - fJtcosfJt) } .
s - ex + fJ 2]2 2fJ
where a and b are constants and J(t) is a linear combination of terms of the form
tne"t cos wt and tme"t sin wt. These are precisely the functions J(O whose .c-transforms
are rational functions F(s) = N(s)lD(s) with degree N < degreeD. In fact, the partial
fraction decomposition can be used to show that the method of undetermined coefficients
always works. See, for example, [B-RJ.
4.2 THE LAPLACE TRANSFORMS OF SOME ELEMENTARY FUNCTIONS 229
Calculation of 'c-transforms by direct integration. Verify the given 'c-transform by using equation 0)
in Section 4.1. Justify improper integrals by integration over 0 :::: t :::: T and calculation ofthe limit T ~ 00.
Assume that a 2: 0, b 2: O.
1. 'c{sin(at») = aJ(5 2 + a2) for 5> O.
2. 'c{cos(at») = 5/(5 2 + a2 ) for 5> O.
3. 'c{tsin(at») = 2a5/(5 2 + a2)2 for 5> O.
4. ,c{ t cos (at) ) = (52 - a2)/(5 2 + a2 )2 for 5> 0
s- 1 I s- 1 I -B
l(s-2)I(s-3) 5-2=0 =A, (s-2)I(s-3)1 5-3=0 - .
Thefactors (s - 2) and (s - 3) are to be covered by your thumb during evaluation. This engineering technique
is call the coverup method or Heaviside's coverup method.
9. Justify the answer for A in the above example by multiplying the original problem by (s - 2) and
substituting s - 2 = O.
10. Find A and B by the coverup method:
s+1 A B
-,----,-:--..,- = - - + --.
(s+2)(s-3) s+2 s-3
11. Find A, B, and C by the coverup method:
s-1 ABC
..,....,..-.,---- = - + - - + --.
s(s-2)(s-3) s s-2 5-3
12. Argue from uniqueness (17) that
A B
'c{f(t)) = - +-
s+1 s-3
Lx
00
n- --
1
for Ixl < l.
n=O I-x
is valid for all t. Justify, based upon the three properties, the following steps by providing missing details.
by (11)
where x = a/5
by (14)
51 -x
1
for a real
5-a
26. Show that (15) is valid for complex a, la/51 < 1, by assembling the results of the above two problems
and verifying that everything works for a complex.
Reduction Formulas. Apply the theory of integration by parts f udv = uv - f vdu from calculus to
establish the given reduction formula.
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 231
27. To the integral fooo e-st(t - a)nH(t - a)dt, for a> 0, apply integration by parts to show that
28. To the integral fooo e-steattndt, for a > 0, apply integration by parts to show that
n!e- as
'c{(t-a)nH(t-a))= sn+l' n=0,1,2, ... , a>O,
nl
'c{eattn) = . , n = 0, 1,2, ... , s > a.
(s - a)n+l
(1)
If the C-transform is applied to both sides of (1), then the linearity and continuity
properties of C imply that on the right C may be brought inside the summation. It then
follows from (11) in Section 4.2 that
00 j(n)(o)
(2) C{{(t)} = F(s) = L~'
n=O
provided that the last series converges for s large enough. Equation (2) with jet) = eat
was used in Section 4.2 to calculate C{eat } = lI(s - a). It is used below to find a number
of operational rules for the C-transform.
232 4 THE LAPLACE TRANSFORM
The Derivative Rule. If (1) holds thenf'(t) is also represented by its Taylor series
for all t:
This derivative rule is fundamental for the applications of the £-transform to differential
equations. It makes it possible to transform linear differential equations for J(t) into
algebraic equations for F(s).
Of course, (5) was derived above only for those special functions J(t) with a Taylor
series representation (1). However, it holds for much more general functions. The
following theorem may be proved by using the Laplace integral (1) of Section 4.1.
Then f' (t) has an £-transform, and the derivative rule (5) holds for all s > a.
Corollary. Let (6)-(8) hold forJ(t) andf'(t). ThenJ"(t) has an £-transform, and
For the proof note that (5) is applicable with J replaced by f'. Thus
Combining this with (5) gives (9). A similar argument gives the follOwing corollary
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 233
Corollary. Let (6)-(8) hold for Jet), f'(t), ... , and j(n-OCt). Then j(nlCt) has an £-
transform and
EXAMPLE 1. Takej(t) = eat, sof'(t) = aeat andj(O) = 1. Then the derivative rule (5)
gives
(12)
This is equivalent to the result £{eat } = l/(s - a) that was obtained earlier by the power
series method.
This is equivalent to the result £{sin wt} = wl(s2 + d) that was obtained earlier.
EXAMPLE 3. Consider the simple LR circuit problem of Chapter 2 with constant applied
voltage. Then
di
(14) L dt + Ri = E for t::: 0, i(O) = io,
(18)
234 4 THE LAPLACE TRANSFORM
To deal with the second term, note the simple partial fraction expansion
(19)
s(s
1
+ so) = ~
1(1~ - +1)s So .
(20)
Thus the uniqueness theorem for the £-transform gives the solution
(2l) .
l(t)=
(.10-R:E) e -RUL
+R:'
E
It is easy to verify that this is indeed the unique solution to (14) Note that i(t) is the sum
of the steady-state current EIR and a transient current ce- RUL , as explained in Chapter 2.
EXAMPLE 4. Consider the LC circuit of Chapter 3 with a constant applied voltage. Then
if q(t) and i(t) = q'(t) are the charge and current respectively, one has
1
(22) Lq" + - q = E for t 2: O.
C
If the circuit is initially inactive, so q(O) = 0 and i(O) = q' (0) = 0, then the £-transform
of (23), Section 4.2, gives
(23)
(25)
whence
Notice the simple physical interpretation of the solution. At time t = 0 there is no current
(i(O) = 0) or charge on the capacitor (q(O) = 0). Then a constant electromotive force
E is switched into the circuit. Immediately, q(t) begins to increase, reaching a maximum
i max = 2EC when kt = :n:. Thereafter, it oscillates between q = 0 and q = 2Ee. The
mean value of q is EC, which is the steady state charge on C. There is no damping because
circuit resistance has been ignored (R = 0).
The Integral Rule. The derivative rule has a counterpart for integrals. To find it, let
f(t) be continuous for 0 :s t < 00 and satisfy
then dearly,
it it
Moreover,
since g(O) = o. By (29) and (30) this gives the integral rule:
(33) .e {l tf(r)dr} = ~C{f(t)}.
That is, the C-transform of the integral off is F(s)/s.
ExAMPLE 5. The Sine Integral. The sine integral function Si(t) is defined by
(34) Si(t) = i t
-dr.
a r
sin r
236 4 THE LAPLACE TRANSFORM
It is known that this integral cannot be expressed in terms of elementary functions. The
integrand has the Taylor series
sin t _1) nt 2n
L (2n+ 1)!'
00 (
(35)
-t- =
n=O
obtained from the Taylor series for sin t by division by t. Application of the power series
method to (35) gives
L----
00 x (_1)n 2n+l
(37) tan-Ix =
2n + 1
n=O
that
First Shifting Rule. Let J(t) have a Laplace transform F(s) for s > So and let a be
any real number. Then e"tJ(t) has a Laplace transform for s > So + a that is given by
(40)
Second Shifting Rule. LetJ(t) have a Laplace transform F(s) for s > So and let a be
a nonnegative constant. Then H(t - a)J(t - a) has a Laplace transform for s > So that is
given by
Second Shifting Rule (Alternative Form). Let get) have a Laplace transform and
let a :::: 0 be constant. Then get + a) has a Laplace transform, and
(43)
atfC )
e t = ~j(n)(o)
~--e t,
at n
n=O n!
(44)
Verification of the Second Shifting Rule. Again, the rule can be proved from either
the integral formula (1) of Section 4.1 or the power series method. For the latter, one has
--H(t -
n=O n!
a)(t - at.
Moreover, (10) in Section 4.2 states that £{H(t - a)(t - a)n} = n;:;;'. Hence the linearity
and continuity properties of .[ give
00 j(n)(o)
Note that the vanishing of j(t) and get) for t < 0 implies that k(t) = 0 for t < O.
Moreover, for fixed t :::: 0 the integrandj(r)g(t - r) of (49) vanishes for r outside the
interval 0 :::: r :::: t, and hence one may also write
It will now be shown that the convolution is very useful, both in calculating the inverse
Laplace transforms of complicated functions and in the Laplace transform method of
solving differential equations. These applications are based on the following.
The Convolution Rule. Let Jet) and get) be two functions of t :::: 0 whose Laplace
transforms F(s) = £If} and G(s) = £{g} exist for s > So. Then the Laplace transform
of j * g exists for s > So and
(53) K(s) = £ {l OO
j(r)g(t - r) dr} = 1 00
j(r)£{g(t - r)} dr,
by the continuity property (3) in Section 4.2. Next, since get) = 0 for t < 0, one has, by
the second shifting rule,
(55) K(s) = (1 00
e-STj(r) dr) G(s).
Now, the symmetry property (50) implies that j and g may be interchanged in (55)
without changing K. Hence
(56) K(s) = (1 00
e-STj(r) dr) G(s) = F(s) (1 00
e-srg(r) dr) .
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 239
G(s) = -1 =
s
1 0
00
e-stH(r) dr,
one gets
(57) F(s) = 10 00
e-STj(r) dr.
The three properties (1), (2), (3) of Section 4.2 have led full circle to the Laplace integral
(l) in Section 4.1. Finally, (56) and (57) imply that K(s) = F(s)G(s), as required.
Differentiation and Integration of Transforms. The operations of differentiating
and integrating the transform F(s) = £If} correspond to simple operations on jet). To
find them assume, at first, that jet) has the Taylor series representation (1) and transform
(2). Then formal differentiation of F(s) gives
dF(s) pn)(O)(n + 1)
I:
00
(58) ds = - sn+2 .
n=O
Recalling that 1/sn+l = £Wln!}, one finds by the continuity assumption and (1),
!I : - -I
(59)
n=O n+1.
00 pnl(o)tn+l
-£
n=O n!
= £{ (-t)j(t)}.
A careful analYSiS, based on the Laplace integral (1) in Section 4.1, gives the follOwing
extended version of (59).
Derivative Rule for Transforms. Let jet) be piecewise continuous and exponen-
tially bounded (condition (7) in Section 4.1). Then
dF(s)
(60) ds = £{ (-t)j(t)} for all s > a.
(62)
240 4 THE LAPLACE TRANSFORM
(63) fs
oo dal
a n+
= _1_
nsn
for n = 1,2, ....
Hence, if J(O) = 0 then term-by-term integration of (2) gives
(64) f s
oo
F(a)da= L--
00
nsn
n=l
jCn)(o)
= jCn)(o).c{ tn-I}
L
00
IL'----
n=l
n (n-l)!
00 jCn)(0)t n- 1 )
=.c
n=] n!
=.c V~)}.
A careful analysis based on the Laplace integral (1) of Section 4.1 yields the following
extension of (64).
Integral Rule for Transforms. Let J(t) be piecewise continuous and exponentially
bounded (condition (7) in Section 4.1). Moreover, assume that
.
11m J(O
(65) -
[-+0+ t
exists. Then
(66) 1 00
F(a) dr = .c {J~) } for all s > a.
I
EXAMPLE 6. Consider the sectionally defined function
t for 0 S t S 1,
(67) J(t) = 1 for 1 S t S n12,
sin t for t ~ nl2.
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 241
(68) F(s) = 10
1
e-stt dt + f l!:
2 e- st dt + i2
00
e- st sin t dt
and the work can be completed by calculating three integrals. However, F(s) can be
calculated without integration by use of the operational rules. To see how to do this,
recall that if a < b then
This property makes it possible to write a single formula for J(t) , namely
Each of the three terms in this sum can be calculated by the second shifting rule. Thus
£{ tH(t)} = £{ t} = 1/s 2 For the next term let get) = 1 - t in the second shifting rule,
equation (42), with a = 1. Then get + 1) = 1 - (t + 1) = - t and
(71)
Finally, take get) = sin t - 1, a = n12, so that get + nl2) = sin(t + n12) - 1 = cos t - 1
in (42) and
Y
1.0
y=l Y = sin(t)
y=t
0.0
Next consider a sectionally defined function J(t) consisting of M sections with transitions
at the points 0 = to < t 1 < ... < tM' If J(t) = In (t) for tn-l < t < tn, then generalizing
(69), one can write
Notice that if eachJn(t) is defined for all t ::: tn - l then (74) can be rearranged, as in (70),
to get
This represents J(t) as a sum of terms whose 'c-transforms can be calculated by the
second shifting rule when the transforms of Jl (t),fl (t + tl),h(t + tl)' ... are known.
Infinite Sums. Sectionally defined functions J(t) with infinitely many transition
points 0 = to < tl < ... < tn < ... are also useful. Thus we assume that J(t) = In (t)
for tn-l < t < tn and n = 1,2, .... For simplicity it will be assumed that tn -+ +00 as
n -+ 00. Then the analogue of (75) is
=L
00
where we define Jo(t) == O. This infinite series is a finite sum. Indeed, for any fixed value
of t, there is an index N such that t < tN, Thus if t < tn then H(t - t n) = 0 for n = N,
N + 1, .... However, the C-transforms of the individual terms in the series will not be
zero, in general. This is illustrated below.
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 243
Periodic Functions. A functionf(t), defined for t ~ 0, is said to have period p > °if
Graphically, this means that the graph of y = f(t) repeats itself in sections of length p;
see Figure 2.
It is natural to build periodic functions sectionally, in sections of length p, with
transition points at t = 0, p, 2p, 3p, .... To do this define on (-00, (0)
(82) Fp(s) = l P
e-stf(t) dt,
To verify the first equality of (83) note that C{fp(t - np)H(t - np)} = e-nsFp(s) by the
second shifting theorem. The second equality follows by noting that e- ns = (e- S ? and
o < e- S < 1 for s > O. Thus the series in (83) is the geometric series L~ xn with ratio
x = c S and 0 < x < 1.
In the remainder of this section the periodic function rule will be used to calculate
the C-transforms of several simple periodic functions.
EXAMPLE 7. A Square Wave. The definition of the square wave function J(t) is indicated
in Figure 3.
The square wave J(t) is given by equation (80) with P = 2a and
= lo a e- st dt - 12a (1 e- )2 as
(85) Fp(s) e- st dt =- ---
o a S
(1 - e- as )2 (1 - x)2 1 1- x
(86) F(s) = = ---
s(1 - e- 2as ) s(1 - x 2 ) S 1 + x'
o
a 2a 3a 4a Sa 6a
-1
FIGURE 3. Asquare wave y = J(t)
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 245
x = e- as This gives
1
(88) F(s) = - [1 - 2e- as + 2e- 3as _ 2e- 4as + ... J.
s
Combining this with the second shifting rule gives an alternative representation the
square wave:
EXAMPLE 8. A Triangular Wave. The definition of the triangular wave function get) is
indicated in Figure 4.
The C-transform G(s) can be calculated from the periodic function rule, as in the
preceding example. The triangular wave is precisely the integral of the square wave:
(91) G(s) =
1
S2 tanh (as)
2" .
Circuit Problems with Continuous Inputs. Solve the following differential equations using the £-
transform method and check the solution.
1. (LR Circuit) 3i'(t) + li(t) = 10, i(O) = 1.
2. (Re Circuit) Sq'(t) + O.Sq(t) = sinOOt), q(O) = O.
Circuit Problems with Discontinuous Inputs. Solve the following differential equations by the £-
transform method. Check the solution.
3. (LR Circuit) Li' (t) + Ri(t) = vet), L = 10, R = 100, vet) = 100e- t on 3 ~ t < 00, vet) = 0 otherwise.
Find the current i(t) given i(O) = o.
4. (LR Circuit) Li'(t) + Ri(t) = vet), L = 10, R = 100, vet) = 400t on 0 ~ t < 1, vet) = 0 otherwise.
Find the current i(t) given i(O) = o.
5. (RC Circuit) Rq'(t) + (lIC)q(t) = vet), R = 100, C = 0.1, vet) = 1000 on 0 ::: t < 1.1, vet) = 0
otherwise. Find the current i(t) = q' (t) given q(O) = o.
6. (RC Circuit) Rq'(t) + (l/C)q(t) = vet), R = 100, C = 0.1, vet) = 100(t - 2) on 2::: t < 00, vet) = 0
otherwise. Find the current i(t) = q' (t) given q(O) = O.
Vibrating Systems with Continuous Inputs. Solve the following differential equations by the £-transform
method.
7. (Constant Input without Damping) /'(t) + 4y(t) = 100,y(0) = /(0) = 0 Findy(t).
B. (Sinusoidal Input without Damping) yl/ (t) + 4y(t) = 10 sin(20t), yeO) = / (0) = O. Find yet).
Vibrating Systems with Discontinuous Inputs. Solve the following differential equations by the £-
transform method.
9. (Single Square Wave without Damping)yl/(t)+2y(t) =j(t),y(O) =/(0) = O,f(t) = 1£orO::: t < 1,
j(t) = 0 otherwise. Find yCt) and the steady-state oscillation.
10. (Single Square Wave with Damping) yl/(t) + 3/(t) + 2y(t) = j(t), yeO) = /(0) = O,j(t) = 1 for
o ::: t < 1,J(t) = 0 otherwise. Find yet) and the transient state for t > l.
Applications of the First Shifting Rule. Find £([(t)}, given j(t).
1 s-1
lB. (s-0.5)4 .
23. (s-1)2+3 .
2 100
19. (s-1)2+4 .
24. (s+0.3)1000·
1
20. s2+2s+2 .
25. 5s 22s+5
+lOs+6 .
s-1
21. 2A+Bs
s2+4 .
26. s2+2s+2 .
se2 - s
35. 2 cosh
s 1
s-e' 3B. s2+4s+5 .
4.3 OPERATIONAL RULES FOR THE LAPLACE TRANSFORM 247
2
40. 53+25 43. 1 5-1
. 525 2+1'
1 1 1
41. 54 -165 2' 44. 545 2+16'
y'(s) N +-I+N
+(- - -) yes) = o.
s 1- s
y'(s) N +-I+N
+(- - -) yes) = 0, YO) = 0,
s 1- s
N
by separation of variables to get yes) = eN (5~!{
5
. Conclude that the Nth Laguerre polynomial is given
by
50. sin wt, cos wt. 51. sin wt, H(t - n).
Applications of the Convolution Rule. Find h(t) by the convolution rule, given .c{h(t)}.
1 1 1 1
52. (5~l)2' 53. 52 5-4' 54, S 52+4'
Applications of the Convolution Rule to Oscillations. Find yet) by the convolution rule.
55, (Undamped Forced Oscillation) y" +y = sin 2t, yeO) = y' (0) = O.
56. (Undamped Forced Oscillation with Shifted Data) /' + y = sin t, yeO) = 1, /(0) = O.
57. (Undamped Oscillation with Discontinuous Input) y" + 16y = sin t[H(t) - H(t - n)l, yeO)
/(0) = O.
58. (Damped Oscillation with Discontinuous Input) y" + 3/ + 2y = H(t - 1) - H{t - 2), yeO) =
/(0) = O.
where j(ta+O) and jUa -0) are the right- and left-hand limits ofj(t) at ta. Why does the extra term arise?
62. (Convolution Example) Letj(t) = 1/0, get) = 1/.;rt=li. Show that J~j(x)g(t - x)dx fails to exist
at t = 1 and explain why this example is excluded by the hypotheses of the convolution theorem.
63. (Convolution Rule) The main step in the proof of the convolution rule is the identity
where F(t,x) = j(x)e-stg(t - x) (s is constant in the argument). Justify the interchange of dx and dt
and the new limits of integration by citing a reference from advanced calculus.
£-Transforms of Periodic Functions. The following problems assume a periodic input.
64. (£-transform of a Triangular Wave) Use direct integration to calculate the integral
.... t
o rrlw 3rrlw FIGURE 6. Full-wave sin wt
4.4 ApPLICATIONS TO DiffERENTIAL EQUATIONS 249
68. (Steady-State Current) Find the steady-state current in an LR-circuit with 2a-periodic square wave
input vet) = A for 0 :::: t < a, vet) = 0 for a :::: t < 2a.
Power Series Methods. The following problems use a Taylor series representation to derive formulas for
the C-transform.
69. (Derivative Rule) Assume that l' (t) is represented by its Taylor series
L -(-- tn-
00 j(n) (0)
1'(t) = 1 for all t.
n=1 n - 1)1
(92) io
t
j(r)dr= L (n+l!)
00
n=O
j(n)(O)t n+1
Derive the t-integral rule by applying the C-transform to the two sides of (92).
71. (First Shifting Rule) Assume thatj(t) is represented by its Taylor series (1). Then
Use the rule Cleatt n } = nlf(s - a)n+l to obtain the first shifting rule
4. The .c-transform of the differential equation is used to obtain an algebraic equation for
the .c-transform Xes) = .c{x(t)}. The equation is solved algebraically for Xes). Finally,
x(t) is constructed from its .c-transform X(s). This last step is based on the operational
rules of Section 4.3, supplemented by transform tables and computer programs for
calculating inverse transforms.
LR and RC Circuits. The LR circuit equation of Section 2.3,
di
(1) L- + Ri = E(t),
dt
and the corresponding RC circuit equation
The solution withx(O) = 0 has C-transform Xes), which satisfies (s + a)X(s) = W/(S2 +
or
( 2 ),
C
(9) x(t) = Ae-at + B cos wt + - sin wt
w
w ) -at asin(wt) wcos(wt)
= ( e + - .
w 2 + a2 w 2 + a2 w 2 + a2
Clearly, the first term on the right is a transient term, while
a w
(10) xss(t) = 2 2 sin(wt) - 2 2 cos(wt)
w +a w +a
is the steady-state solution. This function is periodic, with the period of the input
function, even though the homogeneous equation has no periodic solutions. Of course,
the method of undetermined coefficients can also be applied to (7) to yield the solution
(10).
Periodic Square Wave Input. Next consider (3) with a = 1 and periodic square wave
input jet) with period 2. Thus jet) is given by (81) in Section 4.3 with p = 2,
The constants C1 and C2 must be determined so that x(t) is continuous and periodic.
First, x(t) must be continuous at t = 1, so C1 and C2 must satisfy
(17)
Next, to be a periodic solution, x(t) must satisfy (14) with t = 0; that is, x(O) = x(2), or
(I8)
Now, (17) and (18) are a pair of linear equations for C1 and C2. By Cramers rule or the
method of elimination these equations have the unique solution
-2e 2e2
(I9)
C1 = e + l' C2=--·
e+I
Thus, if (13) is to have a periodic solution x(t), then for 0 ~ t ~ 2 it must be given by
2e)
1- ( e + 1 e- t for 0 ~ t ~ I,
(20) {
xp(t) = -1+ (2e
--
)
e
-t
for 1 ~ t ~ 2.
e+I
Moreover, we have
I-e
(2l) x(O) = x(2) = --.
I+e
This value determines the unique continuation of x(t) to t ~ 2. Since x(2) = x(O), the
continuation should be simply xp(t), translated to the right 2 units; i.e., xp(t - 2). To
check this, notice that by (ll), (12) and (13),
and (2l) holds. To identify the solution of this initial value problem, let
and make the change of variables (t,x) ~ (r,y) in (21), (22) to obtain
dy
(24) - +y =!p(r) for 0::: r::: 2,
dr
l-e
yeO) = - .
l+e
But the unique solution of this was shown above to be y(r) = xp(r). Thus, by (23),
as expected. Similarly, (13) on 4 ::: t ::: 6 withx( 4) = (l-e)/(l +e) givesx(t) = xp (t-4),
4 ::: t ::: 6, etc. This argument proves the following result.
Theorem. The LR circuit equation with periodic square wave input, equation (13), has
a unique periodic solution x(t) = xss(t) defined by
x
0.6
0.0
-0.6
FIGURE 7. Graph of the steady state
o 6 response of an LR circuit
254 4 THE LAPLACE TRANSFORM
(32) k(t) = 1 - e- t .
Note that for any particular value of t, the series in (34) terminates. Thus, for 2n < t <
2n + 1,
(35) x(t) = x(O)e- t + (k(t) - 2k(t - 1) + 2k(t - 2) + ... + 2k(t - 2n»
(36)
it follows that
= -1 +2 (
1 + e2n +!)
I+e
1- e 2e2n+1
=--+--.
I+e I+e
Combining (37) and (38) gives for 2n < t < 2n + 1,
and thus (27) holds for all t 2: O. In particular, (40), (41), (42) imply that x(t) = xss(t)
if and only if x(O) = (1 - e)/(l + e).
Electrical and Mechanical Oscillators. The electrical and mechanical oscillators
that were analyzed in Section 2.5 and Section 3.3 are modeled by the initial value problem
whence
(46)
(47)
The constants AI, A2 can be calculated from the initial values Xo and Yo.
Case 2. (a 2 - 4b = 0). Then S2 + as + b = (s - A)2, where A = -al2 is a real double
root. Thus
Al A2
(48) Xes) = -s--A + -(S---A)-2 '
and
(49)
Case 3. (a 2 - 4b < 0). Then 52 + as + b = (s + I)2 + y2, y = .Jb - a2/4, and Xes)
can be written
(50)
Therefore,
(51)
This reproduces the particular solution of (43) that was found in Section 3.3 by the
method of variation of parameters.
The Transfer Function. Equation (54) represents Xes) = C{x(t)}, the C-transform
of the oscillator response, as a product of a factor K(s) that depends only on the
physical properties of the oscillator and a factor F(s) = C{f(t)} that depends only on the
input function. The function K(s) is called the transfer function of the oscillator. This
simple example illustrates a concept that is widely used by engineers in the analysis of
complicated electrical networks and other linear systems. Additional examples of transfer
functions are given in Section 4.5.
The exact form of k(t) depends on the three cases discussed above. In Case 1 it is
easy to check that
(56)
and hence
~lt_~2t
(57) k(t) = .
Al - A2
This agrees with (19) in Section 3.3. The other two cases follow similarly
Damped Oscillator with Sinusoidal Input. This problem was solved in Sec-
tion 3.3, equation (50) and following. Here the following example will be solved by the
C-transform method:
1 As + B C(s + 1) + 2D
(59)
Xes) = (S2 + 1)(S2 + 2s + 5) = S2 + 1 + (s + 1)2 + 22 '
258 4 THE LAPLACE TRANSFORM
where the second equality is the appropriate partial fractions expansion. Simple algebra
gives the coefficients
1 1 1 -1
(60) A=--
10'
B=-
5'
c=-
10'
D--
- 20'
Thus one has
1 s 1 1 1 s+1
(61)
Xes) =- 10 S2 + 1 + 5" S2 + 1 + 10 (s + 1)2 + 22
1 2
20 (s + 1)2 + 22 '
where the last line is convenient for the use of C-transform tables. The latter give
1 1 1 1
(62) x(t) = - - cos t +- sin t + _e- t cos(2t) - _e- t sin(2t).
10 5 10 20
This can be written
where
1
(69) F(s) = - [1 - 2e- rrs + 2e- 2rrs _ 2e-3rrs + ... J.
5
(70) Xes) = M(s) [1 - 2e- rrs + 2e- 2rrs _ 2e-3rrs + ... J'
where
1
(71) M(s) = [{met)} = [( )2 2]·
55+1 +2
The method of partial fractions gives, after some algebra,
1 1 5 +1 1 2
M(s) = 5s - 5(s
(72)
+ 1)2 + 22 10 (s + 1)2 + 22
and hence
1 1 1
(73) met) = - - _e- t cos(2t) - _e- t sin(2t).
5 5 10
It will also be convenient to define
1
(74) let) = -(2cos(2t) + sin(2t»e- t
10
so that
1
(75) met) = - - let).
5
Combining (70) and the second shifting rule, M(s)e- mrs = H(t -rrn)} [{ met -rrn), gives
the solution
Also, since cos(2t) and sin(2t) have period rr, one has from (74) and (75),
( eIr+l
f!'-I)l_~
5
2 e-(t-2Irn)
+ - -Ir . (2cos(2t) + sin(2t»
l+e 10
1 2
= cl(t) - - + - - l e t - 2Jrn)
5 1 + eIr '
1 2
(82) x(t) = d(t) +- - - - l (t - (2n + l)Jr).
5 1 +eIr
Note that the term d(t) is a transient term, tending to zero as t -+ 00. If this term is
dropped in (81) and (82) then the remaining terms define a periodic solution xss(t), of
period 2Jr. For the interval 0 .::5 t .::5 2Jr we have from (82) with n = 0, and also (81)
with n = 1, xss(t) = xp(t), where
1 2
- - - - l e t - Jr) 0.::5 t .::5 Jr,
(83) 5 l+eIr '
1 2
-- + - - lIr( t - 2Jr) Jr .::5 t .::5 2Jr.
5 l+e '
4.4 ApPLICATIONS TO DIFFERENTIAL EQUATIONS 261
Exercises 4.4
Applications to Initial Value Problems. Use the C-transform and the initial values to calculate the C-
transform Xes) = C{x(t)}. Then find the partial fractions form of Xes). Finally, use tables or a computer
algebra system to determine x(t).
1. x' - 0.05x = 0, x(O) = 200.
2. x" - 3x' + 4x = 0, x(O) = 1, x'(O) = 2.
3. x" +x' + 1.25x = O,x(O) = 2,x'(0) =-1.
4. 9X" - 6,( +x = O,x(O) = 5,x'(0) = o.
5. x" + 20x' + 200x = 0, x(O) = 0, x' (0) = 20.
Applications to Finding Particular Solutions. Find a particular solution of the nonhomogeneous
equation by applying the C-transform method with initial values xCO) = 0, x'CO) = o.
The inputj(t) is the 271"-periodic triangular wave that on [0,271"1 agrees with the function
-
jet) = {t7I"-t °
71"
~ t < 71",
~ t < 271".
In the exercises below, the general solution is obtained, and the steady-state solution is computed.
27. Use the L:-transform and the convolution method to solve the damped oscillator problem in convolution
integral form.
28. Evaluate the convolution integral x(t) of the damped oscillator problem on the interval
Hint: On [71",271"1 the answer is
° ~ t ~ 271".
29. Plot the oscillator solution x(t) on the interval 0 :::: t :::: 2n.
30. Calculate the initial values of the steady-state solution of the damped oscillator problem and plot this
solution on [0,2n].
Hint: The steady-state solution x(t) = Xh(t) + xp(t) has period 2n and therefore it satisfies
x(O) = x(2n), x' (0) = x' (2n). In these equations, xh (t) = (q cos(2t) + c2 sin(2t))e- t and xp(t)
is the particular solution defined and plotted above, satisfyingxp(O) = x~(O) = O. Solve equations
xeO) = x(2n), x' (0) = x' (2n) for q, C2 and plot xU) on 0 :::: t :::: 2n.
(1)
(2)
264 4 THE LAPLACE TRANSFORM
ql
~ ~i2
CI i3 q2
iI
C2
t
Here only the special case L = 6, R = 10, l/C I = 6, 1/e2 = 60 will be discussed. In
this case (1), (2) become, after removal of constant factors,
(5)
It will be shown that the system has one and only one solution for each choice of ai, a2, bl .
On transforming (3), (4) and using the operational rules of Section 4.3, one gets for
the transforms Ql (s) = £{ql (O}, Q2(S) = £{q2(t)} the pair of equations
where PI(S) and P2(s) are quadratic polynomials whose coefficients involve at, a2, bl ·
The right-hand sides of (9) are the partial fractions appropriate to the denominator D(s).
Clearly, (9) implies that
(ll)
3s + 5-5s
(12) PI (s) =
-1 s+ 6
= (3s + 5)(s + 6) - 5s
= 3(S2 + 6s + 10),
5 1
(13) ql CO = 2:c-t - 2e- 2t + 2: c- 3t ,
1 _/
q2(t) = - -e + e-2t --e
1 -3t
2 2
It is easy to check that (13) satisfies the original system (3), (4) and the initial conditions
(11).
Each body experiences two Hooke's law forces, from the springs on either side of
them. Thus Newton's law gives the pair of equations
(16)
since the second derivative of sin wt is -w 2 sin wt. Substituting these into (15) gives,
after some rearrangement, the pair of equations
(18)
This has two positive solutions: w = ,,;'k and w = .J3k. Thus (17) has the solutions
w = ,,;'k, CI = C2 = 1 and also w = .J3k, CI = -C2 = 1. This gives the solution pairs
(19) Xl = sin -Jkt, X2 = sin -Jkt,
Xl = sin J3kt,X2 = - sin J3kt.
Moreover, one can replace sin wt by cos wt in (16) to get the two additional solution pairs
Thus four distinct (linearly independent!) solutions have been found. Moreover, if each
is multiplied by a constant and they are added together, new solutions are obtained,
because system (15) is linear and homogeneous. The L:-transform will be used to show
that all solutions arise in this way.
4.5 ApPLICATIONS TO SYSTEMS OF DIFFERENTIAL EQUATIONS 267
Solution via the 'c-transform. System (15) is second order in both Xl and X2, and
hence appropriate initial values are
(24)
and is never zero for real s. This implies that Xl (s) and X2 (s) are uniquely determined
by (23). In fact, a simple calculation gives
(25)
X (s) = (S2 + 2k)(sal + bl ) + k(sa2 + h)
I (52 + k)(S2 + 3k) ,
and
(26)
X (s) = (S2 + 2k)(sa2 + b2) + k(sal + bl ) .
2 (S2 + k)(s2 + 3k)
It is clear from the partial fractions expansion form of these functions that Xl (0 and
X2(t) are linear combinations of the four special solutions defined by (19) and (20). In
particular, al = a2 = 0, b[ = h = J'k gives the first solution pair of (19), while
al = a2 = 0, bl = -b 2 = J3k gives the second; al = a2 = 1, bl = b2 = 0 gives the
first of (20), and al = -a2 = 1, bl = b2 = 0 gives the second. On multiplying the four
solutions by constants AI, A 2, A 3, A4 and adding, one gets the general solution
with derivatives
To calculate AI, A 2, A3, ~ from the initial values (21), set t = 0 in these equations to
get the equations
and
(31) Al = (b l + b2 )/2../h,
A2 = (b l - b2)/2J3k,
A3 = (al + a2)12,
A4 = (al - a2)/2.
Equations (27) have been shown to define the general solution of the oscillator problem
of Figure 9.
Note that if A2 = A4 = 0, then the solution is a synchronous oscillation of frequency
.fii;(21r), while if Al = A3 = 0, it has frequency J3kt(21r). These two special solutions
define the characteristic modes of the oscillator. It will be shown in Chapter 5 that
systems of n coupled oscillators have n characteristic modes.
A Two-Body Damped Mechanical Oscillator. Consider two simple mass-spring
oscillators, coupled together by a damper as suggested in Figure 10.
The springs are assumed to obey Hooke's law, while the damper opposes relative
motion of the two masses by a force proportional to their relative speed. Newton's laws
give the equations
The solutions of this system will be determined by the initial positions and speeds of the
two bodies. If the notation (21) of the preceding problem is used, then the 'c-transforms
of equations (33) can be written
(35)
Hence the partial fraction expansion of Xl (s), obtained from (34), (35) is
AIS+A2 A3 A4
(36) X I ()
S = +- - + -:----:_::_
S2 + 1 S + 1 (s + 1)2 .
This gives
(37)
with a similar expression for X2(t). The dependence of AI, A 2, A 3, A4 on the initial data
al, a2, bl , b2 can be calculated from (34).
There is an easier way to find the general solution of (33). To discover it, note that
by adding the equations of (33) one finds that
(38)
(39)
270 4 THE LAPLACE TRANSFORM
and hence
It is not difficult to calculate the constants CI, C2, C3, C4 from the initial data ai, a2, bl ,
h Thus (41) defines the general solution of the system (33).
It is not difficult to see from the structure of the device of Figure 10 that it may have
an undamped oscillatory mode of vibration. Indeed, if the distance X2 - Xl = constant
then the damper is not activated, and undamped oscillations occur (C3 = C4 = 0 in
(41)). In general, the motion consists of a damped part, or transient, and an oscillatory
part, or steady-state oscillation. For large t the two bodies oscillate in phase with constant
distance between them.
An Electric Circuit Analog. The two-loop electric circuit of Figure 11 will be shown
to be isomorphic to the damped mechanical oscillator just discussed.
For if the electric currents ii, i 2, i3 are as in the figure and ql, q2 are the charges on
the capacitors, then Kirchhoff's current and voltage laws give
(42)
If the first three equations are used to eliminate il, i 2 , i 3 , the resulting system becomes
(44)
where f(t) is the applied force. The analogous LRC circuit was analyzed in Section 3.3
above, equations (61)-(66). It was found that if
Moreover, the general solution of (45) differs from (47) by an exponentially damped term,
so that all solutions approach xss(t) for large t. In some applications, such as synchronous
electrical motors, xss(t) may represent an undesirable vibration. For such cases it was
discovered by Frahm in 1909 (see [DenH]) that xss(t) can be completely eliminated by
J(t)
coupling m to a small undamped oscillator. To see how this can be done, consider the
coupled oscillators of Figure 13.
Newton's laws yield the system of equations
(51)
where
(52)
It can be shown that all the roots of D(s) = 0 have negative real parts; see [eh-B] for a
proof. The partial fractions expansion of (51) can be written
(53) () As + Bw
Xs= +--
pes)
52 + w 2 D(s) ,
where pes) is a polynomial of degree 3 or less. The corresponding equation for x(t) is
and Xtr(t) tends to zero exponentially as t -+ 00. Thus, in general, the compound
oscillator of Figure 13 has the same type of undamped steady-state oscillation (47) as the
4.5 ApPLICATIONS TO SYSTEMS OF DIFFERENTIAL EQUATIONS 273
simple oscillator. The only exception to this occurs when the numerator in (51) cancels
the factor S2 + ui in the denominator. This is the idea behind the vibration absorber.
Given the frequency w of the applied force, one chooses hi and ml such that
hi 2
(56) -=w.
ml
Then, from (51),
(57) X( ) = fOmlw
S D(s) ,
and it follows that x(t) = Xtr(t) is exponentially damped. Of course, one has from (50),
fowh l
(58) Xl (s) = (52 + ( 2 )D(s) ,
and hence ml will have a steady-state vibration Xl (0 of the form (47). However, this may
be more acceptable in an application than the vibration of the primary mass m. Vibration
absorbers of this and other types are discussed starting on page 87 in [DenHl.
Uncoupled Systems. Use the .c-transform method to calculate x(t) and y(t).
1. (LR Circuit) x' = 3x, y' = 4y + 5, x(O) = 1, yeO) = 2.
2. (LR Circuit) x' = -x + e- t , y' = 2y. x(O) = 0, yeO) = -2.
3. (Mass-Spring System) x" + 4x = 1, y" + y = 0, x(O) = 0, x'(O) = 0, yeO) = 0, y'(0) = 1.
4. (Mass-Spring System) x" + 2X' + 5x = 1, y" + y = I, x(O) = 0, x'(O) = 0, yeo) = 0, y'(0) = 1.
First-Order Coupled Systems. Use the .c-transform method to calculate Xes) = .c{x(t») and yes) =
.c(y(t»). Then use tables or a computer algebra system to find x(t) andy(t).
10. x" + y' - x + y = 1, y" + x' - x + y = 0, x(O) = 0, x' (0) = 0, yeO) = 0, y' (0) = o.
11. x" + y' + 3x = 1, y" - 4x' + 3y = 0, x(O) = 0, x'(O) = 0, yeo) = 0, y'(O) = O.
Applications to Electric Circuits. Below are four two-loop electrical circuits. In each case use Kirchhoff's
laws to write a system of differential equations for the unknowns. Then use the .c-transform method to find
formulas for the unknowns.
274 4 THE LAPLACE TRANSFORM
12. The parameters in the circuit of Figure 14 have the values v = 30 volts, R1 = 10 ohms, R2 = 20 ohms,
L1 = 0.02 henries and L2 = 0.04 henries. Calculate i1 and i2 if i1 (0) = 0 and i2(0) = o.
13. The parameters in the circuit of Figure 15 have the values v = 100 volts, R1 = 20 ohms, R2 = 40
ohms, L1 = 0.01 henries and L2 = 0.02 henries. The currents i1 and i2 are as indicated. Calculate i1
and i2 if i1 (0) = 0 and i2 (0) = o.
14. The parameters in the circuit of Figure 16 have the values v = 500sin(10t) volts, C = 0.01 farads,
L = 1 henries and R1 = R2 = 10 ohms. Calculate the charge q on the capacitor if q(O) =
0 and
i(O) = q'(O) = o.
15. The two loops of the circuit in Figure 17 are coupled by a mutual inductance M. Kirchhoff's laws give
the pair of equations
Calculate the currentsij andi2 ifij(O) = O,i2(O) = oand the parameters have the values Lj = L2 = 1
henry, Rj = 1 ohm, R2 = 0.25 ohm, M = 0.5 henry and v = 120 volts.
C HAP T E R 5
Linear Algebra
Systems of simultaneous linear equations occur frequently in all branches of science and
engineering. Linear algebra is the theory of such systems and the methods available
for their solution. The goal of this chapter is to present the elementary concepts and
algorithms that are needed for applications.
+
v
FIGURE 1. Wheatstone bridge
Ohms law and Kirchhoffs voltage law imply that Xl, X2, X3 satisfy the linear equations
13XI - lOxl - X3 0, =
(2) -lOxl + 16xl - 2X3 = 0,
-Xl - 2X2 + 13x3 = 15.
It will be shown below that this system has the unique solution Xl = 5/12, Xl = 5/12,
X3 = 5/4. Hence, the Wheatstone bridge with the given parameters has branch currents
il = Xl = 5/12, il = Xl = 5/12, i3 = X3 - Xl = 5/6, i4 = Xl - X3 = -5/6,
is = Xl - X2 = 0 and i6 = X3 = 514 amperes.
mass m2 the condition is -Xl + 2X2 - X3 = (glk)m2, etc. Proceeding in this way gives a
system of five linear equations for Xl, ... , xs, which may be written
lxl - X2 =gm/k,
-Xl + 2x2 - X3 =gmik,
(3) X2 + 2x3 - X4 = gm3lk ,
X3 + 2x4 - Xs = gm4 /k ,
- X4 + 2xs = gmslh.
It will be shown below that for any values of glk and masses m l, ... , ms this system
has a unique solution. For example, if glk = 1 and ml = 1, m2 = 2, m3 = 3, m4 = 4,
ms = 5 then Xl = 3516, X2 = 32/3, X3 = 27/2, X4 = 40/3 and Xs = 55/6.
by calculating the positive integer coefficients Xl, ... , X7. The principles are that the
number of atoms of each element must be conserved as well as the total electric charge
Q. This gives six equations for the seven coefficients:
eu 2x1 = Xi,
S Xl = 8X6,
H X2 = 2x7 ,
N X3 = Xs,
0 3X3 = Xs +X7,
Q X2 -X3 =~.
Xl = 24n,
X2 = 128n,
X3 = 32n,
Xi = 48n,
Xs = 32n,
X6 = 3n,
X7 = 64n,
where all, au, ... , amn and bl , b2 , ... , bm are prescribed numbers and the equations are
to be solved for the unknowns Xl, ... , X n . Examples 1,2, and 3 have m x n = 3 x 3,
5 x 5, and 6 x 7, respectively.
Section 5.2 below develops a method for computing all the solutions (if any) of any
linear system. For small values of m and n (say m ::::: 3, n ::::: 3) those solutions can
5.1 SYSTEMS OF LINEAR EQUATIONS 281
be found by hand computation. However, real applications often lead to much larger
values of m and n. For example, practical electric circuits may have 100 or more meshes,
leading to linear systems with m ::: 100, n ::: 100. Problems of nuclear reactor design
lead to systems with several thousand unknowns. Clearly, such problems require efficient
computer algorithms for their solution.
For large systems (m and/or n large) it may not be helpful, or even possible, to write
the equations down on paper. Instead, the numbers all, au, ... , a mn and bl , b2 , ... , bm
that define the system may be generated in the computer as needed. For the theoretical
study of such systems it is essential to have a concise notation that does not require each
equation to be written explicitly The notation of linear algebra was invented for this
purpose.
Vector Notation. A solution of the linear system (5) is an ordered set of real numbers
(Xl, ... ,xn )· The term ordered means that each unknown Xk has its assigned place in the
list. For theoretical calculations each solution set will be denoted by a single letter:
x == (Xl,'" ,Xn).
The set of all such n-tuples x is denoted by Rn. Similarly, b == (b l , ... , bm ) denotes
the right-hand side of system (5). Members of Rn are called vectors or, more explicitly
n-vectors.
Matrix Notation. The set of mn coefficients aij in system (5) will be denoted by the
single letter A. Thus
en a,. )
au
a2l all a2n
(6) A == .
amI a m2 a mn
Such an array of mn values will be called a matrix or, more explicitly, an m x n matrix.
The m x n system (5) can now be written as a single equation, Ax == b, where Ax denotes
the m-vector
(
a2l X I + anX2 + ... + a2nXn
(7) Ax == . .
Vector Algebra. Let n-vectors y = (YI, ... , Yn) and z = (ZI, ... , Zn) be two solutions
of the homogeneous system Ax = 0, so that Ay = 0, Az = 0, or, explicitly,
and
On adding corresponding equations of (8) and (9) one finds that the n-vector w
(WI, W2, ... , wn ) defined by
satisfiesAw = O. Thus two solutions of the homogeneous system, y andz, have generated
a third solution w. It is natural, and convenient, to write w = y + z and call w the vector
sum of the n-vectors y and z.
Similarly, if a is any real number then on multiplying each equation of (8) by a one
finds that u = (OIYI, OIY2, . .. ,00Yn) satisfies Au = O. This suggests that we write u = OIy
and call u the scalar multiple of the n-vector y by the number 01. The definition
can be expressed by saying that u = OIy if and only if Uk = OIYk for k = 1,2, ... , n.
The notation of the last two paragraphs implies that the matrix A obeys the rules
for all vectors x, y and all numbers 01. It follows easily that if {J is another number then
for vectors x, y and all numbers ex, f3. In Section 5.5 these properties will be used to find
a simple description of the solution sets of the homogeneous systems Ax = o.
Complex Numbers. The examples given above dealt with matrices and vectors with
components in the real number field R. However, exactly the same rules hold when the
components are in the field C of complex numbers. Both cases will be treated in the
remainder of this chapter.
Conversion to Matrix Form. Identify matrices A, x and b that convert the systems below to matrix form
Ax = b. Verify the definitions of A, x and b using matrix multiplication and equality of vectors.
1. 2x - Y = 3, 2. x+y-2z= 1,
x+4y = 5. 4x +z = 2,
-x+y = 5.
A Resistive Network. Consider the resistive network of Figure 3. Apply Kirchhoff's voltage law and
Kirchhoff's current law to complete the following exercises.
3. Derive a system of five linear equations for the mesh currents Xl, ... , x5 in Figure 3.
4. Represent the five mesh equations of Figure 3 in the form of a matrix system Ax = b. Fully identify the
matrix A, the vector x and the vector b.
5. Solve the system Ax = b of the previous problem for the mesh currents x, verifying the formula
1
x = 6535 (9702, 3033, 612, 54, 18).
Transition Probabilities. A vector p is called a probability vector if each entry is nonnegative and the sum
of the entries is 1. A matrix A is called a transition probability matrix if each column of A is a probability vector.
6. (Transition Probabilities) Let A be a square transition probability matrix. Show that for each probability
vector p, the vector Ap is also a probability vector. Conclude that iterates Anp of probability vectors are
again probability vectors.
7. (Weather) Let
2'I 4
3 4I)
A -- (1 g 1
4 I
4 .
I I I
4 g 2'
The entries are interpreted as weather transition probabilities for rain, sun or clouds. Precisely, if p is
the 3-vector of today's probabilities for rain, sun, and clouds, then Ap is tomorrow's probabilities. Given
that today's probabilities are 0, ! and!, predict the probability of rain 1, 3 and 5 days ahead.
(2)
-3
2 3
5 6
1
). b=(1,7,5).
To solve (1) it will suffice to omit writing the unknowns Xl, X2, X3 and write only the
augmented matrix
(3)
5.2 THE GAUSS ELIMINATION METHOD 285
Here the left three columns define A and the right column defines h. Thus each row of
the augmented matrix defines the corresponding equation ofthe system (1).
These operations are called elementary row operations. Each of them is reversible.
Hence each produces an equivalent system. This will be illustrated by transforming (3)
to a solvable equivalent form. The goal of the operations is to introduce into A as many
zeros as possible. The double arrow notation (A I h) {::::::} (A'l hi) will be used to
denote the equivalence of systems Ax = hand A'x = hi.
Multiply row 1 of (3) by -4 and add it to row 2 and row 3. This combination of
elementary row operations produces the equivalent system
2 3
(5) -3 -6
-11 -11
Next multiply row 2 by -11/3 and add it to row 3. The result is the augmented matrix
~(
1 2 3
(6) (A' Ib') 0 -3 -6 3I ) ,
0 0 11 -10
This system has a unique solution, which can be obtained by solving the equations in
reverse order. Thus X3 = -lOllI, X2 = -1 - 2x3 = -1 + (20/11) = 9/11 and
Xl = -2X2 - 3X3 + 1 = -18/11 + 30111 + 11111 = 23/11. This shows that system
(7), and hence also system (1), has the unique solution x = (23/11,9/11, -10/11). The
solution may be verified by direct substitution into (1).
286 5 LINEAR ALGEBRA
(8) (A Ib) = (
1
4
2
5
31)
6 7 .
6 9 12 9
It looks similar to Example 1 but has quite a different solution set. To solve it, add -4
times row 1 to row 2 and -6 times row 1 to row 3. This gives the equivalent system
(9)
( o~ -~ -! ~).
-3 -6 3
Next add -1 times row 2 to row 3 and also multiply row 2 by -1/3. The result is the
0
eqUivalent system
(10) (A'lb') =
2 3
1
o
2
0
-l)
This represents the system
The third equation, OXI + OX2 + OX3 = 0, is automatically satisfied and may be dropped.
System (11) has infinitely many solutions, given by X2 = -1- 2x3 ,XI = 1- 2X2 -3X3 =
3 + X3. Thus X3 = t may be any real number and then X2 = -1 - 2t, Xl = 3 + t. In
vector notation the solutions can be written
(12)
This also defines the solution set of (8), since it is equivalent to (11).
represent planes in space, which, according to the solution above, intersect along a line
in space, as in Figure 4.
5.2 THE GAUSS ELIMINATION METHOD 287
(l3) (A Ib) =(
1 2
4 5
31)
6 7 .
6 9 12 3
It differs from Example 2 only in the third component of b. Applying exactly the same
row operations as in Example 2 gives the equivalent system
The last row represents the equation OXI + OX2 + OX3 = 2, which is always false. Hence
(14) has no solution-its solution set is empty. The same is therefore true of system (l3).
Column 2. Apply the procedure described for column 1 to the (m -1) x (n -1) system
obtained from (A'l b') by deleting the first row and first column. This produces an
equivalent system (A" Ib") in which there is a 1 in position (2,2) and zeros in positions
(3,2), (4,2), ... , (m, 2). There is only one exception. If a;2 i= 0, then it may happen
that a~2 = a;2 = ... = a~2 = O. In this case one leaves column 2 as it is and proceeds
to the first column (if any) that has nonzero coefficients on or below row 2.
The Gauss method continues as above through the remaining columns, each time
deleting an additional row and one or more columns. It terminates when all columns of
A have been processed. The final result is the replacement of the original system (A I b)
by an equivalent system (B I c) with the following properties.
(15) • All rows (if any) that consist entirely of zeros are at the
bottom of the matrix.
• In any nonzero row, the first nonzero number (from the
left) is 1. It will be called a leading 1 .
• In any two consecutive nonzero rows the leading 1 in the
lower row is to the right of the leading 1 in the upper row.
Row Echelon Matrices. A matrix that has properties (15) is said to be a row echelon
matrix. Thus matrix (10) of Example 2 is a row echelon matrix. Matrix (6) of Example 1
is not. However, division of rows 2 and 3 by - 3 and 11, respectively, give an equivalent
row echelon matrix:
1 2 3
(16) ( 012 1
-1 )
001 -10/11
The procedure described above for reducing a linear system (A I b) to row echelon form
is called Gauss Elimination. The row echelon form readily yields all solutions (if any)
of the original system. Examples 1, 2 and 3 illustrate the three possibilities. An m x n
system (A I b) is inconsistent and has no solutions if (and only in its row echelon form
has a row consisting of n zeros followed by a nonzero constant, as illustrated in Example
3. System Ax = b has a unique solution if (and only in its row echelon form has n
leading 1's along the main diagonal (possible only if m ::::: n). If m > n then rows n + 1
through m must consist entirely of zeros. Finally, Ax = b has infinitely many solutions
if (and only in its row echelon form has fewer than n nonzero rows.
Solution by Back Substitution. All solutions of an m x n system Ax = b can be
calculated from an equivalent row echelon system Bx = c. If Ax = b has a unique
solution then equation n of the system Bx = cis Xn = en. Next, equation (n - 1) reads
5.2 THE GAUSS ELIMINATION METHOD 289
(~ ~ ~
-1
(17) (Alb) {=:::} 2
000 o
Recall that" {=:::} " denotes equivalence. Equation (17) exhibits two distinct row echelon
matrices that are equivalent to (A I b), but only the latter is in reduced row echelon form.
The second advantage of the reduced row echelon form is that it is immediately
solvable without back substitution. Each nonzero row contains one dependent unknown
followed by zero or more independent unknowns. For example, from the last matrix in
(7) it is seen that X3 = t is an independent unknown, while Xl and X2 are dependent.
Moreover, by inspection one gets Xl = t + 3, X2 = -1 - 2t, X3 = t, as in (2).
290 5 LINEAR ALGEBRA
The extended version of the Gaussian algorithm described above is usually called
the Gauss-Jordan algorithm. 2 It associates to each augmented matrix (A I b) a unique
rref matrix (B I c), equivalent to (A I b), from which the solutions of Ax = b (if any) can
be written down by inspection. For systems of three or four equations the rref matrix
(B I c) can be calculated by hand. For larger numbers of equations and/or unknowns the
Gauss-Jordan algorithm becomes algebra-intensive, requiring large numbers of algebraic
operations. Fortunately, the algorithm can be automated by using one of the computer
algebra systems (CAS for short) that have been developed since the early 1970s. The
use of computers greatly increases the size of the linear systems that can be solved in
practice. This section concludes with three systems that are difficult to solve by hand but
are readily solved by use of a CAS. Many additional examples are given in subsequent
examples and exercises.
(18) A= CBS 97
49
-255
291
147
85
-97
-49
-55
50
63
-705
779
469
-37 )
79
57
45 135 -45 -8 299 -93
3 -1 0 7 0
U D
0 0 1 2 0
(19)
0 0 0 0 1
0 0 0 0 0
All solutions of the system Ax = b are obtained by assigning arbitrary values to X2, X3
and Xs and then computing Xl, X4 and X6 from (20). These solutions can be written in
2After Wilhelm Jordan (1842-1899), German geodesist and mathematician who popularized the reduced row
echelon method through his Handbook of Geodesy.
5.2 THE GAUSS ELIMINATION METHOD 291
n-vector form (n = 6) as
(21) x = (1,0,0,3,0,5)
+S(-3,1,0,0,0,0)
+ t( 1,0,1,0,0, 0)
+u(-7,0,0,-2,1,0),
where S = X2, t = X3 and u = X5 are arbitrary parameters.
EXAMPLE 5. Consider the 4 x 6 system Ax = b where A is the matrix of Example 4 with
a new vector
(22) b = (-470,698,464,-352).
(23) (B I c) =
(
1 3
-1 ° 7 ° 1)
° ° °° ° ° ° 1 2 3
°° 1 5 .
°° ° °°° 1
The last row shows that the equivalent systems (B I c) and (A I b) are inconsistent and
have no solutions. Thus a simple change of the vector b changes the system Ax = b from
one with infinitely many solutions to one with no solutions. From the original forms of
the two systems it is impossible to recognize this. A simple algebraiC test to determine
the nature of solutions of Ax = b is given in Section 5.5 below.
1 0 0 0 1
0 1 0 0 3
(25) (B I c) = 0 0 1 0 5
0 0 0 1 1
0 0 0 0 0
Complex Numbers. The examples given above all have components in the real
number field. However, the Gauss elimination method works equally well with complex
coefficients. An example is given in the Exercises.
Exercises 5.2
3. A = ( _~ -12 ) .
4. A= ( -~
-1
2
-r) .
n
-1
Echelon Form. Find the reduced row echelon form of A.
-1 0 0
( -12 2 -1 0
5. A= ~ -1 2 -1
n
0 -1 2
-1 0 0 0
-1 2 -1 0 0
( 2
6. A= ~ -1 2 -1 0
0 -1 2 -1
0 0 -1 2
Gauss-Jordan Algorithm. In the problems below, solve the system Ax = b.
7. A= (~ ~2 1
1 1
~ ~)'b=(l).
-1 -1 0
5.2 THE GAUSS ELIMINATION METHOD 293
8. A = (! 2 ~). b =( -0.
9. A= ( ~ ~ ). b = ( - D.
Geometric Classification. Classify the following systems of equations geometrically into the classes below.
Class A. The planes intersect in a unique point.
Class B. The planes intersect in a line.
Class C. The planes meet pairwise in skew lines that never intersect.
Class D. All the planes are the same.
Class E. The planes are parallel and there are no intersections.
5x+ 5y+ Z =0,
10. lOx + 25y + 2z = 0,
5x + 15y + 3z = 1.
5x+ 5y+ Z= 0,
11.lOx+lOy+2z=0,
5x + 15y + 3z = 1.
Y =0,
12. z= 0,
y+z=1.
5x+ 5y+ z=O,
13. lax + lOy + 2z = 0,
15x + 15y + 3z = O.
5x+ 5y+ Z= 0,
14.lOx+l0y+2z=l,
15x + 15y + 3z = 2.
5x+5y+ z = 0,
15. 5x + 5y + 2z = 0,
5x + 5y + 3z = 2.
16. (Solution of a Complex System) Solve the complex system
x = (1, i).
17. (Inconsistent Systems) Three planes meet in a unique point that is not on a fourth plane. Explain the
connection with inconsistent systems of equations.
18. (Inconsistent System) Prove that a 3 x 3 linear system Ax = b is inconsistent and has no solution if
and only if there is a row echelon form of the augmented matrix (A I b) having a row with 3 zeros and
4th entry nonzero.
19. (Uniqueness) Prove that a 5 x 3 linear system Ax = b has a unique solution x if and only if a row
echelon form of its augmented matrix (A I b) has 3 rows with a leading 1 and rows 4 and 5 are all zeros.
294 5 liNEAR ALGEBRA
20. (Infinitely Many Solutions) Prove that a consistent n x n linear system Ax == b has infinitely many
solutions x if and only if a row echelon form of its augmented matrix (A Ib) has at least one zero row.
21. (Refinery) Two refinery sites X and Y produce total amounts K of kerosene and G of gasoline. At site
X, they process x barrels of crude oil, and y barrels are processed at site Y. Production at these sites is
slightly different, resulting in the equations
A == ( 4 -1 2 6)
-1
3
5
4
-1 -3
1 3
.
For any column vector b, determine consistency conditions for solvability of Ax == b and report the
solution x (which depends on the components of b).
23. (Tridiagonal Systems) Let A == (aij) be an n x n matrix defined by the formulas akh == 2, 1 ::: k ::: n;
ak+l,k = ak,k+l == -1,1 ::: k ::: n - 1; aij == 0 otherwise. Let b be the first column of the identity
matrix. Solve the system Ax == b for n = 7.
(1)
and
(2)
respectively. This section develops the basic concepts and notations of vector algebra,
based on these operations, that are needed in Section 5.5 to describe the structure of
solutions of linear systems Ax = h.
A vector space V in Rn is a subset of Rn that is closed under vector addition and
scalar multiplication. This means that if x and yare any vectors in V and a is any scalar
then the vectors x + y and ax are also in V.
Vector spaces occur frequently in linear algebra and many will appear in the
remainder of Chapter 5. Here the concept is illustrated with two examples that are
associated with a matrix A.
The Null Space of A. If A is an m x n matrix then the set N(A) of all n-vectors x
that satisfy Ax = 0 will be called the null space of A. Thus N(A) is the solution set
5.3 VECTOR SPACES 295
of the homogeneous system Ax = O. Here 0 denotes the zero m-vector (0,0, ... ,0) (m
terms). In Section 5.1 it was shown that for all n-vectors x and y and all scalars a, one
has A(x + y) = Ax + Ay and A(ax) = aAx. In particular, if x and yare in N(A) then
so are x + y and ax. Thus N (A) is a vector space in Rn.
The Range of A. If A is an m x n matrix, then the set R(A) of all m-vectors b such that
Ax = b has a solution x in Rn is called the range of A. Thus R(A) is the set of all right-
hand sides b for which the system Ax = b is solvable. To prove that R(A) is a vector space,
recall that if b' and b" are in R(A), then there are n-vectors x' and x" such that Ax' = b'
and Ax" = b". But then, by (12) of Section 5.1, b' + b" = Ax' + Ax" = A(x' + x"),
and also ab' = aAx' = A(ax'). Hence b' + b" and ab' are also in R(A). Thus R(A) is
closed under addition and scalar multiplication, and so R(A) is a vector space.
Subspaces of a Vector Space. The set Rn is clearly a vector space in Rn, since it
is closed under n-vector addition and scalar multiplication. All other vector spaces in
Rn are proper subsets of Rn. Such vector spaces are called vector subspaces of Rn. The
smallest subspace of Rn is the subspace Vo = {OJ whose only member is the zero n-
vector 0 = (0,0, ... ,0) E Rn. The subspace Vo is closed under vector addition and
scalar multiplication and thus is a vector subspace of Rn.
Linear Combination of Vectors. If Vi, V2, ... , Vk are vectors in Rn and ti, t2, ... ,
(3)
is said to be a linear combination of the n-vectors Vi, V2, ... , Vk. The set of all linear
combinations of Vi, V2, ... , Vk is a vector subspace V of Rn. The proof of this is left for
the Exercises. V will be called the span of Vi, V2, ... , Vk and the notation
(4)
will be used. A given subspace V may be spanned by many different sets of vectors. This
is illustrated by the following example.
(5) Vi = (1,-2,3,0),
V2 = (2,0,5,0),
V3 = (5,-2,13,0),
296 5 LINEAR ALGEBRA
(6)
(7)
To see this, recall that span {VI, V2} is the set of all vectors
(8)
while span {VI, V2, V3} is the set of all vectors
(9)
Every vector V in span {VI, V2} is in V; just take t1 = 51, t2 = 52 and t3 = a in (9).
To show that every vector in V has the form (8), and thus is in span {VI, V2}, the easily
verified equation
(10)
(ll) I w =
=
+ t2V2 + t3(Vl + 2V2)
t1 v 1
(t1 + t3)VI + (t2 + 2t3)V2,
by the rules of vector algebra. This completes the proof. Equation (10) can be used
Similarly to eliminate VI or V2, and one has
(12)
which shows both the redundancy in the definition (6) and the nonuniqueness of the
spanning sets.
(13)
5.3 VECTOR SPACES 297
Thus if (13) holds with tl # °then VI = (-t2/tj)V2 + ... + (-tk/tl)Vk and Vj is a linear
combination of V2, ... , Vk. Similarly, if t2 # 0, then V2 is a linear combination of Vj, V3,
... , Vk, etc. If a set of vectors VI, V2, ... , Vh is not linearly dependent, we shall say that
it is linearly independent. This is equivalent to the following definition.
Definition. A set of vectors Vj, V2, ... , Vk in RH is said to be linearly independent if
and only if equation (3) holds only when tj = t2 = ... = tk = 0.
Equation 00) can be rewritten as
(14)
and it follows that the set Vj, V2, V3 of Example 1 is linearly dependent. On the other
hand, any two of these vectors are linearly independent. To verify this for the pair Vj, V2
it must be shown that the equation tjVj + t2V2 = 0 can hold only if tl = t2 = 0. The
rules of vector algebra give
(16)
and
Theorem. If VI, V2, ... , Vk is a basis for a subspace V in RH then every vector V in V
has a unique representation as a linear combination
(18)
Proof. Only the uniqueness needs to be shown. Suppose that a vector V in V had two
representations:
We wish to show that these are really the same: t1 = t~, t2 = t;, ... , tk = t;,. But (19)
implies that
(20)
Moreover, V1, V2, ... , Vk are linearly independent, by assumption. Hence, (20) implies
that t~ - t1 = ... = t~ - tk = 0, or t1 = t;, t2 = t;, ... , tk = t~. This completes the proof.
A given vector space V has many different bases. Thus in Example 1, each of the
pairs {V1, V2}, {V1, V3} and {V2, V3} is a basis for V; see (12). More generally, an infinite
number of different bases for V are defined by
Theorem. Let VI, V2, ... , Vk and W1, W2, ... , We be any two bases of a vector space V
in Rn. Then k = £.
Proof (Optional). We shall show that if k < £ then the set W1, W2, ... , We is linearly
dependent and hence not a basis. The opposite case k > £ then leads to the conclusion
that VI, V2, ... , Vk is linearly dependent.
Each of the vectors Wj is in V and V1, V2, ... , Vk is a basis for V. Hence there are
constants aij such that
(22)
for j = 1, 2, ... , £. The coefficients aij define a k x £ matrix A = (aij) (where 1 ::: i ::: k,
1 ::: j ::: e). Using this matrix it will be shown that there exist constants t1, t2, ... , te not
all zero such that
(23)
and hence the vectors Wj are not linearly independent and hence are not a basis. To prove
this, substitute (22) into (23) and collect terms to get
+ ...
Since the vectors Vj, V2, ... , Vk are linearly independent, the coefficients of each Vm must
be zero. This gives k linear equations in the e unknowns tj, t2, ... , te, which can be
written in matrix form as At = 0. But A is a k x ematrix with k < e, so there are more
unknowns than equations. The Gauss elimination method of Section 5.2 implies that
such a system always has nontrivial solutions
where ej has the entry 1 in place j and zero in all others. A linear combination of the
vectors has the form
(25)
and this is the n-vector (0,0, ... ,0) if and only if tj = t2 = ... = tn = 0. Thus the set
of vectors ej, e2, ... , en is linearly independent. Moreover, each x = (Xj, X2, ... ,xn ) in
Rn can be written
(26)
and hence the set (24) spans Rn It follows that (24) defines a basis for Rn and R n
has dimension n. As a corollary, we conclude that any n + 1 vectors in Rn are linearly
dependent.
Testing Sets of Vectors for Linear Dependence. Let Vj, V2, ... , Vk be a set of k
vectors in Rn. By definition, the set is linearly dependent if and only if the vector equation
(27)
300 5 liNEAR ALGEBRA
holds for a coefficient set t = (t I, t2, ... ,tk) i- (0, 0, ... , 0). To test this, introduce the
components of the vector Vj by
(28)
If the n x k matrix A = (aij) is introduced then (29) can be written as the linear
homogeneous system
(30) At = 0.
Thus the vectors VI, V2, ... , Vk are linearly dependent if and only if (30) has a nontrivial
solution. This can be decided by using Gauss elimination to find the reduced row echelon
form of A. The matrix A is readily written down, since its columns are just the vectors
Example 1 concluded. The vectors VI, V2, V3 defined by (5) are known to be linearly
dependent. To verify this by the test described above we have the row equivalence
A= (-~ ~ -~)
3513
{=::} (~ ~ ~).
000
000 000
VI + 2V2 - V3 = 0,
as before.
EXAMPLE 2. Consider the 4 x 6 matrix A defined by (18) in Section 5.2. Then A has the
same null space as the reduced row echelon matrix B defined by (19) of Section 5.2. It
follows that x is in N(A) if and only if
(32) VI = (-3,1,0,0,0,0),
5.3 VECTOR SPACES 301
V2 = (1,0,1,0,0,0),
V3 = (-7,0,0,-2,1,0)
° ° ° ° °
2 -1
° ° °
1
° 0 -8
A=
° ° ° ° °
1 -2
°° ° ° ° 1 -1
°° ° ° 3 -1 -1
° ° ° °
1 -1 -2
2
°° ° -1
° °
° ° ° °
2
° -4
B=
°° ° 2 -2
° 0
0
°° 1 0 -16
°
°° 0 0 2
° -1
0
° ° N°
Thus X7 is the only independent unknown, and
0 -64 3
x=s (3
- 21- 3- -
1 -31 )
8' '2'4'2'64' ,
where s is an arbitrary constant. To solve the chemistry problem, the coefficients Xk must
be positive integers. This is achieved by taking s = 64n, where n is a positive integer.
The solution with the smallest coefficients is given by n = 1, so
x = (24,128,32,48,32,3,64),
302 5 LINEAR ALGEBRA
defined on a fixed interval a ::: x ::: b. Properties (I)-(X) are easily seen to hold with the
usual definitions of sums and scalar multiples. Thus V is a vector space. For this example
5.3 VECTOR SPACES 303
the notions of linear independence, basis and dimension were already encountered in
Chapter 3, where V was shown to have dimension 2. More generally, the solution set of
a linear homogeneous nth-order equation,
EXAMPLE 5. Fix a positive integer n and let Vn be the set of all polynomials of degree less
than or equal to n,
where ao, ... , an are arbitrary scalars. Then Vn is a vector space. It is not difficult to
show that the monomials 1, x, x 2 , ... , xn are linearly independent and hence Vn has
dimension n + 1.
EXAMPLE 6. Let V denote the set of all polynomials eno restriction on n). Then V is a
vector space, and it contains Vn as a subspace for each integer n :::: O. Clearly, V cannot
have finite dimension. Such spaces are said to be infinite-dimensional.
EXAMPLE 7. Let C [a, b J denote the set of all continuous functions on a fixed interval
[a,bJ. Then C[a,bJ satisfies properties (I)-eX) and hence is a vector space. It contains
the set of all polynomials as a subspace and hence is infinite-dimensional.
EXAMPLE 8. Let V be the set of all functions jex) that are defined for -00 < x< 00 and
satisfy
Then V is a vector space. In this case it is not obvious that V is closed under addition.
This will be shown later to follow from the Schwarz inequality:
Complex Vector Spaces. Properties (I)-(X) are stated for scalars from the real
number field R. These properties characterize real vector spaces. If R is replaced by the
complex field C throughout properties (I)-(X), the resulting vector space is said to be a
complex vector space. For eigenvalue problems and some other applications it will be
necessary to use complex vector spaces.
304 5 LINEAR ALGEBRA
Exercises 5.3
Null Space and Range. Determine the null space and range of the matrix A.
1. A == (~ ~). 2. A == (~ ~).
Basis for the Null Space of A. Determine a set of basis vectors that span the null space of A.
1 0
1100)
3. A == ( 0 0 1 0 . ( o 1
000 0 4. A == 1 1
2 0
Independent Sets. Determine a smallest subset that has the same span as VI, V2, v3.
7. A == (~ ~). 8. A == ( 232102)
1 1 0 .
9. (Trivial Subspace) Prove that the origin is a proper vector subspace of Rn.
10. (Subspace) Assume n > 1 and let x be a fixed vector in Rn. Prove that the set V == {ax : a real} is a
proper vector subspace of Rn.
I l. (Subspace) Let XI, ... , Xk be certain vectors in Rn. Let V be the set of all linear combinations
k
LCiXi,
i=1
where q to Ck are arbitrary real numbers. Prove that V is a vector subspace of R" (it may be all of R").
12. (Span) Show that if VI and V2 are n-vectors then
(4)
We shall write C' = aA and call C' the scalar multiple of A by a. The set Mm,n, with
operations of addition defined by (2) and scalar multiplication defined by (4), is a vector
space. The proof is a verification of properties (I)-(X) of Section 5.3. The space Mm,n is
spanned by the matrices Eij (1 :::: i :::: m, 1 :::: j :::: n), where Eij is obtained from the zero
matrix by replacing the element in row i and column j by 1. Moreover, these matrices
are readily seen to be linearly independent and hence form a basis of Mm,n. In particular,
Mm,n has dimension mn.
Some of the vector spaces Mm,n are familiar from earlier work. The space Ml,l is just
the scalar field R (or C) interpreted as a vector space of dimension 1. More generally,
MI,n is the vector space Rn (or Cn) of n-tuples of scalars, or row vectors. Similarly, Mm,l
is the space of n x 1 matrices, or column vectors, and is isomorphic to R n (or en).
Matrices as Linear Operators. Every matrix A in Mm,n determines a linear operator
from R n to Rm. Explicitly, A carries each x in R n to the vector b = Ax in Rm, where
n
(5) hi = L:>ijXj, I:::: i :::: m.
j=l
306 5 LINEAR ALGEBRA
The linearity of the mapping means that (see (13) in Section 5.1)
(7) z = A(Bx)
maps x in RP into z in Rm. Moreover, it is clearly linear and hence has the form
z= ex,
where e is in Mm,p' The composite map e is called the matrix product of A and B and is
written e = AB. To calculate the components of e, we use the component representations
of A and B. Thus
m P
Zj = LajkYk, Yk = LbktXt.
k=l £=1
Matrix Products. The order of the factors A and B in the matrix product e= AB
must be preserved. As a linear operator the product AB means first apply B then apply
A. For this to be meaningful it is necessary that the number of columns of A equal the
number of rows of B. This restriction implies that the product in the opposite order BA
may not be defined. In fact, if A is in Mm,n and B is in Mn,p then BA is defined only if p = m.
The Scalar Product Rule for Computing Matrix Products. If x = (Xl, X2, ... ,xn )
and y = (Yl,Y2,'" ,Yn) are in Rn then their scalar product is the number
L XkYk =
n
(10) (x, y) = XIYl + X2Y2 + ... + xnYn'
k=l
5.4 MATRICES AND MATRIX ALGEBRA 307
With this notation the rule (9) for forming the matrix product C = AB is
This scalar product rule makes it easy to remember how to compute matrix products.
whenever the indicated products are defined. The first rule implies that multiple matrix
products AIA2 ... An are meaningful without parentheses. Proofs of the rules are left to
the Exercises.
Square Matrices. Matrices in the set Mn,n are called square matrices of size n.
Products of matrices in Mn,n are always defined and Mn,n is closed under matrix
multiplication. Almost any example will show that matrix multiplication in Mn,n is not
commutative. For example, in M2,2 if
A=(12)
3 4'
B=(56)
7 8
then
A= ( -21 0
2)' B= (-3 2)
-2 -4 '
for which
AB = ( -~ -6 ) = BA.
-4
The notation D = diag(d l , d2 , ... , dn ) will also be used. The multiplication rule gives
the products
(13) AD=
a2l dl
anldl
.
and2
an2d2
a2n dn
anndn
)
and
c,an
)
dIan dlal n
d2 a2l d2an d2a2n
(14) DA= .
In particular, if D = In where
(15) In = diag(l, 1, ... , 1)
then AIn = InA = A. The matrix In is called the identity matrix in Mn,n. Equations (13)
and (14) imply that if D is diagonal and AD = DA for all A in Mn,n then D = dIn, where
d is a scalar and AD = DA = dA.
Singular and Nonsingular Square Matrices. A square n x n matrix A defines a
linear operator x --+ y = Ax in Rn. We know that RCA), the range of A, is a vector
subspace of Rn. The matrix A will be said to be singular if and only if RCA) i= Rn.
Otherwise A is said to be nonsingular. Thus A is nonsingular if and only if RCA) = Rn.
Now, the system Ax = y can be written in the column vector form Csee the Exercises)
(16) Xl
(
a2l
all)
.
..
+ X2 (an)
.
.
an
+ ... + Xn
.
a2n
(aln)
.
. .
=
..
Y2
. (1) .
This says that vectors y in R(A) are in the span of the column vectors of A. In particular,
A is nonsingular if and only if every y in Rn can be written in this way This implies the
follOwing theorem.
Theorem. An n x n matrix A is nonsingular if and only if its column vectors are linearly
independent and thus form a basis for Rn.
Proof. If the column vectors of A are linearly independent, then they must span Rn ,
and hence A is nonsingular. Conversely, if A is nonsingular, then R(A) = Rn , and hence
(16) implies that the column vectors of A span Rn. To do this they must be linearly
independent.
Corollary. An n x n matrix A is nonsingular if and only if the null space N(A) = {OJ.
This means that the only solution of Ax = 0 is the vector x = O.
Proof. If A is nonsingular, then the column vectors of A are linearly independent. Hence
(16) with y = 0 implies that x = O. Conversely, if N(A) = {OJ then (16) with y = 0
implies that A has linearly independent columns and hence A is nonsingular.
(17) AB = BA = In.
Note that if A is invertible then both A and B must be nonsingular, because (17) holds
if and only if
Proof. We have shown that if A is invertible then it is nonsingular. We must show that if
A is nonsingular then it is invertible. To prove this, we note that if A is nonsingular then
the map x ----+ y = Ax has range R(A) = Rn. Moreover, the map is one-to-one. For if
y = AXl = AX2 with Xl =f. X2 then the linearity of A implies that x = Xl - X2 satisfies
Ax = 0, so x is in N(A). But this implies that x = 0 by the Corollary above and so
Xl = X2, contrary to assumption. Thus A maps Rn one-to-one onto R(A) = Rn and hence
310 5 liNEAR ALGEBRA
has a unique inverse x = By. The linearity of A implies that B is also linear. Moreover,
x = BAx for all x in Rn, and y = ABy for all yin Rn, so AB = BA = In, and A is invertible.
Uniqueness of the Inverse. The inverse matrix B of (17) is unique when it exists. For
if (17) holds and also AC = CA = In then AB = In implies CAB = C and hence CA = In
implies that B = C.
Test for Invertibility of a Square Matrix. We have shown that a square matrix A
is invertible if and only if N(A) = {O}. Now, solution of Ax = 0 by Gauss elimination
generates all solutions. The matrix A is invertible if and only if x = 0 is the only solution.
This can happen only if the reduced row echelon form of A is the identity In. For if any
diagonal entry of the reduced matrix were zero then Ax = owould have a solution x =I O.
This follows readily from (14). The matrix E- 1 is found by replacing c by lie.
Adding a Row to Another Row. The corresponding matrix is obtained by applying the
same operation to In. Thus to add row q to row p we have
1 o o o
o 1 1 o
E=
o o 1 o
o o o 1
where only the first, pth, qth, and last rows and columns are shown. The remaining rows
and columns are those of In.
5.4 MATRICES AND MATRIX ALGEBRA 311
Interchanging Two Rows. The corresponding matrix is obtained by applying the same
operation to In. Thus to interchange rows p and q we have
1 0 0 0
0 0 1 0
E=
0 1 0 0
0 0 0 1
where only the first, pth, qth and last rows and columns are shown. The remaining rows
and columns are those of In.
Algorithm for A-I. It was shown above that A is invertible if and only if the reduced row
echelon form is In. Moreover, from Section 5.2, the reduced form can be computed by a
sequence of elementary matrices EI , E2 , ... ,EN of the three types defined above such that
(18)
(19)
The algorithm for generating A -1 is contained in these equations. It states that if the
sequence of elementary row operations reduces A to In, then the same operations produce
A -1 when applied to In. The work may be organized by writing the augmented matrix
(A I In) and applying E I , E2 , ... , EN to it to get (In I A-I).
(A I13 )
(
2
2 -3
3 -6
-2 1 1 0
2 0 1
4 0 0 n
( )
1 0 0 0 2 -1
{=::} 0 1 0 -2 5 -2
0 0 1 -3 6 -2
(I3IA-l).
312 5 LINEAR ALGEBRA
U !)(
As a check we have
o
~ )~h
-2 2 -1
AA-' -3 -2 5 -2
-6 -3 6 -2
Vector Space Mm,n. Let EU be the m x n matrix obtained from the zero matrix by replacing the entry in
row i and columnj by 1. Complete the following.
1. Determine all matrices EU for m = n = 2.
2. Do the matrices Eij span Mm,"?
3. Are the matrices Eij linearly independent?
4. (Linear Operators) Define the mapping nx) = y by Yl = 3Xl - 2X2, Y2 = Xl + 2X2. Compute
anu) + bnv) and nau + by). Is T a linear operator?
Linear Operators on Polynomials. Let T be the mapping from the set V of all polynomials of the form
x = e3t2 + C2t + Cj into R3 defined by nx) = (el, e2, e3).
7. (Conformable Products) Let A = (2, -1, 3) and B = (~). Does AB make sense?
8. (Commuting Matrices) Find two 3 x 3 matrices A and B that commute, both of which have some
nonzero off-diagonal elements.
A=( ; -;
-I 5 4
i).
5.4 MATRICES AND MATRIX ALGEBRA 313
16. (Column Vector Form of a System) Prove that an n x n linear system Ax = y can be written in the
column vector form
Xl (all)
~l
: +X2 : + ... +Xn (a :) = tl)
(al2)
an 1n n
~n
:.
18. (Singular Matrix) Find necessary and sufficient conditions for which a diagonal matrix of order three
is singular.
19. (Singular Matrix) Find necessary and sufficient conditions for which a diagonal matrix of order n is
Singular.
22. (Invertible Matrix) Suppose that a square matrix A is the product of invertible matrices. Explain why
A is invertible.
23. (Invertible Matrix) A given n x n matrix A has reduced row echelon form equal to the n x n identity
matrix. Explain why A is invertible.
verify that
26. (Elementary Matrices) Write out all 3 x 3 elementary row matrices E that correspond to multiplying
a row by a nonzero number.
27. (Elementary Matrices) Write out all 3 x 3 elementary row matrices E that correspond to adding a row
to another row.
28. (Permutation Matrices) All possible products of 3 x 3 elementary row matrices E that correspond to
swapping rows in the identity matrix 13 are called the permutation matrices of order 3. There are six
distinct permutation matrices of order 3. Find them.
Calculation of the Inverse Matrix. Find explicitly the inverse of the given matrix.
314 5 LINEAR ALGEBRA
-3
29. A = ( 21 -3
-1
-1 2)
1 .
5 4
34. A= ( ~
-3
6
5
-1 )
4 .
2
-1 _1)
A~ 1
-1 )
30. A= (12 21 -4 , -2 1 5
35. ( o
3 0 -1 0 1 '
-3 0 o 2
A= ( 21
(l -1 )
31. 11 31) ,
0
-1 -3 2
-2 4)
36. A~ 2 -1
1 '
32. -4 -2 , -1 1
A~ (1
A=(l
-: )
-5 3 -2 2
3 2
37. 1 '
33. A= (43 21 1)
1 . 1
231 0 1
38. (Linear Operators) Show that if x --+ A(x) is any linear map from Rn to Rm (so that (6) holds), then
Ax = b is equivalent to the linear system
n
I>ijXj = hi, 1::: i::: m,
j=l
where A(ej) = I:::l aijei (the standard basis el. " ' . em was defined in (24) of Section 5.3)
Associative and Distributive Laws. Prove the following rules, assuming that all products are defined,
42. (Scalar Product Rule) Find the entry in row 3 and column 2 of the product matrix AB, given
Scalar Product in Rn. If x and yare two vectors in Rn then the scalar product, or
inner product, of x and y, written (x , y), was defined in Section 5.4 by the relation
n
(1) (x, y) = L XhYh = XlYl + X2Y2 + ... + XnYn·
h=l
is called the norm of x. A basic property of the scalar product is the Schwarz inequality.
Schwarz Inequality. For all vectors x and y in Rn,
(4)
Taking C{ = -(x, y)/llyl12 (the minimizing value) and simplifying gives (3). Moreover,
equality holds if and only if x + ay = 0 and hence x and yare linearly dependent.
Geometrical Meaning of (x, y). If x and yare nonzero vectors, then the Schwarz
inequality can be written as
(x, y) I < 1
I IIxllllyll - .
It follows that there is a unique angle esuch that 0 ::: e ::: nand
(5) (x, y) = IIxlillyll cos e.
Thus e can be defined to be the angle between the nonzero vectors x and y. If e = n/2
then cos e = 0 and
(6) (x, y) = o.
Orthogonality. Two vectors x and yare said to be orthogonal if and only if (6) holds.
Note that the zero vector 0 is orthogonal to all vectors and is the only vector with this
316 5 LINEAR ALGEBRA
property (see the Exercises). Moreover, two nonzero vectors are orthogonal if and only
if the angle between them is nl2 radians.
Orthogonal Sets and Bases. A set of vectors in Rn
then taking the scalar product of this equation with any vector Xk and using the
orthogonality gives
then the set YI, n, ... , YP consists of orthogonal unit vectors and has the same span as
the set (7). An orthogonal set of unit vectors is called an orthonormal set. In particular,
an orthogonal basis of unit vectors is called an orthonormal basis.
Orthonormal bases are particularly simple to work with. Thus if Yl, Y2, ... , Yn is an
orthonormal basis in Rn and x is any vector in Rn then
(8)
where
This is easily verified by taking the scalar product of equations (8) and Yk.
The Gram-Schmidt Construction. This method starts with a basis
5.5 THE FUNDAMENTAL THEOREM OF liNEAR ALGEBRA 317
Step 1.
Step 2.
Step 3.
Step k.
At Step k one never gets Yk = °because if this occurred then Xk would be a linear
°
combination of x I, X2, ... , Xk-l and the original set {Xl, X2, ... ,xn } would not be a basis.
Moreover, at Step k the construction gives (Yj , Yk) = for j < k. Hence the new set
{YI, Y2, ... ,Yn} is orthogonal.
EXAMPLE 1. In R2 take
that X and yare orthogonal. More generally, let VI and V 2 be two subspaces of the same
space Rn. Then we shall say that VI and V 2 are orthogonal subspaces of Rn, written
VI .1 V 2 , if and only if every VI in VI is orthogonal to every V2 in V 2 . If this is the case
(9)
proof. Y EB y-L is a subspace of Rn. To prove (9) it is enough to show that any vector x
in Rn can be written as x = Xl + X2 with Xl in Y and X2 in y-L. To do this, let
Then
(0)
Proof. Apply the preceding theorem to the subspaces Y and y-L. This gives
where VI is in Y, V2, and WI are in y-L and W2 is in (y-L)-L. Since V2 - WI is in y-L and
W2 = Vj + (V2 - WI), we have
0= (W2, V2 - WI)
= IIV2 - w1112.
Thus V2 = WI, and therefore also VI = W2. Hence W2 is in Y and (10) follows.
Matrix Transpose and Adjoint. With each m x n matrix A we may associate the
n x m matrix AT obtained from A by interchanging rows and columns. Thus the row
vectors (column vectors) of AT are the column vectors (row vectors) of A. The matrix AT
is called the transpose of A. If A = (aij) and AT = (a&) , then the definition means that
Clearly, (AT) T = A.
A second n x m matrix A * may be associated with A by the equality
The matrix A* is called the adjoint of A. We shall show that the matrices AT and A * are
the same. More precisely, we shall prove the following theorem.
Theorem. For each m x n matrix A there is exactly one n x m matrix A* such that (12)
holds and it is
(13)
and
n
(Ax, y) = L (AX)iYi
i=1
n m
= LXjLaijYi.
j=1 i=1
320 5 LINEAR ALGEBRA
After these preliminaries, we can now formulate the central result of this section as
follows.
(14) R(A)EBN(A*) = Rm ,
(15) R(A*) EBN(A) = Rn.
This result is known as the fundamental theorem oflinear algebra because, as will
be shown below, it contains complete information on the solvability of the linear system
Ax=b.
Proof. Equation (15) follows from (14) on replacing A by A* and A* by (A*)* = A (so
m is replaced by n). Thus only (14) need be proved. To prove (14) we recall that for each
subspace Y of Rm one has Y EB y.l = Rm. Applying this to Y = R(A) gives
Fredholm Alternative Theorem. This result, which gives complete information about
the solvability of linear systems Ax = b, may be formulated as follows.
5.5 THE FUNDAMENTAL THEOREM OF LINEAR ALGEBRA 321
In the second case, Ax = b has solutions if and only ifb J.. NCA*).
Proof. The first case occurs when dim RCA) = m and hence, by (14), NCA*) = {O}.
The second case occurs when dim RCA) < m and hence dimNCA*) > O. Moreover, in
the second case RCA) = NCA*)~ and hence vectors bin Rm are in RCA) if and only if
b J.. NCA*).
Rank and Nullity of a Matrix. The rank and nullity of a matrix A are the
nonnegative integers
We have shown that the column space of A is the subspace RCA) in Rm. Similarly,
the row space of A is the span of the row vectors of A in Rn. This coincides with the
subspace R(AT) = RCA*) in Rn. It is a remarkable fact that R(A) and RCA*) always
have the same dimension, even when m =f. n. The result may be formulated as follows.
To prove (20), let aI, a2, ... , an denote the column vectors of A = (ely), so
all)
a21 (an)
a22 (aln)
a2n
al = ( . ,
..
a2 = .,
..
an = . .
..
amI am 2 amn
If we write r = rank(A) then the definition of rank(A) implies that there is a basis for
R(A) with r vectors, sayvl, V2, ... , Yr. Hence there exists an n x r matrix (Cij) such that
(22)
These can be written as a vector equation for the kth column of A*:
ak2
akl) C21
(ell) e22
(en) (Clr)
C2r
( : = Vlk : + V2k : + ... + Vrk : .
akn Cn I Cn2 enr
This says that R(A*) is spanned by r vectors. It follows that dim R(A*) :::: r == rank(A),
which is equivalent to (20). This completes the proof.
EXAMPLE 2. Considerthe 4 x 6 matrix A of (18) in Section 5.2 with reduced row echelon
form B of (19), Section 5.2. Here
(25)
is a solution basis for Ax = 0, and tl, t2, ... ,tk are arbitrary scalars. In particular, by (24),
The rank of A, k, and the vectors X p , Xl, X2, ... , Xk are all readily generated by Gauss
elimination.
Complex Field. The results of this section extend to linear systems with complex
coefficients, provided that the scalar product is redefined by
n
(x, y) = I>kYk,
k=l
where Z denotes the complex conjugate of z. The adjoint of a complex matrix A is then
-T
A* = A ; that is,
324 5 LINEAR ALGEBRA
1. (Orthogonal Subspace) Let V be a plane through the origin in R3 Describe geometrically how to
construct a basis for V-1.
2. (Orthogonal Complement) Let K be any subset of R", not necessarily a subspace, and let K-1 denote the
set of all vectors orthogonal to all of the vectors in K. Prove that K-1 is a subspace of R" and determine
(K-1 )-1
Xl = (1,1), X2 = (1,0)
to obtain the orthogonal set
YI = (1,1), Y2 = (1/2,-1/2).
6. (Gram-Schmidt) Prove that at Step k in the Gram-Schmidt construction, Yh is orthogonal to Xl, ... ,
Xk-l and also to Yl, ... , Yk-I·
7. (Fredholm Alternative) Give an example of a 3 x 3 matrix A and a vector b such that Ax = b has a
solution and also A*y = 0 has nontrivial solutions.
8. (Orthogonality) Show that the zero vector is the only vector that is orthogonal to all vectors.
9. (Rank) Let A be a 6 x 8 matrix with rank 4. How many vectors are in a basis for the column space of A?
10. (Adjoint Relation) Prove the adjoint relation A* = :;f for the complex case.
11. (Rank) Let A be a 6 x 8 matrix with rank (A T) = 5. How many vectors are in a basis for the row space of A?
12. (Nullity) Compute the nullity of a 6 x 8 matrix A that satisfies rank(A*) = 3.
Fundamental Theorem of Linear Algebra. Compute bases for the four subspaces R(A) , R(A*), N(A),
N(A*) of the fundamental theorem.
13. A = (~ ~ ~).
=( ~ ~)
(l ::n
14. A
15 A=
16. (Inverse Matrix) Let a square matrix A have orthogonal columns. Show that A-I is formed from A by
replacing each column c of A by dllcll 2 followed by taking the transpose of the resulting matrix.
5.5 THE FUNDAMENTAL THEOREM OF LINEAR ALGEBRA 325
-2 )
1 .
2
18. (Orthogonal System) Let a square matrix A have orthogonal columns aI, ... ,an. Show that the solution
of the linear system Ax = b is the solution of the vector equation
19. (Solution of an Orthogonal System) Find the solution X of the linear system Ax = b, given the matrix
A and vector b below.
Regression. A regression line is a straight line of the form Y = mx + b that best fits data points (Xl, Yl),
... , (Xn, Yn) in the sense that the sum-of-squares error
n
E = L(mxk +b- Yk)2
k=l
is minimized. The solution of the minimization problem can be succinctly expressed in terms of the data
vectors
20. Show that the intercept b and the slope m of the regression line satisfy the equations below.
21. Find the regression line Y = mx + b for the accident data below and test the equation graphically for
quality of fit.
Years driving 4 6 8 10 12 14
Accidents 10 8 4 8 6 5
22. Assume that the data have been subjected to translations such that
326 5 liNEAR ALGEBRA
Show that the solution to the regression problem is b = 0 and that m is given by the projection formula
(x, Y)
m=--.
(x, x)
b). The resulting formulas are the n-dimensional Cramer's rule. The calculation is based
on n x n determinants, which will be defined first.
The Determinant Function. The n x n determinant function assigns to each n x n
matrix A = (aij) a Single real or complex number, which will be denoted by det (A). An
alternative notation for det(A) is IAI. Here the vertical bars "I" do not denote a norm or
absolute value. The notational distinction is that A = (aij) denotes an n x n matrix, an
array of n2 numbers, while det(A) = I (aij) I denotes a single real or complex number,
the determinant of A, which is to be computed from the numbers all, al2, ... , ann. The
definition of det(A) for general n x n matrices A, and algorithms for computing it, are
given below. The main lines of the theory are suggested by the special cases n = 1, 2,
and 3, which are examined first.
1 x 1 Determinants. Here A = (all) and det(A) = all' The system Ax = b is the single
linear equation all Xl = bl , which is uniquely solvable if and only if det(A) = all =f. O.
2 x 2 Determinants. The determinant of a 2 x 2 matrix is defined by
-a 21 bl + allb2 1 all b1
X2 = =- -
det(A) b2
allan - allall all
This is the classical Cramer's rule for the case n = 2. An alternative form of (3) that is
useful for the generalization to n > 2 is given next.
Let Mjh denote thejk-minor of the matrix
(4)
Thus Mjk is the 1 x 1 matrix (or number) obtained from A by deleting row j and column
k. Thus Mll = (an), M12 = (a2l), M21 = (al2), and Mn = (all). Next let
Then the definition (1) can be written in the two equivalent forms
(7)
These are called the first and second row expansions of det(A), respectively Moreover,
(l) implies that 2 x 2 matrices satisfy detCA T ) = detCA). Combining this with (7) gives
(8)
These are called the first and second column expansions of det(A), respectively.
Next, define the adjoint matrix of A to be the transpose of cof(A):
where 1= diag(l, 1) is the 2 x 2 identity matrix. It follows that if det(A) -::f:. 0, and hence
A-1 exists, then
(12)
au al2 aD
(13) det(A) = a21 a22 aB
a31 a32 a33
In elementary textbooks it is shown after considerable effort that if det(A) -::f:. °then (14)
has the unique solution
bl al2 al3
1
(15)
Xl = det(A) b2 a22 a23 ,
b3 a32 a33
au b1 al3
1
X2 = det(A) a2l b2 a23
a31 b3 a33
au al2 bl
1
X3 = det(A) a21 a22 b2
a31 a32 b3
This is Cramers rule for the case n = 3. It will be verified below by a proof that is valid
for all n ~ 2.
5.6 DETERMINANTS AND CRAMER'S RULE 329
The cofactor expansions derived above for n = 2 will be formulated next for the
case n = 3. The proofs for this case are omitted here because they are covered by the
calculation for n ~ 2 presented below.
For the 3 x 3 matrix
(16)
the jk-minor Mjk is the 2 x 2 matrix obtained by deleting row j and column k of A. Thus
All
(19) cof(A) = A = ( A2l
A31
Note that the factor (-I)i+k in (18) produces a "checkerboard" pattern of + 1 and -1
over the components of cof (A), with + 1 in the upper left corner. The notation (18) and
the definition (13) of 3 x 3 determinant can be combined to give
(22)
adj(A) = (cof(A»T
satisfies A adj(A) = (det(A» I (now 3 x 3), and thus if det(A) f: 0, then
A -} = (det1(A») adj(A).
330 5 LINEAR ALGEBRA
(23)
Aljbl + A 2jb 2 + A 3j b3 .
Xj= det(A) ,)=1,2,3.
(24) det(A) =
of the n2 components aij of A. The function has been defined explicitly for n = 1, 2 and
3. For the extension to n > 3 it will be convenient to think of det(A) as a function of
the n column vectors of A,
(25)
and to write
(26)
Clearly, if aI, a2, ... , an are knOWTl then the n 2 components aij are also knoWTI, and
conversely
The Case n = 2. For this case
(27)
are the standard basis vectors in R2 Moreover, we shall show that D (aI, a2) is completely
determined by these three properties. This may be stated as a uniqueness theorem.
Uniqueness Theorem. D (aI, a2) = allan - al2a2l is the only function that has
properties (28), (29) and (30).
(31)
all au al3
(32) det(A) = a21 an a23 = D(al,a2,a3)
a31 a32 a33
is defined by the expression (13), and it is easy, but tedious, to show that D (aI, a2, a3)
has the following three properties.
(36) 1) (aI, a2, ... , an) is a linear function of each of the vectors
a l, a2, ... , an separately;
(37) 1) (aI, al, ... ,an) = 0 whenever two (or more) of the vectors
al, a2, ... , an are equal;
where el, e2, ... , en are the standard basis vectors in Rn (ej is the jth column of In).
The behavior of 1) (al, a2, ... , an) under permutations of the vectors el, e2, ... , en
will be needed in the proof of the theorem. The following notation and facts are needed.
A permutation of the set {l, 2, ... , n} is a one-to-one mapping
The notation a(i) will be used for the image under a of the integer i. The set of all
permutations of n objects is the symmetric group Sn and has n! distinct elements. A
transposition is a permutation that interchanges two integers and leaves the others fixed.
It is known that every a in Sn can be represented as a group product of transpositions.
A permutation a is said to be even (respectively, odd) if it is the product of an even
(respectively, odd) number of transpositions. The notation
1, a even,
sgn(a) = (
-1, a odd,
will be used.
5.6 DETERMINANTS AND CRAMER'S RULE 333
Properties (36) and (37) imply that V (a], a2, ... ,an) changes sign when two of the
variables a], a2, ... , an are transposed. The proof is the same as for the case n = 2, given
above as equation (31). More generally, we have
(40)
for every permutation a in Sn. The special case of a transposition was proved above. The
general case follows on factoring a into a product of transpositions.
Proof of the Basic Theorem (Optional). Represent the column vectors a], a2, "', an
by means of the basis e], e2, ... , en:
n n n
(41) a] = l..: aill eil' a2 = l..:
i,=1
ai, 2ei 2' an = l..:ainnein.
il=1 in=l
Then if V is any function that satisfies (36)-(38) then substituting (41) into
V (aI, a2, ... , an) and using the linearity (36) gives (nn terms)
n n
l..: L ... L
n
Moreover, the term V(ei, , ei" ... ,eiJ vanishes unless the vectors eil' ei" ... , ei n are
distinct; that is, unless Ci], i 2 , ... , in) is a permutation of (1,2, ... , n). The nonzero terms
correspond to the permutations a of (l, 2, ... , n), so
(43) V (aI, a2,···, an) = L sgn (a) aa(l) 1 aa(2) 2 ... a(J(n)n'
a
This proves the uniqueness statement of the theorem. We have to show that any function
V (a], a2, ... ,an) that satisfies (36)-(38) must be given by (43). To prove the existence
statement of the theorem we shall show that if V is defined by (43) then V does indeed
have properties (36)-(38). Condition (36) is immediate because each term in the sum (43)
is linear in the components of each of the vectors a], a2, ... , an. Equation (38) also follows
334 5 LINEAR ALGEBRA
because only the identity permutation a(l) = 1, a(2) = 2, ... , a(n) = n places n unit
components of el, e2, ... ,en into the same product. Finally, to prove (37) we have by (43)
In particular, if al = a2 then D (aI, a2, ... ,an) = O. The proof for the other pairs of
vectors is similar. This completes the proof of the theorem.
(44)
where aI, a2, ... , an are the column vectors of A and D is the unique function defined
by the basic theorem.
An explicit formula for det(A) is given by equation (43) above. For n = 1,2 and 3 it
is clear from (43) that definition (44) agrees with the definitions of elementary algebra;
see (l) and (13).
Basic Properties of Determinants. The determinant function has the following
useful properties.
Property 1. For all n x n matrices A one has
det(A T ) = det(A),
where AT is the transpose of A.
Property 2. For all n x n matrices A and B one has
n
k=l
n
Ck = CIC2·· ·C n·
k=l
a".(k)k·
Now, e = a(k) if and only if k = a-I (e) = Tce), where T is the unique inverse
permutation to a. As a runs through the n! permutations in Sn, its inverse T does the
same. Also, sgn(T) = sgn(a- l ) = sgn(a). Thus replacing a by T = a-I in (43) gives
det(A) = Lsgn(T)
r
n n
(=1
alr(l) = Lsgn(T)
r
nn
(=1
a;(t)l = det(A T ),
where hj = Bej. The last function is linear in each hj separately and vanishes if two hjs
coincide. It follows, as in the proof of the basic theorem, that
(45)
336 5 LINEAR ALGEBRA
(46)
The constant Ajk is called the cofactor of ajk in detCA). Equation C45) is called the
cofactor expansion of detCA) along row i. It is not immediately obvious that the cofactors
in (45) and (46) are the same, but this will be verified below.
Minors and Cofactors. For n = 2 and 3 we have seen how to calculate the cofactors
Aij as determinants of order n - 1. We shall now extend this to all orders n 2: 2. To begin,
let Mij be the Cn - 1) x Cn - 1) matrix obtained by deleting row i and columnj of A.
Then the following theorem holds.
Proof. The linearity of V (aI, a2, ... , an) with respect to al gives
(48)
=
k=l
In particular, we have
1 al2 aln
0 an a2n
(49) All =V(el,a2, ... ,an) =
0 an2 ann
5.6 DETERMINANTS AND CRAMER'S RULE 337
In this determinant, if the first column is multiplied by -al2 and added to the second
column, the value of the determinant is unchanged. This is because the amount added is
1 -al2 al n 1 1 aln
0 0 a2n 0 0 a2n
(50) = -al2 =0.
0 0 ann 0 0 ann
The effect on (49) of adding (50) is to replace al2 by zero (equation (49) is a linear
function of a2!)' Proceeding similarly, we may replace al3, a14, ... , al n by zero. Thus
1 0 0
0 a22 a2n
All =
0 a n2 ann
Now, this last determinant is clearly a linear function of each of the (n - I)-vectors
Moreover, if two of these vectors are equal, then All = O. Finally, if the vectors are
the standard (n - I)-dimensional basis vectors, then All = 1. Hence the uniqueness
statement of the basic theorem gives
All = = det (M ll ) ,
as was to be shown. A similar argument works for the other cofactors. For example,
0 a n2 ann 0 a n2 ann
1 0 0 al2 an aln
0 al2 al n a32 a33 a3n
= - det (Mu) .
0 a n2 ann a n2 a n3 ann
338 5 LINEAR ALGEBRA
Note that in (47) the factor (-l)i+j arises from moving the coefficient aij in det(A) from
the (i,j)-position to the (1, I)-position by i transpositions ofrows and j transpositions
of columns.
All An
All
An
(51) adj(A) = (cof(A))T = ( :
A in
(53)
To verify this, note that if i = j then (53) is just the cofactor expansion of det(A) along
the ith row. If i #- j then the left side of (53) is the cofactor expansion of a determinant
whose ith andjth rows are equal. This gives zero in accordance with (53).
Cramer's Rule. Equation (54) gives the n-dimensional generalization of Cramer's rule
for n = 2 and 3, given by (12) and (23) above. It states that if A is a nonsingular n x n
5.6 DETERMINANTS AND CRAMER'S RULE 339
det(A)
J = 1,2, ... ,n,
where Aj is the matrix formed from A by replacing the jth column of A by the column
vector b.
Evaluation of Determinants. How can we compute the value of an n x n
determinant? One answer is to use the combinatorial rule (43):
This sum contains n! terms each of which is a product of n terms, for a total of n! x n
multiplications. This count grows rapidly with n. Thus, for example,
1O! x 10=36,288,000.
This rapid growth makes the combinatorial formula impractical for all except very small
values of n. A second choice would be to use the cofactor expansions such as
= L aijAij
n
det(A) (i fixed).
j=l
However, when the cofactors are expanded into lower-order determinants, all n! x n
products are again generated.
A more efficient method is to use elementary row and column operations to transform
det(A) into a new determinant that has many zero entries. The use of the cofactor
expansion then becomes much simpler. The reduction can be based on the following
three rules.
Rule 1. If two rows (or columns) are interchanged then the determinant changes sign.
Rule 2. If a row (or column) is multiplied by a constant then the determinant is
multiplied by that constant.
Rule 3. If a scalar multiple of a row (or column) is added to another row (or column)
then the determinant is unchanged.
For row operations these rules were proved above. The corresponding column
operations follow from the row operations and the rule det(A T ) = det(A).
340 5 LINEAR ALGEBRA
EXAMPLE 1. The use of Rules 1-3 to simplify the computation of a determinant will be
illustrated by the following example:
8 0 0 4 8 0 0 4
9 1 -7 2 #1 9 1 0 -19
det(A) = -8 =
1 14 2 -8 1 0 44
0 0 1 -3 0 0 1 -3
8 0 4 8 0 4
#2 #3
9 1 -19 17 0 -63
-8 1 44 -8 1 44
8 4
~ (_1)2 ~ (-1)2«8)(-63) - (4)(7)) = -572.
17 -63
Here the first equation defines det(A). Step #1 subtracts (-7) times row 4 from row 2
and (14) times row 4 from row 3. Step #2 is the cofactor expansion along column 3.
Step #3 is to subtract row 3 from row 2. Step #4 is the cofactor expansion along column
2. Finally, Step #5 is the cofactor expansion of a 2 x 2 determinant.
Use of Gauss Reduction. A variant of the preceding method may be based on the
Gauss reduction algorithm. In matrix form the algorithm is
(56)
(see Section 5.4, equation (18)), where each Ej is a matrix that describes an elementary
row operation and U is an upper triangular matrix (all entries below the main diagonal are
zero). Taking the determinant of (56) and using the rule det(AB) = det(A) det(B) gives
Moreover, the terms det(Ej ) and det(U) can be found by inspection. Thus if Ej represents
the interchange of two rows then det(Ej ) = -1 (Rule 1); if Ej represents multiplication
of a row by a constant c then det(Ej ) = C (Rule 2); and if Ej represents addition
of a scalar multiple of a row to another row then det(Ej ) = 1 (Rule 3). Moreover,
det(U) = UllU22· .. Unn (product of diagonal elements) by cofactor expansion along the
first column. This is illustrated by application to the determinant of Example 1.
5.6 DETERMINANTS AND CRAMER'S RULE 341
0 0 1 -3 0 0 1 -3
1 1
1 0 0 2
1 0 0 2
C:26)
7 7
#4 0 1 -7 -'2 0 1 -7 -'2
17
~ (8)(21) 17
0 0 21 "2 0 0 1 42
0 0 1 -3 0 0 1 -3
1
1 0 0 '2
0 1 -7 _Z
~ (168) 17
2
= (168) (143)
#7
- 42 = -572,
0 0 1 42
143
0 0 0 42
I E~~~~i~.~~..~..~ .~
Matrices of order 2. Evaluate det(A) for the following matrices.
1.
( ~ ~)- 4. ( -~ -~ )- 7.
(~ -b)o .
2.
(~ o . -1 ) 5.
(~ ~). 8. ( -A
-1 4 ~ A ).
3.
(~ ~). 6.
(~ -: ). 9.
(~ -~ ).
Matrices of order 3. Evaluate det(A) for the following matrices.
342 5 LINEAR ALGEBRA
10.
(~ ~ ) I
5
5
13.
(r
4
2
2 r)
16.
(~ 0
5
-I)
11.
(~ ~ ) 3
5
5
14.
(~ -~ )0
I
0
17.
(~ -~ )
2
I
0
12.
(~ ~ ) I
0
S
15.
(~ -I)
0
0 18.
(I -~ )
2
Matrices of order 4. Evaluate the determinant of the given matrix and cite the determinant rules used.
(" lL)
n
0 0 0
o b
U
0 12 c 5
19. 21.
I 2 I~ . 0 b
o 0 0 0 0
n un
I
(
a 0 0 6
-I b 0 b
20. 22.
0 -I c c 4
0 0 -I 0 0
Warning. While Sarrus's rule exists for 2 x 2 and 3 x 3 determinants, there is no rule for 4 x 4 and
higher! In the exercises below, apply Sarrus's rule for n = 3 to evaluate the given determinant.
23.
(~ 0
5
-1 )- 26.
A=(~ ~)-
2
24.
(~ 3
0 -1
~)- 27.
A= (1 ~)-
(! A= (l :)
2
~)-
4
0
25. 1 28.
-1
-1 -1
Cramer's Rule. Solve for x
in the system Ax = b by Cramer's rule.
29.
(~ =~)x=(~). 31.
(~ 1
5
5
~ )x= (l~)-
30. (_~ ~)x=(~). 32.
(~ 5
~)X=(j)-
(-;
5
0 0
}=m
0
33.
1 2
0 -2
}{;)
0 0
(-!
0
34.
5
0 -5
35. (Definition of DeterminanO Using just the definition of determinant, evaluate
2-x
1 3-x =0.
n-x
37. (Determinant Identity) Show by reduction to triangular form that
222 2
233 3 = 1.
234 ... n
344 5 LINEAR ALGEBRA
38. (Number Theory) Use Cramer's rule to prove the following divisibility result: If three numbers each
with three digits are each divisible by an integer m, then the determinant formed from their digits is
divisible by m. For example, the numbers 104, 143, 195 each have three digits and are divisible by 13,
and therefore the determinant
is divisible by 13.
Applications to Geometry. Establish the follOwing results.
39. (Area of a Triangle) Let a triangle have vertices (Xl ,Yl), (X2,Y2), (X3,Y3). Prove that the area A of the
triangle is given by the determinant equation
1 ( Xl X2 X3)
A = - det Yl Y2 Y3 .
2 1 1 1
40. (Equation of a Line) Given a line passing through two distinct points (Xl,Yl), (X2,Y2), prove that the
equation of the line is
XXI X2)
det ( Y Yl Y2 = 0.
1 1 1
41. (Equation of a Plane) Given a plane passing through three points (Xl ,Yl, Zl), (X2,Y2, Z2), (X3,Y3, Z3),
prove that the equation of the plane is
XXI x2 X3)
det ( Y Yl Y2 Y3 = 0.
Z Zl Z2 Z3
1 1 1 1
42. (Volume of a Parallelepiped) The volume of the parallelepiped in space whose concurrent edges are
the vectors
is known to equal the scalar triple product X . (Y x Z). Prove that the volume V is also given by the
determinant equation
Xl x2 X3)
V = det ( Yl Y2 Y3 .
Zl Z2 Z3
m m
Hint: The answer is a certain fraction of the volume of the associated parallelepiped.
47. Volume of a parallelepiped whose concurrent edge vectors are
(1) Ax = AX and x =I o.
Any vector x that satisfies (1) is said to be an eigenvector for the eigenvalue A. The
problem of determining the eigenvalues and eigenvectors of a given square matrix A is
called the eigenvalue problem for A. The pair x, A is called an eigenpair of (1).
Eigenvalues often appear as the frequencies of vibration of mechanical oscillators.
This will be illustrated by the case of a pair of coupled harmonic oscillators.
Two bodies of mass m slide on a smooth plane and are coupled by springs with
spring constants k. The displacements Xl and Xl are measured from their equilibrium
positions (vertical dashed lines) and are counted as positive when to the right. With these
conventions, Hooke's law and Newton's laws of motion give the following differential
equations (where primes denote time derivatives).
Equations of Motion.
Normal Modes of Vibration. To find the possible modes of vibration let us look for
solutions of the special form
(3)
where Cl and C2 are constants, not both zero. Such solutions are called synchronous
vibrations or normal modes of vibration. Substituting the functions (3) in equations
(2) gives
If TCt), Cl, and C2 are not zero, the variables can be separated to give the equations
and hence
or Ac = AC, where
(A - AI)c = 0.
2 -A -1
det(A - AI) =
-1 2- A
= (A - 2)2 - 1 = (A - l)(A - 3) = 0.
Hence there are two eigenvalues,
and
k k k k
(7) f-L1 = -AI = -,
m m
f-L2 = -A2
m
= 3-.
m
Eigenvectors. The eigenvectors for eigenvalue Al = 1 are the nontrivial solutions of
-C2 )
-C2
= (0).
0
Normal Modes of Vibration. The pairs (AI, C1) and (A2, C2) give two normal modes:
348 5 LINEAR ALGEBRA
Recall that w is the frequency of the simple harmonic oscillator with mass m and Hookes
constant k. Mode 1, with the lower frequency, is called the fundamental mode. Note
that for it Xl (t) = X2 (t). Thus the middle spring is not stretched and the two masses
move like a simple oscillator with mass 2m and spring constant 2k. It can be shown that
the general solution of equations of motion (2) is just the sum of modes 1 and 2. Thus
A= (0 -1)
10'
complex). Applying this to the characteristic polynomial, we see that every n x n matrix
A has at least one eigenvalue A]. The corresponding set of eigenvectors is the eigenspace
N (A - A]In) with dimension 1 or more. This is illustrated by Example 1 in Section 5.6.
EXAMPLE 2. The 5 x 5 matrix
0 1 0 0 0
0 0 1 0 0
A= 0 0 0 1 0
0 0 0 0 1
0 0 0 0 0
If the characteristic polynomial has several distinct roots then each root will be an
eigenvalue with at least one corresponding eigenvector. More precisely, the following
theorem holds.
Theorem. Let the n x n matrix A have k distinct eigenvalues A], ... , Ak (so 1 ::s k ::s n).
Then the characteristic polynomial has the factorization
(ll)
where m], ... , mk are positive integers whose sum is n. Moreover, the eigenspaces
(12)
satisfy
The exponent mj is called the algebraic multiplicity of eigenvalue Aj, while dim (Aj)
is called its geometric multiplicity. It can be shown by simple examples that dim (Aj)
can be any integer between 1 and mj.
Linear Independence of Eigenvectors. If a matrix A has eigenpairs (A], c]) and
(A2, C2) with A] -# A2 then c] and C2 are linearly independent. For if they were linearly
350 5 LINEAR ALGEBRA
whence
and so Al = A2 (since a i- 0). The same conclusion holds if A2 =1= O. This contradiction
shows that CI and C2 are linearly independent. The result is extended to more than two
eigenvectors by the following theorem.
Theorem. Let the n x n matrix A have distinct eigenvalues AI, A2, ... ,Ak (so 1 ::: k ::: n)
and corresponding eigenvectors CI, C2, ... , Ck. Then the vectors CI, C2, ... , Ck are linearly
independent.
(14)
holds then the coefficients ai, a2, ... , ak must all be zero. The proof will be by induction
on k.
(15)
Now subtract Ak times (14) from (15). The last terms cancel and we have
(16)
But the inductive hypothesiS implies that the coefficients in (16) are all zero. Since the
eigenvalues are distinct, it follows that a] = a2 = ... = ak-I = O. Then, returning to
(14), we have akCk = 0 and hence ak = O. This completes the proof.
5.7 EIGENVALUES AND EIGENVECTORS 351
where {el, e2, ... , en} are the standard basis vectors, defined by
Note that the order of the indices k, .e is not the same as for a matrix product. Equations
(19) imply that
<Pnl <Pn2
Thus the matrix <l> may be written as
(20)
where the notation means that <l> is the matrix whose first column is <Pl, second column
<P2, etc. Clearly, <l> is nonsingular because the vectors <PI' <P2, ... , <Pn are a basis and
hence are linearly independent.
352 5 liNEAR ALGEBRA
or in matrix notation,
(21) x = <l>x'.
Similarly,
(22) b = <l>b' ,
where b' is the unique vector defined by b ' = <I>-lb. Substituting (21) and (22) into
Ax = b gives
(23)
then
The eigenpairs of A are readily seen to be (Aj, ej), where el, e2, ... , en is the standard
basis. Moreover, A is invertible if and only if the eigenvalues are all nonzero. In this case
A -I
1\ = d'lag ('11.1-I ,11.2
,-I ,-I)
, ... , n
II. .
where CI, C2, ... , C n are the columns of C, then the rules of matrix multiplication imply
that
AC = (AcI AC2'" AC n)
=CA
= (AICI A2C2'" AnCn ).
AC2 = A2 C2,
Moreover, each Cj =f:. 0 because C is nonsingular. Hence the column vectors of C are all
eigenvectors of A. Also, they are linearly independent, again because C is assumed to be
nonsingular. Conversely, if A has n linearly independent eigenvectors CI, C2, ... , Cn then
C, defined by (26), is nonsingular, and (25) holds. Hence the following theorem is valid.
Then
so
T
Cl = ( 1, -1 ) ,
T
C2 = ( 1,1 ) ,
hence
c 1 _ adj
- detC - 2
C_~ (1 -1) 1 1
and
A-( 01)
- -1 0 .
Then
Al = i, Cl = (1, if,
A2 = - i, C2 = (l,_i)T,
5.7 EIGENVALUES AND EIGENVECTORS 355
hence
and
C-'AC~(~_~).
ExAMPLE 5. (Matrix Not Diagonalizable). Let
A=(~ ~).
Then
so
Cl = (l,of·
Thus A is not diagonalizable.
Theorem. Let n be a positive integer and let A denote an n x n self-adjoint matrix, real
or complex. Then A has the following properties:
(30) A is diagonalizable.
Property (30) follows from (29) by the preceding theorem. We shall give proofs of (27)-
(29). Only (29) is difficult. It will be proved by induction on the order n, starting with
the simple case n = 1.
Proof of (27). Recall that every n x n matrix A satisfies (Ax, y) = (x, A*y) for all
vectors x and y. In particular, self-adjoint matrices A satisfy (Ax, y) = (x, Ay) for all
vectors x and y in en. In this equation take x = y, where Ax = AX for some (possibly
complex) A and some x =f. O. This gives (Ax, x) = (x, Ax), or A(X, x) = I(x, x), by
the properties of the inner product in en.
This can be written as
since (x, x) = IIx1l2. Since x =f. 0, it follows that A = I, which implies that A is real.
proof of(28). Let (A, x) and (A', y) be eigenpairs for A with distinct eigenvalues Aand A'.
Substituting Ax = Ax and Ay = A'y into (Ax, y) = (x, Ay) gives (Ax, y) = (x, A'y),
or, since A and A' are real,
(A - A') (x, y) = o.
Since A =f. A', it follows that (x, y) = 0 and x and yare orthogonal.
To begin, recall that every n x n matrix has at least one eigenpair (A I, Xl), so
Next construct an orthonormal basis {YI, Y2, ... , Yn} such that YI = Xl. This can always
be done by the Gram-Schmidt method. Then define Q to be the n x n matrix whose
column vectors are YI, Y2, ... , Yn' The notation
(31)
5.7 EIGENVALUES AND EIGENVECTORS 357
will be used. Clearly, Q*Q = ((yj , Yk)) = In because (Yj , Yk) = Ojk. The same is true for
QQ*, so
Q*Q = QQ* = In
and Q* = Q-l. Such matrices are said to be orthogonal matrices. It follows from (31) that
and hence
Here
while
o o
bl,l b1,n-l
bn-1,n-l
(32)
The vectors Zl, Z2, ... , Zn are orthonormal in Rn (or en) because the vectors iI, i 2 , ... ,
i n - l are orthonormal in Rn- l (or en-I). Finally, the vectors
Xj == QZj, j = 1, 2, ... , n,
whence
Moreover,
and the eigenvectors Xl, X2, ... , Xn are orthonormal. This completes the proof of (29).
Calculation of Eigenvalues and Eigenvectors. It is important for many appli-
cations to have methods for computing eigenvalues and eigenvectors of matrices. In
5.7 EIGENVALUES AND EIGENVECTORS 359
Exercises 5.7
1. A=(_~ ~) 8. A = ( i 0
-1 -~ )-
-1
2. A = ( ~ ~) -5
3. A = ( ~ ~) 9. A = ( ~ 4
0 ~)-
~) ~ ~)-
4. A= ( -25
10. A = ( 0
0
5. A = ( _~ -! ) 0
~
2 ~ ~)-
~)-
11. A = ( 0
6. A = ( 1 0
0 -1
0
7. A = ( ~
2
4
-1 )
-2 . 12. A = ( ~ 4
0 ~)-
6 -3
Eigenvectors. The problem of finding eigenvectors reduces to the computation of at most n reduced row
echelon forms. In each exercise below determine all the eigenvectors. A computer algebra system will be
helpful.
360 5 LINEAR ALGEBRA
)
2
17.A=(~
!)
2
23A~U
0 0
0 -1 2 0
)
0
n
2 -1
18. A = ( ~ 4
6
-2
-3
0 0 -1
A~U
0 0
-~ )
3 0
~
0
24 0 2
19. -1
A= ( 0 0
25. (Multiplicity) Let A be a 7 x 7 real matrix with eigenvalue A of multiplicity 7. Show that the geometric
multiplicity of A is 7 if and only if A = diag(A, ... , A).
26. (Characteristic Polynomial) Prove by means of cofactor expansion that the determinant of A - Un is
a polynomial of degree n with highest order term (_A)n and constant term det(A).
27. (Eigenspace Dimension) Produce five matrices A of size 5 x 5 with all eigenvalues zero and eigenspace
N(A) of dimensions 1,2,3,4 and 5. This shows that if the algebraic multiplicity of an eigenvalue is 5
then the geometric multiplicity can be any integer from 1 to 5.
28. (Similarity) Let A and B be similar matrices. Prove that det(A) = det(B).
29. (Similarity) Do there exist two matrices A and B with the same characteristic polynomial, but A and B
are not similar matrices?
30. (Similarity) Let A and B be similar matrices. Show that they have exactly the same characteristic
polynomial.
31. (Similarity) Let the n x n matrix A be similar to a diagonal matrix D, D = 4>-1 A4>. Show that the
columns of 4> are a basis for Rn It is not assumed that the eigenvalues of A are distinct; e.g., A could be
the n x n identity matrix.
32. (Similarity) Let the n x n matrix A be similar to a diagonal matrix D, D = 4>-1 A4>. Show that any
matrix B similar to A satisfies an equation D = III-I BIII for exactly the same diagonal matrix D.
33. (Eigenvecctors) Let x = (3,1, -1) and y = (1, -1, 1). Define A = xTy (matrix product). Find the
eigenvectors of A.
34. (Trace) Let x and y be two nonzero column vectors in Rn and define A = x T y (matrix product). Show
that the trace of A is an eigenvalue and that all the other eigenvalues are zero.
5.7 EIGENVALUES AND EIGENVECTORS 361
implies
36. (Normal Modes) Show directly that the normal modes of vibration are
Xl = x2 = Al cos(wt) + Bl sinew!)
and
mx~ =
h( -2Xl + X2),
mx~ = h(XI - 2X2)
where w = ,/Wm.
37. (Eigenvector) Establish the formula C2 = 0, -1/ for the second eigenvector in Example 1.
Characteristic Polynomial. Theoretical considerations show that the characteristic polynomial of an n x n
matrix A has n roots in the complex field. The practical problem of determining the polynomial coefficients
is considered in the exercises below.
38. Let A be a 2 x 2 matrix with real coefficients aij. Show that the characteristic polynomial of A can be
written in the form
Companion Matrix. Let xn = an_1Xn- 1 + ... + alx + ao be a polynomial of degree n ::: 1. The matrix
C below is called the companion matrix.
o o o
o o
o )
The five exercises below develop properties of the companion matrix.
44. Describe in words how to obtain the companion matrix from the n x n identity matrix.
45. Show that xn = an_1X n- 1 + ... + al x + aO is the characteristic equation of the companion matrix for
the special cases n = 2 and n = 3.
46. Choose the third row of the matrix A below such that its characteristic polynomial is equal to -A 3 +
3A 2 - 5A + 2.
48. Let t = A be a root of tn = an-l tn- 1 + ... + al t + ao. Show that A is an eigenvalue of the companion
matrix C with corresponding eigenvector
49. (Eigenspaces) Prove that the eigenvectors of an n x n matrix A, corresponding to a given eigenvalue
A, form a subspace of Rn
50. (Column space) Let A be a nonzero eigenvalue of the n x n matrix A. Show that each eigenvector of A
corresponding to A belongs to the column space of A.
51. (Column Space) Let the 3 x 3 matrix A have the property that its three column vectors add to (7,7,7) T
Prove that 7 is an eigenvalue of A.
52. (Eigenvalues of Matrix Products) Let A be an eigenvalue of the matrix product AB of two real n x n
matrices A and B. Prove that A is an eigenvalue of the product BA.
53. (Eigenvalue of the Inverse Matrix) Let A be a nonzero eigenvalue of the n x n nonsingular matrix A.
Show that l/A is an eigenvalue of A -1 .
55. (Eigenvectors) Let A be an eigenvalue of the n x n matrix A with corresponding eigenvector x. Show
that Akx is an eigenvector of A corresponding to A, provided that Akx # 0, for k = 1, 2, 3, ..
56. (Eigenvectors of Similar Matrices) Let A be an eigenvalue of the n x n matrix A with corresponding
eigenvector x. Assume that B is similar to A, B = <1>-1 A<I>. Prove that y = <I>-lx is an eigenvector of
B corresponding to eigenvalue A.
CHAPTER 6
Vector Analysis
Vector analysis is primarily the invention of josiah Willard Gibbs (1839-1903, American
mathematician and physicist). His work on vector analysis first became known through
his widely circulated Yale lecture notes from the years 1881-84. The first definitive
textbook, entitled Vector Analysis, was published in 1901 by Edwin Bidwell Wilson-
a follower of Gibb's Yale lectures. The notation and methods of "Gibbs-Wilson" have
found almost universal use throughout most of the twentieth century
Vector analysis is the theory of geometric vectors in Euclidean space-including
vector algebra and vector differential and integral calculus. The theory has applications
to many problems of geometry and mathematical physics. Some of these are developed
in this and subsequent chapters.
Most first-year calculus textbooks include an introduction to vector analysis. More-
over, science and engineering students are also introduced to concepts of vector analysis
in their introductory physics and engineering courses. For these reasons the present text
will review the basic concepts only briefly before moving on to the main theorems and
applications.
366 6 VECTOR ANALYSIS
Q'
C = A+B.
Multiplication of Vectors by Scalars. In vector analysis a scalar quantity; or scalar,
is a number of the real number field. The product of a vector A and a scalar a is a vector,
denoted by aA, which is defined as follows:
Case 1 (a> 0). In this case aA is the vector parallel to A whose magnitude (= length)
is the product of a and the magnitude of A.
Case 2 (a = 0). In this case aA = OA = 0, where 0 is the zero vector that corresponds
to the trivial displacement pP (no motion).
Case 3 (a < 0). In this case aA is the vector with direction opposite to that of A and
magnitude equal to lal times that of A.
The algebraic properties of the set of geometric vectors in E2 are summarized in the
following theorem.
Theorem. The set of geometric vectors in E2 (and the corresponding set of all
displacements of E2 ) is closed under the operations of vector addition and scalar
multiplication and forms a vector space in the sense of Section 5.3. This means that the
following eight properties hold.
The proofs of these simple properties, based on plane geometry in E2 , will be left as
exercises. The identification of the geometric vectors in E2 as a vector space allows us to
apply to them the concepts and results of Chapter 5. In particular, we have the following
corollary:
Proof. Only the dimension statement needs to be checked. The geometry of E2 permits
us to construct pairs of noncollinear line segments and corresponding vectors Al and
A2 . We shall show that any such pair is a basis, so that each vector A in E2 can be written
uniquely as
(9)
y p
Y2 Q
Yl p R
L-~------------~--~X
The distance between two points in E2 may be computed from their Cartesian
coordinates. Forlet P ++ (Xl ,Yl) and Q ++ (X2,Y2) be any pair of points, as in Figure 5,
and mark the point R ++ (X2,Yl).
PR
Then the distance I I equals IX2 - xII, the distance IRQI equals IY2 - Yll, and hence,
by the Pythagorean theorem,
are called the components of A in the given coordinate system. Clearly, the mapping
A ++ (aI, a2) is a bijective map of the vector space of geometric vectors onto R2. Many
textbooks identify the vector space of geometric vectors with the vector space R2 of
Chapter 5. This can lead to confusion because the pair (aI, a2) depends on the choice of
the coordinate system, while A does not.
Euclidean Space E3 . The mathematical setting for Newtonian mechanics, and for
much of the mathematical physics of the eighteenth and nineteenth centuries, is the three-
6.1 VECTOR ALGEBRA 371
dimensional Euclidean space E3 of high-school solid geometry The theory of E3 and its
vectors closely follows that of E2 . Thus the geometric vectors for E3 are the equivalence
--+
classes of directed line segments PQ, where equivalence is defined as in Figure 1. Vector
addition and scalar multiplication are defined as before, and the set of all geometric
vectors forms a real vector space under these operations. The one real difference is that
the geometric vectors in E2 form a two-dimensional vector space, while those in E3 form
a three-dimensional vector space. Thus if Ai, A2 and A3 are any noncoplanar vectors in
E3 then every vector A can be written uniquely as
for E2 , we shall denote by i, j and k the unit vectors along the positive x, y and Z axes,
respectively: With this notation every geometric vector has a unique representation
The numbers al, a2, a3 are called the components of A in the given coordinate system.
--+
By associating A with the directed line segment PQ above, we see that al = X2 - Xl,
a2 = Y2 - Yl, a3 = Z2 - Zl, and hence the vector norm of A satisfies
(12) A B-
. - 1 0 ifA=OandlorB=O,
IAIIBI cose if A and B are both not 0,
where e is the smaller of the two angles between A and B, so 0 :::: e : : 1(; see Figure 7.
The dot product is also called the scalar product or the inner product in some
texts. We shall see that it has many uses in applications of vector algebra to problems of
geometry and physics. Here we shall give several properties of the dot product that are
useful in applications. The most important is described by the following theorem.
(14)
A-B
B
A FIGURE 7. Definition of A· B
6.1 VECTOR ALGEBRA 373
we have
(16)
Additional Properties. The following useful rules follow readily from (12) and (14):
(17) A . B = B . A (symmetry).
(18) (aA + bB) . C = a(A . C) + b(B . C) (linearity).
(19) A . A = IAI2 > 0 unless A = 0 (positivity).
These properties will be used freely in vector algebra calculations.
Position Vectors in E3 • For geometrical calculations it is often convenient to
represent the points of E3 by vectors in E3 . We shall illustrate the method by applying
it to a description of the lines and planes in E3 . To begin, we introduce a Cartesian
coordinate system and write
as illustrated in Figure 8.
Lines in E3. A line L in E3 is specified by two distinct points on it or, equivalently, by
a point on it and a nonzero vector parallel to it. The second deSCription gives the formula
where rand ro are the position vectors of any two points on L and A =f. 0 is parallel to
L; see Figure 9.
374 6 VECTOR ANALYSIS
(22)
y = Yo + ta2,
Z = Zo + ta3,
where (al, a2, a3) =f. (0,0,0). A nonparametric description is obtained by eliminating t
to get
x - Xo Y - Yo Z - Zo
(23) - - = - - = - - (= t).
al a2 a3
Here it is understood that if, for example, a2 = 0 then y = yo. Thus if al = a2 = 0 in
(23) then L is the line described by x = Xo, Y = Yo, and Z = Zo + a3t. This is the line
through (xo,Yo, Zo) that is parallel to the z-axis.
Planes in £3. A plane Il in £3 is specified by three points on it or, equivalently, by
a point on it and two linearly independent vectors that are parallel to it. The second
(24) r = TO + sA + tB,
where TO is on n and A and B are linearly independent, so that neither is a multiple of
the other; see Figure 10.
To find a coordinate representation of n, let
TO = xoi + yoj + zok,
A = ali + a2.i + a3k,
B= bli + b2j + b3 k.
x = Xo + sal + th,
Y = Yo + sa2 + tb 2,
Z = Zo + sa3 + tb3·
A nonparametric representation can be found by introducing the vector N orthogonal to
A and B, so
N . A = N . B = 0 (and N f. 0).
N . (r - ro) = O.
In particular, if
N = ai + bj + ck
then an equation for n is
a(x - xo) + bey - Yo) + c(z - zo) = 0,
or
ax + by + cz = d,
A
ro
FIGURE 10. Plane n through ro and parallel to
A and B
376 6 VECTOR ANALYSIS
(25) AxB= I0
nlAllBI sin e
if A = 0 andJor B = 0,
if A and B are both not 0,
where e is the angle between A and B, chosen so that 0 ::: e ::: n, and n is a unit vector
perpendicular to A and B.
There are two unit vectors that are perpendicular to A and B, each being the negative
of the other. The desired vector n is determined by the following rule.
Right Hand Rule. The direction of n is the direction of advance of a right-threaded
screw, perpendicular to the plane of A and B, if the screw's head is rotated from A to B
through the angle e; see Figure 11.
The cross product is also called the vector product or the outer product in some
texts. Some of its applications to geometry and physics are developed in this and
subsequent chapters.
Properties of the Cross Product. The following properties can be proved directly from
the definition of the cross product:
(26) B x A = -A x B (antisymmetry).
AxB
AxB
~ ~~
.... .... .. •....•...•••....•.•...............
= a2b3 - a3b2,
(28)
= a3bl - al b3,
and where ai, a2, a3, bl , b2, b3 are the components of A and B, defined by (13). This is
readily derivable from the linearity property (27) and the elementary products
i xi =j xj = k x k = 0,
i xj =-j xi = k,
j x k = - k x j = i,
k x i =- i x k = j.
The latter follows directly from (25). Indeed,
j k
(29) A x B= al a2 a3
bl b2 b3
Of course, (29) is not a true determinant because the first row consists of vectors rather
than scalars. However, if (29) is expanded in minors along the first row then equation
(28) is recovered. Equation (29) is a useful mnemonic for remembering (28).
assigns a scalar to each ordered triple of geometric vectors. From (29) it is easy to see
that if C = cli + c2.i + C3k then
Cl C2 C3
(31) [A,B,CJ = al a2 a3
bl b2 b3
Conversely, the properties of the cross product are all easily derivable from (30) and (31).
For example, (26) follows from (31) and the properties of determinants. Derivations from
(31) of other properties are left as exercises.
Vector Identities. The following identities are frequently useful in cross product
calculations.
Verification of these identities are straightforward but tedious. More concise proofs can
be found in [D-SJ and other vector calculus texts.
Notational Conventions. Some references use A = xi + yj + zk and the triple
notation A = (x,y, z) to mean the same thing. This is valid when a fixed basis i, j, k is
used in calculations. The notation is used in computer algebra systems to apply linear
algebra methods to vector analysis problems. We will adopt this notational convenience
when the context makes it clear that a fixed basis i, j, k is being used:
1. (Parallelogram Diagonals) Given an origin 0 above the plane of parallelogram ABCD, form vectors
a = oA, b = aB, c = OC, d = 00. The diagonals of the parallelogram intersect at a point P. Find a
vector formula for oP.
2. (Parallelogram) Let A, B, C, D be the consecutive vertices of a parallelogram. Let 0 be an origin not in
-+
the plane of the parallelogram ABCD. Define a = OA, etc. Explain geometrically why b - a = c - d.
3. (Equality of Vectors) Show that two vectors a, b are equal if and only if la - bl = O.
4. (Zero Vector) Is the zero vector 0 a directed line segment? What is the magnitude of the zero vector?
5. (Parallel Vectors) Find a condition for two nonzero vectors a and b to be parallel.
6. (Magnitude) Let a, b be nonzero vectors and define c = Ibi/ial. Find the magnitude of ca.
6.1 VECTOR ALGEBRA 379
7. (Midpoint) Let a, b be vectors joining the origin 0 to points A, B respectively Show that i(a + b)
corresponds to the midpoint of segment AB.
8. (Normal to a Plane) How many distinct unit vectors are orthogonal to a given plane'
qa + c2(a + b) + c3(a ~ b) = 0
for all vectors a and b.
Projv(u) = u·v
-~,
~
Projv(u) = u·v
-~ v.
v·v
Ivl
Let u = (t, et , e2t ), v = (t + I, et - l ,e l - t ) and let t = 0.6. Compute the requested quantity and display the
result with 3-digit accuracy.
u = (t 2 ,et ,e- t ),
v= (t " e2t e- 2t ) ,
w = (I,~I,O),
and let t = 0.5. Compute the following answers and display the result with 3-digit accuracy
16. Find the area of the parallelogram determined by u and v.
17. Compute the volume of the parallelepiped determined by u, v, w.
21. (Angle in a Triangle) A triangle in R2 has vertices P = (3, ~l), Q = (4,5) and R = (~I, ~2). Find
e
the cosine of the angle at the vertex P.
22. (Scalar Triple Product) Show that if any two vectors in a scalar triple product are equal, then the value
of the scalar triple product is zero.
23. (Coplanar Points) It is known that three vectors are coplanar if and only if their scalar triple product
is zero. Use this fact to find x such that the three vectors below are coplanar.
24. (Vector Triple Product) Let a = 0,2, -3), b = (2, -1,1) and c = (1, -1, 1). Compute the vector
triple product a x (b x c).
25. (Scalar Triple Product) Given three vectors a, band c, simplify (a + b) . (b + c) x (a + c).
26. (Vector Triple Product) Prove the second of the fundamental identities
27. (Vector Line Equation) Find the vector equation of the line through 0, -1, 1) and parallel to 0, 2, -1).
28. (Vector Line Equation) Find the vector equation ofthe line through the points (1, -1, 1) and (-1,0,1).
29. (Vector Line Equation) Find the point in the plane x + 5y + 6z = -1 that is also on the line through
(0,1,1) in direction (-I, -1,2).
30. (Near Point) Find the point R nearest to the origin on the line L through P = 0, -4, 6) and Q =
(3,1,2)
31. (Equation of a Plane) Find the equation of the plane passing through the three points P = (t, t2 , t3 ),
Q = (t sin(t), t cos(t) , t2 ), R = (sin(t), cos(t) , t), where t = 0.886.
32. (Intersection of Planes) Find an algebraic description of the intersection of the two planes 3x-y+4z =
3 and -4x - 2y + 7z = 8. Plot the two planes on one set of axes and show the intersection set.
33. (Angle Between Planes) Find the angle in degrees between the two planes 3x - y + 4z = 3 and
-4x - 2y + 7z = 8.
Geometry. Prove by means of geometrical definitions the follOwing properties of geometrical vectors. In
the proofs, use only methods of plane geometry
34. A + B = B + A for all vectors A and B.
35. A + (B + C) = (A + B) + C for all vectors A, Band C.
36. A + 0 = 0 + A for all vectors A.
(36) B x A = -A x B (antisymmetry).
6.2 VECTOR CALCULUS OF CURVES IN SPACE 381
(38) A B= {0 if A = 0 and/or B = 0,
x nlAllBI sine if A and B are not O.
of real-valued functions.
Space Curves in E3 • Relative to an origin 0 in E3 , the points pet) in E3 , defined by
-:=+
(2) OP(t) = f(t), a::: t::: b,
represent a point set in £3' The space curves treated in this section will be constructed
in this way. As preparation for the construction, some simple concepts of vector calculus
are reviewed.
382 6 VECTOR ANALYSIS
extends immediately to the various products involving vectors. Thus if aCt) and f(t) are
scalar and vector functions, respectively, then
Care must be taken in (12) to preserve the order of the vector factors because the vector
product is not commutative.
Space Curves. The space curves studied below are subsets C of Euclidean space E3
that have parametric representations
where x(t), yet) and z(t) are real-valued functions of t, defined for a ::: t ::: b. We
shall assume that the correspondence t --+ r(t) is one-to-one (or bijective). This means
that each point of C corresponds to exactly one point of the interval a ::: t ::: b. The
correspondence is also assumed to be continuous, which implies that C is a connected
set of points.
Equivalent Parameterizations. Notice that the parameterization of C defined by
(13) is in no way unique. For let 1/I(t) have a positive derivative 1/1' (t) and let t = 1/1 (.)
map a' ::: • ::: b' onto a ::: t ::: b. Then r = r(1/I(.» will map a' ::: • ::: b' one-to-
one onto C and will be a second parameterization of C. We shall call it an equivalent
parameterization of c. Clearly, a given space curve has infinitely many eqUivalent
parameterizations. Each one of them determines the curve C as a set in E3 , and its
orientation rea) is the initial point of C, and reb) is its final point. A point ret) is between
r(tl) and r(t2) if and only if tl < t < t2.
Smooth Space Curves. The class of space curves defined above is too large to
permit the use of methods of vector differential calculus. We shall restrict our study
to parametric curves that have at least three continuous derivatives. This suggests the
follOwing definition.
Definition. A space curve C is said to be a smooth space curve if and only if it satisfies
dr ,
(17) dt = r (t)
is a tangent vector to C at the point ret). Thus conditions (14)-(16) guarantee the
existence of a continuously turning tangent line at each point of C.
Elementary Examples. The following three examples of smooth space curves illustrate
the concepts defined above.
EXAMPLE 1. A Straight Line. Let ro and rl be the position vectors of two distinct points
in E3 . Then
describes the straight line segment joining ro and rl. In particular, reO) = ro and rO) =
fl. Also, f' (t) = fl - fO is a nonzero constant vector, parallel to the line segment r.
EXAMPLE 2. An Ellipse. Consider an ellipse with semiaxes a and b (a > b > 0). If it lies
in the z-plane of a Cartesian coordinate system and has axes along the x and y axes then
r = xi + yj + zk is on the ellipse if and only if
Note that r(2n) = reO) = ai, and the curves closes when t = 2n.
where a > 0 is the radius and b > 0 is the pitch of the helix, which measures the spacing
of successive turns of the spiral; see Figure 12.
6.2 VECTOR CALCULUS OF CURVES IN SPACE 385
x ~>y
This is usually motivated by deriving (21) as a limit of the lengths of inscribed polygonal
curves. Also, (13) implies that
°
where rp(O maps a ~ t ~ b one- to-one onto ~ s ~ f.. Moreover, if C is a smooth
space curve then rp(O has continuous derivatives through order three. Also, by (16),
ds r--~--~~~~~~
(25) dt = rp' (t) = Jr'(t)J = V(X' (t))2 + (y'(t))2 + (Z'(t»2 > 0,
and we see that (24) is invertible and t = rp-l(S) has derivatives of order three or more
°
for ~ s ~ f.. Thus r(rp-l(s» defines an equivalent parameterization with arc length s
as parameter. A particular consequence of equation (25) is the theorem that the tangent
vector r'(O is a unit vector if and only if s = t - a. We shall see below that the arc length
parameterization is very useful in studying the structure of smooth space curves.
The Geometry of Space Curves. The information given above will now be used to
show that the shape of each smooth space curve is determined by two scalar functions
386 6 VECTOR ANALYSIS
of the arc length parameter. We begin with the arc length representation of C as
dT(s)
Case 1. ~ = 0 for 0 ::: s ::: C.
dT(s)
Case 2. d;- =f:. 0 for 0 ::: s ::: e.
dT(s)
Case 3. - - = 0 for some values of s but not all.
ds
In Case 1, integration gives T(s) = To = constant. It follows easily that C is a straight
line parallel to To. In Case 2 we have
dT(s) I >
(29) K(S) = Id;- 0 for 0::: s ::: e.
This will be called the regular case. The function K(S) is called the curvature of C at
res). In Case 3, K(S) is zero at some values of s and not zero at others. If we work between
zeros of K(S) we revert to the regular case. The study of the behavior of C near the zeros
of K(S) is too difficult to study here. It is studied in advanced geometry texts.
The Regular Case. In this case we have, by (29),
dT(s)
(30) d;- = K(s)N for 0::: s ::: C,
where N = N(s) is a unit vector orthogonal to T by (28). Thus N lies in the normal
plane to Cat res); N is called the principal normal at res). To span the normal plane
at res) we need another normal vector that is orthogonal to both T and N. An obvious
choice is the binormal at res), defined by
Since T and N are orthogonal unit vectors, we see that B is a unit vector and
In fact, for each s, the vectors T, Nand B form an orthogonal basis for all vectors in E3 .
dB
(33) - = -rN
ds
for 0<
-
S <
-
.e.
The coefficient res) is called the torsion of Cat res). (The use of the minus sign in (33)
is a convention and is not essential.) A fundamental theorem of elementary geometry
states that the two scalar functions K(S) and res) determine C completely except for its
position in space. An explanation of this fact is sketched below.
The Frenet-Serret Formulas. The derivatives dT/ds and dBlds are written in terms
of the orthogonal basis T, N, B in (30) and (33). To complete the triad we will show that
dN
(34) - = -KT+ rB.
ds
One way to do this is to differentiate the relation
(35) N=BxT.
dN dB dT
-=-xT+Bx-.
ds ds ds
Combining this with (30) and (33) gives (34). The three vector relations
dT
(36) -=KN
ds '
dN
- KT + rB,
ds
dB
-rN
ds
are called the Frenet-Serret formulas after their discoverers F Frenet (1852) and JA.
Serret (1851).
388 6 VECTOR ANALYSIS
Calculation of K(S) and res) From C. If C is given in the arc length representation (26)
then differentiation and the definition (29) give
Calculation of C From K(S) and res). It is enough to calculate T(s), since integration
then gives Cas
The mass density of the wire (mass per unit length of C) is a positive function p(x, y, z).
The usual approximation procedure gives the approximate Riemann sums
L p(x(U,y(ti), z(ti))lr'(ti)l~ti'
n
M ~
i=1
Passing to the limit as the partition norms tend to zero gives the definite integral
I. Exercises 6.2 . ~
",~ bi, 1 R" ~nw' w"","'~' ~,,"'" ,'"
1. (Vector Form of a Line) Find a vector representation ret) = ro + tf) of the line that passes through the
points (3,4,7) and (1, -1, 3)
2. (Parametric Form of a Line) Use vector methods to find a parametric representation
x = xo + t(Xl - xo),
Y = YO + t(Yl - YO),
Z = Zo + t(Zl - zo)
of the line that passes through the point (3,4,7) in the direction of the vector i - j + 2k.
3. (Symmetric Form of a Line) Use vector methods to find a symmetric representation
x - Xo Y - YO Z - Zo
Xl - Xo YI - YO Zl - Zo
390 6 VECTOR ANALYSIS
of the line that passes through the point (3, 4, 7) in the direction of the vector 2i - j + k.
4. (Equations of a Line) Use vector methods to find the vector, parametric, and symmetric representations
of the line that passes through the point (3,4,7) and is parallel to the line of intersection of the planes
defined by the equations x + 2y + llz = 0 and 5x + lly + 58z = O.
5. (Circle Parameterization) Find the parametric equations for a circle with center (1,2,5) of radius 7 in
the plane z = 5.
6. (Parameterizations) Show that ret) = cos(t)i + sin(t)j, 0 ::: t ::: 7T12, is a parameterization of a quarter
circle and so is r(u) = (0 - u 2 )i + 2uj)/(1 + u 2 ), 0 ::: u ::: l.
7. (Triangle Parameterization) The boundary of the triangle with vertices 0,0,0), 0,1,0), and (2,3, 1)
is the image of a parametric curve r(t). Find one such parametric curve r(t).
8. (Tangent Vector) Find the velocity vector r' and unit tangent vector r'llr'l to the curve ret) = cos(t)i +
sin(t)j at t = 7T12.
9. (Constant Tangent) Let ret) = ti + (2t + 10)j + 5tk, which represents a line in space. Find the unit
tangent T and verify that it is constant, i.e., dT/dt = O.
10. (No Tangent) Find the points on the curve ret) = tj + Itlj that fail to have a tangent defined.
11. (Velocity and Speed) Let ret) = cos(t)(i - j) + sin(t)(i + j) + tk describe the motion of a particle.
Find the velOcity r' and speed Ir'l of the particle.
12. (Particle Dynamics) Let ret) describe the path of a moving particle that has constant speed (Ir'(t)1 =
constant). Does it follow that rl/ (t) = O? Explain.
13. (Angular Speed) A particle moves around the circle x 2 + y2 = 4 with angular velOCity dOfdt a constant
10 radians per second. Find a vector formula ret) for the motion of the particle and compute the particle
speed Ir' (t) I·
14. (Polar Equations) A plane motion ret) is given in polar coordinates p, 0 by the equations p = 1 + sin t,
o= -1 + e- t . Find the velocity r' (t) and the acceleration rl/ (t) at t = O.
15. (Polar Motion) A particle moves in the plane by the equation ret) = (1-cosO(t»(cos O(t)i +sin O(t)j),
where 0' (t) = 4. Find the velOcity r' (t) and the acceleration rl/ (t).
16. (Continuity) Prove that the dot product f(t) . get) and cross product f(t) x get) of continuous vector
functions f(t) and get) are continuous functions of t.
17. (Dot Product) Let aCt) and bet) be two continuously differentiable vector functions. Prove that
(a(t) . b(t»' = a' (t) . bet) + a(t) . b' (t).
18. (Cross Product) Let aCt) and bet) be two continuously differentiable vector functions. Prove that
(a(t) x b(t»' = a'(t) x bet) + aCt) x b'(t).
19. (Arc Length) Find the length of ret) = cos ti + sin tj + 2tk on 0::: t ::: 27T.
20. (Arc Length) Find the length of ret) = ti + sin(27Tt)j + cos(2m)k on 0 ::: t ::: 1.
21. (Arc Length) Find the length of the plane curve r(t) = (cos 3t)i + (sin 3t)j on 0 ::: t ::: 7T.
22. (A Formula for dTlds) Given a twice continuously differentiable vector function ret), let s
f~ Ir'(r)ldr and let T = drlds be the unit tangent. Prove the decomposition formula (below' = dldt)
23. (Normal and Tangential Components) Given a twice continuously differentiable vector function ret),
let s = I~ Ir' (r)ldr. Define as usual the unit tangent T = dr/ds, the curvature K = IdTldsl, and the
standard unit normal N = K-1dTlds. Prove the decomposition formula (below' = dldt)
r' . r"
aT = y'
24. (Curvature) Given s = I~ Ir'(r)ldr, prove the curvature formula
25. (Curvature) Show that the curvature ofr(t) = (3t-t 3 )i+3t2j+(3t+t 3 )kisK(s) = 1/(3(1+t2 )2).
26. (Tangential and Normal Components) In the acceleration representation r" (I) = aT T + aN N, which
says that r" is in the plane of T and N, the coefficients aT and aN are called the tangential and normal
components of acceleration. Find aT and aN for the helix ret) = (cos t)i + (sin t)j + 2tk.
27. (Circular Helix Tangents and Normals) Sketch the circular helixf(1) = cos2ti + sin2tj + tk for
o :::: t :::: 4rr and draw on the curve aU = rr/4 the unit tangent vector T and the standard normal vector N.
28. (Binormal Magnitude) Prove directly from the definition B = T x N that the binormal B has unit length.
29. (Formulas for T, N and B) Show that the following formulas are valid for a twice-differentiable vector
function r(t).
31. (Trihedral Formulas) Verify the formulas B = T x N, N = B x T, T = N x B for the special case of
the helix ret) = cos ti + sin tj + tk.
32. (Binormal) Prove that the binormal B and its arc length derivative dBlds are orthogonal
33. (Reciprocal Basis) Let aI, a2 and a3 be given vectors that are nonzero and noncoplanar. Define
a2 x a3 al x a2
hI = _=-----0..._ b3 = -=---::..-
al . a2 x a3 al . a2 x a3
35. The curve C is the straight line segment C from (1,0, 1) to 0,3,2). The functionpis defined by
p(x,y,Z) = 3y + xz - x.
36. The curve C is the semicircle y = J1 - x 2 clockwise from ( - 1, 0) to 0, 0). The function p is defined by
p(x,y) =x 2 +x2/.
37. Compute the line integral of the tangential component of the force F = xi + x 2yj along the straight
line from (-1,0) to 0,0). Hint: The answer is the line integral fcp(x,y,z)ds, where p(x,y,z)ds =
F· dr = F(r(t» . r'(t)dt.
38. Compute the line integral of the tangential component of the force F = xi + x 2yj along the polygon
from (-1,0) to (1,1) to (1,0).
where (u, v) ranges over a domain D in the (u, v)-plane. Before we discuss the general
properties of such surfaces, we shall give three simple examples.
(2) a x b i= o.
Then
represents the plane through fO that contains the straight lines f = fO +ua and f = fO +vb
(see Section 6.1). Note that if condition (2) is violated then a and b are linearly dependent
and the plane degenerates to a line or point.
EXAMPLE 2. Cones. Consider the upper nappe 5 of the cone with vertex at the origin
and axis along the positive z-axis (see Figure 13).
6.3 VECTOR CALCULUS OF SURFACES IN SPACE 393
o
FIGURE l3. Cone with vertex angle ex
Jx2 + y2
(4) -'-----'-- = tan ex,
Z
and then the cone has the parametric form
(5) x = (tanex)vcosu,
y = (tan ex)v sin u,
z = v,
with 0 :::: u :::: 2Jr, v ::: O. In vector form,S has the representation
(7)
One parametric representation is given by the defining equations for spherical coordinates
with r = Jx 2 + y2 + Z2 = 1, so (see Figure 14)
(0)
(ll) x=u,
y =v,
Z = (cota)Ju 2 + v2 ,
(12) x=u,
y =v,
Z = J'I---u2,-------v-,-2,
where 0 :s u2 + v2 :s l. Note that this gives the upper half of the sphere. The lower half
is obtained by changing the sign of the square root in (2).
Stereographic Projection. Another parametric representation of the unit sphere is
obtained by mapping each point (x,y, z) of the sphere from the north pole (0,0,1) onto
the equatorial plane (u, v, 0); see Figure 15.
Equating the unit vectors for the displacements from (0,0,1) to (x,y, z) and from
(0,0,1) to (u, v, 0) gives the vector equation
xi + yj + (z - l)k ui + vj - lk
(13)
Equating the coefficients ofi,j, and k and solving for x,y, and Z gives, after some algebra,
2u
(14) x = u2 + v2 + l'
6.3 VECTOR CALCULUS OF SURFACES IN SPACE 395
z
(0,0,1)
(u, v, 0)
2v
Y = u2 + v2 + l'
u2 + v2 - 1
z - --::---::---
- u2 2 + v + l'
for -00< u, v < 00.
This representation is used in complex analysis. Note that the unit sphere with the
north pole (0,0,1) removed is mapped one-to-one onto the entire finite (u, v)-plane.
Smooth Surfaces in Space. Methods of vector calculus are applicable to parametric
surfaces (1) that have several continuous partial derivatives. We shall find the following
definition useful.
Definition. A surface 5 in E3 is said to be a smooth surface if it satisfies (with D a
domain in the Cu, v)-plane)
Notice that if Co is any smooth curve in D then it is lifted by the mapping (1) to a space
curve C that is embedded in s!
u-Curves in S. To begin, let (uo, va) lie in D and consider the curve in 5 defined by
We shall call it the u-curve through r(uo, va), since only u varies on it. The results of
Section 6.2 imply that the vector
(19)
is a tangent vector to the u-curve at fa = r(uo, vo). It is therefore also a tangent vector to
5 at fO.
v-Curves in S. Similarly, define the v-curve through fa by
(21)
Note also that condition (17) implies that Tu and Tv are linearly independent.
Tangent Vectors to 5 at roo We shall show that every tangent vector to 5 at ro is a
linear combination of Tu and Tv. For this purpose define a curve Co in D by
By Section 6.2, the tangent to C at fO is obtained by taking the t-derivative and setting
t = 0 (so (u, v) = (uo, vo». The t-derivative of (23) at t = 0 is given by the chain rule as
Since we can choose any values for u~ and v~, it is clear that we obtain the entire tangent
plane in this way.
The Tangent Planes to S. The preceding result shows that the tangent plane to 5
at any point ro = f(UO, vo) passes through that point and is spanned by the vectors
fu (uo, vo) and fv(UO, va). Thus, by Example 1, the tangent plane has parametric equation
(24) f = fO + Ufu(UO, vo) + Vfv(Uo, vo), -00 < u,v < 00.
6.3 VECTOR CALCULUS OF SURFACES IN SPACE 397
The Normal Lines to S. The normal vectors to 5 at a point r(uo, vo) are orthogonal to
the tangent plane at r(uo, vo). Thus they are just the scalar multiples of the normal vector
(25)
Surface Area. Two formulas for areas of surfaces are developed in most first-year
calculus courses. The simplest formula is applicable to surfaces of revolution. It states
that if x, y and z are Cartesian coordinates and if the graph of
y = J(x), a ~ x ~ b,
is revolved around the x-axis, then the area A of the resulting surface of revolution is
given by (see page 324 in [V-P])
In most texts the proofs offered for these formulas are heuristic, being based on estimates
of terms in the corresponding Riemann sums. Here we shall derive a formula for the
area of any smooth parametric surface 5 as defined by (15)-(17). The proof will again be
heuristic. Finally, the generality of the new formula will be shown by deriving (27) and
(28) from it.
The Area Formula for Parametric Surfaces. The formula states that if 5 satisfies
(15)-(17), then its area is
Heuristic Derivation of (29). First we must realize that no rigorous proof of (29)
is possible because no independent definition of A(S) has been given. Instead, our
"derivation" is a heuristic argument meant to motivate equation (29).
398 6 VECTOR ANALYSIS
b-a
a = Ul < U2 < ... < U m+1 = b, f},Ui = f},u = --,
m
d-c
c = Vl < V2 < ... < Vn+l = d, f},vi = f},v = - - .
n
Note that the particular lines U = Ui (a v-curve) and v = Vj (a u-curve) decompose S
into small surfaces Sij, defined by (l) with Ui SuS Ui+l, Vj S v S Vj+l, i = 1, ... , m,
j = 1, ... ,n; see Figure 16.
Then the additivity of area gives
LL A(Sij)'
m n
A(S) =
i=1 j=1
Moreover, and this is the main idea behind equation (29), Sij is approximated by the
parallelogram at f(Ui, Vj) with edges f},u fu(Ui' Vj) and f}, v fv(Ui, Vj); see Figures 17 and 18.
Thus
with the approximation improving as t.u -+ 0, t.v -+ 0. Combining the last two
formulas gives
This last sum is a Riemann sum for the double integral on the right in (29). Thus passing
to the limit (m, n) -+ (00,00) gives formula (29).
Application to Surfaces of Revolution. We shall show that formula (27) is a special
case of (29), thus giving a second derivation of (27). Our method is based on the
parameterization
(32)
and so
Differentiating gives
ru = i + Iuk,
rv = j + Iv k,
whence after a short calculation we get
ru x rv = -Iu i - Id + k
and
(35) g(u' , Vi) = gi (u' , v')i + g2 CUi, v')j + g3 CUi, v')k, (u' , Vi) in D'
be any two parameterizations representing S. Then
The proof, which is based on a theorem of advanced calculus for changing variables
in a double integral, is too long to reproduce here.
Notation. The invariance result (36) makes it natural to suppress the variables u, v and
write
dA = Iru x rvldudv
and
IExercises 6.31
1. (Partial Derivatives) The equations x = u2 , y = v2 , Z = uv define a surface S. Find the vector panials
ru and rv for the vector parameterization r(u, v) = xi + yj + zk.
2. (Plane) Find a vector normal to the plane r(u, v) = ui + vj + (- 3u + 2v)k.
3. (Plane) Find a vector parameterization r(u, v) for the plane 3x - 2y + 5z = 7.
4. (Plane) Find a vector equation r(u, v) for the plane parallel to the plane 3x-2y+5z = 7 and containing
(0,1, -1).
5. (Plane) LetP = (1,-1,0), Q = (1,1,-1), R = (0,1,-1). Find a vector equation r(u,v) for the plane
determined by P, Q, R.
6. (Cone) Represent the cone x 2 + Z2 = y2, Y :::: 0, as a vector parameterization r(u, v) on an explicit
(u, v)-domain.
l
7. (Cone) Represent the section of the cone x 2 + = 3z2 between the planes z = 1 and z = 3 as a vector
parameterization r(u, v) on an explicit (u, v)-domain.
8. (Hemisphere) Represent the upper hemisphere x 2 +y2 + z2 = 100, z :::: 0 as a vector parameterization
r(u, v) on a rectangle.
9. (Spherical Cap) Represent the spherical cap above the plane z = 2 on the sphere x 2 + l + Z2 = 16
by a vector parameterization rCu, v) on a rectangle.
10. (Stereographic Projection) Describe graphically the curve in the (u, v)-plane that corresponds to the
circle on the unit sphere in the plane z = 0.5 according to the stereographic projection equations.
13. (Smooth Surfaces) The equations x = e- 1/u2 , y = e- 1/v2 , z = 1 define a surface 5 with domain
o ::s u ::s 1,0 ::s v ::s 1, provided that at u = v = 0 we define x = y = O. Is this a smooth surface?
14. (Smooth Conical Surface) Show that the section of the cone x 2 + y2 = 3z 2 between the planes z = 1
and z = 3 has a smooth vector parameterization r(u, v).
15. (Normal to a Surface) Let r = ui + vj + (u 2 + v2)k. Compute the normal n = ru x rv at u = 1, v = 2.
16. (Normal to a Surface) Find a vector parameterization r(u, v) and the outer unit normal n = ±N/INI.
N = ru x rv, for the surface z = x 2 + i·
17. (Tangent Plane) Find by vector methods the equation of the tangent plane to the surface z = 3x 2 + i
at the point (1, 1,4).
18. (Regular Surface) A point on a surface is said to be regular if the partials ru and rv at that point span
a plane, or, more SUCcinctly; ru x rv 1= O. Find the nonregular points for r = vi + vj + uvk.
19. (Space Triangle Parameterization) Let a space triangle PQR be given by three vertices P, Q, R. Define
edge vectors a = PQ, b = PR. Show that a parameterization of triangle PQR is
r(u, v) = oP + ua + vb
with domain
20. (Space Triangle Area) Let a space triangle PQR be given by three vertices P, Q, R. Define edge vectors
a = PQ, b = PR. Show that the area of triangle PQR is equal to
Surface Area. For each exercise below, identify a vector parameterization r(u, v) and its domain D.
Compute the partials ru, rv and Iru x rvl 2 = Irul21rvl2 - (ru . rv)~. Finally, compute the surface area
21. The surface x = sinucosv,y = sinucosv, z= cosu, O::s u::s rr/2, O::S v::S rr/3.
22. The portion of the unit sphere at the origin that is above the plane z = 1/2.
23. The section of the surface x = u2 , Y = uv, z = v2/2 corresponding to the region bounded by the curves
u = 0, u = 1, v = 0 and v = 3.
24. The triangle with vertices 0,0,0), (0,1,0), (1, 1, 1).
25. An open box (no top) determined by the edge vectors i, 2j and 4k.
26. The closed tetrahedral surface formed from base triangle (0,0,0), (1,0,0), (0, 1,0) by connecting lines
to the fourth point (1, 1,2).
27. The cone x = vsinacosu, y = vsinasinu, z = vcosa, for 0 < a < rr/2 fixed and 0 ::s u < 2rr,
O::s v::S h.
6.4 CALCULUS OF SCALAR AND VECTOR FIELDS 403
28. The ellipsoid with semiaxes a, b, c described by x = a sin u cos v, y = b sin u sin v, Z = c cos u,
0::0 u ::0 n, 0 ::0 v ::0 2n. Give the surface area in unevaluated integral form.
29. The cylinder x = a cos u, y = a sin u, Z = v, for 0::0 u ::0 2n, hI ::0 v ::0 h2, with a > 0 fixed.
30. The paraboloid x = av cos u, y = av sin u, Z = v2 , 0 ::0 u ::0 2:7r, 0 ::0 v ::0 h, with a > 0 fixed.
31. The section of the paraboloid Z = x 2 +l within the unit square 0 ::0 x ::0 1, 0 ::0 Y ::0 1. Give the
surface area in unevaluated integral form.
32. The surface of revolution x = v cos u, y = v sin u, Z = f( v), where f is a function of one variable defined
for VI ::0 v ::0 V2 and 0 ::0 u < 2n. Give the answer in integral form.
33. (Center of Mass) Let f(x, y, Z) = x + y + z represent the mass density per unit of surface area on the
surface 5 described by the equations x = u, y = v, Z = 1 - 2u - v, 0 ::0 u ::0 1,0 ::0 v ::0 2. Find the
center of mass (x,y, z) from the formulas
y= ~f IsYf(x,y,Z)dS, m = f Isf(x,y,Z)dS.
(1) J(x,y) = c.
This notion leads to the topographic maps of geography and the isothermal maps and
isobaric maps of weather prediction. A simple example is shown in Figure 19.
404 6 VECTOR ANALYSIS
c=3
(2) j(x,y, z) = c.
dx
(4) -;it = F1(x,y),
dy
dt = F2 (x,y).
For greater accuracy we may integrate system (4) to find the flow-lines of the vector field.
The term "flow-line" is motivated by the case where F represents the velocity field in a
fluid and the integral curves are the streamlines. Physically, these are the paths followed
by a light particle that drifts with the fluid.
For vector fields F(x,y, z) in £3, the direction fields lie in £3 and are much harder to
display. Computer graphics can be used to construct perspective views of the direction
fields. Skill is needed to produce helpful diagrams. The flow-lines can also be constructed
and displayed in the same manner. The simple example
dx
(5) - = a(y -x)
dt '
dy
- =px-y-xz (a, p, f3 constants)
dt '
dz
-=-f3z+xy
dt
shows how complicated the flow-lines can become. System (5) is the well-known Lorenz
system, whose flow lines exhibit chaotic behavior (see [Sch]). A computer-generated
trajectory is shown in Figure 21, with a = lO, p = 28, and f3 = 8/3.
Directional Derivatives and Gradients. Let fer) denote a scalar field whose
first -order partial derivatives
and call (6) the directional derivative of fer) at ro in the direction u. An important
theorem of vector analysis states that (6) is a linear function of u, namely,
(7)
In fact, (7) is a consequence of the chain rule of calculus. We shall also write
Normal Vectors to a Level Surface of a Scalar Field. Let j(r) be a scalar field
and consider a point ro where
(ll)
is a normal vector to the level surface S. To do this, let res) be a parametric curve with
arc length parameter s, chosen in such a way that
where T is any unit tangent vector to S at roo Then Jeres)) = j(ro) and the chain rule gives
(14)
I
if F is the velocity field of fluid and S is a smooth oriented surface then
volume of fluid per unit time crossing}
(15) fIs F . n dA = ~;; the negative side to the positive side
408 6 VECTOR ANALYSIS
It is to be understood that if the integral (14) is negative then the net flow is from the
positive to the negative side of S.
Derivation of (15). Partition S into a union of small surface elements and consider the
flow across a typical element ~S. The volume of fluid crossing ~S in time ~t fills a
cylinder of height F . n ~t; see Figure 22.
Thus, writing ~A for the area of ~S,
Equation (15) now becomes apparent if we sum over all surface elements of the partition
and make the norm of the partition tend to zero.
Net Flow out of a Parallelepiped. Let us consider a small parallelepiped V in the
domain of the vector field F:
We shall calculate the net flow rate per unit volume out of V, in the limit where
~X2 + ~i + ~Z2 ~ O.
-1
IVI
11 5
F·ndS~-+-+-.
~Fl
~x
~F2
~y
~F}
~z
(16) . -1
hm
IVI ..... O IVI
11 5
oF=
F·ndS
l
-oF
+-+
2 of}
-
ox oy oz '
evaluated at (xo,Yo, zo)· This last expression is called the divergence of Fat (xo,Yo, zo)
and it is written
.
dI OFI OF2 of}
(17) vF=-+-+-.
ax ay az
Equation (16) implies that the net flow rate per unit volume out of parallelepiped V is
div F(xo,yo, Zo). Note that the divergence operator diy, defined by (17), carries a vector
field F into a scalar field div F. Also, from (16), div F(xo,yo, zo) > 0 means that the fluid
is expanding near (xo,Yo, zo), while div F(xo,yo, zo) < 0 means that it is compressing.
The Curl Operator and the Curl Field. let C be a closed curve lying in the domain
D of F and consider the line integral
(18) l F . dr = l F . T ds,
where s is the arc length parameter on C and T = dr/ds is the unit tangent vector.
Integral (18) is called the circulation of F around C. If F is the velocity field of a fluid
then the circulation of F measures the rate at which the fluid is rotating. Note that for
fixed magnitude IFI the circulation (18) is largest when F is parallel to T, while it is zero
when F is orthogonal to T.
410 6 VECTOR ANALYSIS
The circulation is always zero for a gradient flow F = grad f. To see this, note that
by the chain rule,
dr
F . T = gradf . ds
= afdx + afdy + afdz
ax ds ay ds az ds
df
= ds'
and hence if £ is the length of C, then
1 1
C
F·dr=
1df
-ds
0 ds
= f(x(£),y(£), z(e) - f(x(O),y(O), z(O»
=0
because C is closed.
Circulation Around a Small Rectangle. We shall estimate the circulation around
a small rectangle Cz in the (x,y)-plane with venices
(xo,Yo, zo), (xo + ~x,Yo, zo), (xo + ~x,yo + ~y, zo), (xo,Yo + ~y, Zo).
Rectangle Cz is oriented counterclockwise as seen from the positive z-axis; see Figure 24.
For ~X2 + ~y2 small we can estimate
I c,
F. dr ~ [Fl(xo + ~X,Yo,Zo) - Fl(xo + ~X,Yo + ~y,Zo)] ~x
+ [F 2 (xo + ~X,Yo + ~y, zo) - F2 (Xo,Yo + ~y, Zo)] ~y,
so if Sz = ~x~y = area enclosed by Cz , then
~ 1
Sz c,
F· dr ~ aF2
ax
_ aFl.
ay
Y
x
(Xo + t:.x, Yo + t:.y , <0) FIGURE 24. Oriented rectangle Cz
6.4 CALCULUS OF SCALAR AND VECTOR FIELDS 411
.
hm -
1 i F·dr= - - - ,
aF3 aF2
i
Sx
5x -+O Cx ay az
. -1 F . dr = -aF - -
aF.
l 3
hm
5y --+0 Sy c, az ax
These three limits determine the vector field curl F, defined by
lim ~ { F· dr = (curl F) . n,
5--+oS1c
where 5 is the area of C.
EXAMPLE 1. A Rotating Fluid. Consider a fluid that is rotating about the z-axis with
constant angular velocity w radians per second. The fluid velocity F = F (r) at each point
in space is determined by the vector w = wk; see Figure 25.
We have
Moreover, F and w x r have the same direction, by the right hand rule; see Figure 25. Thus
(21) F =W x r
gives the velocity field in a fluid that rotates as a rigid body about the z-axis with angular
velocity w. A simple calculation, using (20), gives
The factor 2 is correct. Thus the rotation vector w is one-half the curl of the velocity field.
The Del Notation. An alternative notation for the differential operators grad, div
and curl is provided by the vector differential operator
a a a
(23) V=i-+j-+k-.
ax ay az
The symbol V is called "del" or, in some texts, "nabla." In applying V to a scalar or vector
field it is understood that the differentiations are applied to all terms to the right of them,
unless they are delimited by parentheses. With this convention we have the equivalence
(24) Vj = i aj + j aj + k aj = gradf.
ax ay az
Furthermore, if
then
(25)
and
(26)
= curl F.
(27) V x (Vf) =0
and
(28) V· (V x F) = 0,
6.4 CALCULUS OF SCALAR AND VECTOR FIELDS 413
provided that I and F have continuous second derivatives. Thus, to verify (28), we have
az
ay - aF2) + aya (aFr ~
v . (V x F) = axa (aF3 aF3) az -
+ ~ (aF2 _ aFr)
az ax ay
a2F3 a2F2 a2Fr
=---+-
axay axaz ayaz
a2F3 a2F2 a2Fr
--+---
ayax azax azay
=0,
provided that the mixed second-order partials are equal. This holds if the second-order
derivatives are continuous.
Additional Operator Identities. The following identities are often useful in vector
calculations. They can be proved by straightforward calculation; see page 155 in [D-S].
Exercises 6.4
Scalar Fields in R2. For each exercise, construct from the given scalar field two figures: (1) the surface,
and (2) a contour diagram.
1. The paraboloid equation z = x 2 + 4yl, -1 ::: x ::: 1, -1 ::: y ::: 1. Plot the curves of constant altitude
z = 0.2 to 1.8 in steps of 0.4.
2. The pendulum energy equation E = 1 - cose + y 2/2, -rr/2 ::: e ::: rr/2, 0 ::: y ::: 1. The variable e is
the angle the pendulum makes with the venical, while y is the velocity of the pendulum bob. Plot the
curves of constant energy E = 0 to E = 1.4 in steps of 0.2.
3. The saddle surface z = x 2 -yl, -3::: x::: 3, -3::: y ::: 3. Plot the curves of constant altitude for
z = -4 to 4 in steps of 2.
4. The temperature function T = 100 - x 2 - y2 on the plate -10 ::: x::: 10, -10 ::: y ::: 10. The T-axis
scale is degrees Celsius. Plot the isothermal curves for T = 0 to 100 in steps of 25°.
414 6 VECTOR ANALYSIS
top
5. The traveling wave h = 4 + cos(x - tl2), 0 ::: x ::: 2n, 0 ::: t ::: 2n, where x is position and t is time.
Plot the curves of equal displacement for h = 3 to h = 5 in steps of 0.5.
Topographic Maps. The hill in Figure 26 has two walking paths to the top. A similar but different hill is
pictured in Figure 27. The following exercises discuss the topographic map and the 3-D figure of the nearby
hill.
6. Which path (A or B) in the topographic map of Figure 26 is the steepest climb? Explain your reasoning.
7. Make a rough 3-D model of the hill directly from the topographic map of Figure 26.
8. Make a rough contour map with three curves (z = 100, 150, 200) using the 3-D figure of the nearby
hill; see Figure 27. Assume that the contours were generated fromy = O.Olzsin(nx) with z fixed.
Flux of a Constant Field. For each exercise below, find the flux for the constant field v = i + j + 4k
through the given surface.
9. A disk of radius 3 in the (x,y)-plane with normal k.
10. A square of side 1 in the plane z= 1 with normal k.
11. A rectangle with vertices (6,0,0), (6,1,0), (6,1,2), (6,0,2) oriented in the positive x-direction.
12. A tilted rectangle with vertices (1,0,2), (1,1,0), (0,1,0), (0,0,2) and normal in the direction of
(j - 2k) x (-i).
13. A triangular plate of area 6 in the (y, z)-plane with normal i.
14. A triangular plate with vertices (1,0,0), (0,1,0), (0,0,1) and normal pointing away from (0,0,0).
Flow Lines of Vector Fields. In the exercises below, solve for the parametrized curve ret) satisfying
r'(t) = F(r(t».
15. F(x,y) = 2i + 3j, reO) = 2i + 4j.
16. F(x,y) = xi + yj, reO) = 2i + 4j.
17. F(x,y) = y 2 i + 2xj, reO) = i +j. Find ret) for t = 0 to t = 1 in steps of 0.2 by Euler's numerical method.
Tangent Lines and Tangent Planes. In the exercises below, find the equation of the tangent line or the
tangent plane to the given curve or surface using gradient methods for level curves and level surfaces.
18. Find the tangent line to x- i = 4 at x = 5, Y = 1.
19. Determine the equation ofthe tangent plane to the surface 2x2 +3i = liz at the point X = l,y = -1,
z = 115.
20. Determine the equation of the tangent plane at (-1,1,1) for the surface x 2 + y4 + x 2 Z2 = 3.
21. (Electric Field) The electric field produced by a charge placed at the origin is given by E = (xi + yj +
zk)/(x 2 + i+ Z2 )3/2. Find div E.
22. (Magnetic Field) A magnetic field B must have zero divergence. Is it possible that B = yi-xj+(x+y)k
is a magnetic field?
23. (Point Source) Let F = j(r 2 )(xi + yj), where r2 = x 2 + i
andj(u) = l/u. Show that F has a point
source at the origin, i.e., the divergence is zero away from (0,0), and its direction is always away from
(0,0)
31. v = x 2i + i j + z4 k
32. v = yzi + xzj + xyk
33. v = 3yzi + 5xzj + 9xyk
amount
8000
...... r---
......
-
level curves
r--- ...... ......r--
r-- ...... ...... r--" for payment P
6000
-r-- -r-- -r-- -,...
-,... r--_
r--_ p= 140
r--_ p= 120
r--_I- I-r-- - l -
P = 100
4000
I--
-I- -,... 1'--
1'--1- 1-- P=80
2000
-I-
- P = 60
3 5 7 9 11 13 15 interest
38. (Curl Identity) Apply the relation curl(w x r) 2w, valid for r = = xi + yj + zk, to solve the equation
curl v = 2i - 4j + 12k forthe vector v = v(x,y, z).
39. (Curl Identity) Let v be a vector field with continuous second partials. Show that div(curl v) = O.
40. (Loan Payments) In Figure 28, the contour graph shows the monthly payment and loan amount on a
5-year car loan. Assume a loan amount of $6000 dollars at 14% interest. If the car is refinanced at 9%
within the first 30 days, then by how much is the monthly payment reduced, approximately?
41. (Fluid Flow) The fluid velocity flow F = 2x i + y j + z k acts in the cube V described by 0 ~ x ~ 1,
o ~ Y ~ 1, 0 ~ z ~ 1. Find the volume of fluid per unit time flowing out of V.
=
42. (Fluid Flux Across a Surface) The fluid velocity flow F xj + z k acts on the surface 5, which is the
=
portion ofthe plane 3x + 2y + z 6 in octant I (x,y,z > 0), oriented by the vector 3i + 2j + k. Find
the volume of fluid per unit time flowing across 5.
43. (Heat Flux) Let 5 be the surface bounded by the spherex2 +i +z2 = 16 with n the unit outer normal
i
to 5. Given the temperature function T = x 2 + + Z2 and the temperature gradient F = -'VT, find
the heat flux out of the sphere,
44. (Fluid Flux Across a Surface) The fluid velocity flow F = ecosx i + j + eX k acts on the surface 5 2i
given by r = ui + vj + v2 k on the rectangle D: 0 ~ u ~ 3, 0 ~ v ~ 2. Find the volume of fluid per
unit time flOwing across 5 in the direction of N ru x rv. =
45. (Net Flow) Let 5 be the surface bounded by Z = x 2 + i and z= 1 with n the unit outer normal to
5. Find the net flow
The goal of this section is to extend the fundamental theorem to integrals of scalar and
vector fields over curves, surfaces and volumes in E2 and E3. Several theorems will be
given that involve V j, V . F and V x F.
d
(3) dJ(r(t)) = V j(r(O) . r' (0.
Integrating (3) over a.::; t .::; b and using (1) gives (2).
where C is as before. If both (4) and (5) hold, then by subtracting (5) from (4) one gets
the formula
(6)
418 6 VECTOR ANALYSIS
Here we shall verify (4) and (5) for some domains of simple type. Then we shall extend
the class of domains for which (4) and (5) hold to a larger class that includes most of the
cases that are needed in applications.
x-Domains. A domain D in E2 is said to be an x-domain if and only if there is an
interval a :'S y :'S b and smooth functions g(y) and h(y) such that points in D satisfy
is the oriented boundary of D. The four components of C have the following parametric
representations, correctly oriented.
Proof. We shall begin with the left-hand side of (4) and show by direct calculation that
it equals the right-hand side. The first step is justified by the standard calculus method
of evaluating double integrals as iterated integrals. Thus
b
C2
...
C3 19(Yl Cl J~h(Yl
a
C4
'" x
FIGURE 29. An x-domain
6.5 INTEGRAL THEOREMS OF VECTOR CALCULUS 419
Then
(9) 12 = 14 = 0,
I) = lab Q(h(t), t) dt, 13 = -lab Q(g(r), r)dr,
because y = constant on C2 and C4 implies h = 14 = 0, while
I) = lab Q(x(t),y(t))y'(t)dt = lab Q(h(t), t)dt,
and
13 = i-a -b
Q(x(t),y(t))y'(t)dt = i-a
-b
Q(g( -t), -t)( -l)dt
= l a
Q(g(r), r)dr = -lab Q(g(r), r)dr.
Combining (8) and (9) gives (4).
y = hex)
y
y = g(x)
The proof will be left as an exercise. Careful work will verify that the minus sign
that appears in (5) is correct. In fact, it is a consequence of the way we have oriented the
boundary C of D.
The restriction to x-domains and y-domains is quite severe. Here we shall define a
much larger class, called regular domains, for which Green's formulas also hold.
Definition of Regular Domains. For the definition, Cl, C2 , ... , Cn denote n disjoint
simple closed curves such that C2 , ... , Cn lie inside C1 . The domain D then consists
of the points that are inside C1 and outside C2 , ... , Cn. The domain D is said to be a
regular domain if it has this form and each curve is regular (= sectionally smooth).
Orientation. In accordance with Green's rule we orient C1 , C2 , ... , Cn such that C1 is
counterclockwise and C2 , ... , Cn are clockwise. With this understanding we have the
following theorem.
Green's Theorem. Let D be a regular domain and let Q, Qx, P, and Py be continuous
in D. Then Green's formula (6) holds with
A formal detailed proof will not be given. However, the idea of the proof is Simple. We
must show that each regular domain is the union of a finite number of nonoverlapping x-
domains. Then equation (4) will hold for each x-domain. Adding these equations gives (4)
for D. To get (5) for D we represent D as a finite union of y-domains. Note that when two
x-domains, or two y-domains, share a common boundary arc, then the corresponding
terms in the integral over C cancel in pairs because they occur with opposite orientations.
y
.............. ·0···
........ ·.. ·
C2
.....................................
...... ·· .. 0·····
.. ·· .... ·.. ··· .. ····
C3
............................
Several formulas of vector integral calculus can be derived from Greens theorem by
suitable choices of the functions P(x,y) and Q(x,y). Two will be derived here.
The Divergence Theorem in E2. Let D be a regular domain in E2 and let
(0)
where C is the oriented boundary of D, and n is the exterior unit normal vector to D.
For the proof we apply Green's theorem with Q = Fl and P = -F2 . Then in Green's
formula (6) we have
oF oF
Qx - Py = ax-
l
+ ay = \7 . F,
2
which gives the left-hand side of (11). For the right-hand side we have
dy dx)
(12) Pdx + Qdy = ( Fl ds - F2 ds ds.
Now, writing
dx dy
= r (s) = -ds i + -ds j
!
T
(\7 x F) . k =-
aF2 - -oF l = \2x - P .
ox oy y
422 6 VECTOR ANALYSIS
Moreover,
The proof is an immediate consequence of Green's theorem. We shall see below that (14)
is a special case of Stokes' Theorem in E3 .
The Divergence Theorem in E3 . This theorem, also called Gauss' theorem, is the
three dimensional analogue of equation (11). Thus it states that
where D is a domain in E3 , 5 is its oriented boundary surface with outward unit normal
vector field nand F is a vector field on D. We shall seek conditions on D and F that
are sufficient to guarantee that (15) holds. As preparation let us note that if Cartesian
coordinates are introduced, so that
(16)
Moreover, the functions FI, F2 and F3 can be chosen independently, and hence (16) is
equivalent to the three equations
The divergence theorem (15) is easy to prove for some simple domains D such as
parallelepipeds, cylinders and spheres. We shall prove it for a large class of domains that
includes those that usually occur in applications.
Definition. A domain D in E3 is said to be a z-domain if there exist a domain R in E2
and sectionally smooth functions g(x,y) and h(x,y) with g(x,y) ~ h(x,y) for all (x,y)
in R such that
Theorem. Equation (19) holds whenever D is a z-domain and the component F3 (x,y, z)
is a smooth function in D.
= Ik F3(x,y,h(x,y»dxdy
-Ik F](x,y,g(x,y»)dxdy.
Let us compare this with the right-hand side of (19). Note that the boundary 5 of D
consists of three parts:
Next we calculate n3d5 for each of the three components 51, 52 and 53. For 51 we have
the parametric representation
Writing
r(u, v) = ui + vj + h(u, v) k,
we get
424 6 VECTOR ANALYSIS
o
FIGURE 32. A z-domain
so
Next,
(22)
(23)
6.5 INTEGRAL THEOREMS OF VECTOR CALCULUS 425
Next,
(24)
Corollary. The preceding theorem extends immediately to domains D that are finite
unions of z-domains. We shall not develop this in detail here, but it is clear that in such
a union, where portions of the boundary surfaces coincide, these portions occur in pairs
with opposite orientation and thus cancel from the boundary of the composite domain.
x-Domains and y-Domains. These special domains are defined in strict analogy
with z-domains: Just permute the coordinates x, y, Z in an obvious way. Thus we see
that equation (17) holds in unions of x-domains, and equation (18) holds in unions of
y-domains. This suggests the following definition.
Regular Domains in E3 . A domain D in E3 will be said to be a regular domain if
and only if
f f In V· F dV = f iF, n d5,
(26)
where 5 is an oriented surface in E3 with positive unit normal vector nand C is the
oriented space curve that bounds S.
It is understood that C is oriented such that an observer standing at a boundary point
with his hand in the direction of n and facing in the positive direction on C will find the
surface 5 to his left; see Figure 33.
In (26), F (x, y, z) denotes any smooth vector field that is defined in a neighborhood
of S. In words, (26) says that the flux of V x F across 5 Cleft-hand side of (26) equals
the circulation around C (right-hand side of (26)).
The validity of Stokes' theorem for plane surfaces 5 was shown above using Green's
theorem. Complete proofs for the general case are long. The details will not be given
here. Instead, a beginning of a proof will be given from which interested readers can
complete the proof. Details can be found in [D-S]; see Figure 34 for notation.
The idea of the proof is to use the chain rule
dr = ru du + rv dv
(x. y. 7) -space
(II . u)-plane
FIGURE 34. Notation for the proof of
Stokes' theorem
6.5 INTEGRAL THEOREMS OF VECTOR CALCULUS 427
to write
Jer F· dr = JEr [~
au
f(F . ru) - ~ (F . rv)]
av
du dv.
The remainder of the proof is to simplify the last integral to get (see page 271 in [D-S])
Line Integrals. Convert each line integral below by means of a parameterization of C into an ordinary
calculus integral and then evaluate.
15. The line segment joining 0,2, -1) to the nearest point of the plane x + y + z = 1.
16. The circle x 2 +i = 16 in the plane z= 0, oriented counterclockwise when viewed from above.
17. The circle x 2 + z2 = 16 in the plane y = 1, oriented counterclockwise when viewed from (0,2,0).
18. The ellipse x 2 + 4i = 4 in the plane z = 0, counterclockwise as viewed from the point (0,0,1).
19. The ellipse i + 4z 2 = 8 in the plane x = 2, counterclockwise as viewed from the point (4,0,0).
20. The square bounded by the lines x = ±1, Y = ±1 in the plane z = 0, counterclockwise when viewed
from (0,0,1).
21. The boundary of the triangle determined by the portion of the plane x + y + z = 1 in the first octant,
counterclockwise when viewed from (2,2,2).
Decomposition of a Regular Domain in E2. In the following exercises, decompose the given region D in
E2 into a finite number of x-domains or y-domains. Report a parameterization for each subdomain, including
its outward normal n. The decomposition is to be a regular domain, which implies careful definition of the
normals.
22. The set D is the region in E2 bounded by the x-axis, the line x = 1, and the curve y = x 3
23. The set D is the unit square in the first quadrant.
25. The set D is the disk at (0,0) with radius 1 in the upper half-plane plus the region inside the rectangle
y = 0, y = -1, x = -1, x = 1 in the lower half-plane.
26. The set D is the circle with center 0, -2) and radius 4.
27. The set D is the region remaining when the triangle bounded by x = 0, x = 2 andy = x is removed
from the square bounded by x = 0, x = 3, Y = 0, Y = 3.
Decomposition of a Domain in E3. In the following exercises, decompose the given domain D in E3 into
a finite number of z-domains. Report with each z-domain the outward normal n to the boundary surface.
28. Domain D is the parallelepiped bounded by the planes x = 0, x = 1, Y = ±1, z = ±l.
31. The domain D is the region between the cone 4x 2+4i = Z2 and the circular cylinder (x-l)2 +y2 = 1
in the half-space z ::: o.
Green's Theorem. In the following exercises, compute directly the two sides of Green's formula
and verify equality of II and 12. In each let D be the set x 2 +i ::: 1 and take C to be the circle ret) =
cos t i + sin t j, 0 ::: t ::: 2T(.
32. P = y, Q = -x.
33. P = -y, Q = 0
6.5 INTEGRAL THEOREMS OF VECTOR CALCULUS 429
and verify that hand 12 equal. In this formula, C is the oriented boundary of D and n is the outward unit
normal vector to D.
36. The vector field is F = (y - x)(i - j) and D is the square bounded by x = 0, x = 1, Y = 0, Y = l.
37. The vector field is F = (4y +x 2 )i +ij and Dis the square bounded byx = O,x = 1,y = O,y = l.
38. The vector field is F = (x + y)i + x 2j and D is the triangle bounded by y = 0, x = 2, Y = x.
39. The vector field is F = xi + (x + y)j and D is the triangle bounded by y = 0, x = 2, Y = x.
Stokes' Theorem in E2. In the following exercises, compute directly the integrals on the two sides in
Stokes' theorem
h= fin (V x F) . k dA, 12 = i (F . T) ds
and verify by direct integration that Ii and 12 are equal. In this formula, C is the oriented boundary of D
and T is the unit tangent vector to C.
40. The vector field is F = (y - x)(i + j) and D is the square bounded by x = 0, x = 1, y = 0, y = 1,
oriented counterclockwise as viewed from (0,0,1).
41. The vector field is F = (4y + x 2 )i + yj and D is the square bounded by x = 0, x = 1, Y = 0, Y = 1,
oriented counterclockwise as viewed from (0,0,1).
42. The vector field is F = (x + y)i - xj and D is the triangle bounded by y = 0, x = 2, y = x, oriented
counterclockwise as viewed from (0,0,1).
43. The vector field is F = xi + (x + y)j and D is the triangle bounded by y = 0, x = 2, Y = x, oriented
counterclockwise as viewed from (0, 0, 1).
Divergence Theorem in E3. In the following exercises, compute the surface integral in the Divergence
theorem of Gauss,
f f In div FdV.
In this formula,S is the oriented boundary of D and n is the outward unit normal vector to D.
44. The vector field is F = xi + yj + zk and D isx 2 + i + z2 ~ 1.
45. The vector field is F = 2xi - yj + zk and D is x 2 + i + z2 ~ 4.
46. The vector field is F = xzi + xyj + xyk and D is the parallelepiped bounded by the planes x = 0,
y=O,z=O,x= l,y= 1,z=2.
430 6 VECTOR ANALYSIS
47. The vector field is F = xyi + xzj + yzk and D is the parallelepiped bounded by the planes x = 0,
y = 0, Z = 0, x = 1,y = 1, z = 4.
Divergence Theorem in E3. In the following exercises, compute the triple integral and the surface integral
in the Divergence theorem of Gauss to show without appeal to the theorem that 11 = 12. In this formula, S
is the oriented boundary of D, and n is the outward unit normal vector to D.
48. The vector field is F = 2yj + zk and D is the interior of the sphere x 2 + i + z2 = 1.
49. The vector field is F = 3yj and D is the interior of the sphere x 2 + i + Z2 = 4.
fIs (V x F) . n dA
in Stokes' theorem. In this formula,S is a smooth oriented surface in E3 with boundary curve C oriented
counterclockwise relative to the outward unit normal vector n to the surface S.
52. The vector field is F = yi + zj and the oriented parabolic surface S is r(u, Y) = v cos u i + v sin u j +
(I - v2 )k on D: °
direction of ru x rv.
°
:<:: u :<:: 2n, :<:: v :<:: I, with boundary curve C x 2 + i
= 1 and normal in the
53. The vector field is F = xj and the oriented parabolic surface S is r(u, v) = v cos u i +v sin u j + (1- y2)k
° ° +i
on D: :<:: u :<:: 2n, :<:: v :<:: 2, with boundary curve C x 2 = 2 and normalin the direction of ru x rv.
54. The vector field is F = xj and the oriented surface S is the cylinder x 2 +
top x 2 + i :<:: 1, z = I, with normal n equal to k on the top.
i = I, °:<:: z :<:: I, plus its
55. The vector field is F = zyi, and the oriented surface S is the cylinder x 2 + i = I, 0:<:: z :<:: 2, plus its
bottom x 2 + i :<:: I, z = 0, with normal n equal to - k on the bottom.
Stokes' Theorem in E3. In the following exercises, compute the oriented line integral and the surface
integral
in Stokes' Theorem, in order to verify lj = b without appeal to the theorem. In this formula, S is a smooth
oriented surface in E3 with boundary curve C oriented counterclockwise relative to the outward unit normal
vector n to the surface S.
56. The vector field is F = yi, and the oriented parabolic surface S is r(u, v) = v cos u i + v sin u j + (1- v2 )k
onD:O:<:: u:<:: 2n,0:<:: v:<:: 1,withboundarycurveCx2 +i = landnormalinthedirectionofruxrv.
6.6 X-RAY DIFFRACTION AND CRYSTAL STRUCTURE 431
57. The vector field is F = xj, and the oriented parabolic surface 5 is r(u, v) = v cos u i +v sin u j +0- v2 ) k
on D: 0 :::: u :::: 2n,0 :::: v :::: 2, with boundary curve C:x 2+i
= 4 and nOTIllal in the direction of ru xrv.
58. The vector field is F = yi and the oriented surface 5 is the cylinder x 2 +i = 1, 0 :::: z :::: 1, plus its
top x 2 + i ::1, Z = 1, with outward normal n equal to k on the top.
59. The vector field is F = 2xj and the oriented surface 5 is the cylinder x 2 +i = 1,0 :::: Z :::: 2, plus its
bottom x 2 + i ::
1, Z = 0, with normal n equallo k on the bottom.
Green's Area Formulas. The next four exercises derive and apply the basic area formulas of Green, which
reduce two-dimensional area problems in E2 to a one-dimensional calculation of a line integral.
60. Verify that the area of region D in E2 is given by
Area of D = ~ { x dy - ydx.
2lc
61. Verify that the area of region D in E2 is given by
AreaofD= !cXdY.
62. Verify that the area of region D in E2 is given by
Area of D = !c -y dx.
63. Verify that the area enclosed in E2 by the astroid C defined by r = (cos 3 t)i + (sin 3 OJ, 0 :::: t :::: 2n, is
Additional Proofs. The text omits some details, which are filled in by the follOwing exercises.
64. (Green's Formula any-Domains) Let P(x,y) and Py(x,y) be continuous in a y-domain D of E2 with
boundary C. Prove Green's formula
65. (Green's Theorem Cancellations) Let D be a regular domain and let Q, Qx, P and Py be continuous
in D. Prove that when two x-domains, or two y-domains, share a common boundary arc then the
corresponding terms in the integral
[Pdx+Qdy
cancel in pairs.
a3
~
: .... ........ ... :::::;. :.........::..... ::::: FIGURE 35. A crystal lattice with atoms at
R = n,8, + nz8z + n383
the spacing of the atoms was much too small to be resolved by optical microscopes. In
1895 We. Roentgen discovered X-rays, which proved to be electromagnetic waves of
very short wavelengths. In 1912 the German physicist Max von Laue validated Bravaiss
model by showing that crystals diffract X-rays like a three-dimensional diffraction grating.
For example, it was found that salt (Nact) has a cubic lattice with spacing 4.08 A (where
1 A = 1 angstrom unit = 10-8 cm). Thus a 1 cm3 salt crystal has about 24.5 million
atoms along each edge, for a total of about 1.47 x 10 22 atoms. Here we shall sketch how
X-ray diffraction data for a crystal can be used to determine its crystal structure.
Crystal Lattices. A crystal lattice is a set of vectors in E3 of the form
where aI, a2, a3 are linearly independent (so al x a2 . a3 i- 0) and n1, n2, n3 run through
the integers. A portion of a crystal lattice is shown in Figure 35.
Figure 36 shows a portion of a 2-dimensional lattice and three families of parallel
lines that pass through the lattice points. The same phenomenon occurs in 3-dimensional
lattices but is more difficult to depiCt.
.
: __ F;ll1ilY
~ .....•.
1__ : ___ :_
. ..
~~---
.-.
. , , ,
, : : : :
---
--~-- -
" ----e
The calculation of a crystal lattice from X-ray diffraction data of a sample depends
on two facts.
(a) The atoms is a crystal lattice can be arranged in stacks of parallel equispaced families
in (infinitely) many different ways.
(b) Each such stack of parallel lines will reflect a beam of X-rays of a suitable wavelength.
The remainder of this section will show how this can be done.
(2) a l . al = 1, a l . a2 = 0, a l . a3 = 0,
a 2 . al = 0, a 2 . a2 = 1, a 2 . a3 = 0,
a 3 . al = 0, a 3 . a2 = 0, a3 . a3 = 1.
It is easy to see that these conditions determine a I , a 2, a 3 uniquely. To find them, note
that the vector a2 x a3 is orthogonal to a2 and a3, and hence
is orthonormal, then
434 6 VECTOR ANALYSIS
so orthonormal bases are self-dual. Secondly, equations (2) are symmetric in (aI, a2, a3)
and (aI, a 2, a 3). Hence we may interchange the two bases in (2), which implies that
(5)
To calculate the coefficients (A I, A 2 , A3 ) we take the scalar product of A with the dual
basis vectors and use (2). This gives
(6)
are the lattice points when nl, n2, n3 are integers. Let aI, a 2, a 3 be the dual basis. Then
(7)
defines a second lattice, called the dual lattice. The dual lattice is important for the
calculation of crystal structure because of the following theorem.
Theorem. Every dual lattice vector N = N(ml, m2, m3) is a normal vector to a family
of parallel crystal planes.
(8)
(9)
6.6 X-RAY DIFFRACTION AND CRYSTAL STRUCTURE 435
lie on a plane if
(0)
If L is any common multiple of ml, m2, m3 (so L is divisible by each) then (0) has
integral solutions fll, fl2, fl3 and the plane contains three lattice points. But then, by (0),
and similarly,
be a dual lattice vector whose components (ml, m2, m3) have no common factors. Then
the family of parallel crystal planes with normal N = N(ml, m2, m3) is given by the
solutions of
In particular, for L = 1,
1
(ll) d = d(ml,m2,m3) = ------
IN(ml,m2,m3)1
is the distance between adjacent parallel planes.
For the proof, we have
N 1
-·R=-=d.
INI INI
X-Ray Beams and Their Vectors. In physical optics a collimated beam of X-rays
may be characterized by a unit vector u (beam direction) and a wavelength A. However,
it is customary to use instead
2IT
k= - = the wave number
A
436 6 VECTOR ANALYSIS
and
0.2A:s A :s lo2A.
Reflection of an X-Ray Beam by a Crystal Plane. Let an X-ray beam with wave
vector k be reflected from a crystal plane with normal N. Then by the laws of geometrical
optics a reflected beam is produced with wave vector k', where k, N, and k' lie in a plane
e
and the angle of incidence equals the angle ofreflection; see Figure 37.
This can be written as
The Bragg Condition. Crystal planes occur as stacks of parallel planes. In general,
reflected beams from two or more parallel planes will tend to cancel by interference,
producing a weak reflected beam. A strong reflected beam will occur only when reflected
beams from adjacent planes are in phase and reinforce one another. A criterion for this
to happen is the equation
., ....
k .... ..... k'
t
d
adjacent
crystal
•
~
planes
FIGURE 38. Reflection by adjacent
+ crystal planes
The Bragg Condition in Vector Form. We may write the Bragg condition as
k cos e = nrr/d, or (see Figure 37)
k'.~=-k.~= nrr
INI INI d'
or, since d = 1I1NI,
or
where
(15)
Crystal Structure Determination. The vector Bragg condition (14) gives all
possible diffracted X-ray beams k' corresponding to a given incident beam k and a given
438 6 VECTOR ANALYSIS
IExercises.6.6. !
,. ""." . .·.,h .. , < ......~
1. (Counting Salt Crystal Atoms) Find approximately the number of atoms in a salt crystal having the
shape of a parallelepiped with edges i + j, i + 2j, j + 3k, all units in centimeters.
2. (Modeling of a Crystal Lattice) A portion of a crystal lattice with orthogonal basis vectors can be
modeled by stacking a number of identical blocks (parallelepipeds). How should the dimensions of a
block be related to the basis vectors aj, a2, a3 of the crystal lattice?
6.6 X-RAY DIFFRACTION AND CRYSTAL STRUCTURE 439
3. (Dual Basis of a Crystal Lattice) A crystal lattice is defined by the set of orthogonal vectors a1 = i +j.
a2 = i - j, a3 = k. Compute the dual basis a I , a 2 , a 3 .
4. (Dual Basis of a Crystal Lattice) A crystal lattice is defined by the set of vectors a1 = i +j + k, a2 = i -j,
a3 = k. Compute the dual basis aI, a 2 , a 3
5. (Crystal Plane Equation) A crystal lattice is defined by the set of vectors a1 = i + j, a2 = i - j, a3 = k.
Find a crystal plane N . r = L containing aI, 3a2 and 4a3.
6. (Crystal Plane Equation) A crystal lattice is defined by the set of vectors a1 = i + j + k, a2 = i - j,
a3 = k. Find a crystal plane N . r = L containing Sa], 3a2 and 4a3.
7. (Family of Parallel Crystal Planes) A crystal lattice is defined by the set of vectors a1 = i +j, a2 = i -j,
a3 = k. Find a family of parallel crystal planes N . r = L (L an integer) containing aI, 3a2 and 4a3.
8. (Family of Parallel Crystal Planes) A crystal lattice is defined by the set of vectors a1 = i +j + k, a2 =
i - j, a3 = k. Find a family of parallel crystal planes N . r = L (L an integer) containing Sa 1, 3a2 and 4a3.
9. (Reflected X-Ray Beam) A family of parallel crystal planes is given by 12n1 + 3n2 + 4n3 = L, where L
is an integer, for the crystal lattice defined by the set of vectors a1 = i + j + k, a2 = i - j, a3 = k. Find
a reflected X-ray beam k' given incident beam k = 2a 1 + a2, assuming that the Bragg condition holds.
10. (Reflected X-Ray Beam) A family of parallel crystal planes is given by 12n] + 20n2 + ISn3 = L, where
L is an integer, forthe crystal lattice defined by the set of vectors a1 = i +j +k, a2 = i -j, a3 = k. Find a
reflected X-ray beam k' given incident beam k = rra] + 2rra2, assuming that the Bragg condition holds.
CHAPTER 7
Partial Differential Equations
of Mathematical Physics
Chapters 2 through 4 of this text developed solution methods for physical problems that
are governed by ordinary differential equations. The purpose of Chapters 7 through 9 is
to extend these methods to problems that are governed by partial differential equations.
Partial differential equations have been the subject of vigorous mathematical research
for over 250 years and remain so today. A systematic and complete coverage of this
subject is far beyond the scope of this text. Instead, the equations to be solved here will
be introduced in the context of several explicit applications. This has the advantage of
leading to both the differential equations and the auxiliary conditions (initial conditions,
boundary conditions) that are needed to Single out the unique solutions to specific
physical problems. These ideas are developed in the present chapter. The actual solution
procedures are presented in Chapters 8 and 9.
442 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
These assumptions will now be discussed and made more precise. First, the thinness
assumption of (I) is interpreted to mean that the string position at any time can be
modeled as a mathematical curve in space. The uniformity assumption means that the
physical characteristics of the string (diameter, mass density, elastiCity, etc.) are constant
over the length of the string. It will be shown that such a string is characterized by three
parameters:
x=O x=L
FIGURE l. Small transverse vibrations of an elastic string
lJean Ie Rond d'Alembert 0717-1783), one of the leading French mathematicians of the eighteenth century.
7.1 VIBRATING STRINGS 443
To interpret assumption (II), let (x,y) be rectangular coordinates in the plane of the
motion, chosen so that when the string is at rest it occupies the interval 0 S x S L
of the x-axis. Then a lateral vibration of the string is one for which each particle of the
string with rest position (x,O) moves in time t to a new position (x, u(x, t)) with the
same x-coordinate. Thus each particle moves orthogonally to its rest position.
The Wave Function u(x, t). If observation of a vibrating string begins at time t = 0
then the prediction of its future motion is equivalent to computing the function u(x, t) of
the two independent variables x and t for all x with 0 S x S L (all points of the string)
and all times t > 0 (all future times) For fixed Xo the graph of y = u(xo, t), t :::: 0, gives
a history of the motion of particle Xo. In particular, the partial derivatives Ut(xo, t) and
Utt(xo, t) are the velocity and acceleration of that particle. For a fixed time to, the graph
of y = u(x, to), 0 s x S L, is a stroboscopic image, or "snapshot," of the position of the
string at time to. In particular, the partial derivative uxCx, to) is the slope of the string at
particle x and time to.
The Tension Vector T(x, t). The forces on any particle of the string are those
transmitted by their neighbors through the constant tension T. It will be convenient to
write
(2) T(xo, t) = force of portion x :::: Xo of the string on the portion x S Xo.
Then T(x, t) lies in the (x,y)-plane and its magnitude is the prescribed tension T:
Finally, assumption (IV) will be interpreted to mean that the slope (5) remains so small
that its square u; is negligible in comparison with 1(u; « 1) Combining this and
Taylor's theorem gives
(6) cos8
sin 8 Ux cos 8 = Ux + O(u~) ~ Ux .
444 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
This will be applied to an arbitrary portion [xo, xII of the string, where 0 :s Xo < Xl :s L.
A small segment [x,x + ~xl has mass p~x, velocity Ut(x, t) and momentum pUt~x.
Thus the usual calculus argument gives the integral
(9) M(t) =
IXo
XI pUt (x, t)dx
for the momentum of the portion [xo, xII. This momentum is entirely in the y-direction,
since the displacement of each particle is orthogonal to the x-axis by assumption (II).
The y-component of the net external force on the portion [xo, xII is
(10)
(ll)
or
(12)
string
x
FIGURE 2. Tension vector for a deflected
xo string at fixed time t
7.1 VIBRATING STRINGS 445
y = u(x, t)
T
T string
TUx(XQ, t)
x
FIGURE 3. Vertical forces on an
XQ arbitrary string portion [xo, x,]
(13)
(14)
If.
where
(15) c=
Some Product Solutions. It is easy to see that d'Alembert's equation (14) has
infinitely many solutions. In particular, if J.L is any positive constant, then the products
Only the first two of the product solutions (16) satisfy u(O, t) = O. These also satisfy
u(L, t) = 0 if and only if J.L = mr/L, where n is a positive integer. This gives the following
family of solutions of (14), (17) for n = 1, 2, 3, ... :
(20) u(x, t) =
n=l
where the an's and bn's are constants and Wn = mr/L. The corresponding string velocity
is obtained by t-differentiation:
For any choice of the an's and bn's (such that the series converge), (20) defines a possible
mode of vibration of the string. This is called the superposition principle.
Need for Initial Conditions. Experience suggests that to predict the future states
of the string it is necessary to know the initial state:
and
If an's and bn S can be found such that these equations hold then by inserting them in (20)
the solution u(x, t) is obtained. It is natural to ask which functions can be so represented.
7.1 VIBRATING STRINGS 447
The surprising answer is, essentially any function. This is shown in Chapter 8, where a
method for finding the an's and bn's is derived.
The Boundary Value Problem for u(x, t). The preceding discussion suggests that
the wave function u(x, t) for the vibrating string is characterized by the following
properties:
where j(x) and g(x) are prescribed. This is an example of a boundary value problem.
A solution of the partial differential equation in the strip °: :
x ::: L, t ~ is sought °
that satisfies the boundary conditions (27) and (28) along the boundary of the strip.
It is suggested above that these conditions determine u(x, t) uniquely However, it is
conceivable that other conditions necessary for uniqueness have been missed. It will be
shown next that this is not the case by proving the following uniqueness theorem.
Uniqueness Theorem. The boundary value problem (25)-(28) has at most one solution.
This means that if Ul (x, t) and Uz (x, t) are any two solutions of (25)-(28) with the same
j(x) and g(x) then
(29)
The uniqueness theorem is proved below. The proof is based on the following energy
calculation.
Total Energy of the Vibrating String. The energy of the string is a sum of the
kinetic energy (KE.) due to the velOcity v = Ut and the potential energy (PE.) due
to the stretching of the string from rest against the tension T. To calculate them, let
(x, x + ilx) be an element of the string. Then its mass is m = pilx and its velOCity is
v = Ut. Hence its K E. is ~ mv 2 = ~ pu~ ilx. Integrating over the string gives
For the same element (x,x + llx) the stretched length is ds - llx, where ds is given by
elementary calculus as
Conservation of Energy Theorem. The solution of the boundary value problem for
the vibrating string satisfies
For the proof, the derivative E'(t) will be calculated and shown to be zero for t ::: O.
Differentiating (33) and using U tt = (2Uxx and (2 = Tip gives
(35) E'(t) = i L
(pUtUtt + IUxuxt) dx
i L
(Iutu xx + Tuxuxt ) dx
T i L
(Utux)xdx
Now, the boundary condition (27) implies that Ut(O, t) = 0 and ut(L, t) = O. Hence
E' (t) = 0 and E(t) = constant, as stated.
Proof of the Uniqueness Theorem. Let Uj (x, t) and U2(X, t) be the two solutions of the
theorem and define
(36)
7.1 VIBRATING STRINGS 449
Then the linearity of the differential equation and boundary conditions implies that
u(x, t) is also a solution of the boundary value problem (25)-(28) but with zero initial
values for 0 :::; x :::; L:
(39) P i L
u;(x, Odx +T i L
u;(x, Odx =0 for t ~ O.
Here the two integrands are nonnegative and hence it must be that
This clearly implies that u(x, t) = k (a constant) for 0 :::; x :::; L, t ~ O. But u(x, 0) =0
and hence k = O. This shows that u(x, t) = Ut (x, t) - U2(X, t) = 0, or
x=O x=L
FIGURE 4. String fixed at the right end, free to slide at the left
450 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
rod. Since T =f. 0, the string must satisfy the boundary condition
(42) ux(O, t) = ° for t 2: 0.
Similarly, the other end of the string may be fixed as before (so u(L, t) = 0) or may also
°
be free to slide along a rod (so ux(L, t) = 0).
Still another way to attach the string at x = is to let it slide along a vertical rod,
as above, and attach it to the origin with a spring with Hooke constant k. Then the y-
°
component of the tension at x = must balance the restoring force of the spring, giving
the boundary condition
(46)
(47)
The possible boundary conditions are the same as for the uniform strings, as are the energy
theorem and the uniqueness theorem. Of course, it is more difficult to find solutions of
(47) if P and/or T are not constants.
7.2 HEAT DIFFUSION IN RODS: FOURIER'S HEAT EQUATION 451
I Exercises 7.1 I
1. Use Taylors theorem to show that if Ux = tane then cose = 1 + O(u~) and sine = Ux + O(u~).
2. Verify that each of the four products in equation (16) is a solution of d'Alembert's equation (26)
3. Show that the normal mode solutions of the vibrating string problem, defined by (18), are periodic
functions of t with common period P = 2L./plf.
° ° °
4. Prove the uniqueness theorem forthe vibrating string for the case where the boundary condition u(O, t) =
is replaced by ux(O, t) = and the boundary condition u(L, t) = is unchanged.
°
5. Prove the uniqueness theorem for the vibrating string for the case where the boundary conditions are
given by Tux (0, t) = ku(O, t) and ux(L, t) = 0, with k >
6. Prove the uniqueness theorem for the nonuniform string, governed by (47), with boundary conditions
u(O, t) = 0, u(L, t) = 0. Suggestion: show that
1 (L
E(t) = 2: 10 {p(x)ut(x,t) + T(x)u~(x,t)} dx = constant.
(I) The rod has length L and x is the distance along the rod axis measured from
one end, so that 0 ::: x ::: L.
(II) The temperature T in the rod is a function of the time t and the coordinate x only:
2Jean Baptiste Joseph Fourier (1768-1830), French physicist and mathematician. His book Theone analytique de
la Chaleur, published in 1822, had a decisive influence on the development of mathematics and mathematical physics in
the nineteenth century.
452 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
where j(x) is a prescribed function. Assumptions (I) and (II) define the temperature
function u(x, t). The goal is to calculate it. Assumption (III) is needed for consistency.
If there is heat flow across the lateral boundary then T = u(x,y, z, t) will vary with the
three spatial coordinates and t. Assumption (IV) means that the physical parameters
that characterize the rod material are all constants, independent of X and t. They are
listed below. Assumption (V) eliminates heat sources that, if present, would modify the
temperature distribution.
Fourier's Laws of Heat Conduction. Fourier's analysis of heat diffusion was based
on two physical laws that he discovered. To formulate them, let q(x, t) denote the heat
flux in the rod at position x and time t. Thus q(x, t) is the number of calories of heat
energy per unit area per unit time that are passing point x at time t. Then Fourier's first
law is the following:
Fourier's Heat Flux Law.
~u)~ ________________~)~
FIGURE 5. Homogeneous heat-conducting rod
x=o and coordinate x
7.2 HEAT DIFFUSION IN RODS: FOURIER'S HEAT EQUATION 453
A slice [x,x + ~xl of the rod has volume A~x, mass pA~x and hence cpA~x is the
number of calories needed to raise its temperature 10. Thus the heat content of the slice
at temperature u(x, t) is u(x, t)pcA~x and integration over Xo S X S Xl gives
with t-derivative
(7) d Q(t,xo,XI) =
-d
t
lX'
Xo
Ut(x,t)cpAdx.
(8)
°
for all Xc, Xl with S Xo S Xl S L. Differentiating (8) with respect to Xl and then
making Xl = X gives, after division by cpA,
-
Xo
x=O -
Aq(xo , b)
x=L
where
K
(10) K=-.
Cp
Equation (9) is Fourier's partial differential equation for heat diffusion. The positive
parameter K is known as the thermal diffusivity. It is a function of the rod material.
The Boundary Value Problem for the Rod. The temperature in the rod can now be
calculated by solving the boundary value problem of finding a function u(x, t) such that
where j(x) is prescribed. These properties completely determine the function u(x, t).
This is confirmed by the following uniqueness theorem.
Uniqueness Theorem. The boundary value problem (11)- (14) has at most one
solution.
Proof. As in the case of the vibrating string, consider two solutions of (11)-(14), say
Ul (x,t) and U2(X, t), having the same initial values j(x). To show that Ul (x, t) and U2(X, t)
must be identical, it is enough to show that the function u(x, t) = Ul (x, t) - U2 (x, t) is
zero for 0 S x S Land t :::: O. Now, u(x, t) is a solution of (l1)-(14) with initial values
u(x,O) = O. Thus if
(16)
The integrated terms are zero by the boundary condition (13). Thus
But J(t) ::: 0, and hence let) = 0 for 0 ::: t < 00. This can only happen if u(x, t) = 0 for
o ::: x ::: L, t ::: 0, which completes the proof.
Other Boundary Conditions. Other physical arrangements at the ends of the bar
lead to other boundary conditions. Thus if the bar is insulated at x = 0, then there is
no heat flow there. This means that q(O, t) = 0, which, by Fourier's heat flux law (4), is
equivalent to the boundary condition
Another possibility is that heat is radiated out at x = 0 following Newton's law of cooling.
The law states that Flux out = k[Uin - uoutl where k > 0 is a material constant. Applied
at x = 0 with U out = 0 the law gives
which is isomorphic to (44) in Section 7.l. Clearly these boundary conditions can be
prescribed independently at the two ends of the rod, leading to six different boundary
value problems. It can be shown that the uniqueness theorem and its proof extend to all
of them. Several will be solved in Chapters 8 and 9.
Nonuniform Rods. The heat equation must be modified if one or more of the bar
parameters c, p, A, or K varies with x. As an example, suppose that the cross-sectional
area A is variable:
(24)
456 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
(25)
Exercises 7.2
2. Show that nonzero product solutions of Exercise 1 satisfy the boundary conditions u(O, t) = °and
° °
u(L, t) = if and only if A = and /L = mrlL, where n is a positive integer.
3. Prove the uniqueness theorem for heat diffusion in a rod if the boundary conditions are u(O, t) = ° and
ux(L,t) = 0.
I(t) = -1
2 0
la L
u2 (x, t)A(x) dx
(I) Rectangular coordinates X and yare defined in the plane of the plate. The plate
occupies a connected region Q in the (x,y)-plane.
(II) The temperature T in the plate is a function of x, y and the time t:
7.3 HEAT DIFFUSION IN PLATES 457
Fourier's Heat Conservation Law. To express the conservation law, let D denote
any subset of the plate n and let Q(t, D) be the heat content of D at time t, measured
from temperature zero. Then Fourier's second law states that
d
(5) dt Q(t, D) = Net flux of heat into D.
x
FIGURE 7. Homogeneous heat-conducting plate Q in E2
458 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
A partial differential equation for u(x,y, t) will be derived by expressing the two sides of
(5) in terms of u. On the left,
in analogy with (7), Section 7.2, where A is the constant plate thickness, c is the specific
heat and p is the mass density (mass per unit volume). On the right, the net flux is obtained
by integrating the normal component of q over aD, the boundary curve of the region D.
If n = n(x,y) denotes a unit normal vector to aD at (x,y) pointing out of D, then
where
(10)
(II) Il (CPUt - KV 2 u) dx dy = 0.
The equation must hold for all subsets D of the plate n and all times t ::: O. This can
happen only if the integrand is zero. For if it were positive at some point (xo,Yo) in n
then by continuity it would be positive in some disk D centered on (xo,Yo) and (ll)
would fail. The same argument holds if it is negative. Thus (11) implies Fourier's heat
equation for plates:
Uniqueness Theorem. The boundary value problem (13)- (16) has at most one
solution.
Proof. The proof is almost the same as that in Section 7.2. If Uj and U2 are two solutions
of (13)-(16) with the samefex,y) then U = Uj -U2 is also a solution but withfex,y) = 0
in n. Hence it is enough to show that iff = 0 then U = O. To do so, consider the double
integral
so l(t) 2: 0 and 1(0) = O. Taking the t-derivative of let) and using the heat equation (14)
gives
au
(22) Vu . n = - =0
an
for (x,y) in an, t :::: 0,
where n = n(x,y) is the outward unit normal to an at (x,y). Equation (22) defines
au/an as the normal derivative of u at point (x, y) in an. Still another boundary condition
requires
(23) u o for(x,y)onC1,t::::0,
au
o for (x,y) on C2 , t :::: 0,
an
where an = C1+C 2 . This is called a mixed boundary condition. Yet another boundary
condition is to let the boundary radiate into an exterior at zero temperature. In this case
Newton's law of cooling gives the boundary condition
au
(24) K- + ku = 0
an
for (x,y) in an, t :::: 0,
where k > 0, in analogy with (22), Section 7.2. The uniqueness theorem extends to all
of these cases. The proofs are left for the Exercises.
I Exercises 7.3
;;0: ~
t
<4 G/WL, % ~~,~ lvrn"",,==
1. Prove the uniqueness theorem for heat diffusion in plates if the boundary condition (15) is replaced by
au/an = 0 on the boundary, t ::: 0
2. Same exercise as 1 if (15) is replaced by (23).
4. Show that the heat equation for plates, Ut = K(Uxx + Uyy), has product solutions
5. Show that if n is the unit square, defined by 0 ::: x ::: 1, 0 ::: y ::: 1, then the product solutions of
Exercise 4 satisfy the zero boundary condition on an if and only if )., = mr and fL = mJ!', where nand
m are positive integers.
7.4 STEADy-STATE HEAT DIFFUSION IN PLATES 461
where j(x,y) is a prescribed function on the boundary aQ. The following theorem
guarantees that these properties completely determine u(x, y).
Uniqueness Theorem. The boundary value problem (2) - (4) has at most one solution.
Proof. As in Section 7.3, it will suffice to show that if j(x,y) = 0 on aQ then u(x,y) = 0
in Q. To verify this, apply equation (19) of Section 7.3 to u. Since (3) holds, one gets
The boundary value problem (2)-(4) is usually called the Dirichlet problem. It will
be solved in Chapter 9 for plates of several special shapes including rectangles, circular
462 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
disks and related shapes. For more general shapes the problem is difficult and often can
be solved only by numerical methods.
Other Boundary Conditions. The boundary value problem obtained by replacing
the Dirichlet boundary condition (4) by the condition
(7)
au = j(x,y)
- for (x,y) on an
an
is called the Neumann problem. Physically, (7) is equivalent to specifying the heat flux
across the boundary an. A third possibility is
(9) K-
au + ku = kj(x,y) for (x,y) on an
an
defines the Robin problem. It corresponds phYSically to heat diffusion with radiation
into a medium with temperature j(x,y). The uniqueness theorem holds for the mixed
and Robin problems. The solution of the Neumann problem is not unique, but any two
solutions differ only by a constant. These questions are taken up in the Exercises.
Other Applications. The Laplace equation and the boundary value problem defined
above occur in many other applications. These include the analysis of gravitational,
electric and magnetic potentials, and theories of ideal fluid flow.
IExercises 7.41
1. Supply the details for identity (5).
2. Apply the divergence theorem carefully to obtain from identity (5) the resulting equation (6).
3. For the Neumann problem, show that if Ul(X,y) is a solution and c is a constant then U2(X,y) =
U1 (x, y) + c is a second solution. Hence the Neumann problem does not have a unique solution.
4. For the Neumann problem, show that if Ul (x,y) and U2(X,y) are two solutions with the same f(x,y)
then there is a constant c such that U2(X,y) - Ul (x,y) = c.
5. Prove the uniqueness theorem for the mixed problem.
6. Prove the uniqueness theorem for the Robin problem (assuming that K and k are positive).
7.5 VIBRATIONS OF DRUMS 463
The thinness assumption of (I) is interpreted to mean that the membrane position at
any time can be modeled as a mathematical surface in space. The uniformity assumption
means that the physical parameters of the membrane (mass denSity, tension) are constant.
To interpret assumption (II) let (x,y, z) be Cartesian coordinates in space, chosen such
that the membrane at eqUilibrium occupies a domain Q in the (x,y)-plane. Then (II) is
interpreted to mean that each point (x,y, 0) of the resting membrane is found at time t at
a point (x, y, u(x, y, t)). The problem of predicting the membrane motion then becomes
the mathematical problem of calculating the wave function u(x,y, t). The condition that
the membrane boundary be fixed becomes the boundary condition u(x,y, t) = 0 for
(x, y) on an and t ::: O. Assumption (III) is interpreted to mean that the membrane offers
no resistance to either bending or shearing. This hypothesis is used below to calculate
the tension. Finally, assumption (IV) will be interpreted to mean that the membrane
displacements are so small that the deviation of the surface normal vector
and N is the unit normal vector to the membrane, with positive z-component.
The Tension Vector T(x,y, t). Fix a time t and consider on the membrane surface
z = u(x,y, t) a small element of arc C with length ds and unit tangent vector t. If the
membrane were cut along C, the two sides would pull apart. Thus if the two sides of C
are denoted by 1 and 2, then side 1 exerts a force T(x,y, t)ds on side 2, and side 2 exerts
the balancing force - T(x,y, t)ds on side 1. It will be shown that the four assumptions
listed above determine T(x,y, t) completely. First, T(x,y, t) must lie in the tangent plane
to the membrane at the point (x,y, u(x,y, t)), because the membrane does not resist
bending This property is equivalent to the condition that T(x,y, t) is orthogonal to the
normal vector N at the point. Second, T(x,y, t) must be orthogonal to the unit tangent
vector t, because the membrane does not resist shearing. Finally, IT(x,y, t)1 = To is a
fixed positive constant by the uniformity assumption. It follows that
be the projection of C onto the (x,y)-plane, with arc length s as parameter (see Figure
9). Then
(4) to = (x'(s),y'(s),o)
and
are the unit tangent and normal vectors to Co, respectively It follows that C has the
representation
and hence
The vector t is also a unit vector because X'(S)2 + y'(s)2 = 1 and hence (uxX'(S) +
uyy'(S))2 « 1 by assumption (IV). A short calculation using u~ + u; « 1 gives
where aulan = Uxnl + uyn2 is the normal derivative of u on Co. Combining this with (3)
gives
Application of Newton's Second Law. Euler's wave equation for the vibrating
drum will now be derived by applying Newton's second law to an arbitrary portion of the
membrane. For this purpose let D be any portion of Q whose boundary aD is a simple
closed curve. The momentum in the z-direction of the portion of the membrane lying
466 7 PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS
over D is
where p is the mass density of the membrane. By (9), the z-component of the net force
on the membrane portion over D is
(ll) F(t) = 1
aD
au
To- ds,
an
(13)
By a now-familiar argument, this can hold for all subsets D of Q only if the integrand is
zero, or
(14)
where j(x,y) and g(x,y) are the initial displacement and velocity of the membrane,
respectively The completeness of these properties will be verified by proving the
uniqueness theorem below.
Uniqueness Theorem. The boundary value problem (15)-(18) has at most one solution.
7.5 VIBRATIONS OF DRUMS 467
Proof. As for the vibrating string problem, it is enough to show that if (15)-(18) hold
withf = g = 0 then U = O. This may be done by showing that the energy integral
(19) E(t) = ~ fin (PU~ + To(u; + U~») dx dy
is constant for all t ::: 0 and is zero iff = g = O. First, taking the t-derivative of E(t) and
using the wave equation (16) gives
To ( ut(V'u· n) ds,
Jar:!.
where the last step follows from the divergence theorem. Now (17) implies that
Ut(x,y, t) = 0 for (x,y) on aQ and t ::: O. Thus the last integral is zero and so E'(t) = 0
for t ::: O. Also
and in particular, E(O) = 0 iff = g = O. This proves that iff = g = 0 then E(t) = 0
for all t ::: o.
It follows that
In Chapter 9 the vibrating drum problem will be solved for rectangular and circular
drums and some related shapes. Solution methods for more general shapes may be found
in more advanced texts such as [Gal, [Ke].
1Exercises 7.51'
1. Show that Euler's wave equation (14) has product solutions
3. Prove the uniqueness theorem for the case that the Dirichlet boundary condition (17) is replaced by the
Neumann condition
3u(x,y, t) f
_--'.- = 0
an or Cx,y) on aQ, t >
- o.
4. Same as exercise 3 but with the Robin boundary condition
au
-dn + hu = 0 for (x,y) on aQ, t ::: 0,
where h > O.
(I) Rectangular coordinates in 3-dimensional Euclidean space are given by (x,y, z).
The body occupies a connected region n in (x,y, z)-space.
(II) The temperature T in the body is a function of x,y, z, and the time t.
(III) The body is homogeneous.
(IV) No heat is generated in the body
(V) The boundary an of the body is held at temperature zero.
(VI) The initial temperature is known.
7.6 HEAT DIFFUSION IN SOLIDS 469
Assumptions (I) and (II) define the temperature function u(x,y, z, t). Assumption
(III) means that the parameters K, p, and c for the body are constants. The remaining
assumptions are self-explanatory.
Fourier's Heat Flux Law. For 3-dimensional bodies the heat flux is a vector
q (x, y, z, t) in 3-dimensional space and Fourier's heat flux law takes the form
(5)
is the 3-dimensional Laplace operator. The validity of (4) for all subsets D of n gives
Fourier's heat equation for 3 dimensions:
The Boundary Value Problem for Solids. The temperature function u(x,y, z, t)
for a body n is characterized by the properties
Uniqueness Theorem. The boundary value problem (7)-(10) has at most one solution.
I,Exercises 7.6J
1. Prove the uniqueness theorem for the boundary value problem (7)-00) by the method of Section 7.3.
2. Prove the uniqueness theorem for the case that the Dirichlet condition (9) is replaced by the Neumann
condition au/an = 0 on the boundary
3. Prove the uniqueness theorem for the case that the Dirichlet condition (9) is replaced by the Robin
condition Kau/an + ku = 0 on the boundary (with K and k positive).
4. Show that the heat equation for solids, Ut = K(Uxx + Uyy + uzz), has product solutions
5. Show that if Q is the unit cube, defined by 0 :::: x :::: 1, 0 :::: y :::: 1, 0 :::: z :::: 1, then the product solutions
of Exercise 4 satisfy the zero boundary condition on aQ if and only if A = mr, fJ., = m1f, v = p1f, where
n, m and p are positive integers.
(1) U xx + Uyy + U zz = O.
Three-dimensional analogues of the boundary value problems of Section 7.4 can be
formulated, and the uniqueness of the solutions proved, just as in Section 7.4. These are
developed in the Exercises below.
7.7 STEADy-STATE HEAT DIFFUSION IN SOLIDS 471
The Dirichlet Problem. The Dirichlet problem for a connected domain n in 3-dimensional Euclidean
space asks for a function u(x, y, z) that satisfies
3. Show that if Ul (x,y, z) and U2 (x,y, z) are any two solutions of the Neumann problem with the same j
then there is a constant c such that Ul (x,y, z) = Ul (x,y, z) + c.
4. Prove the uniqueness theorem for the mixed problem in three dimensions.
5. Prove the uniqueness theorem for the Robin problem in three dimensions.
CHAPTER 8
Fourier Analysis and
Sturm-Liouville Theory
The principal method for solving the boundary value problems of mathematical physics
is the method of separation of variables. The method will be developed and applied
to a variety of problems in this chapter and Chapter 9.
Several examples of the method of separation of variables were given by D. Bernoulli,
L. Euler and others in the late eighteenth century. However, it was ].B. Fourier in his
book of 1822 who revealed the true scope of the method. Fourier based his method
on the convergence of certain infinite series, which are now called Fourier series, but
full proofs of the convergence of these series eluded him. The first rigorous proof of the
convergence of Fourier series was given by P.G.L. Dirichlet in 1829. The scope of the
method was extended to heat diffusion in nonhomogeneous media by C.E Sturm and
]. Liouville in the 1830s and 1840s. An interesting history is given in ]esper Llltzen's
article in the book Studies in the History of Mathematics, Volume 26 (1987), page 242,
published by the Mathematical Association of America.
This chapter is divided into three major parts. Part I treats heat diffusion in uniform
homogeneous rods and the corresponding theory of Fourier series. Part II treats heat
diffusion in the idealized, but important, problem of infinite rods. This leads to Fourier's
integral theorem. Part III treats heat diffusion in nonhomogeneous rods and Sturm and
Liouville's generalization of Fourier series.
474 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
where f(x) is a prescribed temperature distribution. It was shown in Section 7.2 that
there is at most one such function u(x, t). The problem is now to determine for which
f(x) (if any) the boundary value problem has a solution u(x, t) and to find a method for
calculating it. Of course, ifj(x) is the zero function then u(x, t) = 0 is the unique solution.
If f(x) I- 0 then it is not clear whether or not the problem is solvable. The solvability
problem will be approached here by Fourier's method of separation of variables. The
solvability will be shown by a construction of the solution.
Product Solutions. To begin, the initial condition (4) will be ignored, and the
solutions of conditions (1)-(3) of the special form
~u)~ ________________~)~
FIGURE 1. Heat diffusion in a finite rod; Dirichlet
x=o x=L boundary conditions
8.1 DIRICHLET BOUNDARY CONDITIONS AND FOURIER SINE SERIES 475
will be constructed. These are called product solutions. Substitution of (5) into the heat
equation (2) gives
Now, the left side of (7) is independent of x, and the right side is independent of t.
Hence the two sides must equal one and the same constant. Thus separate equations are
obtained for T(t) and X(x), namely
and
where A, called the separation constant, is an undetermined real number. If T(t) and
X(x) are any solutions of these equations then the product (5) is a solution of the heat
equation. The factors X and Y in (5) are linked by the separation constant which must
be the same for both.
Next consider the two boundary conditions
If T(t) is not the trivial solution of (8) then (10) holds only if x(a) = a and X(n) = a.
This leads to the following eigenvalue problem for A and X(x).
Dirichlet Eigenvalue Problem. Find the special values of A such that
has a nontrivial solution. This problem is analogous to the matrix eigenvalue problem of
Chapter 5. It will be shown to have infinitely many distinct eigenvalues and associated
eigenfunctions. The method of finding them is to write the general solution of (1),
containing A, and then to use the boundary conditions to find a nontrivial solution X(x).
The cases A < 0, A = 0 and A > 0 will be treated separately.
476 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
Case 1 (). < 0). Write)' = -IL 2 (IL > 0) Then (11) becomes X" - IL 2 X = °with
general solution
(l3)
where (] and (2 are arbitrary constants. The two boundary conditions give a pair oflinear
equations for (1 and (2:
(14) (1 + (2 = 0,
and A = -IL 2 is an eigenvalue if and only if (14) has a nontrivial solution «(], (2) =I- (0,0).
This is possible if and only if system (14) has a zero determinant:
1 1
(15)
But this is impossible for IL > 0, because e- J1T[ < 1 < el"Jr. Hence the eigenvalue problem
has no solution for A < 0.
Case 2 (A = 0). Proceeding similarly, (11) becomes X" = 0 with general solution
X(x) = (1 + C2X. Hence X(O) = Cl, X(n) = Cl + C2n, and A = 0 is an eigenvalue if and
only if
C] + n(2 = 0,
°
with determinant n =I- O. Thus A = is not an eigenvalue of problem (1)-(12).
°
Case 3 (A> 0). Write A = -IL 2 (IL > 0), so (11) becomes X" + IL 2X = with general
solution
(17) (] = 0,
with determinant
(18)
8.1 DIRICHLET BOUNDARY CONDITIONS AND FOURIER SINE SERIES 477
(19)
L b sin(nx)e-
N
(22) u(x, t) = n n2t
n=]
also satisfy (1)-(3). This function is a solution of the boundary value problem, provided
that j(x) is the linear combination of the eigenfunctions sin(nx)
N
(23) j(x) = I)n sin(nx).
n=]
L b sin(nx),
00
L b sin(nx)e-
00
Constructing solutions from the product solutions in this way is called superposition.
478 8 FOURIER ANALYSIS AND STURM-LIoUVILLE THEORY
I~Exercises 8.1 f:
Heat Diffusion and Graphs. Solve the linear heat diffusion problem and graph the solution surface
z = u(x, t) for 0::: x ::: Jr, 0 ::: t ::: 2.
1. j(x) = sin(2x)
2. j(x) = 2 sin(2x) - sin(x).
3. j(x) = sin(2x)cos(2x). Hint: 2sinecose = sin2e.
4. j(x) = sin(4x). Describe geometrically the difference between this solution surface and the solution
surface forj(x) = sin(2x), on 0::: x::: Jr, 0::: t ::: 2.
Heat Diffusion and Proofs. Solve the linear heat diffusion problem and verify directly that the formula
obtained for u(x, t) is a solution of the problem.
5. j(x) = sin(3x).
6. j(x) = 2 sin(3x) + sin(2x).
7. j(x) = sin(2x)(l + 2 cos(2x»
8. (Trigonometry) Establish the trigonometric identity 2 sin (a) sin(b) = costa - b) - costa + b), assuming
the sum identity cos(x + y) = cos x cos y - sin x sin y.
9. (Integration) Show that for m =1= n,
10. (Calculus) Derive If: sin2 (nx) dx = n12, for all integers n > O.
11. (Calculus) Derive If: cos 2 (nx) dx = Jr/2, [or all integers n > 0
12. (Integration Theory) Explain from basic rules of integration why
1b(N];fnC ) x ) dx = ] ;
N
1b fnCx)dx.
U(X, t) = L b sin(nx)e- n
N
n
2
t
n=l
satisfies the homogeneous boundary conditions u(O, t) = 0 = urn, I).
15. (Initial Heat Distribution) Let J be a finite sum of trigonometric functions,
N
j(x) = L bn sin(nx).
n=l
8.2 ORTHOGONALITY ANO FOURIER COEFFICIENTS 479
17. (Dirichlet Conditions) Show that the series L~=l Xn(x)Tn(t) satisfies the homogeneous bound-
ary conditions, provided that each product term Xn (x)Tn (t) satisfies the homogeneous boundary
conditions.
L b sin(nx),
00
Fourier's surprising discovery is that essentially any function can be so represented. The
key to the representation is the observation that the eigenfunctions sin(nx), n = 1,2,3,
... , satisfy
Check this by integration. We shall refer to this property by saying that the functions
{sin(nx)} are orthogonal on the interval 0 S x S :rr.
The orthogonality relation (2) and the integral identity
To verify this we multiply (1) by sin(mx), where m is any positive integer, and integrate
over 0 S x S :rr. The result is
(5) 1"o
j(x) sin(mx)dx = L b 1" sin(mx) sin(nx)dx.
00
n=1
n
0
Moreover, in the sum on the right of (5), all but one of the integrals are zero by the
orthogonality property: The single nonzero term occurs when n = m. It is equal to
(:rr/2)b m by (3). Thus (5) implies (4). The numbers bn defined by (4) are called the
Fourier sine series coefficients ofj(x).
480 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
l~xerci~~s8.2 ~
Fourier Sine Series. Compute the Fourier sine series coefficients bn for the given function j(x) on [0, n].
1. j(x) = 2 sin(2x) + sin(3x)(l + cos(3x)).
2. j(x) = x.
3. j(x) = e- X
L b sin(nx)
00
(1) f(x) = n
n=l
(2) bn = -
n
21"
0
f(x) sin(nx)dx, n = 1,2,3, ....
When is the converse true? That is, if j(x) is given and if the coefficients bn are defined
by (2), then does (1) hold? Dirichlet showed that this is true when j(x) and rex) are
sectionally continuous on 0 :S x :S n as described by the following definition.
Sectionally Continuous Functions and Derivatives. A function f(x) defined for
a :S x :S b is said to be sectionally continuous if there is a partition of a :S x :S b into
finitely many subintervals,
The values j(a) ,f(XI), ... ,f(Xm-l) ,f(b) may be assigned in any way. If in addition
then rex) is said to be sectionally continuous. Note that the definition prescribes a
procedure for constructing these functions. One need only select the partition, the
8.3 CONVERGENCE OF FOURIER SINE SERIES 481
a
FIGURE 2. A sectionally continuous function
functions.li Ci = 1,2, ... , m), and the values fCa),fCxl), ... ,f(xm~l),fCb). An example
is shown in Figure 2.
Right-Hand and Left-Hand Limits and Derivatives. The right-hand limit off
at any point x is defined to be the limit of fCt) as t approaches x from the right. It is
denoted by f(x + 0). Thus
For functions f with right-hand and left-hand limits, the right-hand and left-hand
derivatives off at a point x, denoted here by f~ ex) and f~ ex) respectively, are defined by
and
and the numbers f(Xi - O),f(Xi), andf(xi + 0) may be all distinct. Iff' is also sectionally
continuous on a::: x::: b, thenf has right-hand and left-hand derivatives at every point
of a < x < b, plus a right-hand derivative at x = a and a left-hand derivative at x = b.
Moreover, if x is not a partition point then l' (x) exists and
f~(Xi) = f(+l(Xi)'
With these preliminaries, Dirichlets convergence theorem for Fourier sine series may be
formulated as follows.
Convergence Theorem for Fourier Sine Series. Let f and l' be sectionally
continuous on 0 ::: x ::: Jr and let
Then the Fourier sine series off converges for every x and it has the sum
00 1
(16) I)n sin(nx) = "2 [r(x + 0) + f(x - 0)] for 0 < x< Jr.
n=l
L b sin(nx) = f(x)
00
Recall that the last equation holds automatically except at the finitely many points where
f(x) is discontinuous. For such f(x) the problem is solved by the following two steps.
8.3 CONVERGENCE OF FOURIER SINE SERIES 483
First, compute the Fourier sine series coefficients bn defined by (15) and write
00
L b sin(nx)e-
00
n=1
Note that after f(x) is expanded as a sine series (19), then u(x, t) is obtained by multiplying
each term bn sin(nx) by the factor e- n2t .
Examples. This section is concluded with four examples that illustrate the theory
developed above.
EXAMPLE 1. (Constant Initial Temperature). Letf(x) = 1 for 0 < x < Jr. Then
f(x) = ~
Jr
f
n=O
sin(2n + l)x ,
2n +1
0 < x < Jr.
EXAMPLE 2. (Constant Temperature Gradient). Take f(x) = x for 0 < x < rr, so
l' (x) = 1 is a constant, and f(O) = f(rr) = O. Then f and l' are sectionally continuous and
bn = -
rr
2i 0
Jf
x sin(nx)d.x =
2(_l)n-l
n
, n=1,2,3, ....
~=2(1-~+~-~+"-)
as before.
2i
bn = -
rr 0
Jf
8 sin(nrr/2),
f(x) sin(nx)d.x = 22
n rr
n=1,2,3, ....
y (nl2,1)
1= ~
n2
(1 + ~ + ~ + ~ + ... )
32 52 72
.
The graph of j(x) is a parabola through the points (0,0) and (n,O) with vertex at the
point (nI2,n 2/4). Integration gives the Fourier coefficients
bn =-
n
21 0
1C
x(n - x) sin(nx)dx = -43 (1 -
nn
(-It) .
Thus b2n = 0, b2n+1 = (8In)/(2n + 1)3 and the convergence theorem gives
x(n - x) = ~ (Sin(X) + sin(3x) + sin(5x) + ... )
n 13 33 53 '
°: : x :::: n.
Exercises 8.3
Linear Heat Diffusion. Solve the linear heat diffusion problem for the given initial heat distribution f and
Dirichlet boundary conditions. Display at least the first five terms of the series solution.
1. f(x) = 3.
2. f(x) = sin (x) - 2 sin(3x).
3. f(x) = 1 +x.
4. f(x) = sin(3x) cos(3x).
5. j(x) = x(rr - x)2.
x2 o :'S x :'S rr/2,
6. j(x) = { (rr _ x)2
rr/2 :'S x :'S rr.
Graphs of Heat Diffusion Solutions. Solve the linear heat diffusion problem with Dirichlet boundary
conditions for the given initial temperature distributionj. Graph the temperature surface on 0 :'S x :'S rr,
o :'S t :'S 2. For infinite series, graph the first three terms of the series (n = 1,2,3).
7. f(x) = 2.
8. j(x) = sin(3x).
Snapshots of Heat Diffusion Solutions. Solve the linear heat diffusion problem with Dirichlet boundary
conditions for the given initial temperature distribution f. Graph the snapshot u(x, t) on 0 :S x :s :rr for
t = 0, t = 0.05, t = 0.1. For infinite series, graph the first three terms (n = 1,2,3).
10. f(x) = sine 4x)
11. f(x) = x 2 (n - x)
I 0 :s x :s :rr12,
12. j(x) = { 0
:rr12 :s x :S n.
Numerical Series Identities. Compute the Fourier sine series for the given function f. Substitute a value
of x in 0 :S x :S :rr into the sine series and apply Dirichlet's convergence theorem to obtain the given numerical
series identity
3n 3 J2 _
14. pCx) -- x(:rr - x), ---r28 - 1 + J3 - 53 - f3
1 1 1
+ 931 + j'j3
1
- .. '.
where j(x) is a prescribed function. From Section 7.2 we know that the problem has
at most one solution. The solution will be constructed by the separation of variables
methods of Section 8.1. Only the boundary condition (3) is different. Hence, only the
main steps are given. Details are left for the Exercises.
The product solutions of (1)-(3) satisfy the same differential equations as before,
(8)-(9) of Section 8.1. The new boundary conditions (3) for a product solution u(x, t) =
X(x)T(t) are
For nontrivial solutions, X(x) must satisfy X/CO) = 0, X/(n) = O. This leads to a new
eigenvalue problem.
Neumann Eigenvalue Problem. Find the values of A such that
(8)
The details are left for the Exercises. Note that this time AO = 0 is an eigenvalue with
eigenfunction Xo(x) = 1 (a constant).
Superposition. The product solutions are
(9)
(12) l 1C
cos(mx) cos(nx)dx =0 for m =1= n,
Multiplying (11) by cos(mx) and integrating over 0 ::s x ::s 7r gives the coefficients
(15) ao = -liT( J(x)dx,
7r 0
(16)
2iT( J(x) cos(nx)dx,
an = - n = 1,2,3, ....
7r 0
The series in (11) with these coefficients is called the Fourier cosine series for J(x) on
o ::s x ::s 7r. The convergence theorem for the cosine series is analogous to that for the
sine series.
Convergence Theorem for Fourier Cosine Series. Let J(x) and J' (x) be section-
ally continuous on 0 ::s x ::s 7r and let the Fourier cosine series coefficients of J(x) be
defined by (15)-(16). Then the Fourier cosine series ofJ(x) converges for every x and it
has the sum
n=O
Examples. The following five examples illustrate the theory.
EXAMPLE 1. (Constant Initial Temperature). LetJex) = 1 for 0 < x < 7r. Then ao = 1
and
an = -
2iT( cos(nx)dx = 0 for n:::: 1.
7r 0
8.4 NEUMANN BOUNDARY CONDITIONS AND FOURIER COSINE SERIES 489
Thus f(x) = 1 = ao and the series has only one nonzero term.
EXAMPLE 2. (Constant Initial Gradient). Define f(x) = x for 0 < x < n so l' (x) = 1.
Then f and l' are continuous on 0 :::; x :::; nand
1 (" x21ll'
ao = -; 10 x dx = 2n 0 2'
_21" _
while for n :::: 1,
2(cos(nn) - 1)
an - - x cos(nx)dx - 2 .
n 0 nn
Hence a2 n = 0 and az n+! = -( 4In)/(2n + 1)2 Thus the convergence theorem gives
(21) x= ~ - .!.
(COS(x) + cos(3x) + cos(5x) + ... ) 0 :::; x :::; n.
2 n 12 32 52 '
ao = 2:. ("/2 dx = ~
n 10 2
and
for 0 < x < nl2 and nl2 < x < n. At the point x = nl2 one has
and cos ((2n + l)nI2) = O. Hence the convergence theorem (17) is verified at this point.
Then
ao =~
n 10r f(x)dx = ~n (::)
2
= ~.
2
Moreover, integration gives, after some work,
21][
an = -
4
f(x) cos(nx)dx = 22 [2 cos(nn/2) - cos(nn) - 1]
non n
for n = 1,2,3, .... This implies that a2n+l = 0, while
4
a2n = n 2(2n)2 [2 cos(nn) - 2] ,
so
-16 -4
'4n+2 = n2(4n + 2)2 = n 2(2n + 1)2 .
Since f(x) is continuous, and l' (x) is sectionally continuous, the convergence theorem
gives
EXAMPLE 5.The functionf(x) = sin(x), 0 ::: x ::: n, is its own Fourier sine series (so
bl = 1and all other bn = 0). Moreover,f(x) = sin (x) and1'(x) = cos(x) are continuous
and hence sin(x) also has a Fourier cosine series on 0 ::: x ::: n. Integration gives
ao = -1 17r sin(x)dx = -,
2
non
8.4 NEUMANN BOUNDARY CONDITIONS AND FOURIER COSINE SERIES 491
It is clear that this equation fails for x < 0 since sin (x) is odd and cos(2nx) is even. It is
not difficult to see that the sum of the above series is I sin (x) I for all x.
I Exercises 8.4 ~
Linear Heat Diffusion. Solve the linear heat diffusion problem for the given initial heat distribution j and
Neumann boundary conditions. Display at least the first five terms of the series solution.
1. j(x) = -1.
2. j(x) = cos (x) + 3 cos(2x)
3. j(x) = 1 +x
4. j(x) = cos(3x) cos(2x).
Hint: cos(a - b) - cos(a + b) = 2 cos(a) cos(b).
5. j(x) = x(n - x)2.
x2 o ~ x ~ nil.
6. j(x) = { (n _ x)2
nl2 ~ x ~ n.
Graphs of Heat Diffusion Solutions. Solve the linear heat diffusion problem with Neumann boundary
conditions for the given initial temperature distributionj. Graph the temperature surface on 0 ~ x ~ n,
o ~ t ~ 2. For infinite senes, graph the first three terms (n = 0, 1,2).
7. j(x) = 3.
8. j(x) = cos(5x).
o~ x
9. j(x) = {~l ~ n12,
nl2 ~ x ~ n.
Snapshots of Heat Diffusion Solutions. Solve the linear heat diffusion problem with Neumann boundary
conditions for the given initial temperature distribution j. Graph the snapshot u(x, t) on 0 ~ x ~ n for
t-values t = 0, t = 0.05 and t = 0.1.
10. j(x) = cos(3x).
11. jex) = x 2 (n - x)
l 0~x~nI2,
12. j(x) = { 0
nl2 ~ x ~ n.
492 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
Numerical Series Identities. Compute the Fourier cosine series for the given function f. Substitute a
value of x in 0 :::: x :::: 7f into the cosine series and apply Dirichlet's convergence theorem, to obtain the given
numerical series identity
x and x + 2n define the same point on the ring. Hence, if x is unrestricted, then
(1) u(x + 2n, t) = u(x, t), for all x and all t ::: 0.
The same is true of the partial derivatives of u(x, t). Condition (1) states that u(x, t) is a
periodic function of x with period 2n. In particular,
These conditions are called periodic boundary conditions. We shall show that the
temperature distribution in the ring is determined by the boundary value problem of
finding a function u(x, t) that satisfies
where j(x) is a prescribed initial temperature distribution. This problem has at most one
solution. This uniqueness theorem is readily verified by the method of Section 7.2. The
proof is left for the Exercises. Here the separation of variables method will be used to
construct the solution.
The product solutions of (3)-(5) satisfy the same differential equations as before.
Only the boundary conditions (5) are different. Thus T(t) = c- At and X" + AX = 0
where A is the separation constant. To determine the values of A and X(x) we have a new
eigenvalue problem, which is formulated as follows.
Periodic Eigenvalue Problem. Find the values of A such that
where C), C2 are arbitrary constants. For an eigenvalue we must choose fI and (c), C2) =f:.
(0,0) such that (9) satisfies the boundary conditions (8). Substituting (9) into (8) gives
494 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
a pair of linear homogeneous equations for (Cl, C2) that may be written
There is a solution (Cl, C2) =1= (0,0) if and only if the system has determinant zero. But
the determinant is
(11)
(e 21rfL - 1) (e- 2JrfL - 1)
(e l1rfL - 1) - (e- 21rfL - 1)
°
which is never zero for p., > (since e21rfL > 1 for p., > 0). Hence, there are no negative
eigenvalues for problem (7)-(8).
Case 2 (A = 0). Here X(x) = Cl +C2X, and the boundary conditions are X(21l') - X(O) =
21l'C2 = 0, X' (21l') - X' (0) = C2 - C2 = 0 with one nontrivial solution (Cl, C2) = (1,0).
This gives the pair
(12) A = Ao = 0,
X(x) = Xo(x) = l.
The boundary conditions (8) give, after simplification, the linear homogeneous equations
sin(21l'p.,) (cos(21l'p.,) - 1)
(15) = 2[cos(2rrp.,) - 1].
(cos(21l'p.,) - 1) - sin(21l'p.,)
Thus the eigenvalues A = p., 2 satisfy cos(21l' p.,) = l. The positive roots of this equation
are the positive integers p., = n (n = 1,2,3, ... ). Moreover, for p., = n the matrix of
system (n) has rank 0 (all four entries are zero), and hence there are no restrictions on
Cl and C2. This means that the solution space of (7)-(8) for the eigenvalue A = n2 is
two-dimensional. The eigenfunctions are sin(nx), cos(nx) and all their nontrivial linear
combinations. The positive eigenvalues, and their eigenfunctions, are
(16)
8.5 PERIODIC BOUNDARY CONDITIONS AND THE COMPLETE FOURIER SERIES 495
where an and bn are arbitrary constants not both zero. The associated factors TCt) = Tn (t)
satisfy T~ = -n 2 T" and hence Tn(t) = e- n2t A complete list of product solutions may
be written
(17)
°
for n = 0, 1,2, .... For n = this reduces to the constant function uo(x, t) = ao, while
for each n ~ 1 the product solution contains two parameters, an and bn.
Superposition. Addition of product solutions gives solutions of the form
(18) u(x, t) = ao + n
n2t •
n=l
This provides a solution of the Fourier ring problem (3)-(6) if the initial values j(x) satisfy
= j(x) = ao + L
00
°: :
°: :
of (7)-(8) is an orthogonal set on the interval x :::: 2](. This means that the product
of any two distinct members of this set integrates to zero over x :::: 2](. Explicitly,
All of these integrals are elementary Moreover, the integrals of the squares of members
of the set (20) are
(23) ao = -1
2n
1 0
2Jr
j(x)dx,
an = -1 1 2Jr
f(x) cos(nx)dx,
1
n 0
2Jr
bn = -1 f(x) sin(nx)dx, n=1,2,3, ....
n 0
The series in (19), with coefficients an, bn defined by (23), is called the complete Fourier
series for f(x) on 0 :s x :s 2n. The following analogue of the convergence theorem for
the sine and cosine series is proved in Section 8.6 below.
Convergence Theorem for Complete Fourier Series. Let f(x) and j'(x) be
sectionally continuous on 0 :s x :s 2n and let the Fourier coefficients off be defined by
(23) above. Then the complete Fourier series off converges for every x and has the sum
(24) ao + 0)]
"=1
The solution is given by (18), where the an and bn are the Fourier coefficients off, defined
by (13)
Examples. The following two examples illustrate the theory. Additional examples are
given in Section 8.6 and in Chapter 9.
EXAMPLE 1. Define
X for o :s x :s n,
f(x) = [
0 for n < x :s 2n.
8.5 PERIODIC BOUNDARY CONDITIONS AND THE COMPLETE FOURIER SERIES 497
Then J and l' are sectionally continuous and j(x) is continuous except at x = re. The
Fourier coefficients (23) are
ao = -
1 12;[ J(x)dx = -
1 121!" x dx = re
-
2re 0 2re 0 4
and, for n ::: 1,
an = -
re
11 0
2 ][
J(x) cos(nx)dx =
(-l)n-l
ren
2
and
1 121!" (_1)n+1
bn =- J(x) sin(nx)dx = ,
re 0 n
by elementary integration. Details are left to the Exercises. The complete Fourier series
can be written. It is
~ + (Sin(X) _ sin(2x) + sin(3x) _ sin(4x) ... )
4 1 2 3 4
2 (cosex) cos(3x) cos(5x) cos(7x) )
- -; -1-2- + 32 + 52 + 72 + . .. .
The convergence theorem implies that the sum of this series is J(x) for all 0 ::s x ::s 2re
except possibly at x = re. For example, taking x = 0 gives the equation
o = ~4 - 3.re (~
12
+~
32
+~
52
+~
72
+ ...) '
or
re 2 1 1 1 1
8 = 12 + 32 + 52 + 72 + ... ,
as shown above in Section 8.3 and Section 8.4. For x = re one has! [J(re+O)+J(re-O) 1=
rel2 and hence the convergence theorem gives
which leads back to the same equation. As a third special case take x = re12. Then
J(rel2) = rel2, and all the cosine terms are zero. Hence the convergence theorem gives
~ = ~ + (~- ~ + ~ - ~ + .. -),
as was shown above in (5) of Section 8.3.
498 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
EXAMPLE 2. Define
Clearly,f is continuous and J' is sectionally continuous, so that the convergence theorem
applies. Elementary integration gives
ao = 1
-
lJ( sin(x)dx = -,1
2n n
I1J(
0
and
an
1
= -no
lJ( sin(x) cos(nx)dx
1 + (_1)n
= nI-n
( 2) for n::: 2.
Similarly
bn = -
1 lJ( sin(x) sin(nx)dx,
n 0
so
1
bi = 2' bn =0 for n::: 2.
1 1. 2 ~ cos(2nx)
f(x) = - + - sm(x) - - ~ .
n 2 n n=1 4n 2 - 1
Making x ~ 0+ gives
1 2 1
L 4n2 _ 1 '
00
o= ; - ;
n=l
or
1 00 1
2 = ~ (2n - 1) (2n + 1)
I I I
=-+-+-+
1·3 3·5 5·7
....
8.5 PERIODIC BOUNDARY CONDITIONS AND THE COMPLETE FOURIER SERIES 499
This last equation is often proved in the first year calculus course. The choice x = nl2
leads to
1 1 2 (_l)n
L 4n
00
1 = -; + 1 - -; 2 - 1'
n=l
or
Complex Form of the Complete Fourier Series. The convergence theorem (24)
implies that ifj(x) satisfies (24) thenj(x) has the representation
where the coefficients an and bn are defined by (23). The expansion (26) takes a much
Simpler form if we use Euler's formula to express sin(nx) and cos(nx) in terms of the
complex trigonometric functions e±inx. Indeed,
1 ( 'nx
cos(nx) =1 e. + e- lnx
. )
,
1 ( 'nx
sin(nx) = 1 e. - e- lnx
. )
,
and so
where
1
(27) Co = lao,
1
Cn = 1 (an - ibn) ,
en = C_ n ,
This formula is called the complex Fourier series for j(x). Notice that by (23),
(29) en = 2~ 1 2Jr
j(y)e- iny dy, n=O,±1,±2, ....
1 2Jr
eimy einy dy = 1 2Jr
ei(m-n)y dy = 27l'omn,
where omm = 1 and omn = ° if m =I n.
IExercises 8.5 ~
JLV"S ~"b~" dt:Lii£i%bdi ¥iJih%'"
verify by direct integration of the Fourier coefficient formulas for an, bn that the complete Fourier series
off is
verify by direct integration of the Fourier coefficient formulas for bn , an that the complete Fourier series
off is
Convergence Theorem. Apply the Fourier convergence theorem to the functions f of the two preceding
exercises in order to verify the following results.
1 1 1 1
1" + 3I + 52 + 7f + ....
Jr2
3. 8' =
4. ~ = f - ~ + ! - t + " ..
1 ,",00 1
S. '2 = L-n=l (2n-l)(2n+l)'
where j(x) is a prescribed initial temperature distribution. Prove that this problem has at most one solution.
This procedure generates the unique p-periodic function f whose restriction to 0 :::: x < p
is fa. Conversely, any p-periodic function can be generated in this way from its values on
o :::: x < p or any other interval of length p.
502 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
(3) l a p
+ f(x)dx = i P
f(x)dx for all real a.
lea) = l
a
a+p
f(x)dx,
then
for all a by the periodicity property (1). Thus lea) = leO) (a constant) for all a.
Now letfo be defined for 0::: x < 2n and letf be its 2n-periodic extension. Suppose
that
= ao + L
00
Then
+L
00
This follows because both f(x) and the sum of the Fourier series are 2n-periodic
extensions offo(x), and we have seen that such an extension is unique.
The coefficients an and bn above are the Fourier coefficients of fo on 0 ::: x < 2n.
Since fo(x) = f(x) on this interval, we may write
ao = -
1 l2Jr f(x)dx,
2n 0
an = -
1 l2Jr fCx) cos(nx)dx, n ::: 1,
n 0
bn = -
1 l2Jr fCx) sin(nx)dx, n ::: 1.
n 0
Equation (3) with P = 2n implies that we may replace the integration interval [0,2nl
with any interval of length 2n. This freedom to change the interval will be used below.
Frequently, a convenient choice is the symmetric interval [-n, n I:
(5) ao = -
1 fJr f(x)dx,
2n -Jr
8.6 PROOFS OF THE CONVERGENCE THEOREMS 503
1 f11:
an = - JCx) cos(nx)dx, n ::: 1,
-11:
:rr:
The convergence theorem of Section 8.5 will be proved by showing that iff and l' are
sectionally continuous then
for all x. The first step is to replace ak and h in (6) by the integrals of (5) and simplify.
This gives
ak cos(kx) + bk sin(kx)
= (~ l>(U) COS(kU)dU) cos(kx) + (~ l>(U) Sin(kU)dU) sin(kx)
1 f11:
= - JCu) (cos(ku) cos(kx) + sin(ku) sin(kx» du
:rr: -11:
(8) 1
Sn(x) = - f11: feu)
:rr: -11:
(1 + L
-
2
n
k=l
cos k(u - x) ) duo
A Trigonometric Formula. The next step is to evaluate the sum of cosines in (8).
The result is
1 ~ sin(n + ~)t
(9) - + ~ cos(kt) = . I ' n=1,2,3, ....
2 k=l 2sm(2:t)
To verify this formula we recall the identity
+ (sin(~t) - sin(N)
+ ... + (sin(n + ~)t - sin(n - ~)t)
= sin(n + ~)t - sin(~t),
since all but the last two terms cancel (a telescoping sum). Dividing by 2sin(~t) and
adding 1/2 gives (9).
_ 1
Sn(x) - -
lr
l Jr
-Jr
J(u)
sin(n + ~)(u - x)
. 1
2sm2:(u-x)
duo
Sn (x) = -
1 l Jr - x
J(t + x)
sin(n
.
+ !)t
12 dt.
lr -Jr-X 2 sm( 2:t)
The integrand in this integral is periodic in t with period 2lr (check this). Hence property
(3) gives
To find it note that integration of (9) over [-lr, Oland [0, lr 1 gives
2
r
~ = .. !:. sin(n + ~ )t dt
lri-Jr 2sin(~t) ,
If we multiply these equations by J(x + 0) and J(x - 0), respectively, and subtract from
(10) then we get the representation
(11)
Sn(x)-~[J(x+O)+J(x-O)l = -
i
1 Jr J(x + t) - J(x + 0) . ( 1) d
sm n + - t t
lr a
+ -1
lr
1
2sin(~t)
0
-Jr
J(x
2
+ t) .-
2 sm(2:t)
J(x - 0) . (
1
d
sm n + 2: t t.
1)
8.6 PROOFS OF THE CONVERGENCE THEOREMS 505
The convergence theorem will now be verified by showing that the last two integrals
have limit 0 as n -+ 00. The proof will be based on a property of the Fourier coefficients
called Bessel's inequality:
Bessel's Inequality. Letfex) be sectionally continuous on -1T :::; X :::; 1T, and let an,
bn be its Fourier coefficients, defined by (5). Then
Proof. This result is called Bessel's inequality. To prove it, let Sn (x) be defined by (6),
as before, and consider the equality
Next, the definition of ak and bk and orthogonality esee (21)-(22), Section 8.5) imply that
Thus the partial sums of the infinite series in (12) form an increasing sequence with finite
upper bound. It follows that the series converges and satisfies Bessel's inequality (12).
The nth term in a convergent series must tend to zero as n -+ 00. Thus Bessel's
inequality implies that a~ + b~ -+ 0 as n -+ 00. This result is called Riemann's theorem.
It can be formulated as follows.
Riemann's Theorem. If fex) is sectionally continuous on -1T :::; X :::; 1T then
i: i:
The integrals in (11) have the form
+ i:
Combining this with Riemann's theorem gives the following result.
(g(t) cos(!t)) sin(nt)dt.
(17) lim
n-+-oo
11< get) sin(n + !)t dt = O.
-Jr
(18) Sn (x) - ! [J(x + 0) + J(x - 0)] = - 111< g(x, t) sin(n + !)t dt,
7r -1<
I
where g(x, t) is defined for all x and for 0 < t ::: 7r by
(20) ao + 0)].
n=l
Proof. For each fixed x we apply Riemanns theorem, in the form (17), to the integral
in (18). The hypothesis that J is sectionally continuous implies that g(x, t), as a function
of t, is sectionally continuous except possibly at t = o. the hypothesis thatJ andf' are
sectionally continuous implies that
Jex + t) - Jex + 0) J(x + t) - Jex + 0) t
2sin(!t) 2sin(!t)
has the limit
lim Jex + t) - J(x + 0) =l' (x)
HO+ 2 sine It)
2
+
Hence if we define g(x, 0 ± 0) = f~ (x) then g(x, t) is in fact sectionally continuous for
-Jr :::: t :::: Jr and the proof is complete.
The convergence theorems for Fourier sine and cosine series of Section 8.3 and
Section 8.4 may be proved as corollaries of the convergence theorem of this section. The
proof makes use of the notion of even and odd functions.
Even and Odd Functions. Consider functions f(x) that are defined on a symmetric
interval -L :::: x :::: L We say that
Examples of even functions are cos(nx), Ixl and x2n. Examples of odd functions are
sin(nx), X 2n+ 1 and x cos(nx). Several simple but useful rules are listed next. The proofs
are left for the Exercises.
Rule 1. Linear combinations of even (respectively, odd) functions are also even
(respectively, odd)
Rule 2. A product of two even functions is even. A product of two odd functions is even.
A product of an even function and an odd function is odd.
Rule 3.
Similarly; every function f(x) on 0 :::: x :::: L that satisfies f(O) = 0 has a unique odd
extension h to -L :::: x :::: L, defined by
Next, let J(x) satisfy the conditions of the convergence theorem for periodic functions
and letJ be even. Then for L = rr,
(26) ao = -1
2rr
1][ J(x)dx = -1
rr
1" J(x)dx,
an = -1 J(x) cos(nx)dx = -
rr _IT rr 0
an 1
= -
rr
1][ _IT
J(x) cos(nx)dx = 0 for n = 1,2,3, ... ,
L b sin(nx).
00
(27) n
n=l
(28) bn = -
1 1][ J(x) sin(nx)dx = -
21][ J(x) sin(nx)dx.
rr _IT rr 0
series of (17), Section 8.4, and the coefficients an given by (26) are the same as in Section
8.4.
Other Periods. The preceding results are easily extended to functions f(x) having
any period p > O. It will be convenient to write p = 2L, so that
The trigonometric functions with this period are sin(nrrxlL) and cos(nrrxlL). Hence we
expect the Fourier expansion of f(x) to have the form
8 (an
00
(34)
1
2: [g(v + 0) + g( v - 0) I = ao + L (an cos (nv) + bnsin (nv)) ,
00
n=l
where
(35) ao = - 1
2rr
l Jr
g(v)dv,
l
-Jr
an = -1 Jr
g(v) cos(nv)dv, n ::: 1,
rr
l
-Jr
Jr
bn = -1 g(v) sin(nv)dv, n ::: 1.
rr -Jr
where
(37) ao = 2~ l>(X)dx,
an =
L
i
~ Lj(x) cos (n1l'X) dx,
L
n ~ 1,
i
-L
bn = ~L L
-L
j(x) sin C1l'X) dx,
L
n ~ l.
Exercises 8.6
1. (Periodicity) Letf(x + p) = f(x) for allx. Show thatf(x + np) = f(x) for all x and all integers n.
2. (Partial Sum Identity) Verify the identity Sn(x) - i [{(x + 0) + f(x - O)J = II + 12, where
h=-
1 fO f(x + t) - f(x - 0) . (
1
1) d
smn+2: t t.
7r -1C 2sin(2:t)
+ 0) + f(x -
f(x
2
0)
= ao + L (an) cos (lLIn
00
X) + bn sin (w,.x),
n=1
PART II FOURIER INTEGRALS 511
where
Fourier Convergence Theorem. The following exercises establish a special case of Fouriers theorem,
given below The main feature of this result is an explicit convergence rate at points where F is very smooth.
Theorem. Let F be piecewise continuous and 2Jr-periodic. Assume x given and define
If F is twice continuously differentiable at x then the partial sums Sn of the Fourier series of F
satisfy the following identities:
I fIT (
sn (x) = - Fx + u)sin «2n + l)ul2)d
u,
2JT _" sin (ul2)
sn(x) - F(x) = - I
2Jr _"
f" C(u) sin«2n + l)ul2)du,
=
(2n +1 l)Jr f"-IT
C' (u) cos«2n + 1)ul2)du.
sin«2n+l)ul2) I ~ (h
Dn ()
u == = - + L..J cos u).
2 sin (ul2) 2 k=l
13. Integrate Dirichlet's trigonometric identity over [-Jr, Jr] to obtain f~" Dn (u)du = Jr.
14. Assume that x is fixed and C(u) == [F(x+u) - F(x)]/ sin(ul2) is continuously differentiable on [-Jr, Jr]
Establish using integration by parts that
-
1 f" C(u) sin«2n + l)ul2)du =
~ J~" c' (u) cost (2n + 1 )ul2)du
.
2Jr -IT 2n + 1
M
15. Conclude from the preceding exercises that in the convergence formula F(x) = sn(x) + - - the
2n +1
L:
value of M is the integral
heat flow in a semi-infinite rod with Dirichlet boundary conditions at the free end.
The analogous problem with Neumann boundary conditions leads to the Fourier cosine
integral. The doubly infinite rod leads to the full Fourier integral with both sine and
cosine terms.
where j(x) is a prescribed initial temperature distribution. For this problem there are no
explicit boundary conditions. However, the following assumptions are made.
To interpret (5) let c be the specific heat, p the mass density and A the cross-sectional
area of the rod. Then, as in Section 7.2, (equation (6)),
is the heat content of the portion a S x S b of the rod. Hence, (5) means that the heat
content of the rod at time t = 0 is finite.
Separation of Variables. As before, the search for a solution begins with the
construction of all product solutions u(x, t) = X(x)T(t) that satisfy 0), (2) and (4).
Conditions (3) and (5) are left for the superposition step. The usual separation argument
shows that T/(t) = -AT(t) and XI/(x) = -AX(x), where A is the separation constant.
This suggests the following eigenvalue problem.
8.7 HEAT DIFFUSION IN AN INFINITE Roo 513
X(x) = A + Bx,
which is bounded if and only if B = O. Thus A = 0 is an eigenvalue with eigenfunction
X(x) = 1 (a nonzero function).
(8) A= fJ2,
(9)
Superposition. It is natural to seek a solution of the boundary value problem (1)-(5)
by integrating over the parameter fJ in (9). Thus we seek to determine functions A(fL)
and B(fL), fL 2: 0, such that the problem has the solution
(10) u(x, t) = E OO
[A(fJ) cos(fJx) + B(fL) sin(fLx)]e-112t dfJ.
If A(fL) and B(fL) are integrable over 0 :::: fL < 00 then (10) satisfies all conditions of the
problem except the initial condition u(x, 0) = f(x). This will be satisfied as well if f(x)
514 8 FOURIER ANALYSIS AND STURM-LIOUVillE THEORY
has a representation
(ll) j(x) = 1 00
[A(JL) cOS(JLX) + B(JL) sin(JLx)] dJL, -00 <x< 00.
When is this possible? Fourier suggested that most integrable functions j(x) can be so
represented and he gave a method to calculate A(JL) and B(JL) whenj(x) is known. A
rigorous theorem and proof were first given by Dirichlet. His theorem will be stated and
proved below.
(1) j(x) = 1 00
[A(JL) cOS(JLX) + B(JL) sin (JLx) ] dJL, -00 < x< 00.
lt is assumed that j(x) is sectionally continuous and If(x) I is integrable over -00 < x <
00. It follows that the integrals
are finite for all real values of v. They will be calculated by multiplying (1) by cos(vx) or
sin(vx), integrating over -N ::::: x ::::: N and then calculating the limit as N -+ 00. The
first step gives
1
i:
00
(3) Lj(X) cos(vx)dx = (A(JL) L: cos(JLx) cos (vx) dx) dJL
+ 1 00
(B(JL) sin (JLx) cos (vx) dx ) dJL.
Now, cos(vx) and sin(vx) are even and odd functions, respectively. Hence
j
-N
N cos(JLx) cos( vx)dx = 2 r
10
cos(JLx) cos( vx)dx
= sin(JL - v)N
JL-V
sin(JL + v)N
+--'---
JL+V
8.8 ORTHOGONALITY CALCULATION 515
and
I -N
N sin(j.tx) cos( vx)d.x = o.
Substituting these into (3) gives
I N
j(x) cos(vx)d.x =
100
0
A(j.t)
sin(j.t - v)N
j.t-V
dj.t
1
~
)sin(j.t + v)N d
+ 00 (
A j.t j.t.
o j.t+V
If we define A(j.t) = 0 for j.t < 0 then the last two integrals have the form
(4) 1 00
-00
(
A j.t
) sin N(j.t - x) d
j.t - x
j.t, x = ±j.t.
Limits of such integrals were first studied by Dirichlet. The following theorem will be
proved in Section 8.9 below.
i:
Dirichlet Limit Theorem. Let A(j.t) and A'(j.t) be sectionally continuous and
Then
(5) lim
N..... oo
1 00
-00
A(j.t)
sin N(j.t - x)
j.t - x
dj.t
n
= -2 [A(x + 0) + A(x - 0) J.
(6) lim
N-+oo
IN j(x) cos(vx)d.x = nA(v).
-N
(7) lim
N-+oo
IN j(x) sin(vx)d.x = nB(v).
-N
(8) A(v) =; 11 00
_ooj(x) cos(vx)d.x,
B(v) = - 1/
n
00
-00
j(x) sin(vx)d.x.
516 8 FOURIER ANALYSIS AND STURM-LIOUVILLE THEORY
(9) J(x) = 1 00
[A(/L) cos (/LX) + B(/L) sin (/Lx) ] d/L, -00 <x < 00,
then A and B must be given by (9). In the next section the converse problem is solved.
1 00
[A(IL) COS(ILX) + B(/L) sin(/Lx)] d/L, -00 < x< 00,
where A(/L) and B(/L) are defined by (8) in Section 8.8, is called the Fourier integral of
J(x). The main result of this section is the follOwing theorem.
Fourier Integral Theorem. LetJ(x) satisfy the conditions
(1) J(x) andf'(x) are sectionally continuous for -00 < x < 00,
(2) J(x) = ~ [J(x + 0) +J(x - 0)] for -00 <x < 00,
(4)
where
(5) B(/L) = - 11
:n: -00
00
J(x) sin(ILX)dx.
Proof of the Corollary. The integral representations (5) for A(JL) and B(JL) give
1 [CO
A(JL) cos(JLx) + B(JL) sin(JLx) = ~ Lco fW [cos(JL~) cos(JLx) + sin(JL~) sin(JLx)] ~
1 [CO
= ~ J_cof(~) COSJL(x - ~)~.
I Ixi < 1,
(7) f(x) = { 1/2 Ixi = 1,
o Ixi > 1.
See Figure 5. It satisfies conditions (1)-(3) (check this). Moreover,f(x) is even, and so
(9) f(x) =~
n
l
0
co
sin JL cos(JLX) dJL
JL
for - 00 <x< 00.
f x
FIGURE 5. Asquare wave on (-00,00)
518 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
defined by
(ll) Si(x) == io
x sinlL
--dlL.
IL
It is an odd function and it satisfies
7r
(12) Si(O) = 0, lim Si(x) =
X~±OO
±-,
2
by (10).
Equation (9) means that
(13)
where
(14) IN X
( ) =_ -2iN sin IL COS(ILX) dIL.
7r 0 IL
To prove (9) recall that
Thus
(16) IN(x) =-
1 iNCl+ X) sin(t)
--dt + -
1 iNO-X) --dt
sin(t)
7r 0 t 7r 0 t
1 1
=- Si(N(l + x)) + - Si(N(l - x)).
7r 7r
The existence of the limit as N ~ 00 is now apparent from (12). To verify (13) for x ~ 0
three cases arise, as follows.
17r 17r
O::;x<l: lim IN(x) = - - + -- = 1 = j(x),
N~oo 7r 2 7r 2
1 7r 1 1
x = 1: lim IN(x) = --
N~oo 7r 2
+ -·0
7r
= -
2
=j(x),
1 7r 1 -7r
x> 1: lim IN(x) = - - + - - = 0 = j(x).
N ..... oo 7r 2 7r 2
This completes the verification.
B.9 THE FOURIER INTEGRAL 519
It is seen that as N increases, there are rapid oscillations near the discontinuities of
f(x) at x = ±1. This behavior is called the Gibbs phenomenon. It can be understood
from the representation (16). The maximum and minimum of Si(x) are easily seen to be
Si(±n) ~ ±1.8519.
Thus
1
= -[Si(2N - n) + Si(n)]
n
~~ [i + Si(n)] = 1.0895
~~ [i - Si(n)] = -0.0895.
Thus the maximum and minimum of IN(x) overshoot those of f(x) by a fixed amount
and they occur at the points 1 ± nlN, which approach 1 as N --+ 00. The total overshoot
in dropping fromf(1 - 0) = 1 to fO + 0) = 0 (left- and right-hand limits) is
IN ( 1 - ~) - IN ( 1 + ~) ~ 1.1790,
which exceeds the jump fO - 0) - fO + 0) = 1 by 17.9 percent.
EXAMPLE 2. Define f(x) by
Then j satisfies the conditions of the convergence theorem. The Fourier coefficients of
j(x) are given by
A(fL) = -1
n
1 00
j(x) COS(fLX)dx = 1 0
00
e- X cos(fLx)dx,
1 1
-00
B(fL) = -1 00
j(x) sin(fLX)dx = 00
e-X sin(fLX)dx.
n -00 0
These integrals can be calculated from tables. In fact, they were worked out in Chapter 4
as the Laplace transforms
(17) .c{COS(fLt)} = 1 o
00
e- SX COS(fLX)dx =
s
-2--2'
S + fL
.c{sin(fLt)} = roo e-
10
sx sin(fLx)dx = A.
+S fL
Setting S = 1 gives the results
B( ) _ _ fL_
fL - 1 fL2 +
and the convergence theorem gives
1 o
00 COS(fLX) + fL sin (fLX) d
1 + fL2
fL =
£()
J X =
I ne-
n12
°
x
x > 0,
x = 0,
X < 0.
EXAMPLE 3. Define j(x) = e- alxl for all x and a > 0. Then j is even, so B(fL) °
== and
A(fL) =- 21
e- ax COS(fLX)dx = - 2 a 2
00
n o n a +fL
2
by (17) with s = a. Hence the convergence theorem gives
1 !:..
00 a COS(fL
2
o n a +fL
X) d = £( ) = -alxl
2 fL JX e , -00 < x< 00.
l~-ax
for x> 0,
j(X) = for x =0,
_eax for x < 0,
8.9 THE FOURIER INTEGRAL 521
where a > O. Then f satisfies the conditions of the convergence theorem and f is odd.
Hence A(/I) = a and
B(/I) = -21
IT 0
00
e- ax sin(/Ix)dx = 2
- 2 /I 2
ITa +/I
by (17) with 5 = a. Hence
EXAMPLE 5. Define f(x) = e- Ixl cos(x) for all x. Then f satisfies the conditions of the
convergence theorem and f is even. Hence B(/I) = a and
A(/I) = - 21°C e- x cos(x) cos(/Ix)dx.
IT 0
The identity
implies that
A(/I) = - 11
IT 0
00
e- X [cos(x + /IX) + cos(x - /IX)] dx.
L (cosO ± fL)t) = 1o
00
e- st cosO ± /I)tdt =
5
2
5
+ (1 ± fL)
2'
The result is
1 1 1 1
A(fL) = - + -----
IT 1 + (l + /I)2 IT 1 + (l - /I)2 '
or, after some algebraiC manipulation,
1
o
00
--4--
IT/I
+
2 /I 2 2
+4
cos(/Ix)d/I = f(x) = e- Ixl COS(X), -00 < X < 00.
522 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
Then f satisfies the conditions of the convergence theorem and f is odd. Hence A(JL) = 0
and
B(JL) = - 21
rr 0
00
e- X cos (x) sin(JLx)dx.
The identity
gives
B(JL) =-11
rr 0
00
e- X [sin(JLx + x) + sin(JLx - x) 1dx.
.c (sin(JLt ± t» = 1 o
00
e- st sin(JL ± l)t dt = 2
s+JL±l
JL±l
( )2
gives
1 JL+1 1 JL-1
B(JL) = - +- ,
rr 1 + eJL + 1)2 rr 1 + eJL - 1)2
or, after algebraic simplification,
Finally,
1
e-X cos(x) x> 0,
2 JL3 {
00
- - 4- sin(JLx)dr = fex) = o x =0,
o rrJL +4
-r cosCx) x< o.
A(p,) = - 21 00
JT 0
e- ax 1 cos(p,x)dx.
u = sin(rx), du = p, cos(p,x)dx,
1 _ax"
V= -e
2a '
gives
~A'(r) = (
sin(p,x)e-ax2 00) - 100 (e-
--
ax2 )
p, cos(p,x)dx
2 2a 0 2a
1
o
- -p,
00 e- ax 2 cos(p,x)dx
2a 0
JTp,
- -A(p,).
4a
Thus
so
Now,
A(O) = -
JT
2100 0
e- ax2 dx,
lim -
M-+oo 11:
11 l°O0
M
-00
fm cOS(f.,LX - f.,L~)d~df.,L = f(x).
Moreover,
i:
00
11: -00
= ~ fm (1 M
cOS(jLX - f.,L~)df.,L) ~
= .!.1°O f(~) sin(Mx - M~) ~
11
11: -00 X - ~
=-
00
)'x+t
"( )sin(Mt)dt.
11: -00 t
Thus to prove the convergence theorem it will suffice to prove that
(I8) lim -
1
M-+oo 11:
1 -00
00
f(x + t) sin(Mt)
t
dt = f(x) for -oo<x<oo
whenever f satisfies (1)-(3). Moreover, this relation is equivalent to the Dirichlet limit
theorem of Section 8.8, differing only in the letters used. Hence both theorems will be
proved when (18) is verified.
The proof of (18) given below is based on the Riemann-Lebesgue theorem. This
extension of the Riemann theorem of Section 8.6 can be formulated as follows.
i:
Riemann-Lebesgue Theorem. Let g(x) satisfy
Then
(20) lim
M->oo
1 00
-00
g(x) sin(Mx)dx = M-+oo
lim 1 00
-00
g(x) cos(Mx)dx = o.
8.9 THE FOURIER INTEGRAL 525
A proof of this theorem is outlined below, at the end of this section. First it will be
used to prove the convergence theorem.
To prove (18) it will be convenient to write
(21)
1
-
/00 f(x + t) sin(Mt) dt - 1
- [f(x + 0) + J(x - 0) I
n -00 t 2
= II (M) + heM) + I3 (M) + I4 (M),
where
(22)
1
I 1 (M) = -
[00 f(x + t) sin(Mt) dt,
(23) I 2 (M) = -
n
nIt
1 /-1-00 f(x + t) sin(Mt) dt,
11
t
1 sin(Mt) 1
(24) I3 (M) = - f(x + t) dt - -f(x + 0),
(25)
not
I4 (M) = -
1
n
/0
-1
f(x
2
+ t) sin(Mt) dt -
t 2
1
-f(x - 0).
Then to prove (18) it will be sufficient to show that each of the functions II, ... , 14 has
limit zero as M --+ 00. The Riemann-Lebesgue theorem is directly applicable to II and
i:
12 . Indeed,
where
and
(26) -
1 11 sin(Mt)
dt = -
1 1M sines)
--ds = -Sl(M) --+-
1. 1
no t no s n 2
526 8 FOURIER ANALYSIS AND STURM-LIOUVILLE THEORY
(27) MM) = -
111
If(x + t) - f(x + 0)1
sin(Mt)
dt
+ (-111
n o t
sin(Mt)dt - -I)Ji(x+ 0),
not 2
and the last term has limit zero as M --+ 00 by (26). It remains to show that
(28) ·
I1m
M~oo
11
0
f(x + t) - f(x + 0) sm
t
. (M)d 0
t t= .
Hypothesis (1) (f and f' sectionally continuous) implies that the right -hand derivative
then get) is sectionally continuous and Ig(t) I is integrable over -00 < t < 00. Hence (28)
follows from the Riemann-Lebesgue theorem. This proves that I3(N) --+ 0 as M --+ 00.
A similar proof works for I4 (M). This completes the proof of the convergence theorem
for Fourier integrals.
Proof of the Riemann-Lebesgue Theorem. Only the first equality of (20) will be
proved. To do this it must be shown that given any E > 0 there is a number Mo = MO(E)
such that
(29) Ii: I
g(x) sin (Mx) dx < E for M > Mo.
For this purpose it will be convenient to introduce a parameter N > 0 and an auxiliary
i:
function ¢(x), - N :s x :s N. With this notation we may write
g(x) sin(Mx)dx = MM,N) + h<M,N) + I 3 (M,N) + I 4 (M,N),
where
II (M, N) = LOO
g(x) sin(Mx)dx,
B.9 THE FOURIER INTEGRAL 527
i:
i:
h(M,N) = g(x)sin(Mx)dx,
i:
13 (M, N) = (g(x) - ¢(x)) sin(Mx)dx,
It will be shown, for suitable choices of N and the function ¢(x) , that each of these four
integrals is less than E/4 for all M greater than some Mo = Mo(E).
Consider first the integrals II and h. The integrability condition (19) implies that
there is an N = N(E) such that
1N
00 Ig(x)ldx <:. and
4
j-N
-00
Ig(x)ldx < :..
4
It follows, since I sin (Mx) I ::s 1, that
E E
(30) IItCM,N)1 < 4' Ih(M,N)1 < 4
for all M. Next, a fundamental theorem of integration theory states that if Ig(x) I is
integrable on -N ::s x ::s N then there is a function ¢(x) with a continuous derivative
on - N ::s x ::s N such that
(31) j
-N
N
Ig(x) - ¢(x)ldx < -
E
4
and hence
This implies that if N = NO(E) is fixed as above then there is an Mo = MO(E) such that
(33)
(34) f(x) = ~
2n
1 (1 f(~)
-00
00 00
-00
cos fleX - ~) d~) dfl,
and
(35) o= ~
2n
1 (1 f(~)
00
-00 -00
00
sin fleX - ~) d~) dfl.
Multiplying (35) by i and adding the result to (34) gives
(36) j(x) = ~
2n
1 (1 f(~)ei/L(X-~) d~)
00
-00
00
-00
dfl,
or
(37)
This is the complex form of the Fourier integral representation of f(x). Note that if we
define J(fl) by
then
J
The function (x) is called the Fourier transform ofJ(x). Large tables of complex Fourier
transforms are available in [ER].
I~~~rcises 8.9J
Fourier Integral Theorem. Below, verify for the given f the three hypotheses of the Fourier integral
theorem.
8.10 FOURIER SINE ANO COSINE INTEGRALS 529
4. j(x) = _x2 for Ixl < l,f( -1) = -0.5,f(1) = -0.5,f(x) = 0 elsewhere.
Calculation of A(/t) and B(/t). &Iow, determine by direct integral evaluation explicit formulas for the
Fourier integral coefficient functions
A(/t) = ~
7f
1 00
-00
j(x) cos(/tx)dx, B(/t) = ~
7f
1
00
-00
j(x) sin(/tx)dx.
j(x) = 10 00
(A(/t) cos(/tx) + B(/t) sin(/tx)) d/t.
9. A(/t) = 4sin/tI/t, B(/t) = O,j(x) = 27f on Ixl < l,f(-l) = j(l) = 7f,f(x) = 0 elsewhere.
10. A(/t) = ~,B(/t) = 1-~~S/1- ,f(x) = 1 for 0 < x < 1,f(O) = j(l) = 1/2,f(x) = 0 elsewhere.
Fourier Integral Theorem. Apply the Fourier integral theorem in the form
j(x) = 10 00
(A(/t) cos(/tx) + B(/t) sin(/tx)) d/t
to obtain the follOwing new integration formulas, which are not easily obtained by any other method.
11. For x> 0,
while for x = 0 the integral evaluates to 7f, and for x < 0 it evaluates to O.
rOO (2COS(j.LX))
12 . For aII x, JO 1+/1-2 d/t = 7fe-Ixl .
13. For 0 < x < 7f roo (cosax+cosa(lI'-x)) da = sinx and for all other x it evaluates to O.
- - 'JO 1-a2 '
°
Thus if x is a distance coordinate, measured from the near end, then the temperature
is a function u(x, t) defined for all x :::: and t :::: 0. If the near end is held at zero
temperature then u(x, t) is the solution of the following boundary value problem.
Heat in a Semi-Infinite Rod (Dirichlet Boundary Condition). Find u(x, t) such
that
where f(x) is a prescribed initial temperature distribution. In analogy with the case of
the infinite rod, we shall seek a solution under the follOwing additional assumptions.
This problem can be solved by separation of variables. The steps parallel those for the
infinite rod in Section 8.7. The result is
(7) u(x, t) = 1 00
B(jL) sin(jLx)e-/12tdjL ,
where
(8) f(x) = 1 00
B(jL) sin(jLx)djL.
Details of the derivation are left for the Exercises. To complete the solution, (8) must be
solved for B(jL). This can be done by an orthogonality calculation, following the plan of
Section 8.8. However, a more direct approach may be based on the full Fourier integral
of Section 8.9. The key idea is to consider the odd extension ofj(x) , defined by
°
f(x) x> 0,
{
(9) rex) = x = 0,
-fe-x) x < 0.
8.10 FOURIER SINE AND COSINE INTEGRALS 531
and
(ll) rex) = -
Jr
21 0
DO
B*(x) sin(J.Lx)dJ.L
for all x. In particular, we have J*(x) = J(x) for x ::::: 0, and (ll) becomes (8) with
(12) B(fL) = - 21
Jr 0
DO
J(x) sin(fLx)dx.
(13) J and l' are sectionally continuous for 0 :::: x < 00,
(14) J(O + 0) = O,f(x) = ~[J(x + 0) + J(x - 0)] for 0 < x < 00,
Then
(17) J(x) = - 21
Jr 0
DO
sin (fLx) 1°C
0
J(~) sin(fL~)d~dfL for x::::: o.
where f(x) is a prescribed initial temperature distribution. Conditions (5), (6) are again
assumed. In this case separation of variables gives the solution
(22)
where
(23)
The last equation can be solved for A(fL) by applying the Fourier integral theorem to the
even extension off. This leads to the following theorem.
Fourier Cosine Integral Theorem. Let f(x) satisfy the conditions
(25) fCO) = f(O + 0) ,fcx) = ! [fCx + 0) + fCx - 0) 1 for 0 :::: x < 00,
(26) 10 00
If(x)ldx < 00.
Then
(28) A(fL) = ~ 10 00
f(x) cosCfLX)dx.
Details of the proof are left for the Exercises. Substitution of (28) into (27) leads to
the following corollary
Corollary. Under the hypotheses of the Fourier cosine integral theorem we have
(29) f(x) = - 21
:rr 0
00
COS(fLX) 1
0
00
f(~) COS(fL~)d~dfL for x 2: O.
8.11 HEAT DIFFUSION IN NONHOMOGENEOUS RODS 533
IExercises 8.10 I
l. (Heat Diffusion) Derive the solution (7)-(8) of the semi-infinite rod problem with Dirichlet boundary
conditions by the method of separation of variables.
2. (Heat Diffusion) Derive the solution (22)-(23) of the semi-infinite rod problem with Neumann boundary
conditions by the method of separation of variables.
3. (Fourier Sine Integral) Derive the representation (16), where B(IL) is defined by (12), using the Fourier
sine integral theorem.
4. (Fourier Cosine Integral) Derive the representation (27)-(28) using the Fourier cosine integral theorem.
where j(x) is a prescribed initial temperature distribution. The partial differential equation
(2) is equation (24) in Section 7.2 with coefficients p, q, w that are related to the physical
parameters p, c, A, K, H by
(7)
(p(x)X')' + [q(x) + AW(x)]X = 0 for a:::: x:::: b,
aIX(a) + a2X'(a) = 0, and bIX(b) + b2X'(b) = O.
This problem is called a Sturm-Liouville problem. The principal facts concerning such
problems are stated in the next section and used to solve the heat diffusion problem
(1)-(4).
(1) p(x), p'(x) , q(x) and w(x) are continuous for a::: x ::: b,
(2) p(x) > 0 and w(x) > 0 for a ::: x ::: b,
(3) (aI, a2) f. (0,0) and (b l , b2 ) f. (0,0).
(4)
and
(5) lim An =
n-+oo
+00.
Moreover, the eigenspace for any An is one-dimensional. This means that any two
eigenfunctions of the eigenvalue An differ at most by a constant factor.
Proofs of the existence theorem depend on methods of advanced calculus; see, for
example, [B-R]. Methods for calculating An and Xn(x) are given in Section S.13, together
with examples.
The success of the method of separation of variables depends on the orthogonality
of the eigenfunctions Xn(x). For the Sturm-Liouville problem of the above existence
theorem, the correct condition is described by the following theorem.
Orthogonality Theorem. Under the conditions of the existence theorem, the
eigenfunctions satisfy the equation
Condition (6) will be expressed by saying that Xm and Xn are orthogonal with weight
w(x) on a ::: x ::: b. When w(x) = 1, it reduces to the usual orthogonality of Fourier
theory. The follOwing proof of (6) is based directly on the Sturm-Liouville differential
equation and boundary conditions.
536 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
Proof of the Orthogonality Theorem. The eigenpairs (A'm, Xm) and (An, Xn) satisfy
(pX~)' + [q + AmwlXm = 0,
(pX~)' + [q + An W lXn = O.
If the first equation is multiplied by Xn , the second by Xm , and the second is subtracted
from the first, then the terms with q cancel. The result is
(8)
where
12 = p(b)[X~(b)Xn(b) - X~(b)Xm(b)l.
(9)
which again implies that It = O. Similarly, the boundary condition at x = b implies that
h = O. Thus
(10) (An - Am) lb Xm (x)Xn (x) w(x) dx = O.
If n i- m then An i- Am and (10) implies (6). This completes the proof.
Sturm-Liouville Series. The final ingredient in the method of separation of variables
is a convergence theorem that states that rather general functions f(x) have expansions
into an eigenfunction series:
00
If (11) is multiplied by Xm(x)w(x) and then integrated over a'::: x .::: b then orthogonality
condition (6) implies that
(12) =
f: f(x)X n (x)w(x)dx
X; (x)w(x)dx.
Cn b
fa
The series in (ll) with coefficients (2) is called the Sturm-Liouville series of/ex).
When is f(x) equal to its Sturm-Liouville series? Various sufficient conditions are known.
One classical theorem states that if
and
More General Boundary Conditions. The preceding theorems do not include the
case of periodic boundary conditions
(14) X(a) = X(b), X'(a) = X'(b).
It can be shown that most of the preceding theory holds in this case. An exception is
in the statement that the eigenvalues are simple (have a one-dimensional eigenspace).
In fact, for the full Fourier series it was shown that the eigenvalues An = n 2 > 0 had
two-dimensional eigenspaces with orthogonal eigenfunctions sin(nx) and cos(nx).
The separated boundary conditions of (7) and the periodic boundary conditions of
(14) are examples of more general linear homogeneous boundary conditions of the form
It is known that the orthogonality theorem holds for the Sturm-Liouville problem with
boundary conditions 05) if (and only iD the 2 x 2 coefficient matrices M and N satisfy
a1u(a, t) + a2ux(a, t) = 0,
(18) (
b1u(b, t) + b2 ux (b, t) = 0, t ~ 0,
(19) u(x,O) = f(x) for a .:::: x .:::: b,
where f(x) is a prescribed initial temperature distribution. The solution of the boundary
value problem (16)-(19) by the method of separation of variables can now be completed.
The product solutions of (16)-(18) are the functions
where An and Xn are the eigenvalues and eigenfunctions of the problem (7), Section 8.11.
The superposition step is to build the complete solution as a sum
L cnXn(x)e-Ant.
00
(20) u(x, t) =
n=l
To satisfy the initial condition (19) we need to represent f(x) as a Sturm-Liouville series
ell). This is possible if f(x) satisfies the conditions of a convergence theorem such as
(13). Then the solution is given by (20) with Cn defined by (12).
[~~ercises ~:12~
Weight Functions. Determine for the given problem the weight function w(x) and the interval of definition
of w(x).
11. The Legendre polynomials Xj (x) = x, X2(X) = (3x 2 - 1)/2, X3(X) = (5x 3 - 3x)/2, a = -1, b = 1,
w(x) = 1.
12. The Legendre polynomialsX2(x) = (3x 2 -1)12, X3(X) = (5x 3 -3x)!2, X4(X) = (35x 4 - 30x 2 +3)/8,
a = -1, b = 1, w(x) = 1.
13. (Separation of Variables) Let u(x, t) = X(x)T(t) be a product solution for the heat diffusion problem
in a nonhomogeneous rod. Supply the details in the method of separation of variables that show that
Tet) and X(x) satisfy, for some constant A,
14. (Orthogonality Theorem) Let (X n , An) and (Xm , Am) be two eigenpairs, as in the orthogonality
theorem. Supply the details in the derivation of the relation
(2)
are assumed with (al,a2) i= (0,0) and (b l ,b2) i= (0,0). If (2) is substituted into (3)
then the result is a homogeneous linear system for (Cl, C2),
is zero. Thus D(A) = 0 is an eigenvalue equation. The eigenvalues AI, A2, A3, ... , which
are guaranteed by the theory, are precisely the roots of this equation. In simple cases the
roots can be found exactly If this is not possible, then numerical methods must be used.
Examples of both types are given below. The corresponding eigenfunctions are given by
(7)
where (Cl, C2) is any nontrivial solution of (4). The solution (Cl, C2) is unique up to scalar
multiples. Indeed, the existence theorem implies that for each eigenvalue An one has
(9)
X" + AX = 0 for 0::: x::: rr,
X(O) =0 and X(rr) = o.
This is the special case of the general Sturm-Liouville problem (7) with P = w = 1, q = 0,
(a, b) = (0, rr), (aI, a2) = (1,0) and (b l , b2) = (1,0). As in Section 8.1, theform ofthe
solution basis VI (x, A), V 2 (x, A) changes with the sign of A. For A > 0 a suitable basis is
(10)
whence
This yields the eigenpairs An = n 2 , Xn(x) = sin(nx) as before. The cases A = 0 and
A < 0 yield no further eigenvalues.
This is the same as the preceding example except that the boundary conditions are
changed to (aI, a2) = (0,1) and (b l , b2 ) = (0,1). For A > 0 the solution basis (10)
may again be used and
whence
(16) DA
() = I JIcosJI
JI 0
-JIsinJI
= - 'Sl·U "I/A
I\.
".
542 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
This readily gives the eigenpairs ()..n,Xn(x» = (n 2 ,cos(nx», for n = 1,2,3, .... The
case A = 0 gives the additional eigenpair (AO,XO(X» = (0,1). The case A < 0 gives no
eigenvalues, as was shown in Section 8.4.
(17)
X" + AX = 0 for 0::::: x ::::: 1,
X(O) = 0 and X(1) + X'(l) = o.
Physically this corresponds to heat diffusion in a uniform rod with left end held at
temperature zero and Newton's law of cooling at the right end. The boundary conditions
are (aI,a2) = (1,0) and (b I ,b2) = (1,1).
To find convenient solution bases VI(X,A), V2 (X,A), the three cases A < 0, A = 0
and A > 0 are again treated separately:
Case 1 (A < 0). Write A = _J,t2 with J,t > O. Then a suitable basis is
whence
1 1
D(A) =
c fl- + J,te /L e-fl- - J,tc-fl-,
where A = _J,t2 Hence D(A) < 0 for all A < 0 and so there are no negative eigenvalues.
Case 2 (A = 0). In this case the differential equation is X" = 0 and a basis is
= 1,
VI (x, 0) V2 (x,0)= x,
V~(x,O) = 0, V~(x,O) = 1,
and so
AIl(O) = 1, A 12 (O) = 0,
A2l (0) = 1, A 22 (O) = 2,
8.13 CONSTRUCTION OF EIGENVALUES AND EIGENFUNCTIONS 543
and
1 0
D(A) = = 2 =F O.
1 2
All(A) = 0, A 12 (A) = 1,
A2l (A) = sin f.L + f.L cos f.L, An (A) = cos f.L - f.L sin f.L,
and hence
D(A) =
o 1
= - (sin f.L + f.L cos f.L).
sin f.L + f.L cos f.L cos f.L - f.L sin f.L
Note that if cos f.L = 0 then D(A) = ±l. Hence, the eigenvalue equation is sin f.L +
f.L cos f.L = 0, or, equivalently,
u_ 3rr
r- 2 FIGURE 10. Crossings of y = tan /1 and y = -/1
544 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
(2l)
(elxX')' + (e lx + Aelx ) X = ° for 0::::: x::::: n,
X(O) = 0, X(n) = 0,
is a Sturm-Liouville problem with p(x) = q(x) = w(x) = elx , (a,b) = (O,n),
(al,a2) = (1,0) and (h,b 2) = (1,0). The differential equation is equivalent to the
constant-coefficient equation
which can be solved by Euler's substitution X = e"x. It is easy to verify that there are no
°
eigenvalues A :s (Cases 1 and 2). For Case 3, write A = f-L2 with f-L > 0. Then Euler:S
method gives the solution basis
whence
D(A) =
elf cos(rrf-L)
1 °
e- rr sin (rrf-L) ,
Moreover, for A = An the system (4) has the solution (Cl, (2) = (0, 1) and hence
(23)
(xX')' + ).,x-IX = for ° 1:s x :s e,
XO) = 0, X(e) = 0,
is a Sturm-Liouville problem with p(x) = x, q(x) = 0, w(x) = 1/x, (a,b) = (1,e),
(aI, a2) = (1,0) and (b l , b2) = (1,0). The problem is solvable because the differential
equation is equivalent (use (xX')' = xX" + X') to the Euler equation
(24)
The equation may be solved by the substitution X = x", which produces a solution if
0i 2 + A = 0. It is easy to check that there are no eigenvalues A :s 0. For A = f-L 2 (f-L > 0)
546 8 FOURIER ANALYSIS AND STURM-LIOUVILLE THEORY
D(A) = 1
cOS(jL)
°
sin(jL)
= sin(-v'A\
Ie
1
Xm(x)Xn(x)x-1dx = Ie
1
dx
sin(mnlnx) sin(nnlnx)-
X
= 0, m#n.
The relation is easily verified by direct integration using the substitution t lnx,
dt = x-ldx.
(25)
(X2X')' + Ax- 2X = ° for I/2:s x :s 1,
X(l/2) = 0, X(l) = 0,
(26)
This has variable coefficients and is not an Euler equation. Hence a solution basis is not
obvious. However, the problem may be Simplified by a change of independent variable.
To do this, define new independent and dependent variables y and Y(y) by
8.13 CONSTRUCTION OF EIGENVALUES AND EIGENFUNCTIONS 547
whence
Y"+AY=O,
(27)
Y(l) = 0, Y(2) = 0.
The original problem will be solved by solving (27) and then transforming back.
It is easy to verify that (27) has no eigenvalues A ::::: 0. For A = ft2 (ft > 0) a solution
basis for (27) is
by the trigonometric identity cos a sin b - sin a cos b = sin (b - a). Thus the eigenvalues
are
Hence we may take ((1,(2) = (0,1) in system (4), which gives the eigenfunctions of
(27) as
An = rr 2n2 ,
Xn(x) = sin (7), n = 1,2,3, . ...
The corresponding orthogonality condition is
1 m
1
X m (x)X n (x)x- 2dx = 11
m
(mrr) sin (nrr)
sin -
x
- dx = 0,
2"
x x
m:f. n.
Exercises 8.13
Sturm-Liouville Problems. Solve the given Sturm-Liouville eigenvalue problem with Dirichlet conditions
for the eigenvalues and eigenfunctions and explicitly write out the orthogonality relations.
1. X" (x) + )'X(x) = 0, X( -rr) = 0, X(rr) = O.
2. (x-IX')' + x- 3(1 + )')X = 0, X(l) = 0, X(e) = O.
3. (xnX')' + h n- 2X = 0, X(a) = 0, X(b) = 0, 0 < a < b, n real. Do only the special case n = 1, a = 1,
b = 2.
If pea) = 0 then this equation has a singular point at x = a, in the sense of Chapter 3, and
solutions X(x) and their derivatives X' (x) may not have limits when x -+ a. In such cases
the boundary condition may be replaced by the weaker condition that XCx) and X'(x)
remain bounded as x -+ a. Recall the role of the boundary condition ajX(a) +a2X'(a) =
o in the proof of the orthogonality condition. It implied that (see (8) in Section 812)
pea) [X~(a)Xn(a) - X~(a)Xm(a)] = 0,
or, equivalently,
If p(x) -+ 0 and Xm(x), X~(x), Xn(x) and X~(x) remain bounded as x -+ a then (2)
again holds and the orthogonality theorem remains true.
Positivity of the Eigenvalues. It will be convenient to show first that any eigenvalue
must be positive. To do this let (A, X(x») be any eigenpair, then multiply (3) by XCx) and
550 8 FOURIER ANALYSIS AND STURM-lioUVILLE THEORY
Moreover,
11 xX 2 (x)dx > 0
because X(x) is a nontrivial solution of (3). Hence, to prove that A > 0, it will suffice
to show that the last integral in (6) is negative. This can be done by integration by parts
and use of conditions (4) and (5). The result is
Now, the integrated terms give zero because (l)X(l)X'(l) = 0, by (5), while
xl/ +X~lX' + AX = O.
For A = 1 this is the Bessel equation of order zero. To find the general solution for any
A > 0, make the change of variable
It follows that
where Jo(y) is the Bessel function of order zero, defined in Chapter 3, and Yo(y) is a
second independent solution. Hence the general solution of (3) for A > 0 may be written
(7)
and
The simplest verification is to repeat the calculation of Section 8.12, equation (8). The
details are left for the Exercises.
Fourier-Bessel Series. If a function j has a representation
L
00
fo1 j(x)JO(snx)x dx
(12) en = 1
fo 15 (SnX)X dx
The series in (11) with coefficients (12) is called the Fourier-Bessel series of j(x). The
convergence of the Fourier-Bessel series to its defining function is known to hold under
the same conditions as for regular Sturm-Liouville series. Convergence theorems and
full proofs are given in [Wal, Chapter 18.
EXAMPLE 2. Legendre Series. Consider the eigenvalue problem for the Legendre
equation
L cnPn(x)
00
(16) j(x) =
n=O
with coefficients
is called the Legendre series representation of j(x). It is known to be valid for -1 <
x < 1 when j(x) and l' (x) are sectionally continuous. The behavior at the singular points
x = ±1 is more complicated. See Ual for details and proofs.
EXAMPLE 3. A Sturm-Liouville Problem with Continuous Spectrum. Consider the
eigenvalue problem
It is a Sturm-Liouville problem with p(x) = x, q(x) = 0, w(x) = X-I and (a, b) = (0,1).
In particular, it is singular both because p(O) = 0 and also because w(x) has a singularity
at x = O. In Chapter 9 this problem arises in the analysis of steady-state heat diffusion
in a plate in the form of a circular sector. It is presented here as an example of a Singular
Sturm-Liouville problem that has continuous spectrum and is completely solvable.
(21)
A method for finding the solutions of an Euler equation was given in Section 3.4. It will
be used here to find the eigenvalues and eigenfunctions of (18)-(20). The three cases
A < 0, A = 0 and A > 0 will be treated separately.
Case 1 (A < 0). Write A = _J..L2 with J..L > O. Then the method of Section 3.4 gives the
general solution
This is bounded for 0 < x ::: 1 if and only if (2 = 0 and X(x) = (lXI". The boundary
condition X(1) = 0 then gives (] = 0 and hence X(x) == O. Thus the problem has no
eigenvalues A < O.
Case 2 (A = 0). In this case the general solution is
X(x) = (1 + (2lnx.
This is bounded for 0 < x :::: 1 if and only if (2 = O. Then XO) = (1 = 0 and so
X(x) == O. Thus A = 0 is not an eigenvalue.
554 8 FOURIER ANALYSIS AND STURM-LIOUVILLE THEORY
Case 3 (A > 0). Write A = p.,2 with p., > O. Then the method of Section 3.4 gives the
general solution
where C(p.,) is an integrable function. An answer to this question can be obtained from
the Fourier sine integral theorem. To derive it, make the change of variables x = e- t (so
lnx = -t) and write
-C(p.,) = B(p.,) = -
rr 0
21 00
pet) sin(p.,t)dt.
(25) C(p.,) = -
rr 0
211
f(x) sin(p., Inx)x- 1dx.
These calculations are valid when pet) = f(e- t ) satisfies hypotheses (13)-(15) of the
Fourier sine integral theorem. This implies the following representation theorem.
(26) f(x) and f' (x) are sectionally continuous for 0 < x ::::: 1,
(27) f(1- 0) = 0, f(x) = ![j(x + 0) + f(x - 0)] for 0 < x < 1,
Then f(x) has the representation (23), where C(p.,) is defined by (25).
8.14 SINGULAR STURM-lioUVILLE PROBLEMS 555
Proof. We need only verify that the conditions (26)-(28) for j(x) are equivalent to
conditions (13)-(15) for F(t) = fCe-t). The verification is left for the Exercises.
I Exercises 8.14 1
~"~"""'",",,,,,,~, ~'"'''''m, _" ,',,,,,b,,,,,il+,~,,',,"
1. (A Legendre Series on 0 ~ x ~ 1) Construct the eigenpairs (j"n, Xn) for the singular Sturm-Liouville
problem
2. (Another Legendre Series on 0 ~ X ~ 1) Construct the eigenpairs (.A.n, Xn) for the Singular
Sturm-Liouville problem
3. (Orthogonality of Fourier-Bessel Series) Supply the details in the proof of the orthogonality relation
The goal of this final chapter is to show how the boundary value problems of mathematical
physics can be solved by the methods of the preceding chapters. This will be done by
solving a variety of specific problems that illustrate the principal types of problems that
were formulated in Chapter 7. Additional applications are developed in the Exercises.
The primary solution method is Fourier's method of separation of variables and the
associated Sturm-Liouville theory of Chapter 8.
Chapter 9 is divided into seven sections, as indicated above. Section 9.1 presents
a final discussion of the one-dimensional heat diffusion problems treated in Chapter 8
(heat equation Ut = uxx} Section 9.2 treats boundary value problems for vibrating strings
(wave equation Utt = c2 u",) Section 9.3 develops Fourier's method for steady-state
temperatures in plates (two-dimensional Laplace equation Uxx + Uyy = 0). Section 9.4
treats transient temperatures in plates (heat equation U t = Uxx +Uyy ). Section 9.5 analyzes
vibrations of circular drums (wave equation un = c2 (uxx +uyy )). Section 9.6 treats steady-
state heat diffusion in solids (Laplace equation Uxx + Uyy + u zz = 0). Finally, Section 9.7
presents an alternative to the Fourier method that is based on the Laplace transform.
----{l )~
x= o X=](
FIGURE 1. Homogeneous heat-conducting rod
Some of these problems were already solved, or partially solved, in Chapter 8. In those
cases the results are summarized, and some extensions of the theory are presented.
Problem 9.1.1. Finite Rod, Ends at Zero Degrees. This problem was formulated, and
the uniqueness of the solution was shown, in Section 7.2, see also Figure 1. The problem
is to find a function u(x, t) such that
(5) u(x, t) =L
00
n=l
bn sin(nx)e- n1t , bn = - 21
]( 0
n
j(x) sin(nx) dx.
This satisfies (1) through (4) and hence defines the solution u(x,t) when u(x,O) = j(x)
and
L b sin(nx),
00
It was shown in Section 8.3 that sufficient conditions for this are that j and l' be
sectionally continuous and satisfy the conditions j(O) = j(Jr) = 0 and j(x) =
~ [r(x + 0) + j(x - 0)] for 0 < x < Jr. Clearly, the factors e- n2t in (5) make the series
converge very rapidly when t > 0, even if j(x) does not satisfy the conditions.
Under slightly stronger restrictions on j(x) one has
(7)
and hence the series in (6) converges absolutely and uniformly for allx. This is a corollary
of the following theorem of Churchill; see [Ch-B2J.
Churchill's Theorem. Let j(x) be defined for -Jr ::: x ::: Jr and assume that
(10) ak = -11:117<
_ f(x) cos(kx) dx, bk = -11:117<
_ f(x) sin(kx) dx.
Then
fJa~+b~ < 00
k=l
and the Fourier series for f(x) converges uniformly to f(x) for -11: ~ X ~ 11:.
If f(x) is given for 0 ~ x ~ 11: then one gets similar results for the sine and cosine
series of f(x) by applying Churchill's theorem to the odd and even extensions of f(x) ,
respectively. This gives the following corollaries.
Corollary. (Sine Series). Letf(x) be continuous, letf'(x) be sectionally continuous
on 0 ~ x ~ 11:, and letf(O) = f(11:) = O. Then (7) holds for
bn = -217< f(x) sin(nx) dx.
11: 0
Corollary. (Cosine Series). Let f(x) be continuous and let f'(x) be sectionally
continuous on 0 ~ x ~ 11:. Then
satisfies
(ll)
n=l
Computation of u(x, t). To compute u(x, t) from the series solution (5) we must truncate
the series; that is, replace (5) by a partial sum:
Lb sin(nx)e-
N
(12) UN(X,t) = n
n2t .
n=l
L b sin(nx).
N
(13) fN(X) = n
n=l
Similarly, the error u(x, t) - UN (X, t) is the solution of the diffusion problem with initial
values f(x) - fN(X) (the truncation error for (6». We wish to show that the truncation
error in u(x, t) does not exceed that for f(x):
(14)
560 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
This is a consequence of the maximum principle for the heat equation; see page 41 in
[Strj. Thus to achieve a satisfactory error bound for u(x, t) at any t it is enough to choose
N so as to achieve the same bound at t = O.
Problem 9.1.2. Finite Rod, Both Ends Insulated. For a rod of length L = Jr and
thermal diffusivity K = 1 this problem was formulated and solved by Fourier's method
in Section 8.4. The problem is to find a function u(x, t) such that
where j(x) is a prescribed function. The Fourier method led to the formal solution
00
where
(20) ao = - ll
Jr 0
1C
j(x)dx
and
(21) an = -
Jr
2i 0
1C
f(x) cos(nx) dx for n = 1,2,3, ....
In particular, setting t = 0 in (19) gives the Fourier cosine series representation of j(x):
00
Sufficient conditions for (19)-(22) to give the solution of the boundary value problem
were derived in Section 8.4. Under the stronger conditions of Churchill's theorem,
Corollary 2, one has
n=1
and the series in (19) converges absolutely and uniformly for 0 S x S Jr.
and
N
UN (X, t) = ao + Lan cos(nx)e-n 2t
, N = 1,2, ....
n=1
Ifj satisfies the hypotheses of Churchill's theorem, Corollary 2, then (23) and the triangle
inequality give the error estimate
L
00
since I cos(nx)e-n 2t lSI. The last sum is an error bound for the Fourier cosine series for
j(x):
L
00
°
To see how this may give a numerical error bound for the remainder (23), consider
Example 2 of Section 8.4: j(x) = x for Sx S Jr. Then
-4
a2n = 0, a2n+l = Jr(2n + 1)2' n = 0,1,2, ... ,
and hence, replacing N by 2N, we get
as an error bound for Iu(x, t) - U2N(X, t)l. The integral test for estimating the sum of a
series gives
N = 1,2, ....
Thus
1 1
RN S E if N > -
- 2
+ -.
JrE
This estimate, which holds for all t ::: 0, is pessimistic. For t > 0 the factors e- n2t make
the series in (19) converge much more rapidly.
Problem 9.1.3. Finite Rod, Mixed Boundary Conditions. This problem illustrates the
Sturm-Liouville theory for a homogeneous rod with different boundary conditions at
the two ends. The boundary value problem is to construct a function u(x, t) such that
where j(x) is a prescribed function. Physically, the boundary conditions (26) mean
that the end of the rod at x = 0 is insulated, while the end at x = 1 radiates
heat into an exterior medium at temperature zero. This problem illustrates the general
Sturm-Liouville theorem in a case where all functions can be calculated expliCitly.
Application of the separation of variables method leads to a series solution of the form
=L
00
(29)
Jo1 j(x)Xn (x)dx n=1,2,3, ....
C
n= Jo1 Xn(x)1 2 dx'
I
Here An and Xn(x) are the eigenvalues and corresponding eigenfunctions of the Sturm-
Liouville problem
(30)
X"(x) + AX(X) = 0 for 0::::: x::::: 1,
X/CO) = 0 and X/(l) + X(l) = O.
Case)... < O. Write)... = _/-L2 with /-L > O. Then the general solution of the differential
equation of (30) is
(31)
with derivative
where hI and hI are arbitrary constants. Thus)... = - /-L I is an eigenvalue if and only if
there is a pair (hI, h2 ) :j:. (0,0) such that
so
and hence
Hence the two boundary conditions give the following linear system for (hI, h2 ):
y
10
o
o 1f
"2
rr h 2rr
2 FIGURE 2. Graphical solution of tan JL = l/JL
Clearly, cos JL =f. O. Hence, (33) can also be written in the equivalent form
1
(34) tanJL = -.
JL
The corresponding eigenfunction is given by (32) with k2 = 0; that is, nonzero multiples
of
and
Accurate values of the eigenvalues can be obtained from equation (34) by standard
numerical methods, as developed in Chapter 1. In Table 1, the first ten crossings of
y = tan x with y = l/x (see Figure 2) were computed by Newton's method.
Eigenfunctions. These have the form (35) with fL = fLn:
This can also be verified by direct integration and use of the eigenvalue equation (34).
Problem 9.1.4. Fourier's Ring Problem. This is the physical problem of predicting the
diffusion of heat in a ring of unit radius when the initial temperature is given; see Figure
3. The problem was already discussed in some detail in Section 8.5. Here we note that
u(x, t), the temperature distribution at time t, is a 2rr-periodic function of the angular
°
coordinate x. Hence u(x, t) is determined by its values on any x-interval of length 2rr.
In Section 8.5 the interval ::s x ::s 2rr was used. It is often convenient to use instead
the symmetric interval -rr ::s x ::s rr.
In this formulation u(x, t) is defined by the follOwing properties.
Conditions for the validity of the solution (41), (42) are that f(x) and rex) be sectionally
continuous and satisfy the boundary conditions. A particularly favorable case occurs
when f(x) satisfies the conditions of Churchill's uniform convergence theorem, namely,
f( -n) = fen).
Then
and hence the series in (41) converges absolutely and uniformly for all x.
Error Estimates. Conditions (43) imply that the partial sums
N
(44) UN (x, t) = ao + L{an cos(nx) + bn sin(nx)}e- n2t
n=l
converge uniformly to u(x, t) as N --+ 00. Moreover, (41), (44) and the triangle inequality
give the error estimate
L
00
The sum on the right can often be estimated, as in Problem 9.1.2, to give practical error
estimates.
9.1 HEAT DIFFUSION IN ONE DIMENSION 567
Problem 9.1.5. Heat Diffusion in a Sphere (Symmetric Case). The boundary value
problem of heat diffusion is a solid body Q was formulated in Section 7.6. This problem
will treat the special case where Q is a sphere of unit radius defined by the inequality
(46)
See Figure 4. If the surface of Q is held at zero degrees and if the initial temperature is
a function fer) of the radial coordinate only, then we may expect that the temperature is
a function u(r, t) for all t 2: O. This conjecture will be proved by constructing a solution
of the form u = u(r, t) The uniqueness theorem for the problem (see Section 7.6) then
implies that there can be no other solution.
The heat equation for three-dimensional bodies is
where A is a separation constant. In particular, the time factor T(t) may be taken to be
(55)
For the radial factor R(r) we get the following singular Sturm-liouville problem:
2
(56) R"(r) + -R'(r) + AR(r) = 0 for 0 < r S 1,
r
R(l) = 0,
R(r) and R'(r) are bounded for 0 < r :::: 1.
This has the standard form of Section 8.11 with coefficients per) = w(r) = r2, q(r) = O.
The problem is singular because p(O) = O. The eigenfunctions will be orthogonal on
o :::: r :::: 1 with weight w(r)
= r2.
Solution of the Differential Equation. The differential equation of (56) can be
Simplified by the change of dependent variable
v"(r) + Av(r) = O.
Thus the differential equation for R(r) has the general solution
( )r - l_ k sin.JAr k cos.JAr
(58) R +2 .
r r
9.1 HEAT DIFFUSION IN ONE DIMENSION 569
Functions R(r) and R'(r) will be bounded on 0 < r ::: 1 if and only if k2 = O. Thus we
may take
sin ,J).r
R( r) = .
r
The boundary condition at r = 1 now gives the eigenvalue equation
An = n 2n 2 , n = 1,2, ... ,
sin(nnr)
RnCr) = , n = 1,2, ....
r
Product Solutions. Combining (53) with (55), we have
(59) Un
(r, t ) -_ sin(nnr) e
_n2n2(
, n = 1,2, ....
r
Superposition. The formal solution of the boundary value problem is
~ sin(nnr)
uCr, t ) =
_n2n2(
(60) ~ Cn e.
n=1 r
Orthogonality. The eigenfunctions are orthogonal on 0 ::: r ::: 1 with weight w(r) = r2,
so
(62) t
10
(SinCnmr») (SinCnnr») r2 dr =
r r
I ~ if m # n,
- If m = n.
2
This can be checked by direct integration.
Coefficients. The coefficients Cn are given by (61) and (62), namely
or
We see that the coefficients {en} are just the Fourier sine series coefficients of rf(r) on
o :s r :s 1. In particular, if rf(r) satisfies the conditions of Churchill's theorem, then the
series solution (60), (63) converges uniformly.
Problem 9.1.6. Heat Diffusion in a Disk (Symmetric Case). The problem of transient
heat diffusion in a plate Q was formulated in Section 7.3. Here we shall solve it for the
special case that Q is a disk of unit radius defined by
(64) r = -/x 2 + y2 :s 1,
(65) V 2 u(r, t) = ( a
ax2
2
+ a
ay 2
2
) u(r, t) = Urr + ~UT.
r
Hence the boundary value problem of Section 7.3 specializes to the search for a function
u(r, t) that satisfies
(o,~o-)---1 (1,0)
(72) (rR')' + hR = 0
and thus per) = w(r) = rand q(r) = 0 (see Section 8.11). Note that (72) is singular on
Os r S 1 because p(O) = O.
The eigenpairs (A, R(r)) are the solutions of the singular Sturm-Liouville problem
where S1, S2, ... are the positive zeros ofJo(x), enumerated in increasing order
and
Superposition. The solution of the disk problem will have the form
Initial Values. To complete the solution of the boundary value problem, we must have
oc
(77) u(r,O) = fer) = L enio (Snr) ,OS r S 1.
n=1
572 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
This is the Fourier-Bessel series of Section 8.14. The orthogonality property implies that
the coefficients {cn} are given by
(84)
Equations (83) and (84) give the solution of the boundary value problem when j(x)
satisfies the conditions of the Fourier integral theorem (see Section 8.9).
To compute the solution u(x, t) by means of (83), (84), it is necessary to carry out
three integrations. We shall now show that this can be reduced to a Single integration.
Equation (84) and the trigonometric identity
imply that
9.1 HEAT DIFFUSION IN ONE DIMENSION 573
Double Integral Representation. Combining (83) and (85) gives the equation
which requires only two integrations. We shall reverse the order of integration in (86)
and carry out the inner integration to get a simple integral representation for u(x, t).
The change of order of integration in (86) is justified by a theorem of advanced
calculus if
(87)
This gives
i: If(x) I dx < 00.
(88)
1
of the boundary value problem:
1 00
(90) u(x, t) = c;
(x_()2
j(~)e--4-t d~, t> o.
2y nt -00
This formula was derived under the integral assumption (87). However, the formula is
valid for a larger class of initial values. For example, it is enough that j(x) be bounded and
sectionally continuous, in which case it may be shown that if U is defined by (90), then
1
lim u(x, t) = - [[(x + 0) + j(x - 0)] for all x.
t--+O+ 2
The proof, which depends on methods of advanced calculus, will not be given here; see
[Wi2l.
An Alternative Formula. If in (90) we make the change of variables s = (~- x)/(2,.fi.) ,
we get
(91) U(x, t) =
y
1
r.;;.
n
1 00
-00
j(x + 2s,.fi.)e- S2 ds.
574 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
From this formula it is rather easy to show that limHo+ u(x, t) = j(x) whenever j(x) is
continuous and bounded (see the Exercises).
EXAMPLE 1. Consider the special case where the initial temperature is the Heaviside
function: j(x) = H(x). The solution is given by (91). Since x + 2s.Jt ::: 0 if and only if
s ::: -xle2.Jt), we have
(92) u(x, t) = 1
r;;. /00 e-s'ds.
'\j J( -xJ(2..!iJ
The Error Function. The integral in (92) can be expressed in terms of the error
function of probability theory The latter, denoted by erf (x), is defined by the equation
(93) 2
erf(x) = ~ l 0
x
e- s, ds.
The function is tabulated (see [A-S]) and is available via many computer algebra systems.
Numerical evaluation of erf(x) via numerical integration was discussed in Section 1.3.
The complementary error function is also useful. It is defined by
(94) 2
erfc(x) = 1 - erf(x) = ~ 100
x e- s, ds.
Example 1 (Concluded). The properties of the error function allow us to write the
solution u(x, t) of (92) in the following equivalent forms:
-x ) -x )
(96) u(x, t) = 2:1 erfc ( 2.Jt = 2:1 - 2:1 erf ( 2.Jt
(x)
= 2:1 + 2:1 erf 2.Jt .
From this and the properties of erf(x) we can derive the following results:
o 8
FIGURE 6. Graph of ~ + ~ erf(xI(2y't)) with t
-8 -4 4
fixed It = 1)
A graph of u(x, t) as a function of x, with t > 0 fixed, is shown in Figure 6. Recall from
Section 1.3 that erf(x) ::: 0.9999 for all x ::: 3.
The Fundamental Solution K(x, t). The solution (90) of the initial value problem
for the infinite rod is a convolution
(97)
where
U(x, t) = i: K(x - t OJ(/;) d/;,
°
Thus K(x, t) is the temperature at point x and time t > due to a quantity of heat placed
°
at x = at time t = 0. To determine how much heat is needed to generate K, we define
where (a, b) is an arbitrary interval. Physically, Q(a, b, t) is the amount of heat in the
segment a :::: x :::: b at time t > 0. We shall calculate its limit as t -+ O+. From the
definition
Q(a, b, t) = -1-
2JITt
lb a
e-x2/4t dx
576 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
(100) Q(a, b, t) =
1
'-
l b'(2.ji)
e- 52 ds
v 1T a/(2.ji)
limQ(a,b,t)=jo ifa<b<OorO<a<b,
[-+0+ 1 if a < 0 < b.
This makes precise the statement that the fundamental solution K(x, t) is the temperature
at point x and time t > 0 due to the release of one calorie of heat at x = 0 at time t = O.
Problem 9.1.8. Heat Diffusion in Semi-Infinite Rods. This problem was solved in
Section 8.10 for cases of zero temperature or heat flux (Dirichlet or Neumann boundary
condition) at the near end of the rod. See Figure 7. Here Single-integral formulas for the
solution are obtained.
Zero Temperature at the Boundary. For this case we found that if u(x, 0) = f(x) and
(101) B(fJ,) =- 21
1T 0
00
j(~) sin(fJ,~) d~
then
(102) u(x, t) = i oo
B(fJ,) sin(fJ,x)e- 1l2t dfJ,.
A single integral formula for u(x, t) in terms of f(x) can be derived by substitution of
(102) into (101), changing the order of integration and evaluation of the inner integral,
as in the case of the infinite rod problem. However, it is easier to apply the result for the
infinite rod to f*(x), the odd extension off(x), as in Section 8.10. Then f*(x) = -fe-x)
for x < 0 and the integral formula (90) of Problem 9.1. 7 gives
(103)
(u)________________~I_ ~
-,,=0 FIGURE 7. Heat diffusion in a semi-infinite rod
9.1 HEAT DIFFUSION IN ONE DIMENSION 577
(105)
where
(107) u(x, t) = 1
00
KN(x, T], t)J(T]) dT],
where
The function KN is called the fundamental solution for the semi-infinite rod with
Neumann boundary conditions. Formally, the fundamental solutions KD and KN differ
by a single sign.
I Exercises 9.1 I
Finite Rod. Solve the follOwing boundary value problems for u(x, t).
1. Dirichlet Problem 9.1.1 ,f(x) = sin 2x - sin 5x.
2. Dirichlet Problem 9. 1.1, f(x) = sin lOx.
3. Neumann Problem 9.1.2,f(x) = 1- cos2x + sin 2 x.
4. Neumann Problem 9.1.2,f(x) = 4 + sin2 2x.
5. Mixed Dirichlet-Robin Problem 9.1.3,f(x) = cos(/-lx), where /-l tan/-l = I and /-l R: 0.27411'.
6. Mixed Dirichlet-Robin Problem 9.1.3,f(x) = 3 cos(/-lx), where /-l tan/-l = 1 and /-l R: 2.04911'.
Fourier's Ring Problem. Solve for u(x, t).
7. Problem 9.1.4,f(x) = sin 2x.
8. Problem 9.1.4,f(x) = cos2x.
578 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
2
Ut = Urr + -Ur·
r
14. (Orthogonality) Prove by direct integration, using only calculus methods, that
11 (sin(rrmr)) (sin(rrnr)) 2
---
o r r
- - - r dr=
{O1 ifm i= n,
-
2
if m = n.
15. (Radial Heat Equation n = 2) Assume that the 2-dimensional heat equation
Ut = Uxx + Uyy
has a solution u = u(r, t) that depends on x and y only in the combination r = ,jx 2 + y2. Show by
the chain rule that the heat equation for U can be written as
1
Ut = Urr + -Ur·
r
16. (Heat Integral) Letf(x) be continuous and bounded on -00 < x < 00. In the integral formula
u(x, t) =
"rr
1
r.; 1 00
-00
f(x + 2s0)e-s2 ds,
show that for each fixed x, limHo+ u(x, t) = f(x) , by using the equality
lim
N->oo
1r.;
"rr
lN-N
e- s2 ds = 1.
K(x t) -_ _
'2v'iii1_ e-x2 /(4t)
, -00 < x < 00
,>.
t °
Verify using only calculus methods that K is a solution of the heat equation Kt = Kxx for -00 < x < 00,
t > 0.
9.2 VIBRATION OF STRINGS AND TRAVELING WAVES 579
(1)
where u(x, t) is the string displacement at point x and time t. In (1), c = .JTlP, Tis
the uniform tension in the string, and p is its mass density. We begin this section with a
discussion of d'Alembert's remarkable discovery that all solutions of (1) represent waves
that travel along the string with constant speed c.
Traveling Waves on Strings. We begin with the observation that if ¢(x) is any
function with a second derivative then
is a solution of (1). Indeed, two applications of the chain rule give Uxx = ¢"(X - ct) and
Utt = C2¢"(X - ct), which imply (1).
The graph of (2) in the ex, u)-plane, with t fixed, is identical to the graph of u = ¢(x),
but translated to the right by ct units, as in Figure 8 below.
Equation (2) represents a traveling wave, propagating to the right without change of
form. The propagation speed is clearly c (see the Exercises). In the same way, if u = y,(x)
is another function with second derivative then
is a solution of (1) that represents a traveling wave that moves to the left with constant
speed c. Moreover, since the wave equation (1) is linear and homogeneous, we may add
-- x
:;.
FIGURE 8. Graph of uex, 0) = rf>ex) and
xo x=xo+ct uex, t) = rf>ex - ct)
580 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
(5) r = x - ct, s= x + ct
and write
so that for a given u, (5) and (6) define v. Then a simple application of the chain rule
gives (see the Exercises)
Thus (1) holds if and only if in the (r, s)-plane one has
(7)
a2 v
-=0.
aras
Integrating first on r gives
av
as = 8(s),
Product Solutions-An Example. In Section 7.1 it was shown that the wave
equation (1) has product solutions such as
where jL is a constant. It is not immediately apparent that this is a sum of traveling waves,
as guaranteed by d'Alembert's theorem. However, the trigonometric identity
Problem 9.2.1. Traveling Waves on an Infinite String. If a long taut string is struck
at a point far from its ends then intuition and experience suggest that traveling waves
will propagate out from that point with speed c. To analyze this problem, it is natural to
consider the idealized problem of an infinite string. Thus the displacement u(x, t) of the
string will be characterized by the following properties (see Section 7.1).
where j(x) and g(x) are prescribed functions. We have already argued (see Section 7.1)
that to predict the future displacement, we must know the initial displacement and
speed, characterized by j(x) and g(x), respectively. The correctness of this model will
be shown by verifying that there is only one function u(x, t) with properties (10), (11)
and (12) (uniqueness). Then the solution will be constructed by two methods, based
on d'Alemberts' theorem and Fourier's integral theorem, respectively. Finally, the two
solutions will be shown to be the same, as required by the uniqueness theorem.
Uniqueness Theorem. For each choice of the functions j(x) and g(x) there is at
most one function u(x, t) that has properties (10), (11) and (12).
Proof. The linearity of the wave equation (11) implies that the uniqueness property is
equivalent to the statement that if j(x) = g(x) = 0 for -00 < x < 00 then u(x, t) = 0
for -00 < x < 00 and t :::: 0 (see the proof ofthe uniqueness theorem in Section 7.1).
582 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
whence
for all x. It follows easily that ¢'(x) = 1/I'(x) = 0 for all x, and hence ¢(x) = ki
and 1/I(x) = k2 (k i and k2 constants). Next, ki + k2 = 0 by (14), so ¢(x) = ki and
1/I(x) = - k1. Finally; by (13) we have
+ -l1
0
x
21/1(x) = j(x) g(;) d;.
c 0
9.2 VIBRATION OF STRINGS AND TRAVELING WAVES 583
Substituting this into (13) gives the solution (see the Exercises for details)
x-ct
+ g(~) d~.
This formula is called d'Alembert's solution of the initial value problem (10), (11), (12).
We have shown that if (10), (11), (12) has a solution then it must be given by this
formula. The converse statement, that (15) defines a function that satisfies (10), (11),
(12), is easily verified by differentiation and setting t = O.
Construction of the Solution via Separation of Variables. The solution (15) can also
be derived by the separation of variables method. To simplify the writing, we shall do
this for the special case g(x) = 0 only: Thus we seek a function u(x, t) that satisfies
where j(x) is a prescribed function. The first step of the separation of variables method
is to seek those product functions
while
(21) u(x, t) = 1 00
[A(j.L) cOS(j.Lx) + B(j.L) sin (j.Lx) ] cos(j.Lct) dj.L.
u(x,O) = f(x) = 1 00
[A(j.L) cOS(j.Lx) + B(j.L) sin(j.Lx)] dj.L.
This is a Fourier integral. Thus by the Fourier integral theorem we must choose A(j.L),
B(j.L) to be
Thus (21) and (22) define the solution of the problem. They require three integrations.
However, we shall show that all integrations can be carried out to recover d'Alembert's
solution (15) with g(x) = O. To do this we note that (22) implies
u(x, t) = -1
J'(
1 fOO
0
00
-00
f(~) cos j.L(X - ~) cOS(j.Lct) ~dj.L.
u(x, t) = -1100 foo f(~) [cos j.L(X - ~ - ct) + cos j.L(x - ~ + ct)] d~dj.L.
J'( 0 -00
f(x) = -1
J'(
1 foo
0
00
-00
f(~) cos j.L(x - ~) d~dj.L
9.2 VIBRATION OF STRINGS AND TRAVELING WAVES 585
gives
1
(23) u(x, t) = 2." [f(x - ct) +f(x + ct)].
This is just d'Alembert's solution in the case where g(x) = o.
EXAMPLE 1. (Plucked String). To pluck a taut string, one gives it a (small) displacement
f(x), holds the string motionless, and then releases it. Hence g(x) == 0 and the solution
is given by (23). Thus the initial profile splits into two equal parts that propagate to the
left and right with speed c. See Figure 9 below.
Problem 9.2.2. Traveling Waves on a Semi-Infinite String. This problem is to find the
waves generated on a long taut string with one end fixed. If the string is plucked or struck
at a point near the fixed end (and therefore far from the other end) then it is natural
to model the problem as a semi-infinite string, stretched along the positive x-axis, with
displacement zero atx = 0 (Dirichlet boundary condition). The boundary value problem
is to find the displacement u(x, t) characLerized by the following conditions.
Uniqueness. This problem has (at mosL) one solution. To show this, it is again enough
to show that if f(x) = g(x) = 0 for all x ::: 0 then u(x, t) = 0 for all x ::: 0 and t ::: O.
This is most readily done by using d'Alembert's theorem. Thus
(29) u(x, t) = </l(x - ct) + ljr(x + ct) for all x::: 0, t::: o.
-
FIGURE 9. Graph of
u(x,1) = Hr(x - e1) + f(x + ct)]
±f(x + ct) f(x) ±l(x - et) and u(x, 0) = f(x)
586 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
°
Note that (29) requires that 1/r(~) be defined for ~ ::: only, but cJ>(~) must be defined
for all real~. Conditions (26), (27), (28) withf = g = give the equations °
(30) U(O, t) °
= cJ>( -ct) + 1/r(ct) = for all t::: 0,
(31) u(x,O) = cJ>(x) + 1/r(x) = °
for all x::: 0,
(32) ut(x,O) = -c [cJ>'(x) -1/r'(x)] = °
for all x::: 0.
°
so 1/r'(x) == and 1/r(x) = k (a constant). Next, cJ>(x) = -k by (31) again. Returning to
(29) gives u(x, t) = k - k = 0, which completes the proof.
Solution of the Problem via d'Alembert's Theorem (g = only). The solution of the °
boundary value problem (24)-(28) can be written in d'Alembert's form (29) again. We
°
shall use the initial and boundary conditions (26), (27), (28) to find the functions cJ> and
1/r. To shorten the writing, only the special case g = is treated. The complementary
°
case f = 0, g of- can be done by the same method. Note that to define u(x, t) for all
° °
x ::: and t ::: we must determine cJ>(~) for all t and 1/r(~) for all ~ ::: 0.
Boundary Condition. By (26) we have
whence
and Ut(x, 0) = -c [cJ>'(x) -1/r'(x)] = 0. Thus cJ>' and 1/r' are defined by
(36) cJ>' (x) + 1/r' (x) = l' (x),
cJ>' (x) - 1/r' (x) = ° for all x::: 0.
whence
1
¢(x) = "2f (x) + hI,
1
1jJ(x) = "2f (x) + h2 ,
where hI and h2 are constants. But then (35) implies that hI + h2 = 0, so
1
¢(x) = "2f (x) + hI,
1
1jJ(x) = "2f (x) - hI·
The constant hI may be absorbed into the definition of ¢ and 1jJ, as in Problem 9.2.1, so
1
(37) ¢(x) = 1jJ(x) = "2f (x) for all x 2: o.
Finally, (34) gives
1
¢(x) = -1jJ( -x) = --f( -x) for all x:::: 0,
2
or
~f(x)
1-"2f (
for x 2: 0,
(38) ¢(x) = 21
-x) for x < O.
where fa (x) is the odd extension of f(x) to -00 < X < 00. Finally, the full solution of
(24)-(28) in the case g = 0 is
1 1
(39) u(x, t) = "2fa (x - ct) + "2f (x + ct).
The function ¢ will satisfy
1
¢(O+) = "2f (O+) ,
1
¢(O-) = -"2f (O+).
Thus ¢(x) will be continuous at x = 0 only if f(O+) = O. Physically, this means that
u(x,O) = f(x) must satisfy the boundary condition.
588 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
u
Stage 1 f(x)
_ _ _ _ _ _ _ _. - - ,_ _ _ _ _ _~----~--L-----~ X
Stage 2 -A B-
____________~~--~-L~~---L--~~ X
c-
Stage 3 B-
c-
------------------~ __~--------~~x
1 1
A = 2f (x +ct) B= -f(x - ct)
2
(41)
It was shown in Section 8.14 that this has the unique solution
(43) B(jJ,) = - 21
IT 0
00
f(~) sin(jJ,~) d~;
that is, B is the Fourier sine integral of f(x). The solution (41), (43) requires two
integrations. To see the identity of this solution with d'Alembert's solution (39), note that
the Fourier sine integral (42), (43) is meaningful for all x and gives the odd extension of
f(x):
(44) fo(x) = 1 00
B(jJ,) sin(jJ,x) djJ, for all x.
To write the solution (41) in traveling wave form, we make use of (44) and the
trigonometric identity
u(x, t) = - 11
2 0
00
B(jJ,) [sin jJ,(x - ct) + sin jJ,(x + ct)] djJ,.
1 I 1 I
ux(x, t) = "2fe (x - ct) + "2f (x + ct),
and hence
1 1
ux(O, t) = -f: (-ct) + -f' (ct) = 0,
2 2
590 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Problem 9.2.3. Vibrating String of Finite Length-Ends Fixed. This problem was
formulated as a boundary value problem in Section 7.1. The string displacement is
characterized by the following five conditions.
where j(x) and g(x) are prescribed initial displacement and speed, respectively. The
uniqueness of the solution was proved by the energy method in Section 7.1.
The solution of this boundary value problem can be constructed by the d'Alembert
traveling wave method, but the details are lengthy. We shall determine this solution by
an indirect method, beginning with the separation of variables method. The latter can
be simplified by noting that if u(f,g) is the solution of (46)-(50) then the linearity of
conditions (46)-(50) implies that
Thus it will be enough to calculate u(f,O) and u(O,g) separately and then add them to get
the general solution.
Construction of u(f,O) by Separation of Variables. We seek first the product functions
that satisfy (46)-(48) and (50) with g = O. These are seen to be
(52) sin
n1lX) cos (mrct)
(L L ' n=I,2,3, ...
and their scalar multiples. Superposition then gives the trial solutions
From Chapter 8 we know that the correct choice is the Fourier sine series coefficients
for j(x):
. (nrrx)
sm - cos (nrrct)
- - = -1.sm (nrr(x-ct)) + -1.sm (nrrex+ct))
L L 2 L 2 L
implies that
and
nrr(x+2L))
sin ( L = sin (nrrx
L + 2rrn
)
= sin (nrrx)
L ;
i.e., the functions sin(nrrxlL) have a common period of 2L. It follows that
where
These conditions characterize f*(x) as the (unique) odd periodic extension of f(x) with
period 2L. With this definition of f* (x), equation (56) is equivalent to
1 1
(60) u(f,O) (x, t) = "2J*(x - ct) + "2J*(x + ct).
Conversely, if u(f,O) is defined by (60), where 1* is defined by (59), then it is easy to check
that u(f,O) is the solution of (46)-(51) with g = O.
Construction of u(O,g). In this case we seek the product solutions that satisfy (46)-(49)
with f(x) = O. These are the scalar multiples of
and
(63) (O,g)
Ut (x,t ) -_ ~
~BnI (nnc)
- . (nnx)
Sin - COS
(nnct)
-- .
n=] L L L
(64) (0 )
U t ,g (x, t) = g(x) = L 00
BnI (nnc)
- sin (nnx)
- ,
n=] L L
which implies
nnc) 2 [L (nn~)
(65) B~ ( L = bn(g) = L 10 gmsin L ~.
We may sum the series in (63) via (65) and Dirichlet's convergence theorem to get
+ -1 ~b
~ n
(g) sm
. (nn(x+ct))
2 n=] L
where again g*(x) is the odd periodic extension of g with period 2L. To get u(O,g) itself,
we may integrate to get
Thus
= -
1 lx+ct g*(~)d~ - -
1 lX-ct g*(~) ~
2c x 2c x
= -2
1 lx+ct g*(~)~.
c x-ct
Finally, adding u(f,O) and u(O,g) gives the general solution
where f* and g* are the odd periodic extensions off and g with period 2L.
Remarks on String Instruments in Music. The vibrating string of finite length
with fixed endpoints is a model for the vibrating strings of musical instruments such as
violins and pianos. The product solution
un(x,t) nnx)
= sin (L [Bn cos (n::nct)
L +Bn, sin (n::nct)]
L '
with parameters Land c = "jf7p, is called the nth characteristic function (or the nth
normal mode) of the string; see Figure 11. It represents a synchronous vibration with
frequency
v, = ;LIf
In music the higher multiples of VI are called overtones. Thus
x
n=l n=2 n=3
The occurrence of these simple fractional ratios is the basis for the harmonious tones of
stringed instruments.
Periodicity of Vibrations of Strings. The solution formula (60) for u(j,O) gives
1 1
u(j,O\x, t + 2Uc) = "'21*(x - ct - 2L) + "'21*(x + ct + 2L)
1 1
= "'2f*(x - ct) + "'2f*(x + ct)
= u(j,O) (x, t),
since1* has period 2L. Thus for any initial displacement u(x, 0) = f(x) the corresponding
motion has a fixed period
2L =2L fE
r=
c VI
and frequency
VI = ~r = ~ fi
2Ly"-P hertz.
Thus every motion of the string is a musical note of frequency VI. The method of tuning
a string is to adjust the tension T until VI has the desired value (for example VI = 440
for middle A).
Problem 9.2.4. Finite Strings with Other Boundary Conditions. Changing the boundary
conditions has a profound effect on the vibrations of a taut string. As an example we
9.2 VIBRATION OF STRINGS AND TRAVELING WAVES 595
shall discuss the boundary value problem defined by the following conditions.
where j(x) is a prescribed initial displacement. Separation of variables leads to the same
Sturm-Liouville problem as in Problem 9.1.3. Thus the product solutions of (66)-(69)
are the scalar multiples of
(72)
L
00
=L
00
where
00
Traveling Waves on an Infinite String. Solve for u(x, t) by applying d'Alembert's formula, then graph
three snapshots of the solution on -5 ~ x ~ 5 at t = 0, t = 0.5 and t = 1. Assume c = 1.
5. f(x) = sinx.
7. (Speed of a Traveling Wave) Let </>(x) be any function with a second derivative and define u(x, t) =
</>(x - ct), where c > 0 is a constant. Let d(t) be the positive distance that the graph of rp(x) is translated
in order to coincide with the graph of u(x, t). Show that the translation is to the right and d(1) = ct, and
hence the speed of a traveling wave is d' (I) = c.
8. (Change of Variables) Let u(x, t) be a given twice-differentiable function. Introduce new coordinates
(r, s) by
r =x - ct, s = x + ct
and define vCr, s) = u«r + 5)12, (5 - r)l(2c)). Apply the chain rule to establish the identity
9. (D'Alembert's Solution) Given f, I' and g continuous, let rp and 1/1 be continuously differentiable
functions that satisfy the equations
Integrate two of these equations from ~ = 0 to ~ = x to obtain equations for t/> and 1/1, and then substitute
into the equation u(x, t) = t/>(x - ct) + 1/I(x + ct) to obtain d'Alembert's solution
{
2X 0 ~ x ~ 1/2,
f(x) = 2(I-x) 1/2 ~ x ~ 1.
Then f has an odd periodic extension h of period 2 and the exact solution y can be written in terms of h:
1
Y= "2 [hex - tH hex + t)j.
11. Graph the solution surface yon 0 ~ x ~ 1, 0 ~ t ~ 1 and also the "snapshots" of y at times t = 0,
t = 0.2, t = 0.4, t = 0.6, t = 0.8, t = 1.
12. Determine the first five terms of the series representation of the solution
L Bn sin(mrx) cos(mrt).
00
y(x, t) =
n=l
Let z be the partial sum of the first five terms. Graph the solution surface z on 0 ~ x ::: 1, 0 ::: t ::: 1
and also the "snapshots" of z at times t = 0, t = 0.2, t = 0.4, t = 0.6, t = 0.8, t = 1.
The Semi-Infinite String. Let fex) be a prescribed function with f twice continuously differentiable and
consider the problem
°
u(x,O) = f(x) for 0 ~ x < 00,
Ut(x,O) = for 0 ~ x < 00.
The exercises below refer to this problem as the semi-infinite string problem, even though it is a special case.
13. Show that the product solutions for the semi-infinite string problem are
15. (Even Extension) Assume that f is twice continuously differentiable on x ;:,: O. Let fe(x) be the even
extension off(x) to -00 < X < 00 and define
1 1
u(x, t) = "2fe(x - ct) + "2 f (x + ct).
Show that U is a solution of the wave equation Utt = c2uxx on x ;:,: 0, t ;:,: 0 with wave shape
u(x,O) = f(x) and wave velocity Ut(x, 0) = o.
Problem 9.3.1. Rectangular Plates, Dirichlet Boundary Condition. This problem asks
for the temperature distribution u(x,y) that satisfies Laplaces equation in a rectangle and
has prescribed values on its four edges. Before solving the general case, we shall illustrate
the method by the case of a square plate of side Jr with zero boundary conditions on
three of the four edges. Thus u(x, y) must satisfy
(1) u (x, y) is defined for 0 ::: x ::: Jr, 0 ::: y ::: Jr,
(2) Uxx + Uyy = 0 for 0::: x ::: Jr, 0 ::: y ::: Jr,
(3) u(O,y) = 0 and u(Jr,y) = 0 for 0::: y ::: Jr,
(4) u(x, 0) = 0 for 0::: x ::: Jr,
(5) u(x, Jr) = j(x) for 0::: x ::: Jr,
that satisfy all the linear homogeneous conditions of the problem. These are (1)-(4) in
the present case. Substituting (6) into (2) and separating variables gives
Y"(y) XI/(x)
--=---=A
Y(y) X(x) ,
where).. is a separation constant. We satisfy (3) by X(O) = X(Jr) = 0 and (4) by yeo) = 0.
Thus X is a solution of the familiar Sturm-Liouville problem
with solutions
An = n2 ,
Xn(X) = sin(nx), n = 1,2,3, ....
The corresponding Y -factors, denoted by Yn , satisfy
(7) u(x,y) =
n=1
where the coefficients Cn are chosen such that the series converges. We wish to choose
the cn's such that (7) satisfies (5); that is, for 0 :s x :s Jr,
L
oc
~ . sinh(ny)
(9) u(x,y)=L.,..bn(j)sm(nx). h( ).
n=1 SIn mr
Convergence. Equation (9) is only a formal solution of the boundary value problem
because the convergence of the series has not been shown. To analyze this we note that
the quotient of hyperbolic sine functions satisfies
Hence the series in (9) converges at least as well as the Fourier sine series for j(x). Criteria
for the convergence of Fourier sine series were given in Section 8.3. Note also that
sinh(ny) e ny e-ny
- -n(1(-y)
----~e for n -+ 00.
sinh(nJr) e n1( - e-nrr
Thus the series (9) converges very rapidly when 0 ~ y < Jr. The slowest convergence
occurs when y = Jr.
The General Case. Now consider the steady-state temperature u(x,y) in a rectangular
plate of arbitrary dimensions Land M units, respectively. If coordinates are chosen as in
Figure 12,
then u(x,y) is characterized by the boundary value problem
where 11 (x) ,h (x), gl (y) and g2 (y) are prescribed functions. Conditions (12)-(15), taken
together, specify u on the complete boundary of the rectangle.
Separation of variables is not directly applicable to the boundary value problem
(10)-(15) when all the functionsl1,h, gL g2 are nonzero. However, if
then the solution of the general case (10)-(15) is given by the sum (check this!)
where 11, h are given. To solve this by separation of variables we first construct the
product solutions of the homogeneous conditions (19), (20) only. Putting
(22) v(x,y) = X(x)¥(y)
An = (mrILi,
Xn(x) = sin(mrxlL), n = 1,2,3, ....
The eigenfunctions Xn are orthogonal on the interval 0 :::: x :::: L. The corresponding
orthogonal expansion is the Fourier sine series for that interval.
602 9 BOUNOARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Vn(X,y) = sin(nJl'xlL)YnCy),
where Yn is the general solution of
(23)
or
y~(o) = 0, Y~(M) = 1,
and so
We will use these coefficients to satisfy the nonhomogeneous boundary conditions (21).
Construction of v (Completed). Superposition gives
00
This satisfies Laplace's equation and the two homogeneous boundary conditions. To
satisfy the two nonhomogeneous boundary conditions, we have
00 nJrX
v(x,O) = flex) = LBnsin(T) for 0:::: x:::: L
n=l
9.3 STEADy-STATE DIFFUSION OF HEAT IN PLATES 603
and
IX) mrx
v(x,M) =f2(X) = "Ansin(-)
~ L for 0<
- x <
- L.
n=1
From this equation, conditions (19)-(21) that characterize v are readily verified.
Construction of w(x,y). The function w(x,y) is characterized by the conditions
These conditions are identical with conditions (19)-(21) for v(x,y) if we exchange x and
y, Land M and (h ,h) with (gl, g2). Thus w(x,y) may be obtained from (25) by making
these exchanges. This gives
where
Problem 9.3.2. Rectangular Plates- Neumann and Mixed Boundary Conditions. These
problems were formulated and the uniqueness of their solutions was discussed in
Section 7.4. Here two examples are solved by the separation of variables method. More
general cases are discussed in the Exercises.
604 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
(O,n)
Ux = a
x
FIGURE 13. Square plate-Neumann
(0,0) Uy =0 (n,a) boundary condition
A Neumann Problem. Consider a square plate of side units, with three sides Jr
insulated (au/an = 0) and a prescribed heat flux (au/an = j) across the fourth. With an
obvious choice of coordinates, see Figure 13,
u(x,y) is characterized by the following boundary value problem.
(26) U(x,y) is defined for 0::::: x::::: Jr, 0::::: y ::::: Jr,
(27) Uxx + Uyy = 0 for 0::::: x ::::: Jr, 0 ::::: Y ::::: Jr,
where j(x) is a prescribed function. We shall solve this problem by separation of variables.
Product Solutions. The product solutions u(x,y) = X(x)Y(y) must satisfy conditions
(26)-(29). This gives the Sturm-Liouville problem
for Y(y). Solving (31) in the usual way gives the eigenpairs
An = n2 ,
Xn(x) = cos(nx), n = 0,1,2,3, ....
9.3 STEADy-STATE DIFFUSION OF HEAT IN PLATES 605
Yn(y) = cosh(ny).
+L
00
We know from Section 7.4 that the solution of the Neumann problem is unique only up
to an additive constant. Thus Co is arbitrary We shall use the final boundary condition,
equation (30), to determine the remaining constants Cn. The y-derivative of u is
L nC
00
L nC sinh(n:rr) cos(nx).
00
This is a Fourier cosine series on the interval 0 :::: x :::: :rr. The Fourier cosine series
coefficients off(x) are defined by (see Section 8.4)
(36) ao(/) = - 11
:rr
7C
f(~) d~,
21
0
7C
an(/) = - f(~) cos(n~) d~, n= 1,2,3, ....
:rr 0
In equation (35) there is no constant term. Thus a solvability condition for f(x) is that
ao(/) = 0, or
~ cosh(ny)
(37) u(x,y) = Co + ~ an(/) cos(nx) . ( ).
n=l n smh mr
It is easy to show that if j(x) satisfies the conditions of Churchills theorem (see Section
9.1, Corollary 2), so that
then the series (37) for u, and the derived series for u y , converge absolutely and uniformly
for all x and y in the square 0 :::: x :::: Jr, 0 :::: y :::: Jr. Under these conditions (37) defines
a function u(x,y) that satisfies conditions (26)-(30) and is therefore a solution of the
Neumann problem.
A Mixed Problem. As a simple example of a mixed problem, we shall consider the
function u(x,y) characterized by the follOwing conditions.
(38) U(x,y) is defined for 0 :::: x:::: Jr, 0 :::: y :::: Jr,
(39) Uxx + Uyy = 0 for 0:::: x :::: Jr, °: : y :::: Jr,
= L Cn sin(nx) cosh(ny)
00
(44) u(x,y)
n=l
and
=L
00
L nC sinh(nrr) sin(nx).
00
Comparing this with the Fourier sine series for j(x) gives the coefficients cn:
Thus the solution of the mixed problem (38)-(42) is defined by the series
~ . cosh(ny)
(47) u(x,y) = L.,;bn(j)sm(nx) . h( ).
n=l nsm nrr
and the series for u(x, y) and uyCx, y) converge uniformly and satisfy (38)-(42), as in the
preceding example.
x
u(x,O) = j(x) FIGURE 14. Dirichlet problem for a half-plane
608 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
and
and
y" -)..Y = 0 for 0::: y < 00,
(54)
Y(y) bounded for 0::: y < 00.
The now familiar arguments imply that the eigenpairs for (53) are
A=f.1,2, f.1,~O,
(55)
X(x) = A(f.1,) cos(f.1,x) + B(f.1,) sin (f.1,x) ,
while
(56)
(58) u(x,y) = 1 00
{A(f.1,) cOS(f.1,x) + B(f.1,) sin(f.1,x)} e- ILY df.1,.
This is the Fourier integral representation of j(x) and the Fourier integral theorem of
Section 8.9 gives the solution
This completes the solution whenever j(x) satisfies the conditions of the Fourier integral
theorem.
The solution (58), (60) requires three integrations to evaluate u(x,y). We shall show
that two of these can be carried out, leaving a single integration. The analysis parallels
that of Problem 9.1. 7 in Section 9.1. Substituting the identity
U(x,y) = ~ 1 (i:j(~)
00
cos p.,(x - ~)d~) e- IlY dp.,.
Changing the order of integration (justified when (52) holds) gives
(1
i:
(61) u(x,y) = ~ i:j(~) 00
cosp.,(x - ~)e-llYdp.,) ~
= K(x - ~,y)j(~) dt
where the kernel K is defined by
(62) K(x,y) = - 11
Tr 0
00
cos(p.,x)e- IlY d~.
This integral may be calculated by recalling the Laplace transform (see Chapter 4)
10
[00 e-st cos(cvt) dt = A
s + cv
(s > 0).
y foo j(~)
(64) u(x,y) =; -00 (x _ ~)2 + y2 dt y > O.
Note that this is valid only for y > O. For Y = 0 the formula yields the meaningless form
Ox 00.
610 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
The integral in (64) is called Poisson's integral. A second form of the solution that
is sometimes useful is obtained by substituting
(66) u(x,y) = -1
Jr
1 7C/2
-7C12
j(x + y tan t) dt.
where M is a constant. It follows from (66) that lu(x,y)1 ::::: M for all x and all y :::: O.
Thus Poisson's integral assigns to each bounded boundary function j(x) a solution u(x, y)
with the same bound. We shall not prove the uniqueness of the bounded solution here.
However, note that uniqueness of the solution is lost if we do not require that u(x,y) be
bounded. For in that case we can add to the solution (66) any multiple of y and obtain
another (unbounded) solution.
A Simple Example. Define j(x) by
where H is the Heaviside function. Then the integrand in (66) is 1 when -1 < x +
y tan t < 1 and zero otherwise. Thus
Problem 9.3.4. Half-Plane with Neumann Boundary Condition. This problem is ob-
tained from the preceding one by replacing the Dirichlet boundary condition u(x, 0) =
j(x) by the Neumann boundary condition uyCx,O) = j(x). In this case the separation of
variables method leads to divergent integrals. However, note that the derivative
-00
f(~)
(x _ ~)2 + y2 ~.
Integrating with respect to y gives, after an elementary integration,
(72) u(x,y) = ~
1 1 00
-00 log [(x -~) 2 + l] 112 f(~) ~ + K,
where K is a constant. It can be shown that (72) defines a solution of the Neumann
i:
problem if
Problem 9.3.5. Infinite Strip-Dirichlet Boundary Condition. This problem models the
case of a long narrow rectangular plate when the steady-state temperature u(x, y) is sought
at points (x,y) far from the two narrow sides. As an example, the strip -00 < x < 00,
o : : y :::: 1, will be discussed; see Figure 15.
We shall consider the case where u(x, 0) = 0 and u(x, 1) =f(x) for -00 < x < 00.
Thus u(x,y) is characterized by the conditions
u= jex) y=l
where j(x) is a prescribed function. To obtain uniqueness we shall add the conditions
(77) U, U x , uy are bounded for -00 < x < 00,0 :::s Y :::s 1,
(78) J~oo [[(x) I dx < 00.
and
y" -).,y =0 for O:::s y:::s 1,
(80)
yeO) = o.
The product solutions have the form
(81)
(82) u(x,y) = 1 o
00
{A(jJ.,)cosjJ.,x+B(jJ.,)sinjJ.,x}
sinh(jJ.,Y)
'h
sm jJ.,
djJ.,.
The boundary condition (76) requires that for -00 < x < 00,
u(x,1) = j(x) = 1 00
{A(jJ.,) cos jJ.,X + B(jJ.,) sin jJ.,x} djJ.,.
As before, this is the Fourier integral ofj(x), and we must have
(83)
Equations (82) and (83) define the solution whenever j(x) satisfies the conditions of the
Fourier integral theorem (see Section 8.9). The method of Problem 9.3.3 leads to an
integral formula
(84)
where
u(x,y) = i: K(x - ~,y)j(~) d~,
K(x,y) =-
1 foo cos(jJ.,x).
sinh (jJ.,y)
h djJ.,.
n -00 sm jJ.,
9.3 STEADy-STATE DIFFUSION OF HEAT IN PLATES 613
However, evaluation of this integral is beyond the scope of this text. Infinite strip problems
with other boundary conditions are left for the Exercises.
Problem 9.3.6. Semi-Infinite Strips with Dirichlet Boundary Condition. This problem
models the case of a long narrow rectangle when the temperature u(x,y) is sought near
one of the narrow sides of the rectangle. With suitable choice of the unit oflength we may
assume that the strip is defined by the inequalities 0 ~ x < 00, 0 ~ y ~ Jr as in Figure 16.
The boundary value problem suggested by the figure asks for a function u(x, y) with
the follOwing properties.
(91) 10 00
Ig(x) I dx < 00, 10 00
Ih(x)1 dx < 00.
U(X, n) = g(x)
(O,n)
U(O,y) = fey)
in analogy with the case of a finite rectangle (see Problem 9.3.1 above). We shall show that
v = u(f,o,O), w = uCO,g,O) and z = uCO,O,h) can all be constructed by separation of variables.
Construction of v = u(f,o,O). The function v is characterized by the conditions
Product Solutions. The product solutions of (93)-(95) and (97) are the multiples of
The coefficients are to be chosen so as to satisfy the final boundary condition (96). Thus
00
This is the Fourier sine series representation of I(y) and is given by the sine series
coefficients
is the desired solution if I(y) satisfies the conditions of Churchill's theorem (see Section
9.1).
Construction of w = u CO,g,O) • The defining conditions for ware
(102) Wxx + Wyy = 0 for 0:::: x < 00,0 :::: y :::: Jr,
(107) fo
oo Ig(x)ldx < 00.
This can be solved by separation of variables, following the plan of Problem 5 above.
However, the results for that problem can be used much more directly. To see this, let
go(x) be the odd extension of g(x) and let u(x,y) be the solution of the infinite strip
problem (see Problem 5 above) in the strip 0 ~ x< 00, 0 ~ y ~ n. The result is
(l08) u(x,y) = 1 00
o
{A(JL) cOSJLX +B(JL)sinJLx}
sinh (JLY)
. h
sm JLn
dJL,
where
However, cos(JL~) and sin(JL~) are even and odd, respectively, and hence A(JL) = 0 and
(1l0)
It follows that
(Ill) W(X,y) = 1
o
00 sinh(JLY)
B(JL) sin(JLx) . h
sm JLK
dJL,
with B(JL) defined by (110), is the desired solution. Properties (101), (102) and (04)-
(06) hold because (108), (l09) solve the infinite strip problem. The final condition,
equation (03), is obvious from equation (Ill).
Construction of z = u(O,O,h). The case parallels that of wand is left for the Exercises.
Problem 9.3.7. Circular Disk-Dirichlet Boundary Condition. This problem asks for
the steady-state temperature distribution u(x, y) that satisfies the Laplace equation in a
circular disk and has prescribed values on the circumference. Thus if the disk has radius
a then u(x,y) must satisfy
Use of Polar Coordinates. It is natural to introduce polar coordinates (r, e), defined by
(1l6)
Hence if we write
e.
where f(8) is a prescribed function of There are other, implicit, conditions that are
needed to calculate vCr, 8). First, by (117) ,f(8) must have period 27l', as must vCr, e):
y
___--,--~u(a, e) =fee)
1 1
R"8 + -R'8
r
+ -R8"
r2
= 0,
or
(125)
r 2R"(r) + rR'(r) - "AR(r) = 0 for 0::::: r < a,
R(r) finite for 0::::: r ::::: a.
For 8(0), equation (124) and the periodicity requirement (120) imply
(126)
8"(0) + A8(0) = 0 for 0::::: 0 ::::: 2ll",
8(2ll") = 8(0) and 8 ' (2ll") = 8'(0).
This is just the Sturm-Liouville problem of Section 8.5 with periodic boundary
conditions. The eigenpairs are
(127)
Ao = 0, 8 0 (0) = Ao,
An = n 2 , 8 nCO) = An cos(ne) + Bn sin(nO), n = 1,2,3, ....
Turning to the radial factor R(r) defined by (125) with "A = n2 , we have an Euler equation
(128)
or Ol = ±n. Thus
This formally satisfies all the conditions except the boundary condition (119). For (19)
we have
yea, f))
n=1
The last equation must be the Fourier series for f(f)), and hence the coefficients An, Bn
must be determined as follows.
The Poisson Integral for the Disk. It is of both theoretical and practical interest that
the series in (132) can be summed to provide a simple integral representation for vCr, e).
The calculation is as follows. First, replace an (j) and bn (j) in (132) by their integral
representations from (131) to get
(133) vCr,e) = -
1 121t jC¢)d¢
2n 0
00 Crlat (1t
+ I:
n=1
--
n
Jo jC¢) Ccos n¢ cos ne + sin n¢ sin ne) d¢
0
= -
1 121t jC¢)d¢+ I00 :Crlat
- 121t jC¢)cosnce-¢)d¢
2n 0 n=1 n 0
= hJo
1(1t j(¢) (00
1+ ~2(~)rn cosnce-¢) ) d¢.
The second step is to sum the series in parentheses. It is essentially a geometric series.
The calculation goes as follows Cwhere R{z} means the real part of z):
Thus if
r .,/,
Z = _el~
a
then
= Z+Z + z 3 + ... =
2 Z
-- (if ria < 1)
l-z
~ei'" (1 - ~e-i"')
11 - ~ei"'12
+ (D
2 2 .
(1 - ~ cos 0/) sin20/
Hence
00 r)n r
- cos ,/.
'f' -
(r)2
-
( _ cos no/ =
~
a a
a 1- 2 (D cos 0/ + (D 2
620 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
and
(134)
(l35) v(r, e) = -1
2n
1
0
2JT
a2 + r2
a2 - r2
- 2ar cos(B - ¢)
f(¢) d¢,
°
valid for :s r < a. This result is called Poisson's integral for the disk.
Caution. Poisson's integral does not make sense when r = a!
(l36) vrr + lV
r r
+ l2r Vee = ° for 0 < r -< a, -00 < B< 00,
Product Solutions and Superposition. These are the same as for the preceding problem
and again lead to solutions of the form
(141) vCr, B) = Ao +
n=l
with arbitrary coefficients An and Bn. To calculate them we determine the radial derivative
Vr by formal differentiation of (141) to get
L nrn-
00
Use of the Neumann Boundary Condition. The boundary condition (137) and
equation (142) give
= fce) = L
00
This gives the Fourier series for fce). In particular, there is no constant term, and hence
a solution exists if and only if fce) satisfies
(144) ao(j) = -
1 12n fee) de = O.
2n 0
Since fce) = avCa, e)!an, we see that (144) means that there is no net flux of heat into or
out of the disk. This condition clearly is necessary for thermal equilibrium. Also, (143)
places no restrictions on Ao in (141). This agrees with the fact, proved in Chapter 7, that
the solution of the Neumann problem is unique only up to an additive constant.
The coefficients in (143) must coincide with the Fourier coefficients off(8). This
uniquely determines the constants An, Bn Cn 2: 1) by
(146) vCr, e) = Ao + a L --
00Crla)n
n=1 n
{an(j) cos ne + bn(j) sin ne},
where Ao is an arbitrary constant. The function vCr, e) defined by (146) satisfies each of
conditions (136)-(140). Condition (137) is verified by differentiating (146) with respect
to r and setting r = a to get (recall that ao(j) = 0)
vrCa,e) =
n=1
An Integral Representation for the Neumann Problem. The series solution (146) can
be summed by an indirect method to get an analogue of Poisson's integral for the disk.
To do this, note that from (146) one has
(147) L -
vr(r,e) = -a 00 (r)n {an(j) cosne + bn(j) sinne}.
r n=1 a
622 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Now, the series in (147) is exactly the series solution of the Dirichlet problem with
boundary values 1(0). Thus it coincides with Poisson's integral, and we can write
(148) vr(r,B)
a
= --
r 2n
1 1 0
2n
a2 + r2 -
a2 - r2
2arcos(B - ¢)
I(¢)d¢.
Integrating with respect to r gives the solution of the Neumann problem as the integral
This formula is more easily derived by complex variable methods - see page 318 in
[Ch-BJ for a proof.
Problem 9.3.9. Circular Sector-Dirichlet Boundary Condition. This problem asks for
the steady-state temperature distribution vCr, B) in a circular sector of radius 1 and angular
opening a (0 < a < 2n); see Figure 18.
The function vCr, B) is characterized by the follOwing conditions.
Here I, g and h are prescribed functions that specify the boundary values of v on the
complete boundary of the sector.
v = h(r)
v = j(8)
x
FIGURE 18. Dirichlet's problem for a
o v = g(r) circular sector
9.3 STEADy-STATE DIFFUSION OF HEAT IN PLATES 623
We shall show that each of the last three functions can be constructed by the separation
of variables method. When this has been done, the solution of the general problem will
be given by (156).
Construction of w = v(j,o,O). This function is characterized by the boundary value
problem
and
(164)
8" + A8 = °
for 0::::: e : : : a,
8(0) = 0, 8(a) = 0,
where A is a separation constant. The Sturm-Liouville problem (164) gives the eigenpairs
An = C:f
8 n (e) = sin(mrela), n = 1,2,3, ....
The value Rn (0) is finite only if h2 = O. Thus the product solutions are scalar multiples of
wCr,e) = Lbn(j)rnsinCne),
n=1
and
00
(173)
r2R" + rR' + AR = 0 for 0:::: r :::: 1,
R(l) = 0, R(r) bounded for 0:::: r:::: 1,
and
(174)
8" - A8 = 0 for 0:::: e : : a,
8(a) = 0,
The corresponding function 8 is the solution of (174) with A = fL2 A convenient choice
is
8(e) = sinh fLea - e)
sinh(fL e)
with 8(0) = 1. Thus the product solutions are the multiples of
sin(fL log r) sinh fL(a - e)
(176) z" (r, e) =
sinh(fLa)
, fL 2: 0,
o
CefL) sin(fL log r)
sinh fLea - e)
. h(
sm fLa
) dfL·
626 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
We wish to choose C(jL) so as to satisfy the final boundary condition (171). By (177)
and (171) we must have
(179) () 211 ()
C jL = - gr
nor
sin(jL log r) d
r.
Thus the solution is given by (177) with C(jL) given by (179). Sufficient conditions for
the validity of (178), (179) were given at the end of Section 8.14.
Construction of v(O,O,h). This may be obtained from (177) by the substitutions () ~ ex-()
andg ~ h.
Problem 9.3.10. Other Plates Describable by Polar Coordinates. The preceding ex-
amples do not exhaust the regions for which the Dirichlet problem can be solved by
separation of variables in polar coordinates. Further examples are rings defined by 0 <
a :::: r :::: b, 0 :::: () :::: 2n, the region exterior to a disk, defined by 0 < a :::: r < 00,
o :::: () :::: 2n, and truncated sectors, defined by 0 < a:::: r :::: b, 0 :::: () :::: ex. For further
discussions see the Exercises.
IExerc!!!~!.1
Square Plate. Compute the solution u(x,y) in series form.
1. Problem 9.3.1 with L = M = rr,fl (x) = O,h(x) = 1, g1 Cy) = g2(y) = o.
2. Problem 9.3.1 withL = M = rr,!1(x) =h(x) = 0,g1Cy) =g2(y) =y/rr.
Numerical Methods for the Square Plate. Compute the solution u(x,y) at x = y = rr/2 correct to 3
decimal places using Leibnitz's theorem for alternating series. The exact answer in each case is 1/4.
8. Problem 9.3.1 with L = M= n,fl(x) = hex) = xln, glCy) = 0, g2(y) = 1. Unique solution
u(x,y) =xln.
Square Plate with Dirichlet Boundary Condition. Find the Fourier sine series solution u(x,y).
9. Problem 9.3.1 with L = M = n,fj (x) = 0,12 (x) = f(x), gl (y) = g2 (y) = 0, where
10. Problem 9.3.1 withL = M = n,fl(x) = 0,f2(X) =f(x), gl(y) = g2Cy) = 0, wheref(x) = x(n -x)
for 0 :s: x :s: n.
Square Plate with Neumann Boundary Condition. Find the Fourier cosine series solution u(x,y) for
the given Neumann boundary condition.
11. Problem 9.3.2 with f(x) = x - nl2 for 0 :s: x :s: n.
16. Problem 9.3.3 withf(x) = H(x - a) - H(x - b) for -00 < x < oo,f(a) = feb) = 1/2, where a and b
are constants such that -00 < a < b < 00. The solution is
18. Assume that a constant M exists such that If(x) I :s: M for all -00 < x < 00. Verify that the Poisson
integral solution of Problem 9.3.3,
u (x, y) = -
y
n
1 00
-00
f(~) d~
(~)2
x - 5 +y
2'
L:
20. The solutions u(j,O) and u(O,g) are given by the formulas
= ~,y)g(~)~,
L:
u(O,g) (x,y) K) (x -
rrK) (x,y) = 1 00
cOS(JLX).
sinh(JL(l -
smhJL
y» dJL,
=1
-00
00 sinh(/lY)
rrK2(X,y) coS(/lX)-.- - d/l.
-00 smh/l
Semi-Infinite Strip with Dirichlet Boundary Condition. Let u(j,g,h)(x,y) denote the solution of the
semi-infinite strip problem 9.3.6. Verify the results below.
21. u(j,g,h) = u(j,o,O) + u(O,g,O) + u(O,O,h).
22. The solution u (O,O,h) is given by the formula
(0
u "
°h)
(x,y) = 1 00
-00
A(/l) cosC/lX).
sinh(/l(l -
smh/l
y» d/l,
where
A(/l) = 3:.
rr
roo h(~) cOS(JL~) d~.
10
Half-Plane Numerical Results. . Evaluate u(O, 1/2) by numerical integration, accurate to five digits.
2
23. Problem 9.3.3 withfCx) = e- x ,using the Poisson integral formula
( )- 1
u x,y - rr
~
-00
00
j(~) ~
(x _ ~)2 + y2 .
2
24. Problem 9.3.4 withf(x) = e- x ,using the integral formula
uCx,y) = - 11
rr -00
00
log[(x - ~)2 + ilf(~) d~.
Circular Disk. Determine explicitly the series solution of the given problem. The first two have Dirichlet
boundary conditions and the last two have Neumann boundary conditions.
25. Problem 9.3.7withfCO) = 1 + 5cosO - 3cos30 and a = 1.
26. Problem 9.3.7 withf(O) = cosO - 5 sin 30 and a = 2.
27. Problem 9.3.8 withfCO) = 4cosO - cos20 + cos 30 and a = 1, Ao = o.
28. Problem 9.3.8 withfCO) = cos20 and a = 2, Ao = o.
Circular Disk Numerical Methods. Determine numerically to five digits the solution u(r, 0) at r = ro,
o = 00, using the stated approximation method, given
fee) = {Io 0<< 00 << rr,
rr 2rr.
9.3 STEADy-STATE DIFFUSION OF HEAT IN PLATES 629
29. Problem 9.3.7 with a = 1. Use a series representation for u and solve for ro = 0.1, eo = Jr/2.
30. Problem 9.3.7 with a = 1. Use Poisson's integral representation for u and solve for ro = 0.1, eo = Jr/2.
31. Problem 9.3.7 with a = 1. Use the exact solution representation below and solve for ro = 0.1, eo = Jr/2.
u(r, e) = -
1 + -1 tan -I (2rSine)
--.
2 Jr 1 - r2
This formula should not be verified (it's difficult!). The arctangent function above is defined by -Jr/2 <
tan- I x < Jr/2.
32. Problem 9.3.8 with a = 1, AO = O. Use a series representation for u and solve for ro = 0.1, eo = Jr/2.
Change Ion Jr < e < 2Jr from its defined value of 0 to the new value -1 (needed for ao(f) = 0).
Circular Sector. Derive the results below for the quarter disk.
33. The solution w = uCf,O,O) of Problem 9.3.9 for the quarter disk a = Jr/2 is given by
L bn(f)r2n sin(2ne),
00
w(r, e) =
n=1
34. The solution w = uCf,O,O) of Problem 9.3.9 for the quarter disk a = Jr/2 andiCe) = 2 sin 6i1 is given by
w(r, e) = 2r 6 sin(6i1).
Circular Hole in an Infinite Plate, Consider a large plate with a circular hole that is far from the plate
edges. We shall model this as the idealized problem of finding the steady-state temperature near a circular
hole of radius a in an infinite plate.
If rand e are polar coordinates with origin at the center of the hole then the steady-state temperature
vCr, e) satisfies
1 1
(180) Vrr + -;:vr + ;:2vee = 0 for a ::'0 r < 00 and all e,
(181) v(a, e) = I(e) for all e,
(182) vCr, e + 2Jr) = vCr, e) for all a ::'0 r < 00 and all e,
(183) vCr, i1) is bounded for a ::'0 r < 00 and all e,
+L
00
Problem 9.4.1. Rectangular Plate-Zero Boundary Condition. For simplicity, the case
of a square plate of side Jr will be treated. The case of a rectangle of arbitrary size can be
dealt with by the same method. We seek a function u(x,y, t) that satisfies
Product Solutions. First we shall separate the variable t from x and y by constructing
product solutions of (1)-(4) of the form
(8)
Eigenvalue Problem for <I>(x,y). Equations (1)-(4) and (7) imply that <I> must satisfy
This problem is our first example of an eigenvalue problem for a partial differential
operator. Each nontrivial solution of (9) defines an eigenpair for the linear operator
L<t> = -<t>xx - <t>yy.
Program. Our plan for solving the heat diffusion problem (1)-(5) is to carry out the
following steps. First, find the eigenpairs O"n, <t>n (x, y)) of problem (9). Second, use
superposition to build a solution of (1)-(5) of the form
00
(13) j(x,y) = .2::>n<t>n(X,y) for 0::: x::: Jr, 0::: y ::: Jr.
n=]
<t>(x,y) = X(x)Y(y).
This second application of separation of variables gives
X" (x) Y"(y)
(14) X(x) = - Y(y) - A = -/L,
where /L is a separation constant. Equation (14) and the boundary conditions of (9) lead
to the following familiar Sturm-Liouville problems for X and Y:
and
y" + vY = 0 for 0::: y ::: Jr,
yeo) = Y(Jr) = 0,
632 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
where
]) = A - fl.
and
(15)
LL
00 00
where the Cmn are constants. We wish to choose them in such a way that the initial
condition (5) is satisfied.
Initial Condition. Equation (15) gives, for 0 S x S Jr, 0 S Y S Jr,
= j(x,y) = L L
00 00
Orthogonality. The orthogonality of the eigenfunctions (15) will be verified and used
to calculate the coefficients Cmn . The following calculation is elementary.
(21) u(x,y, t) =
m=ln=l
Convergence. The proposed solution (21) is only formal, because we have not shown
that the series converges. Equation (20) implies that
411'(11'( If(x,y)ldxdy
lamn (j)I.:::2' forall m,n=I,2,3, ....
Jr 0 0
It follows that if t > 0 then the series in (21) converges absolutely and uniformly in x
andy.
Double Fourier Series. For t = 0, (21) gives the formal expansion
00 00
where amn(j) is given by (20). This series is called the double Fourier sine series for
f(x,y). The convergence theory for such double series is an advanced topic that cannot
be developed here. For a careful treatment see [B-McGl.
Boundary Value Problem. Find the function u(r, e, t) that satisfies the conditions
that satisfy (23)-(25), (27) and (28). The usual separation argument gives
(30)
where A is a separation constant, while (A, <P(r, e)) is a solution of the following
eigenvalue problem.
Eigenvalue Problem. Find all eigenpairs (A, <p(r, e)) such that <P =f. 0 and
where JJ.. is a separation constant. For the 8-factor, (31) and (33) imply that we have the
periodic Sturm-Liouville problem
JJ..=JJ..m=m2,
(39)
8(e) = 8 m W) = Am cos(me) + Bm sin(me), m = 1,2,3, ... ,
9.4 TRANSIENT DIFFUSION OF HEAT IN PLATES 635
where Am and Bm are arbitrary constants. Next, the R-factor is a solution of the singular
Sturm-Liouville problem
(41)
A = Amn = s~n'
R(r) = Rmn(r) = ]m(smnr),
where Smn is the nth positive zero of the Bessel function]m(x), numbered in increasing
order: 0 < Sml < Sm2 < .. '.
Combining (39) and (41) gives the eigenpairs of the <I>-problem:
A = Amn = s~n'
(42) <I>(r, e) = ]m(smnr) cos(me), m = 0,1,2, ... ,
<I>(r, e) = ]m(smnr) sin(me) , m = 1,2,3, ... .
Note that for m = 0 the eigenspace has dimension 1, while for integers m ~ 1 it has
dimension 2. The eigenfunctions listed in (42) are linearly independent. This is most
easily proved by showing that they are orthogonal when integrated over the disk. For
example, we have
= (1 1
1m (Smnr)lp(spqr)r dr) (1" cos(me) cos(pe) de)
=0
if (m, n) i=- (p, q). Indeed, if m i=- p, then the cosine integral is zero, while if m = p but
n i=- q, then the Bessel function integral vanishes by the orthogonality theorem for the
singular Sturm-Liouville problem (40). A similar calculation holds for the product of
any two of the eigenfunctions (42).
Normalizing Constants. The eigenfunctions of problem (40), given by (41), are not
normalized. Instead, it is known that
(44) 1o
1 2 1 2
lm(smnr)rdr = - lm+l(smn).
2
636 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
where the coefficients Amn and Bmn are arbitrary constants, to be chosen such that the
initial condition (26) holds. This is true if
00
The orthogonality and normalizing constants give for n :::: 1 and m :::: 1,
Convergence. The convergence theory of the Fourier-Bessel series (46), (47) requires
a deeper study of the properties of Bessel functions. It will not be presented here.
The Symmetric Case. If u(r, (J, 0) = fer) is independent of (J then Amn = Bmn = 0 for
all m, n :::: 1 by (47). It follows from (46) that u = u(r, t) is independent of (J and
00
where
Apart from notation this result coincides with the solution of Problem 9.1.6, Section 9.1.
9.4 TRANSIENT DIFFUSION OF HEAT IN PLATES 637
Other Boundary Conditions. With minor changes the results of this section apply to
plates that satisfy the Neumann and Robin boundary conditions. See the Exercises.
Exercises 9.4
Square Plate. Solve the transient heat flow problem 9.4.1 for the prescribed functionJ and obtain the
formula for uCx,y, t) without the use of integration tables.
1. f(x,y) = sin 3x sin Sy - sin 4x sin 6y,
u = sin 3xsin Sye- 34t - sin 4xsin 6y e- 52t .
2. f(x,y) = lI'sin3xsiny + sinxsin2y,
u = lI'sin3xsinye- lOt + sin x sin 2ye- 5t .
Square Plate. Solve the transient heat flow problem 9.4.1 for the following special choices of j(x, y).
Express the answer as an infinite series with all coefficients evaluated through integration tables.
Rectangular Plate. Complete the details for the following generalization of Problem 9.4.1 to an arbitrary
rectangle of sides 0 ::: x ::: L, 0 ::: y ::: M.
9. Formulate the transient heat flow Problem 9.4.1 for the rectangle case.
10. Show that separation of variables leads to product solutions X(x)YCy) in which the Sturm-Uouville
problems have solutions Xm(x) = sin(mll'xlL) and Yn(y) = sin(nll'yIM).
11. Show that
m1l')2
Amn= ( L + (nll')2
M '
12. In the series solution u = L~=l L~l cmnXmYnTmn, verify the formula
Cmn = ~ {M (L f(x,y)Xm(X)Yn(t)dxdy.
LM 10 10
Neumann Conditions for the Square Plate. Complete the steps below to solve Problem 9.4.1 with the
Dirichlet conditions replaced by Neumann boundary conditions Ux = 0 on the vertical boundary and Uy = 0
on the horizontal boundary.
13. Apply separation of variables to obtain Sturm-Liouville problems with solutions Xm (x) = cosCmx) and
YnCy) = cos(ny).
14. Show that the time factors Tmn(t) and eigenvalues Amn use the same formulas as Problem 9.4.1.
17. Apply separation of variables to obtain Sturm-Liouville problems with solutions Xm (x) = cos(mx) and
Yn(y) = sin(ny).
18. Show that the time factors TmnW and eigenvalues Amn use the same formulas as Problem 9.4.1.
x
FIGURE 19. Drumhead over n
9.5 VIBRATIONS OF DRUMS 639
(5)
To verify (5), we note that the two sides of equation (5) satisfy conditions (1)-(4) (check
this!). Hence they are equal, by the uniqueness theorem. Equation (5) is Rayleigh's
principle. It allows us to concentrate on the special case of (1)-(4) for which f = O.
The remainder of this section treats the vibrations of a circular drumhead of unit
radius. It is natural to use polar coordinates with origin at the center of the drumhead,
so that the displacement is a function u(r, e, t). We shall consider first the symmetric
case where g = g(r) and u = u(r, t) are independent of e. Then the general case where
g = g(r, e) will be analyzed.
Of course, we expect this condition to hold for the physical problem of the drum.
However, we shall see that it must be used expliCitly to obtain a unique solution.
I Lord Rayleigh (john William Strutt), 1842-1919, English physicist and mathematician.
640 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Product Solutions. The product solutions of (6)-(9) and (11) are the scalar multiples of
u(O,g)(r, t) = f
n=1
an (g) JO(snr) sin(snt).
Sn
This result and the Rayleigh principle give
00
The function uCf,g) defined by (16) is easily seen to satisfy (6)-(11), provided that the
series involved are convergent. The convergence question involves a deeper study of
Fourier-Bessel series. It will not be pursued here.
9.5 VIBRATIONS OF DRUMS 641
Problem 9.5.2. Circular Drumhead-General Case. In the general case, the drum
displacement u(r, e, t) = u(o,~) (r, e, t) will be determined by the following conditions.
where g(r, e) is a prescribed function. In addition, the physical problem requires the
following.
Implicit Conditions.
(24)
L Aon]o(son r) sin(sont)
00
u(r, e, t) =
n=1
m=1 n=1
and hence
L Aonson]o(sonr) cos(SOnt)
00
ut(r, e, t) =
m=1 n=1
642 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
On setting t = 0 and using the orthogonality of the eigenfunctions Csee Section 9.4
above), we get
Amnsmn = amnCg),
Bmnsmn = bmnCg), m = 0,1,2, ... , n = 1,2,3, ... ,
where amn Cg), bmn Cg) are as in Section 9.4. Combining these results gives the solution
Similarly,
u(f,O) (r, e, t) =
n=1
m=1 n=1
The symmetric case and the general case produce Significantly different 3-D plots,
as shown in Figures 20 and 21.
Rayleigh's Principle. Let v = u(OJ) and w = u(O,g). Assume additionally that the partials Vttt. Vtxx and
Vtyyare continuous. The follOwing steps verify Rayleigh's principle.
1. Verify that u = Vt + w is a solution of the wave equation Utt = Uxx + Uyy .
2. Verify that U = Vt + w satisfies u(x,y, t) = 0 for t "= 0, (x,y) in aQ, and u(x,y, 0) = j(x,y) , Ut(x,y) =
g(x,y) for (x,y) in Q and t "= O.
Circular Drum. Verify the following results for the symmetric case in Problem 9.5.1.
3. The circular drumhead solution for the initial conditions fCr) = 0 and g(r) = JO(S2 r) is solved without
integration of Bessel functions
4. The circular drumhead solution for the initial conditions j(r) = 0 and g(r) = 1 is given by
u(r, t) = L
DO 2
-2--JO(snr) sin(snt).
n=1 s~h (sn)
The integrations use a change of variables x = Snr, the identity xJO(x) = (xl! (x)), and the identity from
Section 9.4, J~ J6(Snr)r dr = ±H(Sn).
Rectangular Drum. A membrane stretched over a rectangular frame is called a rectangular drumhead.
Establish the following extensions of the circular drumhead theory by using the rectangular methods of
Problem 9.4.1.
5. Formulate and solve the rectangular drumhead problem for shape 0:5 x :5 L, 0 :5 Y :5 M.
6. Formulate and solve the square drumhead problem for shape 0:5 x :5 IT, 0 :5 Y :5 IT.
Circular Drum. Formulate and solve the follOwing extensions for the circular drumhead of radius a > 0
and propagation speed c = y'Tlp, using the method of separation of variables.
un(r,t) =JO(snr)sin(snt)·
The nodal lines of mode Un are the roots of lo(snr) = 0 in the domain 0 :5 r < 1. The nodal lines are
stationary relative to the time oscillations of Un.
9. Compute and graph in polar coordinates the nodal lines of mode U3.
10. Show that the nodal lines of Un are exactly the n - 1 concentric circles r = sm/sn, m = 1,2, ... ,n - 1.
choices of shapes and boundary conditions are only illustrative. Many similar problems
with rectangular or spherical shapes can be solved by the same method.
° °
for :s x :s L, :s y :s M, °:s z :s N. It follows that each of the three quotients in (8)
is a constant, say
X" (x) yll(y) 2"(z)
(9) X(x) = -A, y(y) = -p., --=-v
2(z) ,
and
(10) A + p. + v = 0.
Use of the boundary conditions (3)-(5) gives
(ll)
X" + AX = ° for O:s x :s L,
X(O) = 0, X(L) = 0,
9.6 STEADy-STATE DIFFUSION OF HEAT IN SOLIDS 645
and
fhr 2
)... =)..., = - -
(13) < L2'
X(x) = Xe(x) = sin
(cnx)
L ' C = 1,2,3, ... ,
and
m2 n 2
jJ.,=jJ.,m=M2'
(14)
YCy) = YmCy) = sin (m~y) , m=1,2,3, ....
(15)
sinh (
Zem(Z) =
sinh
(
(2
iJ
m'
+ M2 nN )'
so that Zem (N) = 1. Then the product solutions (7) are the nonzero multiples of
c ) sinh ( G+ :~ nz)
(16) uemCx,y, z) =
(
sin ~x sin (":Y) -.--:(~=e2===m=2::-\) ,
smh iJ + M2 nN
whereC,m= 1,2,3, ....
646 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Superposition. Linear combinations of the product solutions (16) all satisfy (1)-(5).
To satisfy the final boundary condition (6), we try
L L ClmUem(X,y, z)
00 00
U(x,y, Z)=
e=1 m=l
&+ ~~ nz)
C:
(17)
8 ~ cem
00 00 (£) sinh (
= sin ~X sin
y
) -sl'-n-hl~e=2===m2=-""':}
L2 + M2 nN
j(x,y) =
(18) e=1 m=1
=
~~
f:r. ~ Cl m sin (£nx)
L sin (mny)
M
for 0 ::: x ::: L, 0 ::: y ::: M. This can be done by using the orthogonality of the
eigenfunctions sin(£nxlL) and sin(mnyIM) on 0 ::: x ::: Land 0 ::: y ::: M, respectively
The latter can be written
1o
M
j(x,y) sin (qny)
- dy = -M ~
M
~ Ceq sin (£nx)
2
-
l=1
,
L
and
(20)
These calculations are, of course, only formal. Questions of convergence of the series
and interchange of summation and integration are left for more advanced courses. For
details see [B-McGl.
An Example. Consider a cube of side n so L = M = N = n and take j(x, y) = xy. Then
Now,
p
so
4 (_l)p+1 (-l)q+l 4( -l)p+q
Cpq = 11"2 P q pq
Hence the formal solution is
This defines a mixed problem with the Neumann condition (6') on one face and Dirichlet
boundary conditions u = 0 on the remaining five faces.
Product Solutions. As before, the product solutions of (1)-(5) are the nonzero scalar
multiples of
u(x,y, z) = ~~
L.., L.., Cem sin (€rrx)
- sin (mrry)
- - sinh 2 + 2m rrz ) .
(g;g2 2
e=1 m=1 L M L M
Then formally,
~ L..,
uz(x,y, z) = L.." ~ cern
, sin (€rrx)
- sin (mrry)
- - cosh (g;g2 2
2 + m rrz ) , -2
f=1 m=1 L M L M
where c~m = Cfmrr &+ ~~ , whence boundary condition uz(x,y, N) = j(x,y) gives
(€rrx) (mrry)
j(x,y) = 8~Cfmsin
00 00"
L sin M '
648 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
where c~m = CCmn &+ ~~ cosh ( &+ ~~ nN). The coefficients are now calculated
by means of the orthogonality (9), as before. The result is
x = rsin4>cos8,
(22) y = r sin 4> sin 8,
Z = rcos4>,
Substituting (23) into the Laplace equation, we find after a lengthy but straightforward
calculation, based on (22)-(23) and the chain rule, that
(24)
With these assumptions, the boundary value problem asks for a function vCr, 4» that
satisfies
where 1(4)) is a prescribed function, defined for 0 S 4> S n. We note that the partial
differential equation (26) is Singular at r = 0 and at 4> = 0 and 4> = n. The physical
interpretation of the model requires that vCr, 4» be finite and single-valued at these points.
9.6 STEADy-STATE DIFFUSION OF HEAT IN SOLIDS 649
The boundary value problem (25)-(29) is more difficult than those considered
above because the partial differential equation (26) has variable coefficients that have
singularities on the z-axis. Nevertheless, we shall find that it can be solved by Fourier's
separation of variables method.
Product Solutions. We search for product solutions
(33)
_1_~
sin¢d¢
(sinA.~)
'f'd¢
+ A<P = 0 for 0 <
-
A. < n
'f' - ,
X(x) = <P(¢).
650 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
<p'=- sin¢X',
dx
<P"=(-sin¢)X" - - (cos¢)X'
d¢
=(-sin¢)2X" - (cos¢)X'
=(1 - x 2 )X" - xX'.
(1 - x 2 )X" - 2xX' + AX = 0,
d ( (1 - x 2 ) dx
dx dX) + AX = O.
Since x = cos ¢, the boundary conditions of (33) are transformed into
Xc -1) and X(l) finite.
Thus the transformed problem (33) is the well-known Singular Sturm-Liouville problem
A = An = n(n + 1),
(34)
X(x) = Xn(x) = PnCx), n = 0, 1,2,3, . .. ,
where Pn (x) is the nth Legendre polynomial. In particular, since x = cos ¢, the eigenpairs
of (33) are
where hI and h2 are arbitrary constants. For Rn(O) to be finite, we clearly must have
h2 = 0. This gives the following product solutions.
Product Solutions. The product solutions of (25), (26), (28) and (29) are the nonzero
multiples of
The eigenfunctions <'Pn(¢) = Pn(cos¢) are orthogonal on °<s ¢ <s 7i with weight
w(¢) = sin¢:
Jo
r Pm(cos¢)Pn(cos¢) sin¢d¢ = /1
-1
Pm (x)P n (x) dx = ° for m:j:. n.
Moreover,
Jo
r P~(COS¢) sin¢d¢ = 11 P~(x) dx = _2_,
-1 2m + 1
m = 0, 1,2, ....
(37) 2n -
Cn = - + 110" j(¢)Pn(cos¢)sin¢d¢, n = 0,1,2, ....
2 0
Of course, we have only shown that (36), (37) define a formal solution, since we have not
discussed the convergence of the series and their derivatives. For rigorous results see Ual.
2 2 2
j(¢) = 1 - x = -Po(x) - -P 2 (x).
3 3
652 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Problem 9.6.4. Sphere with Neumann Boundary Conditions, Axisymmetric Case. This
problem differs from the preceding one only in replacing the Dirichlet boundary
condition v( 1, ¢) = j(¢) by the Neumann boundary condition
The product solutions remain the same and hence vCr, ¢) again has the form
00
(41) l rrj(¢)sin¢d¢ = O.
Moreover, if (41) holds then Co may be any constant. This is consistent with the
nonuniqueness of the solutions of the Neumann problem, as stated in the Exercises of
Section 7.7.
Parallelepiped Problems. Solve the steady-state diffusion problem in the unit cube 0 =s x =s 1,0 =s Y =s 1,
o =s Z =s1 under the given boundary condition with zero Dirichlet conditions on the remaining five faces.
1. u(x,Y, 1) = sin(rrx)sin(2rry).
Sphere Problems. Solve the steady-state diffusion Problem 9.6.3 in the unit sphere under the given
=
boundary condition u( 1, ¢) j(¢) without use of integration tables.
5. j(¢) = sin2 ¢.
6. j(¢) = cos2¢.
Sphere Problem Formulas. Verify the given solution to the steady-state diffusion Problem 9.6.3 in the
unit sphere under the supplied boundary condition u(l, ¢) = j(¢). Assume the identity
r 1 (_l)k(2k)1
Jo P2k+l (x)dx = 22k(kl)2(2k + 2)'
7. j(¢) = 1 for 0:::: ¢ :::: lf/2,j(¢) = -1 for lfl2 < ¢ :::: If, and for x = cos¢,
3 73 115
u(r, ¢) = '2rP1 (x) - Sr P3(X) + 16 r P5(X) - ....
8. j(¢) = 1 for 0:::: ¢ :::: lf/2,f(¢) = 0 for lf/2 < ¢ :::: Jr, and
~ + 3) (2n)1
n(4n2n+2 2n+l
u(r,¢) = f;;o(-l) 22n+l(n!)2 r P2n+l(COS¢).
then U will satisfy an ordinary differential equation with respect to x. The Laplace
transform method is to solve this equation, by the methods of Chapter 2 and Chapter 3,
and then find the Laplace inverse u(x, t). Here we shall present three examples of the
method. The first example, heat diffusion in an infinite rod, was solved by Fourier's
method in Problem 9.1. 7 of Section 9.1. Here the results of Problem 9.1.7 will be obtained
by the Laplace transform method. The second and third examples treat boundary value
654 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
problems for the heat and wave equations with nonhomogeneous boundary conditions-
problems not hitherto studied in this text. Many additional examples can be found in
classical texts; see, for example, [Churl.
Problem 9.7.1. Heat Diffusion in an Infinite Rod via Laplace Transform. In Section 9.1
the problem was formulated as the search for a function u(x, t) that satisfies the following
four conditions.
implies that the Laplace transform U(x, s) exists for all s > a and
(7) IU(x, s)1 ~ 1 o
00
e-stlu(x, 01 dt ~ -.
M
s
The derivative rule for the Laplace transform implies that
(9) ..c (Uxx(x, t)) = roo e-stuxx(x, 0 dt = a22 roo e-stu(x, t) dt.
10 ax 10
Hence the transform of the heat equation (3) gives
We shall show that (10) and the boundedness condition determine U(x, s) and u(x, t)
completely
Solution of the Transformed Problem. The general solution of the ordinary differential
equation (10) is
(ll)
9.7 THE LAPLACE TRANSFORM METHOD 655
(12)
For Vp we shall use the method of variation of parameters and the basis VI and V 2 (see
Section 3.2). The Wronskian of VI and V 2 is
(13)
r eyS~f(~) d~)
(15)
+ (C2 + _1_ e-ySx.
2010
For vex, s) to be bounded when x ~ ±oo, we must have
CI = 1Ie
2-ys
1 00
e-yS~f(~) d~,
1
0
0
C2 = 1Ie eyS~f(~) d~.
2-y s -00
(16)
656 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
e- alt e- 2"f<iS
get) = 1-;
y rrt
G(s) = £(g) = ..;s .
Taking 2Ja = Ix - ~I, or a = (x - ~)2/4, gives
_(X_$)2
-1 e 4<
k(t) == £ (K(s)) = 1-; ,
2yrrt
which is the heat kernel of Section 9.1 (see Problem 9.1.7). Thus
(18) u(x, t) =
/00J(;)e
1 _(x_,)2
~,
2yJrt -00
1-; 4<
which is the solution formula of Section 9.1, Problem 9.1.7. Of course, once equation
(18) is obtained, it may be verified by direct calculation.
Problem 9.7.2. The Semi-Infinite Rod with End Temperature Given. Consider a laterally
insulated semi-infinite rod that is at constant temperature zero at time t = 0, as in Figure
22.
If the end of the rod is controlled to have temperature J(t) at each time t ~ 0, then the
internal temperature u(x, t) will be a solution of the following boundary value problem.
U -_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ~
where f(t) is a prescribed function. For simplicity, we shall add the following
supplementary conditions.
where C1 and C2 are arbitrary constants. For V to be bounded when x ---+ 00, we must
take (1 = O. Then (28) implies that (2 = F(s), so
where
so that
x2
r=t--.
4z 2
Moreover, r -+ t as Z -+ +00, and r -+ 0 as Z -+ 2~' Thus the substitution gives
(33) u(x, t) =
2
,.fii /00
2-J (t - X2) e
4z 2
-Z2
dz.
20
The boundary conditions can be verified directly from (33). In particular, U(x,O) = 0
and u(O, t) = J(t).
EXAMPLE 1. The special case that J(t) = Ua (a constant) gives the solution
Problem 9.7.3. The Semi-infinite String with End Displacement Given. As a final
example of the Laplace transform method we shall calculate the motion of a semi-infinite
string, initially at rest, when the end of the string is moved in a prescribed way The
corresponding boundary value problem asks for a string displacement function u(x, t)
that satisfies
where J(t) is the prescribed displacement. Of course, we must assume that u and Ux
remain small enough for the linear wave equation (35) to hold. In particular, we require
the following supplementary conditions.
by the derivative rule of Laplace transform theory and (36). Also, we assume as before that
(43)
where Cl and C2 are arbitrary constants. Moreover, (41), (42) and (43) imply that Cl = °
and C2 = F(s), so
Finally, (44) is easily inverted by using the second shifting theorem of Laplace transform
theory The result is
!°(
where H is the Heaviside function. Equivalently, (45) can be written as
J X)
t -- for 0::: x < ct,
u(x, t) = C
for x> ct.
660 9 BOUNDARY VALUE PROBLEMS OF MATHEMATICAL PHYSICS
Thus the prescribed motion of the end of the string generates a wave that propagates out
along the string with speed c.
IExercises 9.71
Infinite Diffusion. Consider the boundary value problem
Solve by Laplace transform methods the following instances of this problem for the solution u(x, t). The
answers involve the error function
erf(x) =
2 r e-
-./7i 10 X
2
dx.
Solve by Laplace transform methods the following instances of this problem for the solution u(x, t). The
answers will be in unevaluated integral form.
Solve by Laplace transform methods the following instances of this problem for the solution u(x, t). Graph
snapshots of the solution for c = 1,0:::: x:::: 1, t = 0, t = 0.2, t = 0.6.
7. (Semi-infinite String with Distributed Load) Use Laplace transform methods to find u(x, t) for the
modified semi-infinite string problem below. Then verify that the answer is correct.
Section 1.2
I. Y = 0.209 + 0.52x,y = 0.443 + 0.26x
3. Y = -0.7 + 2.2x, y = 6 - 4.5x
5. ao = 0.469, al = 0.52, a2 = -0.3714285713
7. ao = 0.4, al = 2.2, a2 = -6.7.
9. Y = -15000x + 149000, y = 7000(x - 1) + 134000, y = 23000(x - 2) + 141000.
II. Check that the quadratics interpolate the data points
Then show that the polynomials can be pasted together to make a continuously differentiable function
f withj'(O) = o.
13. Choose Xo = 0, then (x - xo)(x - Xl)(X - X2)(X - X3) = h4 t(t - l)(t - 2)(t - 3) has its value of
largest magnitude on 0 :s t :s 3 at some point t (t = xlh). The possible values of t are ~, ~ + 1,
~ - 1. The largest magnitude is I - 11·
21. Item (b) follows by changing variable in the integral, u = -v, du = -dv.
27. Wild oscillation at the ends of the interval.
29. The proof is by induction. Given Zo and the relations mk = C(fk+l-ikl), Zk+l
Xk+l-Xk
= 2mk - Zk, for
k = 0, 1, ... , n - 1, it suffices to prove the relation Zk = F(k) where F(k) = 2 L~~~ (_l)k-l-i mi +
(-l)k zo .
35. Let j(x) = (x - xo)(x - Xl). The largest value of If(x) I is If((XQ + xl)l2)1 = (Xl - xo)2/4.
Section 1.3
1. 0.170825, exact 0.1666666667.
3. 0.3858779367, exact 2In(2) - 1 = 0.386294361.
5. 0.1667000000, exact 1/6 = 0.1666666667.
7. 0.9095424082, exact 3 In(3) - 1 - 2 In(2) = 0.909542506.
9. n = 17.
11. 1.999835504, exact answer 2.
l3. 25 divisions. Table of erf(O.I) to erfO):
15. 6 divisions or m = 3.
8x 2 - 4 2
19. j"(x) = -/ii e- x , n = 20 in trapeZOidal rule. The estimate is 0.2226989755 for exact erf(0.2) =
0.2227025892, error 0.0000036137.
23. Definej(x) = ]0 (x). Let C(h) = (f0 + h) - jO - h»/(2h), R = (4C(hl2) - C(h»/3 (Richardson
extrapolation). The computations, using a hand calculator:
Section 1.4
1. See Figure 1.
ANSWERS AND HINTS TO SELECTED EXERCISES 665
3. See Figure 2.
25. The derivative off is positive, because l' (x) is the sum of two positive terms (except at x = a). Apply
the previous exercise.
29.
Section 1.5
1.
Y X Y x
3.
Y x Y x
5.
Y x Y x
x y x y
11.
13.
13. FyCO,O) = 1.
15. FyCO,O) = 2.
ANSWERS AND HINTS TO SELECTED EXERCISES 669
19. A most interesting proof can be found in the classical analysis text of Dieudonne.
21. To = Qlh = lOQ/(lOh) = IOQlx.
Section 1.7
1. N = 66.
3. N=4.
5. N = 26.
Section 2.1
[24 24
1. Typically, 10 (1 + 20 dt = (t + (2 )lo = 600.
3.
1
i] = - (e- I +sint-cost),
2
1
i2 = - (11e- 1 + sm t - cos t); see Figure 3.
2
5. The cooling equation has formal solution u(O = 20 + SOc-hI because in this setting to = 0, T = u(t),
TO = 100. The additional condition, eg., u(10) = 60, determines h uniquely as h = In(2)/l0 The
cooling time t is the solution of 21 = 20 + SOe-ht for the given value of h.
7. To check the initial condition, set x = 0 to obtain yeO) = eO +0 2 = 1. To check the differential equation,
differentiate y to obtain y' = - 3e- 3x + 2x.
9. x = 0 and x = 2.
11. See Figure 4.
17. Isoclines are horizontal lines x = 3/2 ± V0.25 + c, when c takes on values c 2: -1/4.
Section 2.2
\
x
FIGURE 3. Plot of hand i2
610 ANSWERS AND HINTS TO SELECTED EXERCISES
I / /
/ /
-1
1.0
,
\
0.1
o 1 FIGURE 5. Direction field for XXi +t = 0
11. xCt) = et
1
1 /
-1
21. The plotted data points are (tk, Vk), k = 0 to k = 24. The recursion is Vv+1 - Vk = (%+1 - %)12
(k = 0 to 23), where the tk's and qk'S are given in the flow meter table.
23. 0.0008842 inches.
25. 135.12125 minutes.
27. Because a half-full tank empties at lower velocity, it takes longer to empty a half-full tank. Il
523.94332 seconds, I2 = 1788.8544 seconds.
29. t = 19.4334 by the trapeZOidal rule, t = 19.4334 by Simpson's rule.
31. 69.44 days.
Section 2.3
1. x(t) = xoe-Aot.
3. x(t) = e-1+C05(t)
5. x(t) = e-1+e- t
7. x(t) = 5/.
9. x(t) = t2 + 2.
t5 1
11. x(t) = "9 - 9t 4 '
1
13. x(t) = sin t + - (cos t - cos 1 - sin 1).
t
15. x(t) = t - 2 tanet) sec(t)
7. x 3y + 4x 2y2 + 4y 3 = c.
672 ANSWERS AND HINTS TO SELECTED EXERCISES
9. x sin(y) = c.
2
11. )1-x2 +L = c.
2
y4
13. xy3 + y3 - - = c.
4
15. (y+x)e'Y=c.
17. x 2y+y= 1.
Section 2.5
1. xCt) = Cjt + CI
3. x(t) = Cjt + (2 + t2 /2 + t3 /6.
~
17. v dy = -2v, v + 2y = C, or v = 0, xCt) = Cj + CIe- li .
21. ex =
~
yk 2 +4f.g, vo
-k - ex
= - - , VM =
-k +ex
- - 0- , v= VOVM
exte"t-1 ,J
-1
(t) = va +
VO(VO - VM)
e"t '
i
U h ~ -~ ~ -~
t -1 ( )d
J t t = vat + voCvo - VM) In VM - voe-at . I I
a voex VM - vo
gt 2 b
23. x(t) = Re + bt - - + -(M - at) In 11 - aUMI·
2 a
1
25. y(x) = 1 + .
eX 2- x (I; _tet2 - t dt + C)
ANSWERS AND HINTS TO SELECTED EXERCISES 673
5 ( 2g - 0.23 )
27. t) = l(v)) = - ( ) = 0.1568155886. For the others, employ Simpson's rule, for
2 g g - 0.23
Section 2.6
L Define D to be the region -00 < t < 00, X < L The solution sin t exits D at t = ±nI2, at which time
x' (0 ::: 0, so x(o = 1 for t > n12.
3. The issue is the continuity of the derivative x' (1) at the points t = a and t = b.
Section 2.7
3. The slope field is F(t,x) = x(l - x) Use grid (-2h,y), (-h,y), (O,y), (h,y), (2h,y) for y = 0.5 + 2h,
O.S + h, 0.5, 0.5 - h, 0.5 - 2h and h = 0.25.
x Exact
0
0.1 1.1 1.10S17
0.2 1.21 1.2214
0.3 1.331 1.34986
0.4 1.4641 1.49182
x Exact
0 0.5 0.5
0.1 0.525 0.52498
0.2 0.5499 0.54983
0.3 0.57469 0.57444
0.4 0.59913 0.59869
x Exact
0 1.0 1.0
0.1 1.11 1.1103
0.2 1.24205 1.2428
0.3 1.39847 1.3997
0.4 1.58180 1.5836
0 0.5 0.5
0.1 0.525 0.52498
0.2 0.54988 0.54983
0.3 0.57450 0.57444
0.4 0.59877 0.59869
x Exact
o l. 00000 l. 0
0.2 1.2428 1.2428
0.4 l.58364 l.5836
0 0.5 0.5
0.2 0.54983 0.54983
0.4 0.598686 0.59869
x Exact
0
0.2 1.22 1.2214
0.4 1.4884 1.49182
0 0.5 0.5
0.2 0.54975 0.54983
0.4 0.59852 0.59869
x Exact
o
0.4 1.49173 1.49182
o 0.5 0.5
0.4 0.598686 0.59869
4 0.7368351728 -0.0057765941
64 0.7314271978 -00003686183
256 0.7311508052 -0.0000922263
4096 0.7310643445 -0.0000057653
27. The exact value of x(t) = 1/(1 + e- t ) at t = I is xO) = 0.7310585787. The results are as follows,
where the last column of the table IS xO) - XN, and XN is in column 2.
0.7291666667 0.0018919127
2 0.7309788031 7.97757e-05
4 0.7310545273 4.0517e-06
8 0.7310583515 2.277e-07
29. The exact value of x(1) = 1/0 + e- 1) at t = 1 is xO) = 0.7310585787. The results are as follows,
where the last column of the table is xO) - XN, and XN is in column 2.
0.7309103404 0.0001482387
2 0.7310474077 1.1 1717e-05
4 0.7310578607 7.187e-07
31. The exact answer is f;/2 cos(tJdt = 1. The results for the four methods appear in the table below.
Exact 1. 000000000
Simpson 1.000003392
Euler 1.076482803
Heun 0.9979429868
Rk4 1000000212
676 ANSWERS AND HINTS TO SELECTED EXERCISES
Section 3.1
23. By Kirchhoff's laws, ij + i2 + i3 = i. Drop formulas give ij = CLi~ = Cu", i2 = u'IR, i3 = U/L
1 1
25. Ljq~ = --qj + R(i2 - ij), L2q~ = --q2 + R(ij - i2). The isomorphism is
Cj C2
(mj,m2,kj,k2,C) --+ (Lj,L2, 1/Cj, 1/C2,R).
Section 3.2
1. W = -2.
3. W = cosh2(t - to) - sinh 2(t - to) = 1.
5. W = _2e- 3tJ2
7. W = _3et2 /4 .
9. X2 (t) = sin t.
11. x2(t)=tlnt.
13. X2(t) = 1.
15. X2(t) = -1 + -tin
1 11-+-t I.
2 1- t
4
17. xp(t) = _t 7l2 et
35
1
19. xp(t) = '2t2(ln t)2 - t 2 ln t.
1
21. xp(t) = -'2tcost.
300 + 30.jl49
23. t = 49 = 13.59585.
ANSWERS AND HINTS TO SELECTED ExERCISES 677
Section 3.3
1. et , e2t .
3. e3tJ2 cos(J9lU2), e3t12 sin(J9lU2).
5. e3t11O cos(UlO), e3tJlO sin(UIO).
7. e- 2t15 , te- 2t15 .
3 4 1
9. xp(t) = - - cos(2t) +- sin(2t) + -.
50 50 2
1 t
11. xpW = -"2e- .
1 2t
13. Xp(t) = -te + (19
- - 3-t - -t
1 2) e.t
5 32 8 4
15. Q = CI cos(50t) + C2 sin(50t).
17. Q = CI cosCU20) + C2 sin(U20).
19. i(t) = -lOOe- t sin(20t).
kAo(k - mw2 ) -kAocw
21. Al (w) = ,A2(W) = .
(k - mw 2 )2 +
c2 w 2 (k - mw2 )2 c2w 2 +
k 2A2
23. A(w)2 =
(k - mw)
2 20
+c
2 2' A' = 0 when -[2(k - mw2 )( -2mw)
w
+ 2c2wl = 0, so at a rest point
4m 2 k 2A2
A(w)2 = 2 O2 ,but otherwise the maximum is at w = O. ThenAmax = Ao for 2km - c2 ::'S 0;
c (4km - c )
2mk
otherwise Amax = Ao . Resonance occurs at those frequencies w that maximize the
cJ4km - c2
amplitude A(w).
25. See Figure 7.
27. Let the roots be)..2 = -all + ../B12, )..2 = -al2 - ../B12, where D = a2 - 4b =discriminant. Then
(D> 0) W = be-at, (D = 0) W = e- at , (D < 0) W = Jb - a 2/4e- at .
33. LetfI(t) = eC- 33 +v'6s)t14 + /-33-v'6s)t14,h(t) = eC- I7+J33)t12 + /-17-J33)t12. They cross at
t = 0.1230872708. See Figure 9.
0.35
0.00
o FIGURE 7. Voltage Vet)
678 ANSWERS AND HINTS TO SELECTED EXERCISES
60
° -2 o 2 FIGURE 8. Plot of e- I + e- 21
15. Xl (t) = 1, x2 CO = 1.
17. XI (t) = 1, X2 CO = 1- e-1. For x2, co = 0, Cj = 1, ck+2 = -Ck+l/(k + 2), Then Cn = (-l)n+l/n!
for n ~ 1.
19. XI (t) = 1- t 3/6+ t 6/180 - " ' , X2(t) = t - t 4 /l2 + t 7/504 - ... , Use ck+2 = -ck-l/[(k + l)(k+ 2)]
°
and Co = 1, Cj = C2 = for Xl, similar for X2, Exact solutions involve Bessel functions, too complicated
to cite,
31. Roots 0 and 2. Xl (t) = cO, X2 (t) = qt 2. Recursion relation is k(k - 2)ck = 0 for the smaller root.
33. Roots -3 and -2.Xl (t) = co L~o t 2k - 2/(2k + 1)! = cot- 3 sinh(t).x2(t) = Co L~O t 2n - 3/(2n)! =
cot- 3 cosh(t).
35. Roots 0 and 2. Xl (t) = cot2 L~o( _1) n t 4n +2/(2n + 1)!. Recursion Cn = -4cn-4/[n(n + 2»), Co =I 0,
q = c2 = C3 = O. X2(t) = cOt2 L~0(_1)nt4nl(2n)!. Recursion Cn = -4cn-4/[n(n - 2)], Co =I 0,
q = Q = c3 = 0 (Q =I 0 reproduces the solution Xl with C2 replacing co).
37. The integrand is smooth; differentiate under the integral sign. Then apply integration by parts with
u = sin(t sin 0), du = t cos(t sin 0) cos 0 dO.
39. The integrands upon addition collect with factor [1 - cos 2 0 - sin 2 0] = o.
41. Expand both sides into a series, using term-by-term differentiation on the left. The two series are equal,
by making a change of variable k = m + 1 in the series on the right.
45. Truncate to L~~o( _1)k(xl2) 2k l(k!)2 for plotting, using Taylor estimates on 0.1 ~ X ~ 10. Plot with
a hand calculator or a computer algebra system.
47. When 11- = 0, then 0 - x2)y" - 2xy' = O. The roots are 0 and O. The recursion cn+! = -n(n +
1)cnl[2(n + 1)2] leads to y(x) = Co. Choose Co = 1 to make yO) = 1. When 11- = 1, then
(1- x2)y" - 2xy' + 2y = O. Theny(x) = Co + q (x - l);yO) = 1 gives q = Co = 1 andy = x.
Section 3.5
3. x(1) =t- sint, 0 ~ t ~ n:,x(t) = t 2/2 +sint+ (3 -n:)t+n: 2/2 - 2n:, n: ~ t ~ 2n:.
k tk Xh Yk Ek E'
k Exact
0 0 0 0 0
0.1 -1.6 -0.0789 0.0423 0.92106099
2 0.2 0.84 -3.2 -0.1433 0.3306 0.69670671
3 0.3 0.52 -4.914 -0.1576 0.8158 0.36235775
4 0.4 0.0656 -5.376 -0.0948 1.3777 -0.029199522
680 ANSWERS AND HINTS TO SELECTED EXERCISES
0 0 0 0 0 0 0
0.1 0 0.1 0.00452 -0.01404 0.004524
2 0.2 0.01 0.17048 0.00637 -0.02311 0.016375
3 0.3 0.027048 0.21726 0.006288 -0.02835 0.033337
4 0.4 0.04877 0.245185 0.00485 -0.03068 0.053626
0 0 0 0 0 0
0.1 0.1 -0.0265 -0.07894 0.09735
2 0.2 0.2 0.84 -0.02066 -0.14329 0.17934
3 0.3 0.284 0.52 -0.05099 -0.15764 0.23300
4 0.4 0.336 0.0656 -0.08611 -0.09480 0.24989
h tk xk Yk Ek E'
k Exact
0 0 0 0 0 0 0
1 0.1 0 0 0.000167 0.005 0.000167
2 0.2 0 0.010017 0.001336 0.01005 0.001336
3 0.3 0.001 0.03015 0.003520 0.01519 0.004520
4 0.4 0.004017 0.060602 0.006735 0.62047 0.010752
h tk xk Yk Mean error
0 0 0 0 0
0.2 0.00133 0.019933 0
2 0.4 0.01058 0.078939 0
h tk xk Yk Mean error
0 0 0 0 0
0.2 0.19933 1.98669 0.0000115
2 0.4 0.789389 3.894186 0.000025
ANSWERS AND HINTS TO SELECTED EXERCISES 681
0 0 0 0 0
0.2 0.000133 0.0026667 0
2 0.4 0.002133 0.0213333 0
23. Exactx = cos 4t. Data points (0, 0), (0.1, -0.0789), (0.2, -0.14329), (0.3, -0.15764), (0.4, -0.0948)
25. Exact x = t2 e- t /2. Data points (0,0), (0.1,0.0045), (0.2,0.0064), (0.3,0.0063), (0.4,0.0049).
27. q(t) = 10-10costonO ~ t ~ 10,q(t) = 100-coslO)cos(t-lOHlOsinlOsin(t-lO)ont > 10.
29. x(t) = 0 for 0 ~ t ~ to, xU) = (t - to)4/108 for t > to defines infinitely many solutions.
31. Use to = 0, Xo = 0, Yo = 1, Fl = y, F2 = -x, h = 77:/5 for 5 steps. The recursion starts with
Xl = h - h 3 /6, Yl = 1 - h2 /2 + h4/24
t5
27. x = 6000'
t6
29. x = 720'
682 ANSWERS AND HINTS TO SELECTED EXERCISES
1 1 3 1
31. x = _e- t + -tet - _et + -(cost + sint).
8 4 8 4
t3
33. x = q + C2 t + C3 t2 + 6.
2 t5
35. x = q +C2 t + Clt + 6000.
37. x=qet+Cle-t+c3cost+qsint-1.
Section 4.1
1. See Figure 10.
3. See Figure U.
Section 4.2
1. 2e-s~ (a cos at + s sin at) from t = 0 to t = 00.
s +a
-st . -at -2as -st a2 _5 2
3. e (A cos at + B Sin at), A = s2+a2 + (s2+a2)2' B = s2+a2 + (s2+a2)2 from t = 0 to t = 00.
-Xt
5. ;;f+b 2 (b cos bt + X sin bt), X = s - a, from t = 0 to t = 00.
7. y=25(1-cos2t).
9. y = i - i cos( ht) - H(t - 1) 0- i cos h(t - 1)), Y55 = i (cos h(t - 1) - cos h t)
1
11. (5-5)2'
6 I
13. (5-1)4 - (s+ 112)2 .
,,10 (-I)"nl
15. L...n=O (5-1)"+1 .
31.
e- 2s
2
(L +
5-1
e- 2 )
5+1 .
k ke- PS
67. ps2 - s(l-e-PS)'
69. Write OJ) = i (L~l a" ("~p + j(O») and compare with C(j').
+1 t 00 f(n) (0) ,
71. Since C(eatt") = n!/(s - a)n ,the transform of ea jet) is the series Ln=O -n!- (s_:jn+l . Compare
this series to the series for C(j).
Section 4.4
1. x(t) = 200e U20
19 . xCt) -- -Lte7U3
22
189 elU3
_ ill + ....2...
242
(63+38v'3 ev'3t + 63-38v'3 cv'3t)
6 6 .
ANSWERS AND HINTS TO SELECTED EXERCISES 685
23. i(t) = -WOe-lOt sin(20t), plot on [0, 2nI20[. See Figure 12.
Section 4.5
7. x(t) = e2t (cos 3t - j sin 3t), yet) = e2t (cos 3t + 3 sin 3t).
3.
7xI - 3X2 = 9,
-3xI + 10X2 - 2x3 = 0,
- 2x2 + lOX3 - X4 = 0,
- X3 + 12'4 - 2xs = 0,
- 2'4 + 6xs = O.
21. X = ~K - G, y = ~G - ~K.
n+l-k
23 . xk = Ii+l'
Section 5.3
1. Null space span{O}, range R2.
13. Verify v3 = Vj + 2V2. Then solve Cj Vj + C2V2 = dj V2 + d2V3 for Cj, Q in terms of dj, d2.
15. Any n + 1 vectors in V are dependent. Argue that any v in V is a linear combination of a basis Vj, ... ,
Vn of the subspace V.
17. There are ten separate proofs. For example, items (I) and (VI) hold by the definition of function addition
and function scalar multiplication.
Section 5.4
3. Yes.
9. G~).
11. AB=BA=diag(-1,-4,-2,-2)
19. The diag(aj, a2, ... , an) is Singular if and only if aj a2 ... an = O.
27. (11010)
001
0 ,(1
0010)
101
0 ,(1
010 0)
011
0 ,(1
01 0 0)
001
1,0(1
0
o1) , 0 1 0)
0 0 1
0 .
0 0 1
C
(
-9/5 2215
-115 )
29. -1 2 o .
4/5 -7/5 115
( )
-11 5 -2
31. 7 -3
5 -2
33.
(
315
-2/5
-1
-3/5
215
2
115
115
-1
).
( )
0 0 217 -117
-1 -1 1617 617
" -2
0
-1
0
1517
317
317
217
688 ANSWERS AND HINTS TO SELECTED EXERCISES
-1/2 113 -2
1/3 -1 713 )
4/3
37. ( -1/~ -1/3 3 -10/3 .
-113 -1/3
39. Entries on the left have the form Lk=l L~=l aikbkfcfj-
41. Entries on the left have the form Lk=l (aik + bik)ckj.
Section 5.5
1. Choose a, b to span the plane; then a x b completes a basis in R3 , and V-1 = span{a x b}.
7. A = (~ ~ ~). b= G)'
9. 4.
11. 1.
13. The null space is spanned by (- ~) , the nullspace of A* is the zero vector, the range of A is spanned
by G)' (~). and the range of A * is spanned by G) CD and
Section 5.6
1. -1.
3. -1.
5. c.
7. b2
9. -10.
11. 0.
13. 0.
15. 1.
17. -1.
19. abed.
21. -abed.
23. 21.
25. 4.
27. 1.
29. Xl =x2 =-1.
31. Xl = 0, x2 = 1, X3 = 1.
33. Xl = -1,X2 = 0,X3 = 0,X4 = 1.
35. °
37. Experiment with dimension 4. Subtract row I from the otherthree rows to reduce it to a 3 x 3 determinant
of the same form. Repeat.
39. Use the area equal to one half the area of the corresponding parallelogram, which is i la x hi.
41. The equation after expansion is linear in X, y, Z, hence it represents a plane. The points are in the plane
because of determinant rules.
43. 3.
45. 4x - Y + 2Z = 4.
47. 1.
49. 18.
51. Yes. Since det(A) = det(B) det(CB), then det(CB) =f. 0, and CB is invertible.
Section 5.7
1. 5 - 4)" + )., 2 , 2 + i, 2 - i.
3. ).,2-1,1,-1.
19. The eigenpairs are 1, (-1, -7,4); 2, (0, -1, 1); 0, (0, -3, 1).
21. Eigenvalue A = 0 produces only one eigenvector (1,0,0). Geometric multiplicity 1, algebraic
multiplicity 3.
23. Eigenpairs are 2, (0,1,0,0); 2, (1,0,0,0); 1, (0,0,1,0); -1, (0,0,0,1).
l
25. Let P be the matrix of eigenvectors of A; then p- 1AP = AI. Multiply by P and p- 1 to obtain A = AI.
1 0
37. It is equivalent to solving for q and C2 in the equation ( 2 -=- ~ 2 -=- ~ ) G~) = (~). Apply rref
methods.
39. A2 - 4A +5= 0, roots 2 + i, 2 - i.
41. A2(2 -A).
45. Directly expand the determinants, or else use the determinant expansion formulas for 2 x 2 and 3 x 3
characteristic polynomials given in earlier exercises.
47. (~ ~)-
49. Use the subspacecriterion: "If x and y are in V and q, C2 are constants, then qx + C2Y is in V."
51. Let x = (1,1,1). Show Ax = 7x.
53. Multiply Ax = AX by the inverse matrix.
55. Let y = Akx. Show Ay = Ay.
Section 6.1
11. 41!'.
13. 1.33,2.23,2.14.
15. The angles are 35.6 and 144.4 degrees (their sum must be 180°).
17. 1.25.
19. 2.145.
23. x = 1.
25. 2a· (b x c)
29. (2,3,-3)
43. Write out both sides explicitly in terms of components and compare terms.
Section 6.2
1. r(t)=0,-1,3)+t(2,5,4)
I(
5. ret) = 0 + 7 cos t, 2 + 7 sin t, 5).
1,0,0 Ht( 0,1,0), (l,0,0) --+ 0,1,0),
7. ret) = (1,1,0) + t( 1,2,1 ), (1,1,0) --+ (2,3,1),
( 2, 3, 1 ) + t( -1, - 3, -1 ), (2,3,1) --+ (1,0,0).
9. T = (l!J30)(i + 2j + 5k)
11. r'(t) =- sin(t)(i - j) + cos(t)(i + j) + k, r'(t) = .f3
13. rCt) = 2 cos e(t)i + 2 sin eWj, r' (I) = 20.
692 ANSWERS AND HINTS TO SELECTED EXERCISES
15.
r' (t) = 4 sin O(cos Oi + sin OJ)
+4(1 - cos 0) (- sin Oi + cos OJ),
rl/(t) = 16 cos O(cos Oi + sin OJ)
+ 16 sin O( - sin Iii + cos OJ)
+ 16 sin O( - sin Oi + cos OJ)
+ 16(1 - cos 0)( - cos Oi - sin OJ).
17. The product formula from calculus implies (aj(t)bj(t))' = aj(t)bjCt) + aj(t)bj(t),j = 1,2,3. Add.
19. 2.,f5n.
21. 3n.
23. Start with the result of the previous exercise; then insert KN = dT/ds, r' = r'T.
25. Use K = [r' (t) x rl/ (t)[l[r' (t)3[.
27. The circular helix is a space curve on the cylinder x 2 + l
= 1 that wraps around the cylinder like a
coil spring. The tangent vector points in the direction of the curve. The normal points to the center of
the circle of curvature, which is on the axis of the circular cylinder x 2 + y2 = 1, i.e., the z-axis.
29. Use rl/(t) = (d2s/dt2 )T + (ds/dt)2N and N = B x T.
31. Apply the previous exercise.
33. The triple product is -2. Then h3 =i xj = k; h2 = (i - j)/2; hi = (i + j - 2k)/2.
35. 5v'IO
37. O.
Section 6.3
1. ru = 2ui + vk, rv = 2vj + uk
3. r(u, v) = ui + vj + !C7 - 3u + 2v)k.
5. r(u, v) = (u + v)i + (1 - 2u)j + (u - l)k.
7. For y'3 :::: v :::: 3y'3, 0 :::: u :::: 2n, let r(u, v) = vcosui + v sin uj + (v/y'3) k.
9. r(u, v) = 4 sin u cos v i + 4 sin u sin v j + 4 cos u k for 0 :::: v :::: 2n, 0 :::: u :::: n/3.
11. (0,0,0).
13. No. The trouble is theconditionru xrv i= 0, whichfailsatu = v = 0, becauseru = rv = Oat u = v = 01
15. n = -2i - 4j + k.
17. -6(x - 1) - 2(y - 1) + z - 4 = O.
19. Show that r is in the plane of PQR. Show that (x,y, z) is inside triangle PQR using plane geometry.
*'
21. r(u, v) = sinucosvi + sinucosvj + cosuk, 0 = {(u, v) : 0 :::: u :::: n/2,0 :::: v :::: n/3}, ru
cos u COSy i + cos u cosvj - sin u k, rv = - sin usin vi - sin u sin vj, ru x rv = sin2 usin vU - i),
Area =
23. r(u, v) = u 2 i + uv j + 5:v2 k, 0 = {(u, v) : 0 :::: u :::: 1,0 :::: v :::: 3}, ru = 2u i + uj, rv = vj + v k,
ru x r; = (2u 2 + v2)2, Area = 11.
ANSWERS AND HINTS TO SELECTED EXERCISES 693
25. There are five faces to parameterize. Because of symmetry, only three need to be detailed. The base:
r(u, v) = ui + vj, 0 = leu, v) 0:::: u :::: 1,0 :::: v :::: 2}, ru = i, Iv = j, Iu X Iv = 1,
Area of base = f5 fOI
dudv = 2. The side in the (x, z)-plane: I(U, v) = ui + vk, 0 = {(u,v) : 0 ::::
u :::: 1,0 :::: v :::: 4}, Iu = i, Iv = k, Iu X Iv = 1, Area of (x, z)-side = f; f5
dudv = 4. The
side in the (y,z)-plane: I(U, v) = uj + vk, 0 = {(u,v) : 0 :::: u :::: 2,0 :::: v :::: 4}, Iu = j, Iv =
k, Iu X Iv = 1, Area of (y, z)-side = f04 f5 dudv = 8. The total surface area of the open box (no top)
is 2 + 4 + 4 + 8 + 8 = 26.
27. I(U,V) = vsinacosui + vsinasinuj + vcosak, 0 = {(u,v) : 0 :::: u :::: 2n,0 :::: v :::: h},
Iu = -vsinasinui + vsinacosuj, Iv = sinacosui + sinasinuj + cosak, Iu X Iv = vsina,
Area = nh 2 sin a.
29. I(U, v) = a cos u i + a sin uj + v k, 0 = leu, v) : 0 :::: u :::: 2n, hI :::: v :::: h2}, Iu = -asin u i + acos uj,
Iv = k, Iu X Iv = a, Area = 2an(h2 - hI)'
31. I(U,V) = ui+vj+ (u 2 +v 2 )k, 0 = {(u,v): 0:::: u:::: 1,0:::: v:::: I}, Iu =i+2uk,I v =j+2vk,
Iu X Iv = "II + 4u 2 + 4v 2 , Area = fOI f5 ,,11 + 4u 2 + 4v 2 dudv ="=' 1.86.
33. x= 1/3, Y = 1, Z = -213.
Section 6.4
1. See Figure 14.
3. See Figure 15.
5. See Figure 16.
7. Lift each contour to the height specified, making one curve in each of four planes. Shade the figure.
9. 36n.
11. 2.
13. 6.
15. I(t) = (2 + 2t)i + (4 + 3t)j.
29. O.
31. 0
33. 4xi - 6yj + 2zk.
35. -zj - (sin(x) cos(y»k.
37. Apply the calculus theorem on equality of mixed partials.
39. Apply the calculus theorem on equality of mixed partials.
41. 4.
43. -512n.
45. n.
Section 6.5
19. x = 2, Y = acos t, z = b sin t, 0 ::::: t ::::: 2n where a = v's and b = "fi are the semi-axes of the ellipse.
21. The answer is e = e) + e2 + e3, where
e): x = 1 - t, Y = t, Z = 0 0::::: t ::::: 1,
e2: x = 0, y = 1 - t, Z = to::::: t ::::: 1,
e3: x = t, Y = 0, z = 1 - to::::: t ::::: 1.
23. The set is described by 0 ::::: x ::::: 1,0 ::::: y ::::: 1. This is both an x-domain and a y-domain. The outer
normals: segment x = 0: n = -i, segment x = 1: n = i, segment y = 0: n = -j, segment y = 1:
n =j.
25. The domain can be described as 0 = {(x, y) : -1 ::::: x ::::: 1, -1 ::::: y ::::: "II - x2} with normals:
upper circle: n = xi + yj; segment x = -1: n = -i; segment x = 1: n = i; segment y = -1: n = -j.
27. To describe 0 we use three regions obtained by splitting the given region along lines y = 2 and x = 2.
Define y-domains D) = {(x,y) : 0 ::::: x::::: 2,2 ::::: y ::::: 3), 02 = {(x,y) : 0 ::::: x ::::: 2,x ::::: y ::::: 2), 0 3 =
{(x,y): 2::::: x::::: 3,0:::::y::::: 3). The normals are D)-segment x = 0: n = -i, D)-segment x = 2: n =
i, O)-segmenty = 2: n = -j, O)-segmenty = 3:n =j, 02-segmenty = 2: n =j, 02-segmentx = 0:
n = -i, 02-curvey = x: n = (i - j)/"fi, 03-segmentx = 2: n = -i, 03-segmentx = 3: n = i,
03-segment y = 0: n = -j, 03-segmenty = 3: n = j.
29. The domain is described by region R = {(x,y) : 0 ::::: x::::: 1, -1 ::::: y ::::: I} and the inequality -2 :::::
z ::::: 2. The latter has the form g(x,y) ::::: z ::::: hex, y), and therefore the given set 0 is already in the form
of a z-domain. The normals: face x = 0: n = -i, face x = 1: n = i, face y = -1: n = -j, face y = 1:
n = j, face z = -2: n = -k, face z = 2: n = k.
31. The domain is described by region
and the inequality 0 ;:0 z ;:oj4x 2 + 4y2. The latter has the form gex,y) ::::: z ::::: hex, y), and therefore the
given set 0 is in theform of a z-domain. The normals: face z = 0: n = -k, cylinder ex - 1)2 + y2 = 1:
8 2'+8 2'_k
n = ex - I)i + yj, cone Z = j4x 2 + 4y2: n = Xl YJ
v')6z 2 +!
33. n.
35. 0
37. -2.
39. 4.
41. -4.
43. 2.
45. 32JT/3.
47. 4.
49. 32n.
51. 4.
53. -2n.
55. -2n.
696 ANSWERS AND HINTS TO SELECTED EXERCISES
57. -4Jr.
59. 2Jr.
61. Let F = xi. If the divergence is constant, then one side of Greens theorem is a multiple of the area of D.
63. Apply a previous exercise to obtain the ordinary integral J5" cos 3 t(3 sin 2 t cos t)dt.
65. A shared boundary arc C* between two subdomains DI and D2 of D has to be oriented in opposite
ways, in order for D to be regular.
Section 6.6
1. 4.41 x 10 22
1 1 1 2 -I 1 3 -I
3. a =2 a2 xa 3=L: a l,a =Ta3xal=L:a2,a =T a l xa 2=a3·
5. The plane equation is 12nl + 4n2 + 3n3 = 12, which applies to triples (nl, n2, n3) corresponding to
+ n2a2 + n3a3.
vectors nl al
7. The family 12nl + 4n2 + 3n3 = L defines a family of parallel crystal planes.
1. Draw a right triangle with sides 1, ux, )1 + u~ to obtain formulas for sin/! and cos/! in ter.ms of ux.
Apply the binomial series formula to expand (l + u~)-1/2 in a Taylor series.
3. The period of UI or u2 in t is 2Jr divided by nJrclL Hence they also have period P.
5. Let u be the difference of two solutions. Define E(l) = ~ f~(pu~ + Tu;)dx. Show that E/(t)
-ka(u(O, t)2/2))lot and hence E(t) ::: E(O). Then E(O) = 0 implies E(t) ::: 0 or E(t) = 0 for all t. So the
integrand must vanish, which implies u(x, I)=constant. Since u(x, 0) = 0, then u == 0 as required.
Section 7.2
1. The cosine and sine terms satisfy a differential equationy" + JL2 y = O. Use it to find second derivatives.
3. Show that the difference u of two solutions is zero. Define I(t) = ~ f~ u 2(x, t)dx. Then show that
I' = -K J~ u~ dx ::: 0; hence I(t) ::: 1(0) = 0, and then 1(1) == O. Integrand zero implies u == O.
5. The details are the same as Exercise 3, with the exception of the treatment of the boundary terms in I' (I).
Here, the term KAu(x, t)ux(x, I)1~~6 appears, which is zero, so again I'et) ::: O.
Section 7.3
-*
1. Again Jar? u(V'(u) . n)dS = O. The rest ofthe proof is unchanged from the text.
3. In this case, Jar? u(V'(u) . n)dS = Jar? u 2dS ::: O. The rest of the proof is unchanged from the text
version.
5. A solution u = 0 if and only if one of the sine factors is zero at x = 1 or y = 1, which happens exactly
when A or JL is an integer multiple of Jr.
Section 7.4
1. Expand the dot product V' . Cu V' (u)). Compare with the right side, V' Cu)2 = u~ + u;.
3. A constant satisfies the differential equation, so the only question is the boundary condition. This is
settled quickly by observing that V' (u) = 0 when u is constant.
ANSWERS AND HINTS TO SELECTED EXERCISES 697
5. Let U be the difference of two solutions. Then u satisfies a Neumann problem with mixed boundary
conditions. By the proof of the uniqueness theorem, V(u) = 0 on n, because u(V(u) . n) = 0 on an.
Hence for connected n, u is a constant on n. Since q '" 0, then this constant is zero, and hence u is zero.
Section 7.5
1. The sine and cosine terms are solutions of an ordinary differential equation wI! + a2 w = 0, where
depending on the context, a = j.i, a = A, or a = we. Therefore, Utt = -w 2c2u, Uxx = -A 2U,
Uyy = _j.i 2u. It remains to compute both sides of the differential equation and compare terms.
° ° °
1. It suffices to show that the difference u of two solutions is zero. Let let) = ~ f f fa u 2 dx dydz. Then
1'(1) = gives I(t) = 1(0) = and finally u ==
-f f faa u2 dS :s: 0; hence I'(t) :s: °as before, to give 0 :s: let) :s: 1(0) = 0.
3. The method parallels that of the first exercise. The only new detail is that the boundary integral of
u(V(u) . n) evaluates to
5. A solution u vanishes on the boundary if and only ifthe product of the sine factors vanishes atx = y = z =
1, and this is exactly the condition sin Asin j.i sin v = 0. Therefore, A, j.i, and v are integer multiples of Jr.
Section 7.7
1. The proof uses the same integral method as in Section 7.4. Here, V . (uV(u)) = V(u)2, and by the
divergence theorem the integralofVu 2 over n is zero (u is zero on the boundary). Hence Vu = Oin n, and
then by connectness of n it follows that u is constant. The constant is zero because u = 0 on the boundary.
3. The difference u = U2 - ul satisfies L'.u = 0 and zero boundary conditions. Methods of the first exercise
produce Vu = 0, and finally u is some constant c on n by connectness. Then u = c gives u2 = ul + e.
5. The proof parallels that of the first exercise. The difference in detail is that V(u)2 = V· (uV(u)) = -hu 2
on the boundary; therefore, by the divergence theorem 0 :s: f f fa V(u)2dx dydz :s: 0. As before, u = c
°
on n. Then V(u) . n + hu = 0 on the boundary gives c = and finally u == 0.
Section 8.1
1. See Figure 17.
3. See Figure 18.
5. u = sin(2x)e- 4t + 2 sin(3x)e- 9t .
7. u = sin(2x)e- 4t + sin(4x)e- 16t
9. Integrate cos(m - n)x - cos(m + n)x over [0, Jr] and use a trigonometric identity similar to the one of
the previous exercise.
Section 8.2
7. u ( x, t) = ,",00 8 . (2k
L..k=O Jr(2k+l) sm + 1)x e-(2k+1)2 t . See Figure 20.
9.
U= L00
- 2
k=l n(2k + 1)
sine 4k + 2)x e-(4k+ 2)
2
t +L
00
k=O n(2k
6
+ 1)
sin(2k + l)x e-(2k+l)
2
t.
13 . f( x ) -_,",00 sin(2kx)
L..k=l -2-k- + ,",00 (1
L..k=O 2k+l + Jr(2k+l)2
2(_l)k ) . (2k
sm + 1)x. Ii0 ven·fy tel
hd . k -
ennty ta ex - n
12
.
Section 8.4
1. u(x, t) = -1.
~ sin(3x)e- 9t
b ( ) -n b rr3 b -rr b
5. u (x, t)
24 2rr
= b0 + ,",CO
2t
Ln=l n cos nx e ,0 = 12' 2k = 2k2' 2k+1 = (2kH)4 rr - (2k+l)2'
13. U(x, t) = ~ + L~o ~k~;' cos(2k + l)x e-(2k+1)2r. Select x = a to obtain the identity
15. For)" = fJ.,2, X = A cos fJ.,X + B sin fJ.,X and the boundary conditions lead to B = a and sin fJ.,JT = O.
Hence X = cos fJ.,X and fJ., = n for some integer n ::: 1. For)" < a there is no nontrivial solution X. For
)" = 0, X = A + Bx and B = O.
Section 8.5
1. Substitutef(x) into Euler's formulas to obtain bo = JT/4, bn = ((_1)n -l)/(JTn2 ), an = (_l)n+1/n.
3. Apply the theorem for x = 0 to Exercise 1.
5. Apply the theorem with x = 0 to Exercise 2.
//"--'~"-
I ///
~ -----"
7. Let U be the difference of two possible solutions. Then u satisfies the ring problem withf = O. Define
I(t) = i f~;r u 2 (x, t)dx. Show I'(t) == 0, hence I(t) = 1(0) = 0, and then u == O.
Section 8.6
1. Apply induction on n. The case n = 1 duplicates the periodicity hypotheSiS. Handle negative n similarly.
7. Write the integral as the sum of two integrals and change variables x = -t in one of them.
13. To evaluate the integral f"!.;r cos(ku)du, use orthogonality of f(x) = cos(Ox) = 1 and g(x) = cos(kx)
on [-n,nJ.
15. Start with the second e'}:tation in the theorem. Isolate F(x) in this formula and replace the integral by
f:
its eqUivalent fraction 2n+l ,where M is (-1) times the integral whose integrand contains Gl Estimates
of the size of M are obtained from the calculus inequalities cose ::5 1 and If: f(x)dxl ::5 j(x)dx.
Section 8.9
1. (1) Both f and J' are continuous (1' (0) has to be computed from the Newton quotient definition). (2)
Nothing to do for continuous f. (3) The integral off is twice fooo xe- X dx.
3. (1) Classic definition of a piecewise infinitely differentiable function. (2) Nothing to do unless x = -1
or x = 1. (3) The integral off is 2 f~ x 2 dx.
9. The integration 4 fooo Si~1' cos(/hx)d/h is done from Example 1 in the text.
11. The indefinite integrals for A(/h) and B(/h) can be found in most integral tables.
ANSWERS AND HINTS TO SELECTED EXERCISES 701
13. Use the trigonometric identity 2 sin a cos b = sin(a+b) +sin(a-b) to integrate A(JL). Use 2 sin asin b =
cos(a - b) - cos(a + b) to integrate B(JL). The addition theorem for the cosine is used on the integrand
in the Fourier integral theorem to simplify
Section 8.10
2
1. The product solutions are B(JL) sin(JLx)e- 1l t. Superposition produces the solution u(x, t), and B(JL) is
determined by the Fourier sine integral theorem.
3. The odd extension of f is defined by f*(x) = f(x) for x 2: 0 and f-(x) = -f( -x) for x < O. To be
i
shown: (1) f* and its derivative are sectionally continuous; (2) f*(x) = ([*(x + 0) + f*(x - 0)]; (3)
The integral of f* over (-00, (0) is finite. The integral relation for f results from applying the Fourier
integral theorem of Section 8.9 to f*, which agrees with f on 0 ~ x < 00. By oddness of f*, A(JL) O. =
The equation for B can be Simplified to involve only f on the interval 0 ~ x < 00, because f* is odd.
Section 8.12
1. w(x)=x 4 ,[0,2].
3. w(x) = (l + xle"x, [0, 1]
5. Compute X~, Xn(O) and Xn(rr). They are shown to equal -AnXn , 0, 0, respectively. To verify that all
solutions are represented, solve the differential equation for the three cases A = -JL 2 , A = 0, and A = JL 2 ,
with JL > O. The third case produces eigenpairs A = n 2 , X = sin(nx) for n = 1,2,3, ....
7. Change variables by x = et , UCO = X(e t ). Then U" + AU = 0, U(O) = U(l) = O. Let t = s/rr,
yes) = U(s/rr). Then yll + rr- 2AY = 0, yeO) = Y(rr) = O. Apply a previous exercise to solve for Y and
hence U; then change variables back to x.
9. Apply the trigonometric identity sin nx sin mx = i [cos(n - m)x - cos(n + m)x].
11. The products X1X2W and X2X3W are each odd functions on [-1, 11; hence their integrals are zero. The
integral of X1X3W = (5x 4 - 3x 2 )/2 is twice its value over [0,1], because it is even. Direct integration
gives zero.
13. Substitute X(x)TCt) into the differential equation and assume X(x) TCt) i= 0 to obtain the two differential
equations for X and T. One issue is TCt) i= 0 for all t, which is settled by solving the differential
equation for T. This allows cancellation of TCt) in the boundary conditions on u, hence producing
alX(a) + a2X'(a) = 0, blX(b) + b2X'(b) = O.
Section 8.13
1. Eigenpairs are A = (2k - 1) 2/4, X = cos((2k - l)xl2) and A = k 2 , X = sin(h) for k = 1,2,3, ....
Orthogonality uses weight w(x) = 1, and the relation is I~" XpXqdx = 0, where Xp and Xq are two
different eigenfunctions from the above list.
3. A = (rrnlln 2)2, X = sin(nrr In(x)lln(2)), n = 1,2,3, ... , w(x) = x-I,
112 x-I sin(nrr In(x)lln(2)) sin(mrr In(x)lln(2))dx = 0 for n i= m.
5. A = n 2/4, X = sin(nx), w(x) = 4, g sin(nx) sin(mx) (4)dx = 0 for n i= m.
Section 8.14
1. Eigenpairs are A = (2k + 1)(2k + 2), X = P2k+l (x), k = 0,1,2, ... , where Pn is the nth Legendre
polynomial.
3. A direct approach can be carried out following the general method of Section 8.12. The functions Xn
and Xm satisfy differential equations; hence we multiply one differential equation by Xn and the other
by X m , and then subtract to give
Integrate over [0, 1]; then integrate by parts on the left. Finally, show that the left side is zero.
Section 9.1
1. U(x, t) = sin(2x)e-4t - sin(5x)e- 25t .
3. u(x, t) = ~ - ~ cos(2x)e- 4t .
17. Verify Kt = K (tr - t) and Kx = -2Kx1(4t), Kxx = K (tr - -t); then Kt = Kxx·
Section 9.2
I
I~ i
\
Section 9.3
1.
u= ~ t
rr k=O
sin«2k + l)x) sinh((2k + 1)y).
(2k + 1) sinh((2k + 1)rr)
3. 0.2499040972, n = 2 terms.
Uj = ~ t
rr k=O
sin((2k + l)x) sinh((2k + l)y)
(2k+ 1)sinh((2k+ l)rr) ,
704 ANSWERS AND HINTS TO SELECTED EXERCISES
7. Show that u = y/rr satisfies Laplaces equation and the boundary conditions. By uniqueness, it is the only
solution.
9.
11.
13.
15. Even though u is bounded, it fails to be true that Ux and uy are bounded. However, by direct calculation,
u satisfies the differential equation and boundary conditions. The boundary evaluation uses the formula
limy~o+ tan-leafy) = ~~.
19. Let w be the difference of the left and right sides of the identity. Then w satisfies the differential equation
with zero boundary conditions. By uniqueness (Chapter 7), w == O.
21. Let w be the difference of the left and right sides of the identity. Then w satisfies the differential equation
with zero boundary conditions. By uniqueness (Chapter 7), w == O.
I N _.2 ~
23. u(O, 0.5) ~ IT fo /+0.25 ~ = 0.6156903104 for N = v Sin 10 = 3.393l. The exact answer is
0.6156903442.
29. The alternating series error is controlled for k = 2, giving vCO.l,rr/2) = 112 + 1495031750000rr =
0.5634510439.
35. The first two equations are verified by direct calculation of the left side. The last equation uses v
independent of e. Unboundedness of the logarithm is a basic property developed in calculus.
37. Product solutions v = R(r)e(e) satisfy for separation constant Je the equations r2R" + rR' - JeR = 0,
e" + Jee = 0, where e is 2rr-periodic and R is bounded at r = 00. Then R = qrn and e =
aI cos ne + bl sin ne, Je = n2 These formulas also apply for Je = O.
ANSWERS AND HINTS TO SELECTED EXERCISES 705
39.
Ao = - 1
2rr
1 0
2,,-
f(O) dO
a -nAn =-11
rr 0
2 ,,-
f(O) cos (nO) dO
Section 9.4
1.
U = sin 3x sin 5y e-(9+25)t - sin 4x sin 6y e-(l6+36lt.
3.
00 00 41+(-l)m+l1+(-l)n+l 2 2
U= " ,,- sinmxsinnye-(m +n )t.
L L rr2 m n
m=l n=l
5.
U = 4 LL
00 00 (_l)m+n
sin mx sin ny e-(m +n
2 2
)t.
m=l n=l mn
7.
_ 8 ~ ~ [1 + (-l)m+l][1 + (_l)n+1]. . _(m2+n2)t
U - 2 L L 3 Sill mx Sill ny e .
rr m=ln=l m n
9. Let u(x,y, t) be defined on domain D: 0 :::: x :::: L, 0 :::: y :::: M, t ::: O. Assume Ut = Uxx + U yy on D,
U = 0 on the boundary of D, and u(x,y, 0) = f(x,y) on 0 :::: x :::: L, 0 :::: y :::: M.
11. The details for I mn duplicate Problem 9.4.1. The other relation follows from the previous exercise and
A = It + v.
13. The problems are X" + ItX = 0, X'(O) = X'(rr) = 0, y" + vY = 0, y'(O) = yl(rr) = 0, with
A = It + v. Then It = m 2 , v = n2
15. Apply superposition to the product solutions of the previous exercises.
l7. The problems are X" + ItX = 0, X' (0) = X' (rr) = 0, yll + vY = 0, yeO) = y(rr) = 0, with A = It + v,
m = 01,2, ... and n = 1,2,3, ... Then It = m2 , v = n 2 In the t-variable, I' + AI = 0, rcO) -10.
19. Apply superposition to the product solutions of the preceding two exercises.
5. Let U be defined over the rectangle D: 0 :::s x :::s L, 0 :::s Y :::s M. Require Utt =
Uxx + Uyy on D for t ~ 0,
U = 0 on the boundary of D for t ~ 0, u(x,y, 0) = J(x,y), Ut(x,y, 0) = g(x,y) on D. Product solutions
X(x)Y(y)T(t) satisfy the problems X" + /LX = 0, X(O) = X(L) = 0, y" + vY = 0, yeO) = Y(M) = 0,
T" + AT = 0, nO) f. 0, A = /L + v. Then /L = (mrr/L) 2, v = (nrr/M) 2, Amn = (mrr/L)2 + (nrr/M) 2,
Xm = sin(mrrxlL), Yn = sin(nrryIM), Tmn = Amn cos(~t) + Bmn sin(~t), and
00 00
U= L LTmn(t)Xm(X)Yn(y),
m=l n=l
where
7. Let Q be the region 0 :::s r :::s a in polar coordinates. Assume that Utt = c2 (u rr + ~ur) on Q for t ~ 0,
u(a, t) = 0 for t ~ 0, u(r,O) = 0 for 0 :::s r < a, ut(r, 0) = g(r) on Q for t ~ 0, and u(r, t) is finite.
Product solutions are R = ]o(5 nrla) and T = sin(5nct). Then u = L~1 Cn.!O(5nrla) sin(5nct) with
Cn = (2I]r(5 n)) JJ g(ax)]o(5 nx)xdx. The solution u(f,Ol is obtained by differentiation as ui°J).
9. The roots in 0 :::s r < 1 of]O(53r) = 0 are exactly at 53r = 51 and 53r = 52.
Section 9.6
1.
sinh ( -v'srrz)
u = sin(rrx) sin(2rry) -v's'
sinh ( 5rr)
3.
sinh(v'4Irrz)
u = sin(4rrx)sin(5rry) r;;, .
-v41rr
ANSWERS AND HINTS TO SELECTED EXERCISES 707
5.
7. Verify it from
Cn 2n+-1
= -
2
(11 -1
Pn(x)dx - 2 1°)
-1
Pn(x)dx,
which gives c2k = 0 and c2k+! = (4k + 3)fd P2k+l(X)dx. In particular, Cj = 3/2, C3 = -7/8,
C5 = 11/16. Bonnet's theorem can be used to show that Pn+l (0) = -(n/(n + l))Pn-l (0), and then
f01 Pn(x)dx = Pn-l (O)l(n + 1) results in
4k + 3 (_1)k(2k)!
+
c2k+l = 2k 2 4k(kl)2 .
Section 9.7
1.
3.
5. u(x, t) = sin net - x) for 0::'0 x < t, u(x, t) = 0 otherwise. See Figure 29.
7. u(x, t) = -it2 - -i (t - xld H(t - xlc)
[A-S] M. Abramowitz and I.A. Stegun, Handbook of Mathematical Functions, New York, Dover
Publications, 1965.
[B-M] G. Birkhoff and S. Mac Lane, A Survey of Modem Algebra, 4th ed., New York, Macmillan, 1977.
[B-McG] PW Berg and].L. McGregor, Elementary Partial Differential Equations, Holden-Day, 1966.
[B-R] G. Birkhoff and G.C Rota, Ordinary Differential Equations, 3rd ed., New York, Wiley Publishing
Co., 1978.
[C-L] E.A. Coddington and N. levinson, Theory of Ordinary Differential Equations, New York, McGraw-
Hill, 1955.
[Ch-B2] R.V Churchill andj.W Brown, Complex Variables and Applications, 5th ed., New York, McGraw-
Hill, 1990.
[Ch-B] R.v Churchill and].W Brown, Fourier Series and Boundary Value Problems, 3rd ed., New York,
McGraw-Hill, 1978.
[Ch-K] W Cheney and D. Kincaid, Numerical Mathematics and Computing, 2nd ed., Monterey,
Brooks/Cole Publishing Co., 1985.
[Churl R.v Churchill, Operational Mathematics, 3rd ed., New York, McGraw-Hill, 1972.
[D-S] H.F. Davis and A.D. Snider, Introduction to Vector Analysis, 6th ed., Wm. C Brown Publishers,
1991.
[DenH] JP Den Hartog, Mechanical Vibrations, 4th ed., New York, Dover Publications, 1985.
[ER] A. Erdelyi et aI., Tables of Integral Transforms, Volumes I and II, McGraw-Hill, 1954.
[Fr] A. Friedman, Advanced Calculus, Holt, Rinehart and Winston, 1971.
[Ga] PR. Garabedian, Partial Differential Equations, New York, John Wiley and Sons, 1964.
[He] P Henrici, Elements of Numerical Analysis, New York, Wiley Publishing Co., 1965.
Ua] D. Jackson, Fourier Series and Othogonal Polynomials, Mathematical Association of America, 1941.
IKe] ].P Keener, Principles of Applied Mathematics, Addison-Wesley, 1988.
IKrey] E. Kreyszig, Advanced Engineering Mathematics, 7th ed., John Wiley and Sons, 1993.
[Mcl] N ,W Mclachlan, Bessel Functions for Engineers, 2nd ed., Oxford, Clarendon Press, 1961.
[No] B. Noble, Applied Linear Algebra, 3rd ed., Prentice Hall, 1988.
[Prj WHo Press et aI., Numerical Recipes, London, Cambridge University Press, 1986.
710 REFERENCES
Regular domain decomposition, 420 heat diffusion and Fourier series, 474
Regular domain in 3-space, 425 insulated at the ends, 455
Regular singular point, 189 nonuniform, 455
Reservoir product solutions, 474
consistency of volume data, 38 radiation at the ends, 455
depth data, 27 separation constant, 475
draining times, 90 series truncation, 559
filling of, 2 sine series, 559
numerical differentiation, 35 superposition, 477
sample depth and volume data, 30 truncation error, 559
water stored, 29 uniform convergence, 559
Resistive network, 283 uniqueness, 454, 486
Resonance, 174 zero heat flux, 486
Riccati, 140 Rod finite
equation, 115 ends insulated, 560
method, 164 heat content, 512
Richardson extrapolation, 33, 34, 40 mixed, 562, 565
Riemann's theorem, 505 mixed tangent equation, 564
Riemann-Lebesgue theorem, 526 nonhomogeneous, 533
Right-hand and left-hand limits, 481 numerical bounds and error, 561
Right hand rule, 376 separation, 534
Right-moving pulse, 588 Sturm-Liouville series, 537
Ring Rod infinite
eigenpairs, 566 alternate fonnula, 573
error estimates, 566 cosine integral, 573
orthogonality, 495 double integral, 573
periodic boundary conditions, 493 eigenvalue, 512
periodic conditions, 492 error function, 574
periodiC eigenvalue problem, 493 heat diffusion, 512, 572
problem, 492, 565 no explicit boundary conditions, 512
solution of Fourier's problem, 496 separation, 512, 572
superposition, 495 superposition, 513
uniqueness, 500 Rod Neumann
Rk4, see Runge-Kutta, 135 constant initial gradient, 489
Robin problem, 462 constant initial temperature, 488
Rocket, 122, 163 problem, 486-488
Rocket launch, 112 sawtooth, 490
Rod sectionally constant, 489
boundary value problem, 454, 474 superposition, 487
cosine series, 559 Rod semi-infinite
cosine series and orthogonality, 487 boundary at zero temperature, 576
cross-sectional area, 512 Dirichlet diffusion, 531
Dirichlet eigenvalue problem, 475 integral formula, 576
ends at zero, 558 Neumann problem, 531
even and odd extensions, 559 Rod semi-infinite, 529
fundamental solution K, 575 Dirichlet diffusion, 529
INDEX 725
t (N)a·~-
n=1
(a+b)N = •
• =0 k
Series formulas.
1 00
-=LU"'
I-u "=0
lul<I
00 x"
e'=L-
n=O n!
• 00 (-I)"x2"+!
smx = "
;0 -'-;::-'-:--:-:-.,-
(2n + I)!
00 (-I)"x2"
cosx=L---
n=O (2n)!
"=0 n+I
' -1 < x ~ 1
tan
-I
x= L00
"=0
(-I)"x2 "+!
2n+I
' Ixl ~ 1
Derivative formulas.
(u")' = nu·-1u'
(uv)' = U'y + Uy'
( ~)'
Y
= U'y - Uy'
y2
(eo)' = u'e"
(In u)' = u'/u
(sin u)' = u' cos u
(cos u)' = -u' sin u
(tan u)' = u' sec 2 u
(cot u)' = -u' csc 2 u
(sinh u)' = u' cosh u
(cosh u)' = u' sinh u
(arcsin u)' = u'(l- U 2 )-112
(arctan u)' = u'/(I + u2 )
Trigonometric formulas.
sin(-O) = -sine, cos(-9) = cos9
sin'9+cos'e = 1
sin (a + b) = sin a cos b + sinbcosa
cos (a + b) = cos a cos b - sinasinb
tan (a + b) = tan a + tanb
1- tanatanb
sin' u = 21 (1 - cos(2u»
1
cos' u = 2(1 + cos(2u»
u I-cosu
tan- = - - -
2 sinu
e"'=cos9+isin9 ( i = H )
Hyperbolic functions.
Integral formulas.
J cotxdx=lnlsinxl+C
J I
secxdx = In secx + tanxl + C
Jx'+a'
~ ~arctan~+C
=
a a
J.... x'+a'
~=sinh-l~+c
a
J'/a'-x' = a +
_dx
__ arcsin ~ C
J Inxdx=xlnx-x+C
J e""sin(bx)dx= a2:b2(aSinbx-bcosbx)+C
f t!"'·
e"" cos(bx)dx = a' + b' (acosbx + bsmbx) + C