0% found this document useful (0 votes)
3 views

Functions of more than one variable

Uploaded by

Rayhan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Functions of more than one variable

Uploaded by

Rayhan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

3 Functions of More Than One Variable

Learning Outcome:

3. Understand single-valued functions of two or three variables and their derivatives,


perform associated computations, and apply understanding and computations to
solve problems.

3.1 Introduction
During the first and second terms of calculus you worked with functions of one variable, which were
usually denoted y = f (x) (for example, y = x2 ). We could think of such a function as a “machine”
named f , with the variable x being the input to the machine and the variable y being the output.
We will now investigate functions of more than one input variable; the ones with which we will work
will have the form
z = f (x, y) or w = f (x, y, z).
Examples would be things like
f (x, y) = x2 + y 2 and w = xyz + 3y + z 2 .
Consider the function w. There are three input variables x, y and z, and one output variable,
w. We say that w is a function of three variables, the number of input variables. Similarly, f is
a function of two variables. OK, so those two examples are bit abstract. Some more concrete (and
practical!) examples would be the following:
• The perceived temperature (“wind-chill”) depends on the actual air temperature and the speed
of the wind.
• The temperature in a well depends on the depth of the well and the time of year.
• The pressure at a point in a room depends on the location (x, y, z) of the point.
• The temperature on a plate of metal depends on the location (x, y) of the point.

In the first two semesters of calculus the two main themes were differentiation and integration of
functions of one variable. We will now take up differentiation and, a bit later, integration of functions
of more than one variable.

3.2 Representing Functions of More Than One Variable

Learning Outcome:

3. (a) Determine a function value of a function of more than one variable from an
equation, a table of values, or a level curve plot.

In the past, when you were given a function of one variable, you often considered several repre-
sentations of such a function: an equation of the function, a table of corresponding input and output
values, a graph of the function. Each has their own advantages and disadvantages. Exact function
values can be obtained from either the equation or the table, but the table has a limited number of
values, whereas the equation can be used to get an unlimited number of input-output pairs. Exact
input-output values are impossible to obtain from a graph, but the overall behavior of a function can
be obtained quickly from looking at a graph.
Functions of three or more variables are challenging to represent in any form other than an equation,
so we will often focus on functions of two variables, which are more easily represented. There are four
common representations of such functions; we will use three of the four regularly.
• An equation can be given, from which function values can be obtained by simply inserting input
values and obtaining corresponding output values. The two equations f (x, y) = x2 + y 2 and
w = xyz + 3y + z 2 are examples of this.
An equation can easily be used to describe a function of any number of variables. The following three
methods are only possible for functions of two variables.

• A table of values can be given. Such a table lists values of one input variable horizontally across
the top or bottom of the table and values of the other input variable along one of the sides (usually
the left side). The output value is for a given pair of input values is then located in the cell of the
table where the row and column of the two input values intersect. An example is the table below,
which gives (in the “body” of the table) the wind-chill temperature W (in degrees Fahrenheit)
for various actual temperature/wind speed combinations (in that order). Wind speed is in miles
per hour and actual temperature is given in degrees Fahrenheit.

Actual Temperature (◦ F )
Wind Speed (mph)

T −10 0 10 20 30 40
v
5 −22 −11 1 13 25 36
10 −28 −16 −4 9 21 34
15 −32 −19 −7 6 19 32
20 −35 −22 −9 4 17 30

• A level curve plot (contour plot) is essentially a “map” of the two input variables, with points
having certain fixed output values connected by curves. The one below represents temperatures
T on a rectangular sheet of metal. The temperatures are in degrees Fahrenheit, and the distances
are in feet (with x-values being on the horizontal axis and y-values on the vertical, as usual).

40
10 90 50
80 60
70
70

60
5
50 90
80

40

30

5 10
• A surface plot, which is a three-dimensional representation of the function, where the input
values represent points in the xy-plane and the output values are z-values. When the z-values
are plotted above (or below) their corresponding x and y values, a surface is obtained. Such
plots are valuable for seeing the behavior of the function, but they are challenging to draw by
hand, and almost impossible to get input-output values from. We will rarely (never?) use these.
For examples of these, see page 792 1st Ed., page 779 2nd Ed.

3.3 Average Rates of Change

Learning Outcome:

3. (b) Determine the average rate of change of a function of two or three variables
from one point to another from either a level curve graph or the equation of
the function.
• Examples: 780: 8, 9 • Exercises: 786: 40, 41, 43, 45
Answer to 786: 40 is −1.67 × 10−4 ppt/◦ C

Looking at either the table of values or the level curve plot from the previous page, it is clear that
the values of the functions (wind chill temperatures for the table, temperatures on the metal plate for
the level curves) vary as the input values change. The main focus of our next couple weeks will be
the rates at which the function values change as the input values change. We will begin by calculating
average rates of change as one or the other (or perhaps both) of the input values change. Suppose that
for a function f (x, y) we go from inputs of (x1 , y1 ) to (x2 , y2). Then the corresponding outputs go
from f (x1 , y1 ) to f (x2 , y2). The average rate of change of the function is then the change in output
divided by the change in input, which we show symbolically like this:
f (x2 , y2) − f (x1 , y1)
Ave ROC =
“distance” from (x1 , y1) to (x2 , y2)
x1 , y1 , x2 and y2 should all be the appropriate numbers, and instead of f you should use the
letter for whatever function you are working with. The “distance” from (x1 , y1) to (x2 , y2) will
always be either a subtraction or an application of the Pythagorean Theorem. I’ve put distance in
quotes because in many cases it will not actually represent a geometric distance, but simply a distance
between numbers. Note that we generally find the difference by taking the final values minus the initial
values. Here are a few examples, demonstrating the way that I’d like you to show your calculations:
⋄ Example 3.3(a): For the temperature function with level curves shown on the previous page,
find the average rate of change of the temperature from (7, 1) to (7, 5).

T (7, 5) − T (7, 1) 65 − 38
Ave ROC = = = 6.75◦ F/foot ♠
5−1 4

⋄ Example 3.3(b): For the function given by the table on the previous page, find the rate of
change in wind-chill from a actual temperature/wind speed combination of 0◦ F and −10 mph
to 0◦ F and −20 mph.

W (0, −20) − W (0, −10) −22 − (−16)


Ave ROC = = = −0.6◦ F/mph ♠
60 − 30 10
Note that in this last example the “distance” is really the change in wind speed.
In both of the above example we have held one of the input variables fixed while the other changes.
In the next example we let both of the input variables vary. This only makes sense to do when both
input variables measure the same quantity and have the same units. (In this case the quantity is true
distance, measured in feet.

⋄ Example 3.3(c): For the temperature function from Exercise 1 of Assignment 8, find the average
rate of change in temperature from (5, 6) to (9, 1).
T (9, 1) − T (5, 6) 60 − 75
Ave ROC = p = = −2.3◦ F/foot ♠
2
(9 − 5) + (1 − 6) 2 6.4
p
The quantity (9 − 5)2 + (1 − 6)2 , is just the distance from (5, 6) to (9, 1), calculated using the
distance formula for two dimensions.

⋄ Example 3.3(d): In some location the temperature T in degrees Fahrenheit of the ground at
a depth of d feet and at time t days from some zero date is modeled by

T (d, t) = 30e−0.2d cos(0.0172t − 0.2d) + 60 .


Find the average rate of change of temperature at time fifty days, from a depth of 3 feet to a
depth of 40 feet.

T (40, 50) − T (3, 50) 60.0066 − 75.9110


Ave ROC = = = −0.4282◦ F/foot ♠
40 − 3 37

3.4 Partial Derivatives

Learning Outcome:

3. (c) Find and interpret (give location, direction, change in dependent variable per
unit of change in independent variable) partial derivatives of a function at a
point.
• Examples: 794: 1, 2, 3, 7
• Exercises: 802: 13 - 33 odd, 37, 41 43, 57, 59, 61

Consider Example 3.3(d). Suppose that we next computed the average rate of change from
(3, 50) to (10, 50), then from (3, 50) to (5, 50), then from (3, 50) to (3.5, 50), from (3, 50) to
(3.1, 50), and so on. Note that we are holding the day fixed at 50 and the initial depth at 3 feet.
The final depth, however, is getting closer and closer to 3 feet. What we should see is the average
rates of change approaching some value, which is the instantaneous rate of change of temperature with
respect to depth.
The process of computing all those average rates of change would get a bit tedious; fortunately it is
not necessary. All we need to do is compute the partial derivative of temperature T with respect
∂T
to depth d, denoted by either or Td (d, t). The computation of the partial derivative I’ll leave
∂d
to class and your textbook, but it turns out to be
Td (d, t) = 6e−0.2d sin(0.0172t − 0.2d) − 6e−0.2d cos(0.0172t − 0.2d)
= 6e−0.2d [sin(0.0172t − 0.2d) − cos(0.0172t − 0.2d)]
The reason for two parts is that we need to apply the product rule in order to obtain the derivative.
If we now evaluate Td (3, 50) we obtain Td (3, 50) = −2.34◦ F/ft. What does this tell us? Well,any
interpretation needs to include the following things:
(i) the physical quantity that we are investigating (output variable of the function)
(ii) the rate at which the quantity is increasing or decreasing
(iii) the input variable in which the change is occurring
(iv) the “place” where that rate of change is occurring
For our example we might say
On the fiftieth day, the temperature at a depth of three feet is decreasing with depth at a rate
of 2.34 degrees Fahrenheit per foot.
The physical quantity being studied is temperature, and the rate of change and “location” should be
clear. The variable in which the change is occurring is indicated by the words “with depth” (and,
less obviously,by the units). By convention, the rate of change for the output variable is always for a
positive change in the input variable.

3.5 Partial Differential Equations

Learning Outcome:

3. (d) Determine whether a given function is a solution to a given partial differential


equation.
• Examples: 800: 11 • Exercises: 804: 76, 77, 79, 80, 82

3.6 Partial Derivatives for a Discrete Function

Learning Outcome:

3. (e) Use a forward, backward or centered difference to approximate a partial


derivative of a function of two variables at a point, when the function is
given in table form.

Suppose that we desire a partial derivative of a function of two variables that is represented with a
table. It is impossible to determine the value of such a partial derivative, but we can find an average
rate of change that approximates the partial derivative. When approximating partial derivatives from
tables, there are three approximations we can get, a forward difference approximation, a backward
difference approximation, and a centered difference approximation:

• Forward Difference Approximation: Here we simply compute an average rate of change from
the point where we want the partial derivative to the next data point in the direction of increase
in the changing input variable.
• Backward Difference Approximation: In this case we compute an average rate of change
from the point “before” (in the sense of the changing input variable) where we want the partial
derivative to the point where we want the partial derivative.
• Centered Difference Approximation: This is just the average of the forward difference and
backward difference. It is equivalent to the average rate of change from the “before” point to the
ahead or “after” point.
Let’s look at some examples:

⋄ Example 3.6(a): For the wind-chill function, find the forward difference approximation of
WT (20, 10).
W (30, 10) − W (20, 10) 21 − 9
WT (20, 10) ≈ = = 1.2◦ F/◦ F ♠
30 − 20 10
Note that 30 is “forward from 20, so the forward approximation is as T goes from 20 to 30.
To obtain the backward difference, we need to compute the average rate of change as T goes from
10 to 20:

⋄ Example 3.6(b): For the wind-chill function, find the backward difference approximation of
WT (30, −15).
W (20, 10) − W (10, 10) 9 − (−4)
WT (20, 10) ≈ = = 1.3◦ F/◦F ♠
20 − 10 10

⋄ Example 3.6(c): For the wind-chill function, find the centered difference approximation of
WT (20, 10).
The centered difference approximation is just the average of the forward and backward differences:
1.2 + 1.4
WT (20, 10) ≈ = 1.3◦ F/◦ F
2
It can also be found by taking the average rate of change from (10, 10) to (30, 10):
W (30, 10) − W (10, 10) 22 − (−4)
WT (20, 10) ≈ = = 1.25◦ F/◦ F ♠
30 − 10 20

3.7 Linear Approximations of Functions of Several Variables

Learning Outcome:

3. (f) Find and use local linear approximations (“linearizations”) of functions of


one, two and three variables. Find vectors and a plane tangent to a surface
at a point.
(g) Find the error or percent error for a linear approximation.
• Examples: 807: 1, sort of • Exercises: 811: 17, 19, 21

The basic idea here is this: Consider again T (x, y) as the function giving the temperature T at
any point (x, y) on a sheet of metal, and suppose we know that
T (4, 5) = 78.3◦ F, Tx (4, 5) = −0.3◦ F/ft, Ty (4, 5) = 0.2◦ F/ft
What if we wanted to know the temperature at (4.7, 5.4)? Since this point is close to (4, 5), the
temperature should be somewhat close to 78.3◦ F.
The diagram to the right shows the two points under considera-
tion, and it shows how we can move from (4, 5) to (4.7, 5.4) by
going ∆x = 0.7 feet in the positive x-direction and then ∆y = 0.4
feet in the positive y-direction. Now we note that as we pass through y=6
(4, 5) in the positive x-direction the temperature is decreasing by
−0.3◦ F/ft, so as we move by ∆x = 0.7 feet we would expect the (4.7, 5.4) ∆y = 0.4
temperature to change by approximately
∆Tx = (0.7ft)(−0.3◦ F/ft) = −0.21◦ F (4, 5)
y=5

Similarly the change in temperature in the y-direction should be x=4 x=5


∆x = 0.7
∆Ty = (0.4ft)(0.2◦ F/ft) = 0.08◦ F
We can then approximate the temperature at (4.7, 5.4) by adding these tow changes to the
temperature at (4, 5):
T (4.7, 5.4) ≈ T (4, 5) + ∆Tx + ∆Ty = 78.3 + (−0.21) + 0.08 = 78.17◦ F
The reason this is an approximation rather than an actual value is that the rates of change only hold
right at (4, 5), but they probably don’t change much if we don’t get too far away from (4, 5). There
is a way to get a quantitative idea of what the error might be, but it fairly technical. At this point,
one should just try to understand what has happened here.
What we have done is use something called the linear approximation of the function T at (or
sometimes we say “near”) (4, 5). The formal way of defining the linear approximation f a function
f of two variables at a point (x0 , y0 ) is

f (x, y) ≈ f (x0 , y0 ) + fx (x0 , y0 )∆x + fy (x0 , y0)∆y


for any point (x, y) near (x0 , y0). Note what this says: To get a function value at a point
near (x0 , y0) we begin with the function value at that point, f (x0 , y0). We then adjust for the
point (x, y) by adding the changes in the x- and y-directions, which are given by the two terms
fx (x0 , y0)∆x and fy (x0 , y0)∆y.
We can actually create a new function L(x, y) that gives us our approximations near (x0 , y0 ),
using the facts that
∆x = x − x0 and ∆y = y − y0
This new function looks like

L(x, y) = f (x0 , y0 ) + fx (x0 , y0)(x − x0 ) + fy (x0 , y0)(y − y0 )


For our example above we would have
L(x, y) = 78.3 + (−0.3)(x − 4) + (0.2)(y − 5) = 78.5 − 0.3x + 0.2y
The last expression is obtained by distributing and combining like terms. The function L is called
the linearization of f at the point (x0 , y0):
Linearization of a Function at a Point
The linearization of a function f (x, y) at a point (x0 , y0) is the function
L(x, y) = f (x0 , y0 ) + fx (x0 , y0)(x − x0 ) + fy (x0 , y0)(y − y0 )

For points (x, y) “near” (x0 , y0 ), L(x, y) ≈ f (x, y). A similar formula
holds for functions of three or more variables.

⋄ Example 3.7(a): Give the linearization of f (x, y, z) = x2 y−3xz+y 2 at the point (2, 5, −1) and
use it to get an approximate value of f (2.07, 4.95, −1.03).
Because we have three variables, the linearization will look like
L(x, y, z) = f (2, 5, −1) + fx (2, 5, −1)(x − 2) + fy (2, 5, −1)(y − 5) + fz (2, 5, −1)(z − (−1)) (1)

Doing some calculations gives us


fx (x, y, z) = 2xy − 3z, fy (x, y, z) = x2 + 2y, fz (x, y, z) = −3x
and
f (2, 5, −1) = 51, fx (2, 5, −1) = 23, fy (2, 5, −1) = 14, fz (2, 5, −1) = −6.

Substituting into (1), the linearization is then


L(x, y, z) = 51 + 23(x − 2) + 14(y − 5) − 6(z + 1)
= 51 + 23x − 46 + 14y − 70 − 6z − 6
= −71 + 23x + 14y − 6z

The approximate value of f (2.07, 4.95, −1.03) obtained from the linearization is then
f (2.07, 4.95, −1.03) ≈ L(2.07, 4.95, −1.03) = −71 + 23(2.07) + 14(4.95) − 6(−1.03) = 52.09 ♠

Let’s make it clear that the linearization does not give us exact function values except at the point
where the linearization was done. However, the values it gives us should be close to the function values
as long as we are at points “near” the point where the linearization was done. Exactly how near we
need to be to be how close is a subject for another class! The absolute value of the difference between
the actual function value at a point and the value given by the linearization at the same point is
usually called the error. What is more useful than the error, however, is the percent error, which
we’ll define by
|actual value − approximate value|
percent error = × 100
|actual value|
⋄ Example 3.7(b): Find the error and percent error when using the linearization of the function
f (x, y, z) = x2 y − 3xz + y 2 from Example 3.7(a) at (2, 5, −1) to approximate the value of
f (2.07, 4.95, −1.03).
The actual value of f (2.07, 4.95, −1.03) is
f (2.07, 4.95, −1.03) = (2.07)2(4.95) − 3(2.07)(−1.03) + (4.95)2 = 52.109055,
so the error is

|f (2.07, 4.95, −1.03) − L(2.07, 4.95, −1.03)| = |52.109055 − 52.09| = 0.019055


The percent error is
|f (2.07, 4.95, −1.03) − L(2.07, 4.95, −1.03)|
× 100 = 0.037% ♠
|52.109055|

⋄ Example 3.7(c): The temperature T (in degrees Fahrenheit) at each point (x, y) (both in
feet) on a plate of metal is a function of where on the plate one is; that is, T = T (x, y). Suppose
we know that
T (4, 2) = 93◦ F, Tx (4, 2) = −3◦ F/ft, Ty (4, 2) = 2◦ F/ft
Use this information to approximate the temperature at (4.1, 1.7).
There are two approaches we can use here, one utilizing the linearization of the function and the other
a bit less formal. Using the second approach, the temperature at (4.1, 1.7) should be approximately
the temperature at (4, 2), but with an adjustment based on each of the partial derivatives. The
adjustment in the x-direction will be −3◦ F/ft × 0.1ft = −0.3◦ F and the adjustment in the y-
direction will be 2◦ F/ft×−0.3ft = −0.6◦ F. The temperature at (4.1, 1.7) will then be approximately
93 − 0.3 − 0.6 = 92.1◦ F.
The more formal approach is to find the linearization
L(x, y) = 93 − 3(x − 4) + 2(y − 2) = 101 − 3x + 2y
and, using it, obtain

f (4.1, 1.7) ≈ L(4.1, 1.7) = 101 − 3(4.1) + 2(1.7) = 92.1◦ F ♠


3.8 Directional Derivatives

Learning Outcome:

3. (h) Find and interpret (location, direction, change in dependent variable per unit
of change in independent variables) directional derivatives of a function at a
point.
(i) Determine the direction in which a function has the greatest rate of increase
or decrease at a point, and give that rate. Determine the directions from a
point in which a function remains constant.
• Examples: 818: 6(b), 7 • Exercises: 823: 21 - 33 odd

Recently we have investigated partial derivatives, which represent the rates of change of a function
as we travel in the positive x or y direction. You may have wondered why we don’t determine the
rate of change of a function in some other direction. Well, now we are going to do that; such a rate of
change is called a directional derivative. The authors of the two books we are referring to define this
in slightly different ways than each other; we’ll go with the definition that I’m going to give here.
Recall the definition of a derivative of a function f of one variable at a point x0 :
f (x0 + h) − f (x0 )
f ′ (x0 ) = lim .
h→0 h
f (x0 + h) − f (x0 )
Note that the quantity is the average rate of change of the function from a point
h
x0 to the point x0 + h. If we then look at such rates of change for h getting closer and closer to
zero, the values we get approach some number that we call the derivative of f at x0 , denoted by
f ′ (x0 ).
Now suppose that we have a function f of two variables. The partial derivative of f with respect
to x at a point (x0 , y0) is then defined by
f (x0 + h, y0 ) − f (x0 , y0 )
fx (x0 , y0 ) = lim .
h→0 h
f (x0 + h, y0 ) − f (x0 , y0 )
Here the quantity is the average rate of change of f from (x0 , y0 ) to
h
(x0 + h, y0 ). Note that the y value of the function is fixed at y0 , and only the x value changes as
we move from the first point to the second. The limit as the distance moved is reduced to zero is then
the derivative of f with respect to x at (x0 , y0 ).
Suppose now that we want to find the derivative of a function f of two variables in the direction
of a vector v = ai + bj. The idea is this: If we place the tail of some scalar multiple hv of the vector
v at the point (x0 , y0 ), its tip will be at the point (x0 + ha, y0 + hb). We then compute the average
rate of change of the function from (x0 , y0 ) to (x0 + ha, y0 + hb) and take its limit as h goes to
zero. The notation we use for this derivative is Dv f (x0 , y0 ), and from this discussion we get that it is
f (x0 + ha, y0 + hb) − f (x0 , y0 )
Dv f (x0 , y0 ) = lim
h→0 h
The value obtained is independent of the magnitude of the vector v, depending only on its direction.
The diagram to the right shows the point (x0 , y0 ) and the points (x0 + a, y0 + b)
(x0 + ah, y0 + hb) for several values of h. Recognize that the notation
h=1
Dv (x0 , y) contains four pieces of information:

• The letter D tells us it is a derivative.


1
• f is the function that it is a derivative of. h= 2

• (x0 , y0 ) is the point where the derivative is being taken. h= 1


4

• v gives the direction in which the derivative is taken. (x0 , y0 )

OK, so that’s all fine and dandy, but how do we go about actually computing directional derivatives?
The key to this is a vector function, called the gradient associated with the function f . The notation
for the gradient function is ∇f (x, y). It is a function of the same number of variables as f , but its
“output” is a vector with the same number of components as the variables of f . We will use it to
find directional derivatives, but it has some special characteristics also.

The Gradient of a Function


The gradient of a function f (x, y) at a point (x0 , y0) is the vector
∇f (x0 , y0 ) = fx (x0 , y0 )i + fy (x0 , y0 )j
The gradient vector at a point has the following characteristics:
• It is perpendicular to the level curve (or, for a function of three vari-
ables, the level surface) through the point (x0 , y0).
• It points in the direction of greatest increase of the function at the
point (x0 , y0 ).
• Its magnitude is the greatest rate of increase that can be experienced
when passing through the point (x0 , y0 ).

Functions of three (or more) variables have gradients that are defined in the same way, but with the
additional components corresponding to the additional variables.
Here’s how the gradient is used to calculate directional derivatives:

Computing a Directional Derivative


The directional derivative of f (x, y) at (x0 , y0 ) and in the direction
of a unit vector u is
Du f (x0 , y0 ) = ∇f (x0 , y0) · u

Note that one must use a unit vector when computing a directional derivative this way.
⋄ Example 3.7(b): Find the directional derivative of f (x, y, z) = x2 y − 3xz + y 2 at the point
(2, 5, −1) in the direction of the vector v = h2, −2, 1i.
Doing some calculations gives us
fx (x, y, z) = 2xy − 3z, fy (x, y, z) = x2 + 2y, fz (x, y, z) = −3x
and
fx (2, 5, −1) = 23, fy (2, 5, −1) = 14, fz (2, 5, −1) = −6,
so ∇f (2, 5, −1) = h23, 14, −6i. We now need the unit vector u in the direction of v:
 
v h2, −2, 1i 2 2 1
u= =p = ,− ,
kvk 22 + (−2)2 + 12 3 3 3
The directional derivative is then
 
2 2 1 12
Du (2, 5, −1) = ∇f (2, 5, −1) · u = h23, 14, −6i · ,− , = =4 ♠
3 3 3 3

3.9 Relative Minima and Maxima of Functions of Several Variables

Learning Outcome:

3. (j) Find relative minima and maxima of a function of one or two variables. (This
includes both the function values and where they occur.)
• Examples: 836: 3, 4 • Exercises: 843: 7 - 21 odd

Suppose that we wished to find where the function f (x) = x2 + 6x − 3 had a maximum or a
minimum. First we would take the derivative and set it equal to zero - this would tell us where the
graph of the function had a horizontal tangent, which is then a place where the function might have
a maximum or a minimum:
f (x) = x2 + 6x − 3 =⇒ f ′ (x) = 2x + 6 =⇒ f ′ (x) = 0 when x = −3
So we have identified that the function might have a maximum or minimum at the critical point
x = −3. How do we determine whether it has one of those and, if it does, which one? In first-term
calculus you learned two methods for doing this:
• The First Derivative Test: For this test we determine the sign of the derivative on either side
of the critical point and use that information to determine what is happening at the critical point.
In the case of our example, we can see that the derivative will be negative for values of x less
than −3, and it will be positive for values of x greater than −3. This means the graph of
f (x) is sloping downward to the left of x = −3 and upward to the right of −3, so the function
has a minimum at x = −3.
• The Second Derivative Test: For this test we determine the sign of the second derivative
at the critical point; if the sign of the second derivative there is negative, the graph is concave
down and there is a maximum at the critical point, and if the sign of the second derivative at
the critical point is negative, then the graph is concave up and there is a minimum at the critical
point. For the above example, f ′′ (x) = 2 > 0 for all values of x, so the function is concave up
“everywhere” (for all values of x). Therefore f has a minimum at x = −3.
The minimum at x = −3 is actually an absolute minimum, meaning that the function has a smaller
value there than anywhere else. All that the first and second derivative tests really establish, however,
is that there is a relative minimum at x = −3, meaning that the function value there is smaller
than at nearby values of x.
The situation for a function of two variables is more complicated than it is for a function of just
one variable, because such a function has two first derivatives (considering only partial derivatives)
and three second derivatives. For such a function we first define

Critical Points of a Function of Two Variables


Let f be a function of two variables. Then any point (x0 , y0 ) at which
BOTH
fx (x0 , y0 ) = 0 and fy (x0 , y0 ) = 0
is called a critical point of the function.

Determining critical points is essentially finding where the derivative is zero. After locating all such
points, we must then determine whether there is a minimum or maximum value of the function at
each such point, or whether there is a saddle point there. (A saddle point is roughly analogous to an
inflection point of a function of one variable.) To do this we use the following second derivative test.

Second Partials Test


Let f be a function of two variables with continuous second-order partial
derivatives in some circle centered at a critical point (x0 , y0 ), and let
D = fxx (x0 , y0 )fyy (x0 , y0) − [fxy (x0 , y0 )]2 .
(a) If D > 0 and fxx (x0 , y0 ) < 0, then f has a relative maximum at
(x0 , y0 ).
(b) If D > 0 and fxx (x0 , y0 ) > 0, then f has a relative minimum at
(x0 , y0 ).
(c) If D < 0, then f has a saddle point at (x0 , y0).
(d) If D = 0, then no conclusion can be drawn from this test.

When considering maxima and minima, it is important to distinguish where the maxima or minima
occur and what the maxima or minima are. The maxima and minima are the values of the function
obtained by evaluating the function at all points (x0 , y0 ) where maxima or minima occur. The
maxima and minima that we find using this procedure are what we call relative maxima and relative
minima. Sometimes you will see the word “local” used instead of “relative.” These occur at points
where the function value is higher or lower than it is at all “nearby” points.
3.10 Minima and Maxima of Functions on Closed and Bounded Regions

Learning Outcome:

3. (k) Given a level curve plot of a function of two variables, determine the locations
and approximate values of absolute maxima and minima on a closed region.
(l) Use calculus to find absolute maxima and minima of a function of two vari-
ables on a closed region.
• Examples: 838: 5, 6 • Exercises: 845: 37, 39, 41

In the last section we discussed determining where a function has a relative maximum or minimum.
Some functions don’t have any relative maxima or minima (plural for maximum and minimum),
however. The simplest example of such a function is a sloping plane. But if we restrict ourselves to
what is called a closed and bounded region in R2 , a function f (x, y) will have an absolute maximum
and minimum for that region. What do we mean by closed and bounded? Bounded means essentially
that our region will fit inside of some circle, and closed means the region included its boundary.
Let’s take a look at these ideas using the temperature function with level curves shown below:

40
10 90 50
80 60
70
70

60
5
50 90
80

40

30

5 10

First of all we note that the function definitely has a relative maximum of about 95◦ F, located some-
where around (11, 5.5); there is no relative minimum that we can see in the region for which we have
information.
Now consider the region enclosed by the dashed lines (including the lines themselves), which is
described by
1 ≤ x ≤ 8, 0 ≤ y ≤ 4.
There is no relative maximum or minimum of the function inside the rectangle. However, you should
be able to see that the minimum temperature is perhaps 25◦ F and occurs along the bottom edge of
the rectangle at about (5, 0). The maximum temperature is about 65◦ F at the point (8, 4). Note
that the maximum value occurs at a corner of the region, and the minimum value occurs at a low
point along one of the edges.
Now look at the region described by
10 ≤ x ≤ 13, 3 ≤ y ≤ 10,
which is the region enclosed by the rectangle with dotted sides. The maximum temperature in the
region is about 95◦ F at about (10.9, 5.4), which is a relative maximum of the function as well. The
minimum value is about 44◦ F at (13, 10); there is not a relative minimum at that point.
Everything that can possibly happen when finding the minimum and maximum of a function on a
closed and bounded region has occurred in the two situations that we just looked at. To summarize,
the minimum and maximum values of the function will occur at one of three locations:

• an interior point where there is a relative minimum or maximum


• a low or high point along an edge
• a corner of the region
Now suppose that we are given the equation of a function of two variables, and a rectangular
region to which the two variables are restricted. Here is the process for determining the minimum and
maximum values of the function in the region:
1) Determine all points within the region where both first partial derivatives are zero. We will not
need to conduct the second derivative test!
2) a. Create a function of one variable by fixing one of the variables with the value it has along
one edge of the rectangle.
b. Take the derivative of the function with respect to the remaining variable and determine
whether it is zero at any point(s) along that edge of the rectangle. If it is, make note of the
point(s).
c. Repeat for the other three edges of the rectangle.
3) Evaluate the function at each of the points you found in parts (1) and (2), and at each of the
four corners. The largest value you obtain is the maximum value of the function in the region,
and the minimum value you obtain is the minimum value of the function in the region.

⋄ Example 3.10(a): Find the absolute minimum and maximum and their locations for
T (x, y) = x2 + xy + y 2 − 6x + 2 on the region bounded by 0 ≤ x ≤ 5 and −3 ≤ y ≤ 0.
First we find critical points on the interior of the region. The two partial derivatives are
Tx (x, y) = 2x + y − 6 and Ty (x, y) = x + 2y.

We set each of these equal to zero and solve the resulting system of equations:
2x + y − 6 = 0 =⇒ 2x + y = 6 2x + (−2) = 6
x + 2y = 0 =⇒ −2x − 4y = 20 2x = 8
times − 2
−3y = 6 2x = 8
y = −2 x = 4

The solution to the system of equations is (4, −2), so that is the interior critical point that must be
checked.
Now we need to find critical points along each edge of the rectangle. When x = 0 we have
T (0, y) = y 2 + 2 ⇒ T ′ (0, y) = 2y

T ′ (0, y) = 0 when y = 0, so the point along the edge where x = 0 that we must check is (0, 0).
(This is a corner point, so we would have checked it anyway.) We do the same thing for the other
three edges:
x=5: T (5, y) = y 2 + 5y − 3 ⇒ T ′ (5, y) = 2y + 5 = 0 when y = −2.5, so check (5, −2.5)

y = −3 : T (x, −3) = x2 − 9x + 11 ⇒ T ′ (x, −3) = 2x − 9 = 0 when x = 4.5, so check (4.5, −3)


y=0: T (x, 0) = ⇒ T ′ (x, 0) = 2x − 6 = 0 when x = 3, so check (3, 0)
Those are all of the points we have to check along each edge.
We must also check the four corner points, (0, 0), (5, 0),
(0, −3) and (5, −3). So now we have eight points where (x, y) T (x, y)
we must check the function value; this is generally best
(4, −2) −10
organized in a table, as shown to the right. We conclude
that the minimum value of the function (0, 0) 2
(5, −2.5) −9.25
f (x, y) = x2 + xy + y 2 − 6x + 2
(4.5, −3) −9.25
on the rectangle bounded by (3, 0) −7
(5, 0) −3
0 ≤ x ≤ 5 and −3≤y ≤0
(0, −3) 11
is −10 at (4, −2). The maximum value is 11 at (0, −3). (5, −3) −9

You might also like