Openstax - Multivariable Functions & Partial Derivatives
Openstax - Multivariable Functions & Partial Derivatives
Openstax - Multivariable Functions & Partial Derivatives
4 | DIFFERENTIATION OF
FUNCTIONS OF SEVERAL
VARIABLES
Figure 4.1 Americans use (and lose) millions of golf balls a year, which keeps golf ball manufacturers in business. In this
chapter, we study a profit model and learn methods for calculating optimal production levels for a typical golf ball manufacturing
company. (credit: modification of work by oatsy40, Flickr)
334 Chapter 4 | Differentiation of Functions of Several Variables
Chapter Outline
4.1 Functions of Several Variables
4.2 Limits and Continuity
4.3 Partial Derivatives
4.4 Tangent Planes and Linear Approximations
4.5 The Chain Rule
4.6 Directional Derivatives and the Gradient
4.7 Maxima/Minima Problems
4.8 Lagrange Multipliers
Introduction
In Introduction to Applications of Derivatives (http://cnx.org/content/m53602/latest/) , we studied how to
determine the maximum and minimum of a function of one variable over a closed interval. This function might represent
the temperature over a given time interval, the position of a car as a function of time, or the altitude of a jet plane as it travels
from New York to San Francisco. In each of these examples, the function has one independent variable.
Suppose, however, that we have a quantity that depends on more than one variable. For example, temperature can depend
on location and the time of day, or a company’s profit model might depend on the number of units sold and the amount
of money spent on advertising. In this chapter, we look at a company that produces golf balls. We develop a profit model
and, under various restrictions, we find that the optimal level of production and advertising dollars spent determines the
maximum possible profit. Depending on the nature of the restrictions, both the method of solution and the solution itself
changes (see Example 4.41).
When dealing with a function of more than one independent variable, several questions naturally arise. For example, how
do we calculate limits of functions of more than one variable? The definition of derivative we used before involved a limit.
Does the new definition of derivative involve limits as well? Do the rules of differentiation apply in this context? Can we
find relative extrema of functions using derivatives? All these questions are answered in this chapter.
Our first step is to explain what a function of more than one variable is, starting with functions of two independent variables.
This step includes identifying the domain and range of such functions and learning how to graph them. We also examine
ways to relate the graphs of functions in three dimensions to graphs of more familiar planar functions.
Definition
A function of two variables z = f ⎛⎝x, y⎞⎠ maps each ordered pair (x, y) in a subset D of the real plane ℝ 2 to a
unique real number z. The set D is called the domain of the function. The range of f is the set of all real numbers
z that has at least one ordered pair (x, y) ∈ D such that f (x, y) = z as shown in the following figure.
Determining the domain of a function of two variables involves taking into account any domain restrictions that may exist.
Let’s take a look.
Example 4.1
b. g(x, y) = 9 − x 2 − y 2
Solution
a. This is an example of a linear function in two variables. There are no values or combinations of x and
y that cause f (x, y) to be undefined, so the domain of f is ℝ 2. To determine the range, first pick
a value for z. We need to find a solution to the equation f (x, y) = z, or 3x − 5y + 2 = z. One such
solution can be obtained by first setting y = 0, which yields the equation 3x + 2 = z. The solution
⎛ ⎞
to this equation is x = z − 2 , which gives the ordered pair ⎝z − 2 , 0⎠ as a solution to the equation
3 3
f (x, y) = z for any value of z. Therefore, the range of the function is all real numbers, or ℝ.
b. For the function g(x, y) to have a real value, the quantity under the square root must be nonnegative:
9 − x 2 − y 2 ≥ 0.
|
⎧ ⎫
Therefore, the domain of g(x, y) is ⎨⎩(x, y) ∈ ℝ 2 x 2 + y 2 ≤ 9⎬⎭. The graph of this set of points can be
described as a disk of radius 3 centered at the origin. The domain includes the boundary circle as shown
in the following graph.
336 Chapter 4 | Differentiation of Functions of Several Variables
To determine the range of g(x, y) = 9 − x 2 − y 2 we start with a point (x 0, y 0) on the boundary of the
domain, which is defined by the relation x 2 + y 2 = 9. It follows that x 02 + y 20 = 9 and
This is the maximum value of the function. Given any value c between 0 and 3, we can find an entire
set of points inside the domain of g such that g(x, y) = c:
9 − x2 − y2 = c
9 − x2 − y2 = c2
x 2 + y 2 = 9 − c 2.
Since 9 − c 2 > 0, this describes a circle of radius 9 − c 2 centered at the origin. Any point on this
circle satisfies the equation g(x, y) = c. Therefore, the range of this function can be written in interval
notation as [0, 3].
Example 4.2
a. g(x, y) = 9 − x 2 − y 2
b. f (x, y) = x 2 + y 2
Solution
a. In Example 4.1, we determined that the domain of g(x, y) = 9 − x 2 − y 2 is
| |
⎧ 2 2 2 ⎫ ⎧ 2 ⎫ 2 2
⎨
⎩
(x, y) ∈ ℝ x + y ≤ 9 ⎬
⎭
and the range is z ∈ ℝ 0 ≤ z ≤ 3 . When x + y = 9 we have
⎨
⎩
⎬
⎭
g(x, y) = 0. Therefore any point on the circle of radius 3 centered at the origin in the x, y-plane maps
to z = 0 in ℝ 3. If x 2 + y 2 = 8, then g(x, y) = 1, so any point on the circle of radius 2 2 centered
at the origin in the x, y-plane maps to z = 1 in ℝ 3. As x 2 + y 2 gets closer to zero, the value of z
approaches 3. When x 2 + y 2 = 0, then g(x, y) = 3. This is the origin in the x, y-plane. If x 2 + y 2 is
equal to any other value between 0 and 9, then g(x, y) equals some other constant between 0 and 3.
The surface described by this function is a hemisphere centered at the origin with radius 3 as shown in
the following graph.
338 Chapter 4 | Differentiation of Functions of Several Variables
b. This function also contains the expression x 2 + y 2. Setting this expression equal to various values
starting at zero, we obtain circles of increasing radius. The minimum value of f (x, y) = x 2 + y 2 is
zero (attained when x = y = 0.). When x = 0, the function becomes z = y 2, and when y = 0,
then the function becomes z = x 2. These are cross-sections of the graph, and are parabolas. Recall from
Introduction to Vectors in Space that the name of the graph of f (x, y) = x 2 + y 2 is a paraboloid.
The graph of f appears in the following graph.
Example 4.3
where x is the number of nuts sold per month (measured in thousands) and y represents the number of bolts sold
per month (measured in thousands). Profit is measured in thousands of dollars. Sketch a graph of this function.
Solution
This function is a polynomial function in two variables. The domain of f consists of (x, y) coordinate pairs that
yield a nonnegative profit:
16 − (x − 3) 2 − ⎛⎝y − 2⎞⎠ 2 ≥ 0
(x − 3) 2 + ⎛⎝y − 2⎞⎠ 2 ≤ 16.
This is a disk of radius 4 centered at (3, 2). A further restriction is that both x and y must be nonnegative.
When x = 3 and y = 2, f (x, y) = 16. Note that it is possible for either value to be a noninteger; for example,
it is possible to sell 2.5 thousand nuts in a month. The domain, therefore, contains thousands of points, so we
340 Chapter 4 | Differentiation of Functions of Several Variables
can consider all points within the disk. For any z < 16, we can solve the equation f (x, y) = 16:
16 − (x − 3) 2 − ⎛⎝y − 2⎞⎠ 2 = z
(x − 3) 2 + ⎛⎝y − 2⎞⎠ 2 = 16 − z.
Since z < 16, we know that 16 − z > 0, so the previous equation describes a circle with radius 16 − z
centered at the point (3, 2). Therefore. the range of f (x, y) is {z ∈ ℝ|z ≤ 16}. The graph of f (x, y) is also a
paraboloid, and this paraboloid points downward as shown.
Level Curves
If hikers walk along rugged trails, they might use a topographical map that shows how steeply the trails change. A
topographical map contains curved lines called contour lines. Each contour line corresponds to the points on the map that
have equal elevation (Figure 4.7). A level curve of a function of two variables f (x, y) is completely analogous to a
contour line on a topographical map.
Figure 4.7 (a) A topographical map of Devil’s Tower, Wyoming. Lines that are close together indicate very steep terrain. (b) A
perspective photo of Devil’s Tower shows just how steep its sides are. Notice the top of the tower has the same shape as the
center of the topographical map.
Definition
Given a function f (x, y) and a number c in the range of f , a level curve of a function of two variables for the
value c is defined to be the set of points satisfying the equation f (x, y) = c.
Returning to the function g(x, y) = 9 − x 2 − y 2, we can determine the level curves of this function. The range of g is
the closed interval [0, 3]. First, we choose any number in this closed interval—say, c = 2. The level curve corresponding
to c = 2 is described by the equation
9 − x 2 − y 2 = 2.
Now, multiply both sides of the equation by −1 and add 9 to each side:
x 2 + y 2 = 5.
This equation describes a circle centered at the origin with radius 5. Using values of c between 0 and 3 yields other
circles also centered at the origin. If c = 3, then the circle has radius 0, so it consists solely of the origin. Figure 4.8
is a graph of the level curves of this function corresponding to c = 0, 1, 2, and 3. Note that in the previous derivation it
may be possible that we introduced extra solutions by squaring both sides. This is not the case here because the range of the
square root function is nonnegative.
342 Chapter 4 | Differentiation of Functions of Several Variables
Example 4.4
Given the function f (x, y) = 8 + 8x − 4y − 4x 2 − y 2, find the level curve corresponding to c = 0. Then
create a contour map for this function. What are the domain and range of f ?
Solution
To find the level curve for c = 0, we set f (x, y) = 0 and solve. This gives
0 = 8 + 8x − 4y − 4x 2 − y 2.
We then square both sides and multiply both sides of the equation by −1:
4x 2 + y 2 − 8x + 4y − 8 = 0.
Now, we rearrange the terms, putting the x terms together and the y terms together, and add 8 to each side:
4x 2 − 8x + y 2 + 4y = 8.
Next, we group the pairs of terms containing the same variable in parentheses, and factor 4 from the first pair:
Then we complete the square in each pair of parentheses and add the correct value to the right-hand side:
4⎛⎝x 2 − 2x + 1⎞⎠ + ⎛⎝y 2 + 4y + 4⎞⎠ = 8 + 4(1) + 4.
Next, we factor the left-hand side and simplify the right-hand side:
4(x − 1) 2 + ⎛⎝y + 2⎞⎠ 2 = 16.
We can repeat the same derivation for values of c less than 4. Then, Equation 4.1 becomes
4.2 Find and graph the level curve of the function g(x, y) = x 2 + y 2 − 6x + 2y corresponding to c = 15.
Another useful tool for understanding the graph of a function of two variables is called a vertical trace. Level curves are
always graphed in the xy-plane, but as their name implies, vertical traces are graphed in the xz - or yz-planes.
Definition
Consider a function z = f (x, y) with domain D ⊆ ℝ 2. A vertical trace of the function can be either the set of points
that solves the equation f (a, y) = z for a given constant x = a or f (x, b) = z for a given constant y = b.
Example 4.5
Find vertical traces for the function f (x, y) = sin x cos y corresponding to x = − π , 0, and π , and
4 4
y = − π , 0, and π .
4 4
Solution
First set x = − π in the equation z = sin x cos y:
4
⎛ ⎞ 2 cos y
z = sin⎝− π ⎠cos y = − ≈ −0.7071 cos y.
4 2
This describes a cosine graph in the plane x = − π . The other values of z appear in the following table.
4
−π 2 cos y
4 z= −
2
0 z=0
π 2 cos y
4 z=
2
Table 4.1
Vertical Traces Parallel to the xz-Plane
for the Function f (x, y) = sin x cos y
In a similar fashion, we can substitute the y-values in the equation f (x, y) to obtain the traces in the yz-plane,
as listed in the following table.
−π z = − 2 sin x
4 2
0 z = sin x
π
4 z = 2 sin x
2
Table 4.2
Vertical Traces Parallel to the yz-Plane
for the Function f (x, y) = sin x cos y
The three traces in the xz-plane are cosine functions; the three traces in the yz-plane are sine functions.
These curves appear in the intersections of the surface with the planes x = − π , x = 0, x = π and
4 4
y = − π , y = 0, y = π as shown in the following figure.
4 4
346 Chapter 4 | Differentiation of Functions of Several Variables
Figure 4.11 Vertical traces of the function f (x, y) are cosine curves in the xz-planes (a) and sine curves in the
yz-planes (b).
4.3 Determine the equation of the vertical trace of the function g(x, y) = −x 2 − y 2 + 2x + 4y − 1
corresponding to y = 3, and describe its graph.
Functions of two variables can produce some striking-looking surfaces. The following figure shows two examples.
Figure 4.12 Examples of surfaces representing functions of two variables: (a) a combination of a power function and a sine
function and (b) a combination of trigonometric, exponential, and logarithmic functions.
and
g(x, y, t) = ⎛⎝x 2 − 4xy + y 2⎞⎠sin t − ⎛⎝3x + 5y⎞⎠cos t.
In the first function, (x, y, z) represents a point in space, and the function f maps each point in space to a fourth quantity,
such as temperature or wind speed. In the second function, (x, y) can represent a point in the plane, and t can represent
time. The function might map a point in the plane to a third quantity (for example, pressure) at a given time t. The method
for finding the domain of a function of more than two variables is analogous to the method for functions of one or two
variables.
Example 4.6
b. g(x, y, t) = 2t −4
x2 − y2
Solution
3x − 4y + 2z
a. For the function f (x, y, z) = to be defined (and be a real value), two conditions must
9 − x2 − y2 − z2
hold:
1. The denominator cannot be zero.
2. The radicand cannot be negative.
Combining these conditions leads to the inequality
9 − x 2 − y 2 − z 2 > 0.
Moving the variables to the other side and reversing the inequality gives the domain as
|
⎧ ⎫
domain⎛⎝ f ⎞⎠ = ⎨⎩(x, y, z) ∈ ℝ 3 x 2 + y 2 + z 2 < 9⎬⎭,
which describes a ball of radius 3 centered at the origin. (Note: The surface of the ball is not included in
this domain.)
Functions of two variables have level curves, which are shown as curves in the xy-plane. However, when the function has
three variables, the curves become surfaces, so we can define level surfaces for functions of three variables.
Definition
Given a function f (x, y, z) and a number c in the range of f , a level surface of a function of three variables is
defined to be the set of points satisfying the equation f (x, y, z) = c.
Example 4.7
Solution
The level surface is defined by the equation 4x 2 + 9y 2 − z 2 = 1. This equation describes a hyperboloid of one
sheet as shown in the following figure.
Figure 4.13 A hyperboloid of one sheet with some of its level surfaces.
4.1 EXERCISES
For the following exercises, evaluate each function at the 15. z(x, y) = y 2 − x 2, c=4
indicated values.
16. g(x, y) = x 2 + y 2; c = 4, c = 9
1. W(x, y) = 4x 2 + y 2. Find W(2, −1), W(−3, 6).
17. g(x, y) = 4 − x − y; c = 0, 4
2. W(x, y) = 4x 2 + y 2. Find W(2 + h, 3 + h).
18. f (x, y) = xy; c = 1; c = −1
3. The volume of a right circular cylinder is calculated by
a function of two variables, V(x, y) = πx 2 y, where x is 19. h(x, y) = 2x − y; c = 0, −2, 2
the radius of the right circular cylinder and y represents the
height of the cylinder. Evaluate V(2, 5) and explain what 20. f (x, y) = x 2 − y; c = 1, 2
this means.
x ; c = −1, 0, 2
21. g(x, y) = x + y
4. An oxygen tank is constructed of a right cylinder of
height y and radius x with two hemispheres of radius x
mounted on the top and bottom of the cylinder. Express 22. g(x, y) = x 3 − y; c = −1, 0, 2
the volume of the cylinder as a function of two variables,
xy
x and y, find V(10, 2), and explain what this means. 23. g(x, y) = e ; c = 1 , 3
2
For the following exercises, find the domain of the
function. 24. f (x, y) = x 2; c = 4, 9
⎛y⎞
7. f (x, y) = 4 ln(y 2 − x) 27. g(x, y) = ln
⎝x 2 ⎠; c = −2, 0, 2
8. g(x, y) = 16 − 4x 2 − y 2
28. z = f (x, y) = x 2 + y 2, c=3
9. z(x, y) = y 2 − x 2 y+2
29. f (x, y) = , c = any constant
x2
y+2
10. f (x, y) =
x2 For the following exercises, find the vertical traces of the
functions at the indicated values of x and y, and plot the
Find the range of the functions. traces.
For the following exercises, find the level curves of each Find the domain of the following functions.
function at the indicated value of c to visualize the given
function. 33. z = 100 − 4x 2 − 25y 2
41. Use technology to graph z = x 2 y. 58. Refer to the preceding problem. Using the temperature
function found there, determine the proportionality
constant if the temperature at point P(1, 2) is 50°C. Use
Sketch the following by finding the level curves. Verify the
graph using technology. this constant to determine the temperature at point
Q(3, 4).
2 2
42. f (x, y) = 4 − x − y
59. Refer to the preceding problem. Find the level curves
for T = 40°C and T = 100°C, and describe what the
43. f (x, y) = 2 − x 2 + y 2
level curves represent.
−x 2 − y 2
44. z = 1 + e
45. z = cos x 2 + y 2
46. z = y 2 − x 2
48. w(x, y, z) = x − 2y + z, c = 4
49. w(x, y, z) = x 2 + y 2 + z 2, c = 9
50. w(x, y, z) = x 2 + y 2 − z 2, c = −4
51. w(x, y, z) = x 2 + y 2 − z 2, c = 4
We have now examined functions of more than one variable and seen how to graph them. In this section, we see how to take
the limit of a function of more than one variable, and what it means for a function of more than one variable to be continuous
at a point in its domain. It turns out these concepts have aspects that just don’t occur with functions of one variable.
x→a
lim f (x) = L
if for every ε > 0, there exists a δ > 0, such that if 0 < |x − a| < δ for all x in the domain of f , then
| f (x) − L| > ε.
Before we can adapt this definition to define a limit of a function of two variables, we first need to see how to extend the
idea of an open interval in one variable to an open interval in two variables.
Definition
Consider a point (a, b) ∈ ℝ 2. A δ disk centered at point (a, b) is defined to be an open disk of radius δ centered
at point (a, b) —that is,
|
⎧ ⎫
⎨
⎩
(x, y) ∈ ℝ 2 (x − a) 2 + (y − b) 2 < δ 2⎬⎭
The idea of a δ disk appears in the definition of the limit of a function of two variables. If δ is small, then all the points
(x, y) in the δ disk are close to (a, b). This is completely analogous to x being close to a in the definition of a limit of
a function of one variable. In one dimension, we express this restriction as
a − δ < x < a + δ.
Definition
Let f be a function of two variables, x and y. The limit of f (x, y) as (x, y) approaches (a, b) is L, written
lim f (x, y) = L
(x, y) → (a, b)
if for each ε > 0 there exists a small enough δ > 0 such that for all points (x, y) in a δ disk around (a, b), except
possibly for (a, b) itself, the value of f (x, y) is no more than ε away from L (Figure 4.15). Using symbols, we
write the following: For any ε > 0, there exists a number δ > 0 such that
Figure 4.15 The limit of a function involving two variables requires that f (x, y)
be within ε of L whenever (x, y) is within δ of (a, b). The smaller the value of
ε, the smaller the value of δ.
Proving that a limit exists using the definition of a limit of a function of two variables can be challenging. Instead, we use
the following theorem, which gives us shortcuts to finding limits. The formulas in this theorem are an extension of the
formulas in the limit laws theorem in The Limit Laws (http://cnx.org/content/m53492/latest/) .
statements holds:
Constant Law:
lim c=c (4.2)
(x, y) → (a, b)
Identity Laws:
lim x=a (4.3)
(x, y) → (a, b)
lim y=b (4.4)
(x, y) → (a, b)
Sum Law:
⎛
lim ⎝ f (x, y) + g(x, y)⎞⎠ = L + M (4.5)
(x, y) → (a, b)
Difference Law:
⎛
lim ⎝ f (x, y) − g(x, y)⎞⎠ = L − M (4.6)
(x, y) → (a, b)
Product Law:
⎛
lim ⎝ f (x, y)g(x, y)⎞⎠ = LM (4.8)
(x, y) → (a, b)
Quotient Law:
f (x, y) (4.9)
lim = L for M ≠ 0
(x, y) → (a, b) g(x, y) M
Power Law:
lim ⎛
⎝ f (x, y)⎞⎠ n = L n (4.10)
(x, y) → (a, b)
for all L if n is odd and positive, and for L ≥ 0 if n is even and positive.
The proofs of these properties are similar to those for the limits of functions of one variable. We can apply these laws to
finding limits of various functions.
Example 4.8
2x + 3y
b. lim
(x, y) → (2, −1) 4x − 3y
Solution
a. First use the sum and difference laws to separate the terms:
⎛ 2
lim ⎝x − 2xy + 3y 2 − 4x + 3y − 6⎞⎠
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
(x, y) → (2, −1)
=⎝ lim x 2⎠ − ⎝ lim 2xy⎠ + ⎝ lim 3y 2⎠ − ⎝ lim 4x⎠
(x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1)
⎛ ⎞ ⎛ ⎞
+⎝ lim 3y⎠ − ⎝ lim 6⎠.
(x, y) → (2, −1) (x, y) → (2, −1)
Next, use the constant multiple law on the second, third, fourth, and fifth limits:
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
=⎝ lim x 2⎠ − 2⎝ lim xy⎠ + 3⎝ lim y 2⎠ − 4⎝ lim x⎠
(x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1)
⎛ ⎞
+3⎝ lim y⎠ − lim 6.
(x, y) → (2, −1) (x, y) → (2, −1)
Now, use the power law on the first and third limits, and the product law on the second limit:
⎛ ⎞ ⎛ ⎞⎛ ⎞ ⎛ ⎞
2 2
=⎝
(x, y) → (2, −1) ⎠
x − 2⎝
(x, y) → (2, −1) ⎠⎝(x, y) → (2, −1) ⎠
y + 3⎝
(x, y) → (2, −1) ⎠
lim lim x lim lim y
⎛ ⎞ ⎛ ⎞
−4⎝ lim x⎠ + 3⎝ lim y⎠ − lim 6.
(x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1)
Last, use the identity laws on the first six limits and the constant law on the last limit:
⎛ 2
lim ⎝x − 2xy + 3y 2 − 4x + 3y − 6⎞⎠ = (2) 2 − 2(2)(−1) + 3(−1) 2 − 4(2) + 3(−1) − 6
(x, y) → (2, −1)
= −6.
b. Before applying the quotient law, we need to verify that the limit of the denominator is nonzero. Using
the difference law, constant multiple law, and identity law,
⎛
lim ⎝ 4x − 3y⎞⎠ = lim 4x − lim 3y
(x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1)
⎛ ⎞ ⎛ ⎞
= 4⎝ lim x⎠ − 3⎝ lim y⎠
(x, y) → (2, −1) (x, y) → (2, −1)
= 4(2) − 3(−1) = 11.
Since the limit of the denominator is nonzero, the quotient law applies. We now calculate the limit of the
numerator using the difference law, constant multiple law, and identity law:
⎛
lim ⎝ 2x + 3y⎞⎠ = lim 2x + lim 3y
(x, y) → (2, −1) (x, y) → (2, −1) (x, y) → (2, −1)
⎛ ⎞ ⎛ ⎞
= 2⎝ lim x⎠ + 3⎝ lim y⎠
(x, y) → (2, −1) (x, y) → (2, −1)
= 2(2) + 3(−1)
= 1.
⎛
lim ⎝2x + 3y⎞⎠
2x + 3y
= 1.
(x, y) → (2, −1)
lim = ⎛
(x, y) → (2, −1) 4x − 3y lim ⎝4x − 3y⎞⎠ 11
(x, y) → (2, −1)
Since we are taking the limit of a function of two variables, the point (a, b) is in ℝ 2, and it is possible to approach this
point from an infinite number of directions. Sometimes when calculating a limit, the answer varies depending on the path
taken toward (a, b). If this is the case, then the limit fails to exist. In other words, the limit must be unique, regardless of
path taken.
Example 4.9
4xy 2
b. lim
(x, y) → (0, 0) x 2 + 3y 4
Solution
2xy
a. The domain of the function f (x, y) = consists of all points in the xy-plane except for the
3x 2 + y 2
point (0, 0) (Figure 4.16). To show that the limit does not exist as (x, y) approaches (0, 0), we note
that it is impossible to satisfy the definition of a limit of a function of two variables because of the fact
that the function takes different values along different lines passing through point (0, 0). First, consider
the line y = 0 in the xy-plane. Substituting y = 0 into f (x, y) gives
2x(0)
f (x, 0) = =0
3x 2 + 0 2
for any value of x. Therefore the value of f remains constant for any point on the x-axis, and as y
approaches zero, the function remains fixed at zero.
Next, consider the line y = x. Substituting y = x into f (x, y) gives
2x(x) 2
f (x, x) = = 2x 2 = 1 .
2
3x + x 2
4x 2
This is true for any point on the line y = x. If we let x approach zero while staying on this line, the
equal to 1 .
2
In a similar fashion to a., we can approach the origin along any straight line passing through the origin. If
we try the x-axis (i.e., y = 0), then the function remains fixed at zero. The same is true for the y-axis.
Suppose we approach the origin along a straight line of slope k. The equation of this line is y = kx.
Then the limit becomes
4xy 2 4x(kx) 2
lim = lim
(x, y) → (0, 0) x 2 + 3y 4 (x, y) → (0, 0) x 2 + 3(kx) 4
= lim 4k 2 x 3
(x, y) → (0, 0) x 2 + 3k 4 x 4
= lim 4k 2 x
(x, y) → (0, 0) 1 + 3k 4 x 2
⎛ 2 ⎞
lim ⎝4k x⎠
(x, y) → (0, 0)
= ⎛
lim ⎝1 + 3k 4 x 2⎞⎠
(x, y) → (0, 0)
=0
regardless of the value of k. It would seem that the limit is equal to zero. What if we chose a curve
passing through the origin instead? For example, we can consider the parabola given by the equation
x = y 2. Substituting y 2 in place of x in f (x, y) gives
358 Chapter 4 | Differentiation of Functions of Several Variables
4y 4
= lim
(x, y) → (0, 0) y 4 + 3y 4
= lim 1
(x, y) → (0, 0)
= 1.
By the same logic in a., it is impossible to find a δ disk around the origin that satisfies the definition of
4xy 2
the limit for any value of ε < 1. Therefore, lim does not exist.
(x, y) → (0, 0) x 2 + 3y 4
Definition
Let S be a subset of ℝ 2 (Figure 4.17).
1. A point P 0 is called an interior point of S if there is a δ disk centered around P 0 contained completely in
S.
2. A point P 0 is called a boundary point of S if every δ disk centered around P 0 contains points both inside
and outside S.
Definition
Let S be a subset of ℝ 2 (Figure 4.17).
1. S is called an open set if every point of S is an interior point.
2. S is called a closed set if it contains all its boundary points.
An example of an open set is a δ disk. If we include the boundary of the disk, then it becomes a closed set. A set that
contains some, but not all, of its boundary points is neither open nor closed. For example if we include half the boundary of
a δ disk but not the other half, then the set is neither open nor closed.
Definition
Let S be a subset of ℝ 2 (Figure 4.17).
1. An open set S is a connected set if it cannot be represented as the union of two or more disjoint, nonempty
open subsets.
2. A set S is a region if it is open, connected, and nonempty.
The definition of a limit of a function of two variables requires the δ disk to be contained inside the domain of the function.
However, if we wish to find the limit of a function at a boundary point of the domain, the δ disk is not contained inside
the domain. By definition, some of the points of the δ disk are inside the domain and some are outside. Therefore, we need
only consider points that are inside both the δ disk and the domain of the function. This leads to the definition of the limit
of a function at a boundary point.
Definition
Let f be a function of two variables, x and y, and suppose (a, b) is on the boundary of the domain of f . Then,
the limit of f (x, y) as (x, y) approaches (a, b) is L, written
lim f (x, y) = L,
(x, y) → (a, b)
360 Chapter 4 | Differentiation of Functions of Several Variables
if for any ε > 0, there exists a number δ > 0 such that for any point (x, y) inside the domain of f and within a
suitably small distance positive δ of (a, b), the value of f (x, y) is no more than ε away from L (Figure 4.15).
Using symbols, we can write: For any ε > 0, there exists a number δ > 0 such that
Example 4.10
Prove lim 25 − x 2 − y 2 = 0.
(x, y) → (4, 3)
Solution
|
⎧ ⎫
The domain of the function f (x, y) = 25 − x 2 − y 2 is ⎨⎩(x, y) ∈ ℝ 2 x 2 + y 2 ≤ 25⎬⎭, which is a circle of
radius 5 centered at the origin, along with its interior as shown in the following graph.
We can use the limit laws, which apply to limits at the boundary of domains as well as interior points:
⎛
lim 25 − x 2 − y 2 = lim ⎝25 − x 2 − y 2⎞⎠
(x, y) → (4, 3) (x, y) → (4, 3)
= 25 − 4 2 − 3 2
= 0.
See the following graph.
lim 29 − x 2 − y 2.
(x, y) → (5, −2)
These three conditions are necessary for continuity of a function of two variables as well.
Definition
A function f (x, y) is continuous at a point (a, b) in its domain if the following conditions are satisfied:
1. f (a, b) exists.
Example 4.11
3x + 2y
Show that the function f (x, y) = is continuous at point (5, −3).
x+y+1
Solution
There are three conditions to be satisfied, per the definition of continuity. In this example, a = 5 and b = −3.
1. f (a, b) exists. This is true because the domain of the function f consists of those ordered pairs
for which the denominator is nonzero (i.e., x + y + 1 ≠ 0). Point (5, −3) satisfies this condition.
Furthermore,
3(5) + 2(−3) 15 − 6
f (a, b) = f (5, −3) = = = 3.
5 + (−3) + 1 2+1
2. lim f (x, y) exists. This is also true:
(x, y) → (a, b)
3x + 2y
lim f (x, y) = lim
(x, y) → (a, b) (x, y) → (5, −3) x + y+1
⎛ ⎞
lim ⎝3x + 2y⎠
(x, y) → (5, −3)
= ⎛
lim ⎝ x + y + 1⎞⎠
(x, y) → (5, −3)
= 15 − 6
5−3+1
= 3.
3. lim f (x, y) = f (a, b). This is true because we have just shown that both sides of this equation
(x, y) → (a, b)
equal three.
4.9 Show that the function f (x, y) = 26 − 2x 2 − y 2 is continuous at point (2, −3).
Continuity of a function of any number of variables can also be defined in terms of delta and epsilon. A function of two
variables is continuous at a point (x 0, y 0) in its domain if for every ε > 0 there exists a δ > 0 such that, whenever
(x − x 0) 2 + (y − y 0) 2 < δ it is true, | f (x, y) − f (a, b)| < ε. This definition can be combined with the formal definition
(that is, the epsilon–delta definition) of continuity of a function of one variable to prove the following theorems:
Let’s now use the previous theorems to show continuity of functions in the following examples.
Example 4.12
Show that the functions f (x, y) = 4x 3 y 2 and g(x, y) = cos⎛⎝4x 3 y 2⎞⎠ are continuous everywhere.
Solution
The polynomials g(x) = 4x 3 and h(y) = y 2 are continuous at every real number, and therefore by the product
of continuous functions theorem, f (x, y) = 4x 3 y 2 is continuous at every point (x, y) in the xy-plane. Since
f (x, y) = 4x 3 y 2 is continuous at every point (x, y) in the xy-plane and g(x) = cos x is continuous at every
real number x, the continuity of the composition of functions tells us that g(x, y) = cos⎛⎝4x 3 y 2⎞⎠ is continuous
at every point (x, y) in the xy-plane.
4
Show that the functions f (x, y) = 2x 2 y 3 + 3 and g(x, y) = ⎛⎝2x 2 y 3 + 3⎞⎠ are continuous everywhere.
4.10
Definition
Let (x 0, y 0, z 0) be a point in ℝ 3. Then, a δ ball in three dimensions consists of all points in ℝ 3 lying at a distance
of less than δ from (x 0, y 0, z 0) —that is,
|
⎧ ⎫
⎨
⎩
(x, y, z) ∈ ℝ 3 (x − x 0) 2 + (y − y 0) 2 + (z − z 0) 2 < δ⎬⎭.
To define a δ ball in higher dimensions, add additional terms under the radical to correspond to each additional
dimension. For example, given a point P = (w 0, x 0, y 0, z 0) in ℝ 4, a δ ball around P can be described by
|
⎧ ⎫
⎨
⎩
(w, x, y, z) ∈ ℝ 4 (w − w 0) 2 + (x − x 0) 2 + (y − y 0) 2 + (z − z 0) 2 < δ⎬⎭.
To show that a limit of a function of three variables exists at a point (x 0, y 0, z 0), it suffices to show that for any point in a
δ ball centered at (x 0, y 0, z 0), the value of the function at that point is arbitrarily close to a fixed value (the limit value).
All the limit laws for functions of two variables hold for functions of more than two variables as well.
Example 4.13
x 2 y − 3z
Find lim .
(x, y, z) → (4, 1, −3) 2x + 5y − z
Solution
Before we can apply the quotient law, we need to verify that the limit of the denominator is nonzero. Using the
difference law, the identity law, and the constant law,
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2x + 5y − z⎞⎠ = 2⎝
(x, y, z) → (4, 1, −3) ⎠
x + 5⎝
(x, y, z) → (4, 1, −3) ⎠ ⎝(x, y, z) → (4, 1, −3) ⎠
⎛
lim ⎝ lim lim y − lim z
(x, y, z) → (4, 1, −3)
= 2(4) + 5(1) − (−3)
= 16.
Since this is nonzero, we next find the limit of the numerator. Using the product law, difference law, constant
multiple law, and identity law,
⎛ ⎞ ⎛ ⎞
2
⎛ 2
− 3z⎞⎠ = ⎝
(x, y, z) → (4, 1, −3) ⎠ ⎝(x, y, z) → (4, 1, −3) ⎠
lim ⎝x y lim x lim y −3 lim z
(x, y, z) → (4, 1, −3) (x, y, z) → (4, 1, −3)
= 25 .
16
4.2 EXERCISES
For the following exercises, find the limit of the function. xy + 1
75. lim
(x, y) → (0, 0) x 2 + y 2 + 1
60. lim x
(x, y) → (1, 2)
x2 + y2
76. lim
5x 2 y (x, y) → (0, 0) x2 + y2 + 1 − 1
61. lim
(x, y) → (1, 2) x 2 + y 2
x−y−1
1 80. lim
64. lim (x, y) → (2, 1) x−y−1
(x, y) → (11, 13) xy
x 4 − 4y 4
y 2 sin x 81. lim
65. lim (x, y) → (0, 0) x 2 + 2y 2
(x, y) → (0, 1) x
⎛ x8 + y7 ⎞ x3 − y3
lim
sin⎜ ⎟
82.
66. lim (x, y) → (0, 0) x − y
(x, y) → (0, 0) ⎝ x − y + 10 ⎠
y tan x x 2 − xy
lim 83. lim
67.
(x, y) → (π/4, 1) y + 1 (x, y) → (0, 0) x − y
⎛1 ⎞ xz 2 − y 2 z
lim − 5y ⎠ 84. lim
(x, y) → (2, 5)⎝ x
69. (x, y, z) → (1, 2, 3) xyz − 1
70. lim x ln y x2 − y2 − z2
(x, y) → (4, 4) 85. lim
(x, y, z) → (0, 0, 0) x 2 + y 2 − z 2
−x 2 − y 2
71. lim e For the following exercises, evaluate the limit of the
(x, y) → (4, 4)
function by determining the value the function approaches
along the indicated paths. If the limit does not exist, explain
72. lim 9 − x2 − y2 why not.
(x, y) → (0, 0)
⎛ 2 3
73. lim ⎝x y − x 3 y 2 + 3x + 2y⎞⎠
(x, y) → (1, 2)
⎛x + y ⎞
lim x sin⎝
4 ⎠
74.
(x, y) → (π, π)
xy + y 3 x2 − y2
86. lim 97. Determine whether g(x, y) = 2 is continuous
(x, y) → (0, 0) x 2 + y 2 x + y2
a. Along the x-axis (y = 0) at (0, 0).
b. Along the y-axis (x = 0)
98. Create a plot using graphing software to determine
c. Along the path y = 2x
where the limit does not exist. Determine the region of
the coordinate plane in which f (x, y) = 1 is
xy + y 3 x2 − y
87. Evaluate lim using the results of
(x, y) → (0, 0) x 2 + y 2
continuous.
previous problem.
99. Determine the region of the xy-plane in which the
2
x y ⎛ xy 2 ⎞
88. lim composite function g(x, y) = arctan⎜ x + y ⎟ is
(x, y) → (0, 0) x 4 + y 2 ⎝ ⎠
a. Along the x-axis (y = 0) continuous. Use technology to support your conclusion.
b. Along the y-axis (x = 0)
100. Determine the region of the xy-plane in which
c. Along the path y = x 2
f (x, y) = ln(x 2 + y 2 − 1) is continuous. Use technology
to support your conclusion. (Hint: Choose the range of
x2 y
89. Evaluate lim using the results of values for x and y carefully!)
(x, y) → (0, 0) x 4 + y 2
x2 y 105. [T]
94. f (x, y) = a. Use a CAS to draw a contour map of
x + y2
2
z = 9 − x 2 − y 2.
⎧ ⎫
⎪ x2 y ⎪
b. What is the name of the geometric shape of the
if (x, y) ≠ (0, 0)
95. f (x, y) = ⎨ x2 + y2 ⎬ (Hint: level curves?
⎪ ⎪ c. Give the general equation of the level curves.
⎩ 0 if (x, y) = (0, 0)⎭ d. What is the maximum value of z ?
Show that the function approaches different values along e. What is the domain of the function?
two different paths.) f. What is the range of the function?
Now that we have examined limits and continuity of functions of two variables, we can proceed to study derivatives.
Finding derivatives of functions of two variables is the key concept in this chapter, with as many applications in
mathematics, science, and engineering as differentiation of single-variable functions. However, we have already seen that
limits and continuity of multivariable functions have new issues and require new terminology and ideas to deal with them.
This carries over into differentiation as well.
Definition
Let f (x, y) be a function of two variables. Then the partial derivative of f with respect to x, written as ∂ f /∂ x,
or f x, is defined as
This definition shows two differences already. First, the notation changes, in the sense that we still use a version of Leibniz
notation, but the d in the original notation is replaced with the symbol ∂ . (This rounded “d” is usually called “partial,” so
∂ f /∂ x is spoken as the “partial of f with respect to x.”) This is the first hint that we are dealing with partial derivatives.
Second, we now have two different derivatives we can take, since there are two different independent variables. Depending
on which variable we choose, we can come up with different partial derivatives altogether, and often do.
Example 4.14
Use the definition of the partial derivative as a limit to calculate ∂ f /∂ x and ∂ f /∂ y for the function
Solution
370 Chapter 4 | Differentiation of Functions of Several Variables
∂f
To calculate , first calculate f ⎛⎝x, y + h⎞⎠:
∂y
h→0
= −3x + 4y + 5.
4.12 Use the definition of the partial derivative as a limit to calculate ∂ f /∂ x and ∂ f /∂ y for the function
f (x, y) = 4x 2 + 2xy − y 2 + 3x − 2y + 5.
The idea to keep in mind when calculating partial derivatives is to treat all independent variables, other than the variable
with respect to which we are differentiating, as constants. Then proceed to differentiate as with a function of a single
variable. To see why this is true, first fix y and define g(x) = f (x, y) as a function of x. Then
Example 4.15
Calculate ∂ f /∂ x and ∂ f /∂ y for the following functions by holding the opposite variable constant then
differentiating:
a. f (x, y) = x 2 − 3xy + 2y 2 − 4x + 5y − 12
Solution
a. To calculate ∂ f /∂ x, treat the variable y as a constant. Then differentiate f (x, y) with respect to x
using the sum, difference, and power rules:
∂f
= ∂ ⎡⎣x 2 − 3xy + 2y 2 − 4x + 5y − 12⎤⎦
∂x ∂x
= ∂ ⎡⎣x 2⎤⎦ − ∂ ⎡⎣3xy⎤⎦ + ∂ ⎡⎣2y 2⎤⎦ − ∂ [4x] + ∂ ⎡⎣5y⎤⎦ − ∂ [12]
∂x ∂x ∂x ∂x ∂x ∂x
= 2x − 3y + 0 − 4 + 0 − 0
= 2x − 3y − 4.
The derivatives of the third, fifth, and sixth terms are all zero because they do not contain the variable x,
so they are treated as constant terms. The derivative of the second term is equal to the coefficient of x,
which is −3y. Calculating ∂ f /∂ y:
∂f
= ∂ ⎡⎣x 2 − 3xy + 2y 2 − 4x + 5y − 12⎤⎦
∂y ∂y
= ∂ ⎡⎣x 2⎤⎦ − ∂ ⎡⎣3xy⎤⎦ + ∂ ⎡⎣2y 2⎤⎦ − ∂ [4x] + ∂ ⎡⎣5y⎤⎦ − ∂ [12]
∂y ∂y ∂y ∂y ∂y ∂y
= −3x + 4y − 0 + 5 − 0
= −3x + 4y + 5.
= ∂ ⎡⎣sin⎛⎝x 2 y − 2x + 4⎞⎠⎤⎦
∂g
∂x ∂x
= cos⎛⎝x 2 y − 2x + 4⎞⎠ ∂ ⎡⎣x 2 y − 2x + 4⎤⎦
∂x
= ⎝2xy − 2⎠cos⎝x y − 2x + 4⎞⎠.
⎛ ⎞ ⎛ 2
372 Chapter 4 | Differentiation of Functions of Several Variables
To calculate ∂ g/∂ y, treat the variable x as a constant. Then differentiate g(x, y) with respect to y
using the chain rule and power rule:
= ∂ ⎡⎣sin⎛⎝x 2 y − 2x + 4⎞⎠⎤⎦
∂g
∂y ∂y
= cos⎛⎝x 2 y − 2x + 4⎞⎠ ∂ ⎡⎣x 2 y − 2x + 4⎤⎦
∂y
= x 2 cos⎛⎝x 2 y − 2x + 4⎞⎠.
4.13 Calculate ∂ f /∂ x and ∂ f /∂ y for the function f (x, y) = tan⎛⎝x 3 − 3x 2 y 2 + 2y 4⎞⎠ by holding the
opposite variable constant, then differentiating.
How can we interpret these partial derivatives? Recall that the graph of a function of two variables is a surface in ℝ 3. If
we remove the limit from the definition of the partial derivative with respect to x, the difference quotient remains:
In Figure 4.21, the value of h is positive. If we graph f (x, y) and f ⎛⎝x + h, y⎞⎠ for an arbitrary point (x, y), then the
slope of the secant line passing through these two points is given by
f ⎛⎝x + h, y⎞⎠ − f (x, y)
.
h
This line is parallel to the x-axis. Therefore, the slope of the secant line represents an average rate of change of the function
f as we travel parallel to the x-axis. As h approaches zero, the slope of the secant line approaches the slope of the tangent
line.
If we choose to change y instead of x by the same incremental value h, then the secant line is parallel to the y-axis and
so is the tangent line. Therefore, ∂ f /∂ x represents the slope of the tangent line passing through the point ⎛⎝x, y, f (x, y)⎞⎠
parallel to the x-axis and ∂ f /∂ y represents the slope of the tangent line passing through the point ⎛⎝x, y, f (x, y)⎞⎠ parallel
to the y-axis. If we wish to find the slope of a tangent line passing through the same point in any other direction, then we
need what are called directional derivatives, which we discuss in Directional Derivatives and the Gradient.
We now return to the idea of contour maps, which we introduced in Functions of Several Variables. We can use a
contour map to estimate partial derivatives of a function g(x, y).
Example 4.16
⎛
Use a contour map to estimate ∂ g/∂ x at the point ⎝ 5, 0⎞⎠ for the function g(x, y) = 9 − x 2 − y 2.
Solution
The following graph represents a contour map for the function g(x, y) = 9 − x 2 − y 2.
The inner circle on the contour map corresponds to c = 2 and the next circle out corresponds to c = 1. The first
circle is given by the equation 2 = 9 − x 2 − y 2; the second circle is given by the equation 1 = 9 − x 2 − y 2.
The first equation simplifies to x 2 + y 2 = 5 and the second equation simplifies to x 2 + y 2 = 8. The
⎛
x-intercept of the first circle is ⎝ 5, 0⎞⎠ and the x-intercept of the second circle is ⎛⎝2 2, 0⎞⎠. We can estimate
⎛
the value of ∂ g/∂ x evaluated at the point ⎝ 5, 0⎞⎠ using the slope formula:
∂g
∂x | (x, y) = ⎛
⎝ 5, 0⎞⎠ ≈
g⎛⎝ 5, 0⎞⎠ − g⎛⎝2 2, 0⎞⎠
5−2 2
= 2−1 =
5−2 2
1
5−2 2
≈ −1.688.
⎛
To calculate the exact value of ∂ g/∂ x evaluated at the point ⎝ 5, 0⎞⎠, we start by finding ∂ g/∂ x using the
374 Chapter 4 | Differentiation of Functions of Several Variables
1/2
chain rule. First, we rewrite the function as g(x, y) = 9 − x 2 − y 2 = ⎛⎝9 − x 2 − y 2⎞⎠ and then differentiate
with respect to x while holding y constant:
∂ g 1⎛ −1/2
= ⎝9 − x 2 − y 2⎞⎠ (−2x) = − x .
∂x 2 9 − x2 − y2
∂g
∂x | (x, y) = ⎛⎝ 5, 0⎞⎠ = − 5
9 − ⎛⎝ 5⎞⎠ 2 − (0) 2
= − 5 = − 5 ≈ −1.118.
4 2
The estimate for the partial derivative corresponds to the slope of the secant line passing through the points
⎛ ⎛ ⎞⎞ ⎛ ⎛ ⎞⎞
⎝ 5, 0, g⎝ 5, 0⎠⎠ and ⎝2 2, 0, g⎝2 2, 0⎠⎠. It represents an approximation to the slope of the tangent line to the
⎛
surface through the point ⎝ 5, 0, g⎛⎝ 5, 0⎞⎠⎞⎠, which is parallel to the x-axis.
4.14 Use a contour map to estimate ∂ f /∂ y at point ⎛⎝0, 2⎞⎠ for the function
f (x, y) = x 2 − y 2.
Definition
Let f (x, y, z) be a function of three variables. Then, the partial derivative of f with respect to x, written as ∂ f /∂ x,
or f x, is defined to be
We can calculate a partial derivative of a function of three variables using the same idea we used for a function of two
variables. For example, if we have a function f of x, y, and z, and we wish to calculate ∂ f /∂ x, then we treat the other
two independent variables as if they are constants, then differentiate with respect to x.
Example 4.17
Use the limit definition of partial derivatives to calculate ∂ f /∂ x for the function
Then, find ∂ f /∂ y and ∂ f /∂ z by setting the other two variables constant and differentiating accordingly.
Solution
We first calculate ∂ f /∂ x using Equation 4.14, then we calculate the other two partial derivatives by holding
the remaining variables constant. To use the equation to find ∂ f /∂ x, we first need to calculate f ⎛⎝x + h, y, z⎞⎠:
and recall that f (x, y, z) = x 2 − 3xy + 2y 2 − 4zx + 5yz 2 − 12x + 4y − 3z. Next, we substitute these two
expressions into the equation:
∂f ⎡ x 2 + 2xh + h 2 − 3xy − 3hy + 2y 2 − 4xz − 4hz + 5yz 2 − 12x − 12h + 4y − 3z
= lim ⎢
∂x h → 0⎣ h
x 2 − 3xy + 2y 2 − 4xz + 5yz 2 − 12x + 4y − 3z ⎤⎥
−
h ⎦
⎡ 2xh + h 2 − 3hy − 4hz − 12h ⎤
= lim ⎢ ⎥
h → 0⎣ h ⎦
⎡h⎛2x + h − 3y − 4z − 12⎞⎠ ⎤
= lim ⎣ ⎝ ⎦
h→0 h
= lim ⎛⎝2x + h − 3y − 4z − 12⎞⎠
h→0
= 2x − 3y − 4z − 12.
Then we find ∂ f /∂ y by holding x and z constant. Therefore, any term that does not include the variable y
is constant, and its derivative is zero. We can apply the sum, difference, and power rules for functions of one
variable:
∂ ⎡x 2 − 3xy + 2y 2 − 4xz + 5yz 2 − 12x + 4y − 3z⎤
∂ y⎣ ⎦
∂ ⎡ 2⎤ ∂ ⎡ ⎤ ∂ ⎡ 2⎤ ∂ ∂ ⎡
= ⎣x ⎦ − ⎣3xy⎦ + ⎣2y ⎦ − [4xz] + ⎣5yz 2⎤⎦ − ∂ [12x] + ∂ ⎡⎣4y⎤⎦ − ∂ [3z]
∂y ∂y ∂y ∂y ∂y ∂y ∂y ∂y
= 0 − 3x + 4y − 0 + 5z 2 − 0 + 4 − 0
= −3x + 4y + 5z 2 + 4.
To calculate ∂ f /∂ z, we hold x and y constant and apply the sum, difference, and power rules for functions of
one variable:
∂ ⎡x 2 − 3xy + 2y 2 − 4xz + 5yz 2 − 12x + 4y − 3z⎤
∂ z⎣ ⎡ ⎤ ⎦
= ⎣x ⎦ − ⎣3xy⎦ + ⎣2y ⎦ − [4xz] + ⎣5yz 2⎤⎦ − ∂ [12x] + ∂ ⎡⎣4y⎤⎦ − ∂ [3z]
∂ 2 ∂ ⎡ ⎤ ∂ ⎡ 2⎤ ∂ ∂ ⎡
∂z ∂z ∂z ∂z ∂z ∂z ∂z ∂z
= 0 − 0 + 0 − 4x + 10yz − 0 + 0 − 3
= −4x + 10yz − 3.
376 Chapter 4 | Differentiation of Functions of Several Variables
4.15 Use the limit definition of partial derivatives to calculate ∂ f /∂ x for the function
f (x, y, z) = 2x 2 − 4x 2 y + 2y 2 + 5xz 2 − 6x + 3z − 8.
Then find ∂ f /∂ y and ∂ f /∂ z by setting the other two variables constant and differentiating accordingly.
Example 4.18
Solution
In each case, treat all variables as constants except the one whose partial derivative you are calculating.
∂f ⎡ x 2 y − 4xz + y 2 ⎤
= ∂ ⎢ ⎥
∂x ∂ x⎣ x − 3yz ⎦
∂ ⎛x 2 y − 4xz + y 2⎞⎛x − 3yz⎞ − ⎛x 2 y − 4xz + y 2⎞ ∂ ⎛x − 3yz⎞
= ∂ x⎝ ⎠⎝ ⎠ ⎝ ⎠∂ x ⎝ ⎠
⎛ ⎞2
⎝ x − 3yz⎠
⎛ 2
⎛
2xy − 4z⎞⎠⎛⎝x − 3yz⎞⎠ −
⎝ ⎝x y − 4xz + y 2⎞⎠(1)
a. =
x − 3yz⎞⎠ 2
⎛
⎝
∂f ⎡ x 2 y − 4xz + y 2 ⎤
= ∂ ⎢ ⎥
∂y ∂ y⎣ x − 3yz ⎦
∂ ⎛x 2 y − 4xz + y 2⎞⎛x − 3yz⎞ − ⎛x 2 y − 4xz + y 2⎞ ∂ ⎛x − 3yz⎞
∂ y⎝ ⎠⎝ ⎠ ⎝ ⎠∂ y ⎝ ⎠
= ⎛ ⎞2
⎝x − 3yz⎠
⎛ 2
⎝x + 2y⎞⎠⎛⎝x − 3yz −⎞
⎠
⎛ 2
⎝x y − 4xz + y 2⎞⎠(−3z)
=
x − 3yz⎞⎠ 2
⎛
⎝
x 3 − 3x 2 yz + 2xy − 6y 2 z + 3x 2 yz − 12xz 2 + 3y 2 z
= ⎛ ⎞2
⎝x − 3yz⎠
x 3 + 2xy − 3y 2 z − 12xz 2
= ⎛ ⎞2
⎝x − 3yz⎠
∂f ⎡ x 2 y − 4xz + y 2 ⎤
= ∂⎢ ⎥
∂z ∂ z⎣ x − 3yz ⎦
∂ ⎛x 2 y − 4xz + y 2⎞⎛x − 3yz⎞ − ⎛x 2 y − 4xz + y 2⎞ ∂ ⎛x − 3yz⎞
∂ z⎝ ⎠⎝ ⎠ ⎝ ⎠∂ z ⎝ ⎠
= ⎛ ⎞2
⎝x − 3yz⎠
⎛ 2
⎛
(−4x) x − 3yz −
⎝
⎞
⎠ ⎝x y − 4xz + y 2⎞⎠⎛⎝−3y⎞⎠
=
x − 3yz⎞⎠ 2
⎛
⎝
−4x 2 + 3x 2 y 2 + 3y 3
= ⎛ ⎞2
⎝x − 3yz⎠
∂f
= ∂ ⎡⎣sin⎛⎝x 2 y − z⎞⎠ + cos⎛⎝x 2 − yz⎞⎠⎤⎦
∂x ∂x
= ⎝cos⎛⎝x 2 y − z⎞⎠⎞⎠ ∂ ⎛⎝x 2 y − z⎞⎠ − ⎛⎝sin⎛⎝x 2 − yz⎞⎠⎞⎠ ∂ ⎛⎝x 2 − yz⎞⎠
⎛
∂x ∂x
⎛ 2 ⎞ ⎛ 2 ⎞
= 2xy cos⎝x y − z⎠ − 2x sin⎝x − yz⎠
∂f
= ∂ ⎡⎣sin⎛⎝x 2 y − z⎞⎠ + cos⎛⎝x 2 − yz⎞⎠⎤⎦
∂y ∂y
b. = ⎝cos⎛⎝x 2 y − z⎞⎠⎞⎠ ∂ ⎛⎝x 2 y − z⎞⎠ − ⎛⎝sin⎛⎝x 2 − yz⎞⎠⎞⎠ ∂ ⎛⎝x 2 − yz⎞⎠
⎛
∂y ∂y
= x 2 cos⎛⎝x 2 y − z⎞⎠ + z sin⎛⎝x 2 − yz⎞⎠
∂f
= ∂ ⎡⎣sin⎛⎝x 2 y − z⎞⎠ + cos⎛⎝x 2 − yz⎞⎠⎤⎦
∂z ∂z
= ⎝cos⎛⎝x 2 y − z⎞⎠⎞⎠ ∂ ⎛⎝x 2 y − z⎞⎠ − ⎛⎝sin⎛⎝x 2 − yz⎞⎠⎞⎠ ∂ ⎛⎝x 2 − yz⎞⎠
⎛
∂z ∂z
= −cos⎛⎝x 2 y − z⎞⎠ + y sin⎛⎝x 2 − yz⎞⎠
4.16 Calculate ∂ f /∂ x, ∂ f /∂ y, and ∂ f /∂ z for the function f (x, y, z) = sec⎛⎝x 2 y⎞⎠ − tan⎛⎝x 3 yz 2⎞⎠.
Each of these partial derivatives is a function of two variables, so we can calculate partial derivatives of these functions.
Just as with derivatives of single-variable functions, we can call these second-order derivatives, third-order derivatives, and
so on. In general, they are referred to as higher-order partial derivatives. There are four second-order partial derivatives
for any function (provided they all exist):
∂2 f ⎡∂ f ⎤ ∂2 f ⎡∂ f ⎤ ∂2 f ⎡∂ f ⎤ ∂2 f ⎡∂ f ⎤
= ∂ ⎣ ⎦, = ∂ = ∂ = ∂ ⎣ ⎦.
∂ x∂ y ∂ x ⎣ ∂ y ⎦ ∂ y∂ x ∂ y ⎣ ∂ x ⎦
, ,
∂x 2 ∂ x ∂x ∂y 2 ∂ y ∂y
378 Chapter 4 | Differentiation of Functions of Several Variables
An alternative notation for each is f xx, f yx, f xy, and f yy, respectively. Higher-order partial derivatives calculated with
respect to different variables, such as f xy and f yx, are commonly called mixed partial derivatives.
Example 4.19
Solution
To calculate ∂ 2 f /dx 2 and ∂ 2 f /∂ y∂ x, we first calculate ∂ f /∂ x:
∂f −3y
=e + 2 cos⎛⎝2x − 5y⎞⎠.
∂x
∂2 f ⎡∂ f ⎤
= ∂ ⎣ ⎦
∂x 2 ∂ x ∂x
⎡ −3y ⎤
= ∂ ⎣e + 2 cos⎛⎝2x − 5y⎞⎠⎦
∂x
= −4 sin⎛⎝2x − 5y⎞⎠.
∂2 f ⎡∂ f ⎤
= ∂ ⎣ ⎦
∂ y∂ x ∂y ∂x
⎡ −3y ⎤
= ∂ ⎣e + 2 cos⎛⎝2x − 5y⎞⎠⎦
∂y
−3y
= −3e + 10 sin⎛⎝2x − 5y⎞⎠.
∂f −3y
= −3xe − 5 cos⎛⎝2x − 5y⎞⎠.
∂y
∂2 f ⎡∂ f ⎤
= ∂ ⎣ ⎦
∂ x∂ y ∂x ∂y
⎡ −3y ⎤
= ∂ ⎣−3xe − 5 cos⎛⎝2x − 5y⎞⎠⎦
∂x
−3y
= −3e + 10 sin⎛⎝2x − 5y⎞⎠.
∂2 f ⎡∂ f ⎤
= ∂ ⎣ ⎦
∂ y2 ∂y ∂y
⎡ −3y ⎤
= ∂ ⎣−3xe − 5 cos⎛⎝2x − 5y⎞⎠⎦
∂y
−3y
= 9xe − 25 sin⎛⎝2x − 5y⎞⎠.
4.17 Calculate all four second partial derivatives for the function
f (x, y) = sin⎛⎝3x − 2y⎞⎠ + cos⎛⎝x + 4y⎞⎠.
At this point we should notice that, in both Example 4.19 and the checkpoint, it was true that ∂ 2 f /∂ x∂ y = ∂ 2 f /∂ y∂ x.
Under certain conditions, this is always true. In fact, it is a direct consequence of the following theorem.
Clairaut’s theorem guarantees that as long as mixed second-order derivatives are continuous, the order in which we choose
to differentiate the functions (i.e., which variable goes first, then second, and so on) does not matter. It can be extended to
higher-order derivatives as well. The proof of Clairaut’s theorem can be found in most advanced calculus books.
Two other second-order partial derivatives can be calculated for any function f (x, y). The partial derivative f xx is equal
to the partial derivative of f x with respect to x, and f yy is equal to the partial derivative of f y with respect to y.
Example 4.20
Verify that
u(x, y, t) = 5 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)
Solution
First, we calculate u tt, u xx, and u yy :
⎡ ⎤
u tt = ∂ ⎣∂ u ⎦
∂t ∂t
= ∂ ⎡⎣5 sin(3πx)sin⎛⎝4πy⎞⎠⎛⎝−10π sin(10πt)⎞⎠⎤⎦
∂t
= ∂ ⎡⎣−50π sin(3πx)sin⎛⎝4πy⎞⎠sin(10πt)⎤⎦
∂t
= −500π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)
⎡ ⎤
u xx = ∂ ⎣∂ u ⎦
∂x ∂x
= ∂ ⎡⎣15π cos(3πx)sin⎛⎝4πy⎞⎠cos(10πt)⎤⎦
∂x
= −45π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)
∂ ⎡∂ u ⎤
∂ y ⎣∂ y ⎦
u yy =
Next, we substitute each of these into the right-hand side of Equation 4.20 and simplify:
4⎛⎝u xx + u yy⎞⎠ = 4⎛⎝−45π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt) + − 80π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)⎞⎠
= 4⎛⎝−125π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)⎞⎠
= −500π 2 sin(3πx)sin⎛⎝4πy⎞⎠cos(10πt)
= u tt.
⎛ ⎞ ⎛y ⎞
4.18 Verify that u(x, y, t) = 2 sin⎝ x ⎠sin⎝ ⎠e −25t/16 is a solution to the heat equation
3 4
Since the solution to the two-dimensional heat equation is a function of three variables, it is not easy to create a visual
representation of the solution. We can graph the solution for fixed values of t, which amounts to snapshots of the heat
distributions at fixed times. These snapshots show how the heat is distributed over a two-dimensional surface as time
progresses. The graph of the preceding solution at time t = 0 appears in the following figure. As time progresses, the
extremes level out, approaching zero as t approaches infinity.
Figure 4.23
If we consider the heat equation in one dimension, then it is possible to graph the solution over time. The heat equation in
one dimension becomes
u t = c 2 u xx,
where c 2 represents the thermal diffusivity of the material in question. A solution of this differential equation can be written
in the form
2 m2 c2 t (4.22)
u m (x, t) = e −π sin(mπx)
where m is any positive integer. A graph of this solution using m = 1 appears in Figure 4.24, where the initial
temperature distribution over a wire of length 1 is given by u(x, 0) = sin πx. Notice that as time progresses, the wire
cools off. This is seen because, from left to right, the highest temperature (which occurs in the middle of the wire) decreases
and changes color from red to blue.
382 Chapter 4 | Differentiation of Functions of Several Variables
Figure 4.24 Graph of a solution of the heat equation in one dimension over time.
Figure 4.25 (a) William Thomson (Lord Kelvin), 1824-1907, was a British physicist and
electrical engineer; (b) Kelvin used the heat diffusion equation to estimate the age of Earth
(credit: modification of work by NASA).
During the late 1800s, the scientists of the new field of geology were coming to the conclusion that Earth must be
“millions and millions” of years old. At about the same time, Charles Darwin had published his treatise on evolution.
Darwin’s view was that evolution needed many millions of years to take place, and he made a bold claim that the
Weald chalk fields, where important fossils were found, were the result of 300 million years of erosion.
At that time, eminent physicist William Thomson (Lord Kelvin) used an important partial differential equation, known
as the heat diffusion equation, to estimate the age of Earth by determining how long it would take Earth to cool from
molten rock to what we had at that time. His conclusion was a range of 20 to 400 million years, but most likely
about 50 million years. For many decades, the proclamations of this irrefutable icon of science did not sit well with
geologists or with Darwin.
Read Kelvin’s paper (http://www.openstaxcollege.org/l/20_KelEarthAge) on estimating the age of
the Earth.
Kelvin made reasonable assumptions based on what was known in his time, but he also made several assumptions
that turned out to be wrong. One incorrect assumption was that Earth is solid and that the cooling was therefore
via conduction only, hence justifying the use of the diffusion equation. But the most serious error was a forgivable
one—omission of the fact that Earth contains radioactive elements that continually supply heat beneath Earth’s mantle.
The discovery of radioactivity came near the end of Kelvin’s life and he acknowledged that his calculation would have
to be modified.
Kelvin used the simple one-dimensional model applied only to Earth’s outer shell, and derived the age from graphs
and the roughly known temperature gradient near Earth’s surface. Let’s take a look at a more appropriate version of
the diffusion equation in radial coordinates, which has the form
∂ T = K ⎡∂ 2 T + 2 ∂ T ⎤. (4.23)
∂t ⎣ ∂2 r r ∂ r ⎦
384 Chapter 4 | Differentiation of Functions of Several Variables
Here, T(r, t) is temperature as a function of r (measured from the center of Earth) and time t. K is the heat
conductivity—for molten rock, in this case. The standard method of solving such a partial differential equation is by
separation of variables, where we express the solution as the product of functions containing each variable separately.
In this case, we would write the temperature as
T(r, t) = R(r) f (t).
1. Substitute this form into Equation 4.13 and, noting that f (t) is constant with respect to distance (r) and
R(r) is constant with respect to time (t), show that
1 ∂ f = K ⎡∂ 2 R + 2 ∂ R ⎤.
f ∂t R⎣ ∂r2 r ∂r ⎦
2. This equation represents the separation of variables we want. The left-hand side is only a function of t and
the right-hand side is only a function of r, and they must be equal for all values of r and t. Therefore, they
both must be equal to a constant. Let’s call that constant −λ 2. (The convenience of this choice is seen on
substitution.) So, we have
1 ∂ f = −λ 2 K ⎡∂ 2 R + 2 ∂ R ⎤ = −λ 2.
R⎣ ∂r2 r ∂r ⎦
and
f ∂t
2
Now, we can verify through direct substitution for each equation that the solutions are f (t) = Ae −λ t and
⎛ ⎞ 2t
R(r) = B⎝sinrαr ⎠ + C⎛⎝ cosr αr ⎞⎠, where α = λ/ K. Note that f (t) = Ae +λn is also a valid solution, so we
could have chosen +λ 2 for our constant. Can you see why it would not be valid for this case as time increases?
3. Let’s now apply boundary conditions.
a. The temperature must be finite at the center of Earth, r = 0. Which of the two constants, B or C,
must therefore be zero to keep R finite at r = 0 ? (Recall that sin(αr)/r → α = as r → 0, but
cos(αr)/r behaves very differently.)
b. Kelvin argued that when magma reaches Earth’s surface, it cools very rapidly. A person can often touch
the surface within weeks of the flow. Therefore, the surface reached a moderate temperature very early
and remained nearly constant at a surface temperature T s. For simplicity, let’s set T = 0 at r = R E
and find α such that this is the temperature there for all time t. (Kelvin took the value to be
300 K ≈ 80°F. We can add this 300 K constant to our solution later.) For this to be true, the sine
argument must be zero at r = R E. Note that α has an infinite series of values that satisfies this
condition. Each value of α represents a valid solution (each with its own value for A). The total or
general solution is the sum of all these solutions.
c. At t = 0, we assume that all of Earth was at an initial hot temperature T 0 (Kelvin took this to be
about 7000 K.) The application of this boundary condition involves the more advanced application of
Fourier coefficients. As noted in part b. each value of α n represents a valid solution, and the general
solution is a sum of all these solutions. This results in a series solution:
⎛T R ⎞ (−1) n − 1 −λn 2 t sin(α n r)
T(r, t) = ⎝ 0π E ⎠∑ n e r , where α n = nπ/R E.
n
n−1
Note how the values of α n come from the boundary condition applied in part b. The term −1 n is the constant
A n for each term in the series, determined from applying the Fourier method. Letting β = π , examine the first
RE
few terms of this solution shown here and note how λ 2 in the exponential causes the higher terms to decrease quickly
as time progresses:
Near time t = 0, many terms of the solution are needed for accuracy. Inserting values for the conductivity K and
β = π/R E for time approaching merely thousands of years, only the first few terms make a significant contribution.
Kelvin only needed to look at the solution near Earth’s surface (Figure 4.26) and, after a long time, determine what
time best yielded the estimated temperature gradient known during his era (1°F increase per 50 ft). He simply chose
a range of times with a gradient close to this value. In Figure 4.26, the solutions are plotted and scaled, with the
300 − K surface temperature added. Note that the center of Earth would be relatively cool. At the time, it was thought
Earth must be solid.
Figure 4.26 Temperature versus radial distance from the center of Earth. (a) Kelvin’s results, plotted to scale. (b) A
close-up of the results at a depth of 4.0 mi below Earth’s surface.
Epilog
On May 20, 1904, physicist Ernest Rutherford spoke at the Royal Institution to announce a revised calculation that
included the contribution of radioactivity as a source of Earth’s heat. In Rutherford’s own words:
“I came into the room, which was half-dark, and presently spotted Lord Kelvin in the audience, and realised that I was
in for trouble at the last part of my speech dealing with the age of the Earth, where my views conflicted with his. To
my relief, Kelvin fell fast asleep, but as I came to the important point, I saw the old bird sit up, open an eye and cock a
baleful glance at me.
Then a sudden inspiration came, and I said Lord Kelvin had limited the age of the Earth, provided no new source [of
heat] was discovered. That prophetic utterance referred to what we are now considering tonight, radium! Behold! The
old boy beamed upon me.”
Rutherford calculated an age for Earth of about 500 million years. Today’s accepted value of Earth’s age is about 4.6
billion years.
386 Chapter 4 | Differentiation of Functions of Several Variables
4.3 EXERCISES
For the following exercises, calculate the partial derivative 125. Let z = tan(2x − y). Find ∂ z and ∂ z .
using the limit definitions only. ∂x ∂y
∂ z for z = x 2 − 3xy + y 2
112.
∂x 126. Let z = sinh⎛⎝2x + 3y⎞⎠. Find ∂ z and ∂ z .
∂x ∂y
∂2 f
130. Given f (x, y, z) = x 3 yz 2, find and
∂ x∂ y
136. f yx for z = ln(x − y) 150. Show that f (x, y) = ln⎛⎝x 2 + y 2⎞⎠ solves Laplace’s
2 2
2 2 equation ∂ z2 + ∂ z2 = 0.
137. Let z = x 2 + 3xy + 2y 2. Find ∂ z2 and ∂ z2 . ∂x ∂y
∂x ∂y
Find f xyz.
156. The function P(T, V) = nRT gives the pressure at
V
3 2 2 3 a point in a gas as a function of temperature T and volume
144. Let F(x, y, z) = x yz − 2x yz + 3xz − 2y z.
Find F xyz. V. The letters n and R are constants. Find ∂ P and ∂ P ,
∂V ∂T
and explain what these quantities represent.
145. Given f (x, y) = x 2 + x − 3xy + y 3 − 5, find all
points at which f x = f y = 0 simultaneously. 157. The equation for heat flow in the xy-plane is
∂f ∂2 f ∂2 f
= + . Show that
146. Given f (x, y) = 2x 2 + 2xy + y 2 + 2x − 3, find ∂t ∂ x2 ∂ y2
∂f ∂f f (x, y, t) = e −2t sin x sin y is a solution.
all points at which = 0 and = 0 simultaneously.
∂x ∂y
158. The basic wave equation is f tt = f xx. Verify that
147. Given f (x, y) = y 3 − 3yx 2 − 3y 2 − 3x 2 + 1, find f (x, t) = sin(x + t) and f (x, t) = sin(x − t) are
all points on f at which f x = f y = 0 simultaneously. solutions.
148. Given f (x, y) = 15x 3 − 3xy + 15y 3, find all 159. The law of cosines can be thought of as a function
of three variables. Let x, y, and θ be two sides of any
points at which f x(x, y) = f y(x, y) = 0 simultaneously.
triangle where the angle θ is the included angle between
149. Show that z = e x sin y satisfies the equation the two sides. Then, F(x, y, θ) = x 2 + y 2 − 2xy cos θ
In this section, we consider the problem of finding the tangent plane to a surface, which is analogous to finding the equation
of a tangent line to a curve when the curve is defined by the graph of a function of one variable, y = f (x). The slope of the
tangent line at the point x = a is given by m = f ′(a); what is the slope of a tangent plane? We learned about the equation
of a plane in Equations of Lines and Planes in Space; in this section, we see how it can be applied to the problem at
hand.
Tangent Planes
Intuitively, it seems clear that, in a plane, only one line can be tangent to a curve at a point. However, in three-dimensional
space, many lines can be tangent to a given point. If these lines lie in the same plane, they determine the tangent plane at
that point. A tangent plane at a regular point contains all of the lines tangent to that point. A more intuitive way to think of
a tangent plane is to assume the surface is smooth at that point (no corners). Then, a tangent line to the surface at that point
in any direction does not have any abrupt changes in slope because the direction changes smoothly.
Definition
Let P 0 = (x 0, y 0, z 0) be a point on a surface S, and let C be any curve passing through P 0 and lying entirely in
S. If the tangent lines to all such curves C at P 0 lie in the same plane, then this plane is called the tangent plane to
S at P 0 (Figure 4.27).
390 Chapter 4 | Differentiation of Functions of Several Variables
Figure 4.27 The tangent plane to a surface S at a point P 0 contains all the tangent lines
to curves in S that pass through P 0.
For a tangent plane to a surface to exist at a point on that surface, it is sufficient for the function that defines the surface to
be differentiable at that point. We define the term tangent plane here and then explore the idea intuitively.
Definition
Let S be a surface defined by a differentiable function z = f (x, y), and let P 0 = (x 0, y 0) be a point in the domain
of f . Then, the equation of the tangent plane to S at P 0 is given by
To see why this formula is correct, let’s first find two tangent lines to the surface S. The equation of the tangent
line to the curve that is represented by the intersection of S with the vertical trace given by x = x 0 is
z = f (x 0, y 0) + f y (x 0, y 0)(y − y 0). Similarly, the equation of the tangent line to the curve that is represented by the
intersection of S with the vertical trace given by y = y 0 is z = f (x 0, y 0) + f x (x 0, y 0)(x − x 0). A parallel vector to the
first tangent line is a = j + f y (x 0, y 0)k; a parallel vector to the second tangent line is b = i + f x (x 0, y 0)k. We can take
the cross product of these two vectors:
| |
i j k
= 0 1 f y (x 0, y 0)
1 0 f x (x 0, y 0)
= f x (x 0, y 0)i + f y (x 0, y 0)j − k.
This vector is perpendicular to both lines and is therefore perpendicular to the tangent plane. We can use this vector as a
normal vector to the tangent plane, along with the point P 0 = ⎛⎝x 0, y 0, f (x 0, y 0)⎞⎠ in the equation for a plane:
Example 4.21
Solution
First, we must calculate f x (x, y) and f y (x, y), then use Equation 4.24 with x 0 = 2 and y 0 = −1:
f x (x, y) = 4x − 3y + 2
f y (x, y) = −3x + 16y − 4
f (2, −1) = 2(2) 2 − 3(2)(−1) + 8(−1) 2 + 2(2) − 4(−1) + 4 = 34.
f x (2, −1) = 4(2) − 3(−1) + 2 = 13
f y (2, −1) = −3(2) + 16(−1) − 4 = −26.
Figure 4.28 Calculating the equation of a tangent plane to a given surface at a given point.
4.19 Find the equation of the tangent plane to the surface defined by the function
f (x, y) = x 3 − x 2 y + y 2 − 2x + 3y − 2 at point (−1, 3).
Example 4.22
Find the equation of the tangent plane to the surface defined by the function f (x, y) = sin(2x)cos⎛⎝3y⎞⎠ at the point
(π/3, π/4).
Solution
First, calculate f x (x, y) and f y (x, y), then use Equation 4.24 with x 0 = π/3 and y 0 = π/4:
f x (x, y) = 2 cos(2x)cos⎛⎝3y⎞⎠
f y (x, y) = −3 sin(2x)sin⎛⎝3y⎞⎠
⎛ ⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎞⎛ ⎞
f ⎝π , π ⎠ = sin⎝2⎝π ⎠⎠cos⎝3⎝π ⎠⎠ = ⎝ 3 ⎠⎝− 2 ⎠ = − 6
3 4 3 4 2 2 4
⎛ ⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎞⎛ ⎞
f x ⎝π , π ⎠ = 2 cos⎝2⎝π ⎠⎠cos⎝3⎝π ⎠⎠ = 2⎝− 1 ⎠⎝− 2 ⎠ = 2
3 4 3 4 2 2 2
⎛ ⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎛ ⎞⎞ ⎛ ⎞⎛ ⎞
f y ⎝π , π ⎠ = −3 sin⎝2⎝π ⎠⎠sin⎝3⎝π ⎠⎠ = −3⎝ 3 ⎠⎝ 2 ⎠ = − 3 6 .
3 4 3 4 2 2 4
z = f (x 0, y 0) + f x (x 0, y 0)(x − x 0) + f y (x 0, y 0)(y − y 0)
⎛ ⎞ ⎛ ⎞
z = − 6 + 2 ⎝x − π ⎠ − 3 6 ⎝y − π ⎠
4 2 3 4 4
z = 2 x − 3 6 y − 6 − π 2 + 3π 6 .
2 4 4 6 16
A tangent plane to a surface does not always exist at every point on the surface. Consider the function
⎧ xy (x, y) ≠ (0, 0)
f (x, y) = ⎨ x 2 + y 2
⎩0 (x, y) = (0, 0).
Figure 4.29 Graph of a function that does not have a tangent plane at the
origin.
If either x = 0 or y = 0, then f (x, y) = 0, so the value of the function does not change on either the x- or y-axis.
Therefore, f x (x, 0) = f y ⎛⎝0, y⎞⎠ = 0, so as either x or y approach zero, these partial derivatives stay equal to zero.
Substituting them into Equation 4.24 gives z = 0 as the equation of the tangent line. However, if we approach the origin
from a different direction, we get a different story. For example, suppose we approach the origin along the line y = x. If
we put y = x into the original function, it becomes
x(x) 2
f (x, x) = = x = | x| .
2
x + (x) 2
2x 2 2
When x > 0, the slope of this curve is equal to 2/2; when x < 0, the slope of this curve is equal to −⎛⎝ 2/2⎞⎠. This
presents a problem. In the definition of tangent plane, we presumed that all tangent lines through point P (in this case, the
origin) lay in the same plane. This is clearly not the case here. When we study differentiable functions, we will see that this
function is not differentiable at the origin.
Linear Approximations
Recall from Linear Approximations and Differentials (http://cnx.org/content/m53605/latest/) that the formula
394 Chapter 4 | Differentiation of Functions of Several Variables
The diagram for the linear approximation of a function of one variable appears in the following graph.
The tangent line can be used as an approximation to the function f (x) for values of x reasonably close to x = a. When
working with a function of two variables, the tangent line is replaced by a tangent plane, but the approximation idea is much
the same.
Definition
Given a function z = f (x, y) with continuous partial derivatives that exist at the point (x 0, y 0), the linear
approximation of f at the point (x 0, y 0) is given by the equation
Notice that this equation also represents the tangent plane to the surface defined by z = f (x, y) at the point (x 0, y 0). The
idea behind using a linear approximation is that, if there is a point (x 0, y 0) at which the precise value of f (x, y) is known,
then for values of (x, y) reasonably close to (x 0, y 0), the linear approximation (i.e., tangent plane) yields a value that
is also reasonably close to the exact value of f (x, y) (Figure 4.31). Furthermore the plane that is used to find the linear
approximation is also the tangent plane to the surface at the point (x 0, y 0).
Example 4.23
Given the function f (x, y) = 41 − 4x 2 − y 2, approximate f (2.1, 2.9) using point (2, 3) for (x 0, y 0).
What is the approximate value of f (2.1, 2.9) to four decimal places?
Solution
To apply Equation 4.25, we first must calculate f (x 0, y 0), f x (x 0, y 0), and f y (x 0, y 0) using x 0 = 2 and
y 0 = 3:
4.20 5 − 2x + 3y
Given the function f (x, y) = e , approximate f (4.1, 0.9) using point (4, 1) for (x 0, y 0).
What is the approximate value of f (4.1, 0.9) to four decimal places?
Differentiability
When working with a function y = f (x) of one variable, the function is said to be differentiable at a point x = a if f ′ (a)
exists. Furthermore, if a function of one variable is differentiable at a point, the graph is “smooth” at that point (i.e., no
corners exist) and a tangent line is well-defined at that point.
The idea behind differentiability of a function of two variables is connected to the idea of smoothness at that point. In this
case, a surface is considered to be smooth at point P if a tangent plane to the surface exists at that point. If a function is
differentiable at a point, then a tangent plane to the surface exists at that point. Recall the formula for a tangent plane at a
point (x 0, y 0) is given by
For a tangent plane to exist at the point (x 0, y 0), the partial derivatives must therefore exist at that point. However, this is
not a sufficient condition for smoothness, as was illustrated in Figure 4.29. In that case, the partial derivatives existed at
the origin, but the function also had a corner on the graph at the origin.
Definition
A function f (x, y) is differentiable at a point P(x 0, y 0) if, for all points (x, y) in a δ disk around P, we can
write
f (x, y) = f (x 0, y 0) + f x (x 0, y 0)(x − x 0) + f y (x 0, y 0)(y − y 0) + E(x, y), (4.26)
The last term in Equation 4.26 is referred to as the error term and it represents how closely the tangent plane comes to the
surface in a small neighborhood (δ disk) of point P. For the function f to be differentiable at P, the function must be
smooth—that is, the graph of f must be close to the tangent plane for points near P.
Example 4.24
Demonstrating Differentiability
Solution
First, we calculate f (x 0, y 0), f x (x 0, y 0), and f y (x 0, y 0) using x 0 = 2 and y 0 = −3, then we use Equation
4.26:
f (2, −3) = 2(2) 2 − 4(−3) = 8 + 12 = 20
f x (2, −3) = 4(2) = 8
f y (2, −3) = −4.
E(x, y)
Next, we calculate lim :
(x, y) → (x 0, y 0) (x − x 0) 2 + (y − y 0) 2
lim
E(x, y)
= lim 2x 2 − 8x + 8
(x, y) → (x 0, y 0) (x − x 0) 2 + (y − y 0) 2 (x, y) → (2, −3) (x − 2) 2 + ⎛⎝y + 3⎞⎠ 2
2⎛⎝x 2 − 4x + 4⎞⎠
= lim
(x, y) → (2, −3) (x − 2) 2 + ⎛⎝y + 3⎞⎠ 2
2(x − 2) 2
= lim
(x, y) → (2, −3) (x − 2) 2 + ⎛⎝y + 3⎞⎠ 2
2⎛⎝(x − 2) 2 + ⎛⎝y + 3⎞⎠ 2⎞⎠
≤ lim
(x, y) → (2, −3) (x − 2) 2 + ⎛⎝y + 3⎞⎠ 2
= lim 2 (x − 2) 2 + ⎛⎝y + 3⎞⎠ 2
(x, y) → (2, −3)
= 0.
Since E(x, y) ≥ 0 for any value of x or y, the original limit must be equal to zero. Therefore,
2
f (x, y) = 2x − 4y is differentiable at point (2, −3).
4.21 Show that the function f (x, y) = 3x − 4y 2 is differentiable at point (−1, 2).
398 Chapter 4 | Differentiation of Functions of Several Variables
⎧ xy (x, y) ≠ (0, 0)
The function f (x, y) = ⎨ x 2 + y 2 is not differentiable at the origin. We can see this by calculating
⎩0 (x, y) = (0, 0)
the partial derivatives. This function appeared earlier in the section, where we showed that f x (0, 0) = f y (0, 0) = 0.
Substituting this information into Equation 4.26 using x 0 = 0 and y 0 = 0, we get
E(x, y)
Calculating lim gives
(x, y) → (x 0, y 0) (x − x 0) 2 + (y − y 0) 2
xy
E(x, y) x + y2
2
lim = lim
(x, y) → (x 0, y 0) (x − x 0) 2 + (y − y 0) 2 (x, y) → (0, 0) x 2 + y 2
xy
= lim .
(x, y) → (0, 0) x 2 + y 2
Depending on the path taken toward the origin, this limit takes different values. Therefore, the limit does not exist and the
function f is not differentiable at the origin as shown in the following figure.
Differentiability and continuity for functions of two or more variables are connected, the same as for functions of one
variable. In fact, with some adjustments of notation, the basic theorem is the same.
Differentiability Implies Continuity shows that if a function is differentiable at a point, then it is continuous there.
However, if a function is continuous at a point, then it is not necessarily differentiable at that point. For example,
⎧ xy (x, y) ≠ (0, 0)
f (x, y) = ⎨ x 2 + y 2
⎩0 (x, y) = (0, 0)
is continuous at the origin, but it is not differentiable at the origin. This observation is also similar to the situation in single-
variable calculus.
Continuity of First Partials Implies Differentiability further explores the connection between continuity and
differentiability at a point. This theorem says that if the function and its partial derivatives are continuous at a point, the
function is differentiable.
was not differentiable at the origin. Let’s calculate the partial derivatives f x and f y :
∂f y3 ∂f x3
= and = .
∂x ⎛ 2 ⎞
3/2 ∂y ⎛ 2
3/2
⎝x + y 2⎠ ⎝x + y 2⎞⎠
The contrapositive of the preceding theorem states that if a function is not differentiable, then at least one of the hypotheses
must be false. Let’s explore the condition that f x (0, 0) must be continuous. For this to be true, it must be true that
lim f x (0, 0) = f x (0, 0):
(x, y) → (0, 0)
y3
lim f x (x, y) = lim 3/2
.
(x, y) → (0, 0) (x, y) → (0, 0) ⎛ 2 2⎞
⎝x +y ⎠
y3 y3
lim 3/2
= lim 3/2
(x, y) → (0, 0) ⎛ 2
+ y 2⎞⎠
y → 0 ⎛⎛
⎝x ⎝⎝ky⎠
⎞ 2
+ y 2⎞⎠
y3
= lim 3/2
y → 0⎛ 2 2
⎝k y + y 2⎞⎠
y3
= lim 3/2
y→0 ⎛ ⎞
|y| 3 ⎝k 2 + 1⎠
y
= 1 lim | | .
3/2 y → 0 y
⎛ 2 ⎞
⎝k + 1⎠
3/2 ⎛ 3/2⎞
If y > 0, then this expression equals 1/⎛⎝k 2 + 1⎞⎠ ; if y < 0, then it equals − 1/⎛⎝k 2 + 1⎞⎠
⎝ ⎠. In either case, the value
400 Chapter 4 | Differentiation of Functions of Several Variables
Differentials
In Linear Approximations and Differentials (http://cnx.org/content/m53605/latest/) we first studied the concept
of differentials. The differential of y, written dy, is defined as f ′ (x)dx. The differential is used to approximate
Δy = f (x + Δx) − f (x), where Δx = dx. Extending this idea to the linear approximation of a function of two variables
at the point (x 0, y 0) yields the formula for the total differential for a function of two variables.
Definition
Let z = f (x, y) be a function of two variables with (x 0, y 0) in the domain of f , and let Δx and Δy be chosen so
that ⎛⎝x 0 + Δx, y 0 + Δy⎞⎠ is also in the domain of f . If f is differentiable at the point (x 0, y 0), then the differentials
dx and dy are defined as
dx = Δx and dy = Δy.
The differential dz, also called the total differential of z = f (x, y) at (x 0, y 0), is defined as
Notice that the symbol ∂ is not used to denote the total differential; rather, d appears in front of z. Now, let’s define
Δz = f ⎛⎝x + Δx, y + Δy⎞⎠ − f (x, y). We use dz to approximate Δz, so
Δz ≈ dz = f x (x 0, y 0)dx + f y (x 0, y 0)dy.
Therefore, the differential is used to approximate the change in the function z = f (x 0, y 0) at the point (x 0, y 0) for
given values of Δx and Δy. Since Δz = f ⎛⎝x + Δx, y + Δy⎞⎠ − f (x, y), this can be used further to approximate
f ⎛⎝x + Δx, y + Δy⎞⎠:
One such application of this idea is to determine error propagation. For example, if we are manufacturing a gadget and are
off by a certain amount in measuring a given quantity, the differential can be used to estimate the error in the total volume
of the gadget.
Example 4.25
Approximation by Differentials
Find the differential dz of the function f (x, y) = 3x 2 − 2xy + y 2 and use it to approximate Δz at point
(2, −3). Use Δx = 0.1 and Δy = −0.05. What is the exact value of Δz ?
Solution
First, we must calculate f (x 0, y 0), f x (x 0, y 0), and f y (x 0, y 0) using x 0 = 2 and y 0 = −3:
This is the approximation to Δz = f ⎛⎝x 0 + Δx, y 0 + Δy⎞⎠ − f (x 0, y 0). The exact value of Δz is given by
4.22 Find the differential dz of the function f (x, y) = 4y 2 + x 2 y − 2xy and use it to approximate Δz at
point (1, −1). Use Δx = 0.03 and Δy = −0.02. What is the exact value of Δz ?
Definition
A function f (x, y, z) is differentiable at a point P(x 0, y 0, z 0) if for all points (x, y, z) in a δ disk around P we
can write
f (x, y) = f (x 0, y 0, z 0) + f x (x 0, y 0, z 0)(x − x 0) + f y (x 0, y 0, z 0)(y − y 0) (4.28)
+ f z (x 0, y 0, z 0)(z − z 0) + E(x, y, z),
E(x, y, z)
lim = 0.
(x, y, z) → (x 0, y 0, z 0) 2
(x − x 0) + (y − y 0) 2 + (z − z 0) 2
If a function of three variables is differentiable at a point (x 0, y 0, z 0), then it is continuous there. Furthermore, continuity
of first partial derivatives at that point guarantees differentiability.
4.4 EXERCISES
For the following exercises, find a unit normal vector to the 181. z = x 2 − 2xy + y 2, P(1, 2, 1)
surface at the indicated point.
For the following exercises, find parametric equations for
163. f (x, y) = x 3, (2, −1, 8) the normal line to the surface at the indicated point. (Recall
that to find the equation of a line in space, you need a
⎛ ⎞
x = 0 when x = y = 1 point on the line, P 0 (x 0, y 0, z 0), and a vector
164. ln⎝ y − z⎠
n = 〈 a, b, c 〉 that is parallel to the line. Then the
For the following exercises, as a useful review for equation of the line is
techniques used in this section, find a normal vector and a x − x 0 = at, y − y 0 = bt, z − z 0 = ct.)
tangent vector at point P.
182. −3x + 9y + 4z = −4, P(1, −1, 2)
165. x 2 + xy + y 2 = 3, P(−1, −1)
2
183. z = 5x 2 − 2y 2, P(2, 1, 18)
⎛ 2 2⎞
166. ⎝x +y ⎠ = 9⎛⎝x 2 2⎞
− y ⎠, P( 2, 1)
184. x 2 − 8xyz + y 2 + 6z 2 = 0, P(1, 1, 1)
167. xy 2 − 2x 2 + y + 5x = 6, P(4, 2)
185. z = ln⎛⎝3x 2 + 7y 2 + 1⎞⎠, P(0, 0, 0)
168. 2x 3 − x 2 y 2 = 3x − y − 7, P(1, −2)
4x 2 + 6y 2
186. z = e , P(0, 0, 1)
x2 − y2
169. ze − 3 = 0, P(2, 2, 3)
187. z = x 2 − 2xy + y 2 at point P(1, 2, 1)
For the following exercises, find the equation for the
tangent plane to the surface at the indicated point. (Hint: For the following exercises, use the figure shown here.
Solve for z in terms of x and y.)
7x 2 + 4y 2
174. z = e , P(0, 0, 1)
176. x 2 + 4y 2 = z 2, P(3, 2, 5)
⎛ ⎞
177. x 3 + y 3 = 3xyz, P⎝1, 2, 3 ⎠
2
188. The length of line segment AC is equal to what
178. z = axy, P⎛⎝1, 1 , 1⎞
a ⎠
mathematical expression?
For the following exercises, complete each task. 200. The centripetal acceleration of a particle moving
2
xy in a circle is given by a(r, v) = vr , where v is the
191. Show that f (x, y) = e x is differentiable at point
(1, 0). velocity and r is the radius of the circle. Approximate
the maximum percent error in measuring the acceleration
192. Find the total differential of the function resulting from errors of 3% in v and 2% in r. (Recall
w = e y cos(x) + z 2. that the percentage error is the ratio of the amount of error
over the original amount. So, in this case, the percentage
error in a is given by da
a .)
193. Show that f (x, y) = x 2 + 3y is differentiable at
every point. In other words, show that
201. The radius r and height h of a right circular
Δz = f (x + Δx, y + Δy) − f (x, y) = f x Δx + f y Δy + ε 1 Δx + ε 2 Δy,
cylinder are measured with possible errors of 4% and 5%,
⎛
where both ε 1 and ε 2 approach zero as Δx, Δy⎞⎠
⎝
respectively. Approximate the maximum possible
approaches (0, 0). percentage error in measuring the volume (Recall that the
percentage error is the ratio of the amount of error over the
xy original amount. So, in this case, the percentage error in V
194. Find the total differential of the function z = y + x
is given by dV .)
where x changes from 10 to 10.5 and y changes from V
15 to 13.
202. The base radius and height of a right circular cone
y are measured as 10 in. and 25 in., respectively, with
195. Let z = f (x, y) = xe . Compute Δz from P(1, 2)
a possible error in measurement of as much as 0.1 in.
to Q(1.05, 2.1) and then find the approximate change
each. Use differentials to estimate the maximum error in the
in z from point P to point Q. Recall calculated volume of the cone.
Δz = f (x + Δx, y + Δy) − f (x, y), and dz and Δz are
203. The electrical resistance R produced by wiring
approximately equal.
resistors R 1 and R 2 in parallel can be calculated from the
196. The volume of a right circular cylinder is given by formula 1 = 1 + 1 . If R 1 and R 2 are measured to be
V(r, h) = πr 2 h. Find the differential dV. Interpret the R R1 R2
formula geometrically. 7Ω and 6Ω, respectively, and if these measurements are
accurate to within 0.05Ω, estimate the maximum possible
197. See the preceding problem. Use differentials to
error in computing R. (The symbol Ω represents an ohm,
estimate the amount of aluminum in an enclosed aluminum
can with diameter 8.0 cm and height 12 cm if the the unit of electrical resistance.)
aluminum is 0.04 cm thick. 204. The area of an ellipse with axes of length 2a and
2b is given by the formula A = πab. Approximate the
198. Use the differential dz to approximate the change in
percent change in the area when a increases by 2% and
z = 4 − x 2 − y 2 as (x, y) moves from point (1, 1) to b increases by 1.5%.
point (1.01, 0.97). Compare this approximation with the
actual change in the function. 205. The period T of a simple pendulum with small
oscillations is calculated from the formula T = 2π L
g,
199. Let z = f (x, y) = x 2 + 3xy − y 2. Find the exact
where L is the length of the pendulum and g is the
change in the function and the approximate change in the
function as x changes from 2.00 to 2.05 and y changes acceleration resulting from gravity. Suppose that L and g
from 3.00 to 2.96. have errors of, at most, 0.5% and 0.1%, respectively.
Use differentials to approximate the maximum percentage
error in the calculated value of T.
2
206. Electrical power P is given by P = V , where
R
V is the voltage and R is the resistance. Approximate
the maximum percentage error in calculating power if 120
V is applied to a 2000 − Ω resistor and the possible
percent errors in measuring V and R are 3% and 4%,
respectively.
213. [T] Find the equation for the tangent plane to the
surface at the indicated point, and graph the surface and the
tangent plane: z = ln(10x 2 + 2y 2 + 1), P(0, 0, 0).
In single-variable calculus, we found that one of the most useful differentiation rules is the chain rule, which allows us to
find the derivative of the composition of two functions. The same thing is true for multivariable calculus, but this time we
have to deal with more than one form of the chain rule. In this section, we study extensions of the chain rule and learn how
to take derivatives of compositions of functions of more than one variable.
dz = ∂ z · dx + ∂ z · dy , (4.29)
dt ∂ x dt ∂ y dt
where the ordinary derivatives are evaluated at t and the partial derivatives are evaluated at (x, y).
Proof
The proof of this theorem uses the definition of differentiability of a function of two variables. Suppose that f is
differentiable at the point P(x 0, y 0), where x 0 = g(t 0) and y 0 = h(t 0) for a fixed value of t 0. We wish to prove that
z = f ⎛⎝x(t), y(t)⎞⎠ is differentiable at t = t 0 and that Equation 4.29 holds at that point as well.
E(x, y)
where lim = 0. We then subtract z 0 = f (x 0, y 0) from both sides of this equation:
(x, y) → (x 0, y 0) (x − x 0) 2 + (y − y 0) 2
E⎛⎝x(t), y(t)⎞⎠
+ lim t − t0 .
t → t0
⎛ ⎞ ⎛ (x − x ) 2 + (y − y ) 2 ⎞
= lim ⎜ ⎟ lim ⎜ 0 ⎟.
E(x, y) 0
⎝ (x − x 0) 2 + (y − y 0) 2 ⎠t → t 0⎝ ⎠
t → t0 t − t0
As t approaches t 0, x(t), y(t)⎞⎠ approaches ⎛⎝x(t 0), y(t 0)⎞⎠, so we can rewrite the last product as
⎛
⎝
⎛ ⎞ ⎛ (x − x ) 2 + (y − y ) 2 ⎞
lim ⎜ E(x, y)
⎟ lim ⎜ 0 0 ⎟.
(x, y) → (x 0, y 0)⎝ (x − x ) 2 + (y − y ) 2 ⎠(x, y) → (x 0, y 0)⎝ t − t0 ⎠
0 0
Since the first limit is equal to zero, we need only show that the second limit is finite:
⎛ (x − x ) 2 + (y − y ) 2 ⎞ ⎛ (x − x ) 2 + (y − y ) 2 ⎞
lim ⎜ 0 0 ⎟ = lim ⎜ 0 0 ⎟
(x, y) → (x 0, y 0)⎝ t − t 0 ⎠ (x, y) → (x 0 0⎝
, y ) (t − t ) 2
⎠
0
⎛ x−x 2 y − y 2⎞
= lim ⎜ ⎛⎝ t − t 0 ⎞⎠ + ⎛⎝ t − t 0 ⎞⎠ ⎟
(x, y) → (x 0, y 0)⎝ 0 0 ⎠
⎛ ⎛ x − x 0 ⎞⎞ ⎛ ⎛ y − y 0 ⎞⎞
2 2
=
⎝(x, y) →lim(x 0, y 0)⎝ t − t 0 ⎠⎠ + ⎝(x, y) →lim(x 0, y 0)⎝ t − t 0 ⎠⎠ .
Since x(t) and y(t) are both differentiable functions of t, both limits inside the last radical exist. Therefore, this value
is finite. This proves the chain rule at t = t 0; the rest of the theorem follows from the assumption that all functions are
differentiable over their entire domains.
□
∂ f dx
Closer examination of Equation 4.29 reveals an interesting pattern. The first term in the equation is · and the
∂ x dt
∂ f dy
second term is · . Recall that when multiplying fractions, cancelation can be used. If we treat these derivatives
∂ y dt
as fractions, then each product “simplifies” to something resembling ∂ f /dt. The variables x and y that disappear in
this simplification are often called intermediate variables: they are independent variables for the function f , but are
dependent variables for the variable t. Two terms appear on the right-hand side of the formula, and f is a function of two
variables. This pattern works with functions of more than two variables as well, as we see later in this section.
Example 4.26
408 Chapter 4 | Differentiation of Functions of Several Variables
Solution
a. To use the chain rule, we need four quantities— ∂ z/∂ x, ∂ z/∂ y, dx/dt, and dy/dt:
∂ z = 8x ∂ z = 6y
∂x ∂y
dx = cos t dy
= −sin t
dt dt
This answer has three variables in it. To reduce it to one variable, use the fact that
x(t) = sin t and y(t) = cos t. We obtain
dz = 8x cos t − 6y sin t
dt
= 8(sin t)cos t − 6(cos t)sin t
= 2 sin t cos t.
This derivative can also be calculated by first substituting x(t) and y(t) into f (x, y), then
differentiating with respect to t:
z = f (x, y)
= f ⎛⎝x(t), y(t)⎞⎠
= 4(x(t)) 2 + 3⎛⎝y(t)⎞⎠ 2
= 4sin 2 t + 3cos 2 t.
Then
dz = 2(4 sin t)(cos t) + 2(3 cos t)(−sin t)
dt
= 8 sin t cos t − 6 sin t cos t
= 2 sin t cos t,
which is the same solution. However, it may not always be this easy to differentiate in this form.
b. To use the chain rule, we again need four quantities— ∂ z/∂ x, ∂ z/dy, dx/dt, and dy/dt:
∂z = x ∂z = −y
∂x x − y2
2 ∂y x − y2
2
dx = 2e 2t dx = −e −t.
dt dt
To reduce this to one variable, we use the fact that x(t) = e 2t and y(t) = e −t. Therefore,
2t −t
dz = 2xe + ye
dt x2 − y2
2⎛⎝e 2t⎞⎠e 2t + ⎛⎝e −t⎞⎠e −t
=
e 4t − e −2t
4t −2t
= 2e + e .
4t −2t
e −e
To eliminate negative exponents, we multiply the top by e 2t and the bottom by e 4t:
dz = 2e 4t + e −2t · e 2t
dt e 4t − e −2t e 4t
6t
= 2e + 1
e 8t − e 2t
6t
= 2e + 1
e ⎝e − 1⎞⎠
2t ⎛ 6t
6t
= 2e + 1 .
e t e 6t − 1
Again, this derivative can also be calculated by first substituting x(t) and y(t) into f (x, y), then
differentiating with respect to t:
z = f (x, y)
= f ⎛⎝x(t), y(t)⎞⎠
= (x(t)) 2 − ⎛⎝y(t)⎞⎠ 2
= e 4t − e −2t
1/2
= ⎛⎝e 4t − e −2t⎞⎠ .
Then
410 Chapter 4 | Differentiation of Functions of Several Variables
4.23 Calculate dz/dt given the following functions. Express the final answer in terms of t.
It is often useful to create a visual representation of Equation 4.29 for the chain rule. This is called a tree diagram for the
chain rule for functions of one variable and it provides a way to remember the formula (Figure 4.34). This diagram can be
expanded for functions of more than one variable, as we shall see very shortly.
In this diagram, the leftmost corner corresponds to z = f (x, y). Since f has two independent variables, there are two lines
coming from this corner. The upper branch corresponds to the variable x and the lower branch corresponds to the variable
y. Since each of these variables is then dependent on one variable t, one branch then comes from x and one branch
comes from y. Last, each of the branches on the far right has a label that represents the path traveled to reach that branch.
The top branch is reached by following the x branch, then the t branch; therefore, it is labeled (∂ z/∂ x) × (dx/dt). The
bottom branch is similar: first the y branch, then the t branch. This branch is labeled ⎛⎝∂ z/∂ y⎞⎠ × ⎛⎝dy/dt⎞⎠. To get the formula
for dz/dt, add all the terms that appear on the rightmost side of the diagram. This gives us Equation 4.29.
In Chain Rule for Two Independent Variables, z = f (x, y) is a function of x and y, and both x = g(u, v) and
y = h(u, v) are functions of the independent variables u and v.
∂z = ∂z ∂x + ∂z ∂x (4.31)
∂u ∂x ∂u ∂y ∂u
and
∂ z = ∂ z ∂ x + ∂ z ∂ y. (4.32)
∂v ∂x ∂v ∂y ∂v
We can draw a tree diagram for each of these formulas as well as follows.
∂y
Figure 4.35 Tree diagram for ∂ z = ∂ z · ∂ x + ∂ z · and
∂u ∂x ∂u ∂y ∂u
∂ z = ∂ z · ∂ x + ∂ z · ∂ y.
∂v ∂x ∂v ∂y ∂v
To derive the formula for ∂ z/∂ u, start from the left side of the diagram, then follow only the branches that end with u
and add the terms that appear at the end of those branches. For the formula for ∂ z/∂ v, follow only the branches that end
with v and add the terms that appear at the end of those branches.
There is an important difference between these two chain rule theorems. In Chain Rule for One Independent Variable,
the left-hand side of the formula for the derivative is not a partial derivative, but in Chain Rule for Two Independent
Variables it is. The reason is that, in Chain Rule for One Independent Variable, z is ultimately a function of t
alone, whereas in Chain Rule for Two Independent Variables, z is a function of both u and v.
Example 4.27
Solution
To implement the chain rule for two variables, we need six partial derivatives—
∂ z/∂ x, ∂ z/∂ y, ∂ x/∂ u, ∂ x/∂ v, ∂ y/∂ u, and ∂ y/∂ v:
412 Chapter 4 | Differentiation of Functions of Several Variables
∂ z = 6x − 2y ∂ z = −2x + 2y
∂x ∂y
∂x = 3 ∂x = 2
∂u ∂v
∂y ∂y
=4 = −1.
∂u ∂v
To find ∂ z/∂ u, we use Equation 4.31:
∂z = ∂z · ∂x + ∂z · ∂y
∂u ∂x ∂u ∂y ∂u
= 3⎛⎝6x − 2y⎞⎠ + 4⎛⎝−2x + 2y⎞⎠
= 10x + 2y.
∂ z = 10x + 2y
∂u
= 10(3u + 2v) + 2(4u − v)
= 38u + 18v.
To find ∂ z/∂ v, we use Equation 4.32:
∂z = ∂z ∂x + ∂z ∂y
∂v ∂x ∂v ∂y ∂v
= 2⎛⎝6x − 2y⎞⎠ + (−1)⎛⎝−2x + 2y⎞⎠
= 14x − 6y.
∂ z = 14x − 6y
∂v
= 14(3u + 2v) − 6(4u − v)
= 18u + 34v.
∂ w = ∂ w ∂ x1 + ∂ w ∂ x2 + ⋯ + ∂ w ∂ xm (4.33)
∂ t j ∂ x1 ∂ t j ∂ x2 ∂ t j ∂ xm ∂ t j
In the next example we calculate the derivative of a function of three independent variables in which each of the three
variables is dependent on two other variables.
Example 4.28
w = f (x, y, z) = 3x 2 − 2xy + 4z 2
x = x(u, v) = e u sin v
y = y(u, v) = e u cos v
z = z(u, v) = e u.
Solution
The formulas for ∂ w/∂ u and ∂ w/∂ v are
∂w = ∂w · ∂x + ∂w · ∂y + ∂w · ∂z
∂u ∂x ∂u ∂y ∂u ∂z ∂u
∂w = ∂w · ∂ x + ∂w · ∂ y + ∂w · ∂z.
∂v ∂x ∂v ∂y ∂v ∂z ∂v
Therefore, there are nine different partial derivatives that need to be calculated and substituted. We need to
calculate each of them:
∂ w = 6x − 2y ∂ w = −2x ∂ w = 8z
∂x ∂y ∂z
∂ x = e u sin v ∂y ∂ z = eu
= e u cos v
∂u ∂u ∂u
∂ x = e u cos v ∂y ∂ z = 0.
= −e u sin v
∂v ∂v ∂v
Now, we substitute each of them into the first formula to calculate ∂ w/∂ u:
∂w = ∂w · ∂x + ∂w · ∂y + ∂w · ∂z
∂u ∂x ∂u ∂y ∂u ∂z ∂u
= ⎝6x − 2y⎞⎠e u sin v − 2xe u cos v + 8ze u,
⎛
then substitute x(u, v) = e u sin v, y(u, v) = e u cos v, and z(u, v) = e u into this equation:
∂w = ∂w · ∂x + ∂w · ∂y + ∂w · ∂z
∂v ∂x ∂v ∂y ∂v ∂z ∂v
= ⎝6x − 2y⎞⎠e u cos v − 2x(−e u sin v) + 8z(0),
⎛
then we substitute x(u, v) = e u sin v, y(u, v) = e u cos v, and z(u, v) = e u into this equation:
x + 2y − 4z
w = f (x, y, z) =
2x − y + 3z
x = x(u, v) = e 2u cos 3v
y = y(u, v) = e 2u sin 3v
z = z(u, v) = e 2u.
Example 4.29
and write out the formulas for the three partial derivatives of w.
Solution
Starting from the left, the function f has three independent variables: x, y, and z. Therefore, three branches
must be emanating from the first node. Each of these three branches also has three branches, for each of the
variables t, u, and v.
Figure 4.36 Tree diagram for a function of three variables, each of which is
a function of three independent variables.
and write out the formulas for the three partial derivatives of w.
Implicit Differentiation
Recall from Implicit Differentiation (http://cnx.org/content/m53585/latest/) that implicit differentiation provides a
method for finding dy/dx when y is defined implicitly as a function of x. The method involves differentiating both sides
of the equation defining the function with respect to x, then solving for dy/dx. Partial derivatives provide an alternative
to this method.
Consider the ellipse defined by the equation x 2 + 3y 2 + 4y − 4 = 0 as follows.
416 Chapter 4 | Differentiation of Functions of Several Variables
This equation implicitly defines y as a function of x. As such, we can find the derivative dy/dx using the method of
implicit differentiation:
d ⎛x 2 + 3y 2 + 4y − 4⎞ = d (0)
dx ⎝ ⎠
dx
dy dy
2x + 6y + 4 = 0
dx dx
⎛ ⎞ dy
⎝6y + 4⎠ = −2x
dx
dy
= − x .
dx 3y + 2
We can also define a function z = f (x, y) by using the left-hand side of the equation defining the ellipse. Then
f (x, y) = x 2 + 3y 2 + 4y − 4. The ellipse x 2 + 3y 2 + 4y − 4 = 0 can then be described by the equation f (x, y) = 0.
Using this function and the following theorem gives us an alternative approach to calculating dy/dx.
provided f y (x, y) ≠ 0.
∂z = − ∂ f /∂ x and ∂z = − ∂ f /∂ y (4.35)
∂x ∂ f /∂ z ∂y ∂ f /∂ z
as long as f z (x, y, z) ≠ 0.
Equation 4.34 is a direct consequence of Equation 4.31. In particular, if we assume that y is defined implicitly as a
function of x via the equation f (x, y) = 0, we can apply the chain rule to find dy/dx:
d f (x, y) = d (0)
dx dx
∂ f dx ∂ f dy
· + · = 0
∂ x dx ∂ y dx
∂ f ∂ f dy
+ · = 0.
∂ x ∂ y dx
Solving this equation for dy/dx gives Equation 4.34. Equation 4.35 can be derived in a similar fashion.
Let’s now return to the problem that we started before the previous theorem. Using Implicit Differentiation of a
Function of Two or More Variables and the function f (x, y) = x 2 + 3y 2 + 4y − 4, we obtain
∂f
= 2x
∂x
∂f
= 6y + 4.
∂y
which is the same result obtained by the earlier use of implicit differentiation.
Example 4.30
Solution
f x = 6x − 2y + 4
a. Set f (x, y) = 3x 2 − 2xy + y 2 + 4x − 6y − 11 = 0, then calculate f x and f y :
f y = −2x + 2y − 6.
The derivative is given by
dy ∂ f /∂ x 6x − 2y + 4 3x − y + 2
= − = − = .
dx ∂ f /∂ y −2x + 2y − 6 x−y+3
dy
dx | (x, y) = (2, 1) =
3(2) − 1 + 2 7
2−1+3
= .
4
To find the equation of the tangent line, we use the point-slope form (Figure 4.38):
y − y 0 = m(x − x 0)
y − 1 = 7 (x − 2)
4
y = 7x − 7 + 1
4 2
y = x − 5.
7
4 2
418 Chapter 4 | Differentiation of Functions of Several Variables
∂f
= 2xe y − yze x
∂x
∂f
= x 2 e y − ze x
∂y
∂f
= −ye x.
∂z
4.5 EXERCISES
For the following exercises, use the information provided 228. Let z = x 2 y, where x = t 2 and y = t 3. Find dz .
to solve the problem. dt
find
∂f
and express the answer in terms of r and θ. 234. xe y + ye x − 2x 2 y = 0
∂r
235. x 2/3 + y 2/3 = a 2/3
220. Suppose f (x, y) = x + y, u = e x sin y, x = t 2,
and y = πt, where x = r cos θ and y = r sin θ. Find 236. x cos(xy) + y cos x = 2
∂f
. xy
∂θ 237. e + ye y = 1
df
For the following exercises, find using the chain rule 238. x 2 y 3 + cos y = 0
dt
and direct substitution.
239. Find dz using the chain rule where
dt
221. f (x, y) = x 2 + y 2, x = t, y = t 2
z = 3x 2 y 3, x = t 4, and y = t 2.
222. f (x, y) = x 2 + y 2, y = t 2, x = t
240. Let z = 3 cos x − sin(xy), x = 1t , and y = 3t.
f (x, y) = xy, x = 1 − t, y = 1 + t
Find dz .
223.
dt
224. f (x, y) = xy , x = e t, y = 2e t
, x = t 1/3, and y = t 3. Find dz .
1 − xy
241. Let z = e
dt
225. f (x, y) = ln(x + y), x = e t, y = e t
242. Find dz by the chain rule where
4 dt
226. f (x, y) = x , x = t, y = t
z = cosh 2(xy), x = 1 t, and y = e t.
2
227. Let w(x, y, z) = x 2 + y 2 + z 2,
x = cos t, y = sin t, and z = e t. Express w as a 243. Let z = xy , x = 2 cos u, and y = 3 sin v. Find ∂ z
∂u
function of t and find dw directly. Then, find dw using and ∂ z .
dt dt ∂v
the chain rule.
420 Chapter 4 | Differentiation of Functions of Several Variables
z = rst 2.
In Partial Derivatives we introduced the partial derivative. A function z = f (x, y) has two partial derivatives: ∂ z/∂ x
and ∂ z/∂ y. These derivatives correspond to each of the independent variables and can be interpreted as instantaneous rates
of change (that is, as slopes of a tangent line). For example, ∂ z/∂ x represents the slope of a tangent line passing through
a given point on the surface defined by z = f (x, y), assuming the tangent line is parallel to the x-axis. Similarly, ∂ z/∂ y
represents the slope of the tangent line parallel to the y-axis. Now we consider the possibility of a tangent line parallel to
neither axis.
Directional Derivatives
We start with the graph of a surface defined by the equation z = f (x, y). Given a point (a, b) in the domain of
f , we choose a direction to travel from that point. We measure the direction using an angle θ, which is measured
counterclockwise in the x, y-plane, starting at zero from the positive x-axis (Figure 4.39). The distance we travel is h and
the direction we travel is given by the unit vector u = (cos θ)i + (sin θ)j. Therefore, the z-coordinate of the second point
on the graph is given by z = f (a + h cos θ, b + h sin θ).
Figure 4.39 Finding the directional derivative at a point on the graph of z = f (x, y). The
slope of the black arrow on the graph indicates the value of the directional derivative at that
point.
We can calculate the slope of the secant line by dividing the difference in z-values by the length of the line segment
connecting the two points in the domain. The length of the line segment is h. Therefore, the slope of the secant line is
f (a + h cos θ, b + h sin θ) − f (a, b)
m sec = .
h
To find the slope of the tangent line in the same direction, we take the limit as h approaches zero.
Definition
Suppose z = f (x, y) is a function of two variables with a domain of D. Let (a, b) ∈ D and define
u = cos θi + sin θj. Then the directional derivative of f in the direction of u is given by
Equation 4.36 provides a formal definition of the directional derivative that can be used in many cases to calculate a
directional derivative.
Example 4.31
Let θ = arccos(3/5). Find the directional derivative D u f (x, y) of f (x, y) = x 2 − xy + 3y 2 in the direction of
u = (cos θ)i + (sin θ)j. What is D u f (−1, 2) ?
Solution
First of all, since cos θ = 3/5 and θ is acute, this implies
⎛ ⎞
2
sin θ = 1 − ⎝3 ⎠ = 16 = 4 .
5 25 5
f ⎛⎝x + h cos θ, y + h sin θ⎞⎠ = (x + h cos θ) 2 − (x + h cos θ)⎛⎝y + h sin θ⎞⎠ + 3⎛⎝y + h sin θ⎞⎠ 2
= x 2 + 2xh cos θ + h 2 cos 2 θ − xy − xh sin θ − yh cos θ
−h 2 sin θ cos θ + 3y 2 + 6yh sin θ + 3h 2 sin 2 θ
⎛ ⎞ 2 3yh 12h 2
= x 2 + 2xh⎝3 ⎠ + 9h − xy − 4xh − − + 3y 2
5 25 5 5 25
⎛ ⎞ ⎛ ⎞
+6yh⎝4 ⎠ + 3h 2 ⎝16 ⎠
5 25
2 21yh
= x 2 − xy + 3y 2 + 2xh + 9h + .
5 5 5
We substitute this expression into Equation 4.36:
f (a + h cos θ, b + h sin θ) − f (a, b)
D u f (a, b) = lim
h→0 h
⎛ 2 2 2xh 9h 2 21yh ⎞ ⎛ 2 2⎞
⎝x − xy + 3y + 5 + 5 + 5 ⎠ − ⎝x − xy + 3y ⎠
= lim
h→0 h
2 21yh
+ 9h5 + 5
2xh
5
= lim
h→0 h
2x 9h 21y
= lim + +
h→0 5 5 5
2x + 21y
= .
5
To calculate D u f (−1, 2), we substitute x = −1 and y = 2 into this answer:
2(−1) + 21(2)
D u f (−1, 2) =
5
= −2 + 42
5
= 8.
(See the following figure.)
Figure 4.40 Finding the directional derivative in a given direction u at a given point on a surface.
The plane is tangent to the surface at the given point (−1, 2, 15).
Another approach to calculating a directional derivative involves partial derivatives, as outlined in the following theorem.
Proof
Equation 4.36 states that the directional derivative of f in the direction of u = cos θi + sin θj is given by
Let x = a + t cos θ and y = b + t sin θ, and define g(t) = f (x, y). Since f x and f y both exist, we can use the chain
rule for functions of two variables to calculate g′ (t):
∂ f dx ∂ f dy
g′ (t) = +
∂ x dt ∂ y dt
= f x (x, y)cos θ + f y (x, y)sin θ.
If t = 0, then x = x 0 and y = y 0, so
g(t) − g(0)
g′ (0) = lim t
t→0
f ⎛⎝x 0 + t cos θ, y 0 + t sin θ⎞⎠ − f (x 0, y 0)
= lim t .
t→0
Example 4.32
Let θ = arccos(3/5). Find the directional derivative D u f (x, y) of f (x, y) = x 2 − xy + 3y 2 in the direction of
u = (cos θ)i + (sin θ)j. What is D u f (−1, 2) ?
Solution
First, we must calculate the partial derivatives of f :
f x = 2x − y
f y = −x + 6y,
2(−1) + 21(2) −2 + 42
D u f (−1, 2) = = = 8.
5 5
This is the same answer obtained in Example 4.31.
4.28 Find the directional derivative D u f (x, y) of f (x, y) = 3x 2 y − 4xy 3 + 3y 2 − 4x in the direction of
⎛ ⎞ ⎛ ⎞
u = ⎝cos π ⎠i + ⎝sin π ⎠j using Equation 4.37. What is D u f (3, 4) ?
3 3
If the vector that is given for the direction of the derivative is not a unit vector, then it is only necessary to divide by the norm
of the vector. For example, if we wished to find the directional derivative of the function in Example 4.32 in the direction
of the vector 〈 −5, 12 〉 , we would first divide by its magnitude to get u. This gives us u = 〈 −(5/13), 12/13 〉 .
Then
D u f (x, y) = ∇ f (x, y) · u
= − 5 ⎛⎝2x − y⎞⎠ + 12 ⎛⎝−x + 6y⎞⎠
13 13
22
= − x + y. 17
13 13
Gradient
The right-hand side of Equation 4.37 is equal to f x (x, y)cos θ + f y (x, y)sin θ, which can be written as the dot
product of two vectors. Define the first vector as ∇ f (x, y) = f x (x, y)i + f y (x, y)j and the second vector as
u = (cos θ)i + (sin θ)j. Then the right-hand side of the equation can be written as the dot product of these two vectors:
The first vector in Equation 4.38 has a special name: the gradient of the function f . The symbol ∇ is called nabla and
the vector ∇ f is read “del f .”
Definition
Let z = f (x, y) be a function of x and y such that f x and f y exist. The vector ∇ f (x, y) is called the gradient of
f and is defined as
Example 4.33
Finding Gradients
a. f (x, y) = x 2 − xy + 3y 2
Solution
For both parts a. and b., we first calculate the partial derivatives f x and f y, then use Equation 4.39.
a.
f x (x, y) = 2x − y and f y (x, y) = −x + 6y, so
∇ f (x, y) = f x (x, y)i + f y (x, y)j
= ⎛⎝2x − y⎞⎠i + ⎛⎝−x + 6y⎞⎠j.
b.
f x (x, y) = 3 cos 3x cos 3y and f y (x, y) = −3 sin 3x sin 3y, so
∇ f (x, y) = f x (x, y)i + f y (x, y)j
= ⎛⎝3 cos 3x cos 3y⎞⎠i − ⎛⎝3 sin 3x sin 3y⎞⎠j.
428 Chapter 4 | Differentiation of Functions of Several Variables
The gradient has some important properties. We have already seen one formula that uses the gradient: the formula for
the directional derivative. Recall from The Dot Product that if the angle between two vectors a and b is φ, then
a · b = ‖ a ‖ ‖ b ‖ cos φ. Therefore, if the angle between ∇ f (x 0, y 0) and u = (cos θ)i + (sin θ)j is φ, we have
D u f (x 0, y 0) = ∇ f (x 0, y 0) · u = ‖ ∇ f (x 0, y 0) ‖ ‖ u ‖ cos φ = ‖ ∇ f (x 0, y 0) ‖ cos φ.
The ‖ u ‖ disappears because u is a unit vector. Therefore, the directional derivative is equal to the magnitude of the
gradient evaluated at (x 0, y 0) multiplied by cos φ. Recall that cos φ ranges from −1 to 1. If φ = 0, then cos φ = 1
and ∇ f (x 0, y 0) and u both point in the same direction. If φ = π, then cos φ = −1 and ∇ f (x 0, y 0) and u point in
opposite directions. In the first case, the value of D u f (x 0, y 0) is maximized; in the second case, the value of D u f (x 0, y 0)
is minimized. If ∇ f (x 0, y 0) = 0, then D u f (x 0, y 0) = ∇ f (x 0, y 0) · u = 0 for any vector u. These three cases are
outlined in the following theorem.
Figure 4.41 The gradient indicates the maximum and minimum values of the
directional derivative at a point.
Example 4.34
Find the direction for which the directional derivative of f (x, y) = 3x 2 − 4xy + 2y 2 at (−2, 3) is a maximum.
What is the maximum value?
Solution
The maximum value of the directional derivative occurs when ∇ f and the unit vector point in the same direction.
Therefore, we start by calculating ∇ f (x, y):
We need to find a unit vector that points in the same direction as ∇ f (−2, 3), so the next step is to divide
∇ f (−2, 3)
= −24 i + 20 j = −6 61 i + 5 61 j.
‖ ∇ f (−2, 3) ‖ 4 61 4 61 61 61
This is the unit vector that points in the same direction as ∇ f (−2, 3). To find the angle corresponding to this
430 Chapter 4 | Differentiation of Functions of Several Variables
The maximum value of the directional derivative at (−2, 3) is ‖ ∇ f (−2, 3) ‖ = 4 61 (see the following
figure).
4.30 Find the direction for which the directional derivative of g(x, y) = 4x − xy + 2y 2 at (−2, 3) is a
maximum. What is the maximum value?
Figure 4.43 shows a portion of the graph of the function f (x, y) = 3 + sin x sin y. Given a point (a, b) in the domain
of f , the maximum value of the gradient at that point is given by ‖ ∇ f (a, b) ‖ . This would equal the rate of greatest
ascent if the surface represented a topographical map. If we went in the opposite direction, it would be the rate of greatest
descent.
Figure 4.43 A typical surface in ℝ 3. Given a point on the surface, the directional derivative
can be calculated using the gradient.
When using a topographical map, the steepest slope is always in the direction where the contour lines are closest together
(see Figure 4.44). This is analogous to the contour map of a function, assuming the level curves are obtained for equally
spaced values throughout the range of that function.
But g′ (t) = 0 because g(t) = k for all t. Therefore, on the one hand,
Therefore,
∇ f (x, y) · 〈 x′ (t), y′ (t) 〉 = 0.
Thus, the dot product of these vectors is equal to zero, which implies they are orthogonal. However, the second vector is
tangent to the level curve, which implies the gradient must be normal to the level curve, which gives rise to the following
theorem.
We can use this theorem to find tangent and normal vectors to level curves of a function.
Example 4.35
For the function f (x, y) = 2x 2 − 3xy + 8y 2 + 2x − 4y + 4, find a tangent vector to the level curve at point
(−2, 1). Graph the level curve corresponding to f (x, y) = 18 and draw in ∇ f (−2, 1) and a tangent vector.
Solution
First, we must calculate ∇ f (x, y):
f x (x, y) = 4x − 3y + 2 and f y = −3x + 16y − 4 so ∇ f (x, y) = ⎛⎝4x − 3y + 2⎞⎠i + ⎛⎝−3x + 16y − 4⎞⎠j.
This vector is orthogonal to the curve at point (−2, 1). We can obtain a tangent vector by reversing the
components and multiplying either one by −1. Thus, for example, −18i − 9j is a tangent vector (see the
following graph).
4.31 For the function f (x, y) = x 2 − 2xy + 5y 2 + 3x − 2y + 4, find the tangent to the level curve at point
(1, 1). Draw the graph of the level curve corresponding to f (x, y) = 8 and draw ∇ f (1, 1) and a tangent
vector.
Definition
Let w = f (x, y, z) be a function of three variables such that f x, f y, and f z exist. The vector ∇ f (x, y, z) is called
the gradient of f and is defined as
Calculating the gradient of a function in three variables is very similar to calculating the gradient of a function in two
variables. First, we calculate the partial derivatives f x, f y, and f z, and then we use Equation 4.40.
Example 4.36
Solution
434 Chapter 4 | Differentiation of Functions of Several Variables
For both parts a. and b., we first calculate the partial derivatives f x, f y, and f z, then use Equation 4.40.
a.
f z (x, y, z) = 10x − 2y + 3z, f y (x, y, z) = −2x + 2y − 4z and f z (x, y, z) = 3x − 4y + 2z, so
∇ f (x, y, z) = f x (x, y, z)i + f y (x, y, z)j + f z (x, y, z)k
= ⎛⎝10x − 2y + 3z⎞⎠i + ⎛⎝−2x + 2y − 4z⎞⎠j + ⎛⎝−4x + 3y + 2z⎞⎠k.
b.
f x (x, y, z) = −2e −2z cos 2x cos 2y, f y (x, y, z) = −2e −2z sin 2x sin 2y and
f z (x, y, z) = −2e −2z sin 2x cos 2y, so
∇ f (x, y, z) = f x (x, y, z)i + f y (x, y, z)j + f z (x, y, z)k
⎛ −2z
= ⎝2e cos 2x cos 2y⎞⎠i + ⎛⎝−2e −2z⎞⎠j + ⎛⎝−2e −2z⎞⎠
4.32 x 2 − 3y 2 + z 2
Find the gradient ∇ f (x, y, z) of f (x, y, z) = .
2x + y − 4z
The directional derivative can also be generalized to functions of three variables. To determine a direction in three
dimensions, a vector with three components is needed. This vector is a unit vector, and the components of the unit vector are
called directional cosines. Given a three-dimensional unit vector u in standard form (i.e., the initial point is at the origin),
this vector forms three different angles with the positive x − , y − , and z-axes. Let’s call these angles α, β, and γ.
Then the directional cosines are given by cos α, cos β, and cos γ. These are the components of the unit vector u; since
u is a unit vector, it is true that cos 2 α + cos 2 β + cos 2 γ = 1.
Definition
Suppose w = f (x, y, z) is a function of three variables with a domain of D. Let (x 0, y 0, z 0) ∈ D and let
u = cos αi + cos βj + cos γk be a unit vector. Then, the directional derivative of f in the direction of u is given by
We can calculate the directional derivative of a function of three variables by using the gradient, leading to a formula that is
analogous to Equation 4.38.
The three angles α, β, and γ determine the unit vector u. In practice, we can use an arbitrary (nonunit) vector, then divide
by its magnitude to obtain a unit vector in the desired direction.
Example 4.37
Solution
First, we find the magnitude of v:
‖ v ‖ = (−1) 2 + (2) 2 = 3.
4.33 Calculate D u f (x, y, z) and D u f (0, −2, 5) in the direction of v = −3i + 12j − 4k for the function
f (x, y, z) = 3x 2 + xy − 2y 2 + 4yz − z 2 + 2xz.
436 Chapter 4 | Differentiation of Functions of Several Variables
4.6 EXERCISES
y
For the following exercises, find the directional derivative 275. f (x, y) = ,θ= −π
using the limit definition only. x + 2y 4
274. f (x, y) = x 2 + 2y 2, θ = π
6
289. [T] Use technology to sketch the level curve of 303. f (x, y, z) = xy + yz + xz = 3 at point (1, 1, 1)
f (x, y) = x 2 + 4y 2 that passes through P(−2, 0) and
draw the gradient vector at P. 304. f (x, y, z) = xyz = 6 at point (1, 2, 3)
For the following exercises, find the gradient vector at the 305. f (x, y, z) = xe y cos z − z = 1 at point (1, 0, 0)
indicated point.
For the following exercises, solve the problem.
290. f (x, y) = xy 2 − yx 2, P(−1, 1)
306. The temperature T in a metal sphere is inversely
y proportional to the distance from the center of the sphere
291. f (x, y) = xe − ln(x), P(−3, 0)
(the origin: (0, 0, 0)). The temperature at point (1, 2, 2)
292. f (x, y, z) = xy − ln(z), P(2, −2, 2) is 120°C.
a. Find the rate of change of the temperature at point
(1, 2, 2) in the direction toward point (2, 1, 3).
293. f (x, y, z) = x y 2 + z 2, P(−2, −1, −1)
b. Show that, at any point in the sphere, the direction
For the following exercises, find the derivative of the of greatest increase in temperature is given by a
function. vector that points toward the origin.
One of the most useful applications for derivatives of a function of one variable is the determination of maximum and/or
minimum values. This application is also important for functions of two or more variables, but as we have seen in earlier
sections of this chapter, the introduction of more independent variables leads to more possible outcomes for the calculations.
The main ideas of finding critical points and using derivative tests are still valid, but new wrinkles appear when assessing
the results.
Critical Points
For functions of a single variable, we defined critical points as the values of the function when the derivative equals zero
or does not exist. For functions of two or more variables, the concept is essentially the same, except for the fact that we are
now working with partial derivatives.
Definition
Let z = f (x, y) be a function of two variables that is defined on an open set containing the point (x 0, y 0). The point
(x 0, y 0) is called a critical point of a function of two variables f if one of the two following conditions holds:
1. f x (x 0, y 0) = f y (x 0, y 0) = 0
Example 4.38
b. g(x, y) = x 2 + 2xy − 4y 2 + 4x − 6y + 4
Solution
a. First, we calculate f x (x, y) and f y (x, y):
−1/2
f x (x, y) = 1 (−18x + 36)⎛⎝4y 2 − 9x 2 + 24y + 36x + 36⎞⎠
2
= −9x + 18
4y − 9x 2 + 24y + 36x + 36
2
−1/2
f y (x, y) = 1 ⎛⎝8y + 24⎞⎠⎛⎝4y 2 − 9x 2 + 24y + 36x + 36⎞⎠
2
4y + 12
= .
2 2
4y − 9x + 24y + 36x + 36
4y + 12
= 0.
2 2
4y − 9x + 24y + 36x + 36
This equation represents a hyperbola. We should also note that the domain of f consists of points
satisfying the inequality
4y 2 − 9x 2 + 24y + 36x + 36 ≥ 0.
Therefore, any points on the hyperbola are not only critical points, they are also on the boundary of the
domain. To put the hyperbola in standard form, we use the method of completing the square:
4y 2 − 9x 2 + 24y + 36x + 36 = 0
4y 2 − 9x 2 + 24y + 36x = −36
4y 2 + 24y − 9x 2 + 36x = −36
4⎛⎝y 2 + 6y⎞⎠ − 9⎛⎝x 2 − 4x⎞⎠ = −36
4⎛⎝y 2 + 6y + 9⎞⎠ − 9⎛⎝x 2 − 4x + 4⎞⎠ = −36 + 36 − 36
4⎛⎝y + 3⎞⎠ 2 − 9(x − 2) 2 = −36.
g x (x, y) = 2x + 2y + 4
g y (x, y) = 2x − 8y − 6.
Next, we set each of these expressions equal to zero, which gives a system of equations in x and y:
2x + 2y + 4 = 0
2x − 8y − 6 = 0.
Subtracting the second equation from the first gives 10y + 10 = 0, so y = −1. Substituting this into
the first equation gives 2x + 2(−1) + 4 = 0, so x = −1. Therefore (−1, −1) is a critical point of g
(Figure 4.46). There are no points in ℝ 2 that make either partial derivative not exist.
Figure 4.46 The function g(x, y) has a critical point at (−1, −1, 6).
4.34 Find the critical point of the function f (x, y) = x 3 + 2xy − 2x − 4y.
The main purpose for determining critical points is to locate relative maxima and minima, as in single-variable calculus.
When working with a function of one variable, the definition of a local extremum involves finding an interval around the
critical point such that the function value is either greater than or less than all the other function values in that interval.
When working with a function of two or more variables, we work with an open disk around the point.
Definition
Let z = f (x, y) be a function of two variables that is defined and continuous on an open set containing the point
(x 0, y 0). Then f has a local maximum at (x 0, y 0) if
f (x 0, y 0) ≥ f (x, y)
for all points (x, y) within some disk centered at (x 0, y 0). The number f (x 0, y 0) is called a local maximum value.
If the preceding inequality holds for every point (x, y) in the domain of f , then f has a global maximum (also
called an absolute maximum) at (x 0, y 0).
f (x 0, y 0) ≤ f (x, y)
for all points (x, y) within some disk centered at (x 0, y 0). The number f (x 0, y 0) is called a local minimum value. If
the preceding inequality holds for every point (x, y) in the domain of f , then f has a global minimum (also called
an absolute minimum) at (x 0, y 0).
If f (x 0, y 0) is either a local maximum or local minimum value, then it is called a local extremum (see the following
figure).
(x 0, y 0). Suppose f x and f y each exists at (x 0, y 0). If f has a local extremum at (x 0, y 0), then (x 0, y 0) is a
critical point of f .
In this graph, the origin is a saddle point. This is because the first partial derivatives of f (x, y) = x 2 − y 2 are both equal to
zero at this point, but it is neither a maximum nor a minimum for the function. Furthermore the vertical trace corresponding
to y = 0 is z = x 2 (a parabola opening upward), but the vertical trace corresponding to x = 0 is z = −y 2 (a parabola
opening downward). Therefore, it is both a global maximum for one trace and a global minimum for another.
Definition
Given the function z = f (x, y), the point ⎛⎝x 0, y 0, f (x 0, y 0)⎞⎠ is a saddle point if both f 0 (x 0, y 0) = 0 and
f y (x 0, y 0) = 0, but f does not have a local extremum at (x 0, y 0).
The second derivative test for a function of one variable provides a method for determining whether an extremum occurs at
a critical point of a function. When extending this result to a function of two variables, an issue arises related to the fact that
there are, in fact, four different second-order partial derivatives, although equality of mixed partials reduces this to three.
The second derivative test for a function of two variables, stated in the following theorem, uses a discriminant D that
replaces f ″(x 0) in the second derivative test for a function of one variable.
D = f xx (x 0, y 0) f yy (x 0, y 0) − ⎛⎝ f xy (x 0, y 0)⎞⎠ 2. (4.43)
Figure 4.49 The second derivative test can often determine whether a function of two variables has a local minima (a), a
local maxima (b), or a saddle point (c).
To apply the second derivative test, it is necessary that we first find the critical points of the function. There are several
steps involved in the entire procedure, which are outlined in a problem-solving strategy.
Problem-Solving Strategy: Using the Second Derivative Test for Functions of Two Variables
Let z = f (x, y) be a function of two variables for which the first- and second-order partial derivatives are continuous
on some disk containing the point (x 0, y 0). To apply the second derivative test to find local extrema, use the following
steps:
1. Determine the critical points (x 0, y 0) of the function f where f x (x 0, y 0) = f y (x 0, y 0) = 0. Discard any
points where at least one of the partial derivatives does not exist.
2. Calculate the discriminant D = f xx (x 0, y 0) f yy (x 0, y 0) − ⎛⎝ f xy (x 0, y 0)⎞⎠ 2 for each critical point of f .
444 Chapter 4 | Differentiation of Functions of Several Variables
3. Apply Second Derivative Test to determine whether each critical point is a local maximum, local
minimum, or saddle point, or whether the theorem is inconclusive.
Example 4.39
Find the critical points for each of the following functions, and use the second derivative test to find the local
extrema:
a. f (x, y) = 4x 2 + 9y 2 + 8x − 36y + 24
b. g(x, y) = 1 x 3 + y 2 + 2xy − 6x − 3y + 4
3
Solution
a. Step 1 of the problem-solving strategy involves finding the critical points of f . To do this, we first
calculate f x (x, y) and f y (x, y), then set each of them equal to zero:
f x (x, y) = 8x + 8
f y (x, y) = 18y − 36.
f xx (x, y) = 8
f xy (x, y) = 0
f yy (x, y) = 18.
Figure 4.50 The function f (x, y) has a local minimum at (−1, 2, −16).
b. For step 1, we first calculate g x (x, y) and g y (x, y), then set each of them equal to zero:
g x (x, y) = x 2 + 2y − 6
g y (x, y) = 2y + 2x − 3.
To solve this system, first solve the second equation for y. This gives y = 3 − 2x . Substituting this into
2
the first equation gives
x 2 + 3 − 2x − 6 = 0
x 2 − 2x − 3 = 0
(x − 3)(x + 1) = 0.
Therefore, x = −1 or x = 3. Substituting these values into the equation y = 3 − 2x yields the critical
2
⎛ ⎞ ⎛ ⎞
points ⎝−1, 5 ⎠ and ⎝3, − 3 ⎠.
2 2
Step 2 involves calculating the second partial derivatives of g:
g xx (x, y) = 2x
g xy (x, y) = 2
g yy (x, y) = 2.
In step 3, we note that, applying Fermat’s Theorem for Functions of Two Variables to point
⎛ 5⎞ ⎛ 5⎞
⎝−1, 2 ⎠ leads to case 3, which means that ⎝−1, 2 ⎠ is a saddle point. Applying the theorem to point
⎛ 3⎞ ⎛ 3⎞
⎝3, − 2 ⎠ leads to case 1, which means that ⎝3, − 2 ⎠ corresponds to a local minimum as shown in the
following figure.
Figure 4.51 The function g(x, y) has a local minimum and a saddle point.
4.35 Use the second derivative to find the local extrema of the function
f (x, y) = x 3 + 2xy − 6x − 4y 2.
Now that we know any continuous function f defined on a closed, bounded set attains its extreme values, we need to know
how to find them.
The proof of this theorem is a direct consequence of the extreme value theorem and Fermat’s theorem. In particular, if
either extremum is not located on the boundary of D, then it is located at an interior point of D. But an interior point
(x 0, y 0) of D that’s an absolute extremum is also a local extremum; hence, (x 0, y 0) is a critical point of f by Fermat’s
theorem. Therefore the only possible values for the global extrema of f on D are the extreme values of f on the interior
or boundary of D.
3. Determine the maximum and minimum values of f on the boundary of its domain.
4. The maximum and minimum values of f will occur at one of the values obtained in steps 2 and 3.
Finding the maximum and minimum values of f on the boundary of D can be challenging. If the boundary is a rectangle
or set of straight lines, then it is possible to parameterize the line segments and determine the maxima on each of these
segments, as seen in Example 4.40. The same approach can be used for other shapes such as circles and ellipses.
If the boundary of the set D is a more complicated curve defined by a function g(x, y) = c for some constant c, and
the first-order partial derivatives of g exist, then the method of Lagrange multipliers can prove useful for determining the
extrema of f on the boundary. The method of Lagrange multipliers is introduced in Lagrange Multipliers.
Example 4.40
Use the problem-solving strategy for finding absolute extrema of a function to determine the absolute extrema of
each of the following functions:
a. f (x, y) = x 2 − 2xy + 4y 2 − 4x − 2y + 24 on the domain defined by 0 ≤ x ≤ 4 and 0 ≤ y ≤ 2
Solution
a. Using the problem-solving strategy, step 1 involves finding the critical points of f on its domain.
Therefore, we first calculate f x (x, y) and f y (x, y), then set them each equal to zero:
f x (x, y) = 2x − 2y − 4
f y (x, y) = −2x + 8y − 2.
The solution to this system is x = 3 and y = 1. Therefore (3, 1) is a critical point of f . Calculating
f (3, 1) gives f (3, 1) = 17.
The next step involves finding the extrema of f on the boundary of its domain. The boundary of its
domain consists of four line segments as shown in the following graph:
L 1 is the line segment connecting (0, 0) and (4, 0), and it can be parameterized by the equations
x(t) = t, y(t) = 0 for 0 ≤ t ≤ 4. Define g(t) = f ⎛⎝x(t), y(t)⎞⎠. This gives g(t) = t 2 − 4t + 24.
Differentiating g leads to g′ (t) = 2t − 4. Therefore, g has a critical value at t = 2, which corresponds
to the point (2, 0). Calculating f (2, 0) gives the z-value 20.
L 2 is the line segment connecting (4, 0) and (4, 2), and it can be parameterized by the equations
x(t) = 4, y(t) = t for 0 ≤ t ≤ 2. Again, define g(t) = f ⎛⎝x(t), y(t)⎞⎠. This gives g(t) = 4t 2 − 10t + 24.
⎛ ⎞
Then, g′ (t) = 8t − 10. g has a critical value at t = 5 , which corresponds to the point ⎝0, 5 ⎠.
4 4
⎛ ⎞
Calculating f ⎝0, 5 ⎠ gives the z-value 27.75.
4
L 3 is the line segment connecting (0, 2) and (4, 2), and it can be parameterized by the equations
x(t) = t, y(t) = 2 for 0 ≤ t ≤ 4. Again, define g(t) = f ⎛⎝x(t), y(t)⎞⎠. This gives g(t) = t 2 − 8t + 36.
The critical value corresponds to the point (4, 2). So, calculating f (4, 2) gives the z-value 20.
L 4 is the line segment connecting (0, 0) and (0, 2), and it can be parameterized by the equations
x(t) = 0, y(t) = t for 0 ≤ t ≤ 2. This time, g(t) = 4t 2 − 2t + 24 and the critical value t = 1
4
⎛ 1⎞ ⎛ 1⎞
correspond to the point ⎝0, ⎠. Calculating f ⎝0, ⎠ gives the z-value 23.75.
4 4
We also need to find the values of f (x, y) at the corners of its domain. These corners are located at
(0, 0), (4, 0), (4, 2) and (0, 2):
The absolute maximum value is 36, which occurs at (0, 2), and the global minimum value is 20,
which occurs at both (4, 2) and (2, 0) as shown in the following figure.
Figure 4.53 The function f (x, y) has two global minima and one global maximum over its
domain.
450 Chapter 4 | Differentiation of Functions of Several Variables
b. Using the problem-solving strategy, step 1 involves finding the critical points of g on its domain.
Therefore, we first calculate g x (x, y) and g y (x, y), then set them each equal to zero:
g x (x, y) = 2x + 4
g y (x, y) = 2y − 6.
The next step involves finding the extrema of g on the boundary of its domain. The boundary of its
domain consists of a circle of radius 4 centered at the origin as shown in the following graph.
The boundary of the domain of g can be parameterized using the functions x(t) = 4 cos t, y(t) = 4 sin t
for 0 ≤ t ≤ 2π. Define h(t) = g⎛⎝x(t), y(t)⎞⎠:
⎛ ⎞
This equation has two solutions over the interval 0 ≤ t ≤ 2π. One is t = π − arctan⎝3 ⎠ and the other is
2
⎛ ⎞
t = 2π − arctan⎝3 ⎠. For the first angle,
2
⎛ ⎛ ⎞⎞ ⎛ ⎛ ⎞⎞
sin t = sin⎝π − arctan⎝3 ⎠⎠ = sin⎝arctan⎝3 ⎠⎠ = 3 13
2 2 13
⎛ ⎛3 ⎞⎞ ⎛ ⎛3 ⎞⎞
cos t = cos⎝π − arctan⎝ ⎠⎠ = −cos⎝arctan⎝ ⎠⎠ = − 2 13 .
2 2 13
⎛ ⎞
Therefore, x(t) = 4 cos t = − 8 13 and y(t) = 4 sin t = 12 13 , so ⎝− 8 13 , 12 13 ⎠ is a critical
13 13 13 13
point on the boundary and
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2 2
g⎝− 8 13 , 12 13 ⎠ = ⎝− 8 13 ⎠ + ⎝12 13 ⎠ + 4⎝− 8 13 ⎠ − 6⎝12 13 ⎠
13 13 13 13 13 13
= 144 + 64 − 32 13 − 72 13
13 13 13 13
= 208 − 104 13 ≈ −12.844.
13
⎛ ⎞
Therefore, x(t) = 4 cos t = 8 13 and y(t) = 4 sin t = − 12 13 , so ⎝8 13 , − 12 13 ⎠ is a critical
13 13 13 13
point on the boundary and
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2 2
g⎝8 13 , − 12 13 ⎠ = ⎝8 13 ⎠ + ⎝− 12 13 ⎠ + 4⎝8 13 ⎠ − 6⎝− 12 13 ⎠
13 13 13 13 13 13
= 144 + 64 + 32 13 + 72 13
13 13 13 13
= 208 + 104 13 ≈ 44.844.
13
The absolute minimum of g is −13, which is attained at the point (−2, 3), which is an interior point
of D. The absolute maximum of g is approximately equal to 44.844, which is attained at the boundary
452 Chapter 4 | Differentiation of Functions of Several Variables
⎛ ⎞
point ⎝8 13 , − 12 13 ⎠. These are the absolute extrema of g on D as shown in the following figure.
13 13
Figure 4.55 The function f (x, y) has a local minimum and a local maximum.
4.36 Use the problem-solving strategy for finding absolute extrema of a function to find the absolute extrema
of the function
f (x, y) = 4x 2 − 2xy + 6y 2 − 8x + 2y + 3
Example 4.41
Pro- T company has developed a profit model that depends on the number x of golf balls sold per month
(measured in thousands), and the number of hours per month of advertising y, according to the function
z = f (x, y) = 48x + 96y − x 2 − 2xy − 9y 2,
where z is measured in thousands of dollars. The maximum number of golf balls that can be produced and sold
is 50,000, and the maximum number of hours of advertising that can be purchased is 25. Find the values of x
and y that maximize profit, and find the maximum profit.
Solution
Using the problem-solving strategy, step 1 involves finding the critical points of f on its domain. Therefore, we
first calculate f x (x, y) and f y (x, y), then set them each equal to zero:
f x (x, y) = 48 − 2x − 2y
f y (x, y) = 96 − 2x − 18y.
The solution to this system is x = 21 and y = 3. Therefore (21, 3) is a critical point of f . Calculating
f (21, 3) gives f (21, 3) = 48(21) + 96(3) − 21 2 − 2(21)(3) − 9(3) 2 = 648.
L 1 is the line segment connecting (0, 0) and (50, 0), and it can be parameterized by the equations
x(t) = t, y(t) = 0 for 0 ≤ t ≤ 50. We then define g(t) = f x(t), y(t)⎞⎠:
⎛
⎝
L 2 is the line segment connecting and (50, 25), and it can be parameterized by the equations
x(t) = 50, y(t) = t for 0 ≤ t ≤ 25. Once again, we define g(t) = f ⎛⎝x(t), y(t)⎞⎠:
L 3 is the line segment connecting (0, 25) and (50, 25), and it can be parameterized by the equations
x(t) = t, y(t) = 25 for 0 ≤ t ≤ 50. We define g(t) = f x(t), y(t)⎞⎠: ⎛
⎝
The maximum critical value is 648, which occurs at (21, 3). Therefore, a maximum profit of $648,000 is
realized when 21,000 golf balls are sold and 3 hours of advertising are purchased per month as shown in the
following figure.
Figure 4.58 The profit function f (x, y) has a maximum at (21, 3, 648).
456 Chapter 4 | Differentiation of Functions of Several Variables
4.7 EXERCISES
For the following exercises, find all critical points. 329. f (x, y) = y 2 + xy + 3y + 2x + 3
310. f (x, y) = 1 + x 2 + y 2
330. f (x, y) = x 2 + xy + y 2 − 3x
322. f (x, y) = 8xy(x + y) + 7 Find the absolute extrema of the given function on the
indicated closed and bounded set R.
323. f (x, y) = x 2 + 4xy + y 2 344. f (x, y) = xy − x − 3y; R is the triangular region
with vertices (0, 0), (0, 4), and (5, 0).
324. f (x, y) = x 3 + y 3 − 300x − 75y − 3
345. Find the absolute maximum and minimum values
325. f (x, y) = 9 − x 4 y 4 of f (x, y) = x 2 + y 2 − 2y + 1 on the region
|
⎧ ⎫
R = ⎨⎩(x, y) x 2 + y 2 ≤ 4⎬⎭.
326. f (x, y) = 7x 2 y + 9xy 2
2 2
328. f (x, y) = 3x + 2xy + y
−2y
347. f (x, y) = on
x + y2 + 1
2
⎧ ⎫
R = ⎨⎩(x, y): x 2 + y 2 ≤ 4⎬⎭
Solving optimization problems for functions of two or more variables can be similar to solving such problems in single-
variable calculus. However, techniques for dealing with multiple variables allow us to solve more varied optimization
problems for which we need to deal with additional conditions or constraints. In this section, we examine one of the more
common and useful methods for solving optimization problems with constraints.
Lagrange Multipliers
Example 4.41 was an applied situation involving maximizing a profit function, subject to certain constraints. In that
example, the constraints involved a maximum number of golf balls that could be produced and sold in 1 month (x),
and a maximum number of advertising hours that could be purchased per month (y). Suppose these were combined
into a budgetary constraint, such as 20x + 4y ≤ 216, that took into account the cost of producing the golf balls
and the number of advertising hours purchased per month. The goal is, still, be to maximize profit, but now there
is a different type of constraint on the values of x and y. This constraint, when combined with the profit function
f (x, y) = 48x + 96y − x 2 − 2xy − 9y 2, is an example of an optimization problem, and the function f (x, y) is called
the objective function. A graph of various level curves of the function f (x, y) follows.
In Figure 4.59, the value c represents different profit levels (i.e., values of the function f ). As the value of c increases,
the curve shifts to the right. Since our goal is to maximize profit, we want to choose a curve as far to the right as possible.
If there was no restriction on the number of golf balls the company could produce, or the number of units of advertising
available, then we could produce as many golf balls as we want, and advertise as much as we want, and there would be
not be a maximum profit for the company. Unfortunately, we have a budgetary constraint that is modeled by the inequality
20x + 4y ≤ 216. To see how this constraint interacts with the profit function, Figure 4.60 shows the graph of the line
20x + 4y = 216 superimposed on the previous graph.
As mentioned previously, the maximum profit occurs when the level curve is as far to the right as possible. However, the
level of production corresponding to this maximum profit must also satisfy the budgetary constraint, so the point at which
this profit occurs must also lie on (or to the left of) the red line in Figure 4.60. Inspection of this graph reveals that this
point exists where the line is tangent to the level curve of f . Trial and error reveals that this profit level seems to be
around 395, when x and y are both just less than 5. We return to the solution of this problem later in this section.
From a theoretical standpoint, at the point where the profit curve is tangent to the constraint line, the gradient of both of
the functions evaluated at that point must point in the same (or opposite) direction. Recall that the gradient of a function
of more than one variable is a vector. If two vectors point in the same (or opposite) directions, then one must be a constant
multiple of the other. This idea is the basis of the method of Lagrange multipliers.
Proof
Assume that a constrained extremum occurs at the point (x 0, y 0). Furthermore, we assume that the equation g(x, y) = 0
can be smoothly parameterized as
x = x(s) and y = y(s)
where s is an arc length parameter with reference point (x 0, y 0) at s = 0. Therefore, the quantity z = f ⎛⎝x(s), y(s)⎞⎠ has a
relative maximum or relative minimum at s = 0, and this implies that dz = 0 at that point. From the chain rule,
ds
dz = ∂ f · ∂ x + ∂ f · ∂ y = ⎛∂ f i + j ∂ x ⎞ + ⎛∂ f · ∂ y ⎞ = 0,
ds ∂ x ∂ s ∂ y ∂ s ⎝ ∂ x ∂ s ⎠ ⎝∂ y ∂ s ⎠
where the derivatives are all evaluated at s = 0. However, the first factor in the dot product is the gradient of f , and the
second factor is the unit tangent vector T(0) to the constraint curve. Since the point (x 0, y 0) corresponds to s = 0, it
follows from this equation that
460 Chapter 4 | Differentiation of Functions of Several Variables
∇ f (x 0, y 0) · T(0) = 0,
which implies that the gradient is either 0 or is normal to the constraint curve at a constrained relative extremum. However,
the constraint curve g(x, y) = 0 is a level curve for the function g(x, y) so that if ∇ g(x 0, y 0) ≠ 0 then ∇ g(x 0, y 0) is
normal to this curve at (x 0, y 0) It follows, then, that there is some scalar λ such that
∇ f (x 0, y 0) = λ∇ g(x 0, y 0)
□
To apply Method of Lagrange Multipliers: One Constraint to an optimization problem similar to that for the golf
ball manufacturer, we need a problem-solving strategy.
4. The largest of the values of f at the solutions found in step 3 maximizes f ; the smallest of those values
minimizes f .
Example 4.42
Use the method of Lagrange multipliers to find the minimum value of f (x, y) = x 2 + 4y 2 − 2x + 8y subject to
the constraint x + 2y = 7.
Solution
Let’s follow the problem-solving strategy:
1. The optimization function is f (x, y) = x 2 + 4y 2 − 2x + 8y. To determine the constraint function, we
must first subtract 7 from both sides of the constraint. This gives x + 2y − 7 = 0. The constraint
function is equal to the left-hand side, so g(x, y) = x + 2y − 7. The problem asks us to solve for the
minimum value of f , subject to the constraint (see the following graph).
2x 0 − 2 = λ
8y 0 + 8 = 2λ.
The equation g(x 0, y 0) = 0 becomes x 0 + 2y 0 − 7 = 0. Therefore, the system of equations that needs
to be solved is
2x 0 − 2 = λ
8y 0 + 8 = 2λ
x 0 + 2y 0 − 7 = 0.
3. This is a linear system of three equations in three variables. We start by solving the second equation for
λ and substituting it into the first equation. This gives λ = 4y 0 + 4, so substituting this into the first
equation gives
2x 0 − 2 = 4y 0 + 4.
Solving this equation for x 0 gives x 0 = 2y 0 + 3. We then substitute this into the third equation:
462 Chapter 4 | Differentiation of Functions of Several Variables
⎛
2y 0 + 3⎞⎠ + 2y 0 − 7 = 0
⎝
4y 0 − 4 = 0
y 0 = 1.
4.37 Use the method of Lagrange multipliers to find the maximum value of
f (x, y) = 9x 2 + 36xy − 4y 2 − 18x − 8y subject to the constraint 3x + 4y = 32.
Let’s now return to the problem posed at the beginning of the section.
Example 4.43
The golf ball manufacturer, Pro-T, has developed a profit model that depends on the number x of golf balls sold
per month (measured in thousands), and the number of hours per month of advertising y, according to the function
z = f (x, y) = 48x + 96y − x 2 − 2xy − 9y 2,
where z is measured in thousands of dollars. The budgetary constraint function relating the cost of the production
of thousands golf balls and advertising units is given by 20x + 4y = 216. Find the values of x and y that
maximize profit, and find the maximum profit.
Solution
Again, we follow the problem-solving strategy:
1. The optimization function is f (x, y) = 48x + 96y − x 2 − 2xy − 9y 2. To determine the constraint
function, we first subtract 216 from both sides of the constraint, then divide both sides by 4, which gives
5x + y − 54 = 0. The constraint function is equal to the left-hand side, so g(x, y) = 5x + y − 54. The
problem asks us to solve for the maximum value of f , subject to this constraint.
48 − 2x 0 − 2y 0 = 5λ
96 − 2x 0 − 18y 0 = λ.
The equation g(x 0, y 0) = 0 becomes 5x 0 + y 0 − 54 = 0. Therefore, the system of equations that needs
to be solved is
48 − 2x 0 − 2y 0 = 5λ
96 − 2x 0 − 18y 0 = λ
5x 0 + y 0 − 54 = 0.
3. We use the left-hand side of the second equation to replace λ in the first equation:
4. We then substitute (10, 4) into f (x, y) = 48x + 96y − x 2 − 2xy − 9y 2, which gives
Therefore the maximum profit that can be attained, subject to budgetary constraints, is $540,000 with
a production level of 10,000 golf balls and 4 hours of advertising bought per month. Let’s check to
make sure this truly is a maximum. The endpoints of the line that defines the constraint are (10.8, 0) and
(0, 54) Let’s evaluate f at both of these points:
The second value represents a loss, since no golf balls are produced. Neither of these values exceed 540,
so it seems that our extremum is a maximum value of f .
464 Chapter 4 | Differentiation of Functions of Several Variables
4.38 A company has determined that its production level is given by the Cobb-Douglas function
f (x, y) = 2.5x 0.45 y 0.55 where x represents the total number of labor hours in 1 year and y represents the total
capital input for the company. Suppose 1 unit of labor costs $40 and 1 unit of capital costs $50. Use the
method of Lagrange multipliers to find the maximum value of f (x, y) = 2.5x 0.45 y 0.55 subject to a budgetary
constraint of $500,000 per year.
In the case of an optimization function with three variables and a single constraint function, it is possible to use the method
of Lagrange multipliers to solve an optimization problem as well. An example of an optimization function with three
variables could be the Cobb-Douglas function in the previous example: f (x, y, z) = x 0.2 y 0.4 z 0.4, where x represents
the cost of labor, y represents capital input, and z represents the cost of advertising. The method is the same as for the
method with a function of two variables; the equations to be solved are
∇ f (x, y, z) = λ∇ g(x, y, z)
g(x, y, z) = 0.
Example 4.44
Solution
1. The optimization function is f (x, y, z) = x 2 + y 2 + z 2. To determine the constraint function, we
subtract 1 from each side of the constraint: x + y + z − 1 = 0 which gives the constraint function as
g(x, y, z) = x + y + z − 1.
⎛ ⎞
4. Then, we evaluate f at the point ⎝1 , 1 , 1 ⎠:
3 3 3
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2 2 2
f ⎝1 , 1 , 1 ⎠ = ⎝1 ⎠ + ⎝1 ⎠ + ⎝1 ⎠ = 3 = 1 .
3 3 3 3 3 3 9 3
Therefore, an extremum of the function is 1 . To verify it is a minimum, choose other points that satisfy
3
the constraint and calculate f at that point. For example,
f (1, 0, 0) = 1 2 + 0 2 + 0 2 = 1
f (0, −2, 3) = 0 2 + + (−2) 2 + 3 2 = 13.
Both of these values are greater than 1 , leading us to believe the extremum is a minimum.
3
4.39 Use the method of Lagrange multipliers to find the minimum value of the function
f (x, y, z) = x + y + z
There are two Lagrange multipliers, λ 1 and λ 2, and the system of equations becomes
∇ f (x 0, y 0, z 0) = λ 1∇ g(x 0, y 0, z 0) + λ 2∇ h(x 0, y 0, z 0)
g(x 0, y 0, z 0) = 0
h(x 0, y 0, z 0) = 0.
Example 4.45
Solution
466 Chapter 4 | Differentiation of Functions of Several Variables
2x 0 = 2λ 1 x 0 + λ 2
2y 0 = 2λ 1 y 0 + λ 2
2z 0 = −2λ 1 z 0 − λ 2.
The two equations that arise from the constraints are z 0 2 = x 0 2 + y 0 2 and x 0 + y 0 − z 0 + 1 = 0.
Combining these equations with the previous three equations gives
2x 0 = 2λ 1 x 0 + λ 2
2y 0 = 2λ 1 y 0 + λ 2
2z 0 = −2λ 1 z 0 − λ 2
z0 2 = x0 2 + y0 2
x 0 + y 0 − z 0 + 1 = 0.
3. The first three equations contain the variable λ 2. Solving the third equation for λ 2 and replacing into
the first and second equations reduces the number of equations to four:
2x 0 = 2λ 1 x 0 − 2λ 1 z 0 − 2z 0
2y 0 = 2λ 1 y 0 − 2λ 1 z 0 − 2z 0
z0 2 = x0 2 + y0 2
x 0 + y 0 − z 0 + 1 = 0.
x +z
Next, we solve the first and second equation for λ 1. The first equation gives λ 1 = x 0 − z 0 , the second
0 0
y +z
equation gives λ 1 = y 0 − z 0 . We set the right-hand side of each equation equal to each other and cross-
0 0
multiply:
x0 + z0 y0 + z0
x0 − z0 = y0 − z0
(x 0 + z 0)(y 0 − z 0) = (x 0 − z 0)(y 0 + z 0)
x 0 y 0 − x 0 z 0 + y 0 z 0 − z 0 2 = x 0 y 0 + x 0 z 0 − y 0 z 0 − z 0 2.
2y 0 z 0 − 2x 0 z 0 = 0
2z 0 (y 0 − x 0) = 0.
We substitute the first equation into the second and third equations:
z0 2 = x0 2 + x0 2
x 0 + x 0 − z 0 + 1 = 0.
Then, we solve the second equation for z 0, which gives z 0 = 2x 0 + 1. We then substitute this into the
first equation,
z 0 2 = 2x 0 2
⎛
⎝2x 0 + 1⎞⎠ 2 = 2x 0 2
4x 0 2 + 4x 0 + 1 = 2x 0 2
2x 0 2 + 4x 0 + 1 = 0,
−4 ± 4 2 − 4(2)(1) −4 ± 8 −4 ± 2 2
x0 = = = = −1 ± 2 .
2(2) 4 4 2
⎛ ⎞
z 0 = 2x 0 + 1 = 2⎝−1 ± 2 ⎠ + 1 = −2 + 1 ± 2 = −1 ± 2.
2
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2 2
f ⎝−1 + 2 , −1 + 2 , −1 + 2⎠ = ⎝−1 + 2 ⎠ + ⎝−1 + 2 ⎠ + ⎛⎝−1 + 2⎞⎠ 2
2 2 2 2
⎛ ⎞ ⎛ ⎞
= ⎝1 − 2 + 1 ⎠ + ⎝1 − 2 + 1 ⎠ + ⎛⎝1 − 2 2 + 2⎞⎠
2 2
= 6 − 4 2.
⎛ ⎞
Then, we substitute ⎝−1 − 2 , −1 − 2 , −1 − 2⎠ into f (x, y, z) = x 2 + y 2 + z 2, which gives
2 2
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
2 2
f ⎝−1 − 2 , −1 − 2 , −1 − 2⎠ = ⎝−1 − 2 ⎠ + ⎝−1 − 2 ⎠ + (−1 − 2) 2
2 2 2 2
⎛ ⎞ ⎛ ⎞
= ⎝1 + 2 + 1 ⎠ + ⎝1 + 2 + 1 ⎠ + ⎛⎝1 + 2 2 + 2⎞⎠
2 2
= 6 + 4 2.
6 + 4 2 is the maximum value and 6 − 4 2 is the minimum value of f (x, y, z), subject to the given
constraints.
4.40 Use the method of Lagrange multipliers to find the minimum value of the function
f (x, y, z) = x 2 + y 2 + z 2
4.8 EXERCISES
For the following exercises, use the method of Lagrange 376. Maximize f (x, y) = 6 − x 2 − y 2, x + y − 2 = 0.
multipliers to find the maximum and minimum values of
the function subject to the given constraints.
377. Minimize
2 2 2
358. f (x, y) = x 2 y; x 2 + 2y 2 = 6 f (x, y, z) = x + y + z , x + y + z = 1.
362. f (x, y, z) = x 2 + y 2 + z 2, x 4 + y 4 + z 4 = 1 For the next group of exercises, use the method of Lagrange
multipliers to solve the following applied problems.
363. f (x, y, z) = yz + xy, xy = 1, y 2 + z 2 = 1 380. A pentagon is formed by placing an isosceles triangle
on a rectangle, as shown in the diagram. If the perimeter of
364. f (x, y) = x 2 + y 2, (x − 1) 2 + 4y 2 = 4 the pentagon is 10 in., find the lengths of the sides of the
pentagon that will maximize the area of the pentagon.
2 y2
365. f (x, y) = 4xy, x + =1
9 16
366. f (x, y, z) = x + y + z, 1x + 1y + 1z = 1
367. f (x, y, z) = x + 3y − z, x 2 + y 2 + z 2 = 4
384. Show that, of all the triangles inscribed in a circle 393. [T] By investing x units of labor and y units of
of radius R (see diagram), the equilateral triangle has the capital, a watch manufacturer can produce
0.4 0.6
largest perimeter. P(x, y) = 50x y watches. Find the maximum
number of watches that can be produced on a budget of
$20,000 if labor costs $100/unit and capital costs $200/
unit. Use a CAS to sketch a contour plot of the function.
CHAPTER 4 REVIEW
KEY TERMS
boundary point a point P 0 of R is a boundary point if every δ disk centered around P 0 contains points both inside
and outside R
connected set an open set S that cannot be represented as the union of two or more disjoint, nonempty open subsets
constraint an inequality or equation involving one or more variables that is used in an optimization problem; the
constraint enforces a limit on the possible solutions for the problem
contour map a plot of the various level curves of a given function f (x, y)
critical point of a function of two variables the point (x 0, y 0) is called a critical point of f (x, y) if one of the two
following conditions holds:
1. f x (x 0, y 0) = f y (x 0, y 0) = 0
directional derivative the derivative of a function in the direction of a given unit vector
discriminant the discriminant of the function f (x, y) is given by the formula
D = f xx(x 0, y 0) f yy(x 0, y 0) − ⎛⎝ f xy(x 0, y 0)⎞⎠ 2
function of two variables a function z = f (x, y) that maps each ordered pair (x, y) in a subset D of ℝ 2 to a unique
real number z
generalized chain rule the chain rule extended to functions of more than one independent variable, in which each
independent variable may depend on one or more other variables
gradient the gradient of the function f (x, y) is defined to be ∇ f (x, y) = ⎛⎝∂ f /∂ x⎞⎠i + ⎛⎝∂ f /∂ y⎞⎠ j, which can be
generalized to a function of any number of independent variables
graph of a function of two variables a set of ordered triples (x, y, z) that satisfies the equation z = f (x, y) plotted
in three-dimensional Cartesian space
higher-order partial derivatives second-order or higher partial derivatives, regardless of whether they are mixed
partial derivatives
interior point a point P 0 of R is a boundary point if there is a δ disk centered around P 0 contained completely in R
intermediate variable given a composition of functions (e.g., f ⎛⎝x(t), y(t)⎞⎠), the intermediate variables are the
variables that are independent in the outer function but dependent on other variables as well; in the function
f ⎛⎝x(t), y(t)⎞⎠, the variables x and y are examples of intermediate variables
Lagrange multiplier the constant (or constants) used in the method of Lagrange multipliers; in the case of one constant,
it is represented by the variable λ
level curve of a function of two variables the set of points satisfying the equation f (x, y) = c for some real
472 Chapter 4 | Differentiation of Functions of Several Variables
level surface of a function of three variables the set of points satisfying the equation f (x, y, z) = c for some real
number c in the range of f
linear approximation given a function f (x, y) and a tangent plane to the function at a point (x 0, y 0), we can
approximate f (x, y) for points near (x 0, y 0) using the tangent plane formula
method of Lagrange multipliers a method of solving an optimization problem subject to one or more constraints
mixed partial derivatives second-order or higher partial derivatives, in which at least two of the differentiations are
with respect to different variables
objective function the function that is to be maximized or minimized in an optimization problem
open set a set S that contains none of its boundary points
optimization problem calculation of a maximum or minimum value of a function of several variables, often using
Lagrange multipliers
partial derivative a derivative of a function of more than one independent variable in which all the variables but one are
held constant
partial differential equation an equation that involves an unknown function of more than one independent variable
and one or more of its partial derivatives
region an open, connected, nonempty subset of ℝ 2
saddle point given the function z = f (x, y), the point ⎛⎝x 0, y 0, f (x 0, y 0)⎞⎠ is a saddle point if both f x (x 0, y 0) = 0
and f y (x 0, y 0) = 0, but f does not have a local extremum at (x 0, y 0)
tangent plane given a function f (x, y) that is differentiable at a point (x 0, y 0), the equation of the tangent plane to
the surface z = f (x, y) is given by z = f (x 0, y 0) + f x (x 0, y 0)(x − x 0) + f y (x 0, y 0)(y − y 0)
total differential the total differential of the function f (x, y) at (x 0, y 0) is given by the formula
dz = f x (x 0, y 0)dx + f y (x 0, y 0)dy
tree diagram illustrates and derives formulas for the generalized chain rule, in which each independent variable is
accounted for
vertical trace the set of ordered triples (c, y, z) that solves the equation f (c, y) = z for a given constant x = c or the
set of ordered triples (x, d, z) that solves the equation f (x, d) = z for a given constant y = d
KEY EQUATIONS
• Vertical trace
f (a, y) = z for x = a or f (x, b) = z for y = b
• Tangent plane
z = f (x 0, y 0) + f x (x 0, y 0)(x − x 0) + f y (x 0, y 0)(y − y 0)
• Linear approximation
L(x, y) = f (x 0, y 0) + f x (x 0, y 0)(x − x 0) + f y (x 0, y 0)(y − y 0)
• Total differential
dz = f x (x 0, y 0)dx + f y (x 0, y 0)dy.
• Discriminant
474 Chapter 4 | Differentiation of Functions of Several Variables
D = f xx (x 0, y 0) f yy (x 0, y 0) − ⎛⎝ f xy (x 0, y 0)⎞⎠ 2
KEY CONCEPTS
4.1 Functions of Several Variables
• The graph of a function of two variables is a surface in ℝ 3 and can be studied using level curves and vertical traces.
• A set of level curves is called a contour map.
• To study limits and continuity for functions of two variables, we use a δ disk centered around a given point.
• A function of several variables has a limit if for any point in a δ ball centered at a point P, the value of the
function at that point is arbitrarily close to a fixed value (the limit value).
• The limit laws established for a function of one variable have natural extensions to functions of more than one
variable.
• A function of two variables is continuous at a point if the limit exists at that point, the function exists at that point,
and the limit and function are equal at that point.
• A partial derivative is a derivative involving a function of more than one independent variable.
• To calculate a partial derivative with respect to a given variable, treat all the other variables as constants and use the
usual differentiation rules.
• Higher-order partial derivatives can be calculated in the same way as higher-order derivatives.
• The analog of a tangent line to a curve is a tangent plane to a surface for functions of two variables.
• Tangent planes can be used to approximate values of functions near known values.
• A function is differentiable at a point if it is ”smooth” at that point (i.e., no corners or discontinuities exist at that
point).
• The total differential can be used to approximate the change in a function z = f (x 0, y 0) at the point (x 0, y 0) for
given values of Δx and Δy.
• The chain rule for functions of more than one variable involves the partial derivatives with respect to all the
independent variables.
• Tree diagrams are useful for deriving formulas for the chain rule for functions of more than one variable, where
each independent variable also depends on other variables.
• An objective function combined with one or more constraints is an example of an optimization problem.
• To solve optimization problems, we apply the method of Lagrange multipliers using a four-step problem-solving
strategy.
then f xy = f yx.
For the following exercises, find the largest interval of
continuity for the function.
396. The linear approximation to the function of
f (x, y) = 5x 2 + x tan(y)at (2, π) is given by 402. f (x, y) = x 3 sin −1 (y)
L(x, y) = 22 + 21(x − 2) + (y − π).
403. g(x, y) = ln⎛⎝4 − x 2 − y 2⎞⎠
⎛3 9 ⎞
397. ⎝4 , 16 ⎠ is a critical point of
For the following exercises, find all first partial derivatives.
g(x, y) = 4x 3 − 2x 2 y + y 2 − 2.
404. f (x, y) = x 2 − y 2
For the following exercises, sketch the function in one
graph and, in a second, sketch several level curves. 405. u(x, y) = x 4 − 3xy + 1, x = 2t, y = t 3
⎛ ⎞
−⎝x 2 + 2y 2⎠
398. f (x, y) = e .
For the following exercises, find all second partial
derivatives.
399. f (x, y) = x + 4y 2. 406. g(t, x) = 3t 2 − sin(x + t)
476 Chapter 4 | Differentiation of Functions of Several Variables
x + y2
415. f (x, y) = xy
418. f (x, y) = x 2 − y 2, x + 6y = 4