Multivariable Calculus: 1. The Derivative
Multivariable Calculus: 1. The Derivative
1. The Derivative
3. The limit definition for DFa is equivalent to the following condition: for every
> 0, there exists a δ > 0 so that
o(h)
lim = 0.
h→0 khk
2. Partial Derivatives
∂F F (a + hei ) − F (a)
(a) = lim .
∂xi h→0 h
Here ei denotes a unit vector in the xi direction.
PROOF By Theorem 1, we know that DFa (ei ) is the partial derivative (∂F/∂xi )(a).
But for any matrix A, the vector Aei is simply the i’th column of A, and therefore
DFa is the matrix whose i’th column is (∂F/∂xi )(a). From the definition of the
partial derivative, it is easy to see that
∂F ∂f1 ∂fn
(a) = (a), . . . , (a)
∂xi ∂xi ∂xi
and the corollary follows.
Dfx = f 0 (x)
for any x ∈ (a, b). Indeed, it can be shown that f 0 (x) exists if and only if Dfx exists.
More generally, a parametric curve is any function γ : (a, b) → Rn . The deriva-
tive of such a curve is the vector γ 0 (t) defined by the formula
γ(t + h) − γ(t)
γ 0 (t) = lim .
h→0 h
That is, γ 0 (t) is equal to ∂γ/∂t. Again it follows from Corollary 2 that
Dγt = γ 0 (t)
for any t ∈ (a, b). Moreover, one can show that γ 0 (t) exists if and only if Dγt exists.
Note that Dfa is a row vector, not an ordinary (column) vector. As a linear trans-
formation, Dfa is the linear functional Rm → R defined by
∂f
Dfa (ei ) = (a).
∂xi
The transpose of Dfa is vector, known as the gradient of f :
T ∂f ∂f
∇f (a) = (Dfa ) = (a), . . . , (a)
∂x1 ∂xm
Multivariable Calculus 6
From Corollary 2, we might be tempted to think that the derivative DFa is the
same as the Jacobian matrix of partial derivatives, but this is not always the case.
In particular, it is possible for the partial derivatives of F to exist without F being
differentiable:
4. The function f is differentiable at (0, 0) if and only if f is linear, i.e. if and only
if ϕ has the form ϕ(θ) = a cos θ + b sin θ for some constants a, b ∈ R.
In particular, if ϕ satisfies conditions (2) and (3) but not (4), then the partial deriva-
tives of f will exist but f will not be differentiable.
For a specific example, consider the case where ϕ(θ) = sin(3θ). In Cartesian
coordinates, the resulting function f can be written
3x2 y − y 3
f (x, y) = ,
x2 + y 2
we have (∂f /∂x)(0, 0) = 0 and (∂f /∂y)(0, 0) =−1, soboth partial derivatives exist
at (0, 0). However, it is not true that Df(0,0) = 0 −1 . For example, along the line
y = x we have f (x, x) = x, so the directional derivative in this direction satisfies
∂f 1
= 1 6= 0 −1 .
∂(1, 1) 1
An interesting feature of this example is that the partial derivatives of f are not
continuous at the point (0, 0). For example, one can show that
(
∂f 3 if x 6= 0
(x, 0) =
∂y 0 if x = 0,
so ∂f /∂y is not continuous at the point (0, 0).
The following theorem gives conditions on the partial derivatives that guarantee
the differentiability of a function. We shall not prove this theorem here:
Note that the maximum in the above definition always exists since the unit sphere
in Rm is compact. Though we shall not prove it here, the norm kAk can also be defined
as the largest eigenvalue of the symmetric matrix ATA.
The most important property of the matrix norm is the following inequality:
We will also need the following inequality, which is similar in spirit to the Mean
Value Theorem from single variable calculus. Its proof is left as an exercise to the
reader:
Multivariable Calculus 9
2. For the second term, fix a positive constant N > kDGb k. Since F is differentiable
at a, there exists a δ3 > 0 so that
kx − ak < δ3 ⇒ kF (x) − LF (x)k ≤ kx − ak.
2N
If kx − ak < δ3 , it follows that
k(LG ◦ F )(x) − (LG ◦ LF )(x)k =
DGb F (x) − LF (x)
≤ N kF (x) − LF (x)k
≤ kx − ak.
2
4. Diffeomorphisms
One of the most important types of differentiable maps is the diffeomorphism:
Definition: Diffeomorphism
Let U and V be open subsets of Rn . A diffeomorphism from U to V is a map
F : U → V with the following properties:
1. F is bijective.
2. F is differentiable, and
3. F −1 is differentiable.
Multivariable Calculus 11
F
−−−−−→
U V
Figure 2: A diffeomorphism between two open subsets of R2 .
EXAMPLE 4 Diffeomorphisms on R
Let f : R → R be a diffeomorphism. According to the above theorem, the derivative
f 0 for such a function must satisfy f 0 (a) 6= 0 for each a ∈ R. Moreover, the derivatives
of f and f −1 are related by the formula
1
(f −1 )0 (b) = 0
f (a)
Multivariable Calculus 12
for each a ∈ R, where b = f (a). Indeed, it follows from the Inverse Function Theorem
(see below) that any differentiable bijection f : R → R with nonzero derivative is a
diffeomorphism.
If f has a critical point, then it cannot be a diffeomorphism. For example, the
map f : R → R defined by f (x) = x3 is a differentiable√ bijection, but is not a
−1
diffeomorphism, since the inverse function f (x) = x is not differentiable at 0.
3
1. F is not differentiable at a, or
Curvilinear Coordinates
Though diffeomorphisms might seem like an entirely new concept, they are closely
related to the familiar notion of curvilinear coordinates.
Multivariable Calculus 13
ϕ
0 −−−−−→
-¼
0 2 4 6 8
U
V
Figure 3: The polar coordinates diffeomorphism.
for all x ∈ V .
Note that the coordinat functions u1 , . . . , un are essentially the components of the
inverse diffeomorphism ϕ−1 : V → U . That is,
Change of Variables
In this section we discuss the change of variables formula for integration. We begin
by reviewing the geometric meaning of determinants.
Multivariable Calculus 15
v3
v2
a v1
Definition: Parallelepiped
A parallelepiped in Rn is a set of the form
Let a, v1 , . . . , vn ∈ Rn , and let A be the matrix whose columns are the vectors
v1 , . . . , vn . Then the volume of the parallelepiped P (a, v1 , . . . , vn ) is |det A|.
Here “volume” refers to area in the case where n = 2, or length in the case where
n = 1. As a result of the above theorem, we obtain the following interpretation for
the determinant of a linear transformation:
Let A be the matrix whose columns are v1 , . . . , vn , and let B be the matrix whose
columns are T (v1 ), . . . , T (vn ). Then B is equal to T A, the product of the matrix for
T with A, so
|det B| = |(det T )(det A)| = |det T | |det A|.
Sketch of Proof The first statement follows from the fact that
P S
µ(S) = inf µ(Pn ) P1 , P2 , . . . are parallelepipeds in Rn and S ⊂ Pn
for any measurable set S. Then the second statement is clearly true for simple
functions, and can be extended to any L1 function using the Dominated Convergence
Theorem.
Definition: Jacobian
Let U be an open subset of Rn , and let F : U → Rn be a differentiable map. The
Jacobian of F is the function JF : U → [0, ∞) defined by
Roughly speaking, the Jacobian measures the amount by which F multiplies vol-
umes at a point a. That is, if S is a measurable set in a small neighborhood of a,
then
µ f (S) ≈ |JF (a)| µ(S).
Incidentally, one can prove that the Jacobian is always a measurable function.
We are now ready to state the change of variables formula, though we are not in
a position to prove it: