Diff Notes
Diff Notes
1.1 Theorem. Suppose U and V are open sets with f and g complex-valued func-
tions defined on U and V respectively, where f (U ) ⊂ V . Suppose that z0 ∈ U (so
that f (z0 ) ∈ V ). If f is (complex) differentiable at z0 and g is differentiable at f (z0 ),
then g ◦ f : U → C is differentiable at z0 , and (g ◦ f )0 (z0 ) = g 0 (f (z0 ))f (z0 ).
|R(z)|
f (z) − f (z0 ) = f 0 (z0 )(z − z0 ) + R(z) where → 0 as z → z0 , (1)
|z − z0 |
and
|R(w)|
g(w) − g(w0 ) = g 0 (w0 )(w − w0 ) + S(w) where → 0 as w → w0 . (2)
|w − w0 |
|T (z)|
h(z) − h(z0 ) = g 0 (f (z0 ))(z − z0 ) + T (z) where → 0 as z → z0 . (3)
|z − z0 |
and then substitute the result of (1) into this equation to get:
h(z) − h(z0 ) = g 0 (f (z0 ))f 0 (z0 )(z − z0 ) + g 0 (f (z0 ))R(z) + S(f (z)) (4)
The second term on the right side of (4) is o(|z − z0 |) as z → z0 , so all we have to do
is show that the same is true for the third term on the right.
For this, let ε(w) = S(w)/|w − w0 |, so that S(w) = ε(w)|w − w0 | where ε(w) → 0
as w → w0 . Then substituting w = f (z) and using (1) one last time:
S(f (z)) = ε(f (z))|f (z) − f (z0 )| = ε(f (z))|f 0 (z0 )| |z − z0 | + ε(f (z))R(z),
where, on the right-hand side of the equation, the first term on the is o(|z − z0 |)
because f (z) → f (z0 ) as z → z0 (remember: differentiability at a point implies
continuity there), and the second term is o(|z − z0 |) because R(z) has this property,
and ε is bounded. Thus S(f (z)) is o(|z − z0 |) and z → z0 , and the proof of the chain
rule is complete. ¤
Math 829 Spring 1999
1.2 Exercise. Suppose f obeys the hypotheses above, [a, b] is a finite, closed real
interval contained in f (U ), and γ : [a, b] → C is (real) differentiable at a point t0 ∈
(a, b). show that f ◦γ is (real) differentiable at t0 , and that (f ◦γ)0 (t0 ) = f 0 (γ(t0 ))γ 0 (t0 ).
2.1 Theorem. If the first partial derivatives of u exist at every point of G and are
continuous at z0 , the u is differentiable at z0 .
(for h = (h1 , h2 ) ∈ R2 with |h| sufficiently small, then |R(h)| = o(|h|) as |h| → 0.
The first step is to write
and then apply the (one-variable) Mean Value Theorem of differential calculus to each
of the square-bracketed terms on the right. With this you see that the right-hand
side of the equation above is:
where t1 lies between 0 and h1 and t2 lies between 0 and h2 . Thus z1 = (x0 +h1 , y0 +t1 )
and z2 = (x0 , y0 + t2 ) both → z0 as h → 0, and
R(h) = [ux (z1 ) − ux (z0 )]h1 + [uy (z2 ) − yy (z0 )]h2 , (5)
|R(h)| ≤ ε(h)|h|,
-2-
Math 829 Spring 1999
2.2 Exercise. Define “differentiable” for real-valued functions defined on open sub-
sets of Rn . State and prove a sufficient condition for differentiability of such functions
that generalizes Theorem 2.1.
3.3 Exercises. You should go back to the definition of differentiability for functions
R2 → R and identify this linear transformation. Do the same for functions from an
open subset of Rn to R. Do the same for differentiable functions from intervals of
the real line to R2 , or more generally Rn (the so-called “vector-valued functions).
Finally, how do you fit the definition of “complex differentiability” into this “linear
transformation” context?
3.5 The matrix of Df(p0 ). Let {e1 , e2 , . . . en } be the standard unit vector basis
for Rn (i.e., ej is the vector with 1 in the j-th position and zeros elsewhere). Then
upon fixing j and substituting h = tej into (6) and letting t → 0, an argument entirely
similar to the one we used in class for the case n = 2, m = 1 shows that
µ ¶
∂f ∂f1 ∂fm
T ej = (p0 ) = (p0 ), . . . , (p0 ) ,
∂xj ∂xj ∂xj
where fj : U → R is the j-th coordinate function of f :
f = (f1 , f1 , . . . , fn ).
Thus the matrix of Df (p0 ) with respect to the standard bases in Rn and Rm respec-
∂f
tively is the one whose j-th column is ∂x j
(p0 ) (written as a column vector, rather
than as the usual row vector). Let’s call this matrix [Df (p0 )]. Thus [Df (p0 )] =
i=1,j=1 . The image Df (p0 )h of a vector h ∈ R is the m-dimensional vector
(p0 )]m,n
∂fi n
[ ∂xj
found from the equation
[Df (p0 )h] = [Df (p0 )][h],
-3-
Math 829 Spring 1999
where square brackets around a vector denote the corresponding column matrix, i.e.
the transpose of the original (row) vector.
3.6 The Chain Rule revisited. Now suppose that, in addition to the setup above,
V is an open subset of Rm contained in f (U ), and g : V → Rp is differentiable at
f (p0 ). Then g ◦ f : U → Rp is differentiable at p0 , and
where the product between the derivatives on the right is the product of linear linear
transformations, i.e. their composition.
3.7 Exercise. Adopt the argument of §1 to prove this version of the Chain Rule.
Proof. A tangent vector to a curve γ at one of its points γ(t0 ) is just γ 0 (t0 ), which you
can think of as a vector (with coordinates equal to the derivatives of the coordinate
functions). (As a linear transformation R → Rn , this derivative would just be the
map that takes h ∈ R to h times the tangent vector.) With this in hand, the Corollary
becomes a restatement of the Chain Rule—try it! ¤
-4-
Math 829 Spring 1999
Corollary 3.8 says, roughly, that the linear map Df (p0 ) takes each vector tangent
to a given curve through p0 into a vector tangent to the image curve at f (p0 ). In
this context, when we assert that Df (p0 ) is a linear transformation from Rn to Rn we
should actually think of Rn as the space of vectors tangent to all possible differentiable
curves through p0 , and Rm as the corresponding “tangent space” at f (p0 ). This point
of view will show up again in the next section.
From our discussion of the chain rule, the problem is to show that, for every point
p = (ξ, η, ζ) ∈ S 2 \{N }, the linear transformation Dσ(p) preserves angles between
vectors tangent to S 2 at p! In other words, if v and w are three-dimensional vectors
tangent to S 2 at p (analytically: their inner products with p are both zero), then the
angle from v to w is the same as the angle from Dσ(p)v to Dσ(p)w.
-5-
Math 829 Spring 1999
4.3 Analytic statement of the problem. It’s pretty clear that the stereographic
projection preserves the sense of angles, so we will concentrate on preservation of
magnitudes of angles. Here is the analytic expression of what needs to be done.
(i.e. v and w are orthogonal to the line from the origin to p, and hence tangent to S 2
at p). Let A = [Dσ(p)], the matrix of Df (p) with respect to the standard basis. We
desire to show that
hAv, Awi hv, wi
= . (10)
|Av| |Aw| |v| |w|
The quantities that show up on the left and right hand sides of this last equation
are, you will recall, the cosines of the angles between the respective pairs of vectors1 .
4.4 Exercise—reduction of problem. Show that in order to prove (10) you need
only show that there is a positive constant c such that
= hw, A∗ Avi
from which the desired result follows by the symmetry of the inner product (it is
unchanged if the order of its entries is reversed). Note that this calculation works as
well for any m × n matrix and any pair of vectors in Rn .
1
Remember that the inner product on the left-hand side of the equation is the one for R3 , and
the one on the right is that of R2 .
-6-
Math 829 Spring 1999
Suppose for the moment that v = (v1 , v2 , v3 ) and w = (w1 , w2 , w3 ) are any vectors
in R3 . Then using the fact that hAv, Awi = hA∗ Av, wi = [w]T [A∗ A][v] we obtain
after a little calculation:
c−1 hAv, Awi = hA∗ Av, wi = [w]T [A∗ A][v]
= (v1 + xv3 )w1 + (v2 + yv3 )w2 + [xv1 + yv2 + (x2 + y 2 )v3 ]w3
= hv, wi + ∆,
where
∆ = (xw1 + yw2 )v3 + [xv1 + yv2 + (x2 + y 2 − 1)v3 ]w3 .
We claim that if v and w are tangent to S 2 at p then ∆ = 0, which will finish our proof.
For this, go back to equation (12) describing x and y in terms of the coordinates of
p, and note that
2ζ
x2 + y 2 − 1 = ,
1−ζ
hence
(1 − ζ)∆ = (ξw1 + ηw2 )v2 + (ξv1 + ηv2 + 2ζv3 )w3
-7-
Math 829 Spring 1999
5.1 Mercator, conformality, and calculus. In order to make a map of the earth
one has to address the problem of representing the sphere S 2 with some accuracy
on a flat plane. The Flemish map maker Gerardus Mercator attacked the problem
of making a map that would represent S 2 \{|ζ| < 1 − ε}, the unit sphere of R3
with equal spherical caps removed around the north and south poles, on a rectangle
[−π, π] × [−h, h], where ε is some small positive number, and h is positive. Mercator
wanted his map to have the following properties:
(a) The circles of latitude should be represented by horizontal lines of length 2h,
with the equator (latitude zero degrees) the horizontal axis of symmetry.
(b) Equally spaced circles of longitude (great circles on S 2 through the north and
south poles) should be represented by equally spaced vertical lines.
(c) The correspondence between points of S 2 and points of Mercator’s map should
be conformal.
-8-
Math 829 Spring 1999
but note that it wasn’t until 1614, fifteen years after Wright’s treatise on Mercator’s
projection, that Napier published his invention of logarithms!
In 1645 Henry Bond, based on both the work of Wright and recently published
tables of logarithms, conjectured (15), and this became one of the outstanding mathe-
matical problems of the latter half of the seventeenth century. James Gregory proved
(15) in 1668, but his proof was so complicated that it was viewed as suspicious at
best. Two years later Isaac Barrow, Newton’s predecessor at Cambridge, gave a com-
prehensible proof—essentially the same one you find in the calculus books of today.
For this Barrow invented the technique of partial fraction decompositions, which he
applied to the evaluation of many other integrals.
5.2 The Mercator Projection and the complex logarithm. The work we have
done in these notes on conformality of the stereographic projection, the corresponding
conformality of holomorphic functions done in class, and the holomorphicness of the
Principal Branch of the logarithm function result in a quick solution of Mercator’s
problem:
def
(a) First note that the stereographic projection maps the spherical region Sε =
S 2 \{|ζ| < 1 − ε} onto the annular open subset of C described by
(b) Recall that, under the stereographic projection, the circles of longitude on the
sphere go to rays in the plane, and the circles of latitude go to circles in the
plane centered at the origin.
-9-
Math 829 Spring 1999
(c) The conformal map w = φ(z) = iLog z takes the slit annulus AR \(−R, −1/R)
one-to-one onto the rectangle Rh = [−π, π] × [− ln h, ln h], taking the concentric
circles that are images of the latitudes on the sphere onto horizontal lines in the
rectangle, and the rays that are the images of the circles of longitude to vertical
lines in the rectangle.
(d) Since both the stereographic projection σ and the logarithm map φ defined
above are conformal, so is their composition φ◦σ : Sε → Rh .This is the Mercator
projection.
Ν
γ
γ P*
α
λ
O 1 P
α + λ = π/2
2γ + α = π
therefore:
γ = λ/2 + π/4
-10-