Introdiffgeo PDF
Introdiffgeo PDF
DIFFERENTIAL GEOMETRY
18 March 2013
ii
Preface
These are notes for the lecture course “Differential Geometry I” held by the
second author at ETH Zürich in the fall semester 2010. They are based
on a lecture course held by the first author at the University of Wisconsin–
Madison in the fall semester 1983.
In the present manuscript the sections are roughly in a one-to-one corre-
spondence with the 26 lectures at ETH, each lasting two times 45 minutes.
(Exceptions: Of Section 2.6 only the existence of a Riemannian metric was
covered in one of the earlier lectures; Sections 1.9/1.10 together were two
lectures, as well as 4.4/4.5; some of the material in the longer Sections like
1.6 and 2.4 was left as exercises.)
iii
iv
Contents
1 Foundations 1
1.1 Manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.1 Some examples . . . . . . . . . . . . . . . . . . . . . . 1
1.1.2 Recollections about smooth maps and derivatives . . . 2
1.1.3 Submanifolds of Euclidean space . . . . . . . . . . . . 4
1.1.4 Examples and exercises . . . . . . . . . . . . . . . . . 9
1.2 Tangent spaces and derivatives . . . . . . . . . . . . . . . . . 12
1.2.1 Tangent spaces . . . . . . . . . . . . . . . . . . . . . . 12
1.2.2 The derivative . . . . . . . . . . . . . . . . . . . . . . 16
1.2.3 The inverse and implicit function theorems . . . . . . 18
1.3 Submanifolds and embeddings . . . . . . . . . . . . . . . . . . 19
1.4 Vector fields and flows . . . . . . . . . . . . . . . . . . . . . . 23
1.4.1 Vector fields . . . . . . . . . . . . . . . . . . . . . . . . 23
1.4.2 The flow of a vector field . . . . . . . . . . . . . . . . 26
1.4.3 The group of diffeomorphisms . . . . . . . . . . . . . . 28
1.5 The Lie bracket . . . . . . . . . . . . . . . . . . . . . . . . . . 29
1.6 Lie groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
1.6.1 Definition and examples . . . . . . . . . . . . . . . . . 35
1.6.2 The Lie algebra of a Lie group . . . . . . . . . . . . . 38
1.6.3 Lie group homomorphisms . . . . . . . . . . . . . . . . 40
1.6.4 Lie groups and diffeomorphisms . . . . . . . . . . . . . 44
1.6.5 Vector fields and derivations . . . . . . . . . . . . . . . 45
1.7 Vector bundles and submersions . . . . . . . . . . . . . . . . . 47
1.7.1 Submersions . . . . . . . . . . . . . . . . . . . . . . . 47
1.7.2 Vector bundles . . . . . . . . . . . . . . . . . . . . . . 48
1.8 The theorem of Frobenius . . . . . . . . . . . . . . . . . . . . 52
1.9 The intrinsic definition of a manifold . . . . . . . . . . . . . . 59
1.9.1 Definition and examples . . . . . . . . . . . . . . . . . 59
1.9.2 Paracompactness . . . . . . . . . . . . . . . . . . . . . 64
v
vi CONTENTS
2 Geodesics 81
2.1 The length of a curve . . . . . . . . . . . . . . . . . . . . . . . 81
2.2 Geodesics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
2.2.1 The orthogonal projection onto Tp M . . . . . . . . . . 88
2.2.2 The covariant derivative . . . . . . . . . . . . . . . . . 89
2.2.3 The space of paths . . . . . . . . . . . . . . . . . . . . 90
2.2.4 Critical points of E and L . . . . . . . . . . . . . . . . 92
2.3 Christoffel symbols . . . . . . . . . . . . . . . . . . . . . . . . 95
2.4 The geodesic flow . . . . . . . . . . . . . . . . . . . . . . . . . 102
2.4.1 The second fundamental form . . . . . . . . . . . . . . 102
2.4.2 The tangent bundle of the tangent bundle . . . . . . . 105
2.4.3 The exponential map . . . . . . . . . . . . . . . . . . . 105
2.4.4 Examples and exercises . . . . . . . . . . . . . . . . . 107
2.4.5 Geodesics minimize the length . . . . . . . . . . . . . 108
2.4.6 Convexity . . . . . . . . . . . . . . . . . . . . . . . . . 112
2.5 The Hopf–Rinow theorem . . . . . . . . . . . . . . . . . . . . 114
2.6 Riemannian metrics . . . . . . . . . . . . . . . . . . . . . . . 122
2.6.1 Riemannian metrics . . . . . . . . . . . . . . . . . . . 122
2.6.2 Covariant derivatives . . . . . . . . . . . . . . . . . . . 128
2.6.3 Geodesics . . . . . . . . . . . . . . . . . . . . . . . . . 130
4 Curvature 163
4.1 Isometries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
4.2 The Riemann curvature tensor . . . . . . . . . . . . . . . . . 171
4.2.1 Definition and Gauss–Codazzi . . . . . . . . . . . . . . 171
4.2.2 The covariant derivative of a global vector field . . . . 173
4.2.3 A global formula . . . . . . . . . . . . . . . . . . . . . 176
4.2.4 Symmetries . . . . . . . . . . . . . . . . . . . . . . . . 178
4.2.5 Examples and exercises . . . . . . . . . . . . . . . . . 180
4.3 Generalized Theorema Egregium . . . . . . . . . . . . . . . . 182
4.3.1 Pushforward . . . . . . . . . . . . . . . . . . . . . . . 182
4.3.2 Theorema Egregium . . . . . . . . . . . . . . . . . . . 183
4.3.3 The Riemann curvature tensor in local coordinates . . 186
4.3.4 Gaussian curvature . . . . . . . . . . . . . . . . . . . . 187
4.3.5 Gaussian curvature in local coordinates . . . . . . . . 191
4.4 The Cartan–Ambrose–Hicks theorem . . . . . . . . . . . . . . 192
4.4.1 Homotopy . . . . . . . . . . . . . . . . . . . . . . . . . 193
4.4.2 The global C-A-H theorem . . . . . . . . . . . . . . . 194
4.4.3 The local C-A-H theorem . . . . . . . . . . . . . . . . 200
4.5 Flat spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
4.6 Symmetric spaces . . . . . . . . . . . . . . . . . . . . . . . . . 206
4.6.1 Symmetric spaces . . . . . . . . . . . . . . . . . . . . . 207
4.6.2 The covariant derivative of the curvature . . . . . . . 209
4.6.3 Examples and exercises . . . . . . . . . . . . . . . . . 212
4.7 Constant curvature . . . . . . . . . . . . . . . . . . . . . . . . 214
4.7.1 Sectional curvature . . . . . . . . . . . . . . . . . . . . 214
4.7.2 Constant sectional curvature . . . . . . . . . . . . . . 215
4.7.3 Examples and exercises . . . . . . . . . . . . . . . . . 218
4.7.4 Hyperbolic space . . . . . . . . . . . . . . . . . . . . . 219
4.8 Nonpositive sectional curvature . . . . . . . . . . . . . . . . . 224
4.8.1 The theorem of Hadamard and Cartan . . . . . . . . . 225
4.8.2 Positive definite symmetric matrices . . . . . . . . . . 231
viii CONTENTS
Chapter 1
Foundations
1.1 Manifolds
1.1.1 Some examples
Let k, m, n be positive integers. Throughout we denote by
q
|x| := x21 + · · · + x2n
S m := x ∈ Rm+1 | |x| = 1 .
1
2 CHAPTER 1. FOUNDATIONS
Here 1l denotes the identity matrix and AT denotes the transposed matrix
of A. An example of a manifold that is not (in an obvious way) embedded
in some Euclidean space is the complex Grassmannian
Gk (Cn ) := {E ⊂ Cn | E is a complex linear subspace of dimension k}
of k-planes in Cn .
for every x ∈ U . Moreover the identity map is always smooth and its differ-
ential at every point is the identity matrix. This implies that, if f : U → V
is a diffeomorphism (i.e. f is bijective and f and f −1 are both smooth)
then k = ` and the Jacobi matrix df (x) ∈ Rk×k is nonsingular for every
x ∈ U . A partial converse is the inverse function theorem which we restate
below. For a proof see [12] or any textbook on first year analysis.
Inverse Function Theorem. Let Ω ⊂ Rk be an open set, f : Ω → Rk
be a smooth map, and x0 ∈ Ω. If det(df (x0 )) 6= 0 then there is an open
neighborhood U ⊂ Ω of x0 such that V := f (U ) is an open subset of Rk and
f |U : U → V is a diffeomorphism.
Implicit Function Theorem. Let Ω ⊂ Rk × R` be open and f : Ω → R`
be a smooth map. Let (x0 , y0 ) ∈ Ω such that
∂f
f (x0 , y0 ) = 0, det (x0 , y0 ) 6= 0.
∂y
(Here ∂f
∂y (x0 , y0 ) ∈ R
`×` denotes the Jacobi matrix of the map y 7→ f (x , y)
0
at the point y = y0 .) Then there are open sets V ⊂ Rk and W ⊂ R` and a
smooth map g : V → W such that (x0 , y0 ) ∈ V × W ⊂ Ω, g(x0 ) = y0 , and
f (x, y) = 0 ⇐⇒ y = g(x).
Exercise 1.2. Show that the relative topology satisfies the axioms of a
topology (i.e. arbitrary unions and finite intersections of M -open sets are
M -open, and the empty set and M itself are M -open). Show that the
complement of an M -open set in M is M -closed and vice versa.
U M M φ Ω
dΦ(ψ(x))dψ(x) = id : Rm → Rm
M := S 2 = (x, y, z) ∈ R3 | x2 + y 2 + z 2 = 1
φ(x, y, z) := (x, y)
U ∩ M = f −1 (0) = {q ∈ U | f (q) = 0} .
det(dψ0 (x0 ) B) 6= 0.
e ∩ W,
U := U e ∩ ψ −1 (W ).
Ω := Ω
q ∈U ∩M =U
e ∩W ∩M =U
e0 ⊂ U0
Since the pairs (x, y) and (φ0 (q), 0) both belong to the set Ω
e and the restric-
tion of ψ to Ωe is injective we obtain x = φ0 (q) and y = 0. This proves (1.1).
It follows from (1.1) that the map φ := (ψ|Ω )−1 : U → Ω satisfies (ii) and
agrees with the map q 7→ (φ0 (q), 0) on U ∩ M . Thus we have proved that (i)
implies (ii).
That (ii) implies (iii) is obvious. Just define f : U → Rk−m as the
composition of φ with the projection of Rk onto the last k − m coordinates.
We prove that (iii) implies (i). Let f : U → Rk−m be as in (iii) and
denote
X := ker df (p) ⊂ Rk .
By (iii) this is an m-dimensional linear subspace of Rk and we choose a
(k − m)-dimensional linear subspace Y ⊂ Rk such that
Rk = X ⊕ Y.
8 CHAPTER 1. FOUNDATIONS
f (p + x + y) = 0 ⇐⇒ y = g(x).
U0 := Q ∩ M = {p + x + g(x) | x ∈ V } , Ω0 := V ⊂ X ∼
= Rm ,
Rf := R` \ Sf .
M := f −1 (c) = {p ∈ U | f (p) = c}
Sard’s theorem implies that the set of singular values does not contain
any open set and hence the set R` \ Sf of regular values is everywhere dense
in R` . In other words, for every c ∈ R` there is a sequence of regular values
converging to c. In fact, it follows from Sard’s theorem that the set of
regular values is residual in the sense of Baire, i.e. it contains a countable
intersection of open and dense sets. By Baire’s category theorem residual
sets (in a complete metric space) are always dense and, by definition, a
countable intersection of residual sets is still residual. Thus any countable
collection of smooth maps into the same target space R` still has a dense
set of common regular values.
We also emphasize that, if ` > k the differential df (p) : Rk → R` can
never be surjective and so the set of singular values is just the image of f . In
this case Sard’s theorem asserts that the image of f has Lebesgue measure
zero and this remains valid for every continuously differentiable function.
When k ≥ ` Sard’s theorem continues to hold for maps of class C k−`+1 (see
Abraham–Robbin [1]).
is a smooth 5-manifold.
10 CHAPTER 1. FOUNDATIONS
onto the last three coordinates is called the Roman surface and was dis-
covered by Jakob Steiner. The Roman surface can also be represented as
the set of solutions (ξ, η, ζ) ∈ R3 of the equation η 2 ζ 2 + ζ 2 ξ 2 + ξ 2 η 2 = ξηζ.
It is not a submanifold of R3 .
Exercise: Prove this. Show that M is diffeomorphic to a submanifold of R4 .
1.1. MANIFOLDS 11
for every g ∈ GL(n, R) and every v ∈ Rn×n . Deduce that the special linear
group
SL(n, R) := {g ∈ GL(n, R) | det(g) = 1}
is a smooth submanifold of Rn×n .
Example 1.19. The orthogonal group
O(n) := g ∈ Rn×n | g T g = 1l
Sn := S ∈ Rn×n | S T = S
f (g) := g T g.
M := (x, y) ∈ R2 | xy = 0
Tp M v
p
Tp M = ker df (p).
Bε (x0 ) := {x ∈ Rm | |x − x0 | < ε} ⊂ Ω0 .
This shows that v ∈ dφ(p)−1 (Rm × {0}) and thus we have proved (1.2).
Now the sets im dψ0 (x0 ) and dφ(p)−1 (Rm × {0}) are both m-dimensional
linear subspaces of Rk . Hence it follows from (1.2) that these subspaces agree
and that they both agree with Tp M . Thus we have proved assertions (i),
(ii), and (iv).
We prove (iii). If v ∈ Tp M then there is a smooth curve γ : R → M
such that γ(0) = p and γ̇(0) = v. For t sufficiently small we have γ(t) ∈ U ,
where U ⊂ Rk is the open set in (iii), and hence f (γ(t)) = 0. This implies
d
df (p)v = df (γ(0))γ̇(0) = f (γ(t)) = 0.
dt t=0
Thus we have proved that Tp M ⊂ ker df (p). Since the kernel of df (p)
and Tp M are both m-dimensional linear subspaces of Rk we deduce that
Tp M = ker df (p). This proves (iii) and the theorem.
Tx S m = x⊥ = ξ ∈ Rm+1 | hx, ξi = 0 .
M := (x, y) ∈ R3 × R3 | |x − y| = r
Example 1.27. Let H(x, y) := 12 |y|2 + V (x) be as in Exercise 1.17 and let
c be a regular value of H. If (x, y) ∈ M := H −1 (c) Then
Exercise 1.28. The tangent space of SL(n, R) at the identity matrix is the
space
sl(n, R) := T1l SL(n, R) = ξ ∈ Rn×n | trace(ξ) = 0
Tg O(n) = v ∈ Rn×n | g T v + v T g = 0 .
In particular, the tangent space of O(n) at the identity matrix is the space
of skew-symmetric matrices
df (p)v ∈ R`
by
d f (γ(h)) − f (p)
df (p)v := f (γ(t)) = lim . (1.3)
dt t=0 h→0 h
That the limit on the right in equation (1.3) exists follows from our
assumptions. We must prove, however, that the derivative is well defined,
i.e. that the right hand side of (1.3) depends only on the tangent vector
v but not on the choice of the curve γ used in the definition. This is the
content of the first assertion in the next theorem.
df (p)Tp M ⊂ Tf (p) N.
df (p) : Tp M → Tf (p) N
P := f −1 (q) = {p ∈ M | f (p) = q}
for every p ∈ P .
1.3. SUBMANIFOLDS AND EMBEDDINGS 19
U0 φ 0(U0)
M φ0
p
0
P
is injective. Hence there is a linear map B : Rm−n → Rm such that the map
F (q, z) := φ−1
0 (φ0 (f (q)) + Bz) .
This map is smooth, it satisfies F (q, 0) = f (q) for all q ∈ f −1 (U0 ), and
the derivative dF (q0 , 0) : Tq0 N × Rm−n → Tp0 M is the composition of the
map (1.6) with dφ0 (p0 )−1 : Rm → Tp0 M and hence is a vector space iso-
morphism. Thus the Inverse Function Theorem 1.34 asserts that there is an
N -open neighborhood V0 ⊂ N of q0 and an open neighborhood W0 ⊂ Rm−n
of the origin such that V0 × W0 ⊂ Ω, the set U0 := F (V0 × W0 ) ⊂ M is
M -open, and the restriction of F to V0 × W0 is a diffeomorphism onto U0 .
Thus we have constructed a diffeomorphism F : V0 × W0 → U0 that satis-
fies (1.4). We claim that its restriction to the product V × W of sufficiently
small open neighborhoods V ⊂ N of q0 and W ⊂ Rm−n of the origin also
satisfies (1.5). Otherwise there are sequences qi ∈ V0 converging to q0 and
zi ∈ W0 \ {0} converging to zero such that F (qi , zi ) ∈ P . Hence there is a
sequence qi0 ∈ N such that
F (qi , zi ) = f (qi0 ).
Exercise 1.47. Prove that the set of smooth vector fields on M is a real
vector space.
Example 1.48. Denote the standard cross product on R3 by
x2 y3 − x3 y2
x × y := x3 y1 − x1 y3
x1 y2 − x2 y1
For x, y ∈ R3 . Fix a vector ξ ∈ S 2 and define the maps X, Y : S 2 → R3 by
X(p) := ξ × p, Y (p) := (ξ × p) × p.
24 CHAPTER 1. FOUNDATIONS
These are vector fields with zeros ±ξ. Their integral curves (see Defini-
tion 1.51 below) are illustrated in Figure 1.7.
This vector field has a single zero at the origin and its integral curves are
illustrated in Figure 1.8.
K := {φ(t, p0 ) | 0 ≤ t ≤ t0 }
K ⊂ U, (−ε, ε) × U ⊂ D,
Here the last implication follows from (1.9). Moreover, for p ∈ Uk and
t0 /N − ε < t < kt0 /N + ε, we have, by (1.10), that
Since φ(t0 /N, p) ∈ Uk−1 for p ∈ Uk the right hand side is a smooth map on
the open set (t0 /N − ε, kt0 /N + ε) × Uk . Since Uk ⊂ U , φ is also a smooth
map on (−ε, ε) × Uk and hence on (−ε, kt0 /N + ε) × Uk . This completes
the induction. With k = N we have found an open neighborhood of (t0 , p0 )
contained in D, namely the set (−ε, t0 + ε) × UN , on which φ is smooth. The
case t0 ≤ 0 is treated similarly. This proves (i) and (ii) and the theorem.
28 CHAPTER 1. FOUNDATIONS
X is complete ⇐⇒ I(p) = R ∀ p ∈ M ⇐⇒ D = R × M.
φs+t = φs ◦ φt , φ0 = id (1.11)
Diff(M ) := {φ : M → M | φ is a diffeomorphism} .
This is a group. The group operation is composition and the neutral element
is the identity. Now equation (1.11) asserts that the flow of a complete vector
field X ∈ Vect(M ) is a group homomorphism
R → Diff(M ) : t 7→ φt .
1.5. THE LIE BRACKET 29
I × M → Rk : (t, p) 7→ Xt (p)
be a smooth map such that Xt ∈ Vect(M ) for every t. Prove that there is
a smooth family of diffeomorphisms I × M → M : (t, p) 7→ φt (p) satisfying
d
φt = Xt ◦ φt , φ0 = id (1.13)
dt
for every t ∈ I. Such a family of diffeomorphisms I → Diff(M ) : t 7→ φt
is called an isotopy of M . Conversely prove that every smooth isotopy
I → Diff(M ) : t 7→ φt is generated (uniquely) by a smooth family of vector
fields I → Vect(M ) : t 7→ Xt .
for q ∈ N .
Exercise 1.60. Prove that φ∗ X = (φ−1 )∗ X. Prove that for two diffeomor-
phism φ : M → N and ψ : N → P and two vector fields X ∈ Vect(M ) and
Z ∈ Vect(P ) we have (ψ ◦ φ)∗ X = ψ∗ φ∗ X and (ψ ◦ φ)∗ Z = φ∗ ψ ∗ Z.
30 CHAPTER 1. FOUNDATIONS
R → Diff(M ) : t 7→ φt , R → Diff(M ) : t 7→ ψ t
γ̇(0) = 0.
√
Prove that the curve [0, ∞) → Rk : t 7→ γ( t) is differentiable at t = 0 and
√
d 1
γ( t) = γ̈(0).
dt t=0
2
for s, t ∈ R. Then
γ(t) = β(t, t)
1.5. THE LIE BRACKET 31
and
∂β
(0, t) = X(p) − dψ t (ψ −t (p))X(ψ −t (p)), (1.16)
∂s
∂β
(s, 0) = dφs (φ−s (p))Y (φ−s (p)) − Y (p). (1.17)
∂t
Hence
∂β ∂β
γ̇(0) = (0, 0) + (0, 0) = 0.
∂s ∂t
Moreover,
β(s, 0) = β(0, t) = p
for all s and t. This implies
∂2β ∂2β
(0, 0) = (0, 0) = 0
∂s2 ∂t2
and hence
∂2β
γ̈(0) = 2 (0, 0). (1.18)
∂s∂t
Combining (1.17) and (1.18) we find
1 d ∂β
γ̈(0) = (s, 0)
2 ds s=0 ∂t
d
= dφs (φ−s (p))Y (φ−s (p))
ds s=0
d
= ((φs )∗ Y ) (p)).
ds s=0
Note that the right hand side is the derivative of a smooth curve in the tan-
gent space Tp M and hence is itself an element of Tp M . Likewise, combining
equations (1.16) and (1.18) we find
1 d ∂β
γ̈(0) = (0, t)
2 dt t=0 ∂s
d
= − dψ t (ψ −t (p))X(ψ −t (p))
dt t=0
d
= dψ −t (ψ t (p))X(ψ t (p))
dt t=0
d
= dψ t (p)−1 X(ψ t (p))
dt t=0
d ∗
= ψ t X (p)).
dt t=0
32 CHAPTER 1. FOUNDATIONS
Moreover, we have
1 ∂
γ̈(0) = dφs (φ−s (p))Y (φ−s (p))
2 ∂s s=0
∂ ∂
= φs ◦ ψ t ◦ φ−s (p)
∂s s=0 ∂t t=0
∂ ∂
= φs ◦ ψ t ◦ φ−s (p)
∂t t=0 ∂s s=0
∂
X(ψ t (p)) − dψ t (p)X(p)
=
∂t t=0
∂ ∂
= dX(p)Y (p) − ψ t ◦ φs (p)
∂t t=0 ∂s s=0
∂ ∂
= dX(p)Y (p) − ψ t ◦ φs (p)
∂s s=0 ∂t t=0
∂
= dX(p)Y (p) − Y (φs (p))
∂s s=0
= dX(p)Y (p) − dY (p)X(p).
ψ ∗ [X, Y ] = [ψ ∗ X, ψ ∗ Y ], (1.20)
This proves equation (1.20). Equation (1.21) is obvious and the proof
of (1.22) is left to the reader. (Hint: Prove this in local coordinates.)
[ξ, η] := ξη − ηξ.
o(n) := ξ ∈ gl(n, R) | ξ T + ξ = 0
Rm×m → Vect(Rm ) : ξ 7→ Xξ
(See Exercise 1.62.) Geometrically this means following first the backward
flow of Y for time ε, then the backward flow of X for time ε, then the
forward flow of Y for time ε, and finally the forward flow of X for time ε,
we will not, in general, get back to the original point p where we started but
approximately obtain an “error” ε2 [X, Y ](p). An example of this (which we
learned from Donaldson) is the mathematical formulation of parking a car.
Example 1.68 (Parking a car). The configuration space for driving a car
in the plane is the manifold M := C × S 1 , where S 1 ⊂ C denotes the unit
circle. Thus a point in M is a pair p = (z, λ) ∈ C × C with |λ| = 1. The
point z ∈ C represents the position of of the car and the unit vector λ ∈ S 1
represents the direction in which it is pointing. The left turn is represented
by a vector field X and the right turn by a vector field Y on M . These
vector field are given by
This vector field represents a sideways move of the car to the right. And a
sideways move by 2ε2 can be achieved by following a backward right turn
for time ε, then a backward left turn for time ε, then a forward right turn
for time ε, and finally a forward left turn for time ε.
This example can be reformulated by identifying C with R2 via z = x+iy
and representing a point in the unit circle by the angle θ ∈ R/2πZ via
λ = eiθ . In this formulation the manifold is M = R2 × R/2πZ, a point in M
is represented by a triple (x, y, θ) ∈ R3 , the vector fields X and Y are
are Lie groups. In fact every orthogonal matrix has determinant ±1 and so
SO(n) is an open subset of O(n) (in the relative topology).
In a similar vein the group
G × G → G : (g, h) 7→ gh
and
G → G : g 7→ g −1
Here v and h are both matrices in Rn×n and vh denotes the matrix product.
In fact, if v ∈ Tg G then, since G is a manifold, there is a smooth curve
γ : R → G with γ(0) = g and γ̇(0) = v. Since G is a group we obtain a
smooth curve β : R → G given by β(t) := γ(t)h. It satisfies β(0) = gh and
so vh = β̇(0) ∈ Tgh G.
The linear map (1.25) is obviously a vector space isomorphism with
inverse Tgh G → Tg G : w 7→ wh−1 . It is sometimes convenient to define the
map Rh : G → G by Rh (g) := gh (right multiplication by h). This is a
diffeomorphism and the linear map (1.25) is just the derivative of Rh at g
so that
dRh (g)v = vh
Tg G → Tg−1 G : v 7→ −g −1 vg −1 .
g = Lie(G) := T1l G.
This terminology is justified by the fact that g is in fact a Lie algebra, i.e. it
is invariant under the standard Lie bracket operation [ξ, η] := ξη − ηξ on the
space Rn×n of square matrices (see Lemma 1.76 below). The proof requires
the notion of the exponential matrix. For ξ ∈ Rn×n and t ∈ R we define
∞ k k
X t ξ
exp(tξ) := . (1.27)
k!
k=0
It was proved in Analysis II (see [12]) that this series converges absolutely
(and uniformly on compact t-intervals), that the map t 7→ exp(tξ) is smooth
and satisfies the differential equation
d
exp(tξ) = ξ exp(tξ) = exp(tξ)ξ, (1.28)
dt
and that
exp((s + t)ξ) = exp(sξ) exp(tξ), exp(0ξ) = 1l (1.29)
for all s, t ∈ R.
Xξ (g) := ξg ∈ Tg G, g ∈ G. (1.30)
By (1.28), the curve (−ε, ε) → Rn×n : t 7→ exp(tξ) satisfies the same initial
value problem and hence, by uniqueness, we have exp(tξ) = γ(t) ∈ G for
|t| < ε. Now let t ∈ R and choose N ∈ N such that Nt < ε. Then
exp( Nt ξ) ∈ G and hence it follows from (1.29) that
N
t
exp(tξ) = exp ξ ∈ G.
N
γ(t) := g exp(tη)g −1 .
gηg −1 = γ̇(0) ∈ g.
Hence γ is the integral curve of the vector field Xξ in (1.30) with γ(0) = 1l.
This implies γ(t) = exp(tξ) for every t ∈ R, as claimed.
40 CHAPTER 1. FOUNDATIONS
Example 1.78. Since the general linear group GL(n, R) is an open subset
of Rn×n its Lie algebra is the space of all real n × n-matrices
(see Exercise 1.28) and the Lie algebra of the special orthogonal group is
ρ̇ := dρ(1l) : g → h.
Example 1.85. Let g be a finite dimensional Lie algebra. Then the set
Φ is a bijective linear map,
Aut(g) := Φ : g → g
Φ[ξ, η] = [Φξ, Φη] ∀ ξ, η ∈ g
Now suppose that g = Lie(G) is the Lie algebra of a Lie group G. Then
there is a map
for g ∈ G and η ∈ g. Lemma 1.76 (ii) asserts that ad(g) is indeed a linear
map from g to itself for every g ∈ G. The reader may verify that the map
ad(g) : g → g is a Lie algebra automorphism for every g ∈ G and that the
map ad : G → Aut(g) is a Lie group homomorphism. The associated Lie
algebra homomorphism is the map
GL(n, R) → Diff(Rn ) : g 7→ φg
extend formally the notion of a tangent vector (as the derivative of a path
through a given element of Diff(Rn )) to this setting. In particular, the tan-
gent space of Diff(Rn ) at the identity can then be identified with the space
of vector fields Tid Diff(Rn ) = Vect(Rn ). Differentiating the map g → φg one
then obtains a linear map
gl(n, R) → Vect(Rn ) : ξ 7→ Xξ
compact so that every vector field on M is complete. The next exercise gives
another parallel between the Lie bracket on the Lie algebra of a Lie group
and the Lie bracket of two vector fields.
Exercise 1.88. Let G ⊂ GL(n, R) be a Lie group with Lie algebra g and
let ξ, η ∈ g. Define the smooth curve γ : R → G by
Prove that γ̇(0) = 0 and 21 γ̈(0) = [ξ, η]. Compare this with Lemma 1.61.
Exercise 1.89. Let G ⊂ GL(n, R) be a Lie group with Lie algebra g and let
ξ, η ∈ g. Show that [ξ, η] = 0 if and only if exp(sξ) exp(tη) = exp(tη) exp(sξ)
for all s, t ∈ R. How can this result be deduced from Lemma 1.69?
F (M ) := C ∞ (M, R)
φ∗ f := f ◦ φ.
where φt denotes the flow of X. Since the map (1.36) is the derivative of
the “Lie group” anti-homomorphism (1.35) we expect it to be a Lie algebra
anti-homomorphism. Indeed, one can show that
for X, Y ∈ Vect(M ). This confirms that our sign in the definition of the
Lie bracket is consistent with the standard conventions in the theory of
Lie groups. In the literature the difference between a vector field and the
associated derivation LX is sometimes neglected in the notation and many
authors write Xf := df · X = LX f , thus thinking of a vector field on a
manifold M as an operator on the space of functions. With this notation one
obtains the equation [X, Y ]f = Y (Xf ) − X(Y f ) and here lies the origin for
the use of the opposite sign for the Lie bracket in many books on differential
geometry.
Exercise 1.90. Prove that the map (1.35) is bijective. Hint: An ideal in
F (M ) is a linear subspace J ⊂ F (M ) satisfying the condition
f ∈ F (M ), g ∈ J =⇒ fg ∈ J .
N q0
g f
M U p0
Proof. We prove that (i) implies (ii). Since df (q0 ) : Tq0 N → Tp0 M is surjec-
tive we have n ≥ m and dim ker df (q0 ) = n − m. Hence there is a linear map
A : R` → Rn−m whose restriction to the kernel of df (q0 ) is bijective. Now
define the map ψ : N → M × Rn−m by
ψ(q) := (f (q), A(q − q0 ))
for q ∈ N . Then ψ(q0 ) = (p0 , 0) and dψ(q0 ) : Tq0 N → Tp0 M × Rn−m sends
w ∈ Tq0 N to (df (q0 )w, Aw) and is therefore bijective. Hence it follows from
the inverse function theorem for manifolds (Theorem 1.34) that there is an
N -open neighborhood V ⊂ N of q0 such that W := ψ(N ) ⊂ M × Rn−m is
an open neighborhood of (p0 , 0) and ψ|V : V → W is a diffeomorphism. Let
U := {p ∈ M | (p, 0) ∈ W }
and define the map g : U → N by g(p) := ψ −1 (p, 0). Then p0 ∈ U , g
is smooth, and (p, 0) = ψ(g(p)) = (f (g(p)), A(g(p) − q0 )). This implies
f (g(p)) = p for all p ∈ U and g(p0 ) = ψ −1 (p0 , 0) = q0 . Thus we have proved
that (i) implies (ii). The converse is an easy consequence of the chain rule
and is left to the reader.
48 CHAPTER 1. FOUNDATIONS
π:E→M
π ◦ σ = id.
Π = Π2 = ΠT , im Π = V. (1.38)
Prove that there is a unique matrix Π ∈ R`×` satisfying (1.38). Prove that,
for every symmetric matrix S = S T ∈ R`×` , the kernel of S is the orthogonal
complement of the image of S. If D ∈ R`×n is any injective matrix whose
image is V , prove that det(DT D) 6= 0 and
Corollary 1.102. The pullback and normal bundles are vector bundles.
These maps are smooth and hence it follows again from Theorem 1.100 that
f ∗ E and E ⊥ are vector bundles.
1.7. VECTOR BUNDLES AND SUBMERSIONS 51
To prove that (iii) also implies (i) we fix a point p0 ∈ M and choose
D : U → R`×n as above. Shrinking U if necessary, we may assume that
there is a coordinate chart φ : U → Ω with values in an open set Ω ⊂ Rm .
Consider the open subset
π −1 (U ) = {(p, v) | p ∈ U, v ∈ Ep }
This map is a diffeomorphism with Φ−1 (x, ξ) = φ−1 (x), D(φ−1 (x))ξ for
Remark 1.103. Let D : U → R`×n be as in the above proof. Then the map
Φ : π −1 (U ) → φ(U ) × Rn in the above proof is called a local trivialization
of the vector bundle E. It fits into a commutative diagram
π −1 (U )
Φ / Ω × Rn .
π pr1
φ
U /Ω
U φ Ω
Lemma 1.107 implies Theorem 1.106. We prove that (iii) implies (ii). Let
p0 ∈ M , choose a foliation box φ : U → Ω for E with p0 ∈ U , and define
N := (p ∈ U | φ(p) ∈ Rn × {y0 }}
where (x0 , y0 ) := φ(p0 ) ∈ Ω. Then N satisfies the requirements of (ii).
We prove that (ii) implies (i). Choose two vector fields X, Y ∈ Vect(M )
that satisfy X(p), Y (p) ∈ Ep for all p ∈ M and fix a point p0 ∈ M . By (ii)
there is a submanifold N ⊂ M containing p0 such that Tp N = Ep for every
p ∈ N . Hence the restrictions X|N and Y |N are vector fields on N and so is
the restriction of the Lie bracket [X, Y ] to N . Hence [X, Y ](p0 ) ∈ Tp0 N =
Ep0 as claimed.
We prove that (i) implies (iii). Thus we assume that E is an involutive
subbundle of T M and fix a point p0 ∈ M . By Theorem 1.100 there exist
vector fields X1 , . . . , Xn ∈ Vect(M ) such that Xi (p) ∈ Ep for all i and p and
the vectors X1 (p0 ), . . . , Xn (p0 ) form a basis of Tp0 E. Using Theorem 1.100
again we find vector fields Y1 , . . . , Ym−n ∈ Vect(M ) such that the vectors
X1 (p0 ), . . . , Xn (p0 ), Y1 (p0 ), . . . , Ym−n (p0 )
form a basis of Tp0 M . Using cutoff functions as in the proof of Theorem 1.100
we may assume without loss of generality that the vector fields Xi and Yj
have compact support and hence are complete (see Exercise 1.58). Denote
by φt1 , . . . , φtn the flows of the vector fields X1 , . . . , Xn , respectively, and by
ψ1t , . . . , ψm−n
t the flows of the vector fields Y1 , . . . , Ym−n . Define the map
ψ : R × Rm−n → M by
n
y
ψ(x, y) := φx1 1 ◦ · · · ◦ φxnn ◦ ψ1y1 ◦ · · · ◦ ψm−n
m−n
(p0 ).
By Lemma 1.107, this map satisfies
∂ψ
(x, y) ∈ Eψ(x,y) (1.44)
∂xi
for all x ∈ Rn and y ∈ Rm−n . Moreover,
∂ψ ∂ψ
(0, 0) = Xi (p0 ), (0, 0) = Yj (p0 ).
∂xi ∂yj
Hence the derivative dψ(0, 0) : Rn × Rm−n → Tp0 M is bijective. By the
inverse function theorem 1.34 it follows that there is an open neighborhood
Ω ⊂ Rn ×Rm−n of the origin such that the set U := ψ(Ω) ⊂ M is an M -open
neighborhood of p0 and ψ|Ω : Ω → U is a diffeomorphism. Thus the vectors
∂ψ/∂xi (x, y) are linearly independent for every (x, y) ∈ Ω and, by (1.44),
form a basis of Eψ(x,y) . Hence the inverse map φ := (ψ|Ω )−1 : U → Ω is a
foliation box. This proves the theorem.
1.8. THE THEOREM OF FROBENIUS 55
Define β : R2 → M by
β(s, t) := φt (γ(s))
for s, t ∈ R. Then
∂β
(s, 0) = γ̇(s) = Y (γ(s)) ∈ Eβ(s,0)
∂s
and
∂β
(s, t) = X(β(s, t)) ∈ Eβ(s,t)
∂t
for all s, t ∈ R. Hence it follows from Lemma 1.108 that
∂β t ∂β
dφt (p0 )v0 = φ (γ(0))γ̇(0) = (0, t) ∈ Eφt (p0 )
∂s ∂s
for every t ∈ R. This proves the lemma.
E|U := {(p, v) | p ∈ U, v ∈ Ep }
e is involutive the Lie bracket [ζ, ζ 0 ] must take values in the graph
Since E
of A. Hence the right hand side vanishes and this proves Claim 1.
1.8. THE THEOREM OF FROBENIUS 57
φβα := φβ ◦ φ−1
α : φα (Uα ∩ Uβ ) → φβ (Uα ∩ Uβ ) (1.52)
Uα Uβ
φα φβ
φβα
Exercise 1.118. Prove that for every atlas A there is a unique maximal
atlas Amax containing A and that Amax induces the same topology as A .
Remark 1.119. One can consider manifolds where the transition maps
φβα in (1.52) satisfy additional conditions. For example we can require
the transition maps to be real analytic or even polynomials, leading to the
subject of real algebraic geometry. Or we can consider the case where
m = 2n is even, identify R2n with Cn , and require the transition maps
to be holomorphic, leading to the subject of complex geometry. Still
in the case m = 2n we could require the transition maps to be canonical
transformations in the sense of classical mechanics and this is the subject
of symplectic geometry. One can also weaken the requirements on the
transition maps and only ask that they are of class C k . In the case k = 0
there can be dramatic differences. For example, there are 4-dimensional
C 0 -manifolds that do not admit any smooth structure.
Example 1.120. The complex projective space CPn is the set
and the associated line is ` = Cz. This space is equipped with the quotient
topology. Namely, if π : Cn+1 \ {0} → CPn denotes the obvious projection,
a subset U ⊂ CPn is open by definition if and only if π −1 (U ) is an open
subset of Cn+1 \ {0}. The atlas on CPn is given by the open cover
Ui := {[z0 : · · · : zn ] | zi 6= 0}
of real lines in Rn+1 . It can again be identified with the quotient space
Ui := {[x0 : · · · : xn ] | xi 6= 0}
Tn := Rn /Zn
equipped with the quotient topology. Thus two vectors x, y ∈ Rn are equiv-
alent if their difference x − y ∈ Zn is an integer vector and we denote by
π : Rn → Tn the obvious projection which assigns to each vector x ∈ Rn its
equivalence class
π(x) := [x] := x + Zn .
Then a set U ⊂ Tn is open if and only if the set π −1 (U ) is an open subset
of Rn . An atlas on Tn is given by the open cover
denotes the set of unitary k-frames in Cn and the group U(k) acts on Fk (Cn )
contravariantly by D 7→ Dg for g ∈ U(k). The projection
π : Fk (Cn ) → Gk (Cn )
Example 1.124 (The real line with two zeros). A topological space
M is called Hausdorff if any two points in M can be separated by disjoint
open neighborhoods. This example shows that a manifold need not be a
Hausdorff space. Consider the quotient space
M := R × {0, 1}/ ≡
either y1 = y2 6= 0, t1 + x1 y1 = t2 + x2 y2
[t1 , x1 , y2 ] ≡ [t2 , x2 , y2 ] ⇐⇒
or y1 = y2 = 0, t1 = t2 , x1 = x2 .
64 CHAPTER 1. FOUNDATIONS
For y 6= 0 we have [0, x, y] ≡ [t, x − t/y, y], however, each point (x, 0) on the
x-axis gets replaced by the uncountable set R×{(x, 0)}. Our manifold is the
quotient space M := X/ ≡. This time we do not use the quotient topology
but the topology induced by our atlas via (1.53). The coordinate charts are
parametrized by the reals: for t ∈ R the set Ut ⊂ M and the coordinate
chart φt : Ut → R2 are given by
1.9.2 Paracompactness
The existence of a countable atlas is of fundamental importance for almost
everything we will prove about manifolds. The next two remarks describe
several equivalent conditions.
Exercise 1.129. Prove that every manifold is locally compact. Find an ex-
ample of a manifold M and a point p0 ∈ M such that every closed neighbor-
hood of p0 is non-compact. Hint: The example is necessarly non-Hausdorff.
Our next goal is to carry over all the definitions from embedded manifolds
in Euclidean space to the intrinsic setting.
The reader may check that the notion of a smooth map is independent
of the atlas used in the definition, that compositions of smooth maps are
smooth, and that sums and products of smooth maps from M to R are
smooth.
1.9. THE INTRINSIC DEFINITION OF A MANIFOLD 67
−1
where (α, x) ∼ (β, y) iff φ−1
α (x) = φβ (y). Define an atlas on M by
f
eα := {[α, x] | x ∈ φα (Uα )} ,
U φeα ([α, x]) := x.
Prove that M
f is a smooth m-manifold and that it is diffeomorphic to M .
1.9.4 Submanifolds
Definition 1.137. Let M be an m-manifold and n ∈ {0, 1, . . . , m}. A subset
N ⊂ M is called an n-dimensional submanifold of M if, for every p ∈ N ,
there is a local coordinate chart φ : U → Ω for M , defined on an an open
neighborhood U ⊂ M of p and with values in an open set Ω ⊂ Rn × Rm−n ,
such that φ(U ∩ N ) = Ω ∩ (Rn × {0}) .
By Theorem 1.10 an m-manifold M ⊂ Rk in the sense of Definition 1.3
is a submanifold of Rk in the sense of Definition 1.137. By Theorem 1.40 the
notion of a submanifold N ⊂ M of a manifold M ⊂ Rk in Definition 1.37
agrees with the notion of a submanifold in Definition 1.137.
68 CHAPTER 1. FOUNDATIONS
for a smooth curve γ : R → M with γ(0) = p. Hence the set (1.56) is a vector
space as well. The reader may check that the map (1.58) is well defined and is
indeed a bijection between the quotient spaces (1.56) and (1.57). Moreover,
for each smooth curve γ : R → M with γ(0) = p we can now define the
derivative γ̇(0) ∈ Tp M simply as the equivalence class
∼ d
γ̇(0) := [γ]p = α, φα (γ(t)) ∈ Tp M.
dt t=0 p
is given by
φeβα (x, ξ) = (φβα (x), dφβα (x)ξ)
for x ∈ φα (Uα ∩ Uβ ) and ξ ∈ Rm where φβα := φβ ◦ φ−1
α . Thus the transition
maps are all diffeomorphisms and so the coordinate charts φeα define an
atlas on T M . The topology on T M is determined by this atlas via (1.53).
If M has a countable atlas so does T M . The remaining assertions are easy
exercises.
x1 , . . . , xm : Uα → R.
∂ ∂
1
(p), . . . , m (p) ∈ Tp M (1.67)
∂x ∂x
so that
∂ 1, if i = j,
dxi (p) (p) = δji :=
∂xj 0, if i 6= j.
Thus ∂/∂xi is a vector field on the coordinate patch Uα . For each p ∈ Uα it
is the canonical basis of Tp M determined by φα . In the notation of (1.57)
and Remark 1.143 we have
∂
(p) = [α, ei ]p = dφα (p)−1 ei
∂xi
where ei = (0, . . . , 0, 1, 0, . . . , 0) (with 1 in the ith place) denotes the stan-
dard basis vector of Rm . In other words, for every vector
ξ = (ξ 1 , . . . , ξ m ) ∈ Rm
where ξ1, . . . , ξm
: Uα → R are smooth real valued functions. If the map
Xα : φα (Uα ) → Rm is defined by (1.61) then Xα ◦ φ−1 1 m
α = (ξ , . . . , ξ ).
The above notation is motivated by the observation that the derivative of
a smooth function f : M → R in the direction of a vector field X on a
coordinate patch Uα is given by
m
X ∂f
LX f |Uα = ξi .
∂xi
i=1
If {Uα }α∈A is an open cover of M then a partition of unity {θα }α∈A (in-
dexed by the same set A) is called subordinate to the cover if each θα is
supported in Uα , i.e.
supp(θα ) := {p ∈ M | θα (p) 6= 0} ⊂ Uα .
76 CHAPTER 1. FOUNDATIONS
I1 := {i ∈ I | Vi ∩ U 6= ∅}
S
is a finite set. Hence the set U0 := U \ i∈I0 ∩I1 V i is an open neighborhood
S
of p0 and we have U0 ∩ Vi = ∅ for every i ∈ I0 . Hence p0 ∈ / i∈I0 Vi .
1.10. PARTITIONS OF UNITY 77
J → I : j 7→ ij
such that
W j ⊂ V ij ∀ j ∈ J.
Let {χi }i∈I be the partition of unity constructed in Step 3. By the axiom
of choice there is a map I → A : i 7→ αi such that Vi ⊂ Uαi for every i ∈ I.
For α ∈ A define θα : M → [0, 1] by
X
θα := χi .
αi =α
Here the sum runs over all indices i ∈ I with αi = α. This sum is locally
finite and hence is a smooth function on M . Moreover, each point in M has
an open neighborhood in which only finitely many of the θα do not vanish.
Hence the sum of the θα is a well defined function on M and
X X X X
θα = χi = χi ≡ 1.
α∈A α∈A αi =α i∈I
This shows that the functions θα form a partition of unity. To prove the
inclusion supp(θα ) ⊂ Uα we consider the open sets
Wi := {p ∈ M | χi (p) > 0}
and for i ∈ I we have θi (p0 ) = θi (p1 ), hence φi (p0 ) = φi (p1 ), and hence
p0 = p1 . Moreover, for every p ∈ M the derivative df (p) : Tp M → Rk is
injective, and f is proper because M is compact. Hence f is an embedding
as claimed.
The number ` in the proof of Theorem 1.152 can actually be chosen less
than or equal to m + 1. However, this is a deep fact in algebraic topology
and we shall not address this question here. Assuming this, the proof of
Theorem 1.152 shows that every compact m-manifold M can be embedded
in Rk with k = (m + 1)2 . Using Sard’s theorem one can in fact reduce the
dimension of the ambient space to k = 2m + 1 and a further trick, due to
Whitney, shows that a compact m-manifold can always be embedded into
R2m . Moreover, Theorem 1.152 can in fact be extended to noncompact
manifolds and one can show that a manifold admits an embedding into a
finite dimensional vector space if and only if it is Hausdorff and second
countable. However, we will not address this issue here.
Remark 1.153. The manifold RP2 cannot be embedded into R3 . The same
is true for the the Klein bottle K := R2 / ≡ where the equivalence relation
is given by [x, y] ≡ [x + k, ` − y] for x, y ∈ R and k, ` ∈ Z.
80 CHAPTER 1. FOUNDATIONS
Standing assumption
We have seen that all the results in the first chapter carry over to the in-
trinsic setting, assuming that the topology of M is Hausdorff and paracom-
pact. In fact, in most cases it is enough to assume the Hausdorff property.
However, these results mainly deal with introducing the basic concepts like
smooth maps, embeddings, submersions, vector fields, flows, and verifying
their elementary properties, i.e. with setting up the language for differen-
tial geometry and topology. When it comes to the substance of the subject
we shall deal with Riemannian metrics and they only exist on paracompact
Hausdorff manifolds. Another central ingredient in differential topology is
the theorem of Sard and that requires second countability. To quote Moe
Hirsch [6]: “Manifolds that are not paracompact are amusing, but they never
occur naturally and it is difficult to prove anything about them.” Thus we
will set the following convention for the remaining chapters.
We assume from now on that each intrinsic manifold M
is Hausdorff and second countable and hence is also paracompact.
For most of this text we will in fact continue to develop the theory for
submanifolds of Euclidean space and indicate, wherever necessary, how to
extend the definitions, theorems, and proofs to the intrinsic setting.
Chapter 2
Geodesics
For γ(t) := p+t(q−p) we have equality and hence the straight lines minimize
the length among all curves from p to q.
L(γ ◦ α) = L(γ).
81
82 CHAPTER 2. GEODESICS
Here second equation follows from the chain rule and the fact that α̇(t) ≥ 0
for all t and the third equation follows from change of variables formula for
the Riemann integral. This proves the lemma.
Remark 2.3. Choosing α in Remark 2.2 such that α(t) = 0 for t sufficiently
close to 0 and α(t) = 1 for t sufficiently close to 1, we can reparametrize γ
to obtain a curve that is constant near t = 0 and near t = 1. And the
reparametrized curve has the same endpoints, the same image, and the
same length as the original curve.
p
q
M p q
Since M is connected the set Ωp,q is nonempty for all p, q ∈ M . (Prove this!)
We define the intrinsic distance function d : M × M → [0, ∞) by
d(p, q) ≥ |p − q|
for all p, q ∈ Rn and this proves (i). Assertion (ii) follows from the fact that
the curve γ e(t) := γ(1 − t) has the same length as γ and belongs to Ωq,p
whenever γ ∈ Ωp,q . To prove (iii) fix a constant ε > 0 and choose γ0 ∈ Ωp,q
and γ1 ∈ Ωq,r such that
γ0 (1 − t) = γ1 (t) = q
is smooth and has endpoints γ(0) = p and γ(1) = r. Thus γ ∈ Ωp,r and
Hence d(p, r) < d(p, q) + d(q, r) + 2ε for every ε > 0. This proves (iii) and
the lemma.
Example 2.5. Let M = S 2 be the unit sphere in R3 and fix two points
p, q ∈ S 2 . Then d(p, q) is the length of the shortest curve on the 2-sphere
connecting p and q. Such a curve is a segment on a great circle through p
and q (see Figure 2.2) and its length is
0 ≤ d(p, q) ≤ π.
(See Example 2.23 below for details.) By Lemma 2.4 this defines a metric
on S 2 . Exercise: Prove directly that (2.4) is a distance function on S 2 .
84 CHAPTER 2. GEODESICS
q
p
A natural question is now if these two metrics d and d0 induce the same
topology on M . In other words is a subset U ⊂ M open with respect to d0
if and only if it is open with respect to d? Or, equivalently, does a sequence
pν ∈ M converge to p0 ∈ M with respect to d if and only if it converges to
p0 with respect to d0 ? Lemma 2.7 answers this question in the affirmative.
Exercise 2.6. Prove that every translation of Rn and every orthogonal
transformation preserves the lengths of curves.
Lemma 2.7. For every p0 ∈ M we have
d(p, q)
lim = 1.
p,q→p0 |p − q|
In other words, for every p0 ∈ M and every ε > 0 there is an open neigh-
borhood U ⊂ M of p0 such that, for all p, q ∈ U , we have
(1 − ε) |p − q| ≤ d(p, q) ≤ (1 + ε) |p − q| .
|p − q| ≤ d(p, q)
for all p, q ∈ M (see Remark 2.1). Now fix a point p0 ∈ M . By Exercise 2.6
we may assume without loss of generality that
p0 = 0, T0 M = Rm × {0}.
2.1. THE LENGTH OF A CURVE 85
and define
U := {(x, f (x)) | x ∈ Rm , |x| < δ} .
This is an open subset M . We prove that δ and U satisfy the assertion of
the lemma. Thus let p = (x0 , f (x0 )) and q = (x1 , f (x1 )) be two points in U
and consider the curve γ : [0, 1] → U defined by
This implies
d(p, q) ≤ L(γ) ≤ (1 + ε) |p − q|
and the lemma is proved.
A next question one might ask is: Can we choose a coordinate chart
φ:U →Ω
ψ := φ−1 : Ω → U.
Given a smooth curve [0, 1] → Ω : t 7→ c(t) = (c1 (t), . . . , cm (t)) we can write
the length of the composition γ = ψ ◦ c : [0, 1] → M in the form
Z 1
d
L(ψ ◦ c) = dt ψ(c(t)) dt
0
v
Z 1u n 2
uX d
= t ψ ν (c(t)) dt
0 dt
ν=1
v !2
u
Z 1 uX n m
X ∂ψ ν
= t (c(t))ċi (t) dt
0 ∂xi
ν=1 i=1
v
u n X m
Z 1 uX
∂ψ ν ∂ψ ν
= t (c(t)) (c(t))ċi (t)ċj (t) dt
0 ∂xi ∂xj
ν=1 i,j=1
v
u m
Z 1uX
= t ċi (t)gij (c(t))ċj (t) dt,
0 i,j=1
such that Z 1q
L(ψ ◦ c) = ċ(t)T g(c(t))ċ(t) dt (2.6)
0
for every smooth curve c : [0, 1] → Ω. Thus the condition L(ψ ◦ c) = L(c)
for every such curve is equivalent to
This means that ψ preserves angles and areas. The next example shows that
for M = S 2 it is impossible to find such coordinates.
2.2. GEODESICS 87
α + β + γ = π + A.
α
β Aγ
2.2 Geodesics
Let M ⊂ Rn be an m-dimensional connected submanifold. We will address
the following question. Given two points p, q ∈ M is there is a smooth curve
γ : [0, 1] → M with endpoints γ(0) = p and γ(1) = q satisfying
L(γ) = d(p, q)
The advantage is that the function E is smooth in the appropriate sense (to
be made more precise below) while the length fnctional L is only smooth
at those points γ ∈ Ωp,q that satisfy γ̇(t) 6= 0 for all t. Our goal is to
examine the critical points of the functionals L and E. This requires some
preparation.
88 CHAPTER 2. GEODESICS
and
Π(p)v = v ⇐⇒ v ∈ Tp M (2.9)
for p ∈ M and v ∈ Rn (see Exercise 1.99). We have seen in Theorem 1.100
and Corollary 1.101 that the map Π : M → Rn×n is smooth.
Example 2.9. Consider the case n = m + 1. Thus M ⊂ Rm+1 is a sub-
manifold of codimension 1. By Corollary 1.102 the normal bundle T M ⊥
is a vector bundle of rank 1 over M and hence each fiber Ep = Tp M ⊥ is
spanned by a single unit vector ν(p) ∈ Rm . Note that the vector ν(p) ∈ S m
is determined by the tangent space Tp M up to a sign. By Theorem 1.100
each point p0 ∈ M has an open neighborhood U ⊂ M on which there is a
smooth function ν : U → Rm+1 satisfying
for p ∈ U (see Figure 2.4). Such a map ν is called a Gauss map. The
function Π : M → Rn×n is in this case given by
ν( p)
TpM
1 − x2
−xy −xz
T
Π(p) = 1l − pp = −yx 1 − y 2 −yz
−zx −zy 1 − z 2
for ε > 0 sufficiently small. Show that there does not exist a global smooth
function ν : M → R3 satisfying (2.10).
X(t)
γ(t)
M
This is a smooth vector field along γ. Thus the covariant derivative defines
a linear operator ∇ : Vect(γ) → Vect(γ). A vector field X ∈ Vect(γ) is
called parallel if it belongs to the kernel of this operator so that ∇X ≡ 0.
Remark 2.14. For every vector field X ∈ Vect(γ) and every t ∈ I we have
R → Ωp,q : s 7→ γs
Thus we obtain a smooth vector field along γ. Since γs (0) = p and γs (1) = q
for all s, this vector field must vanish at t = 0, 1. This suggests the formula
That every tangent vector of the path space Ωp,q at γ is a vector field along γ
vanishing at the endpoints follows from the above discussion. The converse
inclusion is the content of the next exercise.
Exercise 2.16. Verify equation (2.14) for γ ∈ Ωp,q . Namely, for every
smooth vector field X ∈ Vect(γ) along γ satisfying
X(0) = 0, X(1) = 0,
dE(γ) : Tγ Ωp,q → R.
|γ̇(t)| ≡ c 6= 0
X ∈ Tγ Ωp,q
That (iii) implies (i) follows immediately from this identity. Conversely,
suppose that γ is a critical point of E and that there is a point t0 ∈ [0, 1]
such that ∇γ̇(t0 ) 6= 0. Then, by Remark 2.14, γ̈(t0 ) is not orthogonal to
Tγ(t0 ) M . We assume without loss of generality that 0 < t0 < 1. and choose
a vector v0 ∈ Tγ(t0 ) M such that hγ̈(t0 ), v0 i > 0. Hence there is a constant
ε > 0 such that 0 < t0 − ε < t0 + ε < 1 and
Choose a smooth cutoff function β : [0, 1] → [0, 1] such that β(t) = 0 for
|t − t0 | ≥ ε and β(t0 ) = 1. Define X ∈ Tγ Ωp,q by
X(t) := β(t)Π(γ(t))v0 .
Then hγ̈(t), X(t)i ≥ 0 for all t and hγ̈(t0 ), X(t0 )i > 0. Hence
Z 1
dE(γ)X = − hγ̈(t), X(t)i dt < 0
0
and this contradicts (i). Thus we have proved that (i) is equivalent to (iii).
94 CHAPTER 2. GEODESICS
We prove that (i) is equivalent to (ii). Assume first that γ satisfies (i).
Then γ also satisfies (iii) and hence γ̈(t) ⊥ Tγ(t) M for all t ∈ [0, 1]. This
implies
1d
0 = hγ̈(t), γ̇(t)i = |γ̇(t)|2 .
2 dt
Hence the function t 7→ |γ̇(t)|2 is constant. Choose c ≥ 0 such that |γ̇(t)| ≡ c.
If c = 0 then γ(t) is constant and thus γ(t) ≡ p = q. If c > 0 we have
d
dL(γ)X = L(γs )
ds s=0
Z 1
d
= |γ̇s (t)| dt
ds s=0 0
Z 1
∂
= |γ̇s (t)| dt
0 ∂s s=0
∂
Z 1
γ̇(t), ∂s s=0 s
γ̇ (t)
= dt
0 |γ̇(t)|
1 1D
Z E
= γ̇(t), Ẋ(t) dt
c 0
1
= dE(γ)X.
c
Thus, in the case c > 0, γ is a critical point of E if and only if it is a critical
point of L. Hence (i) is equivalent to (ii) and this proves the theorem.
Theorem 2.17 still does not answer the question of the existence of a
curve that minimizes the length among all curves with the same endpoints.
However it characterizes the critical points of the energy and length function-
als and thus gives us a necessary condition that energy-minimizing curves
must satisfy.
2.3. CHRISTOFFEL SYMBOLS 95
c = (c1 , . . . , cm ) : I → Ω
γ =ψ◦c:I →M
(see Figure 2.6). Our goal is to understand the equation ∇γ̇ = 0 or, more
generally, to describe the operator X 7→ ∇X on the space of vector fields
along γ in local coordinates.
X(t)
γ(t)
U
M ψ
φ
Ω
ξ(t)
c(t)
ξ = (ξ 1 , . . . , ξ m ) : I → Rm
such that
m
X ∂ψ
X(t) = dψ(c(t))ξ(t) = ξ i (t) (c(t)). (2.18)
∂xi
i=1
96 CHAPTER 2. GEODESICS
for x ∈ Ω and i, j ∈ {1, . . . , m}. The coefficients Γkij are called the Christof-
fel symbols. To sum up we have proved the following.
Lemma 2.20. Let c : I → Ω be a smooth curve and denote γ := ψ ◦ c : I →
M . If ξ : I → Rm is a smooth map and X ∈ Vect(γ) is given by (2.18) then
its covariant derivative at time t ∈ I is given by
m
X ∂ψ
∇X(t) = ξ˙k (t) + Γkij (c(t))ξ i (t)ċj (t) (c(t)), (2.21)
∂xk
k=1
for k = 1, . . . , m.
Proof. This follows immediately from the definition of geodesics and equa-
tion (2.21) in Lemma 2.20 with X = γ̇ and ξ = ċ.
2.3. CHRISTOFFEL SYMBOLS 97
for p, q ∈ S m .
98 CHAPTER 2. GEODESICS
Our next goal is to understand how the Christoffel symbols are deter-
mined by the metric in local coordinates. Recall from equation (2.5) that the
inner products on the tangent spaces inherited from the standard Euclidean
inner product on the ambient space Rn are in local coordinates represented
by the matrix function
g = (gij )m
i,j=1 : Ω → R
m×m
given by
∂ψ ∂ψ
gij := , . (2.25)
∂xi ∂xj Rn
We shall see that the Christoffel symbols are completely determined by the
functions gij : Ω → R. Here are first some elementary observations.
Remark 2.24. The matrix g(x) ∈ Rm×m is symmetric and positive definite
for every x ∈ Ω. This follows from the fact that the matrix dψ(x) ∈ Rn×m
has rank m and
hξ, g(x)ηiRm = hdψ(x)ξ, dψ(x)ηiRn
for all x ∈ Ω and ξ, η ∈ Rm .
Since g(x) is symmetric and positive definite, so is its inverse matrix g(x)−1 .
In particular we have g k` (x) = g `k (x) for all x ∈ Ω and k, ` ∈ {1, . . . , m}.
Remark 2.26. Suppose X, Y ∈ Vect(γ) are vector fields along our curve
γ = ψ ◦ c : I → M and ξ, η : I → Rm are defined by
m m
X ∂ψ X ∂ψ
X(t) = ξ i (t) (c(t)), Y (t) = η j (t) (c(t)).
∂xi ∂xj
i=1 j=1
gij : Ω → R, i, j = 1, . . . , m,
be smooth functions such that each matrix (gij (x))m i,j=1 is symmetric and
positive definite. Let Γkij : Ω → R be smooth functions for i, j, k = 1, . . . , m.
Then the Γkij satisfy the conditions
m
∂gij X
Γkij = Γkji , = gi` Γ`
jk + gj` Γ`
ik (2.26)
∂xk
`=1
If the Γkij are defined by (2.20) and the gij by (2.25), then the Γkij sat-
isfy (2.26) and hence are given by (2.27).
Proof. Suppose that the Γkij are given by (2.20) and the gij by (2.25). Let
c : I → Ω and ξ, η : I → Rm be smooth functions and suppose that the
vector fields X, Y along the curve
γ := ψ ◦ c : I → M
are given by
m m
X ∂ψ X ∂ψ
X(t) := ξ i (t) (c(t)), Y (t) := η j (t) (c(t)).
∂xi ∂xj
i=1 j=1
Here we have dropped the argument t in each term. Hence it follows from
equation (2.13) in Remark 2.15 that
d
0 = hX, Y i − hX, ∇Y i − h∇X, Y i
dt !
X X ∂gij
= gij ξ˙ η + gij ξ η̇ +
i j i j i j
ξ η ċk
∂xk
i,j k
X X
i ` ` i j k
− gi` ξ η̇ − gi` Γjk ξ η ċ
i,` i,j,k,`
X X
− g`j ξ˙` η j − g`j Γ`ik ξ i η j ċk
j,` i,j,k,`
!
X ∂gij X `
X
= − gi` Γjk − gj` Γ`ik ξ i η j ċk .
∂xk
i,j,k ` `
Taking the sum of the last two minus the first of these equations we obtain
∂gki ∂gkj ∂gij
+ − = 2Γkij .
∂xj ∂xi ∂xk
Thus we have proved that (2.29) is equivalent to (2.30) and hence (2.26) is
equivalent to (2.27). This proves the theorem.
and that the resulting map Π : M → Rn×n is smooth (see Theorem 1.100
and Corollary 1.101). Differentiating this map at a point p ∈ M we obtain
a linear map
dΠ(p) : Tp M → Rn×n
which, as usual, is defined by
d
dΠ(p)v := Π(γ(t)) ∈ Rn×n
dt t=0
dΠ(p)v w = dΠ(p)w v ∈ Tp M ⊥ .
X(t) = Π(γ(t))X(t)
Hence
hp : Tp M × Tp M → Tp M ⊥ ,
defined by
hp (v, w) := (dΠ(p)v)w = (dΠ(p)w)v (2.36)
for p ∈ M and v, w ∈ Tp M is called the second fundamental form on M .
The first fundamental form on M is the family of inner products on the
tangent spaces Tp M induced by the Euclidean inner product on Rn .
hp (v, w) = −ν(p)hdν(p)v, wi
for p ∈ M and v, w ∈ Tp M .
104 CHAPTER 2. GEODESICS
for v, w ∈ Tp M = Rm × {0}).
Exercise 2.35. Let M ⊂ Rn be an m-manifold. Fix a point p ∈ M and a
unit tangent vector v ∈ Tp M so that |v| = 1 and define
L := {p + tv + w | t ∈ R, w ⊥ Tp M } .
Let γ : (−ε, ε) → M ∩ L be a smooth curve such that γ(0) = p, γ̇(0) = v,
and |γ̇(t)| = 1 for all t. Prove that
γ̈(0) = hp (v, v).
Draw a picture of M and L in the case n = 3 and m = 2.
Equation (2.34) in the proof of Lemma 2.31 shows that the derivative of
a vector field X along a curve γ is given by Gauss–Weingarten formula
Ẋ(t) = ∇X(t) + hγ(t) (γ̇(t), X(t)). (2.37)
Here the first summand is tangent to M and the second summand is orthog-
onal to the tangent space of M at γ(t). Applying the Gauss–Weingarten
formula to the vector field X = γ̇ we obtain
γ̈ = ∇γ̇ + hγ (γ̇, γ̇). (2.38)
By definition γ is a geodesic if and only if ∇γ̇ = 0 and, by equation (2.38),
this means that γ satisfies the equation
γ̈ = hγ (γ̇, γ̇). (2.39)
2.4. THE GEODESIC FLOW 105
is smooth.
106 CHAPTER 2. GEODESICS
p
M
expp (v) = p + v
Example 2.42. Consider the orthogonal group O(n) ⊂ Rn×n with the
standard inner product
hv, wi := trace(v T w)
on Rn×n . The orthogonal projection Π(g) : Rn×n → Tg O(n) is given by
1
Π(g)v := (v − gv T g)
2
and the second fundamental form by hg (v, v) = −gv T v. Hence a curve
γ : R → O(n) is a geodesic if and only if γ T γ̈ + γ̇ T γ̇ = 0 or, equivalently,
γ T γ̇ is constant. So geodesics in O(n) have the form γ(t) = g exp(tξ) for
g ∈ O(n) and ξ ∈ o(n). It follows that the exponential map is given by
expg (v) = g exp(g −1 v) = exp(vg −1 )g
for g ∈ O(n) and v ∈ Tg O(n). In particular, for g = 1l the exponential map
exp1l : o(n) → O(n) agrees with the exponential matrix.
Exercise 2.43. What is the injectivity radius of the 2-torus T2 = S 1 × S 1 ,
the punctured 2-plane R2 \ {(0, 0)}, and the orthogonal group O(n)?
Ur
Proof. For every t ∈ I we have α(0, t) = expp (0) = p and so the assertion
holds for s = 0:
∂α ∂α
(0, t), (0, t) = 0.
∂s ∂t
Moreover, each curve s 7→ α(s, t) is a geodesic, i.e.
∂α ∂2α
∇s = Π(α) 2 ≡ 0.
∂s ∂s
∂α
By Theorem 2.17, the function s 7→ ∂s (s, t) is constant for every t, so that
∂α
(s, t) = ∂α (0, t) = |w(t)| = r
∂s ∂s
is open in the relative topology of (0, 1]. Thus I is a union of open intervals
in (0, 1) and one half open interval containing 1. Define β : [0, 1] → [0, 1]
and w : I → Tp M by
|v(t)| v(t)
β(t) := , w(t) := .
ε β(t)
Then β is continuous, both β and w are smooth on I, β(0) = 0, β(1) = 1,
w(1) = v, and
Thus I is a single half open interval containing 1 and on this interval the
condition ∂α∂t (β(t), t) = 0 implies ẇ(t) = 0. Since w(1) = v we have w(t) = v
for every t ∈ I. Hence γ(t) = expp (β(t)v) for every t ∈ [0, 1]. It follows
that β is smooth on the closed interval [0, 1] (and not just on I). Thus we
have proved that every γ ∈ Ωp,q with values in U ε has length L(γ) ≥ ε with
equality if and only if γ is a reparametrized geodesic. But if γ ∈ Ωp,q does not
take values only in U ε , there must be a T ∈ (0, 1) such that γ([0, T ]) ⊂ U ε
and γ(T ) ∈ ∂Uε . Then L(γ|[0,T ] ) ≥ ε, by what we have just proved, and
L(γ|[T,1] ) > 0 because the restriction of γ to [T, 1] cannot be constant; so in
this case we have L(γ) > ε. This proves the theorem.
The next corollary gives a first answer to our problem of finding length
minimizing curves. It asserts that geodesics minimize the length locally.
Corollary 2.46. Let M ⊂ Rn be a smooth m-manifold, I ⊂ R be an open
interval, and γ : I → M be a geodesic. Fix a point t0 ∈ I. Then there is a
constant ε > 0 such that
|γ̇(t)| ≡ c
(see Theorem 2.17). Choose δ > 0 so small that the interval [t0 − δ, t0 + δ]
is contained in I. Then there is a constant r > 0 such that r ≤ inj(γ(t))
whenever |t − t0 | ≤ δ. Choose ε > 0 such that
and
|(t − s)γ̇(s)| = |t − s| c < 2εc < r ≤ inj(γ(s)).
Hence it follows from Theorem 2.44 that
Exercise 2.47. How large can you choose the constant ε in Corollary 2.46
in the case M = S 2 ? Compare this with the injectivity radius.
112 CHAPTER 2. GEODESICS
2.4.6 Convexity
Definition 2.48. Let M ⊂ Rn be a smooth m-dimensional manifold. A sub-
set U ⊂ M is called geodesically convex if, for any two points p0 , p1 ∈ U ,
there is a unique geodesic γ : [0, 1] → U such that γ(0) = p0 and γ(1) = p1 .
Then Q0 (ξ) = |ξ|2 and so Qx continues to be positive definite for |x| suf-
ficiently small. In other words there is a constant ρ > 0 such that, for all
x, ξ ∈ Rm , we have
d2 |c|2 d
2
= hċ, ci = |ċ|2 + hc̈, ci = Qc (ċ) ≥ 0
dt 2 dt
Now choose δ > 0 and r0 > 0 so small that, for all p, q ∈ M , we have
p ∈ Ur0 , d(p, q) < δ =⇒ q ∈ Uρ , (2.42)
p ∈ Uρ =⇒ δ < inj(p), (2.43)
p, q ∈ Ur0 =⇒ d(p, q) < δ. (2.44)
We prove that Ur is geodesically convex for 0 < r ≤ r0 . Fix two points
p, q ∈ Ur . Then p, q ∈ Uρ by (2.42) and, by (2.43) and (2.44), we have
d(p, q) < δ < inj(p).
Hence, by Theorem 2.44, there is a vector v ∈ Tp M such that |v| = d(p, q)
and expp (v) = q. Define
γ(t) := expp (tv), 0 ≤ t ≤ 1.
Then γ is a geodesic with endpoints γ(0) = p and γ(1) = q. Moreover, by
Theorem 2.44, we have
d(p, γ(t)) = t |v| = td(p, q) < δ
for 0 ≤ t ≤ 1 and hence γ(t) ∈ Uρ by (2.42). This implies that the function
t 7→ |φ(γ(t))|2 is convex and therefore
|φ(γ(t))|2 ≤ (1 − t) |φ(γ(p))|2 + t |φ(γ(q))|2 ≤ r2
for 0 ≤ t ≤ 1. Hence γ([0, 1]) ⊂ Ur .
We prove that there is no other geodesic from p to q whose image is
contained in Ur . To see this, let γ 0 : [0, 1] → Ur be any geodesic connecting
p to q. Then there is a vector v 0 ∈ Tp M such that
expp (v 0 ) = q, γ 0 (t) = expp (tv 0 )
for 0 ≤ t ≤ 1. Since γ 0 (t) ∈ Ur ⊂ Ur0 it follows from (2.43) and (2.44) that
d(p, γ 0 (t)) < δ < inj(p)
for all t. If |v 0 | ≥ inj(p) then there is a t ∈ [0, 1] such that δ < t |v 0 | < inj(p)
and then, by Theorem 2.44, we have
d(p, γ 0 (t)) = d(p, expp (tv 0 )) = t v 0 > δ,
M = S2, M = T2 = S 1 × S 1 , M = R2 , M = R2 \ {0}.
Compare this with the injectivity radius. If the set Ur in these examples is
geodesically convex, does it follows that every geodesic in Ur is minimizing?
Exercise 2.56. Let (M, d) be a metric space. Prove that every compact
subset K ⊂ M is closed and bounded. Find an example of a metric space
that contains a closed and bounded subset that is not compact.
Theorem 2.57 (Completeness). Let M ⊂ Rn be a connected m-dimen-
sional manifold and let d : M × M → [0, ∞) be the distance function defined
by (2.1), (2.2), and (2.3). Then the following are equivalent.
(i) There is a point p ∈ M such that M is geodesically complete at p.
(ii) M is geodesically complete.
(iii) (M, d) is a complete metric space.
(iv) Every closed and bounded subset of M is compact.
Theorem 2.58 (Hopf–Rinow). Let M ⊂ Rn be a connected m-manifold
and let p ∈ M . Assume M is geodesically complete at p. Then, for every
q ∈ M , there is a geodesic γ : [0, 1] → M such that
γ(0) = p, γ(1) = q, L(γ) = d(p, q).
Lemma 2.59. Let X be a vector field on M and γ : (0, T ) → M be an
integral curve of X such that the limit
p0 = lim γ(t)
t→0
p ∈ M, |p − p0 | ≤ ρ =⇒ |X(p) − X(p0 )| ≤ ε,
0<t≤δ =⇒ |γ(t) − p0 | ≤ ρ.
Theorem 2.58 implies Theorem 2.57. We prove that (iv) implies (iii). Thus
we assume that every closed and bounded subset of M is compact and choose
a Cauchy sequence pi ∈ M . Choose i0 such that, for all i, j ∈ N, we have
i, j ≥ i0 =⇒ d(pi , pj ) ≤ 1,
and define
c := max d(p1 , pi ) + 1.
1≤i≤i0
Thus d(p1 , pi ) ≤ c for every i ∈ N. Hence the set {pi | i ∈ I} is bounded and
so is its closure. By (iv) this implies that the sequence pi has a convergent
subsequence. Since pi is a Cauchy sequence, this implies that pi converges.
Thus we have proved that (iv) implies (iii).
2.5. THE HOPF–RINOW THEOREM 117
|γ̇(t)| = |v|
for 0 < s < t < T . By (iii) this implies that the limit
p1 := lim γ(t)
t→T
v1 := lim γ̇(t)
t→T
Hence it follows from Theorem 2.58 that, for every q ∈ K there is a tangent
vector v ∈ Tp0 M such that |v| = d(p0 , q) ≤ r and expp0 (v) = q. Thus
This is a closed and bounded subset of the Euclidean space Tp0 M . Hence
Ke is compact and K is its image under the exponential map. Since the
exponential map expp0 : Tp0 M → M is continuous it follows that K is
compact. Thus (i) implies (iv) and this proves the theorem.
Sε (p) := p0 ∈ M | d(p, p0 ) = ε .
Σ1 (p) := {v ∈ Tp M | |v| = 1} ,
Proof. By Theorem 2.44 we have d(p, exp p (v)) = |v| for every v ∈ Tp M
with
0 0
|v| ≤ ε and d(p, p ) > ε for every p ∈ M \ expp (v) | v ∈ Tp M, |v| ≤ ε . This
shows that Sε (p) = expp (εΣ1 (p)) and, since ε is smaller than the injectivity
radius, the map Σ1 (p) → Sε (p) : v 7→ expp (εv) is a diffeomorphism. To
prove the second assertion let q ∈ M such that
r := d(p, q) > ε.
Fix a constant δ > 0 and choose a smooth curve γ : [0, 1] → M such that
γ(0) = p, γ(1) = q, and L(γ) ≤ r + δ. Choose t0 > 0 such that γ(t0 ) is
the last point of the curve on Sε (p), i.e. γ(t0 ) ∈ Sε (p) and γ(t) ∈
/ Sε (p) for
t0 < t ≤ 1. Then
d(Sε (p), q) ≤ r − ε.
Moreover,
d(p0 , q) ≥ d(p, q) − d(p, p0 ) = r − ε
for every p0 ∈ Sε (p). Hence d(Sε (p), q) = r − ε as claimed.
expp : Tp M → M
d(Sε (p), q) = r − ε.
d(γ(t), q) = r − t.
and
d(γ(s), q) ≥ d(p, q) − d(p, γ(s)) ≥ r − s.
Hence d(γ(s), q) = r − s and hence s ∈ I. This proves (2.45).
Sε (γ (t)) exp (ε w)
γ(t)
γ(t)
ε ε q
p r−t−ε
We prove that I is open (in the relative topology of [0, r]). Let t ∈ I
be given with t < r. Choose a constant ε > 0 smaller than the injectivity
radius of M at γ(t) and smaller than r − t. Then, by Lemma 2.60 with p
replaced by γ(t), we have
d(Sε (γ(t)), q) = r − t − ε.
Then
Since γ(t − ε) = expγ(t) (−εγ̇(t)) it follows from Lemma 2.61 below that
w = γ̇(t). Hence expγ(t) (sw) = γ(t + s) and this implies that
d(γ(t + ε), q) = r − t − ε.
2.5. THE HOPF–RINOW THEOREM 121
Then |v − w| < 2 (Figure 2.10). Thus by (2.46) there is a δ > 0 such that
It remains to prove (2.46). For this we use the ambient space Rn and
observe that
d(expp (δv), expp (δw))
lim
δ→0 δ
d(expp (δv), expp (δw)) expp (δv) − expp (δw)
= lim
δ→0 expp (δv) − expp (δw) δ
expp (δv) − expp (δw)
= lim
δ→0 δ
expp (δv) − p expp (δw) − p
= lim −
δ→0 δ δ
= |v − w| .
Here the second equality follows from Lemma 2.7.
gα = (gα,ij )m
i,j=1 : φα (Uα ) → R
m×m
,
∂
[α, ei ]p = dφα (p)−1 ei =: (p)
∂xi
and hence
∂ −1 ∂ −1
gα,ij (x) = (φ (x)), (φ (x)) .
∂xi α ∂xj α
For different coordinate charts the maps gα and gβ are related through the
transition map φβα := φβ ◦ φ−1
α : φα (Uα ∩ Uβ ) → φβ (Uα ∩ Uβ ) via
where the sum runs over all α ∈ A with p ∈ Uα and the inner product is
the standard inner product on Rm . Since supp(θα ) ⊂ Uα for each α and the
sum is locally finite we find that the function M → R : p 7→ hX(p), Y (p)ip is
smooth for every pair of vector fields X, Y ∈ Vect(M ). Moreover, the right
hand side of (2.49) is symmetric in v and w and is positive for v = w 6= 0
because each summand is nonnegative and each summand with θα (p) > 0 is
positive. Thus equation (2.49) defines a Riemannian metric on M .
The second method is to define the functions gα : φα (Uα ) → Rm×m by
X
gα (x) := θγ (φ−1 T
α (x))dφγα (x) dφγα (x) (2.50)
γ∈A
∂ψ ∂ψ
gij (x) := (x), j (x)
∂xi ∂x ψ(x)
Let x0 := φ(p0 ) ∈ Ω and choose r > 0 such that B r (x0 ) ⊂ Ω. Then there is
a constant δ ∈ (0, 1] such that
q
δ |ξ| ≤ ξ T g(x)ξ ≤ δ −1 |ξ| (2.53)
Now let p ∈ V and denote x := φ(p) ∈ Br (x0 ). Then, for every smooth
curve γ : [0, 1] → V with γ(0) = p0 and γ(1) = p, the curve c := φ ◦ γ takes
values in Br (x0 ) and satisfies c(0) = x0 and c(1) = x. Hence, by (2.52)
and (2.53), we have
1 1
Z Z
L(γ) ≥ δ |ċ(t)| dt ≥ δ ċ(t) dt = δ |x − x0 | .
0 0
L(γ) ≥ δr.
Refine the estimate (2.51) in the proof of Lemma 2.66 and show that
d(p, q)
lim = 1.
p,q→p0 |φ(p) − φ(q)|
This is the intrinsic analogue of Lemma 2.7. Use this to prove that equa-
tion (2.46) continues to hold for all Riemannian manifolds, i.e.
Define the inner product on this tangent space to be the restriction of the
standard inner product on Cn×k to this subspace. Exercise: Prove that
the unitary group U(n) acts on Gk (Cn ) by isometries.
gα = (gα,ij )m
i,j=1 : φα (Uα ) → R
m×m
d
hX, Y i = h∇X, Y i + hX, ∇Y i. (2.56)
dt
∇s ∂t γ = ∇t ∂s γ. (2.57)
Exercise 2.73. Prove that the covariant derivatives of vector fields along
curves in an embedded manifold M ⊂ Rn satisfy the axioms of Remark 2.72.
Prove the assertion of Remark 2.72. Hint: Use Theorem 2.27.
130 CHAPTER 2. GEODESICS
(see Theorem 2.27). Prove that they are related by the equation
X ∂φkβα
0
∂ 2 φkβα
0
X 0
∂φi0 ∂φj 0
βα βα
Γk = + Γkβ,i0 j 0 ◦ φβα .
∂xk α,ij ∂xi ∂xj ∂xi ∂xj
k i0 ,j 0
for all α, β ∈ A.
Exercise 2.75. Denote ψα := φ−1
α : φα (Uα ) → M . Prove that the covariant
derivative of a vector field
m
X ∂ψα
X(t) = ξαi (t) (cα (t))
∂xi
i=1
along γ = ψα ◦ cα : I → M is given by
m m
X
ξ˙αk (t) +
X ∂ψα
∇X(t) = Γkα,ij (c(t))ξαi (t)ċjα (t) (cα (t)). (2.58)
∂xk
k=1 i,j=1
2.6.3 Geodesics
With the covariant derivative understood, we can again define geodesics
on M as smooth curves γ : I → M that satisfy the equation ∇γ̇ = 0. Then
all the results we have proved about geodesics will translate word for word to
general manifolds. In particular, geodesics are critical points of the energy
functional on the space of paths with fixed endpoints, through each point
and in each direction there is a unique geodesic on some time interval, and
a manifold is complete as a metric space if and only if the geodesics through
some (and hence every) point in M exist in each direction for all time.
Exercise 2.76. The real projective space RPn inherits a Riemannian metric
from S n as it is a quotient of S n by an isometric involution. Prove that each
geodesic in S n with its standard metric descends to a geodesic in RPn .
2.6. RIEMANNIAN METRICS 131
R → Fk (Rn ) : t 7→ D(t)
DT Ḋ ≡ 0, Ḋ 6≡ 0.
Λ = im D, DT D
b = 0, Λ b : Rk → Λ⊥ = ker DT .
b ◦D =D
Prove that the group O(n) acts on Gk (Rn ) by isometries. Which subgroup
acts trivially?
Exercise 2.81. Carry over Exercises 2.79 and 2.80 to the complex Grass-
mannian Gk (Cn ). Prove that the group U(n) acts on Gk (Cn ) by isometries.
Which subgroup acts trivially?
Chapter 3
∇ : Vect(γ) → Vect(γ),
one for every smooth curve γ : I → M , and these operators are uniquely
characterized by the axioms of Remark 2.72 in Section 2.6. This family
of linear operators is the Levi-Civita connection. It can be extended in a
natural manner to vector fields along any smooth map with values in M
and, in particular, to vector fields along the identity map, i.e. to the space
of vector fields on M . However, in this chapter we will focus attention on the
covariant derivatives of vector fields along curves, show how they give rise
to parallel transport, examine the frame bundle, discuss motions without
“sliding, twisting, and wobbling”, and prove the development theorem.
133
134 CHAPTER 3. THE LEVI-CIVITA CONNECTION
˙ + A(t)ξ(t) = 0,
ξ(t) A(t) := .. ..
.
. .
am
1 (t) · · ·
m
am (t)
for the pullback tangent bundle. This set is a smooth submanifold of I ×Rn .
(See Theorem 1.100 and Corollary 1.102.) The next theorem summarizes
the properties of parallel transport. In particular, the last assertion shows
that the covariant derivative can be recovered from the parallel transport
maps.
Theorem 3.4 (Parallel Transport). Let γ : I → M be a smooth curve
on an interval I ⊂ R.
(i) The map Φγ (t, s) : Tγ(s) M → Tγ(t) M is linear for all s, t ∈ I.
(ii) For all r, s, t ∈ I we have
Φγ (t, s) ◦ Φγ (s, r) = Φγ (t, r), Φγ (t, t) = id.
Proof. Assertion (i) is obvious because the sum of two parallel vector fields
along γ is again parallel and the product of a parallel vector field with a
constant real number is again parallel. Assertion (ii) follows directly from
the uniqueness statement in Theorem 3.3.
We prove (iii). Fix a number s ∈ I and two tangent vectors
v.w ∈ Tγ(s) M.
d
hX, Y i = h∇X, Y i + hX, ∇Y i = 0.
dt
Hence the function I → R : t 7→ hX(t), Y (t)i is constant and this proves (iii).
We prove (iv). Fix an element s ∈ J and a tangent vector v ∈ Tγ(α(s)) M .
Define the vector field X along γ by
for t ∈ I. Thus X is the unique parallel vector field along γ that satisfies
X(α(s)) = v.
Denote
e := γ ◦ α : J → M,
γ e := X ◦ α : I → Rn
X
Then X
e is a vector field along γ
e and, by the chain rule, we have
d e d
X(t) = X(α(t)) = α̇(t)Ẋ(α(t)).
dt dt
Projecting orthogonally onto the tangent space Tγ(α(t)) M we obtain
∇X(t)
e = α̇(t)∇X(α(t)) = 0
for all i, j ∈ {1, . . . , m} and all t ∈ I. Hence the vectors X1 (t), . . . , Xm (t)
form an orthonormal basis of Tγ(t) M for every t ∈ I. This implies that, for
each s ∈ I and each tangent vector v ∈ Tγ(s) M , we have
m
X
v= hXi (s), viXi (s).
i=1
Since each vector field Xi is parallel it satisfies Xi (t) = Φγ (t, s)Xi (s). Hence
m
X
Φγ (t, s)v = hXi (s), viXi (t) (3.2)
i=1
Evidently, the derivative of the first sum with respect to t is equal to the
second sum. This proves (vi) and the theorem.
3.1. PARALLEL TRANSPORT 139
as a real n × n matrix. The formula (3.2) in the proof of (v) shows that this
matrix can be expressed in the form
m
X
Φγ (t, s)Π(γ(s)) = Xi (t)Xi (s)T ∈ Rn×n .
i=1
The right hand side defines a smooth matrix valued function on I × I and
this is equivalent to the assertion in (v).
Remark 3.6. It follows from assertions (ii) and (iii) in Theorem 3.4 that
for all s, t ∈ I. Here the linear map Φγ (t, s)∗ : Tγ(t) M → Tγ(s) M is under-
stood as the adjoint operator of Φγ (t, s) : Tγ(s) M → Tγ(t) M with respect to
the inner products on the two subspaces of Rn inherited from the Euclidean
inner product on the ambient space.
Exercise 3.7. Carry over the proofs of Theorems 3.3 and 3.4 to the intrinsic
setting.
The two theorems in this section carry over verbatim to any smooth
vector bundle E ⊂ M × Rn over a manifold. As in the case of the tangent
bundle one can define the covariant derivative of a section of E along γ as
the orthogonal projection of the ordinary derivative in the ambient space Rn
onto the fiber Eγ(t) . Instead of parallel vector fields one then speaks about
horizontal sections and one proves as in Theorem 3.3 that there is a unique
horizontal section along γ through any point in any of the fibers Eγ(t0 ) . This
gives parallel transport maps from Eγ(s) to Eγ(t) for any pair s, t ∈ I and
Theorem 3.4 carries over verbatim to all vector bundles E ⊂ M × Rn . We
spell this out in more detail in the case where E = T M ⊥ ⊂ M × Rn is the
normal bundle of M .
Let γ : I → M is a smooth curve in M . A normal vector field along
γ is a smooth map Y : I → Rn such that Y (t) ⊥ Tγ(t) M for every t ∈ M .
The set of normal vector fields along γ will be denoted by
the ordinary derivative onto the orthogonal complement of Tγ(t) M and will
be denoted by
∇⊥ Y (t) := 1l − Π(γ(t)) Ẏ (t).
(3.3)
Thus the covariant derivative defines a linear operator
∇⊥ : Vect⊥ (γ) → Vect⊥ (γ).
There is a version of the Gauss–Weingarten formula for the covariant deriva-
tive of a normal vector field. This is the content of the next lemma.
Lemma 3.8. Let M ⊂ Rn be a smooth m-manifold. For p ∈ M and
u ∈ Tp M define the linear map hp (u) : Tp M → Tp M ⊥ by
hp (u)v := hp (u, v) = dΠ(p)u v (3.4)
for v ∈ Tp M . Then the following holds.
(i) The adjoint operator hp (u)∗ : Tp M ⊥ → Tp M is given by
hp (u)∗ w = dΠ(p)u w, w ∈ Tp M ⊥ .
(3.5)
for t ∈ I. Here the last equation follows from (i) and the definition of ∇⊥ .
This proves the lemma.
3.2. THE FRAME BUNDLE 141
Theorem 3.3 and its proof carry over to the normal bundle T M ⊥ . Thus,
if γ : I → M is a smooth curve then, for all s ∈ I and w ∈ Tγ(s) M ⊥ ,
there is a unique normal vector field Y ∈ Vect⊥ (γ) such that ∇⊥ Y ≡ 0 and
Y (s) = w. This gives rise to parallel transport maps
Φ⊥ ⊥
γ (t, s) : Tγ(s) M → Tγ(t) M
⊥
defined by Φ⊥ ⊥
γ (t, s)w := Y (t) for s, t ∈ I and w ∈ Tγ(s) M , where Y is
the unique normal vector field along γ satisfying ∇⊥ Y ≡ 0 and Y (s) = w.
These parallel transport maps satisfy exactly the same conditions that have
been spelled out in Theorem 3.4 for the tangent bundle and the proof carries
over verbatim to the present setting.
a∗ b∗ e = (ba)∗ e, 1l∗ e = e
for a, b ∈ GL(m) and e ∈ Liso (Rm , V ). Moreover, the action is free, i.e. for
all a ∈ GL(m) and e ∈ Liso (Rm , V ), we have
a∗ e = e ⇐⇒ a = 1l.
142 CHAPTER 3. THE LEVI-CIVITA CONNECTION
π : F(M ) → M
which sends to each pair (p, e) ∈ F(M ) to the base boint π(p, e) := p. The
fiber of F(M ) over p ∈ M is the set
The frame bundle admits a right action of the group GL(m) via
This group action preserves the fibers of the frame bundle and its action on
each fiber F(M )p = Liso (Rm , Tp M ) is free and transitive. Thus each fiber
of F(M ) can be identified with the group GL(m) but the fibers of F(M )
are not themselves groups.
Then e(t) ∈ Liso (Rm , Tγ(t) M ) and the map β : R → F(M ) defined by
β(t) := (γ(t), e(t)) is a smooth curve in F(M ) satisfying π(β(t)) = γ(t).
Differentiating this equation at t = 0 we obtain
Thus we have proved that the linear map dπ(p0 , e0 ) : T(p0 ,e0 ) F(M ) → Tp0 M
is surjective for every pair (p0 , e0 ) ∈ F(M ). Hence π is a submersion, as
claimed. This proves the lemma.
If we denote by
ei := e(0, . . . , 0, 1, 0, . . . , 0)
the basis of Tp M induced by the isomorphism e : Rm → Tp M then we have
e1 , . . . , em is an
eT e = 1l ⇐⇒ hei , ej i = δij ⇐⇒
orthonormal basis.
Exercise 3.10. Prove that O(M ) is a submanifold of F(M ) and that the
obvious projection π : O(M ) → M is a submersion. Prove that the action
of GL(m) on F(M ) restricts to an action of the orthogonal group O(m) on
O(M ) whose orbits are the fibers
Lemma 3.12. (i) The tangent space of F(M ) at (p, e) ∈ F(M ) is the direct
sum
T(p,e) F(M ) = H(p,e) ⊕ V(p,e)
of the horizontal space
H(p,e) := (v, hp (v)e) v ∈ Tp M (3.8)
(ii) The vertical space V(p,e) is the kernel of the linear map
Proof. If β : R → F(M ) is a vertical curve with π◦β ≡ p then β(t) = (p, e(t))
where e(t) ∈ Liso (Rm , Tp M ). Hence ė(0) ∈ L(Rm , Tp M ). Conversely, for
every ê ∈ L(Rm , Tp M ), the curve
takes values in the open set Liso (Rm , Tp M ) for t sufficiently small and sat-
isfies ė(0) = ê. This shows that V(p,e) ⊂ T(p,e) F(M ).
146 CHAPTER 3. THE LEVI-CIVITA CONNECTION
−1
F(M)p = π (p)
F(M)
(p,e)
M p
The reason for the terminology introduced in Definition 3.11 is that one
draws the extremely crude picture of the frame bundle displayed in Fig-
ure 3.1. One thinks of F(M ) as “lying over” M . One would then represent
the equation γ = π ◦ β by the following commutative diagram:
F(M ) ;
8
β
ppppp
p
ppp
π
ppp γ /M
R
hence the word “lift”. The vertical space is tangent to the vertical line in
Figure 3.1 while the horizontal space is transverse to the vertical space. This
crude imagery can be extremely helpful.
Exercise 3.13. The group GL(m) acts on F(M ) by diffeomorphisms. Thus
for each a ∈ GL(m) the map
Bξ (p, e) ∈ H(p,e) ,
Proof. That (iii) implies (ii) is obvious. Just take I = R and let ξ : R → Rm
be a constant map.
We prove that (ii) implies (i). Let p0 ∈ M and v0 ∈ Tp0 M be given. Let
e0 ∈ Liso (Rm , Tp0 M ) be any isomorphism and choose ξ ∈ Rm such that
e 0 ξ = v0 .
By (ii) the vector field Bξ has a unique integral curve β : R → F(M ) with
β(0) = (p0 , e0 ). Write β in the form β(t) = (γ(t), e(t)). Then
and hence
We prove that (i) implies (iii). Thus we assume that T > t0 is such that
T ∈ I and the solution of (3.11) exists on the interval [t0 , T ) but cannot be
extended to [t0 , T + ε) for any ε > 0. We shall use the metric completeness
of M to prove the existence of the limit
(p1 , e1 ) = lim β(T ) ∈ F(M ) (3.12)
t→T
t<T
and this implies, by Lemma 2.59, that the solution can be extended to the
interval [t0 , T ], a contradiction. To prove the existence of the limit (3.12)
we write β(t) =: (γ(t), e(t)) so that γ and e satisfy the equations
γ̇(t) = e(t)ξ(t), ė(t) = hγ(t) (γ̇(t))e(t), γ(0) = p0 , e(0) = e0 . (3.13)
on the half-open interval [t0 , T ). In particular, ė(t)η is orthogonal to Tγ(t) M
and e(t)ζ ∈ Tγ(t) M for all η, ζ ∈ Rm and t ∈ [t0 , T ). Hence
d
he(t)η, e(t)ζi = hė(t)η, e(t)ζi + he(t)η, ė(t)ζi = 0. (3.14)
dt
In particular, the function t 7→ |e(t)η| is constant. This implies
|e(t)η| |e0 η|
ke(t)k := sup = sup = ke0 k (3.15)
η6=0 |η| η6=0 |η|
and thus
|γ̇(t)| = |e(t)ξ(t)| ≤ ke0 k |ξ(t)| ≤ ke0 k sup |ξ(s)| =: cT (3.16)
t0 ≤s≤T
This implies
ke(t) − e(s)k ≤ ccT ke0 k (t − s)
for t0 < s < t < T . Hence e(t) converges to a matrix e1 ∈ Rn×m as t tends
to T . The image of this limit matrix is contained in Tp1 M because
Π(p1 )e1 = lim Π(γ(t))e(t) = lim e(t) = e1 .
t→T t→T
3.3.1 Motion
Definition 3.18. A motion of M along M 0 (on an interval I ⊂ R) is
a triple (Ψ, γ, γ 0 ) of smooth maps
Ψ : I → O(n), γ : I → M, γ0 : I → M 0
such that
Ψ(t)Tγ(t) M = Tγ 0 (t) M 0 ∀ t ∈ I.
Note that a motion also matches normal vectors, i.e.
⊥
Ψ(t)Tγ(t) M ⊥ = Tγ 0 (t) M 0 ∀ t ∈ I.
Remark 3.19. Associated to a motion (Ψ, γ, γ 0 ) of M along M 0 is a family
of (affine) isometries ψt : Rn → Rn defined by
ψt (p) := γ 0 (t) + Ψ(t) p − γ(t)
(3.18)
for t ∈ I and p ∈ Rn . These isometries satisfy
ψt (γ(t)) = γ 0 (t), dψt (γ(t))Tγ(t) M = Tγ 0 (t) M 0 ∀ t ∈ I.
152 CHAPTER 3. THE LEVI-CIVITA CONNECTION
We now give the three simplest examples of “bad” motions; i.e. motions
which do not satisfy the concepts we are about to define. In all three of
these examples, p is a point of M and M 0 is the affine tangent space to M
at p:
M 0 := p + Tp M = {p + v | v ∈ Tp M } .
M’
p
M := (x, y, z) ∈ R3 | x2 + (y − 1)2 + z 2 = 1 ,
p := (0, 0, 0).
Then M 0 is the (x, z)-plane and A(t) is any curve of rotations in the (x, z)-
plane, i.e. about the y-axis Tp M ⊥ . (See Figure 3.3.)
3.3. MOTIONS AND DEVELOPMENTS 153
M’
p
Example 3.23 (Pure wobbling). This is the same as pure twisting except
that Ψ(t) is the identity on Tp M and any curve of rotations on Tp M ⊥ . As
a concrete example with m = 1 and n = 3 one can take M to be the circle
of radius one in the (x, y)-plane centered at the point (0, 1, 0) and p to be
the origin:
M := (x, y, 0) ∈ R3 | x2 + (y − 1)2 = 1 ,
p := (0, 0, 0).
Then M 0 is the x-axis and Ψ(t) is any curve of rotations in the (y, z)-plane,
i.e. about the axis M 0 . (See Figure 3.4.)
M’ p
3.3.2 Sliding
When a train slides on the track (e.g. in the process of stopping suddenly),
there is a terrific screech. Since we usually do not hear a screech, this means
that the wheel moves along without sliding. In other words the velocity of
the point of contact in the train wheel M equals the velocity of the point
of contact in the track M 0 . But the track is not moving; hence the point of
contact in the wheel is not moving. One may explain the paradox this way:
the train is moving forward and the wheel is rotating around the axle. The
velocity of a point on the wheel is the sum of these two velocities. When
the point is on the bottom of the wheel, the two velocities cancel.
Definition 3.24. A motion (Ψ, γ, γ 0 ) is without sliding if
Ψ(t)γ̇(t) = γ̇ 0 (t)
for every t.
154 CHAPTER 3. THE LEVI-CIVITA CONNECTION
acting on the manifold M with γ(t) ∈ M being the point of contact with M 0 .
Differentiating the curve t 7→ ψt (p) which describes the motion of the point
p ∈ M in the space Rn we obtain
d
ψt (p) = γ̇ 0 (t) − Ψ(t)γ̇(t) + Ψ̇(t) p − γ(t) .
dt
Taking p = γ(t0 ) we find
d
ψt (γ(t0 )) = γ̇ 0 (t0 ) − Ψ(t0 )γ̇(t0 ).
dt t=t0
Exercise 3.25. Give an example of a motion where |γ̇(t)| = |γ̇ 0 (t)| for every
t but which is not without sliding.
M := (x, y) ∈ R2 | x2 + (y − 1)2 = 1 .
3.3. MOTIONS AND DEVELOPMENTS 155
Choose
γ(t) := (cos(t − π/2), 1 + sin(t − π/2))
= (sin(t), 1 − cos(t)),
0
γ (t) := (t, 0),
The reader can easily verify that this is a motion without sliding. A fixed
point p0 on M , say p0 = (0, 0), sweeps out a cycloid with parametric equa-
tions
x = t − sin(t), y = 1 − cos(t).
(Check that (ẋ, ẏ) = (0, 0) when y = 0; i.e. for t = 2nπ.)
x2 + (y − 1)2 + z 2 = 1,
the plane is y = 0 and the line is the x-axis. The z-coordinate of a point is
unaffected by the motion. Note that the curve γ 0 traces out a straight line
in the plane M 0 and the curve γ traces out a great circle on the sphere M .
∇0 (ΨX) = Ψ∇X.
(iii) Ψ transforms parallel vector fields along γ into parallel vector fields
along γ 0 , i.e. for X ∈ Vect(γ)
∇X = 0 =⇒ ∇0 (ΨX) = 0.
Ψ(t)Π(γ(t)) = Π0 (γ 0 (t))Ψ(t)
for every t ∈ I. This restates the condition that Ψ(t) maps tangent vectors
of M to tangent vectors of M 0 and normal vectors of M to normal vectors
of M 0 . Differentiating the equation X 0 (t) = Ψ(t)X(t) we obtain
∇0 X 0 = Ψ∇X + Π0 (γ 0 )Ψ̇X.
(iii) Ψ transforms parallel normal vector fields along γ into parallel normal
vector fields along γ 0 , i.e. for Y ∈ Vect⊥ (γ)
⊥
∇⊥ Y = 0 =⇒ ∇0 (ΨY ) = 0.
Ψ0 Tγ(t0 ) M = Tγ 0 (t0 ) M 0
Ψ(t0 ) = Ψ0 .
X 0 (t) = Ψ0 v0 , Ψ0 := Ψ(t0 ).
Ψ(t)X(t) = Ψ0 v0
Then X(t) = Φγ (t, t0 )v0 . To put it another way, imagine that M is a ball. To
define parallel transport along a given curve γ roll the ball (without sliding)
along a plane M 0 keeping the curve γ in contact with the plane M 0 . Let γ 0
be the curve traced out in M 0 . If a constant vector field in the plane M 0 is
drawn in wet ink along the curve γ 0 it will mark off a (covariant) parallel
vector field along γ in M .
3.3.4 Development
A development is an intrinsic version of motion without sliding or twisting.
Proof. That (iii) implies (ii) and (ii) implies (i) is obvious. To prove that (i)
implies (iii) choose any t0 ∈ I and any orthogonal matrix Ψ0 ∈ O(n) such
that Ψ0 |Tγ(t0 ) M = Φ(t0 ) and define Ψ(t) : Rn → Rn by (3.19).
Φ0 : Tp0 M → Tp00 M 0
(ii) Any two developments (Φ1 , γ1 , γ 0 |I1 ) and (Φ2 , γ2 , γ 0 |I2 ) as in (i) on two
intervals I1 and I2 agree on the intersection I1 ∩ I2 , i.e. γ1 (t) = γ2 (t) and
Φ1 (t) = Φ2 (t) for every t ∈ I1 ∩ I2 .
(iii) If M is complete then (i) holds with I = R.
Proof. Let γ : R → M be any smooth map such that γ(t0 ) = p0 and define
Φ(t) : Tγ(t) M → Tγ 0 (t) M 0 by
We can think of e(t) as a real k×m-matrix and the map R → Rn×m : t 7→ e(t)
is smooth. In fact, the map t 7→ (γ(t), e(t)) is a smooth path in the frame
bundle F(M ). Define the smooth map ξ : R → Rm by
for every t ∈ I, where Bξ(t) ∈ Vect(F(M )) denotes the basic vector field
associated to ξ(t) ∈ Rm (see equation (3.10)).
3.3. MOTIONS AND DEVELOPMENTS 161
Curvature
4.1 Isometries
Let M and M 0 be submanifolds of Rn . An isometry is an isomorphism of
the intrinsic geometries of M and M 0 . Recall the definition of the intrinsic
distance function d : M × M → [0, ∞) (in Section 2.1) by
Z 1
d(p, q) := inf L(γ), L(γ) = |γ̇(t)| dt
γ∈Ωp,q 0
for p, q ∈ M . Let d0 denote the intrinisic distance function on M 0 .
Theorem 4.1 (Isometries). Let φ : M → M 0 be a bijective map. Then
the following are equivalent.
(i) φ intertwines the distance functions on M and M 0 , i.e. for all p, q ∈ M
d0 (φ(p), φ(q)) = d(p, q)
(ii) φ is a diffeomorphism and dφ(p) : Tp M → Tφ(p) M 0 is an orthogonal
isomorphism for every p ∈ M .
(iii) φ is a diffeomorphism and L(φ ◦ γ) = L(γ) for every smooth curve
γ : [a, b] → M .
φ is called an isometry if it satisfies these equivalent conditions. In the
case M = M 0 the isometries φ : M → M form a group denoted by I(M )
and called the isometry group of M .
Lemma 4.2. For every p ∈ M there is a constant ε > 0 such that, for all
v, w ∈ Tp M with 0 < |w| < |v| < ε, we have
|w|
d(expp (w), expp (v)) = |v| − |w| =⇒ w= v. (4.1)
|v|
163
164 CHAPTER 4. CURVATURE
Remark 4.3. It follows from the triangle inequality and Theorem 2.44 that
d(expp (v), expp (w)) ≥ d(expp (v), p) − d(expp (w), p) = |v| − |w|
whenever 0 < |w| < |v| < inj(p). Lemma 4.2 asserts that equality can only
hold when w is a positive multiple of v or, to put it differently, that the
distance between expp (v) and expp (w) must be strictly bigger that |v| − |w|
whenever w is not a positive multiple of v.
By Theorem 2.44 and the definition of the injectivity radius, the exponential
map at p is a diffeomorphism expp : Bε (p) → Uε (p) for ε < inj(p). Choose
0 < r < inj(p). Then the closure of Ur (p) is a compact subset of M . Hence
there is a constant ε > 0 such that ε < r and ε < inj(p0 ) for every p0 ∈ Ur (p).
Since ε < r we have
Since p1 ∈ Uε (p) it follows from our choice of ε that ε < inj(p1 ). Hence there
is a unique tangent vector v1 ∈ Tp1 M such that
Following first the shortest geodesic from p to p1 and then the shortest
geodesic from p1 to p2 we obtain (after suitable reparametrization) a smooth
γ : [0, 2] → M such that
and
Since w and β(1)v are both elements of Bε (p) and expp is injective on Bε (p),
this implies w = β(1)v. Since β(1) ≥ 0 we have β(1) = |w| / |v|. This
proves (4.1) and the lemma.
Proof of Theorem 4.1. That (ii) implies (iii) follows from the definition of
the length of a curve. Namely
Z b
d
L(φ ◦ γ) = φ(γ(t)) dt
dt
a
Z b
= |dφ(γ(t))γ̇(t)| dt
a
Z b
= |γ̇(t)| dt
a
= L(γ).
In the third equation we have used (ii). That (iii) implies (i) follows imme-
diately from the definition of the intrinsic distance functions d and d0 .
We prove that (i) implies (ii). Fix a point p ∈ M and choose ε > 0
so small that ε < inj(p) and that the assertion of Lemma 4.2 holds for the
point p0 := φ(p) ∈ M 0 . Then there is a unique homeomorphism
Φp : Bε (p) → Bε (φ(p))
Φp
Tp M ⊃ Bε (p) / Bε (φ(p)) ⊂ Tφ(p) M 0 .
expp exp0φ(p)
φ
M ⊃ Uε (p) / Uε (φ(p)) ⊂ M0
Claim 1. The map Φp satisfies the following equations for every v ∈ Bε (p)
and every t ∈ [0, 1]:
exp0φ(p) (Φp (v)) = φ(expp (v)), (4.3)
|Φp (v)| = |v| , (4.4)
Φp (tv) = tΦp (v). (4.5)
Since Φp is linear, this shows that the restriction of φ to the open set Uε (p)
is smooth. Moreover, for every v ∈ Tp M we have
d d
dφ(p)v = φ(expp (tv)) = exp0φ(p) (tΦp (v)) = Φp (v).
dt t=0 dt t=0
Here we have used equations (4.3) and (4.5) as well as Lemma 2.36. This
proves Claim 3 and the theorem.
ψ(p) = Ap + b
II. Given M and M 0 when are they intrinsically isomorphic, i.e. when is
there an isometry φ : M → M 0 from M onto M 0 ?
As we have noted, both the first and second fundamental forms are
preserved by extrinsic isomorphisms while only the first fundamental form
need be preserved by an intrinsic isomorphism (i.e. an isometry).
A question which occurred to Gauss (who worked for a while as a car-
tographer) is this: Can one draw a perfectly accurate map of a portion of
the earth? (i.e. a map for which the distance between points on the map is
proportional to the distance between the corresponding points on the surface
of the earth). We can now pose this question as follows: Is there an isom-
etry from an open subset of a sphere to an open subset of a plane? Gauss
answered this question negatively by associating an invariant, the Gaussian
curvature K : M → R, to a surface M ⊂ R3 . According to his Theorema
Egregium
K0 ◦ φ = K
for an isometry φ : M → M 0 . The sphere has positive curvature; the plane
has zero curvature; hence the perfectly accurate map does not exist. Our
aim is to explain these ideas.
We shall need a concept slightly more general than that of “isometry”.
R → S 1 : θ 7→ eiθ
We close this section with a result which asserts that two local isometries
that have the same value and the same derivative at a single point must agree
everywhere, provided that the domain is connected.
0
Lemma 4.10. Let M ⊂ Rn and M 0 ⊂ Rn be smooth m-manifolds and
assume that M is connected. Let φ : M → M 0 and ψ : M → M 0 be local
isometries and let p0 ∈ M such that
The proof of Theorem 4.1 shows that there is a constant ε > 0 such that
Uε (p) ⊂ U , Uε (p0 ) ⊂ U 0 , and
Rp : Tp M × Tp M → L(Tp M, Tp M )
We must prove that R is well defined, i.e. that the right hand side of
equation (4.6) is independent of the choice of γ and Z. This follows from the
Gauss–Codazzi formula which we prove next. Recall that the second fun-
damental form can be viewed as a linear map hp : Tp M → L(Tp M, Tp M ⊥ )
and that, for u ∈ Tp M , the linear map hp (u) ∈ L(Tp M, Tp M ⊥ ) and its dual
hp (u)∗ ∈ L(Tp M ⊥ , Tp M ) are given by
hp (u)∗ w = dΠ(p)u w
hp (u)v = dΠ(p)u v,
for v ∈ Tp M and w ∈ Tp M ⊥ .
172 CHAPTER 4. CURVATURE
Theorem 4.12. The Riemann curvature tensor is well defined and given
by the Gauss–Codazzi formula
for u, v ∈ Tp M .
∇t Z = ∂t Z − hγ (∂t γ)Z
= ∂t Z − dΠ(γ)∂t γ Z
= ∂t Z − ∂t Π ◦ γ Z.
Hence
∂s ∇t Z = ∂s ∂t Z − ∂s ∂t Π ◦ γ Z
= ∂s ∂t Z − ∂s ∂t Π ◦ γ Z − ∂t Π ◦ γ ∂s Z
= ∂s ∂t Z − ∂s ∂t Π ◦ γ Z − dΠ(γ)∂t γ ∇s Z + hγ (∂s γ)Z
∂s ∂t Z − ∂s ∂t Π ◦ γ Z − hγ (∂t γ)∇s Z − hγ (∂t γ)∗ hγ (∂s γ)Z.
=
Here the first two terms on the right are tangent to M and the last two
terms on the right are orthogonal to Tγ M . Hence
∇s ∇t Z − ∇t ∇s Z = Π(γ) ∂s ∇t Z − ∂t ∇s Z
= hγ (∂s γ)∗ hγ (∂t γ)Z − hγ (∂t γ)∗ hγ (∂s γ)Z.
This proves the Gauss–Codazzi equation and shows that the left hand side
is independent of the choice of γ and Z. This proves the theorem.
4.2. THE RIEMANN CURVATURE TENSOR 173
∇v X(p) := Π(p)dX(p)v ∈ Tp M,
∇X : Vect(M ) → Vect(M ),
∇f X (Y ) = f ∇X Y, (4.11)
∇X (f Y ) = f ∇X Y + (LX f ) Y (4.12)
LX hY, Zi = h∇X Y, Zi + hY, ∇X Zi, (4.13)
∇Y X − ∇X Y = [X, Y ], (4.14)
for all X, Y, Z ∈ Vect(M ) and f ∈ F (M ), where LX f = df ◦ X and
[X, Y ] ∈ Vect(M ) denotes the Lie bracket of the vector fields X and Y .
The next lemma asserts that the Levi-Civita connection is uniquely deter-
mined by (4.13) and (4.14).
The same equation holds for the Levi-Civita connection and hence
ξ = (ξ 1 , . . . , ξ m ) : Ω → Rm
defined by
ξ(φ(p)) = dφ(p)X(p)
m
X
k` 1 ∂g`i ∂g`j ∂gij
Γkij := g j
+ i
− , (4.16)
2 ∂x ∂x ∂x`
`=1
where gij is the metric tensor and g ij is the inverse matrix so that
X
gij g jk = δik
j
(see Theorem 2.27). This formula can be used to prove the existence state-
ment in Lemma 4.17 and hence define the Levi-Civita connection in the
intrinsic setting.
Exercise 4.19. In the proof of Lemma 4.17 we did not actually use that the
operator DX : Vect(M ) → Vect(M ) is linear nor that the operator X 7→ DX
is linear. Prove directly that if a map
DX : L(M ) → L(M )
satisfies (4.13) for all Y, Z ∈ Vect(M ) then DX is linear. Prove that every
map
Vect(M ) → L(Vect(M ), Vect(M )) : X 7→ DX
Proof. Fix a point p ∈ M . Then the right hand side of equation (4.17) at
p remains unchanged if we multiply each of the vector fields X, Y, Z by a
smooth function f : M → [0, 1] that is equal to one near p. Choosing f with
compact support we may therefore assume that the vector fields X and Y
are complete. Let φs denote the flow of X and ψ t the flow of Y . Define the
map γ : R2 → M by
γ(s, t) := ψ s ◦ ψ t (p), s, t ∈ R.
Then
∂s γ = X(γ), ∂t γ = (φs∗ Y )(γ).
This implies
∇s ∇t (Z ◦ γ) = ∇∂s γ ∇φs∗ Y Z (γ) + ∇∂s φs∗ Y Z (γ).
Since
∂
φs Y = [X, Y ]
∂s s=0 ∗
and ∂s γ = X(γ) we obtain
Hence
Rp (X(p), Y (p))Z(p) = ∇s ∇t (Z ◦ γ) − ∇t ∇s (Z ◦ γ) (0, 0)
= ∇X ∇Y Z(p) − ∇Y ∇X Z(p) + ∇[X,Y ] Z(p).
Remark 4.22. Equation (4.17) can be used to define the Riemann cur-
vature tensor. To do this one must again prove that the right hand side
of equation (4.17) at p depends only on the values X(p), Y (p), Z(p) of the
vector fields X, Y, Z at the point p. For this it suffices to prove that the map
4.2.4 Symmetries
Theorem 4.23. The Riemann curvature tensor satisfies
R(Y, X) = −R(X, Y ) = R(X, Y )∗ , (4.21)
R(X, Y )Z + R(Y, Z)X + R(Z, X)Y = 0, (4.22)
hR(X, Y )Z, W i = hR(Z, W )X, Y i, (4.23)
for X, Y, Z, W ∈ Vect(M ). Equation (4.22) is the first Bianchi identity.
Proof. The first equation in (4.21) is obvious from the definition and the
second equation follows immediately from the Gauss–Codazzi formula (4.8).
Alternatively, we may choose a smooth map γ : R2 → M and two vector
fields Z, W along γ. Then
0 = ∂s ∂t hZ, W i − ∂t ∂s hZ, W i
= ∂s h∇t Z, W i + ∂s hZ, ∇t W i − ∂t h∇s Z, W i − ∂t hZ, ∇s W i
= h∇s ∇t Z, W i + hZ, ∇s ∇t W i − h∇t ∇s Z, W i − hZ, ∇t ∇s W i
= hR(∂s γ, ∂t γ)Z, W i − hZ, R(∂s γ, ∂t γ)W i.
This proof has the advantage that it carries over to the intrinsic setting. We
prove the first Bianchi identity using (4.14) and (4.17):
R(X, Y )Z + R(Y, Z)X + R(Z, X)Y
= ∇X ∇Y Z − ∇Y ∇X Z + ∇[X,Y ] Z + ∇Y ∇Z X − ∇Z ∇Y X + ∇[Y,Z] X
+∇Z ∇X Y − ∇X ∇Z Y + ∇[Z,X] Y
= ∇[Y,Z] X − ∇X [Y, Z] + ∇[Z,X] Y − ∇Y [Z, X] + ∇[X,Y ] Z − ∇Z [X, Y ]
= [X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y ]].
The last term vanishes by the Jacobi identity. We prove (4.23) by combining
the first Bianchi identity with (4.21):
hR(X, Y )Z, W i − hR(Z, W )X, Y i
= −hR(Y, Z)X, W i − hR(Z, X)Y, W i − hR(Z, W )X, Y i
= hR(Y, Z)W, Xi + hR(Z, X)W, Y i + hR(W, Z)X, Y i
= hR(Y, Z)W, Xi − hR(X, W )Z, Y i
= hR(Y, Z)W, Xi − hR(W, X)Y, Zi.
Note that the first line is related to the last by a cyclic permutation. Re-
peating this argument we find
hR(Y, Z)W, Xi − hR(W, X)Y, Zi = hR(Z, W )X, Y i − hR(X, Y )Z, W i.
combining these two identities we obtain (4.23). This proves the theorem
4.2. THE RIEMANN CURVATURE TENSOR 179
g := Lie(G) = T1l G
be its Lie algebra. Fix any inner product on the Lie algebra g (not neces-
sarily invariant under conjugation) and consider the Riemannian metric on
G defined by
hv, wig := hvg −1 , wg −1 i
1
hA(ξ)η, ζi = hξ, [η, ζ]i − hη, [ζ, ξ]i − hζ, [ξ, η]i
2
for ξ, η, ζ ∈ g. Then A is the unique linear map that satisfies
for all ξ, η ∈ g. Here A(ξ)∗ denotes the adjoint operator with respect to the
given inner product on g. Note that A(ξ)η = − 12 [ξ, η] whenever the inner
product on g is invariant under conjugation.
(b) Let γ : R → G be a smooth curve and X ∈ Vect(γ) be a smooth vector
field along γ. Then the covariant derivative of X is given by
d −1 −1 −1
∇X = (Xγ ) + A(γ̇γ )Xγ γ.
dt
d
(γ̇γ −1 ) + A(γ̇γ −1 )γ̇γ −1 = 0.
dt
4.3.1 Pushforward
0
We assume throughout this section that M ⊂ Rn and M 0 ⊂ Rn are smooth
submanifolds of the same dimension m. As in Section 4.1 we denote objects
on M 0 by the same letters as objects in M with primes affixed. In particular,
g 0 denotes the first fundamental form on M 0 and R0 denotes the Riemann
curvature tensor on M 0 .
Let φ : M → M 0 be a diffeomorphism. Using φ we can move objects on
M to M 0 . For example the pushforward of a smooth curve γ : I → M is the
curve
φ∗ γ := φ ◦ γ : I → M 0 ,
the pushforward of a smooth function f : M → R is the function
φ∗ f := f ◦ φ−1 : M 0 → R,
∇0 (φ∗ X) = φ∗ ∇X (4.27)
(v) φ∗ R = R0 .
184 CHAPTER 4. CURVATURE
Hence it follows from Theorem 2.27 that Γ0 kij = Γkij for all i, j, k. This implies
that the covariant derivative of φ∗ X is given by
m m
X X ∂ψ 0
∇0 (φ∗ X) = ξ˙k + Γkij (c)ċi ξ j k (c)
∂x
k=1 i,j=1
m m
X
ξ˙k +
X ∂ψ
= dφ(ψ(c)) Γkij (c)ċi ξ j k (c)
∂x
k=1 i,j=1
∗
= φ ∇X.
This proves (4.27). Equation (4.28) follows immediately from (4.27) and
Remark 4.14.
4.3. GENERALIZED THEOREMA EGREGIUM 185
DX Y := φ∗ (∇φ∗ X φ∗ Y ) .
E(φ ◦ γ) = E(γ)
for every smooth map γ : [0, 1] → M . Hence γ is a critical point of the energy
functional if and only if φ ◦ γ is a critical point of the energy functional.
Alternatively it follows from (ii) that
0 d
∇ φ ◦ γ = ∇0 φ∗ γ̇ = φ∗ ∇γ̇
dt
By (ii) the vector fields X 0 and φ∗ X along φ ◦ γ are both parallel and they
agree at t = t0 . Hence X 0 (t) = φ∗ X(t) for all t ∈ I and this proves (4.29).
186 CHAPTER 4. CURVATURE
Warning: Care must be taken with the ordering of the indices. Some
`
authors use the notation Rkij `
for what we call Rijk and R`kij for what we
call Rijk` .
Exercise 4.31. Prove equations (4.31), (4.32), and (4.33). Use (4.31) to
give an alternative proof of Theorem 4.30.
ν : M → Sm.
dν(p) : Tp M → Tν(p) S m = Tp M
Here we use the fact that Tν(p) S m = ν(p)⊥ and, by definition of the Gauss
map ν, the tangent space of M at p is also equal to ν(p)⊥ . Thus dν(p) is
linear map from the tangent space of M at p to itself.
Definition 4.32. The Gaussian curvature of the hypersurface M is the
real valued function K : M → R defined by
K(p) := det dν(p) : Tp M → Tp M
AreaS (ν(B))
|K(p)| = lim .
B→p AreaM (B)
This says that the curvature at p is roughly the ratio of the (m-dimensional)
area of the spherical image ν(B) to the area of B where B is a very small open
neighborhood of p in M . The sign of K(p) is positive when the linear map
dν(p) : Tp M → Tp M preserves orientation and negative when it reverses
orientation.
Remark 4.34. We see that the Gaussian curvature is a natural general-
ization of Euler’s curvature for a plane curve. Indeed if M ⊂ R2 is a
1-manifold and p ∈ M we can choose a curve γ = (x, y) : (−ε, ε) → M such
that γ(0) = p and |γ̇(s)| = 1 for every s. This curve parametrizes M by the
arclength and the unit normal vector pointing to the right with respect to
the orientation of γ is ν(x, y) = (ẏ, −ẋ). This is a local Gauss map and its
derivative (ÿ, −ẍ) is tangent to the curve. The inner product of the latter
with the unit tangent vector γ̇ = (ẋ, ẏ) is the Gaussian curvature. Thus
dx d2 y dy d2 x dθ
K := − =
ds ds2 ds ds2 ds
where s is the arclength parameter and θ is the angle made by the normal
(or the tangent) with some constant line. With this convention K is positive
at a left turn and negative at a right turn.
Exercise 4.35. The Gaussian curvature of a sphere of radius r is constant
and has the value r−m .
Exercise 4.36. Show that the Gaussian curvature of the surface z = x2 −y 2
is −4 at the origin.
We now restrict to the case of surfaces, i.e. of 2-dimensional submani-
folds of R3 . Figure 4.1 illustrates the difference between positive and nega-
tive Gaussian curvature in dimension two.
4.3. GENERALIZED THEOREMA EGREGIUM 189
hR(u, v)v, ui
K(p) = . (4.34)
|u|2 |v|2 − hu, vi2
Moreover, for all u, v, w ∈ Tp M , we have
Π(p) = 1l − ν(p)ν(p)T .
Hence
dΠ(p)u = −ν(p)(dν(p)u)T − dν(p)u ν(p)T .
Here the first summand is the second fundamental form, which maps Tp M
to Tp M ⊥ , and the second summand is its dual, which maps Tp M ⊥ to Tp M .
Thus
T
hp (v) = ν(p) dν(p)v : Tp M → Tp M ⊥ ,
hp (u)∗ = dν(p)u ν(p)T : Tp M ⊥ → Tp M.
and hence
A B C
R3 −→ R3 −→ R3 −→ R3
Aξ := ξ 1 ν(p) + ξ 2 u + ξ 3 v,
dν(p)η, if η ⊥ ν(p),
Bη :=
η, if η ∈ Rν(p),
hζ, ν(p)i
Cζ := hζ, zi .
hζ, wi
We abbreviate
D := EG − F 2 .
192 CHAPTER 4. CURVATURE
∂y F − 21 ∂x G
E F
1 1
K ◦ψ = det F G 2 ∂y G
D2 1 1 1 2 1 2
2 ∂x E ∂x F − 2 ∂y E − 2 ∂y E + ∂x ∂y F − 2 ∂x G
1
E F 2 ∂y E
1 1
− 2 det F G 2 ∂x G
D 1 1
∂y E 2 ∂x G 0
2
1 ∂ E∂x G − F ∂y E
= − √ √
2 D ∂x E D
1 ∂ 2E∂x F − F ∂x E − E∂y E
+ √ √ .
2 D ∂y E D
This expression simplifies dramatically when F = 0 and we get
1 ∂ ∂x G ∂ ∂y E
K ◦ψ =− √ √ + √ (4.37)
2 EG ∂x EG ∂y EG
Exercise 4.40. Prove that the Riemannian metric
4
E=G= , F = 0,
(1 + x2 + y 2 )2
4.4.1 Homotopy
Definition 4.41. Let M be a manifold and I = [a, b] be a compact interval.
A (smooth) homotopy of maps from I to M is a smooth map
γ : [0, 1] × I → M.
We often write
γλ (t) = γ(λ, t)
for λ ∈ [0, 1] and t ∈ I and call γ a (smooth) homotopy between γ0
and γ1 . We say the homotopy has fixed endpoints if γλ (a) = γ0 (a) and
γλ (b) = γ0 (b) for all λ ∈ [0, 1]. (See Figure 4.2.)
We remark that a homotopy and a variation are essentially the same
thing, namely a curve of maps (curves). The difference is pedagogical. We
used the word “variation” to describe a curve of maps through a given
map; when we use this word we are going to differentiate the curve to find
a tangent vector (field) to the given map. The word “homotopy” is used
to describe a curve joining two maps; it is a global rather than a local
(infinitesimal) concept.
γ1
M
γ0
Example 4.45. The Euclidean space Rm is simply connected, for any two
curves γ0 , γ1 : [a, b] → Rm with the same endpoints can be joined by the
homotopy
γλ (t) := γ0 (t) + λ(γ1 (t) − γ0 (t)).
The punctured plane C \ {0} is not simply connected, for the curves
γn (t) := e2πint , 0 ≤ t ≤ 1,
then
γ(1) = p0 =⇒ γ 0 (1) = p00 , Φ(1) = Φ0
(iii) If (Φ0 , γ0 , γ00 ) and (Φ1 , γ1 , γ10 ) are developments satisfying the initial
condition (4.39) then
for every t.
4.4. THE CARTAN–AMBROSE–HICKS THEOREM 195
Example 4.49. Before giving the proof let us interpret the conditions in
case M and M 0 are two-dimensional spheres of radius r and r0 respectively in
three-dimensional Euclidean space R3 . Imagine that the spheres are tangent
at p0 = p00 . Clearly the spheres will be isometric exactly when r = r0 .
Condition (ii) says that if the spheres are rolled along one another without
sliding or twisting then the endpoint γ 0 (1) of one curve of contact depends
only on the endpoint γ(1) of the other and not on the intervening curve γ(t).
By Theorem 4.37 the Riemann curvature of a 2-manifold at p is determined
by the Gaussian curvature K(p); and for spheres we have K(p) = 1/r.
p0
M’
Exercise 4.50. Let γ be the closed curve which bounds an octant as shown
in the diagram for Example 4.49. Find γ 0 .
for t ∈ I. Then γ̇ 0 = Φγ̇ by the chain rule and, for every vector field X
along γ, we have
Φ∇X = ∇0 (ΦX)
by Theorem 4.30. Hence it follows from Lemma 3.29 and Lemma 3.36
that (Φ, γ, γ 0 ) is a development. Since (Φ, γ, γ 0 ) satisfies the initial condi-
tion (4.39) the assertion follows from the uniqueness result for developments
in Theorem 3.38. This proves the lemma.
196 CHAPTER 4. CURVATURE
for every t, by Lemma 4.48. Hence it follows from Theorem 4.30 that
from γ0 to γ1 with endpoints fixed. By Theorem 3.38 there is, for each λ, a
development (Φλ , γλ , γλ0 ) on the interval [0, 1] with initial conditions
(because M 0 is complete). The proof of Theorem 3.38 also shows that γλ (t)
and Φλ (t) depend smoothly on both t and λ. We must prove that
To see this we will show that, for each fixed t, the curve
as required.
198 CHAPTER 4. CURVATURE
First choose a basis e1 , . . . , em for Tp0 M and extend to get vector fields
Ei ∈ Vect(γ) along the homotopy γ by imposing the conditions that the
vector fields t 7→ Ei (λ, t) be parallel, i.e.
Then the vectors E1 (λ, t), . . . Em (λ, t) form a basis of Tγλ (t) M for all λ and t.
Second, define the vector fields Ei0 along γ 0 by
so that
∇0t Ei0 = 0.
Third, define the functions ξ 1 , . . . , ξ m : [0, 1]2 → R by
m
X m
X
∂t γ =: ξ i Ei , ∂t γ 0 = ξ i Ei0 . (4.42)
i=1 i=1
Here the second equation follows from (4.41) and the fact that Φλ ∂t γ = ∂t γ 0 .
Now consider the vector fields
∇0t X 0 = ∇0t ∂λ γ 0
= ∇0λ ∂t γ 0
X m
0
ξ i Ei0
= ∇λ
i=1
m
X
∂λ ξ i Ei0 + ξ i Yi0
=
i=1
and
∇0t Yi0 = ∇0t ∇0λ Ei0 − ∇0λ ∇0t Ei0 = R0 (∂t γ 0 , ∂λ γ 0 )Ei0 .
To sum up we have X 0 (λ, 0) = Yi0 (λ, 0) = 0 and
m
X
∇0t X 0 = ∂λ ξ i Ei0 + ξ i Yi0 , ∇0t Yi0 = R0 (∂t γ 0 , ∂λ γ 0 )Ei0 .
(4.44)
i=1
4.4. THE CARTAN–AMBROSE–HICKS THEOREM 199
X 0 := Φλ ∂λ γ, Yi0 := Φλ ∇λ Ei (4.45)
∇0t X 0 = Φλ ∇t ∂λ γ
= Φλ ∇λ ∂t γ
m
!
X
= Φλ ∇λ ξ i Ei
i=1
m
X
∂λ ξ i Ei + ξ i ∇λ Ei
= Φλ
i=1
m
X
∂λ ξ i Ei0 + ξ i Yi0
=
i=1
and
∇0t Yi0 = Φλ ∇t ∇λ Ei − ∇λ ∇t Ei
= Φλ R(∂t γ, ∂λ γ)Ei
= R0 (Φλ ∂t γ, Φλ ∂λ γ)Φλ Ei
= R0 (∂t γ 0 , X 0 )Ei0 .
∂λ γ 0 = Φλ ∂λ γ, ∇0λ Ei0 = Φλ ∇λ Ei
for i = 1, . . . , m. This says that λ 7→ (Φλ (t), γλ (t), γλ0 (t)) is a development.
For t = 1 we obtain ∂λ γ 0 (λ, 1) = 0 as required.
Now the modified theorem (where φ is a local isometry) is proved. The
original theorem follows immediately. Condition (iv) is symmetric in M
and M 0 ; hence if we assume (iv) we have local isometries φ : M → M 0 ,
ψ : M 0 → M with
But then ψ ◦ φ is a local isometry with ψ ◦ φ(p0 ) = p0 and d(ψ ◦ φ)(p0 ) = id.
Hence ψ ◦ φ is the identity. Similarly φ ◦ ψ is the identity so φ is bijective
(and ψ = φ−1 ) as required.
200 CHAPTER 4. CURVATURE
Remark 4.52. The proof of Theorem 4.47 shows that the various implica-
tions in the weak version of the theorem (where φ is only a local isometry)
require the following conditions on M and M 0 :
(i) always implies (ii), (iii), and (iv);
(ii) implies (iii) whenever M 0 is complete;
(iii) implies (i) whenever M 0 is complete and M is connected;
(iv) implies (iii) whenever M 0 is complete and M is simply connected.
then
γ(1) = p0 =⇒ γ 0 (1) = p00 , Φ(1) = Φ0 .
(iii) If (Φ0 , γ0 , γ00 ) and (Φ1 , γ1 , γ10 ) are developments as in (ii) then
γ(t) := expp0 (tv), γ 0 (t) := exp0p0 (tΦ0 v), Φ(t) := Φ0γ 0 (t, 0)Φ0 Φγ (0, t),
0
Lemma 4.54. Let p ∈ M and v, w ∈ Tp M such that |v| < inj(p). For
0 ≤ t ≤ 1 define
∂
γ(t) := exp(tv), X(t) := expp t(v + λw) ∈ Tγ(t) M.
∂λ λ=0
Then
∇t ∇t X = R(γ̇, X)γ̇, X(0) = 0, ∇t X(0) = w. (4.46)
A vector field along γ satisfying the first equation in (4.46) is called a Jacobi
field along γ.
Proof. Write γ(λ, t) := expp (t(v + λw)) and X := ∂λ γ to all λ and t. Since
γ(λ, 0) = p for all λ we have X(λ, 0) = 0 and
d
∇t X(λ, 0) = ∇t ∂λ γ(λ, 0) = ∇λ ∂t γ(λ, 0) = v + λw = w.
dλ
Moreover, ∇t ∂t γ = 0 and hence
∇t ∇t X = ∇t ∇t ∂λ γ
= ∇t ∇λ ∂t γ − ∇λ ∇t ∂t γ
= R(∂t γ, ∂λ γ)∂t γ
= R(∂t γ, X)∂t γ.
Proof of Theorem 4.53. The proofs (i) =⇒ (ii) =⇒ (iii) =⇒ (i) =⇒ (iv)
are as before; the reader might note that when L(γ) ≤ r we also have
L(γ 0 ) ≤ r for any development so that there are plenty of developments
with γ : [0, 1] → Ur and γ 0 : [0, 1] → Ur0 . The proof that (iv) implies (i) is
a little different since (iv) here is somewhat weaker than (iv) of the global
theorem: the equation Φ∗ R = R0 is only assumed for certain developments.
Hence assume (iv) and define φ : Ur → Ur0 by
is flat.
Exercise 4.62. For b ≥ a > 0 and c ≥ 0 define M ⊂ C3 by
for every t. We assume that u̇(t) 6= 0 for every t so that u(t) and u̇(t) are
linearly independent. Show that
M = {q + tv | q ∈ Γ, t ∈ R}
where H and Γ are as in (i) and v is a fixed vector not parallel to H. (This
is a cone with the cone point p at infinity.)
(iii) The tangent developable to a space curve γ : R → R3 , i.e.
where γ̇(t0 ) and γ̈(t0 ) are linearly independent and ε > 0 is sufficiently small.
(iv) The normal developable to a space curve γ : R → R3 , i.e.
Remark 4.68. The proof of Theorem 4.70 below will show that, if M is
locally symmetric, the isometry φ : Ur (p, M ) → Ur (p, M ) with φ(p) = p and
dφ(p) = −id exists whenever 0 < r ≤ inj(p).
Corollary 4.71. Let M and M 0 be locally symmetric spaces and fix two
points p0 ∈ M and p00 ∈ M 0 , and let Φ0 : Tp0 M → Tp00 M 0 be an orthogonal
linear isomorphism. Let r > 0 be less than the injectivity radius of M at p0
and the injectivity radius of M 0 at p00 . Then the following holds.
(i) There is an isometry φ : Ur (p0 , M ) → Ur (p00 , M 0 ) with φ(p0 ) = p00 and
dφ(p0 ) = Φ0 if and only if Φ0 intertwines R and R0 :
(Φ0 )∗ Rp0 = Rp0 0 . (4.55)
0
such that
(∇R)(X)(X1 , X2 )Y = ∇X R(X1 , X2 )Y − R(∇X X1 , X2 )Y
(4.56)
− R(X1 , ∇X X2 )Y − R(X1 , X2 )∇X Y
defined by the right hand side of equation (4.56), is multi-linear over the
ring of functions F (M ). Hence it follows as in Remark 4.22 that ∇R is well
defined, i.e. that the right hand side of (4.56) at p ∈ M depends only on the
tangent vectors X(p), X1 (p), X2 (p), Y (p).
Remark 4.76. Let γ : I → M be a smooth curve on an interval I ⊂ R and
X1 , X2 , Y ∈ Vect(γ)
(ii) The covariant derivative of the Riemann curvature tensor satisfies the
second Bianchi identity
(∇X R)(Y, Z) + (∇Y R)(Z, X) + (∇Z R)(X, Y ) = 0. (4.59)
Proof. We prove (i). Let v1 , v2 , w ∈ Tp M and choose parallel vector fields
X1 , X2 , Y ∈ Vect(γ) along γ satisfying the initial conditions X1 (0) = v1 ,
X2 (0) = v2 , Y (0) = w. Thus
X1 (t) = Φγ (t, 0)v1 , X2 (t) = Φγ (t, 0)v2 , Y (t) = Φγ (t, 0)w.
Then the last three terms on the right vanish in equation (4.57) and hence
(∇v R)(v1 , v2 )w = ∇(R(X1 , X2 )Y )(0)
d
= Φ R (X1 (t), X2 (t))Y (t)
dt t=0 γ(0,t) γ(t)
d
= Φ R (Φγ (t, 0)v1 , Φγ (t, 0)v2 )Φγ (t, 0)w
dt t=0 γ(0,t) γ(t)
d
= Φγ (0, t)∗ Rγ(t) (v1 , v2 )w.
dt t=0
Here the second equation follows from Theorem 3.4. This proves (i).
We prove (ii). Choose a smooth function γ : R3 → M and denote by
(r, s, t) the coordinates on R3 . If Y is a vector field along γ we have
(∇∂r γ R)(∂s γ, ∂t γ)Y = ∇r R(∂s γ, ∂t γ)Y − R(∂s γ, ∂t γ)∇r Y
−R(∇r ∂s γ, ∂t γ)Y − R(∂s γ, ∇r ∂t γ)Y
= ∇r (∇s ∇t Y − ∇t ∇s Y ) − (∇s ∇t − ∇t ∇s ) ∇r Y
+R(∂t γ, ∇r ∂s γ)Y − R(∂s γ, ∇t ∂r γ)Y.
Permuting the variables r, s, t cyclically and taking the sum of the resulting
three equations we obtain
(∇∂r γ R)(∂s γ, ∂t γ)Y + (∇∂s γ R)(∂t γ, ∂r γ)Y + (∇∂t γ R)(∂r γ, ∂s γ)Y
= ∇r (∇s ∇t Y − ∇t ∇s Y ) − (∇s ∇t − ∇t ∇s ) ∇r Y
+∇s (∇t ∇r Y − ∇r ∇t Y ) − (∇t ∇r − ∇r ∇t ) ∇s Y
+∇t (∇r ∇s Y − ∇s ∇r Y ) − (∇r ∇s − ∇s ∇r ) ∇t Y.
The terms on the right hand side cancel out. This proves the theorem.
4.6. SYMMETRIC SPACES 211
Proof of Theorem 4.70. We prove that (iii) implies (i). This follows from
the local Cartan–Ambrose–Hicks Theorem 4.53 with
p00 = p0 = p, Φ0 = −id : Tp M → Tp M.
This isomorphism satisfies
(Φ0 )∗ Rp = Rp .
Hence it follows from the discussion in the beginning of this section that
Φ∗ R = R 0
for every development (Φ, γ, γ 0 ) of M along itself satisfying
γ(0) = γ 0 (0) = p, Φ(0) = −id.
Hence, by the local C-A-H Theorem 4.53, there is an isometry
φ : Ur (p, M ) → Ur (p, M )
satisfying
φ(p) = p, dφ(p) = −id
whenever 0 < r < inj(p, M ).
We prove that (i) implies (ii). By Theorem 4.30 (Theorema Egregium),
every isometry φ : M → M 0 preserves the Riemann curvature tensor and
covariant differentiation, and hence also the covariant derivative of the Rie-
mann curvature tensor, i.e.
φ∗ (∇R) = ∇0 R0 .
Applying this to the local isometry φ : Ur (p, M ) → Ur (p, M ) we obtain
∇dφ(q)v R φ(q) (dφ(q)v1 , dφ(q)v2 ) = dφ(q) (∇v R) (v1 , v2 )dφ(q)−1 .
∂ψ
Ei (x) := (x) ∈ Tψ(x) M, x ∈ Ω, i = 1, . . . , m,
∂xi
be the local frame of the tangent bundle determined by this coordinate
chart. Let Γkij : Ω → R denote the Christoffel symbols and Rijk
` : Ω → R the
Given i, j, k, ` ∈ {1, . . . , m} we can express the vector field (∇Ei R)(Ej , Ek )E`
along ψ for each x ∈ Ω as a linear combination of the basis vectors Ei (x).
This gives rise to functions ∇i Rjk` ν : Ω → R defined by
X
ν
(∇Ei R)(Ej , Ek )E` =: ∇i Rjk` Eν . (4.60)
ν
K(p) := K(p, Tp M ).
This follows from equation (4.36) in the proof of Theorem 4.37 which holds
in all dimensions. In particular, when M = S m , we have ν(p) = p and hence
K(p, E) = 1 for all p and E. For a sphere of radius r we have ν(p) = p/r
and hence K(p, E) = 1/r2 .
4.7. CONSTANT CURVATURE 215
Example 4.89. Let G ⊂ O(n) be a Lie subgroup equipped with the Rie-
mannian metric
hv, wi := trace(v T w)
for v, w ∈ Tg G ⊂ Rn×n . Then, by Example 4.27, the sectional curvature of
G at the identity matrix 1l is given by
1
K(1l, E) = |[ξ, η]|2
4
for every 2-dimensional linear subspace E ⊂ g = Lie(G) = T1l G with an
orthonormal basis ξ, η.
Exercise 4.90. Let E ⊂ Tp M be a 2-dimensional linear subspace, let r > 0
be smaller than the injectivity radius of M at p, and let N ⊂ M be the
2-dimensional submanifold given by
N := expp ({v ∈ E | |v| < r}) .
Show that the sectional curvature K(p, E) of M at (p, E) agrees with the
Gauss curvature of N at p.
Exercise 4.91. Let p ∈ M ⊂ Rn and let E ⊂ Tp M be a 2-dimensional lin-
ear subspace. For r > 0 let L denote the ball of radius r in the (n − m + 2)
dimensional affine subspace of Rn through p and parallel to the vector sub-
space E + Tp M ⊥ :
n o
L = p + v + w | v ∈ E, w ∈ Tp M ⊥ , |v|2 + |w|2 < r2 .
Proof. The “only if” statement follows immediately from the definition with
v1 = v2 = u and v2 = v3 = v. To prove the converse, we assume that M has
constant curvature k. Fix a point p ∈ M and define the multi-linear map
Q : Tp M 4 → R by
Q(v1 , v2 , v3 , v4 ) := hRp (v1 , v2 )v3 , v4 i − k hv1 , v4 ihv2 , v3 i − hv1 , v3 ihv2 , v4 i .
Q(p, q) := −x0 y0 + x1 y1 + · · · + xm ym
Tp Hm = v ∈ Rm+1 | Q(p, v) = 0 .
We remark that the manifold Hm does not quite fit into the extrinsic
framework of most of this manuscript as it is not exhibited as a submanifold
of Euclidean space but rather of “pseudo-Euclidean space”: the positive
definite inner product hv, wi of the ambient space Rm+1 is replaced by a
nondegenerate symmetric bilinear form Q(v, w). However, all the theory
developed thus far goes through (reading Q(v, w) for hv, wi) provided we
make the additional hypothesis (true in the example M = Hm ) that the
first fundamental form gp = Q|Tp M is positive definite. For then Q|Tp M is
nondegenerate and we may define the orthogonal projection Π(p) onto Tp M
as before. The next lemma summarizes the basic observations; the proof is
an exercise in linear algebra.
Lemma 4.105. Let Q be a symmetric bilinear form on a vector space V
and for each subspace E of V define its orthogonal complement by
E ⊥Q := {w ∈ V | Q(u, v) = 0 ∀v ∈ E} .
V = E ⊕ E ⊥Q ⇐⇒ E ∩ E ⊥Q = {0},
∇X(t) = Π(γ(t))Ẋ(t)
= Ẋ(t) + Q(Ẋ(t), γ(t))γ(t)
= Ẋ(t) − Q(X(t), γ̇(t))γ(t).
4.7. CONSTANT CURVATURE 221
where the first summand on the right is tangent to Hm and the second
summand is Q-orthogonal to Tp Hm . Hence
Ẋ = ∇X + hγ (γ̇)X
extends to the present setting. The reader may verify that the operators
∇ : Vect(γ) → Vect(γ) thus defined satisfy the axioms of Remark 2.72 and
hence define the Levi-Civita connection on Hm .
Now a smooth curve γ : I → Hm is a geodesic if and only if it satisfies
the equivalent conditions
d
Q(γ̇, γ̇) = 2Q(γ̈, γ̇) = 0
dt
because γ̈ is a scalar multiple of γ, and so Q(γ̇, γ̇) is constant. Let p ∈ Hm
and v ∈ Tp Hm be given with
Q(v, v) = 1.
222 CHAPTER 4. CURVATURE
where
et + e−t et − e−t
cosh(t) := , sinh(t) := .
2 2
In fact we have γ̈(t) = γ(t) ⊥Q Tγ(t) Hm . It follows that the geodesics
exist for all time and hence Hm is geodesically complete. Moreover, being
diffeomorphic to Euclidean space, Hm is connected and simply connected.
It remains to prove that Hm has constant sectional curvature k = −1.
To see this we use the Gauss–Codazzi formula in the hyperbolic setting, i.e.
Exercise 4.106. Prove that the pullback of the metric on Hm under the
diffeomorphism p
Rm → Hm : x 7→ 1 + |x|2 , x
is given by s
hx, ξi2
|ξ|x = |ξ|2 −
1 + |x|2
or, equivalently, by the metric tensor,
xi xj
gij (x) = δij − (4.74)
1 + |x|2
for x = (x1 , . . . , xm ) ∈ Rm .
4.7. CONSTANT CURVATURE 223
Exercise 4.107. The Poincaré model of hyperbolic space is the open unit
disc Dm ⊂ Rm equipped with the Poincaré metric
2 |η|
|η|y =
1 − |y|2
4δij
gij (y) = 2 , y ∈ Dm . (4.75)
2
1 − |y|
Interpret this map as a stereographic projection from the south pole (−1, 0).
x 2y p 1 + |y|2
y=p , x= , 1 + |x|2 = .
1 + |x|2 + 1 1 − |y|2 1 − |y|2
expp : Tp Hm → Hm
is given by
p
p sinh Q(v, v)
expp (v) = cosh Q(v, v) p + p v (4.76)
Q(v, v)
Then the tuple M, M 0 , p0 , p00 , Φ0 satisfies condition (i) of the weak version of
Theorem 4.47. Hence this tuple also satisfies condition (iv) of Theorem 4.47.
Since M and M 0 are connected, simply connected, and complete we may
apply the strong version of Theorem 4.47 to obtain an isometry ψ : M → M 0
satisfying
ψ(p0 ) = p00 , dψ(p0 ) = Φ0 .
Since every isometry is also a local isometry and M is connected it follows
from Lemma 4.10 that φ(p) = ψ(p) for all p ∈ M . Hence φ is an isometry,
as required.
226 CHAPTER 4. CURVATURE
Remark 4.114. Refining the argument in the proof of Lemma 4.113 one can
show that a local isometry φ : M → M 0 must be surjective whenever M is
complete and M 0 is connected. None of these assumptions can be removed.
(Take an isometric embedding of a disc in the plane or an embedding of
a complete space M into a space with two components, one of which is
isometric to M .)
Likewise, one can show that a local isometry φ : M → M 0 must be
injective whenever M is complete and connected and M 0 is simply connected.
Again none of these asumptions can be removed. (Take a covering R → S 1 ,
or a covering of a disjoint union of two isometric complete simply connected
spaces onto one copy of this space, or some noninjective immersion of a disc
into the plane and choose the pullback metric on the disc.)
ξ(0) = 0, ˙
ξ(0) 6= 0, ξ(t) 6= 0 ∀ t > 0.
f (t) := |ξ(t)|
˙
hη, ξi
f˙ = .
|η|
ξ(0) = 0, ¨ = 0.
ξ(0)
Prove that there are constant ε > 0 and c > 0 such that, for all t ∈ R:
˙
Hint: Write ξ(t) = tv + η(t) and ξ(t) = v + η̇(t) with η(t) = O(t3 ) and
2
η̇(t) = O(t ). Show that the terms of order 2 and 4 cancel in the Taylor
expansion at t = 0.
4.8. NONPOSITIVE SECTIONAL CURVATURE 227
Proof of Theorem 4.112. We prove that (i) implies (ii). Let p ∈ M and
v, v̂ ∈ Tp M be given. Assume without loss of generality that v̂ 6= 0 and
define γ : R → M and X ∈ Vect(γ) by
∂
γ(t) := expp (tv), X(t) := expp (t(v + λv̂)) ∈ Tγ(t) M (4.78)
∂λ λ=0
for t ∈ R. Then
It follows from Exercise 4.115 with ξ(t) := Φγ (0, t)X(t) that the function
[0, ∞) → R : t 7→ |X(t)| is smooth and
d
|X(t)| = |∇X(0)| = |v̂| .
dt t=0
Moreover, for t > 0, we have
d2 d hX, ∇Xi
2
|X| =
dt dt |X|
|∇X|2 + hX, ∇∇Xi hX, ∇Xi2
= −
|X| |X|3 (4.81)
|X|2 |∇X|2 − hX, ∇Xi2 hX, R(γ̇, X)γ̇i
= +
|X|3 |X|
≥ 0.
Here the third equation follows from the fact that X is a Jacobi field along γ,
and the last inequality follows from the nonpositive sectional curvature con-
dition in (i) and from the Cauchy–Schwarz inequality. Thus the second
derivative function [0, ∞) → R : t 7→ |X(t)| − t |v̂| is nonnegative; so its first
derivative is nondecreasing and it vanishes at t = 0; thus
|X(t)| − t |v̂| ≥ 0
We prove that (ii) implies (i). Assume, by contradiction, that (ii) holds
but there is a point p ∈ M and a pair of vectors v, v̂ ∈ Tp M such that
X(0) = 0, ∇X(0) = v̂ 6= 0.
that there is a constant c > 0 such that, for t > 0 sufficiently small, we have
the inequality
Moreover,
for t sufficiently small, where the second inequality follows from (4.82).
Hence, by (4.81), we have
Integrating this inequality over an interval [0, t] with ct2 < εδ 2 we get
d d
|X(t)| < |X(t)| = |∇X(0)|
dt dt t=0
Here the third inequality follows from (ii). This shows that (ii) implies (iii).
230 CHAPTER 4. CURVATURE
We prove that (iii) implies (ii). Fix a point p ∈ M and a tangent vector
v ∈ Tp M and denote
q := expp (v).
φ := exp−1
q ◦ expp : Tp M → Tq M.
and hence
|φ(v + v̂))| = d(expp (v), expp (v + v̂)) ≥ |v̂| ,
Thus we have proved that (iii) implies (ii). This completes the proof of the
theorem.
4.8. NONPOSITIVE SECTIONAL CURVATURE 231
P := P ∈ Rn×n | P T = P > 0
hS1 , S2 iP := trace S1 P −1 S2 P −1
(4.84)
for P ∈ P and S1 , S2 ∈ S = TP P.
g∗ P := gP g T (4.85)
P0 := {P ∈ P | det(P ) = 1}
R(S, T )A = ∇s ∇t A − ∇t ∇s A
for every matrix X ∈ Rn×n . Hence hS, RP (S, T )T iP ≤ 0 for all P ∈ P and
all S, T ∈ S . This proves Step 4.
Step 5. P is a symmetric space.
Given A ∈ P define the map φ : P → P by
φ(P ) := AP −1 A.
This map is a diffeomorphism, fixed the matrix A = φ(A), and satisfies
dφ(P )S = −AP −1 SP −1 A
for P ∈ P and S ∈ S . Hence dφ(A) = −id and, for all P ∈ P and all
S ∈ S , we have
(dφ(P )S)φ(P )−1 = −AP −1 SA−1
and therefore
2 2
|dφ(P )S|2φ(P ) = trace AP −1 SA−1 = trace P −1 S = |S|2P .
Remark 4.123. The space P can be identified with the quotient space
GL(n, R)/O(n) via polar decomposition.
Remark 4.124. Theorem 4.117 carries over verbatim to the complex set-
ting. Just replace P by the space H of positive definite Hermitian matrices
H = H ∗ > 0,
[1] Ralph Abraham and Joel W. Robbin, Transversal Mappings and Flows.
Benjamin Press, 1967.
[2] Lars V. Ahlfors and Leo Sario, Riemann Surfaces. Princeton University
Press, 1960.
[4] David B.A. Epstein, Foliations with all leaves compact. Annales de
l’institute Fourier 26 (1976), 265–282.
[8] John W. Milnor, Topology from the Differentible Viewpoint. The Uni-
versity Press of Vorginia 1969.
[10] Mary Ellen Rudin, A new proof that metric spaces are paracompact.
Proceedings of the AMS 20 (1969), 603.
237
Index
238
INDEX 239
surface, 188
symmetric space, 207
unitary group, 40
Urysohn metrization theorem, 65
vector bundle, 48
section, 48
vector field, 23, 71
along a curve, 89
along a map, 128
basic, 148
normal, 139
parallel, 90
with compact support, 28
Veronese embedding, 70
vertical
tangent space, 145