DiffTop Lecture Notes
DiffTop Lecture Notes
L ECTURE NOTES
G EREON Q UICK
L AST UPDATE: 02 FEBRUARY 2024
Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2 Smooth manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.1 Topology in ℝ𝑛 14
3.3 Embeddings 69
4.1 Submersions 75
3
4.5 Exercises and more examples 93
6 Transversality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
14 Orientation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 284
15.4 Linking number and the Hopf invariant via exercises 320
16 Hopf Degree Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 448
Preface
These are the lecture notes accompanying a one-semester course. The aim of the course is to
pick up the readers with a background in multi-variable calculus and linear algebra and to take
them on a direct path to the fascinating world of differential topology.
In order to make the ideas and techniques as accessible as possible the arguments are ex-
plained in great detail. Thus instead of aiming for the most elegant and shortest argument we
often take a longer walk and pick up every flower along the way by hand. We hope that the
amount of detail will make it easier for a relatively unexperienced mathematician to witness
and understand what is happening and to appreciate how some relatively straight-forward ideas
lead to exciting and deep results.
This has the consequence that these notes are not brief and do not just summarise the main
ideas and theorems in differential topology. For a brief and comprehensive account we recom-
mend the excellent books by Milnor [13] and Guillemin–Pollack [5] on which these notes are
based on.
The chapters are accompanied by a list of exercises. We highly recommend to work through
all the exercises. At the end of the book there are suggestions for solutions to all exercises. For
some it may be tempting to glimpse at the solutions before trying to solve the problems, other
will rather try on their own first. We think that the readers may decide for themselves how they
prefer to learn new mathematics.
We are grateful to many colleagues and students who helped to improve these notes. In
particular, we would like to thank Torgeir Aambø, Johanna Aigner, Anders Alexander Ander-
sen, Torgeir Aambø, Daria Barjaktarevic, Denis Bergmann, Erlend Bergtun, Dagmar Coelle,
Eivind Xu Djurhuus, Robin Fissum, Sigurd Gaukstad, Knut Bjarte Haus, Håvard Skjetne Lille-
heie, Abigail Linton, Martin Löcsei, Chileshe Mwamba, Trygve Poppe Oldervoll, Leona Ro-
denkirchen, Sindre Strøm, Marius Thaule, Inga Maria Tillmann, Melvin Vaupel, and Glen
Matthew Wilson.
9
1. Introduction
∙ Geometry vs topology
Classical geometers were interested in measuring angles and distances. For example, two
things are the ‘same’ – more mathematically speaking they are congruent – in classical geom-
etry1 if you can transform one into the other by moving or rotating them. A first variation to
allow flexibility, is projective geometry: Two things are considered the same if they are both
views of the same object. For example, an ellipse and a circle can be projectively equivalent:
for one can look like the other when you look at them from the right perspective. In topology,
we take this idea a big step further and consider two objects the same if we can continuously
transform one into the other. For example, a triangle is equivalent to a circle and both are
equivalent to a square. In differential topology, the part we study in this class, we only allow
smooth transformations. Then square and circle are different, because a square has vertices
which are not smooth points while the circle does not. As a slogan we may say that differential
topology is the study of properties that do not change under smooth transformations. Smooth-
ness of an object is something we check locally, i.e., by looking at every point and a small
neighborhood around it. The smoothest space we know is Euclidean space. This leads to the
following first working definition of what kind of objects we are going to study:
𝕊1 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑥2 + 𝑦2 = 1} ⊂ ℝ2 .
The circle is something one-dimensional. But how do we describe that precisely when
we need two coordinates to describe its points. One way is that if we zoom in at any
point it just looks like a bended line segment. Looking very closely it even looks almost
like a straight line segment. See Figure 1.1. So, locally, and we will make precise what
that means very soon, the circle looks like a segment of ℝ1 . The unit circle 𝕊1 , more
generally, the 𝑛-dimensional sphere
10
Chapter 1. Introduction 11
Figure 1.1: Each segment of the circle looks like an open interval in ℝ.
While not being precise at all, our working definition may sound at the same time quite strict.
In a small neighborhood every point looks the same. How can this lead to interesting objects?
We will see that there is in fact a universe of examples of smooth manifolds of very different
kind. The point is that even though all points look pretty much the same locally, manifolds may
look very different globally. As a first simple example consider the two-dimensional sphere
and the two-dimensional torus. They look the same locally, but they are quite different globally:
the torus has a whole in the middle while the sphere does not. In fact, one of the main goals in
topology is to classify all types of manifolds.
A key method to analyse spaces and maps, or more generally any kind of complicated object
in mathematics, is to attach to them interesting invariants. These are usually numbers, groups,
vector spaces, …, any sort of algebraic objects which are much easier to understand and to
distinguish than the spaces and maps we started with. The name invariant comes from the
fact that we require that the algebraic objects, for example numbers 𝑖(𝑋) and 𝑗(𝑓 ) we attach to
each object 𝑋 and each map 𝑓 , does not change under the geometric transformations we allow.
For smooth manifolds, the transformations we allow are diffeomorphisms, or, later on, smooth
homotopy equivalences. For example, assume we would like to understand two manifolds 𝑋
and 𝑌 . Let us assume they are defined in complicated ways so that it is not easy at all to check
if they are maybe the same objects after all. If we can calculate the invariants 𝑖(𝑋) and 𝑖(𝑌 ) and
get 𝑖(𝑋) ≠ 𝑖(𝑌 ), then 𝑋 and 𝑌 could not have been the same to begin with. If 𝑋 and 𝑌 were
vector spaces, then we are very familiar with the idea of an invariant, namely the dimension.
That is, if dim 𝑋 ≠ dim 𝑌 , then we know that there is no linear isomorphism between 𝑋 and
𝑌 . If dim 𝑋 = dim 𝑌 , then we can construct an isomorphism 𝑋 ≅ 𝑌 by choosing bases for
both 𝑋 and 𝑌 . In fact, a smooth manifold has a tangent space at every point. The tangent
space is a real vector space, and it will be one of the main tools for our study. However, tangent
spaces and the dimension are not sufficient to study smooth manifolds and we will need more
sophisticated tools and invariants. For example, we will develop intersection theory and will
show that the smooth manifolds 𝕊𝑛 , the 𝑛-sphere, and ℝP𝑛 , the 𝑛-dimensional real projective
space, are not diffeomorphic for 𝑛 ≥ 2. In fact, we will show that they are not even homotopy
equivalent for 𝑛 ≥ 2.
12
∙ dimension
∙ Brouwer degree
∙ intersection number
∙ Euler characteristic
After developing the theory of smooth manifolds with and without boundaries we will prove
the following famous theorem: Let 𝔻𝑛 be the 𝑛-dimensional unit disc
𝔻𝑛 = {(𝑥1 , … , 𝑥𝑛 ) ∈ ℝ𝑛 ∶ 𝑥21 + ⋯ + 𝑥2𝑛 ≤ 1}.
This theorem has a lot of very important applications. In particular, one way to apply it is
to first transform a problem into finding a solution 𝑥0 of an equation of the form 𝑓 (𝑥) = 𝑥 and
to use Brouwer’s theorem to show that such a solution exists. It will require a good amount of
exciting work to prove the theorem.
Then we will introduce the most important invariant in differential topology: the Brouwer
degree of a smooth map. It will turn out to be an extremely powerful tool. We will first introduce
a mod 2-version of the degree and use it for example to show the following important and deep
result of Hopf:
Theorem 1.4 (A Hopf invariant one primer) There is a smooth map 𝕊3 → 𝕊2 which
is not homotopic to a constant map.
The map 𝕊3 → 𝕊2 of the theorem is an example of Hopf fibration which only exists in
certain dimensions. We will use this important map as a running example throughout these
notes. The way we talked about invariants so far may suggest that they only allow to show that
something does not exist. However, there are also some very important situations where an
equality of invariants implies that spaces are equivalent. A very famous example of such a case
is the following theorem which uses the integer-valued Brouwer degree which we will study in
this course after introducing orientations:
A vector field on a smooth manifold 𝑋 is a map which assigns to each point 𝑥 a tangent
vector to 𝑋 at 𝑥. A zero of a vector field is a point to which the field assigns the zero vector.
The Brouwer degree allows us to define the index of a zero which is an integer that characterises
the geometry of the vector field around the zero. The sum of the indices of the zeros of a vector
field a priori depends on the smooth structure of the manifold. On the other hand, the manifold
𝑋 is also equipped with an integer, the Euler characteristic 𝜒(𝑋). This is a purely topological
invariant which means it does not change if we transform 𝑋 continuously and it does not depend
on the fact that 𝑋 is not just a space but a smooth manifold. One of the highlights of this course
is the proof of the following famous result which relates the geometry of vector fields on a
smooth manifold to its Euler characteristic. This is a first example of an index theorem which
is part of a fascinating and very influential area in mathematics:
Before we are able to prove these exciting results we set out to develop the basic theory of
smooth manifolds in the following chapters.
2. Smooth manifolds
2.1 Topology in ℝ𝑛
Recall from Calculus that the norm of a vector 𝑥 = (𝑥1 , … , 𝑥𝑛 ) ∈ ℝ𝑛 is defined as the non-
negative real number √
|𝑥| = 𝑥21 + 𝑥22 + ⋯ + 𝑥2𝑛 ∈ ℝ≥0 .
The norm defines a distance between two points 𝑥, 𝑦 in ℝ𝑛 by taking the norm |𝑥 − 𝑦| of the
difference of 𝑥 and 𝑦. For any 𝑛, the space ℝ𝑛 together with this norm is called 𝑛-dimensional
Euclidean space. It is a topological space in the following way:
∙ Let 𝑥 be a point in ℝ𝑛 and 𝑟 > 0 a real number. We define the 𝑛-dimensional open
ball with radius 𝑟 around 𝑥
∙ Arbitrary unions of open sets are open and finite intersections of open sets are
open.
∙ Familiar examples of open sets in ℝ are open intervals, e.g., the open interval (−2, 1).
∙ The cartesian product of 𝑛 open intervals (an open ‘rectangle’) is open in ℝ𝑛 .
∙ Similarly, closed intervals are examples of closed sets in ℝ, .e.g., the closed interval
[−2, 1].
14
Chapter 2. Smooth manifolds 15
∙ The empty set ∅ and ℝ𝑛 itself are by definition both open and closed sets.
∙ Not every subset of ℝ𝑛 is open or closed. There are a lot of subsets which are neither
open nor closed. For example, the interval (0, 1] in ℝ or the product of an open and a
closed interval in ℝ2 .
Definition 2.1 (Relative open sets) Let 𝑋 be a subset in ℝ𝑛 . Then we say that 𝑉 ⊆ 𝑋
is open in 𝑋 (or relatively open) if there is an open subset 𝑈 ⊆ ℝ𝑛 which is open in ℝ𝑛
with 𝑉 = 𝑈 ∩ 𝑋. More concretely: 𝑉 ⊆ 𝑋 is open in 𝑋 if and only if for every point
𝑥 ∈ 𝑉 there exists a real number 𝜀 > 0 such that 𝔹𝑛𝜀 (𝑥) ∩ 𝑋 ⊆ 𝑉 . See Figure 2.3.
In order to decide whether a subspace is open or closed it is very important to take the
ambient space into account:
Remark 2.2 (Warning) It is important to note that the property of being an open
subset very much depends on the bigger space we are looking at. Hence open always
refers to being open in some given space. For example, a set can be open in a space
𝑋 ⊂ ℝ2 , but not be open in ℝ2 , see Figure 2.4.
16 2.1. Topology in ℝ𝑛
Figure 2.4: The relative open subset of 𝑋 ∩ 𝑈 is open in 𝑋, but is not open in ℝ2 .
Chapter 2. Smooth manifolds 17
Examples:
{ }
1
𝕊2 ∩ 𝔹31 (𝑥0 ) with 𝔹31 (𝑥0 ) = 𝑦 ∈ ℝ3 ∶ |𝑦| < .
2 2 2
In fact, every subset which is open in 𝕊2 and contains 𝑥0 contains a subset if the form
𝕊2 ∩ 𝔹3𝜀 (𝑥0 ) with 𝜀 > 0 sufficiently small.
∙ However, the set 𝕊2 ∩ 𝔹31 (𝑥0 ) is not open in ℝ3 . For there is no three-dimensional open
2
ball 𝔹3𝜀 (𝑥0 ) which is completely contained in 𝕊2 ∩ 𝔹31 (𝑥0 ).
2
Open sets are nice for a lot of reasons. First of all, they provide us with a way to talk about
things that happen close to a point.
Definition 2.4 (Spaces) We establish the following convention for the use of the word
space:
∙ A set together with a topology, i.e., a collection of open sets, is called a topological
space.
∙ From now on, when we talk about a space, we mean a topological space, i.e., a
set together with a specified topology.
Here we observe that the word topology is used in several ways. On the one hand, it is
the name of a whole area in mathematics. On the other hand, it is the name for an additional
structure on a set. We are familiar with this phenomenon: for example, the word algebra
denotes both an area in mathematics and a structure on a set.
The type of maps that preserve open sets, i.e., respect the topology on a set, are called contin-
uous maps:
18 2.1. Topology in ℝ𝑛
Definition 2.5 (Continuous maps: abstract definition) Continuous maps are charac-
terized as follows:
Remark 2.6 Just in case you have heard of categories before: topological spaces form
a category with morphisms given by continuous maps.
We are familiar with continuous maps from Calculus. The 𝜀-𝛿-characterization of continuity
looks as follows:
In our new fancy notation, we can reformulate the last condition as follows: for every
𝜀 > 0, there is a 𝛿 > 0 such that
Proof of Lemma 2.7: First, assume 𝑓 satisfies 𝜀-𝛿-continuity. Let 𝑈 ⊆ ℝ𝑚 be an open set
in ℝ𝑚 . If 𝑓 −1 (𝑈 ) is empty, it is open by definition. So let 𝑎 ∈ 𝑓 −1 (𝑈 ) be a point in 𝑓 −1 (𝑈 ).
The fact that 𝑈 is open means that there is an 𝜀 > 0 such that 𝔹𝑚 𝜖 (𝑓 (𝑎)) ⊆ 𝑈 . Given this 𝜀, the
fact that 𝑓 is continuous means that
But
𝑓 (𝑥) ∈ 𝔹𝑚
𝜀 (𝑓 (𝑎)) ⊆ 𝑈
which implies 𝑓 (𝑥) ∈ 𝑈 and hence 𝑥 ∈ 𝑓 −1 (𝑈 ). Thus, for every 𝑥 ∈ 𝔹𝑛𝛿 (𝑎) ∩ 𝐴, we have
𝑥 ∈ 𝑓 −1 (𝑈 ). In other words,
𝔹𝑛𝛿 (𝑎) ∩ 𝐴 ⊆ 𝑓 −1 (𝑈 ).
Since 𝑎 was an arbitrary point in 𝑓 −1 (𝑈 ), this shows that 𝑓 −1 (𝑈 ) is open in 𝐴.
Next we specify the maps which have inverses in the category of topological spaces:
Examples:
𝕊1 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑥2 + 𝑦2 = 1} ⊂ ℝ2
We know that 𝑓 is bijective and continuous from calculus and trigonometry. However, [ the )
function 𝑓 is not continuous. For example, the image of the open subset 𝑈 = 0, 41
−1
[ )
under 𝑓 , i.e., the subset 𝑓 (𝑈 ), is not open in 𝕊1 . Let us first remark that 𝑈 = 0, 14 is
( )
indeed open in [0, 1), since for examples 𝑈 = [0, 1) ∩ − 14 , 14 . Now we look at the point
𝑓 (0) = (1, 0) ∈ 𝕊1 . Since 0 ∈ 𝑈 , 𝑓 (0) is a point in 𝑓 (𝑈 ). However, for every open ball
𝔹2𝜀 ((1, 0)) ⊂ ℝ2 , the intersection 𝔹2𝜀 ((1, 0)) ∩ 𝕊1 contains points with strictly negative
𝑦-coordinate. In particular, 𝔹2𝜀 ((1, 0)) ∩ 𝕊1 contains points which are not in 𝑓 (𝑈 ), i.e.,
Alternatively, we could observe that 𝑓 −1 (𝔹2𝜀 ((1, 0)) ∩ 𝕊1 ) ⊂ [0, 1) contains points which
are close to 1 in [0, 1) and therefore do not lie in 𝑈 , i.e.,
𝑉 ∩ 𝕊1 = 𝑓 (𝑈 ).
Figure 2.5: Wrapping the interval around the circle via 𝑓 is a bijection, but not a homeomor-
phism, since 𝑓 (𝑈 ) may not be open in 𝕊1 .
One could characterize Topology as the study of properties which are preserved under homeo-
morphisms. Hence we may call a property that is preserved under homeomorphisms a topolog-
ical property. We often refer such a topological property as a global property, since we cannot
check that a space has it just by looking at small neighborhoods of all points. Many interesting
phenomena in differential topology require an interplay of local and global properties. We will
now recall some such topological properties that will play an important role for our study of
smooth manifolds. We will recall here what it means for a space to be
∙ compact,
∙ connected,
∙ path-connected.
∙ Compactness
𝑍 = 𝑈𝑖1 ∪ … ∪ 𝑈𝑖𝑛 .
Recall that a subset 𝑍 ⊂ ℝ𝑛 is called bounded if there is some, possibly big, 𝑟 > 0 such
that 𝑍 ⊂ 𝐵𝑟 (0). For subspaces in Euclidean space we then have the following important char-
acterization of compact subsets.
∙ Theorem 2.11 tells us that open subsets in ℝ𝑛 cannot be compact. For example, open
balls in ℝ𝑛 are never compact.
∙ Compactness makes a lot of things easier. On the one hand, it makes it easier to keep
track of things, as we can cover the space with finitely many open sets. On the other
hand, the previous theorem tells us that points cannot lie too far off, at least for subspaces
in ℝ𝑛 , since compact spaces in ℝ𝑛 are bounded. Hence we can think of compactness as
a general condition which helps to avoid trouble.
Lemma 2.12 (Compact and discrete implies finite) Every compact and discrete
subset 𝑆 of ℝ𝑛 is finite.
Proof: Assume 𝑆 was not finite. Compact subsets of ℝ𝑛 are bounded. Hence there is an
𝜀 > 0 such that 𝑆 is contained in the 𝑛-dimensional box with edges of length 𝜀 and center 0.
Divide this box into 2𝑛 many 𝑛-dimensional boxes of equal size. The length of their edges is
𝜀∕2. If 𝑆 was infinite there must be at least one small box which still contains infinitely many
points of 𝑆. We take this box and divide it again into 2𝑛 many 𝑛-dimensional boxes of equal
size. The length of their edges is now 𝜀∕4. Again, if 𝑆 was infinite there must be at least one of
the smaller boxes which still contains infinitely many points of 𝑆. By repeating the argument,
we see that we can find an infinite sequence of points in 𝑆 which converges. Since 𝑆 is also
closed, any convergent infinite sequence of points in 𝑆 must have a limit in 𝑆. Call this limit
𝑠. But then the subset {𝑠} would not be open in 𝑆, since every open subset of ℝ𝑛 containing 𝑠
would also contain other points of 𝑆. Hence 𝑆 would not be discrete.
∙ Connectedness
22 2.1. Topology in ℝ𝑛
Definition 2.13 (Connected spaces) Recall that a topological space 𝑋 is called con-
nected if 𝑋 cannot be written as the union of two nonempty disjoint open subsets; or
equivalently, if 𝑋 and ∅ are the only subsets which are both open and closed in 𝑋.
∙ Familiar examples of connected spaces are intervals in ℝ. For example, the closed inter-
val [0, 1] is connected.
∙ If 𝑋 is not connected, it has subsets 𝑍𝛼 ⊂ 𝑋 which are both open and closed. Each such
𝑍𝛼 is called a connected component of 𝑋. Hence 𝑋 can be considered as the possibly
infinite union of its connected components.
∙ The previous argument is often used indirectly in the following way: if 𝑋 and 𝑌 have
the same number connected components, we remove a suitable point 𝑥 ∈ 𝑋 and count
the number of connected components of 𝑋 ⧵ {𝑥} and 𝑌 ⧵ {𝑓 (𝑥)}. If these numbers are
𝑓
different, there cannot be a homeomorphism 𝑋 ←←←→
← 𝑌 . For if such an 𝑓 exists, then the
𝑓
restriction 𝑋 ⧵ {𝑥} ←←←→
← 𝑌 ⧵ {𝑓 (𝑥)} is still a homeomorphism. Continuing this or a similar
process often leads to the desired conclusion.
Proof: Let 𝑠 ∈ 𝑆 be a value of 𝑓 , i.e., 𝑠 = 𝑓 (𝑥) for some 𝑥 ∈ 𝑋. We can write 𝑋 as the
disjoint union of the sets
Since 𝑓 is locally constant, both 𝐴 and 𝐵 are open. For, if 𝑎 ∈ 𝐴, then there is an open
neighborhood 𝑈𝑎 ⊂ 𝐴 with 𝑓 (𝑈𝑎 ) = {𝑠}, i.e. 𝑈𝑎 ⊂ 𝐴. Similarly, if 𝑏 ∈ 𝐵, then there is an
open neighborhood 𝑈𝑏 ⊂ 𝑋 with 𝑓 (𝑈𝑏 ) = {𝑓 (𝑏)}, i.e. 𝑈𝑏 ⊂ 𝐵. But since 𝑋 is connected and
𝐴 ≠ ∅, we must have 𝐴 = 𝑋, and 𝑓 is constant.
∙ Path-connectedness
Chapter 2. Smooth manifolds 23
The criterion for connectedness is elegant to state, but also rather abstract. For example, it
does not tell us if we can walk, i.e., draw a line without interruptions, from one point to another,
as one would intuitively expect for a connected space. This leads to a related and more concrete
property:
Proof: Suppose 𝑋 is path-connected. If 𝑋 was not connected, then there would be two
disjoint nonempty open subsets 𝐴 and 𝐵 with 𝑋 = 𝐴 ∪ 𝐵. Since 𝐴 and 𝐵 are nonempty, we
can choose two points 𝑎 ∈ 𝐴 and 𝑏 ∈ 𝐵. Since 𝑋 is path-connected, there is a continuous map
𝛾 ∶ [0, 1] → 𝑋 with 𝛾(0) = 𝑎 and 𝛾(1) = 𝑏. Hence 0 ∈ 𝛾 −1 (𝐴) ⊂ [0, 1] and 1 ∈ 𝛾 −1 (𝐵) ⊂ [0, 1].
Since 𝐴 and 𝐵 are disjoint and open, the subsets 𝛾 −1 (𝐴) and 𝛾 −1 (𝐵) are disjoint and open in
[0, 1]. Since 𝑋 = 𝐴 ∪ 𝐵, we would have [0, 1] = 𝛾 −1 (𝐴) ∪ 𝛾 −1 (𝐵) which contradicts the fact
that [0, 1] is connected. Hence 𝑋 must be connected.
∙ We will show later that smooth manifolds, however, are connected if and only if they are
path-connected.
∙ But be aware that there are connected spaces which are not path-connected. A standard
example, illustrated in Figure 2.7, is the subspace
|𝑓 (𝑎 + ℎ) − 𝑓 (𝑎) − 𝐿𝑎 (ℎ)|
lim = 0.
ℎ→0 |ℎ|
Note that if such an 𝐿𝑎 exists, it is unique and is the best possible linear approximation of 𝑓 at
𝑎. Moreover, if 𝐿𝑎 exists it can be represented in the standard bases of ℝ𝑛 and ℝ𝑚 , respectively,
𝜕𝑓
by the Jacobian matrix at 𝑎, the 𝑚 × 𝑛-matrix with (𝑖, 𝑗)-entry the partial derivative 𝜕𝑥 𝑖 (𝑎).
𝑗
Recall that a differentiable map is in particular also continuous.
Conversely, if we know that all partial derivatives at 𝑎 exist and are continuous, then 𝑓 is
differentiable at 𝑎. We say that 𝑓 is differentiable if it is differentiable at every point 𝑎 in 𝑈 .
In differential topology we usually require that maps are not just once but infinitely many
times differentiable. In this case, we call them smooth. More precisely, we define:
Examples:
∙ The familiar maps exp, cos, sin and all polynomials are smooth maps from ℝ to ℝ.
𝜕 𝑘 𝑃𝑖
The partial derivative 𝜕𝑥𝑗1 …𝜕𝑥𝑗𝑘
(𝑎) is just the partial derivative of the polynomial 𝑝𝑖 . The
latter always exists and is continuous.
⎛𝑥1 ⎞ ⎛ 2𝑥 𝑥 + 2𝑥 𝑥 ⎞
⎜𝑥 ⎟ 1 3 2 4
𝐹 ∶ ℝ4 → ℝ3 , ⎜ 2 ⎟ → ⎜ 2𝑥2 𝑥3 − 2𝑥1 𝑥4 ⎟ . (2.1)
⎜𝑥3 ⎟ ⎜⎝𝑥2 + 𝑥2 − 𝑥2 − 𝑥2 ⎟⎠
⎝𝑥4 ⎠ 1 2 3 4
To convince ourselves let us calculate some partial derivatives. For example, at a point
𝑎 = (𝑎1 , 𝑎2 , 𝑎3 , 𝑎4 ) ∈ ℝ4 we get
𝜕𝐹1 𝜕 2 𝐹1
(𝑎) = 2𝑎1 , (𝑎) = 0,
𝜕𝑥3 𝜕𝑥3 𝜕𝑥4
𝜕𝐹2 𝜕 2 𝐹2
(𝑎) = −2𝑎4 , (𝑎) = −2,
𝜕𝑥1 𝜕𝑥1 𝜕𝑥4
𝜕𝐹3 𝜕 2 𝐹3
(𝑎) = 2𝑎2 , (𝑎) = 2, …
𝜕𝑥2 𝜕𝑥2 𝜕𝑥2
Now we would like to extend smoothness to maps between arbitrary sets subsets of ℝ𝑛 . But
there is an issue we need to discuss:
Figure 2.8: Smoothness of a map 𝑓 with an arbitrary domain is defined by finding at each point
a smooth map 𝐹 that restrict to 𝑓 on relatively open subset.
Note that smoothness at a point 𝑥 is a local property, i.e., we need to check it only in a
small neighborhood of 𝑥.
⎛𝑥1 ⎞ ⎛ 2𝑥 𝑥 + 2𝑥 𝑥 ⎞
⎜𝑥 ⎟ 1 3 2 4
𝑓 ∶ 𝕊3 → 𝕊2 , ⎜ 2 ⎟ → ⎜ 2𝑥2 𝑥3 − 2𝑥1 𝑥4 ⎟ .
⎜𝑥3 ⎟ ⎜⎝𝑥2 + 𝑥2 − 𝑥2 − 𝑥2 ⎟⎠
⎝𝑥4 ⎠ 1 2 3 4
Chapter 2. Smooth manifolds 27
This map is the restriction of the map 𝐹 defined in (2.1) and hence smooth. Since 𝐹 is
smooth, it remains to check that if 𝑥 = (𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) is in 𝕊3 ⊂ ℝ4 , then 𝑓 (𝑥) ∈ 𝕊2 ⊂
ℝ2 .1
2.2.3 Diffeomorphisms
Note that every diffeomorphism is a homeomorphism, but not the other way around. Here
are some examples for which it is an exercise to verify the assertions.
Examples:
∙ The map ℝ2 ⧵ {(0, 0)} → ℝ2 ⧵ {(0, 0)}, (𝑥, 𝑦) → (𝑥2 − 𝑦2 , 2𝑥𝑦), is not a diffeomorphism
– even though its derivative is invertible everywhere – because it is not one-to-one.
1
𝑓 ∶ 𝕊2 ⧵ {(0, 0, 1)} → ℝ2 , (𝑥, 𝑦, 𝑧) → (𝑥, 𝑦)
1−𝑧
is a diffeomorphism. We will meet it again soon and see that it is quite useful.
⎛𝑥1 ⎞ ⎛ 2𝑥 𝑥 + 2𝑥 𝑥 ⎞
⎜𝑥 ⎟ 1 3 2 4
𝑓 ∶ 𝕊3 → 𝕊2 , ⎜ 2 ⎟ → ⎜ 2𝑥2 𝑥3 − 2𝑥1 𝑥4 ⎟
⎜𝑥3 ⎟ ⎜⎝𝑥2 + 𝑥2 − 𝑥2 − 𝑥2 ⎟⎠
⎝𝑥4 ⎠ 1 2 3 4
is not a diffeomorphism, since it is not one-to-one. For example, the whole unit circle
in the 𝑥1 -𝑥2 -plane on 𝕊3 , i.e., points on 𝕊3 with 𝑥3 = 𝑥4 = 0, is mapped to the north
pole (0, 0, 1) on 𝕊2 . The whole unit circle in the 𝑥3 -𝑥4 -plane on 𝕊3 , i.e., points on 𝕊3
with 𝑥1 = 𝑥2 = 0, is mapped to the south pole (0, 0, −1) on 𝕊2 . In fact, we will see
in Exercise 2.8 that each fiber of the Hopf fibration 𝑓 is diffeomorphic to a circle on
𝕊3 . However, as we will show later, none of these circles intersect even though they are
all linked into each other. This is a fascinating and very rare phenomenon. More on
1
We will study this map further though in different disguise in the exercises and will meet it many times in these
notes. But let us remark anyway that if 𝑥21 + 𝑥22 + 𝑥23 + 𝑥24 = 1, then
(2𝑥1 𝑥3 + 2𝑥2 𝑥4 )2 + (2𝑥2 𝑥3 − 2𝑥1 𝑥4 )2 + (𝑥21 + 𝑥22 − 𝑥23 − 𝑥24 )2 = (𝑥21 + 𝑥22 + 𝑥23 + 𝑥24 )2 = 1.
28 2.3. Smooth manifolds
this later. For the moment, we conclude this example with the remark that after defining
tangent spaces for manifolds we will see that, in fact, there cannot exist a diffeomorphism
between 𝕊3 and 𝕊2 .
Remark 2.20 (Diffeomorphic spaces are equivalent) From the point of view of dif-
ferential topology, diffeomorphic spaces are equivalent, and we may (and will) consider
them as copies of the same abstract space, which may happen to be differently situated
in their surrounding Euclidean spaces.
Many interesting spaces are given as the set of solutions of an equation of the form
𝑓 (𝑥) = 𝑏
𝐒 = {𝑥 ∈ 𝑋 ∶ 𝑓 (𝑥) = 𝑏} ⊂ 𝑋?
∙ (Goal) We would like to describe the space 𝐒 in a simple and efficient way while
still expressing all its interesting properties.
𝐴(𝐱) = 𝟎.
We can use, for example, Gauss elimination to get the set of solutions
𝐋 = {𝐱 ∈ ℝ4 ∶ 𝑥2 = 0, 𝑥3 = 𝑥1 , 𝑥4 = −𝑥2 } ⊂ ℝ4 .
This is a line in ℝ4 . In particular, it is something one-dimensional, i.e., we can describe all the
points in 𝐋 by using just one variable, say 𝑡:
⎧ ⎛1⎞⎫
⎪ ⎜0⎟⎪
𝐋 = ⎨𝐱 ∈ ℝ4 ∶ 𝐱 = 𝑡 ⋅ ⎜ ⎟⎬ .
⎪ ⎜1⎟⎪
⎩ ⎝0⎠⎭
Chapter 2. Smooth manifolds 29
We think of the variable 𝑡 as a parameter and would like to say that 𝑡 parametrizes the set of
solutions 𝐋. In a more formal way we have the map
⎛1⎞
⎜0⎟
𝜓 ∶ ℝ → 𝐋, 𝑡 → 𝑡 ⋅ ⎜ ⎟ .
⎜1⎟
⎝0⎠
⎛𝑥1 ⎞ ⎛ 2𝑥 𝑥 + 2𝑥 𝑥 ⎞
⎜𝑥 ⎟ 1 3 2 4
𝑓 ∶ ℝ4 → ℝ3 , ⎜ 2 ⎟ → ⎜ 2𝑥2 𝑥3 − 2𝑥1 𝑥4 ⎟ .
⎜𝑥3 ⎟ ⎜⎝𝑥2 + 𝑥2 − 𝑥2 − 𝑥2 ⎟⎠
⎝𝑥4 ⎠ 1 2 3 4
This map is not linear, since we multiply variables. W have seen such maps in multivariable
calculus and know how to calculate their derivatives. In fact, 𝑓 is a smooth map, since each
of its coordinates is a polynomial.The equation 𝑓 (𝐱) = 𝟎 has only a single solution – the zero
vector. So let us rather determine the solutions the equation
⎛0⎞
𝑓 (𝐱) = 𝐛 =∶ ⎜1⎟ .
⎜ ⎟
⎝0⎠
This is not a straight line in ℝ4 . However, it looks like something one-dimensional, since
knowing one of the variables, say 𝑥1 , determines 𝑥2 , 𝑥3 and 𝑥4 , and thereby 𝐱. Well, hold on:
Let us fix a value 𝑥1 = 𝑡. Then we get
√
1∕2 = 𝑥21 + 𝑥22 = 𝑡2 + 𝑥22 ⇒ 𝑥2 = ± 1∕2 − 𝑡2 .
Hence 𝑥2 is only determined up to a choice. To remedy this defect, let us restrict our attention
to points 𝐱 ∈ 𝐒 with 𝑥2 ≥ 0, then 𝑥1 = 𝑡 determines 𝐱 completely. We write 𝐒𝑥2 ≥0 for the set
of such points.
2
The map 𝑓 is modelled on the famous Hopf fibration 𝕊3 → 𝕊2 which is one of the very few smooth maps
between spheres whose fibers are all spheres themselves. We will meet the Hopf map many times during this class.
30 2.3. Smooth manifolds
In addition, we need
[ to make sure that
] the square root is defined, i.e., we need that 𝑡 only
√ √
varies in the range 𝑡 ∈ − 1∕2, 1∕2 . Hence we can use the map
⎛ 𝑡 ⎞
[ √ ] ⎜√ ⎟
√ 1∕2 − 𝑡2 ⎟
𝜙̃ + ∶ − 1∕2, 1∕2 → 𝐒𝑥2 ≥0 ⊂ ℝ4 , 𝑡 → ⎜√
⎜ 1∕2 − 𝑡2 ⎟
⎜ −𝑡 ⎟
⎝ ⎠
to describe one part of the set of solutions 𝐒. And we check that this map is a bijection.
This is very similar to the parametrization we used to describe 𝐋. However, this map is
far from linear. There is no way to fix this, since 𝑓 was not linear in the first place. But we can
check that 𝑓 is differentiable at many points. In particular, it is continuously differentiable at
all points in 𝐒. Hence we would like our map 𝜙̃ + to be differentiable as well. In fact, we would
like it to be smooth, since 𝑓 is smooth at all points in 𝐒.
To achieve this, we need to make sure that the domain of 𝜙̃ + is open and the partial deriva-
tives are defined. Hence we replace 𝜙̃ + with the map
⎛ 𝑡 ⎞
( √ √ ) ⎜√ 2 ⎟
1∕2 − 𝑡 ⎟
𝜙+ ∶ − 1∕2, 1∕2 → 𝐒𝑥2 >0 ⊂ ℝ4 , 𝑡 → ⎜√
⎜ 1∕2 − 𝑡2 ⎟
⎜ −𝑡 ⎟
⎝ ⎠
The map 𝜙+ is now a diffeomorphism, the best we can hope for, and does a similar job as
the parametrization 𝜓 above: it expresses
Since 𝜙+ describes only some part of 𝐒, we call the map 𝜙+ a local parametrization of 𝐒.
Finally, we observe that we are missing out on some points of 𝐒, in particular where 𝑥2 ≤ 0.
Hence we need further local parametrizations similar to 𝜙+ to cover all of 𝐒. The collection
of such maps will then express 𝐒 as a one-dimensional and smooth subspace of ℝ4 . In fact,
these maps give 𝐒 the structure of a one-dimensional smooth manifold, a notion we will now
define rigorously based on what we learned from this example.
Manifolds are now spaces that locally look like Euclidean spaces in the following sense:
The natural number 𝑁 in the previous definition is not specified. We just assume that there
is some ℝ𝑁 big enough to fit 𝑋 into it. We are going to discuss what we can say about the
minimal 𝑁 later. It is actually a very interesting question.
Figure 2.9: Points on 𝕊2 and on 𝕋 2 have both open nighborhoods diffeomorphic to open subsets
in ℝ2 . However, 𝕊2 and 𝕋 2 have different global properties.
32 2.3. Smooth manifolds
Figure 2.10: A hyperboloid is an example of a smooth 2-manifold. The cone, however, is not
a manifold, since it has a point without an open neighborhood diffeomorphic to an open subset
of ℝ2 . More about these two spaces in Exercise 2.5.
Remark 2.24 (Local coordinates) The set 𝑈 in the definition of a local parametrization
is a subset of ℝ𝑘 , and it may therefore seem plausible to express a point 𝑢 ∈ 𝑈 by its
coordinates 𝑢 = (𝑢1 , 𝑢2 , … , 𝑢𝑘 ). More precisely, given a coordinate system
𝜙−1 ∶ 𝑉 → 𝑈
talk about the coordinates (𝑢1 (𝑥), 𝑢2 (𝑥), … , 𝑢𝑘 (𝑥)) of 𝑥. Hence we need to remember
that the 𝑢1 , … , 𝑢𝑘 are really coordinate functions.
A fundamental example that will play an important role during the whole semester is the
𝑛-dimensional sphere. We start with the one-dimensional case: the unit circle.
𝕊1 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑥2 + 𝑦2 = 1} ⊂ ℝ2
be the unit circle. We are going to show that 𝕊1 is a one-dimensional manifold. First,
suppose that (𝑥, 𝑦) lies in the upper semicircle where 𝑦 > 0. Then
√
𝜙1 (𝑥) = (𝑥, 1 − 𝑥2 )
maps the open interval 𝑊 = (−1, 1) ⊂ ℝ bijectively onto the upper semicircle. It is a
smooth map (−1, 1) → ℝ2 , since its partial derivatives exist and are continuous. Here
it is important that we do not include the endpoints of the interval (−1, 1). Its inverse is
the projection map
𝜙−1
1
(𝑥, 𝑦) = 𝑥
which is defined on the upper semicircle. This 𝜙−1
1
is smooth, since it extends to a smooth
map of all of ℝ to ℝ . Therefore, 𝜙1 is a parametrization.
2 1
These two maps give local parametrizations of 𝕊1 around any point except the two points
(1, 0) and (−1, 0). To cover these points, we can use the maps
√ √
𝜙3 (𝑦) = ( 1 − 𝑦2 , 𝑦) and 𝜙4 (𝑦) = (− 1 − 𝑦2 , 𝑦)
∙ Another example is the set of solutions 𝐒 ⊂ ℝ4 of the equation 𝑓 (𝐱) = 𝐛 that we have
seen at the beginning of the chapter. We can check that the map 𝜙+ we defined is a local
parametrization of 𝐒. Using what we learned from the local parametrizations of 𝕊1 it
should not be too difficult to write down the missing local parametrizations for 𝐒. We
just need to adjust for how 𝐒 sits inside ℝ4 . It is a good exercise to work this out on your
own. Note that we will meet the map 𝑓 and the set 𝐒 again during this course.
34 2.3. Smooth manifolds
∙ In fact, we will see later that many smooth manifolds arise as the set of solutions of a
suitable equation involving a smooth function.4
The definition of a manifold requires parametrizations that cover the whole space. It is
a natural question, what the minimal number of such maps is. The answer depends on the
manifold we look at. Here is a first thought about this number for the sphere:
Remark 2.28 (Need at least two parametrizations on the sphere) Note that we
have used four parametrization maps in the above example. It is an exercise to show
that it is possible to cover 𝕊1 with only two parametrizations. But note that just one
parametrization cannot be enough, because 𝕊1 is compact. For, if there was a dif-
feomorphism 𝜙 ∶ 𝕊1 → 𝑈 ⊂ ℝ1 to an open subset, it would mean that 𝑈 is compact
contradicting the Theorem of Heine-Borel which says that the compact subsets of ℝ1 are
closed and bounded. This argument actually holds for the 𝑛-sphere in every dimension
𝑛 ≥ 1.
There are many different ways to choose parametrizations for a sphere. There is a very
economical one which shows that two parametrizations suffice to cover the 𝑛-sphere:
This is an illustration of the stereographic projection for the 2-sphere 𝕊2 . We study the
formulae of the maps involved in the exercises and will show that this actually defines a sufficient
parametrization.
4
If you cannot wait, you may want to fast-forward to Section 4.2.1 on regular values to have a first glimpse.
Chapter 2. Smooth manifolds 35
Manifolds have subsets. We are interested in those subsets which are manifolds on their
own, possibly of lower dimension:
∙ Similarly, we have basically two ways of considering a copy of the circle on the two-
dimensional torus: once as a horizontal circle, once as a vertical circle.
To find submanifolds in already existing manifolds is an important way to define and study new
manifolds. But there are also other ways to produce new manifolds:
Lemma 2.32 (Creating new manifolds out of old ones) Let 𝑋 ⊆ ℝ𝑁 and 𝑌 ⊆ ℝ𝑀
be manifolds of dimensions 𝑘 and 𝑙, respectively. Then 𝑋 × 𝑌 ⊆ ℝ𝑁+𝑀 is a manifold
of dimension 𝑘 + 𝑙.
Example 2.33 (Torus) One way to define the two-dimensional torus is to think of
it as the product 𝕋 2 = 𝕊1 × 𝕊1 . Then the general statement above implies that 𝕋 2 is a
two-dimensional manifold. This is convenient. However, this way we consider 𝕋 2 as a
subset of ℝ4 , since 𝕊1 being a subspace of ℝ2 forces us to take the product of ℝ2 with
itself to embed 𝕋 2 . Since we are more used to visualise the torus as a subspace in three
dimensions, we will discuss a way to describe 𝕋 2 as a subspace in ℝ3 in the exercises.
2.3.5 A non-example
Finally, we are now going to discuss the case of a space which is not a manifold:
Let 𝑋 denote the union of the 𝑥- and the 𝑦-axis in ℝ2 , in other words,
𝑋 = {(𝑥, 𝑦) ∈ ℝ2 such that 𝑥𝑦 = 0}.
The critical point is the origin 𝑂 = (0, 0), as every other point on 𝑋 has an open neighborhood
which is diffeomorphic to an open interval in ℝ. But no point in ℝ𝑑 with 𝑑 ≥ 2 has an
open neighborhood in ℝ𝑑 diffeomorphic to an open interval in ℝ1 .5 Hence 𝑋 could only be
1-dimensional.
Now let us check the point 𝑂 = (0, 0). If 𝑋 was a manifold of dimension one, there would
be an open subset 𝑉 ⊆ 𝑋 around 𝑂 diffeomorphic to an open interval in ℝ1 . By definition of
open sets in a subset of ℝ2 , there must be an open ball 𝔹2𝜀 (𝑂) such that 𝔹2𝜀 (𝑂) ∩ 𝑋 is contained
in 𝑉 . Let 𝐼 be the open interval in ℝ homeomorphic to 𝔹2𝜀 (𝑂) ∩ 𝑋.
5
We can prove this fact when we have introduced tangent spaces.
Chapter 2. Smooth manifolds 37
The subset 𝔹𝜀 (𝑂) ∩ 𝑋 looks like a cross and contains, in particular, the points
In 𝔹2𝜀 (𝑂) ∩ 𝑋, there are paths, i.e., continuous maps 𝛾 ∶ [0, 1] → 𝔹2𝜀 (𝑂) ∩ 𝑋,
But there is no triple of distinct points with this property in the open intervall 𝐼 ⊂ ℝ. In
more detail, we can argue as follows:
Since 𝑃1 , 𝑃2 , 𝑃3 are pairwise distinct points, their images under 𝜙 must be pairwise distinct
as well. Since ℝ is a totally ordered set, we can order these three points. Assume first 𝜙(𝑃1 ) <
𝜙(𝑃2 ) < 𝜙(𝑃3 ). Then the Intermediate Value Theorem of Calculus implies that for the path
there is an 𝑠 ∈ (0, 1) such that (𝜙◦𝛾2 )(𝑠) = 𝜙(𝑃2 ). This would imply 𝛾2 (𝑠) = 𝑃2 contradicting
the choice of 𝛾2 . Thus the diffeomorphism 𝜙 with the assumed ordering 𝜙(𝑃1 ) < 𝜙(𝑃2 ) < 𝜙(𝑃3 )
cannot exist.
Now we can adjust and repeat this argument for any ordering of the three points 𝜙(𝑃1 ),
𝜙(𝑃2 ) and 𝜙(𝑃3 ) and get contradictions to the choices of paths 𝛾1 , 𝛾2 and 𝛾3 .
Hence the homeomorphism 𝜙 ∶ 𝔹2𝜀 (𝑂) ∩ 𝑋 → 𝐼 cannot exist. We conclude that 𝑂 does
not have a neighborhood homeomorphic to an open interval in ℝ, and 𝑋 is not a manifold.
In the discussion above, we used implicitly that we were looking at path-connected spaces.
Recall that a topological space 𝑋 is called path-connected if for any two points 𝑥, 𝑦 ∈ 𝑋 there
is a continuous map 𝛾 ∶ [0, 1] → 𝑋 from the unit interval to 𝑋 with 𝛾(0) = 𝑥 and 𝛾(1) = 𝑦.
Path-connectedness is a topological property, i.e. it is preserved under homeomorphisms.
The union of the coordinate axes in ℝ2 is an example of a path-connected space and every
interval in ℝ is path-connected. Now assume 𝜙 ∶ 𝔹2𝜀 (𝑂) ∩ 𝑋 → 𝐼 was a homeomorphism to
an interval 𝐼 ⊂ ℝ. Let 𝜙(𝑂) ∈ 𝐼 be the image of the origin. If we remove 𝜙(𝑂) from 𝐼, we
get two components of the interval. Points in the same component can be connected by a path,
whereas points from different components cannot be connected to each other via a path without
crossing 𝜙(𝑂).
If we remove 𝑂 from 𝔹2𝜀 (𝑂) ∩ 𝑋 we get a space with four components. Again, points in the
same component can be connected by a path, whereas points from different components cannot
be connected to each other via a path.
We call these subsets the path-components of the spaces 𝐼 ⧵{𝜙(𝑂)} and (𝔹2𝜀 (𝑂)∩𝑋)⧵{𝑂}.
The key observation is that if 𝜙 was a homeomorphism, 𝜙|(𝔹2 (𝑂)∩𝑋)⧵{𝑂} would still be a home-
𝜀
omorphism. But homeomorphic spaces need to have the same number of path-components,
assuming that number is finite. This is the background for the argument we used above.
38 2.4. Tangent spaces and the derivative
Figure 2.13: Three points on the coordinate axes, two of which can be connected without pass-
ing through the other one.
We are now going to introduce one of the key tools to study smooth manifolds.
Let 𝑥 be a point on the smooth manifold 𝑋. By definition, we can choose a local parametrization
𝜙 ∶ 𝑈 → 𝑋 around 𝑥 which tells us that, at least locally at 𝑥, 𝑋 is the image of 𝑈 ⊂ ℝ𝑘 under
the diffeomorphism 𝜙. Images under diffeomorphism are nice, but images under linear maps
are even better since the latter are vector spaces.
So how could we describe 𝑋, at least locally at 𝑥, via a vector space? Well, 𝑋 is itself not a
vector space, but what we can look for is a linear approximation of 𝑋 at 𝑥. This is the purpose
of the tangent space at 𝑥.
This map is smooth since 𝑓 is smooth, and 𝜙 is injective because of the first coordinate and
surjective by definition of Γ(𝑓 ). Moreover, the projection (𝑥, 𝑓 (𝑥)) → 𝑥 defines a smooth
inverse. Thus, 𝜙 is a diffeomorphism and yields us a parametrization for all points of Γ(𝑓 ).
Chapter 2. Smooth manifolds 39
The tangent line at the point (𝑥, 𝑓 (𝑥)) is the prototype of an example of a tangent space of
smooth manifold. More precisely, we prefer to consider the parallel translate of the tangent
line to the origin, since we want the tangent space to be a vector space. See Figure 2.14. How
can we describe the tangent line 𝐿𝑥 passing through the origin? It is determined by its slope,
i.e., { ( ) }
1 2
𝐿𝑥 = 𝑡 ⋅ ∈ ℝ ∶ 𝑡 ∈ ℝ .
𝑓 ′ (𝑥)
( )
1
Now we observe that is exactly the derivative at 𝑥 of the map 𝜙 we defined above,
𝑓 ′ (𝑥)
( )
1
i.e., 𝑑𝜙𝑥 = ′ . Thus, the tangent line 𝐿𝑥 is the image in ℝ2 of the linear map
𝑓 (𝑥)
𝑑𝜙𝑥 ∶ ℝ → ℝ2 .
Figure 2.14: The tangent line at the graph Γ(𝑓 ) of 𝑓 is the parallel translate of the tangent space
of Γ(𝑓 ).
𝑑𝜙𝑢 ∶ ℝ𝑛 → ℝ𝑁
sending a vector ℎ ∈ ℝ𝑛 to the vector 𝑑𝜙𝑢 (ℎ) ∈ ℝ𝑁 . In Calculus we learned that this map is
ℝ-linear, i.e., 𝑑𝜙𝑢 (ℎ + 𝑔) = 𝑑𝜙𝑢 (ℎ) + 𝑑𝜙𝑢 (𝑔) and 𝑑𝜙𝑢 (𝜆ℎ) = 𝜆𝑑𝜙𝑢 (ℎ) for all ℎ, 𝑔 ∈ ℝ𝑛 and
40 2.4. Tangent spaces and the derivative
𝜆 ∈ ℝ. In particular, the derivative of 𝜙 is a map on its own which is defined on all of ℝ𝑛 even
when 𝜙 may not be. Recall that we can calculate 𝑑𝜙𝑢 in the standard bases of Euclidean spaces
𝜕𝜙
as the Jacobian matrix. Its entry in row 𝑖 and column 𝑗 is the partial derivative 𝜕𝑥 𝑖 (𝑢).
𝑗
Remark 2.34 (The derivative is a linear approximation) One way to appreciate the
significance of the derivative is to think of it as a simple and useful approximation to 𝜙
at 𝑢, i.e., knowing 𝜙(𝑢) and 𝑑𝜙𝑢 gives us a good guess for what 𝜙(𝑢 + ℎ) might, namely
something close to 𝜙(𝑢) + 𝑑𝜙𝑢 (ℎ).
Figure 2.15: The tangent space is the isomorphic image of a ℝ𝑘 in ℝ𝑁 . We visualize it as the
parallel translate of this plane attached to the point of the manifold.
∙ Tangent spaces are useful: While tangent spaces may look quite boring, since they are
just vector spaces, we will see very soon that they are extremely useful for understanding
manifolds. Many important geometric conditions can be stated in terms of tangent spaces.
The most important example for us might be transversality, a key condition for making
intersection theory work.
We can find an answer to this question by taking another local parametrization and check
whether 𝑇𝑥 (𝑋) changes. So let 𝜓 ∶ 𝑉 → 𝑋 be another local parametrization around 𝑥 with
𝜓(0) = 𝑥. If necessary, we shrink 𝑈 and 𝑉 , i.e., we replace 𝑈 with 𝜙−1 (𝜙(𝑈 ) ∩ 𝜓(𝑉 )) ⊂ 𝑈
and 𝑉 with 𝜓 −1 (𝜙(𝑈 ) ∩ 𝜓(𝑉 )) ⊂ 𝑉 . After doing this we can assume
𝜙(𝑈 ) = 𝜓(𝑉 ).
This implies that the image of 𝑑𝜙0 is contained in the image of 𝑑𝜓0 :
Hence 𝑑𝜙0 (ℝ𝑘 ) = 𝑑𝜓0 (ℝ𝑘 ) in ℝ𝑁 . This shows that whatever local parametrization around 𝑥
we start with, the vector subspace 𝑇𝑥 (𝑋) ⊆ ℝ𝑁 is always the same. In mathematical terms we
say that 𝑇𝑥 (𝑋) is well-defined.
Example 2.37 (Tangent spaces of the unit circle) Let 𝑝 = (𝑎, 𝑏) ∈ 𝕊1 be a point with
𝑏 > 0. A local parametrization around 𝑝 with 𝜙(0) = 𝑝 is given by
√
𝜙 ∶ (−𝜀, 𝜀) → 𝕊1 , 𝑡 → (𝑡 + 𝑎, 1 − (𝑡 + 𝑎)2 )
42 2.4. Tangent spaces and the derivative
for some small enough real number 𝜀 > 0. The derivative at 𝑡 is the linear map
( )
2
1
𝑑𝜙𝑡 ∶ ℝ → ℝ , 𝑑𝜙𝑡 = − √ 𝑡+𝑎 .
2 1−(𝑡+𝑎)
Hence
√ the image of ℝ under 𝑑𝜙0 in ℝ is the line spanned by (−𝑏, 𝑎) where we use
2
𝑏 = 1 − 𝑎2 .
⎛1 − 𝑥2 + 𝑦2 −2𝑥𝑦 ⎞
2 ⎜ −2𝑥𝑦
𝑑(𝜙𝑁 )(𝑥,𝑦) = 1 + 𝑥2 − 𝑦2 ⎟ .
(1 + 𝑥2 + 𝑦2 )2 ⎜ ⎟
⎝ 2𝑥 2𝑦 ⎠
The image of ℝ2 under the linear map 𝑑(𝜙𝑁 )(𝑥,𝑦) is the tangent space 𝑇𝜙𝑁 (𝑥,𝑦) 𝕊2 . This
image is spanned by the two column vectors of the matrix 𝑑(𝜙𝑁 )(𝑥,𝑦) . Let us check that
we get the space we would have expected, i.e., the plane which is orthogonal to the
vector 𝜙𝑁 (𝑥, 𝑦):
) ⎛1 − 𝑥 + 𝑦 ⎞
2 2
(
2𝑥, 2𝑦, 𝑥 + 𝑦 − 1 ⋅ ⎜ −2𝑥𝑦 ⎟
2 2
⎜ ⎟
⎝ 2𝑥 ⎠
= 2𝑥(1 − 𝑥2 + 𝑦2 ) − 4𝑥𝑦2 + 2𝑥(𝑥2 + 𝑦2 − 1)
= 2𝑥 − 2𝑥3 + 2𝑥𝑦2 − 4𝑥𝑦2 + 2𝑥3 + 2𝑥𝑦2 − 2𝑥
= 0.
Similarly,
( ) ⎛ −2𝑥𝑦 ⎞
2𝑥, 2𝑦, 𝑥2 + 𝑦2 − 1 ⋅ ⎜1 + 𝑥2 − 𝑦2 ⎟
⎜ ⎟
⎝ 2𝑦 ⎠
= −4𝑥2 𝑦 + 2𝑦(1 + 𝑥2 − 𝑦2 ) + 2𝑦(𝑥2 + 𝑦2 − 1)
= −4𝑥2 𝑦 + 2𝑦 + 2𝑥2 𝑦 − 2𝑦3 + 2𝑥2 𝑦 + 2𝑦3 − 2𝑦
= 0.
Hence the plane spanned by the column vectors is orthogonal to 𝜙𝑁 (𝑥, 𝑦).
a
We do not have to translate first to get 𝜙𝑁 (0) = 𝑝. That is up to us.
Chapter 2. Smooth manifolds 43
∙ Open subsets and tangent spaces
as vector subspaces of ℝ𝑁 .
Lemma 2.39 (Tangent spaces and open subsets) Let 𝑋 be a 𝑘-dimensional manifold
and 𝑊 be an open subset. At any point 𝑥 ∈ 𝑊 , we have
𝑇𝑥 (𝑋) = 𝑇𝑥 (𝑊 ).
A simple way to produce new manifolds is by taking products. We have already met an
important example of this construction: the two-dimensional torus 𝕋 2 = 𝕊1 × 𝕊1 . The tangent
space of such a product behaves as nicely as we can imagine:
𝑇(𝑥,𝑦) (𝑋 × 𝑌 ) = 𝑇𝑥 (𝑋) × 𝑇𝑦 (𝑌 ).
Proof: This follows from the fact that we can choose neighborhoods in 𝑋 × 𝑌 by taking
the product of neighborhoods in 𝑋 and 𝑌 , respectively. Moreover, it is easy to check that
𝑓 ∶ 𝑋 → 𝑋 ′ and 𝑔 ∶ 𝑌 → 𝑌 ′ are smooth maps, then the derivative of the product map is the
product of the derivatives, i.e.,
Now we will turn to the effect of smooth maps on tangent spaces. In fact, every smooth map
between manifolds induces a linear map between tangent spaces. These linear maps are very
useful.
𝑓
𝑋O /𝑌
O
𝜙 𝜓 𝜓 −1
𝑈 /𝑉.
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
Definition 2.41 (The derivative 𝑑𝑓𝑥 ) Taking derivatives yields a diagram of linear
maps and we define 𝑑𝑓𝑥 to be the linear map which makes the diagram commutative:
𝑑𝑓𝑥
𝑇𝑥 (𝑋) / 𝑇 (𝑌 )
O 𝑦
O
𝑑𝜙0 𝑑(𝜙−1 )𝑥 𝑑𝜓0
ℝ𝑘 / ℝ𝑙 .
𝑑𝜃0
Remark 2.42 (Why so complicated?) You may wonder why we need to take this
detour to define 𝑑𝑓𝑥 when we could also consider 𝑓 as a map of subsets of Euclidean
spaces and take the derivative of that map, since we assume 𝑋 ⊂ 𝑅𝑁 and 𝑌 ⊂ ℝ𝑀
for some 𝑁 and 𝑀 anyway. This works nicely if 𝑋 is an open subset in ℝ𝑁 . For if
𝑋 ⊂ ℝ𝑁 is open, we can choose 𝜙 as the identity map and have 𝑇𝑥 (𝑋) = 𝑇𝑥 (ℝ𝑁 ).
Then the derivative 𝑑𝐹𝑥 ∶ 𝑇𝑥 (𝑋) = 𝑇𝑥 (ℝ𝑁 ) = ℝ𝑁 → 𝑇𝑦 (𝑌 ) is actually the derivative
as a smooth map between Euclidean space.
However, if 𝑋 is not open in ℝ𝑁 we need to work a bit harder. By definition of
smoothness, there is an open subset 𝑊 ⊂ ℝ𝑁 and a smooth map 𝐹 ∶ 𝑊 → ℝ𝑀 such
that 𝐹|𝑊 ∩𝑋 = 𝑓|𝑊 ∩𝑋 . The derivative of 𝐹 at 𝑥 is a linear map 𝑑𝐹𝑥 ∶ ℝ𝑁 → ℝ𝑀 . But
what we want is a linear map defined on 𝑇𝑥 (𝑋) ⊂ ℝ𝑁 and with image in 𝑇𝑦 (𝑌 ) ⊂ ℝ𝑀 .
When we look at the gymnastics we do to define 𝑑𝑓𝑥 , we see that this is exactly what
we do: we restrict and adjust 𝑑𝐹𝑥 to the vector subspace 𝑇𝑥 (𝑋) ⊂ ℝ𝑁 such that it has
image in 𝑇𝑦 (𝑌 ). Thus, in the end, the seemingly complicated definition is just the linear
algebra necessary to assure that 𝑑𝑓𝑥 has the correct domain and codomain. We will see
later when we learn about regular values and transversal intersections that there is often
a short cut to make our first intuition work.
6
which means that it does not matter which way we walk around from 𝑈 to 𝑌 .
Chapter 2. Smooth manifolds 45
2.4.6 The derivative is well-defined:
We should check that the derivative is well-defined, i.e., that 𝑑𝑓𝑥 does not depend on the choice
of local parametrizations around 𝑥 and 𝑦 = 𝑓 (𝑥). So let 𝜙′ ∶ 𝑈 → 𝑋 and 𝜓 ′ ∶ 𝑉 ′ → 𝑌
be another choice of local parametrizations around 𝑥 and 𝑦, respectively. Again after possibly
shrinking both 𝑈 , 𝑈 ′ , 𝑉 and 𝑉 ′ we can assume that 𝜙(𝑈 ) = 𝜙′ (𝑈 ′ ) ⊆ 𝑋 and 𝜓(𝑉 ) = 𝜓 ′ (𝑉 ′ ) ⊆
𝑌.
Then 𝑑𝜙0 and 𝑑𝜙′0 differ by a linear isomorphism of ℝ𝑘 , say 𝛼: 𝑑𝜙0 = 𝑑𝜙′0 ◦𝛼. Similarly,
there is a linear isomorphism 𝛽 of ℝ𝑙 such that 𝑑𝜓0 = 𝑑𝜓0′ ◦𝛽. Let 𝜃 ′ ∶ 𝑈 → 𝑉 be defined
similarly to 𝜃, i.e., we set 𝜃 ′ = 𝜓 ′ −1 ◦𝑓 ◦𝜙′ . This gives us the following diagram in which each
square commutes
𝑑𝑓𝑥
𝑇𝑥 (𝑋) / 𝑇 (𝑌 )
@ O 𝑦
O ]
𝑑𝜙′0 𝑑𝜓0′
𝑑𝜙0 ℝO 𝑘 / ℝ𝑙 𝑑𝜓0
𝑑𝜃0′ O
𝛼 𝛽
ℝ𝑘 / ℝ𝑙 .
𝑑𝜃0
𝑑𝜃0′ = 𝛽◦𝑑𝜃0 ◦𝛼 −1 .
This implies the desired identity for the a priori different constructions of 𝑑𝑓𝑥 .
Before we look at an example, we would like to know that the new derivative satisfies a
chain rule, since this is a very useful rule.
𝑓 𝑔
𝑋O /𝑌 /𝑍
O O
𝜙 𝜓 𝜂
𝑈 /𝑉 /𝑊
𝜃=𝜓 −1 ◦𝑓 ◦𝜙 𝜄=𝜂 −1 ◦𝑔◦𝜓
46 2.4. Tangent spaces and the derivative
which gives us the commutative square
𝑔◦𝑓
𝑋O /𝑍
O
𝜙 𝜂
𝑈 /𝑊.
𝜄◦𝜃
Thus, by definition,
𝑑(𝑔◦𝑓 )𝑥 = 𝑑𝜂0 ◦𝑑(𝜄◦𝜃)0 ◦𝑑(𝜙−1 )𝑥 .
The chain rule from Calculus for maps of open sets of Euclidean spaces, then gives
𝑑(𝜄◦𝜃)0 = (𝑑𝜄0 )◦(𝑑𝜃0 ).
Thus
𝑑(𝑔◦𝑓 )𝑥 = (𝑑𝜂0 ◦𝑑𝜄0 ◦𝑑(𝜓 −1 )𝑦 )◦(𝑑𝜓0 ◦𝑑𝜃0 ◦𝑑(𝜙−1 )𝑥 ) = 𝑑𝑔𝑦 ◦𝑑𝑓𝑥 .
Hence we have in fact the desired rule.
𝑓 𝑔
Theorem 2.43 (Chain Rule) If 𝑋 ←←←→ ← 𝑍 are smooth maps of manifolds, then
← 𝑌 ←←→
We conclude this section with another important example and some concrete calculations. Re-
call the Hopf map 𝑓 ∶ 𝕊3 → 𝕊2 defined by
⎛𝑥1 ⎞ ⎛ 2𝑥 𝑥 + 2𝑥 𝑥 ⎞
⎜𝑥 ⎟ 1 3 2 4
𝑓 ∶ 𝕊3 → 𝕊2 , ⎜ 2 ⎟ → ⎜ 2𝑥2 𝑥3 − 2𝑥1 𝑥4 ⎟ .
⎜𝑥3 ⎟ ⎜⎝𝑥2 + 𝑥2 − 𝑥2 − 𝑥2 ⎟⎠
⎝𝑥4 ⎠ 1 2 3 4
We will now compute the derivative of 𝑓 at two points concrete points in 𝕊3 . Later we will be
able to appreciate the relevance of these computations and the choice of points much better. For
the moment, we consider this just as training and illustration.
First, let us pick a point which is mapped to the south pole 𝐬2 = (0, 0, −1) ∈ 𝕊2 . The
formula for 𝑓 shows that all points (𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) ∈ 𝕊3 with 𝑥1 = 𝑥2 = 0 are mapped to 𝐬2 .
Hence, in particular, the north pole 𝐧3 = (0, 0, 0, 1) ∈ 𝕊3 is mapped to 𝐬2 = (0, 0, −1) ∈ 𝕊2 . So
let us look at the point 𝐧3 which is sent to 𝐬2 .
Since the formula for the stereographic projections can be quite involved when the number
of variables increases, we use the local parametrizations given as the inverse of the projection
onto the first coordinates. We used them in Example
√ 2.26 for the 𝕊 . We choose the open ball
1
𝔹3 (𝟎3 ) around the origin 𝟎3 in ℝ3 of radius7 1∕ 2 and use the local parametrization
( √ )
𝜙 ∶ 𝑈 = 𝔹3 √ (𝟎3 ) → 𝑊3 , (𝑥, 𝑦, 𝑧) → 𝑥, 𝑦, 𝑧, 1 − (𝑥2 + 𝑦2 + 𝑧2 )
1∕ 2
7
The choice of the radius will become apparent in a minute when we compute 𝑓 ◦𝜙 which needs to have image
in the subset of points in 𝕊2 with 𝑥3 < 0.
Chapter 2. Smooth manifolds 47
where 𝑊3 ⊂ 𝕊3𝑥 ⊂ 𝕊3 denotes the open subset of 𝕊3 consisting of points with coordinate
4 >0
𝑥4 > 0 and 2𝑥21 + 2𝑥22 < 1. Note that 𝜙 maps 𝟎3 to 𝐧3 . Similarly, we choose the open ball
𝔹21 (𝟎2 ) around the origin 𝟎2 in ℝ2 of radius 1 and use the local parametrization
( √ )
𝜓 ∶ 𝑉 = 𝔹21 (𝟎2 ) → 𝑊2 , (𝑥, 𝑦) → 𝑥, 𝑦, − 1 − (𝑥2 + 𝑦2 )
where 𝑊2 = 𝕊2𝑥 <0 ⊂ 𝕊2 denotes the open subset of 𝕊2 consisting of points with coordinate
3
𝑥3 < 0. Note that 𝜓 maps 𝟎2 to 𝐬2 .
Now we need to calculate the induced map 𝜃 ∶ ℝ3 → ℝ2 such that the diagram commutes
𝑓
𝑊3 /𝑊
O O2
𝜙 𝜓 𝜓 −1
𝔹3 √ (𝟎3 ) / 𝔹2 (𝟎 ).
1 2
1∕ 2 𝜃=𝜓 −1 ◦𝑓 ◦𝜙
We calculate the effect of the maps step by step. Recall that we write |𝐱| for the norm of
points 𝐱 ∈ ℝ𝑘 . First we apply 𝜙 to a point in 𝔹3 (𝟎3 ):
⎛ 𝑥 ⎞
⎛𝑥⎞ ⎜ 𝑦 ⎟
𝜙 ∶ 𝐱 = ⎜𝑦⎟ → ⎜ ⎟.
⎜ ⎟ ⎜√ 𝑧 ⎟
⎝𝑧⎠ ⎜ ⎟
⎝ 1 − |𝐱| ⎠
2
Recall that the inverse 𝜓 −1 is just the projection onto the first two coordinates. Hence applying
the composite 𝜓 −1 ◦𝜙 to a point in ℝ3 yields:
( √ )
⎛𝑥⎞
2𝑥𝑧 + 2𝑦 1 − |𝐱| 2
𝜓 −1 ◦𝑓 ◦𝜙 ∶ 𝐱 = ⎜𝑦⎟ → √ .
⎜ ⎟ 2𝑦𝑧 − 2𝑥 1 − |𝐱|2
⎝ ⎠
𝑧
Now we need to apply this result to compute the horizontal map 𝑑𝑓𝑛3 such that following
diagram for tangent spaces commutes:
𝑑𝑓𝐧3
𝑇𝐧3 (𝕊3 ) / 𝑇 (𝕊2 )
𝐬2
O O
𝑑𝜙𝟎3 𝑑(𝜙−1 )𝐧3 𝑑𝜓𝟎2
ℝ3 / ℝ2 .
𝑑𝜃𝟎3
8
A good reality check is that 𝜃 does map the origin 𝟎3 to the origin 𝟎2 as we claimed.
48 2.4. Tangent spaces and the derivative
In particular, we need to calculate the derivative 𝑑𝜃𝟎3 ∶ ℝ3 → ℝ2 at the origin 𝟎3 . We do
this by first computing the Jacobian matrix 𝐽𝜃 of 𝜃:
√ 2𝑦2
⎛ 2𝑧 − √ 2𝑥𝑦 2 2 1 − |𝐱|2 − √ 2𝑥 − √
2𝑦𝑧 ⎞
𝐽𝜃 = ⎜ √ 1−|𝐱|
2
1−|𝐱|2 1−|𝐱|2 ⎟
.
⎜−2 1 − |𝐱|2 + √ 2𝑥 2𝑧 + √
2𝑥𝑦
2𝑦 + √
2𝑥𝑧 ⎟
⎝ 1−|𝐱|2 1−|𝐱|2 1−|𝐱|2 ⎠
This looks annoyingly complicated. However, there is good news. We want to calculate the
matrix representing 𝑑𝜃𝟎3 at the origin. Hence we set 𝑥 = 𝑦 = 𝑧 = 0 and see that 𝑑𝜃𝟎3 ∶ ℝ3 → ℝ2
is given by the matrix
( )
0 2 0
𝑑𝜃𝟎3 =
−2 0 0
By our definition of 𝑇𝐧3 (𝕊3 ) as the image of 𝑑𝜙𝟎3 ∶ ℝ3 → ℝ4 . As a basis of 𝑇𝐧3 (𝕊3 ) we
can hence choose the images of the standard basis 𝐞31 , 𝐞32 , 𝐞33 of ℝ3 under 𝑑𝜙𝟎3 . With respect this
basis 𝑑𝜙𝟎3 (𝐞31 ), 𝑑𝜙𝟎3 (𝐞32 ), 𝑑𝜙𝟎3 (𝐞33 ) for 𝑇𝐧3 (𝕊3 ) and the standard basis for ℝ3 , the derivative
Similarly, for 𝑇𝐬2 (𝕊2 ) we can choose the image of the standard basis 𝐞21 , 𝐞22 of ℝ2 under
𝑑𝜓𝟎2 . With respect to the standard basis for ℝ2 and the basis 𝑑𝜓𝟎2 (𝐞21 ), 𝑑𝜓𝟎2 (𝐞22 ) for 𝑇𝐬2 (𝕊2 ) the
derivative
𝑑𝜓𝟎2 ∶ ℝ2 → 𝑇𝐬2 (𝕊2 )
Hence — with respect to these bases for 𝑇𝐧3 (𝕊3 ) and 𝑇𝐬2 (𝕊2 ) — we see that the composition
Note that our choices of bases make it very easy to compute the matrix for 𝑑𝑓𝐧3 . So in the
end, there is not as much to compute as one might fear. To make things even more explicit we
observe that 𝑑𝑓𝐧3 has the effect on the basis vectors:
Remark 2.44 (Something we learn from this example) Among other things we see
in this example that — once we have computed 𝜃 — there is a straight-forward way to
compute a matrix which describes 𝑑𝑓𝑥 . For, in this setting, there is a canonical choice
for the bases of 𝑇𝑥 (𝑋) and 𝑇𝑦 (𝑌 ): the images of the standard basis vectors of ℝ𝑛 and ℝ𝑚
under the isomorphisms 𝑑𝜙0 and 𝑑𝜓0 , respectively.
Then we can compute the matrix which represents the linear map 𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) →
𝑇𝑦 (𝑌 ) with respect to these bases just as the matrix which represents 𝑑𝜃0 with respect to
the standard bases of ℝ𝑛 and ℝ𝑚 . And we get this matrix as the Jacobian matrix of 𝜃 at
the origin.
Second, let us apply what we just learned and practice a bit more. So let us look at a point
on 𝕊3 which is mapped to the north pole 𝐧2 = (0, 0, 1) ∈ 𝕊2 . The formula for 𝑓 shows that all
points (𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) ∈ 𝑆 3 with 𝑥3 = 𝑥4 = 0 are mapped to 𝐧2 . Hence, in particular, the point
𝐪3 = (1, 0, 0, 0) ∈ 𝑆 3 is mapped to 𝐧2 = (0, 0, 1) ∈ 𝕊2 . So let us pick that 𝐪3 mapping to 𝐧2 .
Much of the calculation is the same as in the previous case. However, there are some in-
teresting changes, in particular, of some signs. Again, we will be able to appreciate this more
later.
√
We choose the open ball 𝔹3 (𝟎3 ) around the origin 𝟎3 in ℝ3 of radius 1∕ 2 and use the local
parametrization
(√ )
𝜙 ∶ 𝑈 = 𝔹3 √ (𝟎3 ) → 𝑊3 , (𝑥, 𝑦, 𝑧) → 1 − (𝑥2 + 𝑦2 + 𝑧2 ), 𝑥, 𝑦, 𝑧
1∕ 2
where 𝑊3 ⊂ 𝕊3𝑥 ⊂ 𝕊3 denotes the open subset of 𝕊3 consisting of points with coordinate
1 >0
𝑥1 > 0 and 2𝑥23 + 2𝑥24 < 1. We choose the open ball 𝔹21 (𝟎2 ) around the origin 𝟎2 in ℝ2 of radius
1 and use the local parametrization
( √ )
𝜓 ∶ 𝑉 = 𝔹21 (𝟎2 ) → 𝑊2 , (𝑥, 𝑦) → 𝑥, 𝑦, 1 − (𝑥2 + 𝑦2 )
where 𝑊2 ⊂ 𝕊2𝑥 denotes the open subset of 𝕊2 consisting of points with coordinate 𝑥3 > 0.
3 >0
We need to calculate the induced map 𝜃 ∶ ℝ3 → ℝ2 such that the diagram commutes
𝑓
𝑊3 /𝑊
O O2
𝜙 𝜓 𝜓 −1
𝔹3 √ (𝟎3 ) / 𝔹2 (𝟎 ).
1∕ 2 𝜃=𝜓 −1 ◦𝑓 ◦𝜙 1 2
We calculate the effect of the maps step by step. Recall that we write |𝐱| for the norm of
points 𝐱 ∈ ℝ𝑘 . First we apply 𝜙 to a point in 𝔹3 (𝟎3 ):
√
⎛ 1 − |𝐱|2 ⎞
⎛𝑥⎞ ⎜ ⎟
𝑥
𝜙 ∶ 𝐱 = ⎜𝑦⎟ → ⎜ ⎟.
⎜ ⎟ ⎜ 𝑦 ⎟
⎝𝑧⎠ ⎜ ⎟
⎝ 𝑧 ⎠
50 2.4. Tangent spaces and the derivative
Next we apply the composite 𝑓 ◦𝜙:
√
⎛𝑥⎞ ⎛ 2𝑦 1 −√ |𝐱|2 + 2𝑥𝑧 ⎞
𝑓 ◦𝜙 ∶ 𝐱 = ⎜𝑦⎟ → ⎜ 2𝑥𝑦 − 2 1 − |𝐱|2 𝑧 ⎟
⎜ ⎟ ⎜ ⎟
⎝𝑧⎠ ⎝1 − |𝐱|2 + 𝑥2 − 𝑦2 − 𝑧2 ⎠
√
⎛2𝑦 1 −√|𝐱|2 + 2𝑥𝑧⎞
= ⎜2𝑥𝑦 − 2 1 − |𝐱|2 𝑧⎟ .
⎜ ⎟
⎝ 1 − 2𝑦2 − 2𝑧2 ⎠
Now we need to apply this result to compute the horizontal map 𝑑𝑓𝑞3 such that following
diagram for tangent spaces commutes:
𝑑𝑓𝐪3
𝑇𝐪3 (𝕊3 ) / 𝑇 (𝕊2 )
𝐧2
O O
𝑑𝜙𝟎3 𝑑(𝜙−1 )𝐪3 𝑑𝜓𝟎2
ℝ3 / ℝ2 .
𝑑𝜃𝟎3
At the origin we set 𝑥 = 𝑦 = 𝑧 = 0 and see that 𝑑𝜃𝟎3 ∶ ℝ3 → ℝ2 is given by the matrix
( )
0 2 0
𝑑𝜃𝟎3 =
0 0 −2
Now we make our standard choice of bases of 𝑇𝐪3 (𝕊3 ), i.e., 𝑑𝜙𝟎3 (𝐞31 ), 𝑑𝜙𝟎3 (𝐞32 ), 𝑑𝜙𝟎3 (𝐞33 ),
and of 𝑇𝐧2 (𝕊2 ), i.e., 𝑑𝜓𝟎2 (𝐞21 ), 𝑑𝜓𝟎2 (𝐞22 ). Then — with respect to these bases for 𝑇𝐪3 (𝕊3 ) and
𝑇𝐧2 (𝕊2 ) — the composition
Remark 2.45 (Outlook to orientations) We will appreciate this example even more
when we have learned about orientations. For the above computation shows that 𝑑𝑓𝐪3
sends a positively oriented basis to a negatively oriented basis. In other words, 𝑑𝑓𝐪3
reverses orientations. More on this later.
In this section we look at yet another example of an interesting space, the tangent bundle, which
is formed by the collection of tangent spaces of a given smooth manifold 𝑋. We will see that
it is itself a smooth manifold. Later on we will learn that it tells us quite a lot about the
geometry of 𝑋. Moreover, the tangent bundle will turn out to be an extremely useful tool for
many constructions. See for example Section 9.5.
Advice: The reader who may not feel comfortable yet with tangent spaces and what they
are may want to skip this section first and get back to it later when we use the tangent bundle.
Hence in order to be able to keep track of the information contained in all the different
tangent spaces we need a smart device that keeps those spaces apart:
Definition 2.46 (Tangent bundle) The tangent bundle of 𝑋, denoted 𝑇 (𝑋), is the
subset of 𝑋 × ℝ𝑁 defined by
In particular, 𝑇 (𝑋) contains a natural copy of 𝑋 consisting of the points (𝑥, 0).
In the direction perpendicular to 𝑋0 , it contains copies of each tangent space 𝑇𝑥 (𝑋)
embedded as the sets
𝜋 ∶ 𝑇 (𝑋) → 𝑋, (𝑥, 𝑣) → 𝑥.
52 2.5. Tangent Bundle
Any smooth map 𝑓 ∶ 𝑋 → 𝑌 induces a global derivative map
Note that, since 𝑋 ⊂ ℝ𝑁 and 𝑇𝑥 (𝑋) ⊂ ℝ𝑁 for every 𝑥, 𝑇 (𝑋) is also a subset of Euclidean
space:
𝑇 (𝑋) ⊂ ℝ𝑁 × ℝ𝑁 .
Therefore, if 𝑌 ⊂ ℝ𝑀 , then 𝑑𝑓 maps a subset of ℝ2𝑁 to ℝ2𝑀 .
Claim: 𝑑𝑓 is smooth.
We can also say something about the derivative of the composition of smooth maps: Given
smooth maps 𝑓 ∶ 𝑋 → 𝑌 and 𝑔 ∶ 𝑌 → 𝑍, the global derivative of the composite is equal to
the composite of global derivatives:
For, the chain rule implies that, for any (𝑥, 𝑣) ∈ 𝑇 (𝑋),
Theorem 2.47 (Tangent bundles are intrinsic) Diffeomorphic manifolds have dif-
feomorphic tangent bundles. As a result, 𝑇 (𝑋) is an object intrinsically associated to
𝑋, i.e., it does not depend on the ambient Euclidean space.
Finally, we are going to show that 𝑇 (𝑋) is in fact itself a smooth manifold. We will use this
fact in the proof of Whitney’s theorem below.
Let 𝑊 be an open set of 𝑋. In particular, 𝑊 is also a manifold, and we can consider its
tangent bundle 𝑇 (𝑊 ). Since 𝑇𝑥 (𝑊 ) = 𝑇𝑥 (𝑋) for every 𝑥 ∈ 𝑊 , 𝑇 (𝑊 ) is by definition
Theorem 2.48 (Tangent bundles are manifolds) The tangent bundle of a manifold
𝑋 is a smooth manifold of dimension dim 𝑇 (𝑋) = 2 dim 𝑋.
In the next section, we are going to use the tangent bundle as a tool to construct new maps.
The key will be that the tangent bundle gives us extra space for manoeuvring. But before we
do this we take a brief detour.
Tangent bundles are examples of a more general class of spaces, called vector bundles. They
can be defined on any topological space. But let us assume we have a manifold 𝑋. Roughly
speaking, an 𝑛-dimensional vector bundle 𝐸 consists of two data:
∙ a rule for how to glue all these vector spaces together in a nice way.
If it is possible to choose the open subset around 𝑥 in the above condition to be all of 𝑋,
then we call 𝐸 → 𝑋 a trivial bundle.
We can refine this definition and say that 𝐸 → 𝑋 is a smooth vector bundle if we require
in addition that
∙ 𝐸 is a smooth manifold
∙ 𝜋 ∶ 𝐸 → 𝑋 is a smooth map
We conclude the detour with a famous example of a problem which can be phrased in terms
of vector bundles:
Remark 2.49 (Parallelizable spheres) A manifold for which the tangent bundle is
trivial is called parallelizable. Examples of manifolds which are parallelizable are 𝕊1 ,
𝕊3 and 𝕊7 , whereas 𝕊2 is not parallelizable. In fact, it is a famous and deep result that 𝕊𝑛
is parallelizable if and only if 𝑛 = 0, 1, 3 or 7. This is a consequence of the famous and
fundamental result on the possible multiplicative structures on ℝ𝑛 . For the above
statement follows from: Let ℝ𝑛 × ℝ𝑛 → ℝ𝑛 be a map with two-sided identity element
and no zero-divisors. Then 𝑛 must be either 1, 2, 4 or 8.
Chapter 2. Smooth manifolds 55
2.6 Exercises and more examples
Exercise 2.3 For a real number 𝑟 > 0, let 𝔹𝑟 = 𝔹𝑘𝑟 (0) = {𝑥 ∈ ℝ𝑘 ∶ |𝑥| < 𝑟} be the
open ball around the origin with radius 𝑟 in ℝ𝑘 .
is a diffeomorphism from 𝔹𝑟 to ℝ𝑘 .
Hint: Compute the inverse directly, and use the previous exercise to show smooth-
ness.
(b) Suppose that 𝑋 is a 𝑘-dimensional manifold. Show that every point in 𝑋 has a
neighborhood diffeomorphic to an open ball in ℝ𝑘 around the origin.
(c) Suppose that 𝑋 is a 𝑘-dimensional manifold. Show that every point in 𝑋 has a
neighborhood diffeomorphic to all of ℝ𝑘 .
Comment: Recall that choosing a basis for 𝑉 corresponds to choosing a linear iso-
morphism 𝜙 ∶ ℝ𝑘 → 𝑉 . Expressing a vector in 𝑉 in terms of this basis means to attach
coordinates to this vector. Since 𝜙 is linear, we refer to the corresponding coordinates as
linear coordinates.
56 2.6. Exercises and more examples
Exercise 2.5 Recall the hyperboloid and the cone drawn in Figure 2.10:
Exercise 2.6 The torus 𝕋 (𝑎, 𝑏) is the set of points in ℝ3 at distance 𝑏 from the circle
of radius 𝑎 in the 𝑥𝑦-plane, where 0 < 𝑏 < 𝑎. Prove that each 𝕋 (𝑎, 𝑏) is diffeomorphic to
𝕊1 × 𝕊1 ⊂ ℝ4 . What happens when 𝑏 = 𝑎?
1
(𝑥1 , … , 𝑥𝑘+1 ) → (𝑥 , … , 𝑥𝑘 ).
1 − 𝑥𝑘+1 1
(c) Let 𝑆 = (0, … , 0, −1) ∈ 𝕊𝑘 be the ‘south pole’. Describe the parametrization
using the stereographic projection starting in 𝑆 instead of 𝑁, and conclude that 𝕊𝑘
is a 𝑘-dimensional manifold.
(b) Show that 𝜋(𝑧0 , 𝑧1 ) = 𝜋(𝑤0 , 𝑤1 ) if and only if there is a complex number 𝛼 with
|𝛼|2 = 𝛼 𝛼̄ = 1 such that (𝑤0 , 𝑤1 ) = (𝛼𝑧0 , 𝛼𝑧1 ).
(c) Show that, for every point 𝑝 ∈ 𝕊2 , the fiber 𝜋 −1 (𝑝) is diffeomorphic to 𝕊1 .
We conclude that the Hopf map 𝜋 realizes 𝕊3 as a disjoint union of fibers which each
look like 𝕊1 .
Chapter 2. Smooth manifolds 57
2.6.2 Tangent spaces
𝑧2 = 𝑎 at ( 𝑎, 0, 0) for 𝑎 > 0.
Define 𝐹 ∶ 𝑋 → Γ(𝑓 ) by 𝐹 (𝑥) = (𝑥, 𝑓 (𝑥)). We assume that 𝑋 and 𝑌 are smooth
manifolds and 𝑓 is a smooth map.
(b) We also write 𝐹 for the composite map 𝐹 ∶ 𝑋 → 𝑋 × 𝑌 , 𝑥 → (𝑥, 𝑓 (𝑥)). Show
that 𝑑𝐹𝑥 (𝑣) = (𝑣, 𝑑𝑓𝑥 (𝑣)). (You can use 𝑇(𝑥,𝑦) (𝑋 × 𝑌 ) = 𝑇𝑥 (𝑋) × 𝑇𝑦 (𝑌 ).)
(c) Show that the tangent space to Γ(𝑓 ) at the point (𝑥, 𝑓 (𝑥)) is the graph of
𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) → 𝑇𝑓 (𝑥) (𝑌 ).
For understanding smooth manifolds, it can be smart to study maps between manifolds even
though it sounds like making things even more difficult. But assume we know something about
𝑋 and about a map 𝑓 ∶ 𝑋 → 𝑌 , then we might be able to say something interesting about 𝑌 .
In addition, there are a lot of interesting problems which can be stated in terms of properties of
maps.
So let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map between smooth manifolds. Remember that the deriva-
tive at 𝑥 ∈ 𝑋, 𝑑𝑓𝑥 ∶ 𝑇𝑥 𝑋 → 𝑇𝑓 (𝑥) 𝑌 , is a linear map between vector spaces. We have learned
that we may think of the derivative as the best linear approximation at a point. Since it is eas-
ier to understand linear maps, it would be nice if we could classify maps like 𝑓 by the behavior
of 𝑑𝑓𝑥 (with 𝑥 varying in 𝑋).
For the behaviour 𝑑𝑓𝑥 , there are three cases which we are going to study:
∙ dim 𝑋 = dim 𝑌 in which case the nicest possible behaviour of 𝑓 at 𝑥 is that 𝑑𝑓𝑥 an
isomorphism.
∙ dim 𝑋 < dim 𝑌 in which case the nicest possible behaviour of 𝑓 at 𝑥 is that 𝑑𝑓𝑥 one-to-
one.
∙ dim 𝑋 > dim 𝑌 in which case the nicest possible behaviour of 𝑓 at 𝑥 is that 𝑑𝑓𝑥 onto.
∙ First case: 𝑑𝑓𝑥 is an isomorphism. We begin with the nicest case when 𝑑𝑓𝑥 is an
isomorphism. This implies in particular: dim 𝑋 = dim 𝑌 .
Manifolds are characterized by the way they look in a neighborhood around any point. So
let us think locally. In the nicest case, 𝑓 sends a neighborhood of a point 𝑥 diffeomorphically
to a neighborhood of 𝑦 = 𝑓 (𝑥). In this case, 𝑓 is called a local diffeomorphism at 𝑥. More
precisely, we define:
58
Chapter 3. The Inverse Function Theorem, immersions and embeddings 59
subset 𝑈 ⊂ 𝑋 containing 𝑥 such that 𝑓 (𝑈 ) ⊂ 𝑌 is open in 𝑌 and
𝑓|𝑈 ∶ 𝑈 → 𝑓 (𝑈 )
But we obviously have 𝑑(Id𝑋 ) = Id𝑇𝑥 (𝑋) for any manifold 𝑋 and any point 𝑥 ∈ 𝑋. Hence 𝑑𝑓𝑥
is an isomorphism with inverse 𝑑(𝑓 −1 )𝑓 (𝑥) .
Remark 3.3 (It’s a map not a fraction) Note that this is exactly the formula you are
used to from Calculus 1 where we learned
(𝑓 −1 )′ (𝑦) = (𝑓 ′ (𝑓 −1 (𝑦)))−1 .
1
You may be used to this formula as (𝑓 −1 )′ (𝑦) = 𝑓 ′ (𝑓 −1 (𝑦))
from Calculus. But the fraction
here is misleading, since (𝑓 −1 )′ (𝑦) is a linear map. The superscript “to the −1" really
means take the inverse map! In dimension 1, the inverse map happens to be given by
multiplication by the inverse number. But for linear maps or matrices in dimensions > 1,
we cannot write the inverse as a fraction.
Theorem 3.4 (Inverse Function Theorem) Let 𝑋 and 𝑌 be smooth manifolds. Sup-
pose that 𝑓 ∶ 𝑋 → 𝑌 is a smooth map whose derivative
Idea of Proof: We can assume that 𝑋 and 𝑌 are subsets in ℝ𝑁 for some large 𝑁. Let
𝜙 ∶ 𝑈 → 𝑋 be a local parametrization around 𝑥 ∈ 𝑋, and 𝜓 ∶ 𝑊 → 𝑌 a local parametrization
around 𝑦 = 𝑓 (𝑥) ∈ 𝑌 with 𝑈 ⊂ ℝ𝑛 and 𝑊 ⊂ ℝ𝑛 open and 𝜙(0) = 𝑥 and 𝜓(0) = 𝑦.1
Then recall that 𝑑𝑓𝑥 is defined such that the following diagram commutes
𝑑𝑓𝑥
𝑇𝑥 (𝑋) / 𝑇 (𝑌 )
O 𝑦
O
𝑑𝜙0 𝑑(𝜙−1 )𝑥 𝑑𝜓0
ℝ𝑘 / ℝ𝑙 .
𝑑𝜃0
Our assumption is that 𝑑𝑓𝑥 is an isomorphism which implies that 𝑑𝜃0 is an isomorphism.
By the IFT in Calculus, this implies that
∙ 𝜃|𝑉 ∶ 𝑉 → 𝑉 ′ is a diffeomorphism.
Since 𝜙 and 𝜓 are diffeomorphisms, 𝜙(𝑉 ) ⊆ 𝑋 and 𝜓(𝑉 ′ ) ⊆ 𝑌 are open neighborhoods
of 𝑥 and 𝑦, respectively. Moreover, 𝜙|𝑉 and 𝜓|𝑉 ′ are local parametrizations around 𝑥 and 𝑦,
respectively, and
𝑓|𝜙(𝑉 ) ∶ 𝜙(𝑉 ) → 𝜓(𝑉 ′ )
is a diffeomorphism.
Note that this is a local statement, i.e., if 𝑑𝑓𝑥 is invertible, it only tells us that 𝑓 is invertible
in a neighborhood of 𝑥. Even if 𝑑𝑓𝑥 is invertible for every 𝑥 ∈ 𝑋, one cannot conclude that
𝑓 ∶ 𝑋 → 𝑌 is globally a diffeomorphism. But such an 𝑓 is a local diffeomorphism for every
point 𝑥 ∈ 𝑋. We call such a map a local diffeomorphism (without having to refer to a point).
is a global diffeomorphism.
1
Note that the dimension has to be the same when the tangent spaces are isomorphic.
Chapter 3. The Inverse Function Theorem, immersions and embeddings 61
Figure 3.1: Locally, the map 𝑓 is a diffeomorphism at any point. But it cannot be a global
diffeomorphism, since it is not injective.
Remark 3.8 (𝑑𝑓𝑥 looks like the identity) In some situations it would be nice if we
could assume that the linear isomorphism 𝑑𝑓𝑥 was the identity. This is usually not the
case of course. But our freedom of choosing local parametrizations allows us to do the
following. Assume that 𝑑𝑓𝑥 is an isomorphism as in the IFT. Then we can choose local
parametrizationsa 𝜙 ∶ 𝑈 → 𝑋 and 𝜓 ∶ 𝑈 → 𝑌 around 𝑥 and 𝑓 (𝑥), respectively, with
the same open domain 𝑈 ⊂ ℝ𝑛 , such that the diagram commutes:
𝑓
𝑋O /𝑌
O
𝜙 𝜓
𝑈 / 𝑈.
Id𝑈
∙ 𝜓 with
√
𝜓 ∶ 𝑡 → (− 1 − sin2 𝑡, sin 𝑡) = (cos 𝑡, sin 𝑡).
a
We are going to explain how to choose suitable parametrizations in the next sections.
Chapter 3. The Inverse Function Theorem, immersions and embeddings 63
3.2 Immersions and embeddings
3.2.1 Immersions
We continue our study of smooth maps between manifolds using the behaviour of their deriva-
tive. Let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map. We would like to understand how much do we know
about 𝑓 if the derivative is injective. Note that this is only possible if dim 𝑋 ≤ dim 𝑌 , so this
is a silent assumption in this chapter.
∙ Every linear injective map 𝐿 ∶ ℝ𝑘 → ℝ𝑛 is an immersion. This follows from the fact
that 𝐿 equals its own derivative 𝑑𝐿𝑥 at every point 𝑥 ∈ ℝ𝑘 .
is smooth, since all its components are smooth functions. The derivative of 𝑓 at a point
(𝑠, 𝑡) can be described in the standard bases of ℝ2 and ℝ3 , respectively, by the matrix
Before we study more examples and interesting phenomena, we pause for a moment and
show a technical and useful result about immersions.
We have all seen many injective maps before. In fact, among all the injective maps ℝ𝑘 →
ℝ𝑛 with 𝑘 ≤ 𝑛 there is a simplest one, namely the map
which sends the coordinates of a point 𝐱 ∈ ℝ𝑘 to the first 𝑘 coordinates in ℝ𝑛 and adds 0 at the
remaining 𝑛 − 𝑘 positions. This corresponds to the inclusion ℝ𝑘 × {𝟎} ⊂ ℝ𝑛 where 𝟎 denotes
the (𝑛 − 𝑘)-tuple of zeros. The map 𝜄𝑘𝑛 is called the canonical immersion from ℝ𝑘 into ℝ𝑛 .
The matrix which represents 𝜄𝑘𝑛 with respect to the standard bases of ℝ𝑘 and ℝ𝑛 , respectively,
is the 𝑛 × 𝑘-matrix 𝐽𝑛𝑘 which has the 𝑘 × 𝑘-identity matrix in the first 𝑘 rows and only zeros in
the remaining 𝑛 − 𝑘 rows at the bottom. Not all injective maps ℝ𝑘 → ℝ𝑛 are as simple as 𝜄𝑘𝑛 .
However, we will see very soon that it does not get that worse either, at least locally.
To motivate the next theorem let us start with a linear injective map 𝐿 ∶ ℝ𝑘 → ℝ𝑛 . Say
we do not like transformations between spaces of different dimensions. The only one we think
is OK is the canonical one 𝜄𝑘,𝑛 . And say we do like isomorphisms of ℝ𝑛 to itself. Then we could
write 𝐿 as the composition
𝜄𝑘𝑛 𝛼
ℝ𝑘 ←←←→
← ℝ𝑛 ←←→
← ℝ𝑛
where 𝛼 is a linear isomorphism determined by sending the first 𝑘 standard basis vectors 𝐞𝑛1 , … , 𝐞𝑛𝑘
of ℝ𝑛 to the images 𝐯1 = 𝐿(𝐞𝑘1 ), … , 𝐯𝑘 = 𝐿(𝐞𝑘𝑘 ) in ℝ𝑛 of the standard basis vectors of ℝ𝑘 and
by sending the remaining basis vectors 𝐞𝑛𝑘+1 , … , 𝐞𝑛𝑛 of ℝ𝑛 to basis vectors 𝐰𝑘+1 , … , 𝐰𝑛 of the
orthogonal complement of the image of 𝐿 in ℝ𝑛 . Since 𝐿 is injective, this complement has
dimension 𝑛 − 𝑘.
So what happened is that we placed a copy of ℝ𝑘 into ℝ𝑛 as the 𝑘-dimensional plane con-
taining the origin via 𝜄𝑘𝑛 and then we move this plane inside ℝ𝑛 to the position of 𝐿(ℝ𝑘 ).
Actually, we are used to such a manoeuvre. For what 𝛼 does is changing the basis of ℝ𝑛
from the standard basis to the new basis which consists of the vectors 𝐯1 , … , 𝐯𝑘 , 𝐰𝑘+1 , … , 𝐰𝑛 .
With respect to this new basis for ℝ𝑛 and the standard basis for ℝ𝑘 , the matrix which repre-
sents 𝐿 is the 𝑛 × 𝑘-matrix 𝕊𝑘𝑛 described above.
We can translate this idea to immersions between smooth manifolds. In other words — up
to diffeomorphisms — the canonical immersion is locally the only immersion:
Chapter 3. The Inverse Function Theorem, immersions and embeddings 65
𝑓
𝑋O /𝑌
O
𝜙 𝜓
canonical immersion /𝑊
𝑈
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
Proof of the Local Immersion Theorem 3.10: We start by choosing any local parametriza-
tion 𝜙 ∶ 𝑈 → 𝑋 with 𝜙(0) = 𝑥 and 𝜓 ∶ 𝑊 → 𝑌 with 𝜓(0) = 𝑦:
𝑓
𝑋O /𝑌
O
𝜙 𝜓
𝑈 /𝑊.
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
The plan is to manipulate 𝜙 and 𝜓 such that 𝜃 becomes the canonical immersion.
By the assumption, 𝑑𝜃0 ∶ ℝ𝑘 → ℝ𝑛 is injective. Hence, after choosing a suitable basis for
ℝ𝑛 , we can assume that 𝑑𝜃0 is given by the 𝑛 × 𝑘-matrix 𝐽𝑛𝑘 which has the 𝑘 × 𝑘-identity matrix
in the first 𝑘 rows and only zeros in the remaining 𝑛 − 𝑘 rows. Now we define a new map
canonical / 𝑈 × ℝ𝑛−𝑘
𝑈
immersion
𝜃 Θ
y
ℝ𝑛 .
Since 𝜃 is a local diffeomorphism at 0, we can choose 𝑈 and 𝑉 small enough such that 𝜃 sends
open sets to open sets. By the assumption on 𝑑𝜃0 and the construction of Θ, 𝑑Θ0 is represented
by the 𝑛 × 𝑛-identity matrix. By the Inverse Function Theorem, this implies that Θ is a local
diffeomorphism of ℝ𝑛 to itself at 0. Hence we can find an open subset 𝑉 ⊂ 𝑈 × ℝ𝑛−𝑘 and
𝑊 ′ ⊂ ℝ𝑛 such that Θ|𝑉 is a diffeomorphism. After possibly shrinking 𝑈 to an open subset 𝑈 ′
66 3.2. Immersions and embeddings
we get the commutative diagram
𝑓
𝑋O /𝑌
O
𝜙 𝜓
𝑈′
𝜃 /𝑊
O
canonical Θ
immersion ,𝑊′
Since 𝜓 and Θ are local diffeomorphisms at 0, so is the composition 𝜓◦Θ. Hence we can
use 𝜓◦Θ as a local parametrization around 𝑦.
Remark 3.11 We observe from the proof that to be an immersion is a local condition,
i.e., if 𝑓 ∶ 𝑋 → 𝑌 is an immersion at 𝑥, then it is also an immersion for all points
in a neighborhood of 𝑥. For, the local parametrization 𝜙 ∶ 𝑈 → 𝑋 of the proof also
parametrizes any point in the image of 𝜙. This is an open subset around 𝑥 because 𝜙 is
a diffeomorphism onto its image. Hence in order to say more about 𝑓 we need to add
some global topological properties to the local differential data. Recall that for a local
diffeomorphism to be a global one, it suffices to require to be one-to-one and onto. We
will now study what kind of additional property we wish to impose on immersions to
behave nicely.
Even though an immersion does not have to be one-to-one itself, the injectivity of the derivative
does not leave much room for different phenomena in the fibers. So let us have a closer look at
the image of an immersion 𝑓 ∶ 𝑋 → 𝑌 , i.e., the subset 𝑓 (𝑋) ⊂ 𝑌 .
First observations:
Chapter 3. The Inverse Function Theorem, immersions and embeddings 67
∙ Recall the map 𝑓 ∶ ℝ2 → ℝ3 in the above examples for which the image is the two-
dimensional torus. This provides an example where 𝑓 (𝑋) is a submanifold. However, 𝑓
is not a diffeomorphism onto its image, since 𝑓 is not one-to-one.
∙ Note also that we can, in principle, answer the second question independently of the
first. For we can check if 𝑓 has an inverse which is defined on the subset 𝑓 (𝑋) ⊂ 𝑌 and
check if this map is smooth. However, we will see below that if 𝑓 is a diffeomorphism
onto 𝑓 (𝑋), then this will provide the subspace 𝑓 (𝑋) ⊂ with the structure of a smooth
manifold and hence this will be a submanifold in 𝑌 .
We have seen that 𝑓 (𝑋) often is a manifold. But there are also examples where this is not
the case. We will have a look at them now:
Example 3.12 (Figure eight as twist) We just look at Figure 3.2 without making the
formula for 𝑓 explicit. We see here that 𝑓 (𝑋) is not a manifold as a subset of ℝ2 , since
the intersection point of the two branches does not have a local parametrization. We also
see that 𝑓 is not a diffeomorphism onto its image, since 𝑓 is not one-to-one.
Figure 3.2: The map 𝑓 twists the circle once. It is an immersion, but it is not injective. The
image of 𝑓 , considered as a subspace of ℝ2 , is not a submanifold.
Example 3.13 (Figure eight immersion as wrap) So let us modify the map to make it
one-to-one. Consider the map
𝑓 ∶ (−𝜋, 𝜋) → ℝ2 , 𝑡 → (sin 2𝑡, sin 𝑡).
The image of 𝑓 is called a lemniscate, the locus of points (𝑥, 𝑦) satisfying 𝑥2 = 4𝑦2 (1 −
𝑦2 ). See Figure 3.3.
We can check that 𝑓 is smooth, one-to-one and an immersion. For the latter note
that 𝑑𝑓𝑡 can be represented by the 2 × 1-matrix
( )
2 cos 2𝑡
𝐽𝑓 (𝑡) =
cos 𝑡
which is never zero for 𝑡 ∈ (−𝜋, 𝜋) and hence, as a linear map between one-dimensional
vector spaces, 𝑑𝑓𝑡 is an isomorphism for all 𝑡.
Nevertheless, 𝑓 is still not a diffeomorphism onto its image. In fact, we see that 𝑓 is
not a homeomorphism onto its image 𝑓 (𝑋). Moreover, the image 𝑓 (𝑋) is the same as
above and not a manifold.
68 3.2. Immersions and embeddings
Figure 3.3: The map 𝑓 wraps ℝ along the figure eight. It is an immersion and one-to-one.
However, the image of 𝑓 , considered as a subspace of ℝ2 , is still not a submanifold. Note that,
even though 𝑓 is a local diffeomorphism and bijective onto its image, this is not a contradiction
to Lemma 3.7, since the image of 𝑓 , i.e., the figure eight, is not a manifold in ℝ2 .
Let us look at a classical example of another case of a map which is a one-to-one immersion,
but not a homeomorphism — and hence not a diffeomorphism — onto its image. We will not,
however, show that 𝑓 (𝑋) is not a submanifold.
The map 𝐺 is a local diffeomorphism from the plane onto the two-dimensional torus 𝕋 2 .
Figure 3.4: The map 𝛾 wraps a line around the torus. If the slope is irrational, then the image
will never meet itself and is dense on 𝕋 2 .
Chapter 3. The Inverse Function Theorem, immersions and embeddings 69
Example 3.14 (Image of a line with irrational slope) The map 𝛾 is an immersion
because 𝑑𝛾𝑡 is nonzero for every 𝑡, and, as above, a nonzero linear map from a one-
dimensional vector space to another is automatically injective. Moreover, 𝛾 itself is in-
jective, since 𝛾(𝑡1 ) = 𝛾(𝑡2 ) implies
Since 𝛼 is irrational, this implies 𝑡1 = 𝑡2 . One can show that the image of 𝛾 is a dense
subset in 𝕋 2 .
However, 𝛾 is not a diffeomorphism onto its image, since it is not even a homeo-
morphism:
For, look at the set 𝛾(ℤ) = {𝛾(𝑛) ∶ 𝑛 ∈ ℤ}. By Dirichlet’s Approximation Theorem,
for every 𝜀 > 0, there are integers 𝑛 and 𝑚 such that
|𝛼𝑛 − 𝑚| < 𝜖.
Since the line segment between two points 𝑒2𝜋𝑖𝑡1 and 𝑒2𝜋𝑖𝑡2 on the unit circle is shorter
than the circular arc of length |𝑡1 − 𝑡2 |, we have
|𝛾(𝑛) − 𝛾(0)|
( )
= |(𝑒2𝜋𝑖𝑛 , 𝑒2𝜋𝑖𝛼𝑛 ) − (𝑒0 , 𝑒0 )| = | 1, 𝑒2𝜋𝑖𝛼𝑛 − (1, 1) |
= |𝑒2𝜋𝑖𝛼𝑛 − 𝑒2𝜋𝑖𝑚 | ≤ 2𝜋|𝛼𝑛 − 𝑚| < 2𝜋𝜀.
Thus, there is a sequence of integers such that 𝛾(𝑛) converges to 𝛾(0), i.e., 𝛾(0) is a limit
point in 𝛾(ℤ). The image of a convergent sequence under a continuous map is again
a convergent sequence.a Hence if 𝛾 −1 was continuous, then 0 = 𝛾 −1 (𝛾(0)) had to be a
limit point as well. However, ℤ does not have any limit points in ℝ. Hence 𝛾 is not a
homeomorphism onto its image.
We can also show that 𝛾(ℝ) with the subspace topology in 𝕋 2 is not a manifold. We
leave this as an exercise for the moment.
a
This is actually an alternative way to define what continuity means.
3.3 Embeddings
3.3.1 Embeddings
We have seen that both our questions we asked earlier may have negative answers. Let us now
focus on the situation when they do have a positive answer. We give this case a name:
70 3.3. Embeddings
Note that an embedding must be one-to-one, since it is a diffeomorphism onto its image.
Now let 𝑓 ∶ 𝑋 → 𝑌 be an immersion which is also one-to-one. Let us try to show what 𝑓
is an embedding to see which additional assumption we have to make:
𝑓
𝑋O / 𝑓 (𝑋)
O
𝐨𝐩𝐞𝐧 ✓ 𝐨𝐩𝐞𝐧 ?
? 𝑓|𝑊 ≅ ?
𝑊O / 𝑓 (𝑊 )
4 O
𝑓|𝑊 ◦𝜙 𝜓|𝑉 ∩𝜓 −1 (𝑓 (𝑊 ))
𝜙 ≅
≅
canonical / 𝑉 ∩ 𝜓 −1 (𝑓 (𝑊 )).
𝑈
immersion
Since the bottom horizontal map is the canonical immersion, the restriction 𝑓|𝑊 is a
diffeomorphism. As 𝜙 is one as well, we see that the composite 𝑓|𝑊 ◦𝜙 ∶ 𝑈 → 𝑓 (𝑊 ) is
a diffeomorphism. Thus 𝑓 (𝑊 ) is diffeomorphic to an open subset in ℝ𝑛 .
∙ Since we can do this for every point 𝑦 ∈ 𝑓 (𝑋), we would like to say that the collection
of diffeomorphisms 𝑓|𝑊 ◦𝜙 for varying 𝜙 provide local parametrizations for 𝑓 (𝑋).
∙ However, we do not know that 𝑓 (𝑊 ) is open in 𝑓 (𝑋). This does not follow from the
given assumptions on 𝑓 .
∙ Let us assume for a moment that we were lucky and 𝑓 (𝑊 ) was open for each open subset
𝑊 ⊂ 𝑋. Then we could conclude that 𝑓 is a diffeomorphism onto its image. For then 𝑓
would be a bijective and a local diffeomorphism, since for every point 𝑥 ∈ 𝑋 there would
be an open subset 𝑥 ∈ 𝑊 ⊂ 𝑋 and an open subset 𝑓 (𝑊 ) ⊂ 𝑓 (𝑋) such that 𝑓|𝑊 is a
diffeomorphism 𝑊 → 𝑓 (𝑊 ). A bijective local diffeomorphism is a diffeomorphism
by Lemma 3.7 and Exercise 3.2.
We learn from this discussion that we need to find conditions which ensure that 𝑓 (𝑊 ) is open
in 𝑓 (𝑋). This is the case if 𝑓 is an open map, i.e., if 𝑓 sends every open subset in 𝑋 to an
Chapter 3. The Inverse Function Theorem, immersions and embeddings 71
Figure 3.5: One might think that 𝑓 (𝑊 ) provides a local parametrization, but it is not neces-
sarily open in 𝑓 (𝑋). Hence it is notguaranteed that 𝑓 (𝑋), considered as a subspace of 𝑌 , is a
manifold.
open subset in 𝑌 . Equivalently, we could require that 𝑓 is a closed map, i.e., 𝑓 sends every
closed subset in 𝑋 to a closed subset in 𝑌 . Note that this fits well into what we have seen in
the last two examples. For, there 𝑓 was neither closed nor open and 𝑓 ∶ 𝑋 → 𝑓 (𝑋) failed to
be a homeomorphism. A condition that is often easier to test is the following: Recall that for a
general continuous map, the image of any compact set is compact. However, the preimage of
a compact subset is, in general, not compact.
Being proper turns out to be a sufficient global topological constraint for our purposes:
∙ 𝑓 is an open map, or
∙ 𝑓 is a closed map, or
∙ 𝑓 is a proper map.
Then 𝑓 is an embedding.
Before we prove the theorem, let us assume for a moment that 𝑋 is compact. Then every
continuous map 𝑓 ∶ 𝑋 → 𝑌 is proper. This follows from the fact that closed subsets of compact
sets are compact. Hence we can deduce from the theorem the following important special case:
Corollary 3.18 (Compact domain) For a compact smooth manifold 𝑋, every one-to-
one immersion 𝑓 ∶ 𝑋 → 𝑌 is an embedding.
72 3.3. Embeddings
Proof of the Embedding Theorem 3.17:
2
We will use a little bit more general topology in this proof than we recalled so far. We hope that is ok.
Chapter 3. The Inverse Function Theorem, immersions and embeddings 73
3.4 Exercises and more examples
Exercise 3.1 Let 𝐴 ∶ ℝ𝑛 → ℝ𝑛 be a linear map, and 𝑏 ∈ ℝ𝑛 . Show that the mapping
𝑓 ∶ ℝ𝑛 → ℝ𝑛 , 𝑥 → 𝐴𝑥 + 𝑏
is an embedding.
is an immersion. Is it an embedding?
Exercise 3.6 Consider the map 𝑓 ∶ (0, 3𝜋∕4) → ℝ2 , 𝑡 → sin(2𝑡)(cos 𝑡, sin 𝑡).
(c) To test your understanding answer the following questions (and give reasons for
your answer):
74 3.4. Exercises and more examples
∙ What is the difference between Im (𝑓 ) and the graph Γ(𝑓 )?
∙ Is the map 𝐹 ∶ (0, 3𝜋∕4) → (0, 3𝜋∕4) × ℝ2 , 𝑡 → (𝑡, 𝑓 (𝑡)), an embedding?
∙ Would 𝑓 be an embedding if it was defined on the closed interval [0, 3𝜋∕4]?
∙ Is the map 𝑔 ∶ (0, 3𝜋∕4) → ℝ3 , 𝑡 → sin(2𝑡)(cos 𝑡, sin 𝑡, 𝑡) an embedding?
∙ Is the map ℎ ∶ [0, 3𝜋∕4] → ℝ3 , 𝑡 → (sin(2𝑡) cos 𝑡, sin(2𝑡) sin 𝑡, 2𝑡) an embed-
ding?
4.1 Submersions
4.1.1 Submersions
Let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map. We will now turn to the question how much do we know
about 𝑓 if the derivative is surjective. Note that this will require that dim 𝑋 ≤ dim 𝑌 , and
this is a silent assumption in this chapter. We will see that this case will lead to a very useful
observation about the fibers of smooth maps.
∙ Every linear surjective map 𝐿 ∶ ℝ𝑘 → ℝ𝑛 is a submersion. This follows from the fact
that 𝐿 equals its own derivative 𝑑𝐿𝑥 at every point 𝑥 ∈ ℝ𝑘 .
𝑔 ∶ ℝ2 → ℝ, (𝑥, 𝑦) → 𝑥2 − 𝑦2 .
75
76 4.1. Submersions
∙ We define the map
𝑓 ∶ ℝ4 → ℝ,
(𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) → 𝑥1 + 𝑥22 + 𝑥33 + 𝑥44 .
Since 𝑑𝑓𝑧 is a linear map with values in ℝ, it suffices to observe that 𝑑𝑓𝑧 is not the zero
map to conclude that 𝑑𝑓𝑧 is surjective for all 𝑧 ∈ ℝ4 . Hence 𝑓 is a submersion.
2
∙ Let 𝑀(𝑛) denote the space of real 𝑛×𝑛-matrices. It is isomorphic as a vector space to ℝ𝑛 ,
since we can write every 𝑛 × 𝑛-matrix as a column vector of length 𝑛2 . Hence 𝑀(𝑛) is
smooth 𝑛2 -dimensional manifold. Let 𝐺𝐿(𝑛) denote the group of invertible 𝑛×𝑛-matrices
with group operation given by matrix multiplication. This is an open subset and hence a
submanifold of 𝑀(𝑛), also of dimension 𝑛2 . The determinant map
det ∶ 𝑀(𝑛) → ℝ
is a submersion at every matrix 𝐴 ∈ 𝐺𝐿(𝑛). We will discuss this claim later in this
section and in Exercise 4.9.
As for immersions, we can — at least locally — make submersions look as simple as pos-
sible. We will discuss this useful observation next.
which sends the coordinates of a point 𝐱 ∈ ℝ𝑛 to the point in ℝ𝑚 with the first 𝑚 coordinates
and forgets the remaining 𝑛 − 𝑚 ones. The map 𝜎𝑚𝑛 is called the canonical submersion from
ℝ𝑛 onto ℝ𝑚 . The matrix which represents 𝜎𝑚𝑛 with respect to the standard bases of ℝ𝑛 and ℝ𝑚 ,
respectively, is the 𝑚 × 𝑛-matrix 𝑆𝑚𝑛 which has the 𝑚 × 𝑚-identity matrix in the first 𝑚 columns
and only zeros in the remaining 𝑛 − 𝑚 columns on the right hand side.
Again, this is a familiar manoeuvre. For 𝛼 changes the basis of ℝ𝑛 from the standard basis
to a new basis which consists of the vectors 𝐯1 , … , 𝐯𝑚 , 𝐰1 , … , 𝐰𝑛−𝑚 . With respect to this new
basis for ℝ𝑛 and the standard basis for ℝ𝑚 , the matrix which represents 𝐿 is the 𝑚 × 𝑛-matrix
𝑆𝑚𝑛 described above.
We can translate this idea to submersions between smooth manifolds. In other words — up
to diffeomorphisms — the canonical submersion is locally the only submersion:
𝑓
𝑋O /𝑌
O
𝜙 𝜓
canonical submersion /𝑊
𝑈
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
Proof of the Local Submersion Theorem: We start by choosing any local parametrizations
𝜙 ∶ 𝑈 → 𝑋 with 𝜙(0) = 𝑥 and 𝜓 ∶ 𝑉 → 𝑌 with 𝜓(0) = 𝑦:
𝑓
𝑋O /𝑌
O
𝜙 𝜓
𝑈 /𝑉
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
Now we are going to manipulate 𝜙 and 𝜓 such that 𝜃 becomes the canonical submersion.
𝑈
𝜃 / ℝ𝑚
<
Θ ! canonical submersion
ℝ𝑛 .
By the construction, the derivative 𝑑Θ0 at 0 is given by the 𝑛 × 𝑛-identity matrix. Hence Θ
is a local diffeomorphism at 0. Thus we can find a small enough neighborhood 𝑈 ′ around 0
in ℝ𝑛 such that Θ−1 exists as a diffeomorphism from 𝑈 ′ ⊂ ℝ𝑛 onto some small neighborhood
around 0 in 𝑈 . By construction, 𝜃 equals the composition of the canonical submersion with Θ,
i.e., we have 𝜃◦Θ−1 = 𝜎𝑚𝑛 on 𝑈 ′ . This gives us the commutative diagram
𝑓
/𝑌
A 𝑋O O
𝜙 𝜓
𝜙◦Θ−1 𝜃 /6 𝑉 .
>𝑈
canonical
Θ−1
submersion
𝑈′
Hence it suffices to replace 𝑈 with 𝑈 ′ and 𝜙 with 𝜙◦Θ−1 to get the desired commutative diagram
𝑓
𝑋O /𝑌
O
𝜙◦𝚯−𝟏 𝜓
canonical /𝑉
𝑈′
submersion
𝑓 −1 (𝑦) = {𝑥 ∈ 𝑋 ∶ 𝑓 (𝑥) = 𝑦} ⊆ 𝑋.
For example, we would very much like 𝑓 −1 (𝑦) to be a smooth manifold itself. Remember
that this was a situation we started with when we defined manifolds. The fiber 𝑓 −1 (𝑦) is the set
of solutions of the equation 𝑓 (𝑥) = 𝑦.
Chapter 4. Submersions and regular values 79
Remark 4.4 (Be aware) Unfortunately, in general, there is no reason for the set 𝑓 −1 (𝑦)
to be a manifold.
We now write ( )
𝜙−1 (𝑥) =∶ 𝑢(𝑥) = 𝑢1 (𝑥), 𝑢2 (𝑥), … , 𝑢𝑛 (𝑥)
to make the local coordinates in 𝑈 of points in 𝑥 ∈ 𝑉 explicit. With this notation, we can
rewrite the above equivalence as
( )
𝑥 ∈ 𝑉 ∩ 𝑓 −1 (𝑦) ⇐⇒ 𝜃 𝑢1 (𝑥), 𝑢2 (𝑥), … , 𝑢𝑛 (𝑥) = 0.
Now, since 𝜃 is the canonical submersion, we can rewrite this yet again as
However, on 𝑓 −1 (𝑦) ∩ 𝑉 , the remaining local coordinate functions 𝑢𝑚+1 (𝑥), … , 𝑢𝑛 (𝑥) do not
have to satisfy any additional condition. Hence we can use them to define a local coordinate
system in ℝ𝑛−𝑚 for points 𝑥 ∈ 𝑓 −1 (𝑦) ∩ 𝑉 . In fact, the subset 𝑓 −1 (𝑦) ∩ 𝑉 ⊂ 𝑓 −1 (𝑦) is open
in 𝑓 −1 (𝑦), since 𝑉 is open in 𝑋 and 𝑓 −1 (𝑦) has the subspace topology. Similarly, the subset
({0} × ℝ𝑛−𝑚 ) ∩ 𝑈 is open in {0} × ℝ𝑛−𝑚 , since 𝑈 is open in ℝ𝑛 . We now identify {0} × ℝ𝑛−𝑚
with ℝ𝑛−𝑚 and write 𝑈 ′ ⊂ ℝ𝑛−𝑚 for the open subset corresponding to ({0} × ℝ𝑛−𝑚 ) ∩ 𝑈 .
We would like this to be possible for every point in the fiber 𝑓 −1 (𝑦). This is not always the
case. So let us give the desired case a name:
Remark 4.7 The above Theorem 4.6 is actually one of the main tools to show that a
space is a smooth manifold. It is an important mile stone on our journey and we should
always have it in mind whenever we are asked to show that a space has the structure of
a smooth manifold.
Before we look at some examples, we determine the tangent spaces of the submanifold given
as the preimage of a regular value:
Lemma 4.8 (Tangent space of a regular fiber) Let 𝑍 = 𝑓 −1 (𝑦) be the fiber over
a regular value 𝑦 ∈ 𝑌 under the smooth map 𝑓 ∶ 𝑋 → 𝑌 . Then the tangent space to
𝑇𝑥 (𝑍) at a point 𝑥 ∈ 𝑍 is the kernel of the derivative
𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) → 𝑇𝑦 (𝑌 ),
Since 𝜎𝑚𝑛 is the canonical submersion, the composition of the lower horizontal map is zero.
Since 𝑇𝑥 (𝑍) is defined as the image of 𝑑𝜙′0 and since the diagram commutes, this shows that
𝑑𝑓𝑥 sends all of 𝑇𝑥 (𝑍) to 0. This proves 𝑇𝑥 (𝑍) ⊆ Ker 𝑑𝑓𝑥 .
Chapter 4. Submersions and regular values 81
On the other hand, since 𝑦 is a regular value, 𝑑𝑓𝑥 is surjective. Hence the dimension of the
kernel of 𝑑𝑓𝑥 is dim 𝑇𝑥 (𝑋) − dim 𝑇𝑦 (𝑌 ) = dim 𝑋 − dim 𝑌 = dim 𝑍. This shows that 𝑇𝑥 (𝑍) is
a subspace of the kernel of 𝑑𝑓𝑥 of the same dimension as Ker 𝑑𝑓𝑥 . Thus 𝑇𝑥 (𝑍) = Ker 𝑑𝑓𝑥 .
The derivative 𝑑𝑓𝑥 at the point 𝑥 = (𝑥1 , … , 𝑥𝑛+1 ) is the linear map given by the matrix
(2𝑥1 … 2𝑥𝑛+1 ) expressed in the standard basis. Thus 𝑑𝑓𝑥 ∶ ℝ𝑛+1 → ℝ is surjective un-
less 𝑓 (𝑥) = 0, so every nonzero real number is a regular value of 𝑓 . In particular, we get
again that the sphere 𝕊𝑛 = 𝑓 −1 (1) is an 𝑛-dimensional manifold.
∙ We will show in the exercises that 1 ∈ ℝ is a regular value of the determinant function
det ∶ 𝐺𝐿(𝑛) → ℝ. This will show that the subgroup 𝑆𝐿(𝑛) of matrices with determinant
one is a smooth manifold. It is is called the special linear group. More on such groups
later.
𝑓 ∶ ℝ4 → ℝ,
(𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) → 𝑥1 + 𝑥22 + 𝑥33 + 𝑥44 .
Remark 4.9 (Algebraic Geometry in a nutshell) The study of the zeroes of poly-
nomials is the central theme in Algebraic Geometry. This is a classical and fascinating
82 4.2. Regular values
part of pure mathematics. In the past three decades, deep and fascinating connections
between algebraic geometry and homotopy theory have been developed. This is the field
of Motivic Homotopy Theory.
Since 𝑓 −1 (𝑦) can be very complicated if 𝑦 is not regular, the values which are not regular
get the following name:
∙ For example, we will show in Exercise 4.9 that the only critical value of the determinant
map det ∶ 𝑀(𝑛) → ℝ is 0.
∙ Note: All values 𝑦 which are not in the image of 𝑓 also are regular values for 𝑓 .
For, if 𝑓 −1 (𝑦) is the empty set, then there is no condition to be satisfied.
Depending on the relative dimensions of 𝑋 and 𝑌 we can now say how regular values can
arise.
∙ if dim 𝑋 < dim 𝑌 : 𝑦 is a regular value if and only if 𝑦 is not in the image of 𝑓 ; for,
all values in the image are critical (𝑑𝑓𝑥 cannot be surjective when dim 𝑇𝑥 (𝑋) <
dim 𝑇𝑓 (𝑥) (𝑌 )).
A very important application of the Preimage Theorem is that we can use it to show that various
matrix groups are smooth manifolds. Let 𝑀(𝑛) again denote the space of real 𝑛 × 𝑛-matrices
and 𝐺𝐿(𝑛) denote the group of invertible matrices in 𝑀(𝑛).
Let 𝑂(𝑛) be the subgroup of matrices 𝐴 of 𝐺𝐿(𝑛) which satisfy 𝐴𝐴𝑡 = 𝐼 where 𝐴𝑡 denotes
the transpose of 𝐴 and 𝐼 is the 𝑛×𝑛-identity matrix. Note that 𝑂(𝑛) is the subgroup of invertible
matrices which preserve the scalar product of vectors. In particular, matrices in 𝑂(𝑛) preserve
distances and angles in ℝ𝑛 .
Theorem 4.12 (Orthogonal matrices form a smooth manifold) The group 𝑂(𝑛) of
orthogonal matrices is a smooth manifold of dimension 𝑛(𝑛 − 1)∕2.
This map is smooth, since multiplication of matrices is smooth and taking transposes is smooth
as well as we just reshuffle the entries in the matrix. Now we observe 𝑂(𝑛) = 𝑓 −1 (𝐼). Hence,
in order to show that 𝑂(𝑛) is a smooth manifold, we just need to show that 𝐼 is a regular value
for 𝑓 . So let us compute the derivative of 𝑓 at a matrix 𝐴:
𝑓 (𝐴 + 𝑠𝐵) − 𝑓 (𝐴) (𝐴 + 𝑠𝐵)(𝐴 + 𝑠𝐵)𝑡 − 𝐴𝐴𝑡
𝑑𝑓𝐴 (𝐵) = lim = lim
𝑠→0 𝑠 𝑠→0 𝑠
(𝐴 + 𝑠𝐵)(𝐴𝑡 + 𝑠𝐵 𝑡 ) − 𝐴𝐴𝑡 𝐴𝐴𝑡 + 𝑠𝐵𝐴𝑡 + 𝑠𝐴𝐵 𝑡 + 𝑠2 𝐵𝐵 𝑡 − 𝐴𝐴𝑡
= lim = lim
𝑠→0 𝑠 𝑠→0 𝑠
𝑠𝐵𝐴𝑡 + 𝑠𝐴𝐵 𝑡 + 𝑠2 𝐵𝐵 𝑡
= lim = lim 𝐵𝐴𝑡 + 𝐴𝐵 𝑡 + 𝑠𝐵𝐵 𝑡
𝑠→0 𝑠 𝑠→0
= 𝐴𝐵 𝑡 + 𝐵𝐴𝑡 .
Thus 𝐼 is a regular value, and 𝑂(𝑛) is a submanifold of 𝑀(𝑛). We can also calculate the
dimension of 𝑂(𝑛):
𝑛(𝑛 + 1) 𝑛(𝑛 − 1)
dim 𝑂(𝑛) = dim 𝑀(𝑛) − dim 𝑆(𝑛) = 𝑛2 − = .
2 2
We end this section with an outlook to an exciting field which has ramifications to almost
all areas of mathematics:
Remark 4.13 (Lie groups) The manifold 𝑂(𝑛) is an example of a very important
class of smooth manifolds. For, 𝑂(𝑛) is both a smooth manifold and a group such that
the group operations are smooth. For both the multiplication map
are smooth. For the latter note 𝐴−1 = 𝐴𝑡 for 𝐴 ∈ 𝑂(𝑛), though taking inverses is also
smooth for other matrix groups.
In general, a group which is also a manifold such that the group operations are
smooth is called a Lie group. Lie groups are extremely interesting and important
and have a rich and exciting theory. For example, the tangent space at a Lie group at the
identity element is a Lie algebra, a vector space with a certain additional operation. Such
Lie algebras can be classified completely. Lie groups and Lie algebras play an important
role in many different areas of mathematics and physics. We will take a closer look,
though by now way sufficient for their very rich theory, at Lie groups in chapter 5.
We will now take a closer look at a specific and important situation for regular values. This will
be useful later at several occasions.
Chapter 4. Submersions and regular values 85
Suppose 𝑓 ∶ 𝑋 → 𝑌 is a smooth map with dim 𝑋 = dim 𝑌 and 𝑋 compact. Our goal is to
understand the fibers 𝑓 −1 (𝑦) of regular values. We do this in a series of observations:
Proof of the claim: Let 𝑥0 ∈ 𝑋 be a regular point for 𝑓 , i.e., 𝑑𝑓𝑥0 is surjective. Since
we assume dim 𝑋 = dim 𝑌 , this actually implies that 𝑑𝑓𝑥0 is an isomorphism. By the Inverse
Function Theorem 3.4, there is an open neighborhood 𝑊 around 𝑥0 such that 𝑑𝑓𝑥 is an iso-
morphism for all 𝑥 ∈ 𝑊 . Hence the set of regular points is open in 𝑋. Consequently, its
complement in 𝑋, the set of critical points 𝐶 is a closed subset in 𝑋. Since 𝑋 is compact,
this implies that 𝐶 is a compact subset. Since 𝑓 is continuous, 𝑓 (𝐶) is a compact subset in
𝑌 . In particular, 𝑓 (𝐶) is closed in 𝑌 .1 The set 𝑓 (𝐶) is the set of critical values of 𝑓 and its
complement is the set 𝑅 of regular values. Hence 𝑅 is open in 𝑌 .
∙ Now let 𝑦 ∈ 𝑌 be a regular value for 𝑓 such that 𝑓 −1 (𝑦) is not empty. Let 𝑥 be a point in
𝑓 −1 (𝑦). Since 𝑦 is a regular value, 𝑥 is a regular point, i.e., 𝑑𝑓𝑥 is surjective and hence an
isomorphism as dim 𝑋 = dim 𝑌 . Hence 𝑓 is a local diffeomorphism at 𝑥. Let 𝑉 ⊂ 𝑋
and 𝑈 ⊂ 𝑌 be open neighborhoods around 𝑥 and 𝑦, respectively, such that 𝐟|𝐕 ∶ 𝑉 → 𝑈
is a diffeomorphism.
∙ Hence all the points in 𝑓 −1 (𝑦) lie in pairwise disjoint open subsets of 𝑋. We conclude
that 𝑓 −1 (𝑦) is discrete, i.e., it is a space in which every subset is open. Since the subset
{𝑦} is closed in 𝑌 , the fiber 𝑓 −1 (𝑦) is a closed subset of 𝑋. Since 𝑋 is compact and
since closed subsets in compact spaces are compact, this implies that 𝑓 −1 (𝑦) is compact
as well. Hence it is a compact and discrete subset of Euclidean space. We have shown
previously that this implies that 𝑓 −1 (𝑦) is a finite set.
∙ Let 𝑓 −1 (𝑦) = {𝑥1 , … , 𝑥𝑛 }. We can thus pick pairwise disjoint open subsets 𝑊1 , … , 𝑊𝑛
in 𝑋 with 𝑥𝑖 ∈ 𝑊𝑖 and open subsets 𝑈1 , … , 𝑈𝑛 in 𝑌 , each containing 𝑦, such that 𝑓 maps
𝑊𝑖 diffeomorphically onto open 𝑈𝑖 . We define the set
Figure 4.2: Every point 𝑦 in 𝑌 has an open neighborhood 𝑈 such that 𝑓 −1 (𝑈 ) consists of disjoint
copies of open subsets 𝑉𝑖 in 𝑋 which are diffeomorphic to 𝑈 . In three-dimensional space this
may be drawn as several old school records stacked on top of each other. One key consequence
is that the fiber over 𝑦 is a finite and discrete set.
∙ If in addition to the assumptions of the theorem all values in 𝑌 are regular, then 𝑋 → 𝑌 is
an example of a covering. In Topology, a continuous map 𝑓 ∶ 𝑋 → 𝑌 is an unramified
covering if every point in 𝑌 has an open neighborhood 𝑈 such that 𝑓 −1 (𝑈 ) is the disjoint
union of open sets 𝑉𝑖 such that 𝑓 maps each 𝑉𝑖 homeomorphically onto 𝑈 . Coverings
play an important role in topology and homotopy theory.
Since 𝑓 −1 (𝑦) is either empty or finite, it makes sense to talk about the number of elements
in the set 𝑓 −1 (𝑦). We denote this number by #𝑓 −1 (𝑦).
Lemma 4.15 (Locally constant fiber) The function 𝑦 → #𝑓 −1 (𝑦) defined on the set
of regular points for 𝑓 is locally constant, i.e., for every regular value 𝑦0 there is an
open neighborhood 𝑈 ⊂ 𝑌 of 𝑦0 such that #𝑓 −1 (𝑦) = #𝑓 −1 (𝑦0 ) for all 𝑦 ∈ 𝑈 .
Proof: We split the proof into considering the cases 𝑓 −1 (𝑦0 ) ≠ ∅ and 𝑓 −1 (𝑦0 ) = ∅ sepa-
rately.
∙ Now assume 𝑓 −1 (𝑦0 ) = ∅. Since 𝑋 is compact, 𝑓 (𝑋) is compact and hence closed in 𝑌 .
Thus 𝑌 − 𝑓 (𝑋) is open in 𝑌 . Since by our assumption 𝑦0 ∈ 𝑌 − 𝑓 (𝑋), there is an open
neighborhood 𝑈 ∶ 𝑌 − 𝑓 (𝑋) around 𝑦0 such that
As an application of regular values we will now study Milnor’s proof of the Fundamental The-
orem of Algebra:
88 4.4. Milnor’s proof of the Fundamental Theorem of Algebra
As a consequence, 𝑃 (𝑧) must have exactly 𝑛 zeroes when we count them with multiplici-
ties.
∙ (Outline of the proof) Let us first summarize the idea of the proof:
∙ We show that the assumption that 𝑓 is not onto leads to a contradiction as fol-
lows:
If you wonder why this does not work for real polynomials, then note:
Remark 4.17 (ℝ is too one-dimensional) If we tried the above strategy for a real
polynomial 𝑃 ∶ ℝ → ℝ, we would replace 𝑃 with a smooth map 𝑓 ∶ 𝕊1 → 𝕊1 . Then 𝑓
is smooth defined on a compact domain and 𝑓 has only finitely many critical values. But
the conclusion that the function 𝑦 → #𝑓 −1 (𝑦) is constant on the set of regular values on
𝕊1 would not work anymore. For, after removing at least two different points from 𝕊1
we get a space which is not connected.
The proof:
We are going to identify the complex numbers ℂ with the points in real plane ℝ2 , but we
keep in mind how that we can multiply and form inverses for points in ℂ. To prove the theorem
we need to extend the map 𝑃 ∶ ℂ → ℂ to a map on a compact space. Recall that 𝕊2 is a compact
subspace of ℝ3 and that we can relate 𝕊2 and the real plane ℝ2 via stereographic projection,
recall Figure 2.12.
Chapter 4. Submersions and regular values 89
The formulae for the projection from the north pole 𝑁 = (0, 0, 1) ∈ 𝕊2 are
1
𝜙−1 2 2
𝑁 ∶ 𝕊 ⧵ {𝑁} → ℝ , (𝑥1 , 𝑥2 , 𝑥3 ) → (𝑥 , 𝑥 ) and
1 − 𝑥3 1 2
1 ( )
𝜙𝑁 ∶ ℝ2 → 𝕊2 ⧵ {𝑁}, (𝑥1 , 𝑥2 ) → 2𝑥1 , 2𝑥2 , |𝑥|2 − 1 .
1 + |𝑥| 2
The formulae for the projection from the south pole 𝑆 = (0, 0, −1) ∈ 𝕊2 :
1
𝜙−1 2 2
𝑆 ∶ 𝕊 ⧵ {𝑆} → ℝ , (𝑥1 , 𝑥2 , 𝑥3 ) → (𝑥 , 𝑥 ) and
1 + 𝑥3 1 2
1 ( )
𝜙𝑆 ∶ ℝ2 → 𝕊2 ⧵ {𝑆}, (𝑥1 , 𝑥2 ) → 2𝑥1 , 2𝑥2 , 1 − |𝑥|2 .
1 + |𝑥| 2
In order to do this we use the projection from the south pole and define a map
𝑄 ∶ ℂ → ℂ by 𝑄 ∶= 𝜙−1
𝑆 ◦𝑓 ◦𝜙𝑆 .
Similarly, we get
𝑧
𝜙−1
𝑆 ◦𝜙𝑁 (𝑧) = = 1∕𝑧̄ for all 𝑧 ∈ ℂ ⧵ {0}.
|𝑧|2
𝑄(𝑧) = 𝜙−1 −1
𝑆 ◦𝜙𝑁 ◦𝑃 ◦𝜙𝑁 ◦𝜙𝑆 (𝑧)
= 𝜙−1
𝑆 ◦𝜙𝑁 (𝑃 (1∕𝑧))
̄
= 𝜙−1
𝑆 ◦𝜙𝑁 (𝑎𝑛 𝑧̄
−𝑛
+ 𝑎𝑛−1 𝑧̄ −(𝑛−1) + ⋯ + 𝑎1 𝑧̄ −1 + 𝑎0 )
( −𝑛 )
= 1∕ 𝑎̄𝑛 𝑧 + 𝑎̄𝑛−1 𝑧−(𝑛−1) + ⋯ + 𝑎̄1 𝑧−1 + 𝑎̄0
( )
= 𝑧𝑛 ∕ 𝑎̄𝑛 + 𝑎̄𝑛−1 𝑧 + ⋯ + 𝑎̄1 𝑧𝑛−1 + 𝑎̄0 𝑧𝑛 .
Note that 𝑧 = 0 is not a zero of 𝐷(𝑧) ∶= 𝑎̄𝑛 + 𝑎̄𝑛−1 𝑧 + ⋯ + 𝑎̄1 𝑧𝑛−1 + 𝑎̄0 𝑧𝑛 , since 𝑎̄𝑛 ≠ 0. Thus
in small a neighborhood of 𝑧 = 0 in ℂ, the denominator in the expression for 𝑄(𝑧) is bounded
below, i.e., there is a small 𝜀 > 0 and a 𝑐 > 0 such that |𝐷(𝑧)| ≥ 𝑐 for all 𝑧 ∈ 𝐵𝜀2 (0) ⊂ ℂ. Since
𝑄(𝑧) = 𝑧𝑛 ∕𝐷(𝑧), this shows that 𝑄 is smooth at 𝑧 = 0.
𝑄(0) = 𝜙−1 −1 −1
𝑆 (𝑓 (𝜙𝑆 (0)) = 𝜙𝑆 (𝑓 (𝑁)) = 𝜙𝑆 (𝑁) = 0
and hence
𝑄(ℎ) − 𝑄(0)
lim
(ℎ
ℎ→0
)
ℎ𝑛 ∕ 𝑎̄𝑛 + 𝑎̄𝑛−1 ℎ + ⋯ + 𝑎̄0 ℎ𝑛 − 0
= lim
ℎ→0 ℎ
ℎ 𝑛−1
= lim
ℎ→0 𝑎̄ 𝑛 + ℎ(𝑎̄ 𝑛−1 + ⋯ + 𝑎̄ 0 ℎ𝑛−1 )
{
1∕𝑎̄𝑛 if 𝑛 = 1
=
0 if 𝑛 ≥ 2.
𝑓 = 𝜙−1
𝑆 ◦𝑄◦𝜙𝑆
∙ Now we use the fact that 𝑃 is a complex polynomial. This will give us a simple way to
calculate its derivative. For the derivative of the real function 𝑃 ∶ ℝ2 → ℝ2 is the same
Chapter 4. Submersions and regular values 91
as the underlying derivative of 𝑃 ∶ ℂ → ℂ as a complex function.2 More importantly,
the derivative of a polynomial has at most finitely many zeros.
Lemma 4.19 (Second claim: Finitely many critical points) The map 𝑓 ∶ 𝕊2 → 𝕊2
has only finitely many critical points, i.e., points 𝑥 where 𝑑𝑓𝑥 fails to be surjective.
Hence 𝑑𝑃𝑧 fails to be an isomorphism, only if it is the zero map, i.e., 𝑑𝑃𝑧 = 0. However, as a
function of 𝑧, 𝑑𝑃𝑧 is a polynomial of degree 𝑛 − 1 and therefore has at most 𝑛 − 1 zeros. Hence
there are only finitely many 𝑧 where 𝑑𝑃𝑧 is not an isomorphism.
∙ Thus the set 𝑅 of regular values for 𝑓 is 𝕊2 with finitely many points removed and is
therefore connected. Since 𝕊2 is compact, we can apply the Stack of Records Theo-
rem 4.14 and see that the function
∙ (Third claim: Infinitely many values) The map 𝑓 has infinitely many different
values.
open and closed, since 𝑓 is continuous. This is impossible, since 𝕊2 is connected, unless
𝑘 = 1, This would imply that 𝑓 was constant. However, 𝑃 is not constant, and 𝜙𝑁 and 𝜙−1
𝑁
are
diffeomorphisms. Thus 𝑓 is not constant, and our initial assumption had to be wrong.
∙ Conclusion: In particular, 𝑓 −1 (𝑆) ≠ ∅ for the south pole 𝑆 on 𝕊2 . Hence there must be
at least one point 𝑥 ∈ 𝕊2 with 𝑓 (𝑥) = 𝑆. Since 𝜙𝑁 is a diffeomorphism and 𝜙𝑁 (0) = 𝑆,
𝑥 must satisfy 𝑃 (𝜙−1
𝑁
(𝑥)) = 0. Hence 𝑧 ∶= 𝜙−1𝑁
(𝑥) ∈ ℂ is a zero of 𝑃 .
Chapter 4. Submersions and regular values 93
4.5 Exercises and more examples
𝑔 ∶ ℝ2 → ℝ, (𝑥, 𝑦) → 𝑥2 − 𝑦2 .
Determine the set of regular values of 𝑔, and determine the set of critical values of 𝑔. Is
𝑔 a submersion?
Exercise 4.3 Recall that a space 𝑌 is called connected if 𝑌 cannot be written as the
union of two nonempty disjoint open subsets; or equivalently, if 𝑌 and ∅ are the only
subsets which are both open and closed in 𝑌 .
(a) Show that if 𝑋 is compact and 𝑌 is connected, then every nontrivial submersion
𝑓 ∶ 𝑋 → 𝑌 is surjective.
(b) Show that there exist no submersions from compact manifolds to ℝ𝑛 for any 𝑛.
Exercise 4.5 Show that the tangent space to 𝑂(𝑛) at the identity matrix 𝐼 is the vector
space of skew symmetric 𝑛 × 𝑛-matrices, i.e., matrices 𝐵 satisfying 𝐵 𝑇 = −𝐵.
Exercise 4.6 Let 𝑀(2) denote the set of all real 2 × 2-matrices.
(a) Show that the determinant function is a submersion on the open submanifold of
nonzero 2 × 2-matrices 𝑀(2) ⧵ {0}.
Exercise 4.7 In this exercise we prove Euler’s identity for homogeneous polynomials:
Let 𝑃 (𝑥1 , … , 𝑥𝑘 ) be a homogeneous polynomial of degree 𝑚 in 𝑘 variables, i.e.,
Exercise 4.8 In this exercise we show that the fibers of homogeneous polynomials form
manifolds.
Let 𝑃 (𝑥1 , … , 𝑥𝑘 ) be a homogeneous polynomial of degree 𝑚 in 𝑘 variables We con-
sider 𝑃 as a map
ℝ𝑘 → ℝ, (𝑥1 , … , 𝑥𝑘 ) → 𝑃 (𝑥1 , … , 𝑥𝑘 ).
(a) Show that 0 is the only critical value of 𝑃 . Conclude that 𝑃 −1 (𝑎) is a 𝑘 − 1-
dimensional submanifold of ℝ𝑘 for all 𝑎 ≠ 0.
Hint: You may want to use Euler’s identity for homogeneous polynomials.
(b) For two positive real numbers 𝑎, 𝑏 > 0, show that 𝑃 −1 (𝑎) is diffeomorphic to
𝑃 −1 (𝑏). Similarly, For two negative real numbers 𝑎, 𝑏 < 0, show that 𝑃 −1 (𝑎) is
diffeomorphic to 𝑃 −1 (𝑏).
Exercise 4.9 The set 𝑆𝐿(𝑛) of 𝑛 × 𝑛- matrices with determinant +1 form a subgroup
of 𝐺𝐿(𝑛). In this exercise we show that 𝑆𝐿(𝑛) is a smooth manifold of dimension 𝑛2 − 1
and compute its tangent space.
where the sum runs over all permutations of the set {1, … , 𝑛} and sgn(𝜎) denotes
the sign of the permutation 𝜎. Then use a previous exercise.
∑
Or you use the formula det 𝐴 = 𝑛𝑖=1 (−1)𝑖+𝑗 𝑎𝑖𝑗 det 𝐴𝑖𝑗 where 𝐴𝑖𝑗 denotes the
(𝑛 − 1) × (𝑛 − 1)-matrix defined by removing the 𝑖th row and 𝑗th column from 𝐴.
(c) Show that the tangent space to 𝑆𝐿(𝑛) at the identity matrix consists of all matrices
with trace equal to zero.
Hint: Recall that we proved: If 𝑍 = 𝑓 −1 (𝑦) ⊆ 𝑋 is a submanifold defined by a
regular value 𝑦 of a smooth map 𝑓 ∶ 𝑋 → 𝑌 , then 𝑇𝑥 (𝑍) = Ker (𝑑𝑓𝑥 ) ⊆ 𝑇𝑥 (𝑋).
You may also want to use Leibniz’ formula for det.
Chapter 4. Submersions and regular values 95
Exercise 4.10 Recall the Hopf fibration 𝜋 that we have seen previously: We consider
𝕊3 as a subset of ℂ2 , i.e., 𝕊3 = {(𝑧0 , 𝑧1 ) ∈ ℂ2 ∶ |𝑧0 |2 + |𝑧1 |2 = 1}, and 𝕊2 as a subset
of ℂ × ℝ, i.e., 𝕊2 = {(𝑧, 𝑥) ∈ ℂ × ℝ ∶ |𝑧|2 + 𝑥2 = 1}. Then the Hopf fibration 𝜋 is the
map 𝕊3 → 𝕊2 given by
( )
𝜋(𝑧0 , 𝑧1 ) = 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2 .
(b) Let 𝑞 ∈ 𝕊3 . Explain why the restriction of 𝑑 𝜋̃𝑞 to 𝑇𝑞 𝕊3 has image contained in
𝑇𝜋(𝑞) 𝕊2 .
(d) Now show that actually each point in 𝕊2 is a regular value for 𝜋.
5. A brief excursion to Lie groups
Very important examples of smooth manifolds are given by Lie groups. We have seen first
examples in a previous section and will now discuss them further taking the group structure
into account. The study of Lie groups and their associated Lie algebras is a fascinating subject
in mathematics and we highly recommend to read more about them in other books. Here we
will only give a brief introduction considering Lie groups as examples of smooth manifolds.
Definition 5.1 (Lie groups) A Lie group is a group 𝐺 which is also a smooth manifold
such that the two maps
𝜇 ∶ 𝐺 × 𝐺 → 𝐺, (𝑔, ℎ) → 𝑔 ⋅ ℎ =∶ 𝑔ℎ
and
𝜄 ∶ 𝐺 → 𝐺, 𝑔 → 𝑔 −1
corresponding to the two group operations of multiplication and taking inverses, respec-
tively, are both smooth.
In fact, we can summarize the condition that 𝜇 and 𝜄 are smooth by requiring that
𝐺 × 𝐺 → 𝐺, (𝑔, ℎ) → 𝑔ℎ−1
is smooth.
𝐿𝑔 and 𝑅𝑔 ∶ 𝐺 → 𝐺,
𝑖𝑔 𝜇
← 𝐺 × 𝐺 ←←→
𝐺 ←←←→ ← 𝐺,
96
Chapter 5. A brief excursion to Lie groups 97
with 𝑖𝑔 (ℎ) = (𝑔, ℎ), it follows that 𝐿𝑔 is smooth. It is actually a diffeomorphism of 𝐺,
because 𝐿𝑔−1 is a smooth inverse for it. Similarly, 𝑅𝑔 ∶ 𝐺 → 𝐺 is a diffeomorphism.
∙ In fact, many of the important properties of Lie groups follow from the fact that we can
systematically map any point to any other via a canonical global diffeomorphism given by
translation by a suitable element in 𝐺. This translation makes the study of Lie groups
much more accessible compared to arbitrary smooth manifolds. In particular, we can
move an open neighborhood around any point in 𝐺 to make it an open neighborhood of
the identity element. Hence, in a Lie group, we basically only need to study neighbor-
hoods of the identity element.
∙ This observation has important consequence for the tangent spaces of Lie groups. In
fact, the translation property of Lie groups implies that the tangent space to a Lie group
𝐺 at any matrix in 𝐺 is isomorphic to tangent space to 𝐺 at the identity element. It is
a vector space with an additional structure, a Lie bracket, and is an example of a Lie
algebra. The classification of Lie algebras and thereby Lie groups is a highlight in the
history of mathematics.
∙ The real numbers ℝ and Euclidean space ℝ𝑛 are Lie groups under addition, because the
coordinates of 𝑥 − 𝑦 are linear and therefore smooth functions of (𝑥, 𝑦).
∙ Any finite group with the discrete topology is a (compact) Lie group.
∙ Suppose 𝐺 is a Lie group and 𝐻 ⊆ 𝐺 is an open subgroup, i.e., a subgroup which is also
an open subspace. Then 𝐻 is a Lie group as well.
∙ The set ℝ∗ = ℝ ⧵ {0} of nonzero real numbers is a 1-dimensional Lie group under multi-
plication. The subset ℝ>0 of positive real numbers is an open subgroup, and is thus itself
a 1-dimensional Lie group — still with multiplication as the group operation.
∙ The set ℂ∗ of nonzero complex numbers is a 2-dimensional Lie group under complex
multiplication.
∙ The unit circle 𝕊1 ⊂ ℂ∗ is a Lie group under the operations induced by multiplication of
complex numbers.
98 5.1. Lie groups - the definition
∙ A finite product of 𝑘 copies of 𝕊1 is a Lie group. We denote it by 𝕋 𝑘 . In particular, the
2-dimensional torus 𝕋 2 = 𝕊1 × 𝕊1 is a Lie group.
We will see more examples below. But before, we introduce the notion of maps between
Lie groups which respect the Lie group structure.
Definition 5.2 (Lie group homomorphisms) If 𝐺 and 𝐻 are Lie groups, a Lie group
homomorphism from 𝐺 to 𝐻 is a smooth map 𝐹 ∶ 𝐺 → 𝐻 that is also a group homo-
morphism. It is called a Lie group isomorphism if it is also a diffeomorphism, which
implies that it has an inverse that is also a Lie group homomorphism. In this case, we
say that 𝐺 and 𝐻 are isomorphic Lie groups.
∙ Considering ℝ as a Lie group under addition, and ℝ∗ as a Lie group under multiplication,
the map
exp ∶ ℝ → ℝ∗ , 𝑡 → 𝑒𝑡
is smooth, and is a Lie group homomorphism, since 𝑒𝑠+𝑡 = 𝑒𝑠 𝑒𝑡 . The image of exp is the
open subgroup ℝ>0 consisting of positive real numbers. In fact, exp ∶ ℝ → ℝ>0 is a Lie
group isomorphism with inverse log ∶ ℝ>0 → ℝ.
∙ The map
𝜖 ∶ ℝ → 𝕊1 , 𝑡 → 𝑒2𝜋𝑖𝑡
Lie group homomorphisms behave much nicer in many respects than arbitrary smooth maps
between manifolds. For example, the rank of the derivative is constant:
Proof: Let 𝑒𝐺 and 𝑒𝐻 denote the identity elements in 𝐺 and 𝐻, respectively. Suppose 𝑔0
is an arbitrary element of 𝐺. We will show that 𝑑𝑓𝑔0 has the same rank as 𝑑𝑓𝑒 . The fact that 𝑓
is a homomorphism means that for all 𝑔 ∈ 𝐺,
or in other words, 𝑓 ◦𝐿𝑔0 = 𝐿𝑓 (𝑔0 ) ◦𝑓 . Taking differentials of both sides at the identity and
using the chain rule yields
Recall that left multiplication by any element of a Lie group is a diffeomorphism, so both
𝑑(𝐿𝑔0 )𝑒𝐺 and 𝑑(𝐿𝑓 (𝑔0 ) )𝑒𝐻 are isomorphisms. Because composing with an isomorphism does
not change the rank of a linear map, it follows that 𝑑𝑓𝑔0 and 𝑑𝑓𝑒𝐺 have the same rank.
Remark 5.4 (Lie group isomorphisms revisited) Using the constant rank theorem,
one can now show that every bijective Lie group homomorphism 𝑓 ∶ 𝐺 → 𝐻 is auto-
matically a Lie group isomorphism. This is yet another point which makes Lie groups
special among all smooth manifolds. Here is a hint why this could be true: We will learn
soon about Sard’s theorem which will tell us that the subspace of regular values of a
smooth map is dense in the codomain. In particular, there is at least one regular value,
say ℎ0 ∈H, for our Lie group homomorphism 𝑓 . Since 𝑓 is bijective, ℎ0 must be in the
image of 𝑓 . Hence, at the unique point 𝑔0 ∈ 𝐺 with 𝑓 (𝑔0 ) = ℎ0 , we know that 𝑑𝑓𝑔0 is
surjective. But then 𝑑𝑓𝑔 must be an isomorphism, since otherwise the Local Submer-
sion Theorem would imply that 𝑓 looked like the canonical submersion and would have
nontrivial kernel. Hence 𝑓 would not be bijective. According to the previous theorem,
this implies that 𝑑𝑓𝑔 is an isomorphism for all 𝑔 ∈ 𝐺. Hence 𝑓 is a bijective local dif-
feomorphism everywhere. Bijective local diffeomorphisms are global diffeomorphisms.
Since the map is a Lie group homomorphism, it is a Lie group isomorphism.
100 5.2. Examples of Lie groups
5.2 Examples of Lie groups
Since det ∶ 𝑀(𝑛) → ℝ is continuous, being a polynomial in the entries of the matrix, and since
{0} is a closed subset of ℝ, det −1 (0) is closed in 𝑀(𝑛).
We claim that 𝐺𝐿(𝑛) is a Lie group: To show this we need to check that multiplication
and taking inverses are smooth operations. Given two matrices 𝐴 and 𝐵 in 𝐺𝐿(𝑛), the entry in
position (𝑖, 𝑗) in 𝐴𝐵 is given by
∑
𝑛
(𝐴𝐵)𝑖𝑗 = 𝑎𝑖𝑘 𝑏𝑘𝑗 .
𝑘=1
is a smooth map.
Recall that the (𝑖, 𝑗)-minor of a matrix 𝐴 is the determinant of the submatrix 𝐴𝑖𝑗 of 𝐴 ob-
tained by deleting the 𝑖th row and the 𝑗th column of 𝐴. By Cramer’s rule from linear algebra,
the (𝑖, 𝑗)-entry of 𝐴−1 is
1
(𝐴−1 )𝑖𝑗 = ⋅ (−1)𝑖+𝑗 ((𝑗, 𝑖)-minor of 𝐴),
det 𝐴
which is a smooth function of the 𝑎𝑖𝑗 ’s provided det 𝐴 ≠ 0, i.e., the map
is smooth because it depends smoothly on the entries of 𝐴. Therefore, the map of taking inverses
𝜄 ∶ 𝐺𝐿(𝑛) → 𝐺𝐿(𝑛)
is also smooth.
Remark 5.5 (𝐺𝐿(𝑛) exists over many bases) In fact, we can matrices with entries
in any ring 𝐾. We denote the corresponding matrix groups by 𝑀(𝑛, 𝐾), 𝐺𝐿(𝑛, 𝐾), ….
Chapter 5. A brief excursion to Lie groups 101
Since 𝐾 = ℝ is the most important case for us, we omit mentioning the base when it is
clear that we work over ℝ.
Another very important case is 𝐾 = ℂ. The complex general linear group 𝐺𝐿(𝑛, ℂ)
is also a Lie group. It is a group under matrix multiplication, and it is an open subman-
ifold of 𝑀(𝑛, ℂ) and thus a 2𝑛2 -dimensional smooth manifold. It is a Lie group, since
matrix products and inverses are smooth functions of the real and imaginary parts of the
matrix entries.
Note that the determinant is a Lie group homomorphism for both ℝ and ℂ:
In terms of geometry, note that 𝑆𝐿(𝑛) consists of all transformations of ℝ𝑛 into itself which
preserve volumes and orientations.We have shown in the exercises that 𝑆𝐿(𝑛) = det −1 (1) is
a smooth manifold of dimension 𝑛2 − 1. Since it is a subset of the Lie group 𝐺𝐿(𝑛) with the
operation inherited from the one of 𝐺𝐿(𝑛), 𝑆𝐿(𝑛) is also a Lie group. In the exercises we cal-
culate the tangent space of 𝑆𝐿(𝑛) at the identity to be the subspace in 𝑀(𝑛) of all matrices with
trace zero.
Recall that the orthogonal group 𝑂(𝑛) is defined as the subset of matrices 𝐴 in 𝑀(𝑛) such
𝐴𝐴𝑇 = 𝐼. This equation implies, in particular, that every 𝐴 ∈ 𝑂(𝑛) is invertible with 𝐴−1 =
𝐴𝑇 . Hence the determinant of an 𝐴 ∈ 𝑂(𝑛) must satisfy (det 𝐴)2 = 1, i.e., det 𝐴 = ±1. Thus,
𝑂(𝑛) splits into two disjoint parts, the subset of matrices with determinant +1 and the subset of
matrices with determinant −1.
If 𝐴 and 𝐵 have determinant −1, then their product 𝐴𝐵 has determinant +1. Hence the
subset of matrices with determinant −1 is not closed under multiplication and therefore not
a subgroup of 𝑂(𝑛). But the other part is a Lie subgroup of 𝑂(𝑛) and is called the Special
Orthogonal Group denoted by 𝑆𝑂(𝑛)
𝑆𝑂(𝑛) = {𝐴 ∈ 𝑂(𝑛) ∶ det 𝐴 = 1} ⊂ 𝑂(𝑛).
The subgroup 𝑆𝑂(𝑛) is a Lie group
Remark 5.6 (Spin groups) There are other important examples of Lie groups which, in
general, do not arise as closed subgroups of 𝐺𝐿(𝑛, ℝ) or 𝐺𝐿(𝑛, ℂ). For example, the 𝑛th
Spin group Spin(𝑛) is the 𝑛-dimensional Lie group which is a double cover of 𝑆𝑂(𝑛).
The latter means that Spin(𝑛) is equipped with a smooth surjective map 𝜋 ∶ Spin(𝑛) →
𝑆𝑂(𝑛) such that each point in 𝑆𝑂(𝑛) has an open neighborhood 𝑈 such that 𝜋 −1 (𝑈 ) is
a disjoint union of open subsets in Spin(𝑛) each of which is mapped diffeomorphically
onto 𝑈 by 𝜋. The map 𝜋 is part of a short exact sequence of groups
Spin groups can be constructed for example via Clifford algebras. However, there are
some exceptional isomorphisms in low dimensions which we can write down:
Spin(1) ≅ 𝑂(1),
Spin(2) ≅ 𝑆𝑂(2),
Spin(3) ≅ 𝑆𝑈 (2),
Spin(4) ≅ 𝑆𝑈 (2) × 𝑆𝑈 (2),
Spin(6) ≅ 𝑆𝑈 (4).
For some important Lie groups, we will now study the topological properties we singled out in
the beginning: compactness, connectedness and path-connectedness.
We showed in the exercises that 𝑂(𝑛) is compact. As a closed subset , 𝑆𝑂(𝑛) is compact
we well. Similarly, 𝑈 (𝑛) and 𝑆𝑈 (𝑛) are compact. The general linear group 𝐺𝐿(𝑛), however, is
not compact as an open subset of 𝑀(𝑛).
Moreover, note that both 𝑆𝑂(𝑛) and its complement are both open and closed in 𝑂(𝑛). They
are the two connected components of 𝑂(𝑛). In particular, there is no continuous path in 𝑂(𝑛)
from a matrix with determinant +1 to one with determinant −1. In fact, there is no such path
in 𝐺𝐿(𝑛):
Theorem 5.7 The real general linear group is not connected Let 𝛾 be a path in
𝐺𝐿(𝑛), i.e. a continuous map
𝛾 ∶ [0, 1] → 𝐺𝐿(𝑛).
Thus also 𝐺𝐿(𝑛) has two connected components, one of which is an open subgroup con-
sisting to all matrices 𝐴 with det 𝐴 > 0. The other one is just an open subset consisting to all
matrices 𝐴 with det 𝐴 < 0.
Theorem 5.8 The complex general linear group is connected However, 𝐺𝐿(𝑛, ℂ)
is path-connected. We see the difference between 𝐺𝐿(𝑛, ℝ) and 𝐺𝐿(𝑛, ℂ) most clearly
for the case 𝑛 = 1: 𝐺𝐿(1, ℝ) = ℝ∗ is not path-connected, since we cannot cross 0;
whereas 𝐺𝐿(1, ℂ) = ℂ∗ is path-connected, since we can just walk around 0 in the plane.
More generally, to show that 𝐺𝐿(𝑛, ℂ) is path-connected, it suffices to show that
there is a path from any matrix 𝐴 ∈ 𝐺𝐿(𝑛, ℂ) to the identity matrix 𝐼 ∈ 𝐺𝐿(𝑛, ℂ).
Therefore, we define first the function
Figure 5.1: Two points in 𝐺𝐿(𝑛, ℝ) may not be connected. In 𝐺𝐿(𝑛, ℂ), however, we can
always find a path between two matrices. This reduces via the determinant function to the fact
that the plane remains path-connected after removing finitely many points.
104 5.4. Lie subgroups
Remark 5.9 The fact that GL(𝑛, ℂ) is connected while 𝐺𝐿(𝑛, ℝ) is not plays a crucial
role for orientations of vector spaces, vector bundles, manifolds etc. For, every complex
vector space, complex vector bundle, complex manifold, etc has a natural orientation.
We will get back to this later.
Proof:
∙ Let 𝑊 ⊂ 𝐺 be any neighborhood of the identity, and let 𝐻 be the subgroup generated by
𝑊 . To simplify notation, if 𝐴 and 𝐵 are subsets of 𝐺, we write
For each positive integer 𝑘, let 𝑊𝑘 denote the set of all elements of 𝐺 that can be expressed
as products of 𝑘 or fewer elements of 𝑊 ∪ 𝑊 −1 . As mentioned above, 𝐻 is the union of
all the sets 𝑊𝑘 as 𝑘 ranges over the positive integers.
Now, 𝑊 −1 is open because it is the image of 𝑊 under the inversion map, which is a
diffeomorphism. Thus, 𝑊1 = 𝑊 ∪ 𝑊 −1 is open, and, for each 𝑘 > 1, we have
In the previous paragraph we talked about subgroups of a Lie group. But we did not discuss how
the subgroup structure relates to the structure as a smooth manifold. Actually, this is a subtle
Chapter 5. A brief excursion to Lie groups 105
and interesting point that illustrates the importance of the distinction between immersions and
embeddings once again. So here is the definition of a Lie subgroup:
∙ the group operations on 𝐻 are smooth, in the sense that the compositions
𝑓 ×𝑓 𝜇
𝑋 × 𝑋 ←←←←←←←←→ ← 𝐺, and
← 𝐺 × 𝐺 ←←→
𝑓 𝜄
𝑋 ←←←→
← 𝐺 ←→
← 𝐺
are smooth.
∙ Note: It is important to note that, in the above definition, we do not require 𝐻 to have
the subspace topology induced by being a subset in 𝐺. Instead we can think of the map 𝑓
to define a topology on 𝐻 in the sense that a subset 𝑈 ⊂ 𝐻 is open if and only if 𝑓 −1 (𝑈 )
is open in 𝑋.
∙ An abstract subgroup simply means a subgroup in the algebraic sense. The group oper-
ations on the subgroup 𝐻 are the restrictions of the multiplication map 𝜇 and the inverse
map 𝜄 from 𝐺 to 𝐻.
∙ If 𝐻 is in fact also a submanifold, then 𝐻 is a Lie subgroup as the following result shows.
Proof: Since 𝐻 is a subgroup, multiplication and taking inverses in 𝐻 are just the re-
strictions of multiplication and taking inverses in 𝐺 and both have image in 𝐻. Since 𝐻 is a
submanifold we can take 𝑋 = 𝐻 in the above definition, the restrictions of smooth maps to 𝐻
are again smooth.
106 5.4. Lie subgroups
Here are some examples:
∙ The subgroups 𝑆𝐿(𝑛) and 𝑂(𝑛) of 𝐺𝐿(𝑛) are both submanifolds, and therefore embedded
Lie subgroups.
𝐺𝐿(𝑛, ℂ) → 𝐺𝐿(2𝑛, ℝ)
) (
𝑥 𝑦
by replacing each entry 𝑧 = 𝑥 + 𝑖𝑦 in 𝐴 ∈ 𝐺𝐿(𝑛, ℂ) by the block :
−𝑦 𝑥
Now let us get back to understanding the definition of a Lie subgroup. The subtle differ-
ences of immersed and embedded subgroups can be illustrated by a familiar example:
Example 5.13 (An immersed but not embedded Lie subgroup) Recall the maps
𝑔∶ ℝ → 𝕊1 ⊂ ℂ, 𝑡 → 𝑒𝑖𝑡 , and
The map 𝐺 is a local diffeomorphism from the plane onto the torus 𝕋 2 .
Given a real number 𝛼, we defined the map 𝛾𝛼 by
For an explanation of why a Lie subgroup is defined in such a complicated way, we refer to
a fact we will only be able to appreciate when we learn more about Lie theory:
Chapter 5. A brief excursion to Lie groups 107
In the previous example, the Lie algebra of 𝕋 2 has ℝ2 as the underlying vector space,
and the one-dimensional Lie subalgebras are all the lines through the origin with addition
as group operation. Such a line is determined by its slope 𝛼. Hence every 𝛼 should
correspond to a Lie subgroup 𝛾𝛼 (ℝ) in 𝕋 2 .
However, if a Lie subgroup had been defined as a subgroup that is also a submani-
fold, then one would have to exclude all the lines with irrational slopes as Lie subgroups
of the torus. In this case it would not be possible to have a one-to-one correspondence
between the connected subgroups of a Lie group and the Lie subalgebras of its Lie al-
gebra. But this correspondence is extremely useful in Lie theory.
The following theorem is a very useful fact which we state here without proof. See for
example [11, Theorem 7.21].
(d) Show that a bijective Lie group homomorphism is a Lie group isomorphism.
108 5.4. Lie subgroups
Exercise 5.3 Show that an open subgroup 𝐻, i.e., a subgroup which is also an open
subset, of a connected Lie group 𝐺 is equal to 𝐺.
Exercise 5.4 Let 𝐺 be a Lie group and let 𝑒 ∈ 𝐺 be the identity element.
(a) Let 𝜇 ∶ 𝐺 × 𝐺 → 𝐺 denote the multiplication map, and let 𝑔, ℎ ∈ 𝐺. Recall that
we denote by 𝐿𝑔 the left translation in 𝐺 by 𝑔, and by 𝑅ℎ the right translation by ℎ.
Using the identification 𝑇(𝑔,ℎ) (𝐺 × 𝐺) = 𝑇𝑔 (𝐺) × 𝑇ℎ (𝐺), show that the differential
of 𝜇 at (𝑔, ℎ)
𝑑𝜇(𝑔,ℎ) ∶ 𝑇𝑔 (𝐺) × 𝑇ℎ (𝐺) → 𝑇𝑔ℎ (𝐺)
is given by
(c) Use the previous point to show that, for any 𝑔 ∈ 𝐺, the derivative of 𝜄 at 𝑔 is given
by
Exercise 5.5 Show that for any Lie group 𝐺, the multiplication map 𝜇 ∶ 𝐺 × 𝐺 → 𝐺
is a submersion.
Exercise 5.6 Show that the differential of the determinant map det ∶ 𝐺𝐿(𝑛, ℝ) → ℝ at
𝐴 ∈ 𝐺𝐿(𝑛, ℝ) is given by
Now we would like to understand what happens when we do not take the preimage of just a
single point, but the preimage of a whole submanifold.
Let 𝑋 and 𝑌 be smooth manifolds and let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map. Assume that
𝑍 ⊂ 𝑌 is a submanifold of 𝑌 . We would like to understand:
commutes.
∙ Since the lower horizontal map in (6.1) is the canonical immersion, the points in 𝑍 ∩ 𝑉
are exactly those on which the coordinate functions 𝑢𝑘+1 , … , 𝑢𝑚 vanish. Hence we have
𝑍 ∩ 𝑉 = {𝑦 ∈ 𝑉 ∶ 𝑢𝑘+1 (𝑦) = ⋯ = 𝑢𝑚 (𝑦) = 0}.
109
110 6.1. Transversality and preimages
( )
∙ Let us write 𝑔 ∶ 𝑉 → ℝ𝑚−𝑘 for the map given by 𝑦 → 𝑢𝑘+1 (𝑦), … , 𝑢𝑚 (𝑦) . The above
relation then reads
𝑍 ∩ 𝑉 = 𝑔 −1 (0).
𝑓 −1 (𝑍 ∩ 𝑉 ) = 𝑓 −1 (𝑍) ∩ 𝑓 −1 (𝑉 )
∙ And we do have a criterion that guarantees that this is the case. For, according to (6.2),
𝑓 −1 (𝑍 ∩ 𝑉 ) is a manifold if 0 is a regular value for the smooth map 𝑔◦𝑓 .
∙ One way to check this is to show that 0 is a regular value of the composite 𝑔◦𝑓 . The
chain rule tells us
𝑑(𝑔◦𝑓 )𝑥 = 𝑑𝑔𝑓 (𝑥) ◦𝑑𝑓𝑥 .
∙ By Lemma 4.8, which was a consequence of the Preimage Theorem 4.6, the kernel of
𝑑𝑔𝑓 (𝑥) is the subspace 𝑇𝑧 (𝑍). Thus 𝑑𝑔𝑓 (𝑥) induces an isomorphism
≅
← ℝ𝑚−𝑘 .
𝑑 𝑔̄𝑓 (𝑥) ∶ 𝑇𝑓 (𝑥) (𝑌 )∕𝑇𝑓 (𝑥) (𝑍) ←←←→
∙ This means that (𝑑𝑔𝑓 (𝑥) )|Im (𝑑𝑓𝑥 ) can only be surjective if Im (𝑑𝑓𝑥 ) generates the quo-
tient space 𝑇𝑓 (𝑥) (𝑌 )∕𝑇𝑓 (𝑥) (𝑍). In other words, (𝑑𝑔𝑓 (𝑥) )|Im (𝑑𝑓𝑥 ) can only be surjective if
Im (𝑑𝑓𝑥 ) and 𝑇𝑓 (𝑥) (𝑍) together span all of 𝑇𝑓 (𝑥) (𝑌 ).
Chapter 6. Transversality 111
∙ We conclude that 𝑔◦𝑓 is a submersion at 𝑥 ∈ 𝑓 −1 (𝑍 ∩ 𝑉 ) if and only if
The above discussion then provides the proof for the following fundamental result:
∙ Note that transversality tells us something about how the images of 𝑓 and 𝑍 meet in
𝑌 . We will give further geometric intuition for transversality soon.
For now, let us look at some simple examples of transversality and non-transversality. We
consider 𝑌 = ℝ2 with the submanifold 𝑍 being the 𝑥-axis. Then
∙ More generally, let 𝑝(𝑡) = 𝑡𝑛 + 𝑎𝑛−1 𝑡𝑛−1 + ⋯ + 𝑎0 be a polynomial with real coefficients.
We can consider 𝑝(𝑡) as a smooth map ℝ → ℝ. The map 𝑓 ∶ ℝ1 → ℝ2 defined by
112 6.1. Transversality and preimages
Figure 6.1: On the left, 𝑓 is transverse to the 𝑥-axis, since Im (𝑑𝑓0 ) and 𝑇0 𝑍 span all of 𝑇0 𝑌 =
ℝ2 . On the right, however, 𝑓 is not transverse to 𝑍, since both Im (𝑑𝑓0 ) and 𝑇0 𝑍 span the same
one-dimensional subspace.
𝑓 (𝑡) = (𝑡, 𝑝(𝑡)) is transverse to the 𝑥-axis 𝑍 if and only if all zeros of 𝑝 are simple, i.e.,
if and only if we can write 𝑝(𝑡) as a product
𝑝(𝑡) = (𝑡 − 𝑟1 ) ⋅ (𝑡 − 𝑟2 ) ⋅ … ⋅ (𝑡 − 𝑟𝑛 )
with 𝑟𝑖 ≠ 𝑟𝑗 if 𝑖 ≠ 𝑗.
The reason is that the derivative of 𝑝 ∶ ℝ → ℝ at 𝑡0 is given by multiplication by 𝑝′ (𝑡0 ),
i.e.,
𝑑𝑝𝑡0 ∶ ℝ1 → ℝ1 , 𝑠 → 𝑝′ (𝑡0 ) ⋅ 𝑠.
This map is nontrivial and hence surjective if and only if 𝑝′ (𝑡0 ) ≠ 0, i.e., if and only if 𝑡0
is not a zero of 𝑝′ . In other words, 𝑑𝑝′ (𝑡0 ) is trivial only if 𝑡0 is a multiple zero of 𝑝.
Now the tangent space of 𝑍 at a point 𝑓 (𝑡0 ) ∈ ℝ2 = 𝑌 is the 𝑥-axis 𝑇𝑓 (𝑡0 ) 𝑍 = ℝ × {0} ⊂
ℝ2 , and the derivative 𝑑𝑓𝑡0 of 𝑓 at 𝑡0 is the map
( )
2 1
𝑑𝑓𝑡0 ∶ ℝ → ℝ , 𝑠 → 𝑠 ⋅ ′ .
𝑝 (𝑡0 )
( ) ( )
1 1
Now we observe that the vectors and ′ are linearly independent in ℝ2 if and
0 𝑝 (𝑡0 )
only if 𝑝′ (𝑡0 ) ≠ 0. Hence 𝑇𝑓 (𝑡0 ) 𝑍 and Im 𝑑𝑓 (𝑡0 ) span 𝑇𝑓 (𝑡0 ) 𝑌 if and only if 𝑝′ (𝑡0 ) ≠ 0.
For transversality of 𝑓 and 𝑍, we need to check that 𝑇𝑓 (𝑡0 ) 𝑍 + Im 𝑑𝑓 (𝑡0 ) = 𝑇𝑓 (𝑡0 ) 𝑌 for all
𝑡 ∈ 𝑓 −1 (𝑍), i.e., for all zeros of 𝑝.
Figure 6.2: We begin to see a pattern in the plane: 𝑓 is transverse to the 𝑥-axis unless the 𝑥-
axis is tangent to the graph of 𝑓 . In the latter case, the tangent spaces do not provide enough
information.
So let 𝑥 ∈ 𝑓 −1 (𝑍) and 𝑧 ∶= 𝑓 (𝑥). As above we can choose suitable local parametrizations
and find an open subset 𝑉 ⊂ 𝑌 and a smooth map 𝑔 ∶ 𝑉 → ℝ𝑚−𝑘 such that 𝑍 ∩ 𝑉 = 𝑔 −1 (0).
Since tangent spaces are determined locally, the tangent spaces 𝑇𝑧 (𝑍 ∩ 𝑉 ) and 𝑇𝑧 (𝑍) are equal
as subspaces of 𝑇𝑧 (𝑌 ). Since 0 is a regular value for 𝑔, we know that the tangent space of 𝑍 at
𝑧 is given by
Now we use that 0 is also a regular value for the composite 𝑔◦𝑓 . Again, since tangent spaces
are determined locally, we get
As a formula:
The derivative 𝑑𝑖𝑥 ∶ 𝑇𝑥 (𝑋) → 𝑇𝑥 (𝑌 ) is the inclusion map of 𝑇𝑥 (𝑋) into 𝑇𝑥 (𝑌 ) as a subspace.
Hence we get 𝑖 −
⋔ 𝑍 if and only if, for every 𝑥 ∈ 𝑋 ∩ 𝑍,
𝑇𝑥 (𝑋) + 𝑇𝑥 (𝑍) = 𝑇𝑥 (𝑌 ).
∙ Notice that this equation is symmetric in 𝑋 and 𝑍. When it holds, we shall say that the
two submanifolds 𝑋 and 𝑍 are transverse, and write 𝑋 −⋔ 𝑍.
∙ However, we will see later a way to make self-intersections interesting. This will be
made possible by Thom’s transversality theorem and invariance under homotopy. More
on this later in the chapter on intersection theory.
∙ The additivity of codimensions follows from the codimension formula of the previous
theorem:
Figure 6.3: The intersection of a sphere and a circle is transverse unless the there is only one
intersection point.
Remark 6.5 (Intersect as little as possible) We have just learned that two manifolds
intersect transversally if their tangent spaces together span the whole ambient space. A
different way to think of transversality is: Two manifolds intersect transversally if they
intersect as little as possible at every point. And we measure the degree of intersection
in terms of tangent spaces: If two submanifolds intersect, then they intersect transver-
sally if the intersection of their tangent spaces in the ambient tangent space is minimal.
∙ Note that the converse of the theorem is not true. We have seen a simple example last
time: the submanifolds 𝑋 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑦 = 𝑥2 } and 𝑍 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑦 = 0}
do not intersect transversally at 0 in 𝑌 = ℝ2 , but their intersection 𝑋 ∩ 𝑍 = {0} is a
zero-dimensional manifold. However, there do, of course, exist intersections which are
not transversal and where the intersection is not a manifold. We will look at an example
below!
∙ It is useful to note that any smooth map 𝑓 ∶ 𝑋 → 𝑌 whose image does not meet a
submanifold 𝑍 of 𝑌 , i.e., 𝑓 −1 (𝑍) = ∅, is transverse to 𝑍 for trivial reasons. For in this
case there is no condition to be satisfied. In particular, two submanifolds which do not
intersect at all are transverse.
Remark 6.6 (The ambient space matters!) Transversality of 𝑋 and 𝑍 also depends
on the ambient space 𝑌 . For example, the two coordinate axes intersect transversally
in ℝ2 , but not when considered to be submanifolds of ℝ3 . In general, if the dimensions
of 𝑋 and 𝑍 do not add up to at least the dimension of 𝑌 , then they can only intersect
transversally by not intersecting at all. For example, if 𝑋 and 𝑍 are curves in ℝ3 ,
then 𝑋 − ⋔ 𝑌 if and only if 𝑋 ∩ 𝑌 = ∅.
By applying the formula we got for the tangent spaces of 𝑓 −1 (𝑍) to 𝑓 being the inclusion
map 𝑋 ⊂ 𝑌 we get the following useful result:
116 6.2. Transverse intersections
Figure 6.4: The intersection of two circles in the plane is transverse unless the there is exactly
one intersection point.
6.2.2 Examples
𝑓 ∶ ℝ4 → ℝ,
(𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ) → 𝑥1 + 𝑥22 + 𝑥33 + 𝑥44 .
is a manifold in ℝ4 .
Now we show 𝑍 and 𝕊3 intersect transversally in ℝ4 where 𝕊3 ⊂ ℝ4 denotes the
three-dimensional sphere. This will imply that 𝑍 ∩ 𝕊3 is a smooth manifold.
To do this we need to check that 𝑇𝑧 (𝑍) + 𝑇𝑧 (𝕊3 ) = 𝑇𝑧 (ℝ4 ) = ℝ4 for all 𝑧 ∈ 𝑍 ∩ 𝕊3 .
Since 𝑇𝑧 (𝑍) and 𝑇𝑧 (𝕊3 ) are both three-dimensional subspaces of ℝ4 , it suffices to show
that, for every 𝑧 ∈ 𝑍 ∩ 𝕊3 , there is at least one vector 𝐯 in 𝑇𝑧 (𝑍) which is not contained
in 𝑇𝑧 (𝕊3 ).
The tangent space to 𝑍 in a point 𝑧 ∈ 𝑍 is the subspace in ℝ4 given by the kernel of
the derivative 𝑑𝑓𝑧 which in the standard bases is given by the 1 × 4-matrix
( )
𝑑𝑓𝑧 = 1 2𝑥2 3𝑥23 4𝑥34 .
Chapter 6. Transversality 117
⎛12𝑥1 ⎞
⎜ 6𝑥 ⎟
𝐯 ∶= ⎜ 2 ⎟
⎜ 4𝑥3 ⎟
⎝ 2𝑥4 ⎠
⎛12𝑥1 ⎞
( ) ⎜ 6𝑥 ⎟
𝑑𝑓𝑧 (𝐯) = 1 2𝑥2 3𝑥23 4𝑥34 ⋅ ⎜ 2 ⎟
⎜ 4𝑥3 ⎟
⎝ 3𝑥4 ⎠
= 12𝑥1 + 12𝑥22 + 12𝑥33 + 12𝑥34
= 12𝑓 (𝑧)
= 0.
But 𝐯 is not an element in 𝑇𝑧 (𝕊3 ). For, recall that 𝑇𝑧 (𝕊3 ) is the subspace in ℝ4 which
is orthogonal to the vector 𝑧, i.e.,
𝑇𝑧 (𝕊3 ) = {𝑤 ∈ ℝ4 ∶ 𝑧 ⟂ 𝑤 = 0}.
𝑧 ⟂ 𝑤 ⇐⇒ 𝑧 ⋅ 𝑤 = 0.
For 𝐯 we calculate
⎛12𝑥1 ⎞
( ) ⎜ 6𝑥2 ⎟ 2 2 2 2
𝑧 ⋅ 𝐯 = 𝑥1 𝑥2 𝑥3 𝑥4 ⋅ ⎜ ⎟ = 12𝑥1 + 6𝑥2 + 4𝑥3 + 3𝑥4 > 0.
4𝑥
⎜ 3⎟
⎝ 3𝑥4 ⎠
𝑋 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥2 + 𝑦2 − 𝑧2 = 1}
𝑍𝑎 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥2 + 𝑦2 + 𝑧2 = 𝑎}
with 𝑎 > 0. See Figure 6.5. We would like to understand for which 𝑎 do these two
submanifolds intersect transversally in 𝑌 .
Therefore, we need to determine the tangent spaces of 𝑋 and 𝑍𝑎 at points where
118 6.2. Transverse intersections
𝑓 ∶ ℝ3 → ℝ, (𝑥, 𝑦, 𝑧) → 𝑥2 + 𝑦2 − 𝑧2 .
Expressed as a matrix in the standard bases the derivative 𝑑𝑓𝑝 at a point 𝑝 = (𝑥𝑝 , 𝑦𝑝 , 𝑧𝑝 )
has the form
This map is surjective for all 𝑝 ∈ ℝ3 ⧵ {0}. Hence 1 is a regular value of 𝑓 and the
tangent space to 𝑋 at 𝑝 is the kernel of 𝑑𝑓𝑝 . For 𝑝 = (𝑥𝑝 , 𝑦𝑝 , 𝑧𝑝 ) ∈ 𝑋 with 𝑧𝑝 ≠ 0, this
is
𝑔 ∶ ℝ3 → ℝ, (𝑥, 𝑦, 𝑧) → 𝑥2 + 𝑦2 + 𝑧2 .
Expressed as a matrix in the standard bases the derivative 𝑑𝑔𝑝 at a point 𝑝 = (𝑥𝑝 , 𝑦𝑝 , 𝑧𝑝 )
has the form
This map is surjective for all 𝑝 ∈ ℝ3 ⧵ {0}. Hence 𝑎 > 0 is a regular value of 𝑔 and the
tangent space to 𝑍 at 𝑝 is the kernel of 𝑑𝑔𝑝 . For 𝑝 = (𝑥𝑝 , 𝑦𝑝 , 𝑧𝑝 ) ∈ 𝑍𝑎 with 𝑧𝑝 ≠ 0, this
is
𝑥2 + 𝑦2 − 𝑧2 − 1 = 0 = 𝑥2 + 𝑦2 + 𝑧2 − 𝑎.
2𝑧2 = 𝑎 − 1. (6.4)
∙ if 𝑎 = 1, then we see 𝑇𝑝 (𝑋) = 𝑇𝑝 (𝑍1 ) from the above descriptions of the tangent
spaces, since 𝑧𝑝 = 0 for all points in 𝑋 ∩ 𝑍1 . In particular, the tangent spaces
span the same plane in ℝ3 , and not all of ℝ3 , at every 𝑝 ∈ 𝑋 ∩ 𝑍1 . Thus the
intersection is not transversal.
Figure 6.5: The intersection is transverse if 𝑎 > 1, and not transverse if 𝑎 = 1. In both cases,
however, the intersection is a smooth manifold.
Here is an example of an intersection which is not transverse and where the intersection
is not a manifold:
120 6.2. Transverse intersections
𝑍 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥 = 1}
𝑋 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥2 + 𝑦2 − 𝑧2 = 1}.
𝑋 ∩ 𝑍 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥 = 1, 𝑦 = ±𝑧}.
We have seen in Section 2.3.5 that a space consisting of two lines crossing each other
is not a manifold. The intersection point, i.e., the point 𝑝 = (1, 0, 0), does not have a
neighborhood in 𝑋 ∩ 𝑍 which is diffeomorphic to an open subset in Euclidean space.
Thus 𝑋 ∩ 𝑍 is not a manifold. See Figure 6.6.
As a reality check, let us look at the tangent spaces to 𝑋 and 𝑍 at 𝑝: Since 𝑍 is
a parallel translate of a vector subspace of ℝ3 , we see that 𝑇𝑝 (𝑍) is the 𝑦𝑧-plane in ℝ3
(all points with 𝑥 = 0). The tangent space to 𝑋 was calculated in the previous example
(and in an exercise). At 𝑝 = (1, 0, 0), 𝑇𝑝 (𝑋) is the vector subspace in ℝ3 spanned by the
vectors (0, 1, 0) and (0, 0, 1). In other words, 𝑇𝑝 (𝑋) is the 𝑥𝑦-plane in ℝ3 . Thus 𝑇𝑝 (𝑍)
and 𝑇𝑝 (𝑋) do not span 𝑇𝑝 (𝑌 ) = ℝ3 . The problem here is that 𝑍 is’the tangent plane to
𝑋 at 𝑝.
Figure 6.6: The intersection is not transverse and 𝑋 ∩ 𝑍 is not a manifold, since we get two
axes that cross each other.
6.3.1 Transversality
from previous exercise sets. Is the set Im (𝑓 ), the image of 𝑓 in ℝ2 , a manifold? Is the
set (𝑔◦𝑓 )−1 (1) a manifold?
𝑓 𝑔
Exercise 6.3 Let 𝑋 ←←←→ ← 𝑍 be a sequence of smooth maps between manifolds, and
← 𝑌 ←←→
let 𝑊 ⊂ 𝑍 be a submanifold. Assume that 𝑔 is transversal to 𝑊 . Show:
𝑓−
⋔ 𝑔 −1 (𝑊 ) if and only if (𝑔◦𝑓 ) −
⋔𝑊.
Exercise 6.4 Let 𝑉 be a vector space, and let Δ be the diagonal of 𝑉 × 𝑉 . For a linear
map 𝐴 ∶ 𝑉 → 𝑉 , consider the graph Γ(𝐴) = {(𝑣, 𝐴𝑣) ∶ 𝑣 ∈ 𝑉 }. Show that Γ(𝐴) − ⋔ Δ if
and only if +1 is not an eigenvalue of 𝐴.
Exercise 6.5 Let 𝑓 ∶ 𝑋 → 𝑋 be a map, and let 𝑥 be a fixed point of 𝑓 , i.e., 𝑓 (𝑥) = 𝑥. If
+1 is not an eigenvalue of 𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) → 𝑇𝑥 (𝑋), then 𝑥 is called a Lefschetz fixed point
of 𝑓 . The map 𝑓 is called a Lefschetz map if all its fixed points are Lefschetz. Prove that
if 𝑋 is compact and 𝑓 is Lefschetz, then 𝑓 has only finitely many fixed points.
Hint: Show that the intersection of the graph of 𝑓 and the diagonal of 𝑋 is a 0-
Chapter 6. Transversality 123
dimensional submanifold of 𝑋 × 𝑋.
Prove that 𝑆𝑘7 is a 7-dimensional manifold by showing that the intersection is transver-
sal in ℂ5 ⧵ {0}.
Hint:
( At some point you may want ) to show that, at a point 𝑧 = (𝑧1 , … , 𝑧5 ), the vector
𝑤 ≔ 𝑚2 𝑧1 , 𝑚2 𝑧2 , 𝑚2 𝑧3 , 𝑚3 𝑧4 , 6𝑘−1
𝑚
𝑧5 , with 𝑚 ≔ 2 ⋅ 3 ⋅ (6𝑘 − 1), lies in one of the tangent
spaces but not in the other.
7. Sard’s theorem and Morse functions
In the previous sections we have seen how useful regular values are. This motivates to ask:
Question Given a smooth map 𝑓 ∶ 𝑋 → 𝑌 . How many regular values are there in
𝑌?
Recall that a 𝑦 ∈ 𝑌 which is not a regular value for 𝑓 is called a critical value. Hence we
may ask the equivalent question: How many critical values are there?
∙ In Milnor’s proof of the Fundamental Theorem of Algebra, we showed that the smooth
map in question had only finitely many critical values. Actually, we showed that the set
of critical points of the map 𝕊2 → 𝕊2 we defined using the given polynomial was finite.
∙ We might hope that set of critical values is always finite.
∙ However, finiteness is too good to be true when we allow 𝑋 and 𝑌 to be arbitrary smooth
manifolds.
So what is the correct analog of finiteness for our situation? Here is our first answer:
Theorem 7.1 (Sard’s Theorem) Let 𝑋 and 𝑌 be smooth manifolds and let 𝑓 ∶ 𝑋 → 𝑌
be a smooth map. Then the set of regular values of 𝑓 is a dense subset of 𝑌 , i.e., every
open subset of 𝑌 contains a regular value.
∙ This theorem is a key result in differential topology. We will see later that it will be
crucial for Thom’s Transversality Theorem which will make intersection theory work.
We will now reformulate and simplify the theorem. To do this and for the proof we are
going to use the following terminology:
Definition 7.2 (The interior of a set) Let 𝑋 be a topological space, and 𝑆 a subset
of 𝑋. Then the interior of 𝑆, denoted int(𝑆), is the union of all open subsets of 𝑋
contained in 𝑆. By definition, the interior of any 𝑆 is an open subset of 𝑋. In fact, it
is the largest open subset of 𝑋 which is contained in 𝑆. See Figure 7.1.
If 𝑆 ⊂ ℝ𝑁 then int(𝑆) is the set of all points 𝑠 ∈ 𝑆 such that there is a small open
124
Chapter 7. Sard’s theorem and Morse functions 125
ball centred at 𝑥 which is contained in 𝑆.
If 𝑈 is an open subset of 𝑋 then int(𝑈 ) = 𝑈 . In particular, if 𝑋 ⊂ ℝ𝑁 is open then
int(𝑋) = 𝑋. In general, however, int(𝑆) is a proper subset of 𝑋.
Figure 7.1: The interior of a subset 𝑆 in the topological space 𝑋 is the union of all open subsets
of 𝑋 which are contained in 𝑆.
∙ Equivalently, the countable union of compact subsets with empty interior in ℝ𝑝 is a subset
with empty interior.
Theorem 7.3 (Euclidean case) Let 𝑓 ∶ ℝ𝑛 → ℝ𝑝 be a smooth map. Then the set of
regular values of 𝑓 is a dense subset of ℝ𝑝 . Equivalently, the set of critical values of 𝑓
is a set with empty interior.
∙ Note that it follows from the Local Submersion Theorem 4.2 that 𝐶𝑓 is a closed subset
of 𝑋.
1
This means that ℝ𝑝 and smooth manifolds in general are Baire spaces.
126 7.1. The Theorem of Brown and Sard
Proof that Theorem 7.3 implies Sard’s Theorem 7.1: First assume 𝑌 = ℝ𝑝 . Let 𝑥 ∈ 𝑋
be a point and let 𝜙 ∶ ℝ𝑛 → 𝑋 be a local parametrization.2 We set 𝑊 ∶= 𝜙(ℝ𝑛 ) and 𝐾 =
𝜙(𝔹̄ 𝑛1 (0)), where 𝔹̄ 𝑛1 (0) is the closed unit ball in ℝ𝑛 . In particular, 𝐾 is a compact subspace in
𝑋. By Theorem 7.3, the set of critical values of 𝑓 ◦𝜙
𝐷𝑓 ◦𝜙 = 𝑓 (𝐶𝑓 ∩ 𝑊 )
has empty interior. Thus, since 𝐶𝑓 is closed and hence 𝐶𝑓 ∩ 𝐾 is compact, 𝑓 (𝐶𝑓 ∩ 𝐾) is
a compact subset of ℝ𝑝 with empty interior. Since 𝑋 can be covered by countably many
neighborhoods of the form 𝐾, 𝐷𝑓 = 𝑓 (𝐶𝑓 ) is a countable union union of compact subsets
with empty interior.
Second, for an arbitrary smooth 𝑝-dimensional manifold 𝑌 , we can find countably many
local parametrizations 𝜓𝑖 of 𝑌 such that the union of the compact sets 𝐾𝑖 ∶= 𝜓𝑖 (𝔹̄ 𝑝1 (0)) covers
𝑌 . By the first case, the intersection of the 𝐷𝑓 with any of these sets 𝐾𝑖 , is a compact subset
with empty interior. Hence 𝐷𝑓 is a countable union of compact subsets with empty interior
and has itself empty interior.
Proof of Theorem 7.3: The proof will proceed in two steps: First we prove the result for
function ℝ𝑛 → ℝ. Second we use this to reduce the case ℝ𝑝 to ℝ𝑝−1 and will conclude by
induction.
∙ The case 𝑝 = 1:
If 𝑛 = 0, there is nothing to prove. Now assume that the assertion is true for 𝑛 − 1. Let
𝐶𝑖 ⊂ ℝ𝑛 be the closed set of points where all partial derivatives of 𝑓 of order ≤ 𝑖 vanish.
Then 𝐶𝑓 = 𝐶1 and 𝐶1 ⊃ 𝐶2 ⊃ 𝐶3 ⊃ ⋯. Hence
𝐶𝑓 = (𝐶1 − 𝐶2 ) ∪ ⋯ ∪ (𝐶𝑛−1 − 𝐶𝑛 ) ∪ 𝐶𝑛 ,
and thus
𝐷𝑓 = 𝑓 (𝐶1 − 𝐶2 ) ∪ ⋯ ∪ 𝑓 (𝐶𝑛−1 − 𝐶𝑛 ) ∪ 𝑓 (𝐶𝑛 ).
Hence the theorem follows for 𝑝 if we can show that each of the sets 𝑓 (𝐶𝑖 − 𝐶𝑖+1 ) for all
𝑖 ≥ 1 and 𝑓 (𝐶𝑖 ) for 𝑖 ≥ 𝑛 are countable union of closed subsets with empty interior. This will
be shown in the following two lemmas:
Lemma 7.4 The set 𝑓 (𝐶𝑖 − 𝐶𝑖+1 ) is a countable union of closed subsets with empty
interior for all 𝑖 ≥ 1.
Proof: We claim the following: For each 𝑢 ∈ 𝐶𝑖 − 𝐶𝑖+1 there is a compact neighborhood 𝐾
disjoint with 𝐶𝑖+1 and an (𝑛 − 1)-dimensional submanifold 𝑍 ⊂ ℝ𝑛 such that 𝐶𝑖 ∩ 𝐾 ⊂ 𝑍. Then
every point of 𝐶𝑖 ∩ 𝐾 is critical for 𝑓|𝑍 , since it is critical for 𝑓 . By our induction hypothesis,
𝑓 (𝐶𝑖 ∩ 𝐾) is then a closed subset in ℝ without interior points. Since 𝐶𝑖 − 𝐶𝑖+1 can be covered
by countably many arbitrarily small compact neighborhoods 𝐾, this will prove the lemma.
2
Note that we can choose the domain of 𝜙 to be all of ℝ𝑛 , by stretching an open ball 𝔹𝑛𝑟 (0) to all of ℝ𝑛 via a
diffeomorphism.
Chapter 7. Sard’s theorem and Morse functions 127
Now we prove the claim. Since 𝑢 ∈ 𝐶𝑖 − 𝐶𝑖+1 , there is some 𝑖-th order partial derivative of
𝑓 whose first order partial derivatives do not all vanish at 𝑢. We denote this partial derivative
by 𝑔. Then 𝑢 is a regular point for 𝑔. By definition of 𝐶𝑖 , we know 𝑔(𝑢) = 0. By the Local
Submersion Theorem, we can then find an open neighborhood 𝑈 of 𝑢 such that 0 is a regular
value for 𝑔|𝑈 . Hence, by the Preimage Theorem, 𝑔 −1 (0)∩𝑈 is an (𝑛−1)-dimensional subman-
ifold in 𝑈 . Moreover, since 𝐶𝑖 ⊆ 𝑔 −1 (0) by definition of 𝑔 and 𝐶𝑖 , we can set 𝑍 ∶= 𝑔 −1 (0) ∩ 𝑈
and choose a sufficiently small compact neighborhood 𝐾 of 𝑢 in 𝑈 . By definition of 𝑔, we also
know that 𝐾 is disjoint with 𝐶𝑖+1 .
Lemma 7.5 The set 𝑓 (𝐶𝑖 ) is a countable union of closed subsets with empty interior
for 𝑖 ≥ 𝑛.
1 𝑖 1
𝑓 (𝑥 + 𝑢) = 𝑓 (𝑥) + 𝜕𝑢 𝑓 (𝑥) + ⋯ + 𝜕𝑢 𝑓 (𝑥) + 𝜕 𝑖+1 𝑓 (𝑥 + 𝜆𝑢)
𝑖! (𝑖 + 1)! 𝑢
for any two points 𝑥 and 𝑢 in ℝ𝑛 , where 0 ≤ 𝜆 ≤ 1 is a real number and 𝜕𝑢 is the differential
operator
𝜕𝑢 = 𝑢1 𝜕∕𝜕𝑥1 + ⋯ + 𝑢𝑛 𝜕∕𝜕𝑥𝑛 .
Thus, if 𝑥 ∈ 𝐶𝑖 , then by definition of 𝐶𝑖 :
1
𝑓 (𝑥 + 𝑢) − 𝑓 (𝑥) = 𝜕 𝑖+1 𝑓 (𝑥 + 𝜆𝑢).
(𝑖 + 1)! 𝑢
If in addition 𝑥 and 𝑦 ∶= 𝑥 + 𝑢 lie in a convex set 𝐾, then 𝑥 + 𝜆𝑢 is also in 𝐾, and we get the
inequality
|𝑓 (𝑦) − 𝑓 (𝑥)|max ≤ 𝑐|𝑦 − 𝑥|𝑖+1
max
Now we let 𝐾 be the unit cube in ℝ𝑛 and consider the subdivision of 𝐾 into 𝑘𝑛 subcubes
with sides of length 1∕𝑘. Let 𝐾 ′ be one of these subcubes and suppose 𝑥 ∈ 𝐶𝑖 ∩ 𝐾 and 𝑦 ∈ 𝐾 ′ .
Then
|𝑦 − 𝑥|max ≤ 1∕𝑘.
This implies that 𝑓 (𝐶𝑖 ∩ 𝐾 ′ ) is contained in an interval of length 𝑐∕𝑘𝑖+1 .
𝑘𝑛 𝑐∕𝑘𝑖+1 ≤ 𝑐∕𝑘
where we use that 𝑖 ≥ 𝑛 by our assumption. Since 𝑘 is any positive integer, this length can be
arbitrarily small. Hence the set 𝑓 (𝐶𝑖 ∩ 𝐾) must have empty interior. Finally, ℝ𝑛 and therefore
𝐶𝑖 is contained in a countable union of unit cubes 𝐾. This proves the lemma.
∙ The case 𝑝 ≥ 2:
If 𝑂 contains points which are not in 𝑓 (ℝ𝑛 ), then these points are regular values for 𝑓 , and
we are done.
Hence assume 𝑂 ⊆ 𝑓 (ℝ𝑛 ). Let 𝜋 ∶ ℝ𝑝 → ℝ𝑝−1 be the projection onto the first 𝑝 − 1
coordinates. Then 𝜋(𝑂) is open in ℝ𝑝−1 , since 𝜋 is a submersion and hence an open map.3 By
our induction hypothesis, the map 𝜋◦𝑓 ∶ ℝ𝑛 → ℝ𝑝−1 has a regular value 𝑦′ ∈ 𝜋(𝑂). In other
words, 𝑓 is transverse to the line 𝑌 ′ = 𝜋 −1 (𝑦′ ) in ℝ𝑝 .
Now let 𝑓 ′ ∶ 𝑋 ′ → 𝑌 ′ be the induced map. Since 𝑂 ⊆ 𝑓 (ℝ𝑛 ), the open set 𝑂 meets 𝑌 ′ ,
i.e., 𝑌 ′ ∩ 𝑂 ≠ ∅. Since 𝑌 ′ is diffeomorphic to ℝ, 𝑓 ′ has a regular value 𝑦′′ ∈ 𝑌 ′ ∩ 𝑂 by the
case 𝑝 = 1.
∙ 𝑓 is transverse to 𝑌 ′ in ℝ𝑝 , and
By the chain rule and what we learned about transversality, this implies that 𝑦′′ ∈ 𝑌 ′ ∩ 𝑂
is a regular value for 𝑓 . This finishes the proof of Theorem 7.3.
To conclude this section, we remark that Sard’s Theorem is often formulated as follows:
Before we recall some basic measure theory, we observe how the different versions of the
theorem are related:
∙ This result is stronger than the previous version, and we invite the reader to investigate
the relationship between measure zero sets and sets with empty interior. However, the
version we proved suffices for our purposes. For, we will apply the theorem for knowing
that any small open neighborhood of a point contains a regular value.
3
We proved this fact in the exercises.
Chapter 7. Sard’s theorem and Morse functions 129
∑
∞
vol (𝑆𝑖 ) < 𝜀.
𝑖=1
Then in a manifold 𝑋, an arbitrary subset 𝐶 ⊂ 𝑋 has measure zero if, for every local
parametrization 𝜙 of 𝑋, the preimage 𝜙−1 (𝐶) has measure zero in Euclidean space.
Note that measures and volumes depend on the ambient space!
∙ Hence for measure theorists, almost every real number is irrational. This example il-
lustrates that something that happens almost never, can still happen often enough to be
noticed.
∙ Now, every nonempty open subset of ℝ𝑛 contains some nonempty rectangular solid.
Thus, no nonempty open subset of ℝ𝑛 has measure zero.
∙ Hence, no nonempty open subset of a manifold 𝑌 has measure zero. In other words, no
set of measure zero in a manifold 𝑌 can contain a nonempty open subset of 𝑌 .
We will now study a very interesting application of Sard’s Theorem. We understand the
local behaviour of smooth maps at regular points by the Local Submersion Theorem 4.2. But
what about the local behaviour at critical points? In fact, it is often at critical points that
the interesting stuff happens. For example, it is often at critical points that the topology of a
manifold can change.
A standard example is given by the height function on a torus, see Figure 7.2:
Change of homotopy types: We observe in this example that the homotopy type of the
fiber can change at critical points. You may have noticed that have not defined what the term
‘homotopy type’ means. Roughly speaking it is equivalence class of a space under the relation
130 7.2. Morse Functions
Figure 7.2: The critical points of the height function yield a decomposition of the torus.
of homotopy equivalence (which we have not defined either). For example, all contractible
spaces have the same homotopy type (with some assumptions in place). Anyway, we are not
going to fix this lack of precise definitions now. Instead we look at what happens with the
fibers on the torus in this concrete example: For 𝑠 < 0, the preimage ℎ−1 ([0, 𝑠)) under the
height functions is empty. The first critical value is 𝑠 = 0. For sufficiently small 𝑠 > 0, the
preimage ℎ−1 ([0, 𝑠)) has changed and looks like a two-dimensional disk, just a bit punched
in at the center. In particular, the preimage is contractible in this range. But when we pass the
next critical value, the light green dot on the vertical number line, another significant change
happens. For after it, the preimage ℎ−1 ([0, 𝑠)) looks like a bent cylinder. On this cylinder, there
are loops which are not homotopic to a constant map, e.g., the dark green circle. Thus, above
the critical value, the preimage ℎ−1 ([0, 𝑠)) is not contractible anymore. Hence the homotopy
type of the preimage has changed at a critical value. The next change of the homotopy type
happens when we pass the next critical value. The preimage ℎ−1 ([0, 𝑠]) (of the closed interval
[0, 𝑠]) becomes homeomorphic to a compact surface of genus one, i.e., it has one hole, with a
circle as boundary. Another list of things we need to and are going to define towards the end
of the semester. Finally, after passing the last critical value, the preimage is the whole torus.
The torus has the homotopy type of a compact surface with genus one, i.e., still one hole,
but without boundary. In total, we see that a lot of interesting stuff happens at the critical
values.
So let us stick to smooth functions, i.e., smooth maps to ℝ. We want to understand how
critical points look like locally.
∑𝑘
𝜕𝑓 1 ∑ 𝜕2𝑓
𝑘
𝑓 (𝑥) = 𝑓 (𝑐) + (𝑐) ⋅ (𝑥𝑖 − 𝑐𝑖 ) + (𝑐) ⋅ (𝑥𝑖 − 𝑐𝑖 )(𝑥𝑗 − 𝑐𝑗 ) + 𝑜(|𝑥|3 ).
𝑖=1
𝜕𝑥 𝑖 2 𝑖,𝑗=1
𝜕𝑥𝑖 𝜕𝑥 𝑗
Chapter 7. Sard’s theorem and Morse functions 131
If 𝑐 is a critical point, then by definition
( )
𝜕𝑓 𝜕𝑓
𝑑𝑓𝑐 = (𝑐), … , (𝑐) =0
𝜕𝑥1 𝜕𝑥𝑘
(otherwise 𝑑𝑓𝑐 was surjective as a linear map ℝ𝑘 → ℝ). Hence the best possible approximation
for the local behavior of 𝑓 at 𝑐 is the Hessian matrix of the second partial derivatives. Critical
points where the Hessian matrix is invertible is the best we can hope for.
Definition 7.7 (Non-degenerate critical points and Morse functions) For a smooth
function 𝑓(∶ ℝ𝑘 →)ℝ, a point 𝑐 ∈ ℝ𝑘 where 𝑑𝑓𝑐 vanishes, but the Hessian matrix
2𝑓
𝐻(𝑓 )𝑐 = 𝜕𝑥𝜕 𝜕𝑥 (𝑐) is invertible at 𝑐, is called a non-degenerate critical point.
𝑖 𝑗
A smooth function 𝑓 ∶ ℝ𝑘 → ℝ for which all critical points are non-degenerate is
called a Morse function.
Non-degenerate critical points are much easier to study than arbitrary critical points, since
they are isolated from the other critical points, i.e., there is an open neighborhood which does
not contain any other critical points. Hence Morse functions are easier to understand than
arbitrary smooth functions.
Moreover, the matrix representing the derivative 𝑑𝑔𝑥 is the Hessian of 𝑓 at 𝑥. So if 𝑥 is non-
degenerate, then not only is 𝑔(𝑥) = 0, but 𝑔 maps a neighborhood of 𝑥 diffeomorphically
onto a neighborhood of 0 as well. In particular, 𝑔 is injective in that neighborhood of 𝑥. Thus
𝑔 can be zero at no other points in this neighborhood, and 𝑓 has no other critical point in this
neighborhood.
Another reason to be interested in Morse functions is the fact that there are a lot of them.
𝑓𝑎 (𝑥) = 𝑓 (𝑥) + 𝑎 ⋅ 𝑥.
Hence
By Sard’s Theorem, −𝑎 is a regular value of 𝑔 for almost all 𝑎 ∈ ℝ𝑘 . Therefore almost every
𝑓𝑎 is a Morse function.
Now we would like to transport the concept of non-degenerate critical points to manifolds.
and hence 0 is a critical point for the function 𝑓 ◦𝜙. We call 𝑥 a non-degenerate critical
point for 𝑓 if 0 is a non-degenerate critical point for 𝑓 ◦𝜙.
Independence of choice:
Since we made a choice of a local parametrization for this definition, we need to make sure
that the criterion is independent of the choice.
Assuming that 𝑥 is a critical point of 𝑓 , i.e., 𝑑𝑓𝑥 = 0, the chain rule implies for the two
Hessian matrices at 0:
𝐻(𝑓 ◦𝜙)0 = (𝑑𝜃0 )𝑡 𝐻(𝑓 ◦𝜓)0 𝑑𝜃0 .
Since 𝑑𝜃0 is invertible, we see
An important result on Morse functions is that they can be described in some sort of canon-
ical form. It extends our understanding of the local behavior of smooth maps and is a key
result in this section:
for all 𝑥 ∈ 𝜙(𝑈 ) where 𝑠 is the number of negative eigenvalues of the Hessian of 𝑓 at
𝑎.
For the proof of Lemma 7.10 we follow [12, Part I, §2.1]. We are going to use the following
general observation:
∑
𝑘
𝑓 (𝑥1 , … , 𝑥𝑘 ) = 𝑥𝑖 𝑔𝑖 (𝑥1 , … 𝑥𝑘 )
𝑖=1
𝜕𝑓
for some suitable smooth functions 𝑔𝑖 ∶ 𝔹𝑘 → ℝ with 𝑔𝑖 (0) = 𝜕𝑥𝑖
(0).
Proof: We have
1
𝑑𝑓 (𝑡𝑥1 , … , 𝑡𝑥𝑘 )
𝑓 (𝑥1 , … , 𝑥𝑘 ) = 𝑑𝑡
∫0 𝑑𝑡
1∑
𝑘
𝜕𝑓
= (𝑡𝑥 , … , 𝑡𝑥𝑘 ) ⋅ 𝑥𝑖 𝑑𝑡.
∫0
𝑖=1
𝜕𝑥𝑖 1
1 𝜕𝑓
Hence we may set 𝑔𝑖 (𝑥1 , … 𝑥𝑘 ) ∶= ∫0 𝜕𝑥𝑖
(𝑡𝑥1 , … , 𝑡𝑥𝑘 ) ⋅ 𝑥𝑖 𝑑𝑡.
Proof of Lemma 7.10: Without loss of generality, we can assume that 𝑎 is the origin and
that 𝑓 (𝑎) = 𝑓 (0) = 0. By Lemma 7.11 we can write
∑
𝑘
𝑓 (𝑥1 , … , 𝑥𝑘 ) = 𝑥𝑗 𝑔𝑗 (𝑥1 , … 𝑥𝑘 )
𝑗=1
∑
𝑘
𝑔𝑗 (𝑥1 , … , 𝑥𝑘 ) = 𝑥𝑗 ℎ𝑖𝑗 (𝑥1 , … 𝑥𝑘 )
𝑗=1
for suitable smooth functions ℎ𝑖𝑗 defined on the small neighborhood around the origin. Com-
bining these expressions we get
∑
𝑘
𝑓 (𝑥1 , … , 𝑥𝑘 ) = 𝑥𝑖 𝑥𝑗 ℎ𝑖𝑗 (𝑥1 , … 𝑥𝑘 )
𝑖,𝑗=1
134 7.2. Morse Functions
We can assume that ℎ𝑖𝑗 = ℎ𝑗𝑖 since we can replace ℎ𝑖𝑗 with ℎ̄ 𝑖𝑗 = 21 (ℎ𝑖𝑗 + ℎ𝑗𝑖 ) and then get
∑
ℎ̄ 𝑖𝑗 = ℎ̄ 𝑗𝑖 and 𝑓 = 𝑥𝑖 𝑥𝑗 ℎ̄ 𝑖𝑗 . By construction we then get
𝜕2𝑓
ℎ𝑖𝑗 (0) = (0).
𝜕𝑥𝑖 𝜕𝑥𝑗
Thus the matrix 𝐻 with entry ℎ𝑖𝑗 (0) in position (𝑖, 𝑗) is the Hessian matrix of 𝑓 , computed in
the local coordinate system defined by 𝜙. Since 0 is a nondegenerate critical point, the Hessian
is invertible.
Now we have to show that we can choose a coordinate system such that 𝑓 has the simple
form of the lemma. To do so, we write 𝐻 ∶= 𝐻(𝑓 ◦𝜙))0 for the Hessian matrix and 𝐱 =
(𝑥1 , … , 𝑥𝑘 ). Then we have locally
𝑓 = 𝐱𝑡 𝐻𝐱.
Since the Hessian matrix 𝐻 is real symmetric, there is an orthonormal matrix 𝑃 such that
𝑃 𝑡 𝐻𝑃 is a diagonal matrix. Note that multiplying with 𝑃 𝑡 , or 𝑃 , corresponds to a change of
basis ℝ𝑘 → ℝ𝑘 . We write 𝐲 = 𝑃 𝑡 𝐱 for the coordinates with respect to the new basis. Let
𝜆1 , … , 𝜆𝑘 be the eigenvalues of 𝐻, where we order them, and hence the columns in 𝑃 , such
that the first 𝑠 𝜆𝑖 are negative and the remaining 𝑘 − 𝑠 are positive. Note that 𝐻 does not have
eigenvalue 0, since it is invertible by the assumption on 𝑎 being a nondegenerate critical point.
Then we get
∑𝑘
𝑡 𝑡 𝑡 𝑡 𝑡
𝑓 = 𝐱 𝑃 𝑃 𝐻𝑃 𝑃 𝐱 = 𝐲 𝑃 𝐻𝑃 𝐲 = 𝜆𝑖 𝑦2𝑖 .
𝑖=1
Example 7.12 Let us look at the example of the height function on the torus. To make
the notation compatible with the above theorem, we denote the hight function by 𝑓 .
This is just a projection onto the vertical coordinate of the points on the torus. At the
point 𝑝 on the torus with 𝑓 (𝑝) = 0, we can choose local coordinates 𝑥, 𝑦 and write
𝑓 (𝑥, 𝑦) = 𝑓 (𝑝) + 𝑥2 + 𝑦2 = 𝑥2 + 𝑦2 .
At the next two critical values, we can choose local coordinates 𝑥, 𝑦 write 𝑓 as
𝑓 (𝑥, 𝑦) = constant + 𝑥2 − 𝑦2 .
At the final critical value, we can choose local coordinates 𝑥, 𝑦 such that
𝑓 (𝑥, 𝑦) = constant − 𝑥2 − 𝑦2 .
We are now going to discuss some situations where the Morse Lemma 7.10 is useful. We
will see another important application in Section 17.6 when we discuss the Poincaré–Hopf Index
Chapter 7. Sard’s theorem and Morse functions 135
Theorem 17.16. First, we can generalize the fact that almost all functions are Morse to the level
of manifolds: Suppose 𝑋 ⊂ ℝ𝑁 , and let 𝑥1 , … , 𝑥𝑁 ∈ ℝ𝑁 be the usual coordinate functions on
ℝ𝑁 . If 𝑓 ∶ 𝑋 → ℝ is a smooth function on 𝑋 and 𝑎 = (𝑎1 , … , 𝑎𝑁 ) is an 𝑁-tuple of numbers,
we define again a new function 𝑓𝑎 ∶ 𝑋 → ℝ by
𝑓𝑎 ∶= 𝑓 + 𝑎1 𝑥1 + ⋯ + 𝑎𝑁 𝑥𝑁 .
Theorem 7.13 (Morse functions on manifolds are generic) For every smooth function
𝑓 ∶ 𝑋 → ℝ and for almost every 𝑎 ∈ ℝ𝑁 , 𝑓𝑎 is a Morse function on 𝑋, i.e., all its
critical points are nondegenerate.
Proof: We would like to use the above result for 𝑈 ⊂ ℝ𝑘 open. Since 𝑋 ⊂ ℝ𝑁 is in general
not open (in fact, it is never open if dim 𝑋 < 𝑁), the strategy is to cover 𝑋 by open subsets
and then try to lift the 𝑘-dimensional result to open sets in ℝ𝑁 .
So let 𝑥 be any point in 𝑋. First we are going to choose a suitable local coordinate system
around 𝑥. Let 𝑣1 , … , 𝑣𝑘 ∈ ℝ𝑁 be a basis of 𝑇𝑥 (𝑋) (for 𝑘 = dim 𝑋). Then the matrix [𝑣1 ⋯ 𝑣𝑘 ],
having the 𝑣𝑖 ’s as columns, has rank 𝑘. Hence it has 𝑘 linearly independent rows, say 𝑖1 , … , 𝑖𝑘 .
Let 𝜋 ∶ ℝ𝑁 → ℝ𝑘 be projection defined by (𝑥1 , … , 𝑥𝑁 ) → (𝑥𝑖1 , … , 𝑥𝑖𝑘 ) where the 𝑥1 , … , 𝑥𝑁
denote the standard coordinates on ℝ𝑁 . Then
Hence we can take the 𝑘-tuple of functions (𝑥𝑖1 , … , 𝑥𝑖𝑘 ) ∶ 𝑋 → ℝ𝑘 to define a local coordinate
system around 𝑥.
Therefore we can cover 𝑋 with open subsets 𝑈𝛼 ⊆ ℝ𝑁 such that on each 𝑈𝛼 some 𝑘-tuple of
the functions 𝑥1 , … , 𝑥𝑁 on ℝ𝑁 form a coordinate system. Moreover, by some general nonsense
on the topology of Euclidean space, it is always possible to choose a countable subfamily of
the 𝑈𝛼 ’s. Hence we may assume there are only countably many 𝑈𝛼 .
Let 𝑆 ⊂ ℝ𝑁 be the subset of 𝑎 such that 𝑓𝑎 is not Morse. Since the countable union of
sets with measure zero has measure zero, it suffices to show that for each 𝑈𝛼 the set 𝑆𝛼 of 𝑎’s
such that 𝑓𝑎 ∶ 𝑈𝛼 → ℝ is n̊ot Morse, has measure zero.
So let us look at one of the 𝑈𝛼 ’s. We want to show that 𝑆𝛼 has measure zero in ℝ𝑁 .
The function 𝑥 → 𝑓 (𝑥) + 𝑐 ⋅ (𝑥𝑘+1 , … , 𝑥𝑁 ) is smooth. Hence we can apply our previous result
on genericity of Morse functions on open subsets in ℝ𝑘 to this function and get that 𝑓𝑎 is a
Morse function for almost every 𝑏 ∈ ℝ𝑘 .
136 7.2. Morse Functions
Thus, for a fixed 𝑐, the subset of all 𝑏 ∈ ℝ𝑘 where 𝑓𝑎 is not Morse, has measure zero in
ℝ𝑘 . Hence 𝑆𝛼 ∩ (ℝ𝑘 × {0}) has measure zero in ℝ𝑁 . It is a classical result in Measure Theory,
called Fubini’s Theorem, which then implies that the set 𝑆𝛼 of all 𝑎 = (𝑏, 𝑐) where 𝑎 does not
yield a Morse function has measure zero in ℝ𝑁 . Hence 𝑓𝑎 is a Morse function for almost every
𝑎.
Finally, we can also show that being a Morse function is a stable property. In order to prove
stability, we start with a little lemma:
Lemma 7.14 Let 𝑓 be a smooth function on an open set 𝑈 ⊂ ℝ𝑘 . For each 𝑥 ∈ 𝑈 , let
𝐻(𝑓 )𝑥 be the Hessian matrix of 𝑓 at 𝑥. Then 𝑓 is a Morse function if and only if
( )2
∑
𝑘
𝜕𝑓
2
(det(𝐻(𝑓 )𝑥 )) + (𝑥) > 0 for all 𝑥 ∈ 𝑈 . (7.2)
𝑖=1
𝜕𝑥𝑖
𝜕𝑓 𝜕𝑓
Proof: A point 𝑥 is regular if 𝑑𝑓𝑥 = ( 𝜕𝑥 (𝑥), … , 𝜕𝑥 (𝑥)) ≠ 0, and 𝑥 is a nondegenerate
1 𝑘
critical point if 𝑑𝑓𝑥 = 0 and det(𝐻(𝑓 )𝑥 ) ≠ 0. Hence 𝑓 is Morse if and only if (7.2) is satisfied.
Then we can set 𝜀 ∶= minimum of 𝜀(𝑥1 ), … , 𝜀(𝑥𝑛 ). Then 𝑓𝑡 ∶ 𝑋 → ℝ is a Morse function for
all 𝑡 ∈ [0, 𝜀).
8. Smooth Homotopy
In this chapter we are going to introduce one of the most important concepts in topology: homo-
topies between maps. The idea of studying objects up to homotopy has turned out be extremely
successful in many areas in mathematics.
8.1 Homotopies
Definition 8.1 (Homotopy) Let 𝑋 and 𝑌 be two topological spaces and let 𝐼 = [0, 1]
denote the unit interval in ℝ. We say that two continuous maps 𝑓0 and 𝑓1 from 𝑋 to 𝑌
are homotopic, denoted 𝑓0 ∼ 𝑓1 , if there exists a continuous map
𝐹 ∶ 𝑋 × [0, 1] → 𝑌
such that 𝐹 (𝑥, 0) = 𝑓0 (𝑥) and 𝐹 (𝑥, 1) = 𝑓1 (𝑥). 𝐹 is called a homotopy between 𝑓0 and
𝑓1 . We also write 𝑓𝑡 (𝑥) for 𝐹 (𝑥, 𝑡). In other words, a homotopy is a family of continuous
functions 𝑓𝑡 which continuously interpolates between 𝑓0 and 𝑓1 .
Figure 8.1: A family of maps which interpolates between a constant map and the sine function.
Example 8.3 Let 𝛾 ∶ [0, 1] → ℝ2 be a loop, i.e., a continuous map where start and end
points agree: 𝛾(0) = 𝛾(1). Then 𝛾 is homotopic to the constant map [0, 1] → {0} ⊂
ℝ2 . See Figure 8.2. In fact, this is true when we replace ℝ2 with any ℝ𝑘 , since ℝ𝑘 is
138
Chapter 8. Smooth Homotopy 139
contractible as we will show in the exercises.
Example 8.4 In Exercise 8.4 we will show that the antipodal map on the 𝑘-sphere
𝕊𝑘→ 𝕊𝑘 ,
𝑥 → −𝑥 (which sends a point to the point on the other side of the sphere) is
homotopic to the identity on 𝕊𝑘 if 𝑘 is odd. See Figure 8.3.
Figure 8.3: The antipodal map on the sphere may or may not be homotopic to the identity map.
This is a subtle phenomenon which we will study in more detail later.
Example 8.5 (Two maps which are not homotopic) The constant map 𝑓 ∶ 𝕊1 →
ℝ2 ⧵ {0}, 𝑝 → (1, 0) is not homotopic to the map 𝑔 ∶ 𝕊1 → ℝ2 ⧵ {0}, 𝑝 → 𝑝. We are
not yet able to prove this claim. It often turns out that showing that a homotopy cannot
exist is much harder to find a homotopy. We will later develop techniques and invariants
that will allow us to handle such problems. In particular, degrees and winding numbers
will help us.a
There are also other ways to prove the claim, for example using Brouwer’s fixed point theorem which
a
Remark 8.6 (The homotopy category) Homotopy is one of the most crucial concepts
in topology. In fact, a lot of properties in topology are invariant under homotopy.
Therefore, they can be studied by considering maps only ‘up to homotopy’. This leads
to the construction of the homotopy category of spaces in which morphisms are ho-
motopy classes of maps, i.e., continuous maps 𝑓 and 𝑔 represent the same morphism if
and only if 𝑓 and 𝑔 are homotopic. Passing to the homotopy category is a very power-
ful idea which has had tremendous influence in many areas of mathematics. We will,
however, not be able to fully appreciate the homotopy category this semester. You will
meet homotopies also in algebraic topology, homological algebra, model categories, ∞-
categories, the theory of motives in algebraic geometry, new foundations of logic and
set theory and in many other areas of mathematics.
For the study of smooth manifolds, it is desirable to strengthen our assumptions on what a
homotopy is and to require it to be a smooth family of maps. We first present the necessary
definition and will then see that it actually is not such a strong restriction after all.
Definition 8.7 (Smooth homotopy) Let 𝑋 and 𝑌 be smooth manifolds. We say that
two smooth maps 𝑓0 and 𝑓1 from 𝑋 to 𝑌 are smoothly homotopic, denoted 𝑓0 ∼ 𝑓1 ,
if there exists a smooth map 𝐹 ∶ 𝑋 × [0, 1] → 𝑌 such that
𝐹 is called a smooth homotopy between 𝑓0 and 𝑓1 . We also write 𝑓𝑡 (𝑥) for 𝐹 (𝑥, 𝑡). In
other words, a homotopy is a family of smooth functions 𝑓𝑡 which smoothly interpo-
lates between 𝑓0 and 𝑓1 .
∙ Recall that smoothness of 𝐹 in the second variable means that we can extend 𝐹 to a
smooth map 𝑋 × [−𝜀, 1 + 𝜀] → 𝑌 for some small 𝜀 > 0.
To allow only smooth homotopies might seem like a strong restriction. However, it turns
out that we can always approximate a continuous map by a smooth map within its homotopy
class:
∙ Let 𝑔0 , 𝑔1 ∶ 𝑋 → 𝑌 be two smooth maps which are homotopic. Then they are
smoothly homotopic.
Chapter 8. Smooth Homotopy 141
We will prove this important result in Section 12.2. See Theorem 12.20 in particular. The
key tool we will use for the proof are tubular neighborhoods that we will introduce and study in
Section 12.1. For the moment, however, we rather move on and show that smooth homotopy is
an equivalence relation and will then see an application of homotopies.
In order to show that being smoothly homotopic defines an equivalence relation we need to
introduce a new tool: smooth bump functions. It will turn out to be extremely useful in many
applications later. It is a special feature of differential topology that we have the following type
of functions at our disposal:
2
since 𝑒−1∕𝑥 goes faster to 0 than any rational polynomial can go to ±∞.
⎧𝑔(𝑥) = 0 𝑥 ≤ 𝑎 since 𝑓 (𝑥 − 𝑎) = 0
⎪
⎨𝑔(𝑥) > 0 𝑎 < 𝑥 < 𝑏
⎪𝑔(𝑥) = 0 𝑥 ≥ 𝑏 since 𝑓 (𝑏 − 𝑥) = 0.
⎩
⎧ℎ(𝑥) = 0 𝑥≤𝑎
⎪
⎨0 < ℎ(𝑥) < 1 𝑎 < 𝑥 < 𝑏
⎪ℎ(𝑥) = 1 𝑥 ≥ 𝑏.
⎩
⎧𝐻(𝑥) = 1 𝑥 ∈ 𝔹̄ 𝑎 (0)
⎪
⎨0 < 𝐻(𝑥) < 1 𝑎 < |𝑥| < 𝑏
⎪𝐻(𝑥) = 0 𝑥 ∈ ℝ𝑘 ⧵ 𝔹𝑏 (0).
⎩
Figure 8.4: A smooth bump function ℝ → ℝ. Its higher dimensional analogs are based on
this one. The function ℎ has value zero up to 𝑎 and value one from 𝑏 onwards. in between ℎ
increases smoothly.
∙ Reflexivity: This is clear as every map is homotopic to itself via the homotopy 𝑓𝑡 = 𝑓
for all 𝑡.
∙ Symmetry: Suppose 𝑓 ∼ 𝑔 and let 𝐹 be a homotopy. Then the map defined by (𝑥, 𝑡) →
𝐹 (𝑥, 1 − 𝑡) is a homotopy from 𝑔 to 𝑓 . Hence 𝑔 ∼ 𝑓 as well.
Chapter 8. Smooth Homotopy 143
∙ Transitivity: Suppose 𝑓 ∼ 𝑔 and 𝑔 ∼ ℎ, and let 𝐹 be a homotopy from 𝑓 to 𝑔 and 𝐺
be a homotopy from 𝑔 to ℎ. We would like to compose 𝐹 and 𝐺 to get a homotopy from
𝑓 to ℎ. Since we require our homotopies to be smooth, we need to make sure that the
transition from 𝐹 to 𝐺 is smooth.
In order to this, we need to manipulate 𝐹 and 𝐺 a bit. And here we are lucky that we have
our smooth bump functions at our disposal. So let 𝜑 ∶ ℝ → ℝ be a smooth function
such that
{
0 𝑥 ≤ 1∕4
𝜑(𝑡) =
1 𝑥 ≥ 3∕4
We will now study the following question: Assume that 𝑓0 has a given property, for example let
us say that 𝑓0 is a submersion. If there is a smooth homotopy between 𝑓0 and 𝑓1 , do we know
that 𝑓1 also is a submersion?
In other words, we would like to know which properties of maps are or are not invariant
under homotopy.
In fact, many of the properties we have studied so far are not invariant, i.e., if 𝑓0 has a
property 𝑃 and 𝑓𝑡 is a homotopy from 𝑓0 to 𝑓1 , then it is often not true that 𝑓1 has property 𝑃 .
For example, we could start with an embedding 𝑓0 and end up with a constant map 𝑓1 .
So let us ask a more modest question: given 𝑓0 has property 𝑃 , is there always a small
𝜀 > 0 such that 𝑓𝑡 has property 𝑃 for all 𝑡 ∈ [0, 𝜖)? For example, if 𝑓0 is an embedding there
is always a small 𝜀 > 0 such that 𝑓𝑡 remains an embedding for 0 ≤ 𝑡 < 𝜀. We will say that
being an embedding is stable:
∙ We also call the maps which have a stable property, a stable class. Examples are the
classes of embeddings, local diffeomorphisms, submersions,... as we will learn soon.
144 8.3. Stable properties
∙ Note that stability is a very natural condition to ask for: For real-world measurements,
only stable properties are interesting, since any tiny perturbation of the data would make
an unstable property appear or disappear.
In order to get a better idea of stability, let us look at the difference between requiring that
things merely intersect or that they intersect transversally:
∙ That a smooth map 𝑓0 ∶ ℝ → ℝ2 merely intersects the 𝑥-axis is not a stable property.
It disappears immediately. See Figure 8.6.
∙ That two smooth curves (connected 1-dimensional manifolds) meet in ℝ3 is not a stable
property. It disappears immediately. See Figure 8.8.
Chapter 8. Smooth Homotopy 145
∙ That a smooth curve and a smooth surface (2-dimensional manifold) intersect transver-
sally in ℝ3 is a stable property. It persists after a small perturbation. See Figure 8.9.
Figure 8.9: Intersection of a surface and a curve in three-dimensional space is not stable. The
dimensions have to add up.
The following theorem tells us that the properties which turned out to be useful for us so far
are all stable.
(b) immersions,
(c) submersions,
(e) embeddings,
(f) diffeomorphisms.
∙ Note that the assumption that 𝑋 is compact is crucial and not just made for convenience.
146 8.3. Stable properties
The next example will show that we cannot drop this assumption for any of the properties
in theorem:
Example 8.12 (Compactness matters) The Stability Theorem fails when 𝑋 is not
compact. For a simple example, let 𝜌 ∶ ℝ → ℝ be a smooth function with 𝜌(𝑠) = 1 for
|𝑠| < 1 and 𝜌(𝑠) = 0 for |𝑠| > 2. Then we define
𝑓𝑡 ∶ ℝ → ℝ, 𝑓𝑡 (𝑥) = 𝑥𝜌(𝑡𝑥).
To emphasise what is going wrong, let us replace the domain with a closed interval,
say 𝑋 = [𝑎, 𝑏] with 𝑏 > 0, which is a compact subspace of ℝ. Then we can choose
𝜀 > 0 which is small enough such that 1∕𝜀 > max(|𝑎|, |𝑏|). This implies that 𝑥 is not
bigger than 1∕|𝑡|. Then we have 𝑓𝑡 (𝑥) = 𝑥 for all 𝑥 and all 𝑡 < 𝜀.
(a) First we note that local diffeomorphisms are just immersions in the special case when
dim 𝑋 = dim 𝑌 , so (a) follows from (𝑏).
Given a point 𝑥0 ∈ 𝑋, that 𝑑(𝑓0 )𝑥0 is injective implies that the matrix representing 𝑑(𝑓0 )𝑥0
(in local coordinates) has an 𝑚×𝑚-submatrix 𝐴(𝑥0 , 0) with nonvanishing determinant. Since
the determinant is continuous, this submatrix will have nonvanishing determinant in an
open neighborhood of (𝑥0 , 0) in 𝑋 × [0, 1]. Since 𝑋 is compact, finitely many such neighbor-
hoods suffice to cover all of 𝑋 × {0}. Hence there is a small 𝜀 > 0 (it is the minimum for the
open intervals [0, 𝜀𝑖 ) covering {0}) such that the intersection of these finitely many neighbor-
hoods contains 𝑋 × [0, 𝜀). Thus 𝑑(𝑓𝑡 )𝑥 is injective for all (𝑥, 𝑡) ∈ 𝑋 × [0, 𝜀). This is what we
needed.
(c) If 𝑓0 is a submersion, almost the same argument works. We just need to choose an
𝑛 × 𝑛-submatrix of the surjective map 𝑑(𝑓0 )𝑥 with 𝑛 = dim 𝑌 .
(d) Let 𝑍 ⊂ 𝑌 be a closed submanifold, and assume that 𝑓0 is a map which is transversal
to 𝑍. Then we have shown that, for every point 𝑥 ∈ 𝑋, there is a smooth function 𝑔 which
sends a neighborhood of 𝑓 (𝑥) to 0 ∈ ℝcodim 𝑍 and such that 𝑔◦𝑓0 is a submersion. Since 𝑍 is
closed in 𝑌 , 𝑓 −1 (𝑍) is closed in 𝑋 and therefore also compact. Therefore, by (c), there is an
𝜀 > 0 such that 𝑔◦𝑓𝑡 is still a submersion for all 𝑡 < 𝜀. This is means that 𝑓𝑡 is still transversal
to 𝑍 for all 𝑡 < 𝜀.
Chapter 8. Smooth Homotopy 147
(e) Assume that 𝑓0 is an embedding, and let 𝑓𝑡 be a homotopy of 𝑓0 . Since 𝑋 is compact,
𝑓0 and each 𝑓𝑡 are automatically proper maps. Hence we need to show that when 𝑓0 is a
one-to-one immersion, then so is 𝑓𝑡 in a small neighborhood. We just checked that being an
immersion is stable. Hence it remains to show that 𝑓𝑡 is still one-to-one if 𝑡 is small enough.
Then if (e) was false, i.e., if 𝑓𝑡 was not one-to-one in some small neighborhood of 0, then, for
every 𝜀 > 0, we can find a 𝑡 with 0 < 𝑡 < 𝜀 and 𝑥, 𝑦 ∈ 𝑋 such that 𝑓𝑡 (𝑥) = 𝑓𝑡 (𝑦). For example,
for every 𝜀𝑖 = 1∕𝑖, we could find such a 𝑡𝑖 , 𝑥𝑖 and 𝑦𝑖 . Thus there is an infinite sequence 𝑡𝑖 → 0,
and an infinite sequence of points 𝑥𝑖 ≠ 𝑦𝑖 ∈ 𝑋 where 𝑓𝑡𝑖 fails to be injective, i.e., such that
But 𝐺(𝑥0 , 0) = 𝑓0 (𝑥0 ) and 𝐺(𝑦0 , 0) = 𝑓0 (𝑥0 ). By assumption, 𝑓0 is injective, and hence
𝑥0 = 𝑦0 .
Now, after choosing local coordinates, we can express the derivative of 𝐺 at (𝑥0 , 0) by
the matrix
⎛ ∗⎞
⎜𝑑(𝑓 ) ⋮⎟
= ⎜ 0 𝑥0
∗⎟⎟
𝑑𝐺(𝑥0 ,0)
⎜
⎝ 0⋯0 1⎠
where the 0s in the lowest row arise from the fact that the first coordinates do not depend on
𝑡, and the 1 in the bottom right hand corner is the derivative of the function 𝑡 → 𝑡.
Since 𝑓0 is an immersion, 𝑑(𝑓0 )𝑥0 has 𝑘 = dim 𝑋 many independent rows. Thus the
matrix of 𝑑𝐺(𝑥0 ,0) has 𝑘 + 1 independent rows, and hence 𝑑𝐺(𝑥0 ,0) is an injective linear map.
In other words, 𝐺 is an immersion around (𝑥0 , 0) and hence 𝐺 must be one-to-one on some
neighborhood of (𝑥0 , 0).
But we have shown above that the sequences (𝑥𝑖 , 𝑡𝑖 ) and (𝑦𝑖 , 𝑡𝑖 ) both converge to (𝑥0 , 0).
This means that for large 𝑖 both (𝑥𝑖 , 𝑡𝑖 ) and (𝑦𝑖 , 𝑡𝑖 ) belong to this neighborhood. This contra-
dicts the injectivity of 𝐺.
Since 𝑋 is compact, 𝑋 has only finitely many connected components, and so does 𝑌 .
Hence we can check the statement for each of these connected components separately. For, this
gives us an 𝜀𝑖 for each component. Since there are finitely many components, we can just take
the minimum of the 𝜖𝑖 ’s as the 𝜖 for all of 𝑋 and 𝑌 .
148 8.3. Stable properties
Thus we may assume that 𝑋 and 𝑌 are connected. By (a) and (e), we know that being a
local diffeomorphism and being an embedding is a stable property. Thus there is a 𝜀 > 0 such
that 𝑓𝑡 is a local diffeomorphism and an embedding. For 𝑓𝑡 being a diffeomorphism, it remains
to show that 𝑓𝑡 is surjective.
We fix a 0 < 𝑡 < 𝜀. Since 𝑓𝑡 is a local diffeomorphism, 𝑓𝑡 is an open map and hence
𝑓𝑡 (𝑋) is open in 𝑌 . But 𝑓𝑡 (𝑋) is also closed, since it is compact being the image of a compact
space. Since 𝑌 is connected, this implies 𝑓𝑡 (𝑋) = 𝑌 .
Chapter 8. Smooth Homotopy 149
8.4 Exercises and more examples
(a) Show that if 𝑋 is contractible, then all smooth maps 𝑌 → 𝑋 from an arbitrary
manifold 𝑌 into 𝑋 are homotopic.
(b) Conversely, show that if all maps of an arbitrary manifold 𝑌 into 𝑋 are homotopic,
then 𝑋 is contractible.
Exercise 8.2 Let 𝑋 be a smooth manifold of dimension 𝑘. Show that every smooth
map 𝑓 ∶ 𝑋 → 𝕊𝑛 is homotopic to a constant map if 𝑘 < 𝑛.
Hint: Use Sard’s Theorem.
Exercise 8.3 Show that the antipodal map 𝕊𝑘 → 𝕊𝑘 , 𝑥 → −𝑥, is homotopic to the
identity if 𝑘 is odd. (We will see later that this is not true if 𝑛 is even.)
Hint: Start off with 𝑘 = 1 by using the linear maps defined by
( )
cos(𝜋𝑡) − sin(𝜋𝑡)
[0, 1] → 𝑀(2), 𝑡 → .
sin(𝜋𝑡) cos(𝜋𝑡)
Exercise 8.5 Show that every connected smooth manifold 𝑋 is path-connected, i.e.,
given any two points 𝑥0 , 𝑥1 ∈ 𝑋, there exists a smooth map 𝑓 ∶ [0, 1] → 𝑋 with 𝑓 (0) =
𝑥0 and 𝑓 (1) = 𝑥1 .
Hint: Use the fact that homotopy is an equivalence relation to show that the relation
𝑥0 and 𝑥1 can be joined by a smooth curve is an equivalence relation on 𝑋. Then show
that the equivalence classes are both open and closed subsets of 𝑋.
(b) Show that 𝑓 and 𝑔 are smooth, then they are smoothly homotopic.
(a) Show that if 𝑘 is odd, there exists a vector field on 𝕊𝑘 having no zeros.
Hint: For 𝑘 = 1, use (𝑥1 , 𝑥2 ) → (−𝑥2 , 𝑥1 ).
(b) Prove that if 𝕊𝑘 has a vector field which has no zeros, then its antipodal map 𝑥 →
−𝑥 is homotopic to the identity.
Hint: Show that you may assume |𝑠(𝑥)| = 1 everywhere. Now contemplate about
(cos(𝜋𝑡))𝑥 + (sin(𝜋𝑡))𝑠(𝑥) when 𝑡 varies from 0 to 1.
(c) Show that if 𝑘 is even, then the antipodal map on 𝕊𝑘 is homotopic to the reflection
map
𝑟 ∶ 𝕊𝑘 → 𝕊𝑘 , (𝑥1 , … , 𝑥𝑘+1 ) → (−𝑥1 , 𝑥2 , … , 𝑥𝑘+1 ).
Hint: Consider also the reflections
We would like to define manifolds without referring to a given embedding into some ℝ𝑁 . The
key idea that should be preserved in any new definition is that a manifold is a space which
locally looks like Euclidean space. First, we recall an important concept from topology:
Definition 9.1 (Hausdorff spaces) A topological space 𝑋 is called Hausdorff if, for
any two distinct points 𝑥, 𝑦 ∈ 𝑋, there are two disjoint open subsets 𝑈 , 𝑉 ⊂ 𝑋 such
that 𝑥 ∈ 𝑈 and 𝑦 ∈ 𝑉 . In other words, in a Hausdorff space we can separate points by
open neighborhoods.
∙ Every subspace of ℝ𝑁 with the relative topology is a Hausdorff space. However, there
are spaces which are not Hausdorff.
∙ For a typical example of a space which is not Hausdorff, consider two copies of the real
line 𝑌1 ∶= ℝ × {1} and 𝑌2 ∶= ℝ × {2} as subspaces of ℝ2 . On 𝑌1 ∪ 𝑌2 , we define the
equivalence relation (𝑥, 1) ∼ (𝑥, 2) for all 𝑥 ≠ 0. Let 𝑋 be the set of equivalence classes.
In fact, 𝑋 looks like the real line except that the origin is replaced with two different
copies of the origin:
Figure 9.1: A space that looks like a line with a double-point. The topology is such that we
cannot separate the two points by open subsets.
151
152 9.1. Abstract manifolds - the definition
In the definition of an abstract manifold, we want to avoid such pathological spaces. There
are several reasons for this choice. First of all, manifolds are characterised by how open neigh-
borhoods of points look like, and we would like to be able to find separate neighborhoods for
distinct points. That is exactly the Hausdorff property. But there are also slightly deeper rea-
sons. For example, we would like to use the fact that a compact subset 𝑍 of a closed subset
𝑌 ⊂ 𝑋 is itself closed in 𝑋. This general conclusion requires that 𝑋 is a Hausdorff space.
𝑉𝛼𝛽 ∶= 𝑉𝛼 ∩ 𝑉𝛽 ≠ ∅,
is smooth as a map between open subsets of ℝ𝑘 . In fact, this means that the
change-of-coordinates maps are diffeomorphisms, since they are mutual smooth
inverses to each other.
Figure 9.2: On a manifold every point is contained in the domain of a chart. If two charts
overlap we can look at the composition of the chart maps and get a map between open subsets
in ℝ𝑘 . We call this the change-of-coordinate map and require this map to be smooth in the
usual sense.
Note that the smooth 𝑘-dimensional manifolds 𝑋 ⊂ ℝ𝑁 we have been studying so far are
examples of abstract smooth 𝑘-manifolds:
∙ The Hausdorff property is satisfied in ℝ𝑁 and therefore also for every subspace of ℝ𝑁
with the relative subspace topology.
∙ Moreover, every open cover {𝑈𝛼 } of ℝ𝑁 has a countable refinement. For, we can take
the collection of all open balls which are contained in some 𝑈𝛼 , which have rational
radii, and which are centred at points having only rational coordinates.
∙ For an open cover {𝑉𝛼 } of a subset 𝑋 ⊂ ℝ𝑁 , we can write 𝑉𝛼 = 𝑈𝛼 ∩ 𝑋 for some open
subsets 𝑈𝛼 of ℝ𝑁 . Then let {𝑈̃ 𝑖 } be a countable refinement of {𝑈𝛼 } in 𝑅𝑁 , and define
𝑉̃𝑖 = 𝑈̃ 𝑖 ∩ 𝑋.
∙ The charts are just what we called local coordinates and the inverses of charts are what
we called local parametrizations. One difference is that we required local parametriza-
tions to be diffeomorphisms. For an abstract manifold 𝑋, we use the charts to define
what smoothness means for a map on 𝑋. Hence a priori it makes only sense to talk
about the smoothness of the change of coordinate maps. A posteriori we can then check
that charts are in fact diffeomorphisms.
∙ Similarly for smooth maps between manifolds. We only know what smoothness of maps
between Euclidean spaces means. Hence we need to use the charts to first translate the
maps into maps between Euclidean spaces.
154 9.1. Abstract manifolds - the definition
∙ In the abstract definition, we take care of the fact that the images of the charts/local
parametrizations overlap. In fact, we use the overlap to define the smooth structure.
∙ Finally, a chosen collection of charts is called an atlas on the manifold. One can show that
every manifold has a maximal atlas, i.e., the images of the charts are as big as possible.
Lemma 9.5 (Our smooth manifolds are also abstract manifolds) Let 𝑋 be a smooth
manifold according to our initial definition. Then 𝑋 is also an abstract smooth mani-
fold.
Remark 9.6 (It all fits together) It is nice and important to have such an intrinsic
definition of a manifold. However, the definition is quite abstract indeed. And, in
fact, we are going to show that every abstract smooth manifold can be embedded
into Euclidean space and is therefore a manifold for our previous definition. Hence all
the machinery we have developed can be applied to abstract manifolds.
There are several different, though equivalent, ways to define the tangent space of an abstract
manifold. We will look at just one way which is closest to our intuitive and concrete approach.
In addition, we have seen it in earlier exercises.
∙ Actually, it is not necessary that the curves 𝛾 are defined on all of ℝ. It suffices that
there is an open subset 𝑊 ⊂ ℝ containing 0 and 𝛾 ∶ 𝑊 → 𝑋 is a smooth map de-
fined on 𝑊 with 𝛾(0) = 𝑥. In a more sophisticated terminology, it suffices to consider
the set of germs of curves through 𝑥. Then we consider the set of all such curves and
the equivalence relation only requires that the derivatives of the composite with any chart
are equal. Anyway, we see that 𝑇𝑥 (𝑋) only depends on the local structure of 𝑋 at 𝑥.
This is an important example which we can easily be described with the new definition of an
abstract manifold, but for which it is not obvious at all how we can embed it into ℝ𝑁 .
Actually, it is a difficult question how to embed these guys into ℝ𝑁 with 𝑁 as small as
𝑟 𝑟
possible. In fact, if 𝑛 = 2𝑟 for some 𝑚 and if there is an immersion ℝP2 → ℝ2 +𝑘 , then 𝑘
must be at least 2𝑟 − 1. You will learn about the techniques to show this in the more advanced
algebraic topology course and can read about it in [14].
Definition 9.8 (Real Projective Space) The real projective 𝑛-space ℝP𝑛 is the set of
all straight lines through the origin in ℝ𝑛+1 . As a topological space, ℝP𝑛 is the quotient
space
ℝP𝑛 = (ℝ𝑛+1 ⧵ {0})∕ ∼
where the equivalence relation is given by 𝑥 ∼ 𝑦 if there is a nonzero real number 𝜆
such that 𝑥 = 𝜆𝑦. This means that a subset 𝑉 is open in ℝP𝑛 if and only if its preimage
𝑈 = {𝑥 ∈ ℝ𝑛+1 ⧵ {0} ∶ [𝑥] ∈ 𝑉 } is open in ℝ𝑛+1 ⧵ {0}.
∙ Note that since each line through the origin intersects the unit sphere in two (antipodal)
points, ℝP𝑛 can alternatively be described as
𝕊𝑛 ∕ ∼
where the equivalence relation is 𝑥 ∼ −𝑥. As a quotient of 𝕊𝑛 , we see that ℝP𝑛 is actually
compact. As we have seen, this is always very good to know about a space.
156 9.2. Real projective space
Theorem 9.9 ℝP𝑛 is a smooth manifold] Real projective 𝑛-space ℝP𝑛 is an abstract
𝑛-dimensional smooth manifold.
Proof: If 𝑥 = (𝑥0 , … , 𝑥𝑛 ) ∈ ℝ𝑛+1 ⧵ {0}, we write [𝑥] for its equivalence class considered
as a point in ℝP𝑛 . One also often writes [𝑥] = [𝑥0 ∶ … ∶ 𝑥𝑛 ].
For 0 ≤ 𝑖 ≤ 𝑛, let
𝑉𝑖 ∶= {[𝑥] ∈ ℝP𝑛 ∶ 𝑥𝑖 ≠ 0}.
The preimage of 𝑉𝑖 in ℝ𝑛+1 is the open subset {𝑥 ∈ ℝ𝑛+1 ∶ 𝑥𝑖 ≠ 0}. Hence each 𝑉𝑖 is open in
ℝP𝑛 . By varying 𝑖, this gives an open cover of ℝP𝑛 because every representative (𝑥0 , … , 𝑥𝑛 )
of a point [𝑥] ∈ ℝP𝑛 must have at least one coordinate ≠ 0 (otherwise it would be the origin
which is excluded).
and 𝜙−1
𝑖 ∶ 𝑉𝑖 → ℝ
𝑛
1
[𝑥0 ∶ … ∶ 𝑥𝑖 ∶ … ∶ 𝑥𝑛 ] → (𝑥 , … , 𝑥̂𝑖 , … , 𝑥𝑛 )
𝑥𝑖 0
1
where 𝑥̂𝑖 means that 𝑥𝑖 is omitted. Note that the quotient 𝑥𝑖
is well-defined on all of 𝑉𝑖 .
𝑖 , we need to
Since we use a representative of an equivalence class for the definition of 𝜙−1
check that the definition is independent of the chosen representative: If [𝑥0 ∶ … ∶ 𝑥𝑖 ∶ … ∶
𝑥𝑛 ] = [𝜆𝑥0 ∶ … ∶ 𝜆𝑥𝑖 ∶ … ∶ 𝜆𝑥𝑛 ] for some 𝜆 ≠ 0, then
1
𝜙−1
𝑖 ([𝜆𝑥]) = (𝜆𝑥0 , … , 𝜆𝑥𝑖−1 , 𝜆𝑥𝑖+1 , … , 𝜆𝑥𝑛 )
𝜆𝑥𝑖
1
= (𝑥0 , … , 𝑥𝑖−1 , 𝑥𝑖+1 , … , 𝑥𝑛 ) = 𝜙−1
𝑘 ([𝑥]).
𝑥𝑖
𝜙𝑖 𝜙−1
𝑗
𝜙−1
𝑖 (𝑉𝑖 ∩ 𝑉𝑗 ) ←←←←→ ← 𝜙−1
← 𝑉𝑖 ∩ 𝑉𝑗 ←←←←←←→ 𝑗 (𝑉𝑖 ∩ 𝑉𝑗 )
is just
1
(𝑥0 , … , 𝑥̂𝑖 , … , 𝑥𝑛 ) → (𝑥 , … , 𝑥𝑖−1 , 1, 𝑥𝑖+1 , … , 𝑥̂𝑗 , … , 𝑥𝑛 )
𝑥𝑗 0
Remark 9.10 (Tangent space of ℝP𝑛 ) Intuitively, we can think of the tangent space of
ℝP𝑛 at a point [𝑥] ∈ ℝP𝑛 as follows: We can consider the point [𝑥] as a pair of antipodal
points (𝑥, −𝑥) in 𝕊𝑛 . Then a tangent vector at [𝑥] may be viewed as a pair of ’antipodal
vectors’ (𝑣, −𝑣) where 𝑣 ∈ 𝑇𝑥 𝕊𝑛 and −𝑣 ∈ 𝑇−𝑥 𝕊𝑛 . To write −𝑣 may be justified by
identifying both 𝑇𝑥 𝕊𝑛 and 𝑇−𝑥 𝕊𝑛 with the subspace in ℝ𝑛+1 which is orthogonal to the
line 𝐿 through 𝑥 and −𝑥:
In Remark 9.20, and in the arguments leading up to it, we will see that there is a more
precise and canonical description of the tangent space of ℝP𝑛 at a point [𝑥] ∈ ℝP𝑛 as
where 𝐿 ⊂ ℝ𝑛+1 is the line through the origin determined by [𝑥] and 𝐿⟂ denotes the
orthogonal complement of 𝐿 in ℝ𝑛+1 .
There are many reasons why real projective space is important. One is that it comes equipped
with a very useful additional structure:
Remark 9.11 (Canonical line bundle) We discussed vector bundles briefly in Sec-
tion 2.5.3. One class of vector bundles are line bundles, i.e., each vector space at a point
is one-dimensional. Real projective 𝑛-space has a canonical line bundle, often also
called the tautological line bundle. A point in ℝP𝑛 consists of a line 𝐿 in ℝ𝑛+1 through
the origin. Considering 𝐿 both as a point in ℝP𝑛 and as a one-dimensional vector space
defines a line bundle on ℝP𝑛 .
It turns out that for every sufficiently nice topological space 𝑌 with a line bundle
→ 𝑌 , there is a continuous map 𝑌 → ℝP𝑛 (for 𝑛 large enough) such that is the
pullback of the canonical line bundle along this map. Up to homotopy, this map is in
fact unique. We refer to this phenomenon as that ℝP𝑛 is a classifying space for line
bundles. The idea is classifying spaces is very powerful and important.
Another, more geometric reason for considering projective spaces is the following:
∙ Intersection at ∞:
𝑉 = {(𝑥0 , 𝑥1 , 𝑥2 ) ∈ ℝ3 ∶ 𝑥0 𝑣0 + 𝑥1 𝑣1 + 𝑥2 𝑣2 = 0}.
Now assume we are given two distinct lines 1 and 2 in ℝP2 determined by two distinct
vectors 𝑣, 𝑤 ≠ 0 in ℝ3 , i.e.,
1 = {[𝑥0 ∶ 𝑥1 ∶ 𝑥2 ] ∈ ℝP2 ∶ 𝑥0 𝑣0 + 𝑥1 𝑣1 + 𝑥2 𝑣2 = 0}
2 = {[𝑥0 ∶ 𝑥1 ∶ 𝑥2 ] ∈ ℝP2 ∶ 𝑥0 𝑤0 + 𝑥1 𝑤1 + 𝑥2 𝑤2 = 0}.
The orthogonal complements of 𝑣 and 𝑤, respectively, are two planes through the origin.
Hence they meet in a line through the origin in ℝ3 which is the set of solutions of the two
linear equations defining 1 and 2 above. This is a one-dimensional vector subspace of ℝ3
(the kernel of a 2 × 3-matrix). By definition of ℝP2 , this subspace corresponds to a point in
ℝP2 . This is the intersection point of 1 and 2 in ℝP2 .
If this intersection line happens to be the 𝑧-axis, i.e., when 1 and 2 are represented by the
planes given by the 𝑥𝑧-plane and the 𝑦𝑧-plane, then the intersection point is [0 ∶ 0 ∶ 1] ∈ ℝP2 .
We can think of it as the point at infinity in ℝP2 .
However, in the Euclidean plane ℝ2 it may very well happen that two lines are paral-
lel and hence do not intersect. The idea for ℝP2 is to add a point at infinity which is the
intersection point for all parallel lines.
We readily verify that this is an equivalence relation. W equip the quotient space with the
quotient topology and can check that it is just the torus
ℝ×ℝ
≅ 𝕊1 × 𝕊1 = 𝕋 2 .
∼
However, let us have a closer look because we would like to use this picture to equip 𝕋 with
the structure of a manifold:
Figure 9.3: The torus 𝕋 2 can be constructed by gluing together the opposite edges of a square.
Let (𝑥, 𝑦) ∈ ℝ × ℝ and let [𝑥, 𝑦] be its equivalence class in 𝕋 . Let 𝑈𝑟 = 𝐵𝑟2 (𝑥, 𝑦) ⊂ ℝ × ℝ
be the open ball of radius 𝑟 > 0 around (𝑥, 𝑦). Then the map
ℝ×ℝ
𝑞|𝑈𝑟 ∶ 𝑈𝑟 →
∼
is a homeomorphism onto its image for all 𝑟 < 12 . For it is surjective and it is injective, since
for any two points in 𝑈𝑟 the difference of the 𝑥- or 𝑦-coordinates, respectively, is strictly less
than 2𝑟 < 1 by the choice of 𝑟. Moreover, the inverse is continuous, since we checked that 𝑞 is
an open map. We use 𝑞|𝑈𝑟 as a chart around the point [𝑥, 𝑦]. The change-of-coordinate maps
are of the form (or almost, we need to restrict the maps accordingly, but the notation would
become too annoying, so we simplify):
Another famous and slightly more complicated space is the Klein bottle defined as follows:
This time we consider the relation on ℝ × ℝ defined by
Figure 9.4: The Klein bottle can be constructed by gluing together the opposite edges of a
square, but we twist one pair of the two edges.
𝑞|𝑈𝑟 ∶ 𝑈𝑟 → 𝐾
is a homeomorphism onto its image for all 𝑟 < 21 for the same reason as for the torus. We use
𝑞|𝑈𝑟 as a chart around the point [𝑥, 𝑦]. The change-of-coordinate maps are of the form (again
we cheat a bit here):
Remark 9.12 In the exercises we study Hopf manifolds as another important and
interesting class of examples. They play a crucial role in complex geometry, since they
provide the simplest examples of compact complex manifolds which do not admit a
Kähler metric and therefore cannot be embedded into complex projective space.
Theorem 9.14 (Stiefel manifolds are manifolds) The space 𝑉𝑘 (ℝ𝑛+𝑘 ) of 𝑘-frames is
𝑘(𝑘−1)
a compact smooth manifold of dimension 𝑛𝑘 + 2
.
Remark 9.15 We observe that 𝑉𝑘 (ℝ𝑛+𝑘 ) can be identified with the quotient 𝑂(𝑛 +
𝑘)∕𝑂(𝑛). For, if [𝑣1 , … , 𝑣𝑘 ] and 𝑇 ∈ 𝑂(𝑛 + 𝑘), then multiplying each 𝑣𝑖 with 𝑇 yields
another 𝑘-frame [𝑇 𝑣1 , … , 𝑇 𝑣𝑘 ]. In fact, any two 𝑘-frames are connected in this way,
i.e., if [𝑣1 , … , 𝑣𝑘 ] and [𝑣′1 , … , 𝑣′𝑘 ] are two 𝑘-frames then there is an 𝑇 ∈ 𝑂(𝑛 + 𝑘) such
that
[𝑣′1 , … , 𝑣′𝑘 ] = [𝑇 𝑣1 , … , 𝑇 𝑣𝑘 ].
In other words, 𝑂(𝑛+𝑘) acts transitively on the set of 𝑘-frames. Moreover, the stabilizer
subgroup of a given frame is the subgroup isomorphic to 𝑂(𝑛) which acts nontrivially
on the orthogonal complement of the space spanned by that frame.
Proof of Theorem 9.14: We have already proven that 𝑉𝑘 (ℝ𝑛+𝑘 ) is compact. It remains
to show the manifold part. Note that a 𝑘-frame [𝑣1 , … , 𝑣𝑘 ] corresponds to an (𝑛 + 𝑘) × 𝑘-
matrix 𝐴 with 𝑣𝑖 as 𝑖th column. That the column vectors are orthonormal is equivalent to
that the product of the transpose of the 𝑖th column with the 𝑗th column is 1 if 𝑖 = 𝑗 and 0
otherwise. In a formula: {
1 if 𝑖 = 𝑗
𝑣𝑡𝑖 ⋅ 𝑣𝑗 =
0 if 𝑖 ≠ 𝑗.
In other words,
𝐴 represents a 𝑘-frame in ℝ𝑛+𝑘 ⇐⇒ 𝐴𝑡 𝐴 = 𝐼𝑘 ,
where 𝐼𝑘 denotes the 𝑘 × 𝑘-identity matrix.
Let 𝑀(𝑛 + 𝑘, 𝑘) be the space of (𝑛 + 𝑘) × 𝑘-matrices and 𝑆(𝑘) denote the space of symmetric
𝑘 × 𝑘-matrices. We define the map
𝑓 ∶ 𝑀(𝑛 + 𝑘, 𝑘) → 𝑆(𝑘), 𝐴 → 𝐴𝑡 𝐴
and we observe 𝑉𝑘 (ℝ𝑛+𝑘 ) = 𝑓 −1 (𝐼𝑘 ).
Hence, in order to show that 𝑉𝑘 (ℝ𝑛+𝑘 ) is a smooth manifold, we need to show that 𝐼𝑘 is a
regular value for 𝑓 . We have computed the derivative of 𝑓 at a matrix 𝐴 ∈ 𝑉𝑘 (ℝ𝑛+𝑘 ) in the
proof of Theorem 4.12. There we showed
𝑑𝑓𝐴 (𝐵) = 𝐴𝑡 𝐵 + 𝐵 𝑡 𝐴.
162 9.4. Grassmannian
In order to check that 𝐼𝑘 is a regular value, we need to show that
is surjective for all 𝐴 ∈ 𝑉𝑘 (ℝ𝑛+𝑘 ). Recall for the above computation that 𝑀(𝑛 + 𝑘, 𝑘) ≅ 𝑅(𝑛+𝑘)𝑘
and 𝑆(𝑘) ≅ ℝ𝑘(𝑘+1)∕2 are vector spaces and that the tangent space of a vector space equals the
vector space.
( ) ( )𝑡
1 1 1 1 1 1
𝑑𝑓𝐴 (𝐵) = 𝐴𝑡 𝐴𝐶 + 𝐴𝐶 𝐴 = 𝐴𝑡 𝐴𝐶 + 𝐶 𝑡 𝐴𝑡 𝐴 = 𝐶 + 𝐶 𝑡 = 𝐶.
2 2 2 2 2 2
By the Preimage Theorem 4.6, this shows that 𝑉𝑘 (ℝ𝑛+𝑘 ) is a smooth manifold of dimen-
sion
9.4 Grassmannian
There is another very important space that arises from Stiefel manifolds. Any 𝑘-frame in ℝ𝑛+𝑘
spans a 𝑘-dimensional linear subspace in ℝ𝑛+𝑘 .
Definition 9.16 (Grassmannian) The set of all 𝑘-dimensional linear subspaces in ℝ𝑛+𝑘
is called the Grassmann manifold, or short the Grassmannian,
The Grassmannian Gr 𝑘 (ℝ𝑛+𝑘 ) can be identified with the quotient of the Stiefel manifold
𝑉𝑘 (ℝ𝑛+𝑘 ) of orthonormal sequences
[𝑣1 , … , 𝑣𝑘 ]
In other words, Gr 𝑘 (ℝ𝑛+𝑘 ) is the quotient of 𝑉𝑘 (ℝ𝑛+𝑘 ) that we get by identifying 𝑘-frames
which span the same subspace in ℝ𝑛+𝑘 :
Gr 1 (ℝ𝑛+1 ) = ℝP𝑛
It is actually not so easy to embed projective spaces and Grassmannians into Euclidean
spaces. For the moment, we content ourselves with noting that Gr 𝑘 (ℝ𝑛+𝑘 ) can be embedded
at least in ℝ2𝑛𝑘+1 . However, it is an interesting and difficult question what the minimal
dimension 𝑁 such that we can embed them into ℝ𝑁 . We will get back to this point in a general
context in Section 9.5.
There are many reasons why Grassmannians are important. We will now sketch one
of them. If it does not make complete sense to you yet, consider it as a advert for a future
adventure.
We mentioned vector bundles briefly in Section 2.5. The tangent bundle on a smooth manifold is
an important example. The Grassmannian is equipped with a canonical or tautological vector
bundle, denoted 𝛾𝑘𝑛+𝑘 . It is defined as follows: A point in Gr 𝑘 (ℝ𝑛+𝑘 ) consists of a 𝑘-plane 𝑉 in
ℝ𝑛+1 . We can consider 𝑉 both as a point in Gr 𝑘 (ℝ𝑛+𝑘 ) and as a 𝑘-dimensional vector space,
and we can pick a vector 𝑣 ∈ 𝑉 . The space 𝛾𝑘𝑛+𝑘 consists of the collection of such pairs, i.e.,
Together with the projection 𝜋 ∶ 𝛾𝑘𝑛+𝑘 → Gr 𝑘 (ℝ𝑛+𝑘 ) defined by sending (𝑉 , 𝑣) to 𝑉 turns 𝛾𝑘𝑛+𝑘
into a a vector bundle on Gr 𝑘 (ℝ𝑛+𝑘 ).
𝑋 / Gr (ℝ𝑛+𝑘 ).
𝑓 𝑘
We can show that this actually defines a morphism of vector bundles. This diagram is often
called a generalized Gauss map. The maps 𝑓 and 𝑓̄ are very useful for studying manifolds.
More importantly, this picture actually has a fascinating generalization:
164 9.4. Grassmannian
𝑓̄
𝐸 ≅ 𝑓 ∗𝛾 /𝛾
𝑌 / Gr (ℝ𝑛+𝑘 ).
𝑓 𝑘
∙ Note that we will be cheating a bit for the moment. For, we define a map
𝑓 ∶ Gr 𝑘 (ℝ𝑛+𝑘 ) → ℝ𝑁
Proof: Since the quotient map 𝑞 ∶ 𝑉𝑘 (ℝ𝑛+𝑘 ) → Gr 𝑘 (ℝ𝑛+𝑘 ) is continuous and surjective
and since 𝑉𝑘 (ℝ𝑛+𝑘 ) is compact, we see that Gr 𝑘 (ℝ𝑛+𝑘 ) is compact. It remains to show that it is
a smooth manifold.
Let 𝑉 ⊂ ℝ𝑛+𝑘 be a 𝑘-dimensional linear subspace in ℝ𝑛+𝑘 , and let 𝑉 ⟂ be the orthogonal
complement of 𝑉 in ℝ𝑛+𝑘 (with respect to the standard inner product). Define the subspace
𝑉 ⊂ Gr 𝑘 (ℝ𝑛+𝑘 ) consisting of 𝑘-dimensional linear subspaces 𝑉 ′ of ℝ𝑛+𝑘 with the property
that 𝑉 ′ ∩ 𝑉 ⟂ = {0}:
𝑉 = {𝑉 ′ ∈ Gr 𝑘 (ℝ𝑛+𝑘 ) ∶ 𝑉 ′ ∩ 𝑉 ⟂ = {0}}.
Equivalently, 𝑉 is the set of all 𝑘-dimensional subspaces 𝑉 ′ ⊂ ℝ𝑛+𝑘 which are mapped sur-
jectively onto 𝑉 by the projection 𝑝 ∶ ℝ𝑛+𝑘 = 𝑉 ⊕ 𝑉 ⟂ → 𝑉 .
Chapter 9. Abstract Smooth Manifolds 165
We will use subsets of the form 𝑉 to construct local parametrizations. To do this we need
to check several things:
To see that 𝑉 is open Gr 𝑘 (ℝ𝑛+𝑘 ) it suffices to show that its preimage ̃ 𝑉 under 𝑞 is open in
𝑉𝑘 (ℝ𝑛+𝑘 ), since we defined the topology on the Grassmannian as the quotient topology induced
by
𝑞 ∶ 𝑉𝑘 (ℝ𝑛+𝑘 ) → Gr 𝑘 (ℝ𝑛+𝑘 ), [𝑣1 , … , 𝑣𝑘 ] → span(𝑣1 , … , 𝑣𝑘 ) in ℝ𝑛+𝑘 .
By definition of 𝑞 and 𝑉 , the set ̃ 𝑉 consists of all orthonormal 𝑘-frames [𝑣′1 , … , 𝑣′𝑘 ] such
that
′
𝜓𝑉 ∶ 𝑉 → Homℝ (𝑉 , 𝑊 ), 𝑉 ′ → 𝜓𝑉 (𝑉 ′ ) = pr 𝑊
𝑉′
◦(pr 𝑉𝑉 )−1
from 𝑉 to Homℝ (𝑉 , 𝑊 ). See Figure 9.5.
Γ(𝑓 ) ∩ 𝑊 = 𝑉 ∩ 𝑊 = {0}.
To prove the claim it remains to show that 𝜓𝑉 and 𝜓𝑉−1 are smooth. Note that we consider
Homℝ (𝑉 , 𝑊 ) as a topological space by identifying it with ℝ𝑛𝑘 . That is we fix an orthonormal
basis 𝑣1 , … , 𝑣𝑘 for 𝑉 and an orthonormal basis 𝑤1 , … , 𝑤𝑛 for 𝑊 = 𝑉 ⟂ . Then we identify
linear maps 𝑓 ∶ 𝑉 → 𝑊 with their associated matrix 𝐴𝑓 with respect to these bases. The
entries in 𝐴𝑓 are real numbers and we can collect them to 𝑛 ⋅ 𝑘-tuples, i.e., elements in ℝ𝑛⋅𝑘 .
We show first that 𝑇 is smooth: By definition of the topology on the Grassmannian and our
definition of smoothness on Gr 𝑘 (ℝ𝑛+𝑘 ), 𝑇 is smooth if and only if the composite
𝑞
̃ 𝑉 = 𝑞 −1 (𝑉 ) /
𝑉
𝜓𝑉
𝜓𝑉 ◦𝑞
(
Homℝ (𝑉 , 𝑊 )
is smooth. Let [𝑣′1 , … , 𝑣′𝑛 ] be an orthonormal 𝑘-frame which spans 𝑉 ′ . The entries in the matri-
′
ces of the maps pr 𝑊𝑉′
and (pr 𝑉𝑉 )−1 , respectively, expressed in the bases of 𝑉 , 𝑊 and 𝑉 ′ , depend
′
smoothly on the coordinates of the 𝑣′𝑖 . Hence matrix of the map 𝜓𝑉 (𝑉 ′ ) = pr 𝑊 𝑉′
◦(pr 𝑉𝑉 )−1
depends smoothly on the 𝑘-frame [𝑣′1 , … , 𝑣′𝑛 ] and 𝜓𝑉 ◦𝑞 is smooth.
Chapter 9. Abstract Smooth Manifolds 167
Now let 𝑓 ∶ 𝑉 → 𝑊 be a linear map and let 𝐴𝑓 be its representing matrix (in the bases we
chose). Then there is a unique orthonormal basis 𝑣′1 , … , 𝑣′𝑛 of Γ(𝑓 ) with
𝑣′𝑖 = 𝑣𝑖 + 𝑓 (𝑣𝑖 ) = 𝑣𝑖 + 𝐴𝑓 (𝑣𝑖 )
for all 𝑖. Hence the coordinates of each 𝑣′𝑖 depend smoothly on the entries in 𝐴𝑓 . Hence the
𝑘-frame [𝑣′1 , … , 𝑣′𝑛 ] in 𝑉𝑘 (ℝ𝑛+𝑘 ) depends smoothly on 𝑓 . Since the 𝑘-frame [𝑣′1 , … , 𝑣′𝑛 ] spans
Γ(𝑓 ) = 𝜓𝑉−1 (𝑓 ), we can interpret this as Γ(𝑓 ) depends smoothly on 𝑓 and 𝜓𝑉−1 is also smooth.
This proves the third claim.
In total we have shown that the point 𝑉 ∈ Gr 𝑘 (ℝ𝑛+𝑘 ) has an open neighborhood 𝑉 which
is diffeomorphic to ℝ𝑛𝑘 . In other words, Gr 𝑘 (ℝ𝑛+𝑘 ) is indeed a smooth manifold of dimension
𝑛 ⋅ 𝑘.
Remark 9.20 (Tangent space of the Grassmannian) The local coordinate chart 𝜓𝑉
gives us also a way to understand the tangent space of Gr 𝑘 (ℝ𝑛+𝑘 ) at 𝑉 . The 𝜓𝑉 behaves
like a linear map (if we keep vectors small enough) and the image Homℝ (𝑉 , 𝑉 ⟂ ) of 𝜓𝑉
is a vector space. Hence the tangent space of Homℝ (𝑉 , 𝑉 ⟂ ) at 0 is Homℝ (𝑉 , 𝑉 ⟂ ) itself.
Therefore, we can use 𝑑𝜓𝑉 to get a linear isomorphism
It is useful to also introduce a slightly different argument for proving the theorem:
Any 𝑘-dimensional linear subspace in ℝ𝑛+𝑘 is spanned by 𝑘 row vectors in ℝ𝑛+𝑘 . Hence
any such 𝑘-dimensional linear subspace 𝑉 can be represented by a 𝑘 × (𝑛 + 𝑘)-matrix
⎛𝑣11 ⋯ 𝑣1𝑛+𝑘 ⎞
𝐴=⎜ ⋮ ⋱ ⋮ ⎟
⎜ ⎟
⎝𝑣𝑘1 ⋯ 𝑣𝑘𝑛+𝑘 ⎠
where any two such matrices 𝐴 and 𝐴′ represent the same element in Gr 𝑘 (ℝ𝑛+𝑘 ) if and only if
there is an invertible matrix 𝑘 × 𝑘-matrix 𝑇 such that
𝐴′ = 𝑇 ⋅ 𝐴.
Now let 𝐼 = {𝑖1 , … , 𝑖𝑘 } ⊂ {1, … , 𝑛 + 𝑘} be a 𝑘-tuple with 𝑖1 < 𝑖2 < ⋯ < 𝑖𝑘 . We define
the subset 𝐼 of Gr 𝑘 (ℝ𝑛+𝑘 ) to be the set of all 𝑘-planes which are represented by a 𝑘 × (𝑛 + 𝑘)-
matrix 𝐴 such that the 𝑘 × 𝑘-submatrix consisting of the columns 𝑖1 , … , 𝑖𝑘 is invertible. We
call this submatrix the I-th 𝑘 × 𝑘-minor of 𝐴. Recall that elementary row operations that
we learned about in our very first encounter with linear algebra do not change the row space,
i.e., the space spanned by the row vectors. Hence, after performing suitable row operations,
we see that every 𝑉 ∈ 𝐼 is represented by a unique 𝑘 × (𝑛 + 𝑘)-matrix in which this I-th
𝑘 × 𝑘-minor is the 𝑘 × 𝑘-identity matrix. In other words, every 𝑉 ∈ 𝐼 is represented by a
unique 𝑘 × (𝑛 + 𝑘)-matrix 𝐴𝐼 (𝑉 ) in which the 𝑗th column is the standard basis vector 𝑒𝑗 of
length 𝑘 if 𝑗 ∈ 𝐼 and an arbitrary vector of length 𝑘 if 𝑗 ∉ 𝐼.
168 9.4. Grassmannian
Example 9.21 (Representing planes by minors) Let us look at how this works:
∙ More concretely, let 𝑘 = 3 and 𝑛 + 𝑘 = 5 and 𝐼 = {1, 3, 4}. Let 𝑉 be the 3-plane,
i.e., 3-dimensional linear subspace, spanned by the rows of the matrix
⎛3 6 0 1 −18⎞
𝐴 = ⎜3 1 1 0 −7 ⎟ .
⎜ ⎟
⎝5 6 1 1 −21⎠
⎛3 0 1⎞
𝑀 = ⎜3 1 0⎟
⎜ ⎟
⎝5 1 1⎠
consisting of the first, third and fourth column of 𝐴. This minor is an invertible
matrix with inverse
⎛1 1 −1⎞
𝑀 −1 = ⎜−3 −2 3 ⎟ .
⎜ ⎟
⎝−2 −3 3 ⎠
Then 𝑉 ∈ 𝐼 is uniquely represented by the matrix
⎛1 1 0 0 −4⎞
𝐴𝐼 = 𝑀 −1
⋅ 𝐴 = ⎜0 −2 1 0 5 ⎟ .
⎜ ⎟
⎝0 3 0 1 −6⎠
Conversely, any 𝑘×(𝑛+𝑘)-matrix with invertible I-th 𝑘×𝑘-minor defines a unique 𝑘-plane
in 𝐼 .
Thus, for each 𝐼, the 𝑘 ⋅ 𝑛 entries in the remaining columns of any 𝑘 × (𝑛 + 𝑘)-matrix of this
form define a bijection of sets
≅
← ℝ𝑘⋅𝑛 .
𝜓𝐼 ∶ 𝐼 ←←←→
Example 9.22 (Real projective space) For 𝑘 = 1, we have seen that Gr 1 (ℝ𝑛+1 ) = ℝP𝑛 .
A point in ℝP𝑛 is represented by an (𝑛 + 1)-tuple 𝑥1 , … , 𝑥𝑛+1 . Recall that we denote the
equivalence class by [𝑥] = [𝑥1 ∶ … ∶ 𝑥𝑛+1 ]. The sets 𝐼 then consist each of just one
number 𝑖 ∈ {1, … , 𝑛 + 1}. The set 𝑖 is then equal to the subset 𝑉𝑖 we have defined
previously:
𝑖 = 𝑉𝑖 = {[𝑥] ∈ ℝP𝑛 ∶ 𝑥𝑖 ≠ 0}.
The representing matrix 𝐴𝑖 of [𝑥] is the matrix with just one row
Chapter 9. Abstract Smooth Manifolds 169
(𝑥1 ∕𝑥𝑖 , … , 1, … , 𝑥𝑛+1 ∕𝑥𝑖 ) and a 𝑒1 = 1 in column 𝑖.
We also note that the subset 𝜓𝐼 (𝐼 ∩ 𝐼 ′ ) is open in ℝ𝑘⋅𝑛 for every pair 𝐼, 𝐼 ′ . Now we
can give a second proof of Theorem 9.19:
Proof: We already know that Grassmannians are compact. To show it once again anyway
we could argue as follows: Let 𝑉𝑘 be the 𝑘-plane in ℝ𝑛+𝑘 spanned by the first 𝑘 standard basis
𝑒1 , … , 𝑒𝑘 ∈ ℝ𝑛+𝑘 of length 𝑛 + 𝑘. We deduce from what we learned above that we have a
surjective map
𝑞 ∶ 𝑂(𝑛 + 𝑘) → Gr 𝑘 (ℝ𝑛+𝑘 ), 𝑇 → 𝑇 ⋅ 𝑉𝑘 .
Since 𝑂(𝑛 + 𝑘) is compact and the continuous image of compact sets is compact, this shows
that Gr 𝑘 (ℝ𝑛+𝑘 ) is compact.
The answer to this question is yes which may justify our initial approach to smooth man-
ifolds as subsets of Euclidean space. We will discuss the proof only for compact manifolds
170 9.5. Embedding Manifolds in Euclidean Space
in Section 9.5.2. The non-compact requires much more familiarity with arguments in general
topology and we leave this as an extra homework.
Once we know that every smooth abstract manifold can be considered as subspace of some
Euclidean space we will try to answer the following question:
We will approach an answer to this question in two steps: In Section 9.5.3 we show that
there always is an one-to-one immersion of 𝑋 into ℝ2𝑘+1 . If 𝑋 is compact, this immersion
will automatically be an embedding as we learned in Section 3.3. In Section 9.5.2 we will then
show that also for non-compact smooth 𝑘-manifolds there is always an embedding into ℝ2𝑘+1 .
In fact, Whitney showed that 𝑁 = 2𝑘 always works. The proof is much harder, and we will not
discuss it in these notes.
In several proofs in this section we will employ an important and very useful tool. It will
also turn out handy in several occasions later on. This will be the content of the next subsection.
We are familiar with the closure of subsets in many cases. For example, the closure of an
open ball 𝔹𝜀 (0) in ℝ𝑁 is just the closed ball
We need the closure of a subset for example when we want to talk about the support of a
function:
supp(𝑓 ) ∶= {𝑥 ∈ 𝑋 ∶ 𝑓 (𝑥) ≠ 0}
Definition 9.25 (Partition of unity) Let 𝑋 be a smooth manifold and let {𝑈𝛼 } be an
⋃
open cover, i.e., a collection of open subsets in 𝑋 such that 𝛼 𝑈𝛼 = 𝑋. A sequence
Chapter 9. Abstract Smooth Manifolds 171
of smooth functions {𝜌𝑖 ∶ 𝑋 → ℝ} is called a partition of unity subordinate to the
open cover {𝑈𝛼 } if it has the following properties:
(b) Each 𝑥 ∈ 𝑋 has a neighborhood on which all but finitely many functions 𝜌𝑖 are
identically zero.
The most general existence result for partitions of unity (only requiring that each 𝜌𝑖 is merely
continuous and not smooth) is that they exist on every paracompact space, i.e., spaces on
which every open cover has a locally finite refinement. The latter means that every point has
a neighborhood that intersects only finitely many sets in the cover.
We will postpone the proof of the existence of partitions of unity on smooth manifolds to
the end of this section. See Section 9.5.6. It is an interesting proof and we encourage everybody
to go through it at some point. However, it is also rather technical and diverts our attention too
far from our main goal. We will therefore first look at some applications of partitions of unity
and will then show that they actually exist.
Proof:
The collection of all 𝑉𝛼 for all charts (𝑉𝛼 , 𝜙𝛼 ) is an open cover of 𝑋. Since 𝑋 is compact,
we can cover 𝑋 by the images of a finite number of charts 𝑉1 , … , 𝑉𝑛 .
Let {𝜌𝑖 } be a partition of unity subordinate to the open cover defined by the 𝑉𝑖 ’s.
Remark 9.27 At this point we see why we do not use 𝑋 ⧵ 𝑉𝑖 in the definition of 𝑔𝑖
because that would be a closed subset. We also observe that it would not work to drop
the 𝜌𝑖 and just use the 𝜙𝑖 , since there is no reason why 𝜙𝑖 would get closer to zero the
closer we get to the boundary of 𝑉𝑖 . Hence there are several reasons why continuity and
smoothness would not be guaranteed without the 𝜌𝑖 .
Thus 𝑥 and 𝑦 must lie in the same 𝑉𝑖 , since 𝜌𝑖 is supported on 𝑉𝑖 , i.e., 𝜌𝑖 (𝑥) ≠ 0 implies
𝑥 ∈ 𝑉𝑖 and similarly for 𝑦. Hence, since we have 𝑔𝑖 (𝑥) = 𝑔𝑖 (𝑦) and 𝜌𝑖 (𝑥) = 𝜌𝑖 (𝑦) ≠ 𝟎, we must
have
𝜙𝑖 (𝑥) = 𝜙𝑖 (𝑦).
Actually, 𝑔 is also an immersion, but we have not defined what that means for an abstract
manifold. However, this is just an exercise in translating the definitions, and we omit this point
and rather move on.
Remark 9.28 (All manifolds can be embedded in Euclidean space) In fact, every ab-
stract 𝑘-manifold 𝑋 can be embedded in Euclidean space. One can just keep on going
with the above argument in the non-compact case and use local charts to map pieces
of 𝑋 into ℝ𝑘 . Though when using only finitely many copies of ℝ𝑘 to accommodate
infinitely many neighborhoods of 𝑋, we loose injectivity. The key tool that restores
injectivity is a partition of unity which evens out the troubles occurring because of
overlapping neighborhoods.
For this to work, it is crucial that the topology on 𝑋 has a countable basis. This is
a technical point which we do not discuss any further because it would divert us too far
from the main story.
We just remark that it is possible to construct topological spaces without a count-
able basis which are locally homeomorphic to Euclidean space, but which cannot be
embedded into Euclidean space.
ℝ𝑁 → ℝ𝑁−1 → ⋯ → ℝ2𝑘+1
𝑓 ∶ 𝑋 → ℝ𝑀 with 𝑀 > 2𝑘 + 1.
𝑋×𝑋×ℝ
ℎ
𝑇 (𝑋) / ℝ𝑀
𝑔
by
and, using 𝑇𝑦 (ℝ𝑀 ) = ℝ𝑀 at any 𝑦 ∈ ℝ𝑀 and hence 𝑑𝑓𝑥 is a map 𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) → ℝ𝑀 ,
By Sard’s Theorem 7.1 the sets of regular values 𝑅𝑔 and 𝑅ℎ for 𝑔 and ℎ, respectively, are
dense subsets in ℝ𝑀 . Thus their intersection is non-empty and there exists a point in ℝ𝑀 which
is a regular value for both 𝑔 and ℎ simultaneously.
Since dim 𝑇 (𝑋) = 2𝑘, dim 𝑋 × 𝑋 × ℝ = 2𝑘 + 1, but 𝑀 > 2𝑘 + 1, the only regular values
of 𝑔 and ℎ are the points in ℝ𝑀 which are not in the image of 𝑔 or ℎ. Hence there exists a
point 𝑎 ∈ ℝ𝑀 which is neither in the image of 𝑔 nor in the image of ℎ. Note that, since 0
belongs to both images, we must have 𝑎 ≠ 0.
174 9.5. Embedding Manifolds in Euclidean Space
Remark 9.30 (Cannot reduce beyond 2𝑘 + 1) Before we move on, note that we could
only make this argument for 𝑀 > 2𝑘 + 1. Hence this induction argument of shrinking
𝑀 further and further cannot be extended beyond 2𝑘 + 1.
For suppose that 𝜋◦𝑓 (𝑥) = 𝜋◦𝑓 (𝑦). Then, since 𝜋 is linear, we have
𝜋(𝑓 (𝑥) − 𝑓 (𝑦)) = 0,
i.e.,
𝑓 (𝑥) − 𝑓 (𝑦) ∈ Ker (𝜋) = span(𝑎) in ℝ𝑀
= {𝑤 ∈ ℝ𝑀 ∶ 𝑤 = 𝑡 ⋅ 𝑎 for some 𝑡 ∈ ℝ}.
Thus there is a 𝑡 ∈ ℝ with 𝑓 (𝑥) − 𝑓 (𝑦) = 𝑡𝑎. If 𝑥 ≠ 𝑦 then 𝑡 ≠ 0, since 𝑓 is injective. But then
𝑎 = 1∕𝑡(𝑓 (𝑥) − 𝑓 (𝑦)) = ℎ(𝑥, 𝑦, 1∕𝑡)
which contradicts the choice of 𝑎 not being in the image of ℎ.
For suppose there was a nonzero vector 𝑣 in 𝑇𝑥 (𝑋) for which 𝑑(𝜋◦𝑓 )𝑥 = 0. Because 𝜋 is
linear, we have 𝑑𝜋𝑓 (𝑥) = 𝜋 and the chain rule yields
𝑑(𝜋◦𝑓 )𝑥 = 𝜋◦𝑑𝑓𝑥 .
Thus 𝜋(𝑑𝑓𝑥 (𝑣)) = 0, so 𝑑𝑓𝑥 (𝑣) = 𝑡𝑎 for some 𝑡 ∈ ℝ. Because 𝑓 is an immersion, we must
have 𝑡𝑎 ≠ 0. Since we know 𝑎 ≠ 0, this implies 𝑡 ≠ 0. Thus, since 𝑑𝑓𝑥 is linear,
( ) ( )
1 1 1
𝑎 = 𝑑𝑓𝑥 (𝑣) = 𝑑𝑓𝑥 𝑣 = 𝑔 𝑥, 𝑣
𝑡 𝑡 𝑡
which again contradicts the choice of 𝑎 not being in the image of 𝑔.
For compact manifolds, one-to-one immersions are the same as embeddings. So we have just
proved the embedding theorem in the compact case.
Note that Whitney’s result does not give us the minimal 𝑁 for an individual manifold. For
example, we know that 𝕊𝑛 is embedded in ℝ𝑛+1 for every 𝑛. The result tells us that, in general,
𝑁 = 2𝑘 + 1 will always work.
Proof: Let {𝑈𝛼 } be the collection of open subsets of 𝑋 that have compact closure, and let
{𝜌𝑖 } be a subordinate partition of unity. Then
∑
∞
𝑝(𝑥) = 𝑖𝜌𝑖 (𝑥)
𝑖=1
is a well-defined smooth function, since, in a neighborhood of every point, it is a finite sum of
smooth functions.
In order to show that 𝑝 is proper, we need to show that the preimage of any compact subset
of ℝ is again compact. Every compact subset 𝐾 ⊂ ℝ is contained in a closed interval of the
form [−𝑗, 𝑗] for some large enough natural number 𝑗. Hence if we can show that 𝑝−1 ([−𝑗, 𝑗])
is compact, then 𝑝−1 (𝐾) is a closed subset of a compact set and therefore also compact.
For a given natural number 𝑗, assume we have 𝜌1 (𝑥) = ⋯ = 𝜌𝑗 (𝑥) = 0 for any 𝑥 ∈ 𝑋.
Then, by definition of a partition of unity, we have
∑
∞
𝜌𝑖 (𝑥) = 1
𝑖=𝑗+1
and therefore
∑
∞
𝑝(𝑥) ≥ (𝑗 + 1) 𝜌𝑖 (𝑥) = 𝑗 + 1 >𝑗.
𝑖=𝑗+1
This shows
⋃
𝑗
−1
𝑝 ([−𝑗, 𝑗]) ⊂ {𝑥 ∈ 𝑋 ∶ 𝜌𝑖 (𝑥) ≠ 0}.
𝑖=1
Since {𝜌𝑖 } is a partition of unity subordinate to {𝑈𝛼 }, we have supp(𝜌𝑖 ) ⊂ 𝑈𝑖 . Since 𝑈𝑖 has
compact closure and the finite union of compact sets is compact, this shows that 𝑝−1 ([−𝑗, 𝑗])
is a closed subset in a compact set and therefore it is also compact.
Recall that the strongest result is that 𝑁 = 2𝑘 suffices. But this is much harder to prove.
Note also that, for many manifolds, an even lower dimension suffices, e.g., 𝕊𝑛 ⊂ ℝ𝑛+1 the
𝑛-sphere is embedded in ℝ𝑛+1 .
Since 2𝑘 + 2 > 2𝑘 + 1, we can apply the argument from the proof of the previous theorem
by Whitney and find a nonzero vector 𝑎 ∈ ℝ2𝑘+2 such that
𝜋◦𝐹 ∶ 𝑋 → 𝐻
is still an injective immersion, where 𝜋 is the projection onto the orthogonal complement
𝐻 = {𝑏 ∈ ℝ2𝑘+2 ∶ 𝑏 ⟂ 𝑎} of 𝑎 in ℝ2𝑘+2 . By rescaling we can assume |𝑎| = 1. In other words,
we can assume 𝑎 ∈ 𝕊2𝑘+1 .
Since 𝜋◦𝐹 is an injective immersion for almost every 𝑎 ∈ 𝕊2𝑘+1 , we can assume that 𝑎 is
neither the north nor the south pole on 𝕊2𝑘+1 . For if 𝜋◦𝐹 failed to be an injective immersion
on these two points, it would suffice to rotate 𝕊2𝑘+1 a bit (which is a diffeomorphism of 𝕊2𝑘+1 )
in order to avoid north and south pole.
Claim: Given any bound 𝑐, there exists another number 𝑑 such that
is a compact subset of 𝑋. Thus the preimage under 𝜋◦𝐹 of every closed ball in 𝐻 is a compact
subset of 𝑋. Since every compact subset 𝐾 of 𝐻 is a closed subset of some closed ball in 𝑋,
this shows that (𝜋◦𝐹 )−1 (𝐾) is a closed subset of a compact subset in 𝑋 and therefore also
compact.
Proof of the claim: If the claim was false, then there exists a 𝑐 and a sequence of points
{𝑥𝑖 } in 𝑋 for which
But each 𝑤𝑖 is a multiple of 𝑎. Hence the limit of the 𝑤𝑖 must be a multiple of 𝑎 as well.
We conclude that 𝑎 must be either the north or south pole of 𝕊𝑘+1 which contradicts our
assumption on 𝑎. This proves the claim and finishes the proof of the theorem.
Now we return to partitions of unity and prove that they exist on smooth manifolds. Before we
can start the proof, we need some further preparation:
Lemma 9.34 (Separating closed subsets) Let 𝐴 and 𝐶 be disjoint closed subsets in
ℝ𝑁 . Then there are disjoint open subsets 𝑈 and 𝑉 such that 𝐴 ⊂ 𝑈 and 𝐶 ⊂ 𝑉 .
Proof: For each 𝑎 ∈ 𝐴, choose an 𝜀𝑎 > 0 such that 𝐵2𝜀𝑎 (𝑎) ∩ 𝐶 = ∅. This is possible since
𝐶 is closed. Similarly, for each 𝑐 ∈ 𝐶, choose an 𝜀𝑐 > 0 such that 𝐵2𝜀𝑐 (𝑐) ∩ 𝐴 = ∅. We define
For, if 𝑥 ∈ 𝑈 ∩ 𝑉 , then
𝑥 ∈ 𝐵𝜀𝑎 (𝑎) ∩ 𝐵𝜀𝑐 (𝑐)
for some 𝑎 ∈ 𝐴 and 𝑐 ∈ 𝐶. By the triangle inequality, this implies
|𝑎 − 𝑐| < 𝜀𝑎 + 𝜀𝑐 .
But, if 𝜀𝑎 ≤ 𝜀𝑐 , then |𝑎 − 𝑐| < 2𝜀𝑐 and 𝑎 ∈ 𝐵2𝜀𝑐 (𝑐). And, if 𝜀𝑐 ≤ 𝜀𝑎 , then |𝑎 − 𝑐| < 2𝜀𝑎 and
𝑐 ∈ 𝐵2𝜀𝑎 (𝑎). Both cases are impossible.
Another important tool that we will need are smooth bump functions which we introduced
in Section 8.2. Now we will need them in a slightly more interesting form:
178 9.5. Embedding Manifolds in Euclidean Space
Finally, for any fixed point 𝑎 ∈ ℝ𝑁 and for any given 𝑟 > 0, we define
ℎ𝑟𝜀 ∶ ℝ𝑁 → ℝ, ℎ𝑟𝜀 (𝑥) = 1 − 𝑔𝜀 (|𝑥 − 𝑎| − 𝑟).
Now let 𝑈 ⊂ ℝ𝑁 be open and 𝐾 ⊂ 𝑈 be compact. For this general situation we need to
work a bit harder and rearrange the argument as follows:
This is a smooth map with ∫ℝ𝑁 𝜓𝑑𝑥 = 1 where 𝑑𝑥 denotes the standard Lebesgue measure on
ℝ𝑁 .
For a given 𝜀 > 0, define 𝜓𝜀 ∶ 𝑅𝑁 → ℝ by 𝜓𝜀 (𝑥) ∶= 𝜀−𝑁 𝜓(𝑥∕𝜀). This is still a smooth
function with ∫ℝ𝑁 𝜓𝜀 𝑑𝑥 = 1.
Since ℝ𝑁 ⧵ 𝑈 is closed and 𝐾 is compact, we can choose a small 𝜀 > 0 such that, for each
point 𝑥 ∈ 𝐾, we have 𝐵2𝜀 (𝑥) ∩ 𝑈 = ∅. Then the 𝑉 ∶= ∪𝑥∈𝐾 𝐵𝜀 (𝑥) is an open set containing 𝐾
with compact closure 𝑉̄ ⊂ 𝑈 contained in 𝑈 .
𝜑 ∶ ℝ𝑁 → ℝ, 𝑥 → 𝜓𝜀 (𝑥 − 𝑦)𝜒𝑉 (𝑦)𝑑𝑦.
∫ℝ 𝑁
Note that the integral is well-defined, since the closure of 𝑉 , which is the support of 𝜒𝑉 , is
compact.
We are going to show that partitions of unity exist on manifolds step by step with increasing
difficulty. We start with the case of compact subspaces in ℝ𝑁 . Then we are going to transport
this result to compact smooth manifolds. Finally, we discuss arbitrary compact smooth 𝑘-
manifolds. There is no need to restrict to compact manifolds. In fact, partitions of unity exist
on every paracompact topological space (every open cover has a locally finite refinement), a
class of spaces much larger than abstract manifolds.
Let {𝑈𝛼 } be an open cover of 𝑋. Since 𝑋 is compact, {𝑈𝛼 } has a finite subcover {𝑈1 , … , 𝑈𝑛 }.
A partition of unity subordinate to the finite subcover is also a partition of unity subordinate
to the original cover.
180 9.5. Embedding Manifolds in Euclidean Space
Step 1: We are going to show that we can shrink the covering to an open covering {𝑉1 … , 𝑉𝑛 }
such that 𝑉̄𝑖 ⊂ 𝑈𝑖 for each 𝑖.
𝐴 ∶= 𝑋 ⧵ (𝑈2 ∪ ⋯ ∪ 𝑈𝑛 )
Now we proceed by induction as follows: Given open subsets 𝑉1 , … , 𝑉𝑘−1 such that
Then 𝐴𝑘 is a closed subset of 𝑋 which is contained in the open set 𝑈𝑘 . Choose an open subset
𝑉𝑘 containing 𝐴𝑘 such that 𝑉̄𝑘 ⊂ 𝑈𝑘 . Then {𝑉1 , … , 𝑉𝑘−1 , 𝑉𝑘 , 𝑈𝑘+1 , … , 𝑈𝑛 } covers 𝑋. At the
𝑛th step of the induction we are done.
Step 2: Given the open covering {𝑈1 , … , 𝑈𝑛 } of 𝑋, we use Step 1 to choose an open cover
{𝑉1 , … , 𝑉𝑛 } of 𝑋 such that 𝑉̄𝑖 ⊂ 𝑈𝑖 for each 𝑖. Then we repeat this process and choose an
open cover {𝑊1 , … , 𝑊𝑛 } of 𝑋 such that 𝑊̄ 𝑖 ⊂ 𝑉𝑖 for each 𝑖.
𝑖 (ℝ ⧵ {0}) ⊂ 𝑉𝑖 , we have
Since 𝜑−1
supp(𝜑𝑖 ) ⊂ 𝑉̄𝑖 ⊂ 𝑈𝑖 .
Note: Here is the point where we see why we need to apply Step 1 twice: If we were working
with the 𝑉𝑖 ’s instead of 𝑊𝑖 ’s, then we would have supp(𝜑) ⊂ 𝑈̄ 𝑖 instead of supp(𝜑) ⊂ 𝑈𝑖 as
required for a partition subordinate to the cover {𝑈𝑖 }.
∑
𝑛
𝜑(𝑥) ∶= 𝜑𝑖 (𝑥) > 0 for all 𝑥 ∈ 𝑋.
𝑖=1
For each 𝑥 ∈ 𝑋, there is an 𝑖 such that 𝑥 ∈ 𝑋𝑖 and hence 𝜑𝑗𝛼 (𝑥) = 0 for all 𝑗 ≥ 𝑖 + 2. Hence,
for each 𝑥 ∈ 𝑋, the sum
∑
𝜑(𝑥) ∶= 𝜑𝑖𝛼 (𝑥)
𝛼,𝑖
Define subsets
∙ each 𝑋𝑖 is compact, since it is the intersection 𝑋 ∩ 𝐵𝑖 (0) ∩ (𝑋 ⧵ (∪𝑝∈ℝ𝑁 ⧵𝑋 𝐵1∕𝑖 (𝑝)) and
therefore closed and bounded in ℝ𝑁 ;
∙ 𝑋 = 𝑋1 ∪ 𝑋2 ∪ ⋯.
Let {𝑈𝛼 } be an open cover of 𝑋. By the definition of the subspace topology on 𝑋, for each
𝛼, there is a subset 𝑉𝛼 ⊂ ℝ𝑁 open in ℝ𝑁 such that 𝑈𝛼 = 𝑋 ∩ 𝑉𝛼 . Let 𝑌 be the union of all the
𝑉𝛼 in ℝ𝑁 . By the third case, there is a partition of unity on 𝑌 subordinate to the open cover
{𝑉𝛼 }. This is also a partition of unity on 𝑋 subordinate to the open cover {𝑈𝛼 }.
Let {𝑉𝛼 } be an open over of 𝑋. By intersecting with the domains of charts on 𝑋, we get a
refinement of the cover. Hence we can assume that 𝑉𝛼 are the domains of charts on 𝑋. Since
182 9.5. Embedding Manifolds in Euclidean Space
𝑋 is compact, the domains of finitely many charts on 𝑋 suffice to cover 𝑋. Let us label them
(𝑉1 , 𝜙1 ), … , (𝑉𝑛 , 𝜙𝑛 ). Then each 𝑈𝑖 = 𝜙𝑖 (𝑉𝑖 ) is an open subset in ℝ𝑘 .
Now we can proceed exactly as in the case of a compact subspace in ℝ𝑁 for the finite
cover {𝑈1 , … , 𝑈𝑛 } of the space 𝑌 ∶= 𝑈1 ∪ ⋯ ∪ 𝑈𝑛 ⊂ ℝ𝑘 . This yields a partition of unity
{𝜌𝑖 } subordinate to the cover {𝑈1 , … , 𝑈𝑛 }. Composition of each 𝜌𝑖 with 𝜙𝑖 yields a partition
of unity {𝜌𝑖 ◦𝜙𝑖 } on 𝑋 subordinate to the cover {𝑉1 , … , 𝑉𝑛 }.
Chapter 9. Abstract Smooth Manifolds 183
9.6 Exercises and more examples
Exercise 9.1 Let 𝑋 be the set of all straight lines in ℝ2 (not just lines through the
origin).
(a) Show that 𝑋 is an abstract smooth 2-manifold by showing that we can identify 𝑋
with an open subset of the real projective plane ℝP2 .a
(b) Show that there is a bijection between 𝑋 and the set of equivalence classes
(𝕊1 × ℝ)∕ ∼
Exercise 9.2 Recall the Hopf map 𝜋 that we have seen previously: We consider 𝕊3 as
a subset of ℂ2 , i.e., 𝕊3 = {(𝑧, 𝑤) ∈ ℂ2 ∶ |𝑧|2 + |𝑤|2 = 1}, and 𝕊2 as a subset of ℂ × ℝ,
i.e., 𝕊2 = {(𝑧, 𝑥) ∈ ℂ × ℝ ∶ |𝑧|2 + 𝑥2 = 1}. Then the Hopf map 𝜋 is the map 𝕊3 → 𝕊2
given by ( )
𝜋(𝑧, 𝑤) = 2𝑧𝑤,̄ |𝑧|2 − |𝑤|2 .
In this exercise we study another way to define the Hopf map:
Let ℂP1 denote the complex projective space consisting of all complex-one-
dimensional linear subspaces in ℂ2 . We can consider ℂP1 as the set of pairs (𝑧, 𝑤) ≠
(0, 0) of complex numbers modulo the equivalence relation
(𝑧0 , 𝑤0 ) ∼ (𝑧1 , 𝑤1 ) ⇐⇒ 𝑧1 = 𝜆𝑧0 and 𝑤1 = 𝜆𝑤0 for some 𝜆 ∈ ℂ ⧵ {0}.
We consider ℂP1 as a topological space as the quotient (ℂ2 ⧵ {(0, 0)})∕ ∼ and write
[𝑧 ∶ 𝑤] for the equivalence class of (𝑧, 𝑤).
(a) Show that ℂP1 is a two-dimensional abstract smooth manifold.
Hint: Follow the outline for ℝP𝑛 .
(b) We consider 𝕊3 as a subset in ℂ2 . Show that ℂP1 also can be defined as a quotient
ℂP1 = 𝕊3 ∕ ∼𝑠 by an appropriate equivalence relation ∼𝑠 .
(c) We consider 𝕊3 as a subset in ℂ2 and define the map
𝜑 ∶ 𝕊3 → ℂP1 , (𝑧, 𝑤) → [𝑧 ∶ 𝑤].
Find a map ℎ ∶ ℂP1 → 𝕊2 such that the composition
𝜑 ℎ
𝕊3 ←←←→
← ℂP1 ←←→
← 𝕊2
equals 𝜋.
Hint: Think of the stereographic projection.
(c) Let [𝑧0 , 𝑤0 ] be a point in ℂP1 . Describe the fiber 𝜑−1 ([𝑧0 ∶ 𝑤0 ]).
184 9.6. Exercises and more examples
In the next exercise we let a group act on a space. If you are not familiar with group actions
yet, you may want to skip this exercise or first read about group actions in a textbook of your
choice.
(𝑧1 , … , 𝑧𝑛 ) → (𝜆𝑘 𝑧1 , … , 𝜆𝑘 𝑧𝑛 ).
(d) Show that there is a homeomorphism between 𝐻 4 and 𝕊3 × 𝕊1 . Then show that
this actually a diffeomorphism by showing that the compositions with coordinate
charts are smooth.d
Note: The Hopf surface 𝐻𝜆4 is the simplest example of a compact complex manifold
which cannot be equipped with a Kähler metric and cannot be holomorphically embedded
into complex projective space. This follows from that fact that 𝐻𝜆4 ≅ 𝕊3 × 𝕊1 implies
that the first cohomology of a Hopf surface is one-dimensional, whereas every Kähler
manifold must have an even-dimensional first cohomology. This is a first glimpse at the
exciting theory of complex geometry.
a
Once we have learned more general theory, we can say that this follows from the fact that the action of
ℤ on ℂ𝑛 ⧵ {0} is free and discrete.
b
Note that 𝐻 2 is called the Hopf curve, since it is usually considered as a complex manifold of complex
dimension one.
c
Note that 𝐻 4 is called the Hopf surface, since it is usually considered as a complex manifold of complex
dimension two.
d
In general, 𝐻 2𝑛 is diffeomorphic to 𝕊2𝑛−1 × 𝕊1 . The previous point for 𝑛 = 1 is a special case of this
fact, since 𝐴∕ℤ ≅ 𝕊1 × 𝕊1 .
Chapter 9. Abstract Smooth Manifolds 185
Exercise 9.4 For 0 < 𝑚 ≤ 𝑛, the real Milnor hypersurface 𝐻(𝑚, 𝑛) is defined by
{ }
∑
𝑚
𝐻(𝑚, 𝑛) = ([𝑥0 ∶ … ∶ 𝑥𝑚 ], [𝑦0 ∶ … ∶ 𝑦𝑛 ]) ∈ ℝP𝑚 × ℝP𝑛 ∶ 𝑥𝑖 𝑦𝑖 = 0 .
𝑖=0
Before we introduce the main new definition of this chapter, we look at an interesting and prob-
lem we like to solve.
Consider the two smooth manifolds 𝕊2 and ℝP2 , real projective 2-space. We have seen that
both are smooth manifolds of dimension 2, both are connected and compact. So one might
wonder if there is any topological feature that distinguishes these two spaces. In other words,
one might even wonder: are they homeomorphic or even diffeomorphic?
∙ (A new challenge) The answer is no: 𝕊2 and ℝP2 are not homeomorphic and
therefore not diffeomorphic. But how can we prove this?
To attack this problem, we need a new idea. For example, we could study the homotopy
classes of loops on 𝕊2 and ℝP2 , i.e., of continuous maps 𝕊1 → 𝑋 for 𝑋 being either 𝕊2 or ℝP2 .
If these two spaces were homeomorphic, the sets of of such homotopy classes of loops would
be the same. But we can show that the sets we get for 𝕊2 and ℝP2 , respectively, are different
and hence 𝕊2 ≉ ℝP2 :
Let us see how such a map 𝑓̄ ∶ 𝕊1 → ℝP2 could look like: We define 𝑓̄ via the commutative
diagram
𝑓
[0, 2𝜋] / 𝕊2
𝜋
𝕊1 / ℝP2 = 𝕊2 ∕ ∼
𝑓̄
where 𝜋 is the quotient map, the left vertical map sends 𝑡 to (cos 𝑡, sin 𝑡) ∈ 𝕊1 and 𝑓 is defined
by
𝑓 ∶ [0, 2𝜋] → 𝕊2 , 𝑡 → (cos(𝑡∕2), sin(𝑡∕2), 0).
Since (cos 0, sin 0, 0) = (−1) ⋅ (cos 𝜋, sin 𝜋, 0), the map 𝜋◦𝑓 induces a continuous map
𝑓̄ ∶ [0, 2𝜋]∕(0 ∼ 2𝜋) = 𝕊1 → ℝP2 .
186
Chapter 10. Manifolds with Boundary 187
We can even show that 𝑓̄ is smooth: We show this locally for the induced map on local
coordinate charts. On ℝP2 we use the charts we defined previously. Then we can check:
∙ For the charts (0, 𝜋) → 𝕊1 and (𝜋, 2𝜋) → 𝕊1 defined by 𝑡 → (cos 𝑡, sin 𝑡), 𝑓̄ induces the
map ( )
cos(𝑡∕2)
𝑡 → ,0 .
sin(𝑡∕2)
∙ For the charts (𝜋∕2, 3𝜋∕2) → 𝕊1 and (3𝜋∕2, 5𝜋∕2) → 𝕊1 defined by 𝑡 → (cos 𝑡, sin 𝑡), 𝑓̄
induces the map ( )
sin(𝑡∕2)
𝑡 → ,0 .
cos(𝑡∕2)
All of these maps are smooth and hence 𝑓̄ is smooth. We admit that we cheat here a bit and
use a bit of the abstract manifold language. We hope this is OK for this motivating story.
But now we need to answer the question: How do we show that 𝑓̄ is not homotopic to a
constant map?
In order to obtain such a theory of intersections, we already have many tools available, for
example, homotopy and transversality. But we still lack one important piece of the puzzle:
manifolds with boundary. For allowing manifolds to have a boundary will make it possible to
include, in particular, the closed interval [0, 1] that we use in the definition of homotopies in
the category of manifolds.
In order to be able to analyse a wider class of phenomena we would like to enlarge the class of
manifolds. A typical example which we would like to include is the domain of a homotopy
𝑋 × [0, 1] for a smooth 𝑘-dimensional manifold 𝑋. The points on 𝑋 × {0} and 𝑋 × {1} do not
have an open neighborhood which is diffeomorphic to ℝ𝑘 . Another example is the closed unit
ball in ℝ𝑘 .
So far such guys do not qualify as a manifold. From now on, we would like to allow such
subsets. This sounds like an laborious endeavour, but we will see that most of the theorems we
have proved so far are also valid for manifolds with boundary.
188 10.2. Manifolds with Boundary
The idea for what a manifold with boundary should be is the same as before: it is a space
which locally looks like some model space with boundary which we understand well. Hence
we need to choose a suitable new model space:
Definition 10.1 (New Euclidean model) The standard model of a Euclidean space
with boundary is the half-plane
ℍ𝑘 = {(𝑥1 , … , 𝑥𝑘 ) ∈ ℝ𝑘 ∶ 𝑥𝑘 ≥ 0}
Example 10.3 (The closed unit interval) The unit interval [0, 1] is a one-dimensional
smooth manifold with boundary and the boundary consists of the two endpoints {0, 1}.
We can choose local parametrizations
defined on [0, 1) which is an open subset on ℍ1 = [0, ∞) ⊂ ℝ1 . The interior is the open
interval (0, 1).
Let 𝑋 be a manifold with boundary. We need to check that our definition of points in the
interior and on the boundary is independent of the choice of a local parametrization.
Figure 10.1: There are two types of open balls in ℍ𝑘 . The ones where all points satisfy 𝑥𝑘 > 0
are well-known. The ones which allow 𝑥𝑘 = 0 are new.
an open subset of ℝ𝑘 . Now assume 𝑥 is also in the image of another local parametrization
𝜙′ ∶ 𝑈 ′ → 𝑉 ′ ⊂ 𝑋. Then 𝑥 ∈ 𝑊 ∶= 𝑉 ∩ 𝑉 ′ ⊂ 𝑋, and the composition 𝜙′ ◦𝜙−1 ∶ 𝜙−1 (𝑊 ) →
(𝜙′ )−1 (𝑊 ) is a diffeomorphism. Hence, after possibly shrinking 𝑈 ′ , we see that 𝑈 ′ is also an
open subset in ℝ𝑘 . Thus 𝑥 is being an interior point is well-defined.
This shows in particular: if 𝑋 is a manifold with boundary, then the interior of 𝑋, Int(𝑋),
is a boundaryless manifold of the same dimension as 𝑋.
It remains to show that being a boundary point is also well-defined. We show this by
proving the following interest result:
Proof: Let 𝑥 ∈ 𝑋 and let 𝜙 and 𝜓 be two local parametrizations around 𝑥. After possibly
shrinking the domains and codomains, we can assume that 𝜙 ∶ 𝑈 → 𝑉 and 𝜓 ∶ 𝑊 → 𝑉 are
both diffeomorphisms from open sets 𝑈 ⊂ ℍ𝑘 , 𝑊 ⊂ ℍ𝑘 to the same open subset 𝑉 ⊂ 𝑋.
We would like to show 𝜙(𝜕𝑈 ) = 𝜓(𝜕𝑊 ). For then 𝜕𝑉 = 𝜙(𝜕𝑈 ) is independent of our
choice of local parametrization and therefore well-defined. Moreover, since 𝜕𝑈 = 𝑈 ∩𝜕ℍ𝑘 is an
open subset of ℝ𝑘−1 , we would get that every point 𝑦 ∈ 𝜕𝑋 is contained in a local parametriza-
tion 𝜙|𝜕𝑈 ∶ 𝑈 ∩ 𝜕ℍ𝑘 → 𝜕𝑋. This will show that 𝜕𝑋 is a manifold of dimension 𝑘 − 1.
190 10.2. Manifolds with Boundary
By our assumption on 𝜙 and 𝜓, it suffices to show 𝜓(𝜕𝑊 ) ⊂ 𝜙(𝜕𝑈 ). The other inclusion
will follow by symmetry. Hence we would like to show:
Suppose that the claim is false and there is a point 𝑤 ∈ 𝜕𝑊 which is mapped to an interior
point 𝑢 = 𝑔(𝑤) of 𝑈 by 𝑔. Since both 𝜙 and 𝜓 are diffeomorphisms, 𝑔 is a diffeomorphism
of 𝑊 onto an open subset 𝑔(𝑊 ) of 𝑈 . The chain rule implies that the derivative 𝑑(𝑔 −1 )𝑢 of its
inverse is bijective. But, since 𝑢 ∈ Int(𝑈 ), 𝑔(𝑊 ) contains a neighborhood of 𝑢 that is open in
ℝ𝑘 . Thus the Inverse Function Theorem, applied to the map 𝑔 −1 defined on this open subset
of ℝ𝑘 , implies that the image of 𝑔 −1 contains a neighborhood of 𝑤 that is open in ℝ𝑘 . This
contradicts the assumption 𝑤 ∈ 𝜕𝑊 .
Tangent spaces and derivatives are still defined in the setting of manifolds with boundary.
Derivatives of smooth maps can be defined as before: Since smoothness at a point requires
a functions to be defined on open neighborhhod around that point, we need to be a bit more
careful at boundary points:
∙ Derivatives on ℍ𝑘
So let 𝐺′ be another local extension of 𝑔. We need to show 𝑑𝐺𝑢′ = 𝑑𝐺𝑢 . The equality of
the two derivatives is no problem at points in the interior int(𝑈 ) of 𝑈 , because then we have
a small open neighborhood which is still in int(𝑈 ). We are going to use this and approximate
𝑢 be a sequence {𝑢𝑖 } of interior points 𝑢𝑖 ∈ int(𝑈 ) which converge to 𝑢.
Since the derivative of a smooth map at a point depends continuously on changing the point,
this implies that 𝑑𝐺𝑢𝑖 → 𝑑𝐺𝑢 and 𝑑𝐺𝑢′ → 𝑑𝐺𝑢′ when 𝑢𝑖 → 𝑢 and both limits agree. Hence
𝑖
𝑑𝑔𝑢 is well-defined at boundary points too.
Note: It is important to observe that, at all points, 𝑑𝑔𝑢 is still a linear map of all of ℝ𝑘 to
ℝ𝑙 . For we have defined 𝑑𝑔𝑢 as the derivative 𝑑𝐺𝑢 of an extension 𝐺 to an open subset of ℝ𝑘 .
∙ Tangent spaces
Chapter 10. Manifolds with Boundary 191
Let 𝑋 ⊂ ℝ𝑁 be a smooth manifold with boundary, and 𝑥 ∈ 𝑋. Let 𝜙 ∶ 𝑈 → 𝑋 be a local
parametrization with 𝑈 ⊂ ℍ𝑘 open. Let 𝑢 ∈ 𝑈 be the point with 𝜙(𝑢) = 𝑥. Note that we cannot
assume 𝑢 = 0 when 𝑥 is an interior point. Then we have just learned that we can form the
derivative
𝑑𝜙𝑢 ∶ ℝ𝑘 → ℝ𝑁
no matter what kind of point 𝑥 is.
It follows again from the Chain Rule that 𝑇𝑥 (𝑋) does not depend as a subspace of ℝ𝑁 on
the choice of 𝜙.
as the unique linear map which makes the following diagram commutative
𝑑𝑓𝑥
𝑇𝑥 (𝑋) / 𝑇 (𝑌 )
O 𝑦
O
𝑑𝜙𝑢 𝑑𝜓𝑣
ℝ𝑘 / ℝ𝑙 .
𝑑𝜃𝑢
We see that things work out nicely so far for manifolds with boundaries. But:
Example 10.6 (Square is not a manifold) The square 𝑆 = [0, 1] × [0, 1] is not a
smooth manifold with boundary.
For, suppose 𝑆 was a smooth manifold with boundary. Then the corner 𝑠 = (0, 0)
has an open neighborhood 𝑉 ⊂ 𝑆 and there is a diffeomorphism 𝑓 ∶ 𝑈 → 𝑉 to an open
𝑈 ⊂ ℍ2 such that 𝑓 (𝜕𝑉 ) ⊂ 𝜕ℍ2 .
After shrinking 𝑉 if necessary, let 𝐹 ∶ 𝑉̃ → ℝ2 be a smooth extension of 𝑓 on a
subset 𝑉 ⊂ 𝑉̃ ⊂ ℝ2 open in ℝ2 . Then the derivative 𝑑𝐹𝑠 is an isomorphism, since 𝑓 is
a diffeomorphism. Writing 𝐹 = (𝐹1 , 𝐹2 ) we have 𝐹2 (𝑥, 0) = 0 = 𝐹2 (0, 𝑦) for any (𝑥, 0)
and (0, 𝑦) in 𝑈 , since 𝜕𝑉 is mapped to 𝜕ℍ2 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑦 = 0}. Thus taking partial
192 10.2. Manifolds with Boundary
derivatives yields
𝜕𝐹2 𝜕𝐹
(𝑠) = 0, and 2 (𝑠) = 0.
𝜕𝑥 𝜕𝑦
But this implies that 𝑑𝐹𝑠 (𝑒1 ) and 𝑑𝐹𝑠 (𝑒2 ) both lie in ℝ × {0} ⊂ ℝ2 , where 𝑒1 and 𝑒2
denote the first and second standard basis vector in ℝ2 , respectively. In particular, 𝑑𝐹𝑠 (𝑒1 )
and 𝑑𝐹𝑠 (𝑒2 ) are linearly dependent. This contradicts that 𝑑𝐹2 is an isomorphism and
therefore the existence of the diffeomorphism 𝑓 .
Lemma 10.7 (Products and Boundaries) The product of a manifold without bound-
ary 𝑋 and a manifold with boundary 𝑌 is a manifold with boundary. Furthermore,
𝜕(𝑋 × 𝑌 ) = 𝑋 × 𝜕𝑌 ,
and
𝑈 × 𝑉 ⊂ ℝ𝑘 × ℍ𝑙 = ℍ𝑘+𝑙
Example 10.8 (Closed cylinder) The closed cylinder 𝕊1 × [0, 1] is a manifold with
boundary, see Figure 10.2, and the boundary consists of
𝕊1 × {0} ∪ 𝕊1 × {1}.
One of the most important concepts we have studied is transversality of smooth maps to
submanifolds. We would like to extend this to manifolds with boundary. This is possible, but
requires some care.
We start with the special case of regular values for functions on manifolds without boundary.
This is a well-known case, but it turns out that it actually produces manifolds with boundary as
follows:
Chapter 10. Manifolds with Boundary 193
Figure 10.2: A cylinder has the two outer circles as its boundary.
Lemma 10.10 (Regular values for real-valued functions) Suppose that 𝑆 is a mani-
fold without boundary and that 𝜋 ∶ 𝑆 → ℝ is a smooth function with regular value
0. Then the subset {𝑠 ∈ 𝑆 ∶ 𝜋(𝑠) ≥ 0} is a manifold with boundary, and the boundary
is 𝜋 −1 (0).
Proof: The set {𝑥 ∈ 𝑆 ∶ 𝜋(𝑥) > 0} is open in 𝑆, since it is the preimage of the open
subset (0, ∞) ⊂ ℝ under the continuous map 𝜋. It is therefore a submanifold of the same
dimension as 𝑆. Hence every point in {𝑥 ∈ 𝑆 ∶ 𝜋(𝑥) > 0} has an open neighborhood which
is diffeomorphic to an open subset of ℝ𝑘 , 𝑘 = dim 𝑆. It remains to study the preimage of 0.
Example 10.11 (Spheres are boundaries) Let 𝜋 be the smooth function defined by
∑
𝜋 ∶ ℝ𝑘 → ℝ, (𝑥1 , … , 𝑥𝑘 ) → 1 − 𝑥2𝑖 .
𝑖
Then 0 is a regular value of 𝜋, and the closed unit ball 𝔻𝑘 in ℝ𝑘 can be described as
𝔻𝑘 = {𝑥 ∈ ℝ𝑘 ∶ 𝜋(𝑥) ≥ 0}.
The boundary of 𝔻𝑘 is the (𝑘 − 1)-sphere 𝕊𝑘−1 = 𝜋 −1 (0).
194 10.2. Manifolds with Boundary
10.2.4 Towards transversality
Now we try to enter the next level: Recall that transversality is formulated as a criterion on tan-
gent spaces and derivatives. We would like to formulate a similar criterion for maps between
manifolds with boundary.
It turns out that it is not enough to ask that 𝑓 is transversal to 𝑍 in the previous sense, i.e.,
Im (𝑑𝑓𝑥 ) + 𝑇𝑓 (𝑥) (𝑍) = 𝑇𝑓 (𝑥) (𝑌 ).
𝜋 ∶ ℍ2 → ℝ, (𝑥1 , 𝑥2 ) → 𝑥2
our previous condition for transversality is not sufficient to guarantee that equation (10.1)
is satisfied:
For, the derivative 𝑑𝜋(𝑥1 ,𝑥2 ) ∶ ℝ2 → ℝ is just the projection onto the second factor.
Hence it is surjective at every point (𝑥1 , 𝑥2 ). In particular, 0 is a regular value fo 𝜋.
Let 𝑍 ∶= {0}. Then
𝜋 −1 (𝑍) = {(𝑥1 , 𝑥2 ) ∈ ℝ2 ∶ 𝑥2 = 0} = 𝜕ℍ2 .
Since 0 is regular value, we know that 𝜋 −1 (𝑍) is a submanifold. The problem is that the
boundary does not satisfy condition (10.1):
𝜕𝜋 −1 (𝑍) = ∅, whereas 𝜋 −1 (𝑍) ∩ 𝜕𝑋 = 𝜕ℍ2 ≠ ∅.
Chapter 10. Manifolds with Boundary 195
In order to make sure that the boundary behaves well, we need to impose an additional
transversality condition on 𝜕𝑓 .
Theorem 10.14 (With boundary: fibers of regular values) Let 𝑔 be a smooth map of
a 𝑘-manifold 𝑋 with boundary onto a boundaryless 𝑛-manifold 𝑌 , and suppose that
𝑦 ∈ 𝑌 is a regular value for both 𝑔 ∶ 𝑋 → 𝑌 and 𝜕𝑔 ∶ 𝜕𝑋 → 𝑌 . Then the preimage
𝑔 −1 (𝑦) is a (𝑘 − 𝑛)-dimensional manifold with boundary and
Proof: To show that 𝑔 −1 (𝑦) is a manifold with boundary is a local question, i.e., it suffices
that each point in 𝑔 −1 (𝑦) has an open neighborhood which is a manifold with boundary. So
let 𝑥 ∈ 𝑋 be a point with 𝑔(𝑥) = 𝑦. After choosing local coordinates, we can assume that 𝑔 is
a map of the form
𝑔 ∶ ℍ𝑘 → ℝ𝑛 .
So let us look at what happens if 𝑥 ∈ 𝜕𝑋. That 𝑔 is smooth at 𝑥 means by definition that
there is an open subset 𝑈 ⊂ ℝ𝑘 and a smooth map
After possibly replacing 𝑈 with a smaller subset, we can assume that 𝐺 has no critical
points in 𝑈 . Then 𝐺−1 (𝑦) is a smooth manifold by the Preimage Theorem 4.6 for boundaryless
manifolds. We need to show that
In order to show this, we define a new smooth function 𝜋 on the manifold 𝑆 ∶= 𝐺−1 (𝑦)
𝜋 ∶ 𝑆 → ℝ, (𝑥1 , … , 𝑥𝑘 ) → 𝑥𝑘
𝑆 ∩ ℍ𝑘 = {𝑠 ∈ 𝑆 ∶ 𝜋(𝑠) ≥ 0}.
If we can show the claim, then our previous lemma shows that 𝑆 ∩ ℍ𝑘 is a manifold with
boundary and the boundary is 𝜋 −1 (0).
To show the claim, assume there was an 𝑠 ∈ 𝑆 with both 𝜋(𝑠) = 0, i.e. 𝑠 ∈ 𝑆 ∩ 𝜕ℍ𝑘 , and
𝑑𝜋𝑠 = 0. We want to show that the assumption 𝑑𝜋𝑠 = 0 leads to a contradiction.
To do so, first note that 𝜋 is a linear map, and therefore 𝑑𝜋𝑠 = 𝜋. Thus,
𝑑𝜋𝑠 = 𝜋 ∶ 𝑇𝑠 (𝑆) → ℝ
196 10.2. Manifolds with Boundary
being trivial, just means that the last coordinate of every vector in 𝑇𝑠 (𝑋) is 0, i.e.
Now, finally, we apply the assumption of regularity of 𝑦. Since 𝑦 is a regular value of both
𝑔 and 𝜕𝑔, we know that both 𝑑𝑔𝑠 and 𝑑(𝜕𝑔)𝑠 are surjective. This implies
This contradicts assertion (10.2) about the kernels when Ker (𝑑𝑔𝑠 ) ⊂ ℝ𝑘−1 . Thus this
assumption must be false, i.e.,
and hence 𝑑𝜋𝑠 ≠ 0 and therefore 𝑑𝜋𝑠 is surjective. In other words, 0 is a regular value.
But the transversality assumptions of 𝑓 and 𝜕𝑓 with respect to 𝑍 imply the 0 is a regular
value of 𝑔. Hence we can apply the previous theorem and we are done.
Theorem 10.16 (Sard’s Theorem with boundary) For any smooth map 𝑓 ∶ 𝑋 → 𝑌
of a manifold 𝑋 with boundary to a manifold 𝑌 without a boundary, almost every
point of 𝑌 is a regular value of both 𝑓 and 𝜕𝑓 .
10.3 One-Manifolds
The following theorem gives us a complete list of smooth one-dimensional manifolds. Note that
in general, since every manifold is the disjoint union of its connected components, it suffices to
classify connected manifold.
Before we prove the theorem, we begin with the rough idea why it should be true.
(a): Assume 𝑋 is a nonempty, compact, connected 1-manifold. Each point has a neighbor-
hood diffeomorphic to (−1, 1). By compactness, finitely many such neighborhoods 𝑈1 , … , 𝑈𝑛
cover 𝑋. If 𝑛 was equal 1, then 𝑋 ≅ (−1, 1). But an open interval is not compact. Thus, there
must be at least two neighborhoods. Since 𝑋 is connected, these two charts must overlap.
The union of these two intervals has to be either an open interval (if they overlap on one side of
each) or a circle (if they overlap on both sides). But if their union is an open interval, there has
to be another chart, by the compactness of 𝑋. Since there are only finitely many 𝑈𝑖 ’s, we must
eventually arrive at the situation where the neighborhoods intersect on both sides and form a
circle. Then one has to use this to construct a diffeomorphism to 𝕊1 .
(c) and (d): When 𝑋 is not compact, we repeat the above processes. The difference is that
the process may not terminate and we end up with open or half-open intervals.
Figure 10.3: We can cover a compact one-manifold by finitely many open subsets looking like
an open interval. Either we get back to the beginning and get a circle or we stop with another
boundary point. This is the heuristic idea for the classification. There are some details to be
straightened out though.
Chapter 10. Manifolds with Boundary 199
10.3.2 Proof of the classification theorem
We will follow Milnor’s proof using arc-lengths given in the appendix of [13]. In this subsection
𝑋 will aways denote a smooth one-manifold.
Note that the definition forces that 𝐼 can have a boundary only if 𝑋 has a boundary. The
proof of the classification theorem relies on the folllowing lemma:
Proof: We write 𝑈 ∶= 𝑓 (𝐼) ∩ 𝑔(𝐽 ) ⊂ 𝑋. Note that 𝑈 is open in 𝑋, since both 𝑓 (𝐼) and
𝑔(𝐽 ) are open. In order to determine the number of components of 𝑈 we define the map
𝑓| (𝑔| )−1
𝜅 ∶ 𝑓 −1 (𝑈 ) ←←←→ ← 𝑔 −1 (𝑈 )
← 𝑈 ←←←←←←←←←→
where 𝑓| and 𝑔| denote the restrictions of 𝑓 and 𝑔, respectively, such that the maps are defined.
By definition of parametrizations by arc-length, 𝜅 is a local diffeomorphism with derivative
equal to ±1 and maps open subset of 𝐼 diffeomorphically onto open subsets of 𝐽 . In particular,
it is an open map. We consider the graph of 𝜅, denoted by Γ ⊂ 𝐼 × 𝐽 , given by
Γ = {(𝑠, 𝑡) ∈ 𝐼 × 𝐽 ∶ 𝑓 (𝑠) = 𝑔(𝑡)}.
Since Γ is the preimage of the diagonal Δ in 𝑋 × 𝑋 under the map (𝑓 , 𝑔) ∶ 𝐼 × 𝐽 → 𝑋 × 𝑋
and since Δ is closed in 𝑋 × 𝑋, Γ is a closed subset of 𝐼 × 𝐽 . Since 𝑑𝜅(𝑠,𝑡) = ±1, 𝑑𝜅(𝑠,𝑡) takes
values in the discrete set {±1}. Hence 𝑑𝜅(𝑠,𝑡) is locally constant and is therefore constant on
each connected component of Γ. Thus, thinking of 𝐼 × 𝐽 as a rectangle with sides 𝐼 and 𝐽 , Γ
consists of line segments of slope either +1 or −1. We need to understand how many such line
segmens there are. Note that the image of 𝜅 equals the projection of the components of Γ to 𝐽 .
Now let 𝐾 be a connected component of Γ.
Proof: Since Γ is closed, 𝐾 is closed in 𝐼 × 𝐽 as well. Assume that 𝐾 ends in the interior
of 𝐼 × 𝐽 with endpoint 𝑎. Then the projection of 𝐾 onto 𝐽 has endpoint 𝜅(𝑎). But since 𝜅 is a
local diffeomorphism, the image of 𝜅 has to contain an open neighborhood around 𝜅(𝑎). Hence
𝜅(𝑎) cannot be an endpoint of a subinterval of 𝐽 .
Moreover, since 𝜅 is one-to-one, each of the four edges of the rectangle 𝐼 × 𝐽 can contain
at most one endpoint of a component of Γ. Since each component has two endpoints, Γ can
have at most two components. This proves the first assertion in the lemma.
200 10.3. One-Manifolds
Now we show the second by considering the two cases:
∙ Assume Γ has one component: Then 𝜅 has constant slope and Γ is just a linesegment in
the rectangle 𝐼 × 𝐽 . Hence we can extend 𝜅 to a linear map 𝐿 ∶ ℝ𝑡𝑜ℝ. Since both 𝑓 , 𝑔
and 𝐿 are smooth and have equal derivatives, we can glue 𝑓 and 𝑔◦𝐿𝐿−1 (𝐽 ) together to
get an extension
𝐹 ∶ 𝐼 ∪ 𝐿−1 (𝐽 ) → 𝑓 (𝐼) ∪ 𝑔(𝐽 )
which is a parametrization by arc-length of the union 𝑓 (𝐼) ∪ 𝑔(𝐽 ).
∙ Assume Γ has two components: Then all four edges are hit by the components. This
implies that the slope of both components must be the same. Let us assume the slope is
+1. The case with slope −1 is analogous. Let 𝑎 < 𝑑 be the endpoints of 𝐼 and 𝛾 < 𝛽
be the endpoints of 𝐽 . Let 𝛼 ∈ 𝐽 be the starting point and 𝑏 ∈ 𝐼 be the endpoint of the
first component of Γ, and let 𝑐 ∈ 𝐼 be the starting point and 𝛿 ∈ 𝐽 be the endpoint of
the second component. After possibly translating the interval 𝐽 in ℝ we can assume that
𝛾 = 𝑐 and 𝛿 = 𝑑. By the previous arguments, we then have
Figure 10.4: The case when Γ has two components. The slope is the same for both components,
here +1. All four edges of the rectangle 𝐼 × 𝐽 are hit by Γ.
If the open set 𝑓 (𝐼) ⊂ 𝑋 was not all of 𝑋, then there wouldbe a limit point 𝑥 of 𝑓 (𝐼) in
𝑋 ⧵ 𝑓 (𝐼). Then we can parametrize a neighborhood of 𝑥 by arc-length. The previous lemma
then implies that 𝑓 can be extended over a larger interval. This contradicts the assumption
that 𝑓 is maximal. The assertion of the theorem now follows from the comparison with the
different types of intervals in ℝ.
At least as interesting as the theorem are its consequences which are surprisingly rich. We
will begin to study them next.
Proof: Every compact one-manifold with boundary 𝑋 is the disjoint union of finitely many
connected components. Each component is diffeomorphic to a copy of [0, 1]. Hence the
boundary of each component consists of two points. The boundary of 𝑋 consists of these
finitely many pairs of points.
𝑍 _
id /𝑍
>
𝑓
𝑋
There is an important restriction for the existence of such retractions for manifolds with
boundary:
Lemma 10.22 (No retraction onto boundary) If 𝑋 is any compact manifold with
boundary, then there is no retraction of 𝑋 onto its boundary.
Moreover, the codimension of 𝑔 −1 (𝑧) in 𝑋 equals the codimension of {𝑧} in 𝜕𝑋, namely
dim 𝑋 −1 as {𝑧} has dimension 0. Hence 𝑔 −1 (𝑧) is one-dimensional. Since it is a closed subset
in the compact manifold 𝑋, it is also compact.
(𝜕𝑔)−1 (𝑧) = (𝑔|𝜕𝑋 )−1 (𝑧) = (𝑔◦𝑖)−1 (𝑧) = 𝑖−1 (𝑔(𝑧)) = 𝑔 −1 (𝑧) ∩ 𝜕𝑋
Since 𝜕𝑔 = Id𝜕𝑋 ,
This contradicts Lemma 10.20 that the boundary 𝜕(𝑔 −1 (𝑧)) of the compact one-dimensional
manifold 𝑔 −1 (𝑧) consists of an even number of points.
Theorem 10.23 (Brouwer Fixed Point Theorem for smooth maps) Let 𝑓 ∶ 𝔻𝑛 → 𝔻𝑛
be a smooth map of the closed unit ball 𝔻𝑛 = {𝑥 ∈ ℝ𝑛 ∶ |𝑥| ≤ 1} ⊂ ℝ𝑛 into itself.
Then 𝑓 has a fixed point, i.e., there is an 𝑥 ∈ 𝐷𝑛 with 𝑓 (𝑥) = 𝑥.
Before we prove the theorem, let us have a look at dimension one, where the result is very
familiar:
Remark 10.24 (Familiar in dimension one) Note that we have seen this theorem for
𝑛 = 1 in Calculus 1: Every continuous map 𝑓 ∶ [0, 1] → [0, 1] has a fixed point. For
define the function 𝑔(𝑥) = 𝑓 (𝑥) − 𝑥 which is a continuous map from [0, 1] to itself. We
have 𝑔(0) ≥ 0 and 𝑔(1) ≤ 0, since 𝑓 (0) ≥ 0 and 𝑓 (1) ≤ 1. If 𝑔(0) = 0 or 𝑔(1) = 1, we
are done. And if 𝑔(0) > 0 and 𝑔(1) < 1, then the Intermediate Value Theorem implies
that there is an 𝑥0 ∈ (0, 1) with 𝑔(𝑥0 ) = 0, i.e., 𝑓 (𝑥0 ) = 𝑥0 .
Figure 10.5: In dimension one this is the Intermediate Value Theorem. The graph has to cross
the diagonal which consists of fixed points of 𝑓 .
Suppose that there exists an 𝑓 without fixed points. We will show that such an 𝑓 would
allow us to construct a retraction 𝑔 ∶ 𝔻𝑛 → 𝜕𝔻𝑛 . But, since 𝔻𝑛 is compact, we have just
proved in Lemma 10.22 that such a retraction cannot exist.
So suppose 𝑓 (𝑥) ≠ 𝑥 for all 𝑥 ∈ 𝔻𝑛 . Then, for every 𝑥 ∈ 𝔻𝑛 , the two different points 𝑥
and 𝑓 (𝑥) determine a line. Let 𝑔(𝑥) be the point where the line segment starting at 𝑓 (𝑥) and
passing through 𝑥 hits the boundary 𝜕𝔻𝑛 . This defines a map 𝑔 ∶ 𝔻𝑛 → 𝜕𝔻𝑛 . See Figure 10.6.
So let us describe 𝑔(𝑥) explicitly. As a point on the line from 𝑓 (𝑥) to 𝑥, 𝑔(𝑥) can be written
in the form
𝑥 − 𝑓 (𝑥)
𝑔(𝑥) = 𝑥 + 𝑡𝑣, where 𝑣 ∶=
|𝑥 − 𝑓 (𝑥)|
for some real number 𝑡. Note that, since we assume 𝑥 ≠ 𝑓 (𝑥), the vector 𝑣 is always defined.
In fact, it is the unit vector pointing from 𝑓 (𝑥) to 𝑥. Moreover, since 𝑓 is smooth, 𝑣 depends
smoothly on 𝑥.
204 10.4. Brouwer Fixed Point Theorem
Figure 10.6: We construct 𝑔 by extending the line from 𝑓 (𝑥) to 𝑥 until we hit the boundary.
If this map existed for all 𝑥, then we would have found a retraction. That is a contradiction to
Lemma 10.22.
We need to calculate 𝑡 and show that 𝑡 depends smoothly on 𝑥. Since 𝑔(𝑥) is a point on
boundary of 𝔻𝑛 , we know |𝑔(𝑥)| = 1, and 𝑡 is determined by the equation
or, equivalently,
where (𝑥 ⋅ 𝑣)2 − 𝑥 ⋅ 𝑥 + 1 is positive, since 𝑥 ⋅ 𝑥 = |𝑥|2 ≤ 1 and (𝑥 ⋅ 𝑣)2 > 0. Since the scalar
products and square roots involved depend smoothly on 𝑥, we see that 𝑡 depends smoothly on
𝑥. Hence 𝑔 is smooth.
∙ Note that, for 𝑛 = 1, in the above proof we would have construct a map 𝑔 ∶ [0, 1] →
{0, 1} which would send 0 to 0 and 1 to 1. Such a map cannot be continuous by the
Intermediate Value Theorem.
Actually, just as in the one-dimensional case, the theorem also holds for continuous maps:
Chapter 10. Manifolds with Boundary 205
Theorem 10.25 (Brouwer Fixed-Point Theorem for continuous maps) Every con-
tinuous map 𝐹 ∶ 𝔻𝑛 → 𝔻𝑛 has a fixed point.
Proof: The idea is to reduce this theorem to the statement on smooth maps by approxi-
mating 𝐹 by a smooth mapping. This is possible by Weierstrass’ Approximation Theorem,
an important result from Calculus. It applies as 𝔻𝑛 is compact and says:
where we apply the triangle inequality. Hence |𝑃 (𝑥)| ≤ 1 and 𝑃 is a map 𝔻𝑛 → 𝔻𝑛 . More-
over,
Now suppose that 𝐹 (𝑥) ≠ 𝑥 for all 𝑥 ∈ 𝔻𝑛 . Then the continuous function
𝔻𝑛 → 𝔻𝑛 , 𝑥 → |𝐹 (𝑥) − 𝑥|
must have a minimum 𝜇, since 𝔻𝑛 is compact. Since 𝐹 (𝑥) ≠ 𝑥 for all 𝑥, we must have 𝜇 > 0.
Now, for 𝜀 = 𝜇∕2, we choose polynomials 𝑄 and then 𝑃 as above. Since |𝐹 (𝑥) − 𝑥| ≥ 𝜇
for all 𝑥 ∈ 𝔻𝑛 , the triangle inequality yields
The assertion of Theorem 10.23 is not true for the open ball:
Let 𝔹𝑘1 (0) = {𝑥 ∈ ℝ𝑘 ∶ |𝑥| < 1} be the open ball in ℝ𝑘 . We define the map
𝑥
𝜑 ∶ 𝔹𝑘1 (0) → ℝ𝑘 , 𝑥 → √ .
1 − |𝑥|2
This is a smooth map with smooth inverse
𝑦
𝜑−1 ∶ ℝ𝑘 → 𝔹𝑘1 (0), 𝑦 → √
1 + |𝑦|2
Brouwer’s Fixed-Point Theorem has many important applications. Here is one of them:
Instead of studying the proof of this theorem, let us note a consequence of this result. You
can read more about this story and the proof on Terence Tao’s blog.
Even though it sounds like an obvious fact, this is a rather deep theorem. Note that there
exist weird things like a continuous surjection from ℝ𝑚 to ℝ𝑛 for 𝑚 < 𝑛 due to variants of the
Peano curve construction. Hence often we have to be careful with our topological intuition.
Proof of Theorem 10.27: If there was such a continuous injective map from 𝑈 to ℝ𝑚 , then
≅
we could compose it with the embedding ℝ𝑚 ←←←→
← (ℝ𝑚 × {0}) ⊂ ℝ𝑛 . Hence the composite would
yield a continuous injective map from 𝑈 into ℝ𝑛 . By the theorem, the image would be both
open in ℝ𝑛 and lie in the subspace ℝ𝑚 × {0}. But no open subset of ℝ𝑛 can be contained in
Chapter 10. Manifolds with Boundary 207
ℝ𝑚 × {0}, since we must be able to fit at least a tiny open ball of
ℝ𝑛 into that subset and there
is no room for such a ball in the direction of the remaining 𝑛 − 𝑚 coordinates.
Note that invariance of domain and dimension for smooth injective maps is just a con-
sequence of the Inverse Function Theorem. But for maps which are just continuous and
injective, it is much harder to achieve.
208 10.5. Exercises and more examples
10.5 Exercises and more examples
and
𝐺 ∶ ℝ × [−1∕2, 1∕2] → ℝ3 ,
(𝑡, 𝑠) → ((1 + 𝑠 cos(𝑡∕2)) cos 𝑡, (1 + 𝑠 cos(𝑡∕2)) sin 𝑡, 𝑠 sin(𝑡∕2)).
(a) Show that 𝑋 is a 2-dimensional manifold with boundary whose boundary is diffeo-
morphic to the disjoint union of two copies of the unit circle. (Convince yourself
that 𝑋 is a cylinder obtained by starting with a rectangular surface and then glueing
two opposite edges together.)
(b) Show that 𝑌 is a 2-dimensional manifold with boundary whose boundary is diffeo-
morphic to just one copy of the unit circle. (Convince yourself that 𝑌 is a Möbius
band obtained by starting with a rectangular surface and then glueing two opposite
edges after twisting one edge once. If you do not get through all the formulae,
make sure you understand the answer visually at least.)
Exercise 10.4 Suppose that 𝑋 is a manifold with boundary and 𝑥 ∈ 𝜕𝑋. Let 𝜙 ∶ 𝑈 →
𝑋 be a local parametrization with 𝜙(0) = 𝑥, where 𝑈 is an open subset of ℍ𝑘 . Then
𝑑𝜙0 ∶ ℝ𝑘 → 𝑇𝑥 (𝑋) is an isomorphism. Define the upper halfspace 𝐻𝑥 (𝑋) in 𝑇𝑥 (𝑋) to
be the image of ℍ𝑘 under 𝑑𝜙0 , 𝐻𝑥 (𝑋) ∶= 𝑑𝜙0 (ℍ𝑘 ).
(a) Prove that 𝐻𝑥 (𝑋) does not depend on the choice of local parametrization.
(b) Show that there are precisely two unit vectors in 𝑇𝑥 (𝑋) that are perpendicular to
𝑇𝑥 (𝜕𝑋) and that one lies inside 𝐻𝑥 (𝑋), the other outside. The one in 𝐻𝑥 (𝑋) is
called the inward unit normal vector to the boundary, and the other is the outward
unit normal vector to the boundary. Denote the outward unit normal vector by 𝑛(𝑥).
(c) If 𝑋 ⊂ ℝ𝑁 , we consider 𝑛(𝑥) as an element in ℝ𝑁 and get a map 𝑛 ∶ 𝜕𝑋 → ℝ𝑁 .
Chapter 10. Manifolds with Boundary 209
Show that 𝑛 is smooth.
𝑓 ∶ 𝑋 → 𝑌 , (𝑥, 𝑦) → 𝑥2 + 𝑦2 .
(b) Determine 𝑓 −1 (1), 𝜕(𝑓 −1 (1)) and 𝑓 −1 (1)∩𝜕𝑋. Why does the answer not contradict
the assertion of the Preimage Theorem for manifolds with boundary?
into itself. Now use the fact that there is a homeomorphism 𝔹𝑛−1 → 𝑄 to get a continuous
map 𝔹𝑛−1 → 𝔹𝑛−1 .
Exercise 10.7 Here is an alternative and equivalent definition to our previous one: A
space 𝑋 is simply-connected if it is path-connected and every continuous map 𝑓 ∶ 𝕊1 →
𝑋 can be extended to a continuous map 𝐹 ∶ 𝔹2 → 𝑋 such that 𝐹|𝕊1 = 𝑓 . In this exercise
we are going to show that 𝑋 = ℝ2 ⧵ {(0, 0)} is not simply-connected using Brouwer’s
fixed point theorem.
(a) Consider the antipodal map 𝑎 ∶ 𝕊1 → 𝕊1 which sends 𝑝 to −𝑝. Composing with
the inclusion 𝑆 1 ⊂ 𝑋, we consider 𝑎 as a map 𝑓 ∶ 𝕊1 → 𝑋. Show that, if 𝑋 =
ℝ2 ⧵ {(0, 0)} was simply-connected, then this would induce a map
𝑔 ∶ 𝔹2 → 𝑋 → 𝕊1 → 𝔹2 .
(b) Still assuming 𝑋 = ℝ2 ⧵ {(0, 0)} was simply-connected, show that 𝑔 does not have
a fixed point.
(d) Deduce from the previous point that the composed map
id
𝜄 ∶ 𝕊1 ←←←→
← 𝕊1 ⊂ ℝ2 ⧵ {(0, 0)}, 𝑝 → 𝑝,
210 10.5. Exercises and more examples
is not homotopic to the constant map 𝑐 ∶ 𝕊1 → ℝ2 ⧵ {(0, 0)} sending all points to
(1, 0).
Hint: Use that ℝ2 ⧵ {(0, 0)} is not contractible, since it is not-simply connected.
11. The Brouwer Degree modulo 2
We are now ready to define a very important and powerful invariant for smooth maps, the degree
modulo 2. We will see that despite its rather simple definition, the degree determines a lot of
the interesting properties of a map.
Lemma 11.1 (Homotopy Lemma mod 2) Let 𝑋 and 𝑌 be smooth manifolds of the
same dimension, where 𝑋 is compact and without boundary. Let 𝑓 , 𝑔 ∶ 𝑋 → 𝑌 be two
smooth maps. Assume 𝑓 and 𝑔 are smoothly homotopic. If 𝑦 ∈ 𝑌 is a regular value
for both 𝑓 and 𝑔, then
𝜕𝐹 −1 (𝑦) = 𝐹 −1 (𝑦) ∩ 𝜕(𝑋 × [0, 1]) = 𝐹0−1 (𝑦) × {0} ∪ 𝐹1−1 (𝑦) × {1}
= 𝑓 −1 (𝑦) × {0} ∪ 𝑔 −1 (𝑦) × {1}.
Since 𝑋 is compact, 𝑋 × [0, 1] is compact. Since {𝑦} is a closed subset of 𝑌 , 𝐹 −1 (𝑦) is closed
in 𝑋 × [0, 1]. This implies that 𝐹 −1 (𝑦) is compact. Hence Lemma 10.20, which follows from
the classification of compact one-manifolds, implies that 𝜕𝐹 −1 (𝑦) must have an even number
of points. Thus, computing mod 2 we get
#𝑓 −1 (𝑦) + #𝑔 −1 (𝑦) ≡ 0,
and hence
Now suppose that 𝑦 is not a regular value of 𝐹 . By Lemma 4.15, the functions 𝑦′ → #𝑓 −1 (𝑦′ )
and 𝑦′ → #𝑔 −1 (𝑦′ ) are both locally constant on the set of regular values 𝑦′ ∈ 𝑌 of 𝑓 and 𝑔,
211
212 11.1. The Brouwer Degree of maps modulo 2
respectively. Thus there is an open neighborhood 𝑈1 ⊂ 𝑌 of 𝑦 consisting only of regular values
of 𝑓 so that
We will need an important special type of homotopy which preserves more information than
homotopies in general:
The following result will allow us to move points on connected manifolds via a family of
diffeomorphisms. The fact that every map in the homotopy family is a diffeomorphism makes
it much easier to keep track of the orientation numbers at preimages.
Lemma 11.3 (Isotopy Lemma) Given any two points 𝑦 and 𝑧 in the connected man-
ifold 𝑌 , there exists a diffeomorphism ℎ ∶ 𝑌 → 𝑌 such that
∙ ℎ(𝑦) = 𝑧 and
We proof Lemma 11.3 and an extension to finitely many points in Section 11.1.3.
Using the Homotopy Lemma 11.1 and the Isotopy Lemma 11.3 we can now prove the fol-
lowing key result:
This common residue class #𝑓 −1 (𝑦) modulo 2 only depends on the homotopy class of
𝑓.
Chapter 11. The Brouwer Degree modulo 2 213
Proof: Let 𝑦 and 𝑧 be two regular values of 𝑓 . Since 𝑌 is connected, we can apply the
Isotopy Lemma 11.3 and find a diffeomorphism ℎ ∶ 𝑌 → 𝑌 which is isotopic to the identity and
with ℎ(𝑦) = 𝑧. Since ℎ is a diffeomorphism, 𝑧 is a regular value of the composite ℎ◦𝑓 . Since
ℎ is homotopic to the identity, the composite ℎ◦𝑓 is homotopic to 𝑓 . Hence the Homotopy
Lemma 11.1 implies that
Hence we have
To prove the second assertion, assume that 𝑔 ∶ 𝑋 → 𝑌 is a smooth map which is smoothly
homotopic to 𝑓 . By Sard’s Theorem 7.1, there exists an element 𝑦 ∈ 𝑌 which is a regular
value for both 𝑓 and 𝑔. By the Homotopy Lemma 11.1, this implies
Thus the number #𝑓 −1 (𝑦) is independent of 𝑦 and only depends on the smooth homotopy
class of 𝑓 . Hence we make the following definition:
Note: The degree mod 2 is defined only when the range manifold 𝑌 is connected, the
domain 𝑋 is compact, and dim 𝑋 = dim 𝑌 . Whenever we write deg2 , we assume that these
assumptions are satisfied.
Now that we have the invariant deg2 , let us contemplate on what it is good for. First of all,
we note that there are upsides and downsides equipped to deg2 :
∙ The first good news is that deg2 (𝑓 ) is a powerful invariant as we will see soon.
∙ The second good news is that deg2 (𝑓 ) is easy to calculate: just pick any regular value 𝑦
for 𝑓 and count preimage points
ℂ → ℂ, 𝑧 → 𝑧𝑛 ,
which wraps the circle 𝕊1 smoothly around 𝕊1 𝑛 times, has mod 2 degree zero if 𝑛 is
even, and one if 𝑛 is odd. For example, deg2 of the constant map 𝕊1 → 𝕊1 is equal to
deg2 of the squaring map 𝕊1 → 𝕊1 sending 𝑧 → 𝑧2 . Hence deg2 cannot distinguish
between many different maps. This is rather unfortunate.
∙ One may wonder why Theorem 11.4 does not contradict our assertion in Remark 4.17
when we discussed Milnor’s proof of the FTA that it is not sufficient for 𝑌 to be connected
that the number #𝑓 −1 (𝑦) is constant for all regular values. In the proof of Theorem 11.4
we compose 𝑓 with the diffeomorphism ℎ and then use that deg2 is homotopy-invariant.
However, the Homotopy Lemma 11.1 only shows that #𝑓 −1 (𝑦) modulo 2 is constant
within the homotopy class of 𝑓 . Hence we the fiber 𝑓 −1 (𝑦) can be empty for some regular
value, while it is non-empty for others. All we know is that the parity of #𝑓 −1 (𝑦) does
not change.
Lemma 11.6 (deg2 is additive) Let 𝑋 and 𝑍 be compact smooth manifolds without
boundary and let 𝑌 a connected smooth manifold with dim 𝑋 = dim 𝑍 = dim 𝑌 . Let
𝑓 ∶ 𝑋 ⊔ 𝑍 → 𝑌 be a smooth map. Then we have
Proof: Let 𝑦 be a regular value for 𝑓 . Then deg2 (𝑓 ) = #𝑓 −1 (𝑦), and the number of elements
in #𝑓 −1 (𝑦) is the sum of the number of elements lying in 𝑋 and 𝑍, respectively. These two
numbers are deg2 (𝑓|𝑋 ) and deg2 (𝑓|𝑍 ), respectively.
There are many important and powerful applications of deg2 . Here we just have a glimpse at a
few of them.
Let 𝑋 be a compact smooth manifold without boundary of dimension at least one. The
identity map on 𝑋 has degree one, since #(id−1 (𝑦)) = 1 for every 𝑦 ∈ 𝑋. Thus deg2 (id) = 1.
Now let 𝑐 ∶ 𝑋 → 𝑋 be a constant map with value 𝑥0 ∈ 𝑋. Then every 𝑥 ≠ 𝑥0 is a regular
value for 𝑐, since every point not in the image is regular. Thus deg2 (𝑐) = 0. Hence we can
conclude deg2 (id) ≠ deg2 (𝑐). Since deg2 is invariant under smooth homotopy, we have proven
the following interesting fact:
This map would be a smooth homotopy between a constant map and the identity map which
contradicts Theorem 11.7.
Theorem 11.8 (Boundary Theorem for deg2 ) Let 𝑌 be a connected smooth manifold.
Assume 𝑋 = 𝜕𝑊 is the boundary for some compact manifold 𝑊 and that 𝑓 ∶ 𝑋 →
𝑌 is a smooth map which can be extended to all of 𝑊 , i.e., there is a smooth map
𝐹 ∶ 𝑊 → 𝑌 with 𝐹|𝜕𝑊 = 𝑓 . Then deg2 (𝑓 ) = 0.
∙ Note that when 𝑊 is compact, then the closed subset 𝑋 = 𝜕𝑊 is also compact. Hence
deg2 (𝑓 ) is defined.
𝑝
Theorem 11.9 (Existence of zeros via deg2 ) If the mod 2 degree of ∶ 𝜕𝑊 → 𝕊1
|𝑝|
is nonzero, then the function 𝑝 has a zero inside 𝑊 .
( )
𝑝
∙ Note that calculating deg2 |𝑝| simply consists of picking a point 𝑧 ∈ 𝕊1 , we could
think of it as a direction, and just counting the number of times we find a 𝑤 ∈ 𝜕𝑊 with
𝑝(𝑤) = 𝑧, i.e., how often 𝑝(𝑤) points in the chosen direction.
∙ Theorem 11.9 says that this simple procedure tells us whether 𝑝 has a zero inside 𝑊 .
∙ Finally, if you have learned about Complex Analysis, then this should remind you of the
Residue Theorem and Cauchy’s formula.
Proof of Lemma 11.3: For the proof, we call two points 𝑦 and 𝑧 isotopic if the statement of the
theorem holds. This defines an equivalence relation on the set of points in 𝑌 . The proof of the
lemma will consist in showing that each equivalence class is open. Since equivalence classes
are disjoint, this will show that 𝑌 is the disjoint union of open subsets. Since 𝑌 is connected,
this implies there can only be one equivalence class, i.e., the lemma holds for all points in 𝑌 .
To prove that the equivalence classes are open, we will first construct an isotopy ℎ𝑡 on ℝ𝑘
such that
∙ ℎ0 is the identity,
∙ each ℎ𝑡 is the identity outside some specified small ball around 0, and
Figure 11.1: The degree is able to detect a zero of the polynomial in the region 𝑊 .
∙ We begin with the case 𝑘 = 1. Given 𝜀 > 0, let 𝜌 be smooth function that vanishes
outside (−𝜀, 𝜀) and equals 1 at 0. If 𝑧 ∈ ℝ1 , we define
ℎ𝑡 (𝑥) = 𝑥 + 𝑡 ⋅ 𝜌(𝑥) ⋅ 𝑧.
Since |𝜌′ (𝑥)| vanishes outside a compact set, it must be bounded. Hence as long as |𝑧|
is small enough, |𝑡 ⋅ 𝜌′ (𝑥) ⋅ 𝑧| < 1 for all 𝑡 ∈ [0, 1] and 𝑥 ∈ ℝ. Thus ℎ′𝑡 (𝑥) > 0 for all 𝑡
and 𝑥 and ℎ𝑡 is strictly increasing. By the Inverse Function Theorem 3.2 this implies
that its inverse function is also smooth. Hence ℎ𝑡 is a diffeomorphism of ℝ1 for all 𝑧
with |𝑧| small enough.
∙ Now let 𝑘 be arbitrary. We are going to use what we just learned from the case 𝑘 = 1.
Given any point 𝑝 in ℝ𝑘 near the origin, we may rotate the coordinate axes so that the
point lies on the first axis. Hence, writing ℝ𝑘 = ℝ1 × ℝ𝑘−1 , we can assume that 𝑝 has
coordinates of the form 𝑝 = (𝑧, 0) with 𝑧 ∈ ℝ1 .
First we choose a function 𝜌 as for 𝑘 = 1. Given 𝜀 > 0, let 𝜌 be smooth function that
vanishes outside (−𝜀, 𝜀) and equals 1 at 0. Now we choose a function 𝜎 ∶ ℝ𝑘−1 → ℝ
218 11.1. The Brouwer Degree of maps modulo 2
such that 𝜎(0) = 1 and 𝜎 is zero outside some small ball of radius 𝛿. We then define
ℎ𝑡 ∶ ℝ𝑘−1 → ℝ𝑘−1 as follows: for (𝑥, 𝑦) ∈ ℝ1 × ℝ𝑘−1 we set
∙ We have ℎ𝑡 (𝑥, 𝑦) = (𝑥, 𝑦) unless |𝑥| < 𝜀, |𝑦| < 𝛿, and 𝑡 > 0.
We can actually extend the assertion to any finite number of points. We will use this result
later.
ℎ(𝑦1 ) = 𝑧1 , … , ℎ(𝑦𝑛 ) = 𝑧𝑛 .
Proof of Theorem 11.10: The proof is by induction. The Isotopy Lemma 11.3 is the case
𝑛 = 1. Now we assume the assertion being true for 𝑛−1. Then we have a compactly supported
isotopy ℎ′𝑡 ∶ 𝑌 ⧵ {𝑦𝑛 , 𝑧𝑛 } → 𝑌 ⧵ {𝑦𝑛 , 𝑧𝑛 } such that ℎ′1 (𝑦𝑖 ) = 𝑧𝑖 for all 𝑖 < 𝑛 and ℎ′0 = Id.
Since dim 𝑌 > 1, the punctured manifold 𝑌 ⧵ {𝑦𝑛 , 𝑧𝑛 } is connected. Since the isotopy ℎ′𝑡
has compact support, there are open neighborhoods around 𝑦𝑛 and 𝑧𝑛 in 𝑌 on which the ℎ′𝑡 are
all equal to the identity. Hence we can extend the family ℎ′𝑡 to a family of diffeomorphisms of
𝑌 that fix those two points.
Chapter 11. The Brouwer Degree modulo 2 219
Now we apply the induction hypothesis again to the punctured manifold
𝑌 ⧵ {𝑦1 , … , 𝑦𝑛−1 , 𝑧1 , … , 𝑧𝑛−1 } and the points 𝑦𝑛 , 𝑧𝑛 .
Then we get a compactly supported isotopy ℎ′′𝑡 with ℎ′′1 (𝑦𝑛 ) = 𝑧𝑛 and ℎ′′0 = Id. By the same
argument as for ℎ′𝑡 , we can extend ℎ′′𝑡 to an isotopy on all of 𝑌 such that all ℎ′′𝑡 satisfy ℎ′′𝑡 (𝑦𝑖 ) = 𝑧𝑖
for all 𝑖 < 𝑛. Then
ℎ𝑡 ∶= ℎ′′𝑡 ◦ℎ′𝑡
is the desired isotopy.
Now we study an important application of the degree modulo 2 and prove a famous theorem.
First we introduce a useful new invariant.
Let 𝑧 be a point of ℝ𝑛 not lying in the image 𝑓 (𝑋). We would like to understand how
𝑓 (𝑥) winds around 𝑧. To do this, we look at the unit vector
𝑓 (𝑥) − 𝑧
𝑢(𝑥) = .
|𝑓 (𝑥) − 𝑧|
It points in the direction from 𝑧 to 𝑓 (𝑥) and has length one.
We would like to know how often this vector points in a given direction, i.e., how often
𝑢(𝑥) has a given value. We learned in Section 11.1.1 that the degree of 𝑢 is an invariant that
encodes this information:
The number #𝑢−1 (𝑦) modulo 2 is by definition deg2 (𝑢). This number is constant for regular
values 𝑦 of the map 𝑢. To be a regular value means that 𝑦 − 𝑧 hits 𝑓 (𝑋) transversally, or in
other words, the line through 𝑧 and 𝑦 must hit 𝑓 (𝑋) transversally. See Figure 11.2.
Definition11.11 (Winding number) We give this number a name and call it the
winding number of 𝑓 around 𝑧. We denote it by
𝑊2 (𝑓 , 𝑧) ∶= deg2 (𝑢).
Figure 11.2: The winding number of 𝑓 around 𝑧: we count the number of times the vector
𝑓 (𝑥) − 𝑧 points in a given direction where we neglect tangential points. Note also that some
points on the graph may contribute multiple times. In the end we take our count modulo 2.
In other words, any map that is anti-symmetric around the origin must wind around the
origin an odd number of times.
Aside: A we will see below, there is a nice interpretation of this result for the meteorologists
among us: At any given time, there are two antipodal points on the Earth that have the same
temperature and pressure. Assuming temperature and pressure vary smoothly on the surface
of the Earth.
Before we approach the proof, we observe that the Borsuk–Ulam theorem is equivalent to
the following assertion:
Proof: First assume Borsuk–Ulam Theorem 11.12 is true: Given a smooth map 𝑓 ∶ 𝕊𝑘 →
𝕊𝑘 with 𝑓 (−𝑥) = −𝑓 (𝑥), we can consider it as a map 𝑓 ∶ 𝕊𝑘 → 𝕊𝑘 ⊂ ℝ𝑘+1 . Then we have
𝑓 = 𝑓 ∕|𝑓 | and therefore
1 = deg2 (𝑓 ∕|𝑓 |) = 𝑊2 (𝑓 , 0)
Recall that we called real functions 𝑓 with 𝑓 (−𝑥) = −𝑓 (𝑥) odd. Hence, as a slogan, we
can remember the Borsuk–Ulam Theorem for a smooth map 𝑓 ∶ 𝕊𝑘 → 𝕊𝑘 as follows:
In order to prove the theorem, we first need to investigate the relationship of winding numbers
and boundaries:
𝑊2 (𝑓 , 𝑧) = #𝐹 −1 (𝑧) mod 2.
Proof:
𝑓 (𝑥) − 𝑧
𝑢 ∶ 𝑋 = 𝜕𝐷 → 𝕊𝑛−1 , 𝑥 →
|𝑓 (𝑥) − 𝑧|
𝐹 −1 (𝑧) = {𝑦1 , … , 𝑦𝑚 }.
Then we can choose local parametrizations around each 𝑦𝑖 in 𝐷 and let 𝐵𝑖 be the image of a
closed ball in ℝ𝑛 around 𝑦𝑖 . See Figure 11.3. Since 𝑧 is a regular value, the Stack of Records
Theorem 4.14 shows that 𝐹 −1 (𝑧) is discrete and disjoint to 𝑋 = 𝜕𝐷. Thus we can choose the
radii small enough such that these balls satisfy
We define
𝑓𝑖 ∶= 𝐹|𝜕𝔹𝑖 ∶ 𝜕𝔹𝑖 → ℝ𝑛 .
to be the restriction of 𝐹 to 𝜕𝔹𝑖 .
Figure 11.3: We split the contributions of the 𝑓𝑖 and 𝑓 by considering them as restrictions of
𝐹.
𝐷̃ ∶= 𝐷 ⧵ (∪𝑖 Int(𝔹𝑖 ))
𝜕 𝐷̃ = 𝜕𝐷 ∪̇ 𝜕𝔹1 ∪̇ ⋯ ∪̇ 𝜕𝔹𝑚
Since degrees and hence winding numbers are additive with respect to connected com-
ponents this yields
#𝐹 −1 (0) = 𝑊2 (𝑓 , 0) = 1 mod 2.
Now we are almost ready to attack the proof of the Borsuk–Ulam Theorem. The proof will
proceed by induction on the dimension. To treat the case 𝑘 = 1, we need one more ingredient
which we will discuss next. It will turn out, however, that the effort we put in proving the
following lemma will pay off later on.
∙ There exists a smooth map 𝑔 ∶ ℝ → ℝ such that the following diagram commutes
𝑔
ℝ /ℝ
𝑝 𝑝
𝕊1 / 𝕊1 .
𝑓
Figure 11.4: We think of ℝ lying as a spiral above 𝕊1 . That makes it easier to imagine how we
lift 𝑓 to a map 𝑔, first by local lifts which then are patched together.
Chapter 11. The Brouwer Degree modulo 2 225
The idea is that, given any smooth map 𝑓 ∶ 𝕊1 → 𝕊1 , we can lift 𝑓 locally and then patch
the pieces together to get a smooth map, see Figure 11.4,
𝑝 ∶ ℝ → 𝕊1 , 𝑡 → exp(2𝜋𝑖𝑡) = 𝑒2𝜋𝑖𝑡 .
𝑔(𝑡 + 1) − 𝑔(𝑡) ∈ ℤ.
Since the function 𝑡 → 𝑔(𝑡 + 1) − 𝑔(𝑡) is continuous and takes only values in the discrete space
ℤ, it is locally constant. Since ℝ is connected, the function must be constant. In other words,
for all 𝑡 ∈ ℝ, we have
This 𝑞 is a fixed integer depending only on 𝑓 . We can actually think of 𝑞 as measuring how
fast 𝑓 wraps 𝕊1 around itself.
[0, 1] ℝ
𝑝 ℎ 𝑝
𝕊1 /' 𝕊1
𝑓
𝑝−1
𝑠 ◦ℎ
𝑈0
0
/ℝ
]
ℎ 𝑝
𝑝 𝑝−1
𝑠 0
&
𝕊1 / 𝕊1 .
𝑓 −
226 11.2. Winding Numbers and the Borsuk–Ulam Theorem
Now we continue this procedure to construct enough local lifts to cover all of [0, 1]. That
this works can be shown as follows:1 We have just constructed a lift of ℎ on the interval [0, 𝑠]
for some 𝑠 with 0 < 𝑠 ≤ 1. Thus the set
is not empty. Since 𝐷 is bounded by 1, it has a least upper bound 𝑑 ∈ [0, 1].
First claim: 𝑑 ∈ 𝐷.
To prove the claim, let 𝑈 be an open subset of 𝕊1 containing ℎ(𝑑) such that 𝑝 maps each
component of 𝑝−1 (𝑈 ) diffeomorphically onto 𝑈 . Since ℎ is continuous, there is an open
subset 𝑊 ⊂ [0, 1] containing 𝑑 with ℎ(𝑊 ) ⊂ 𝑈 . By definition of 𝐷 and 𝑑, there is a point
𝑠′ ∈ 𝑊 such that 0 < 𝑠′ < 𝑑 and 𝑠′ ∈ 𝐷. Since we can lift ℎ to 𝑔̃ ∶ [0, 𝑠′ ] → ℝ, we have a
̃ ′ ) ∈ ℝ. We also have 𝑔(𝑠
unique point 𝑔(𝑠 ̃ ′ ) ∈ 𝑝−1 (ℎ(𝑠′ )).
Let 𝑉 ⊂ ℝ denote the component of 𝑝−1 (𝑈 ) with 𝑔(𝑠 ̃ ′ ) ∈ 𝑉 . Since ℎ(𝑠′ ) and ℎ(𝑑) are both
in 𝑈 , we can use (𝑝|𝑉 )−1 to define a local lift (𝑝|𝑉 )−1 ◦ℎ ∶ [0, 𝑠′ ] → ℝ of ℎ on [𝑠′ , 𝑑]. Now the
previous lift of ℎ on [0, 𝑠′ ] and the new one on [𝑠′ , 𝑑] agree at 𝑠′ by construction. Hence we
can glue them together to get a lift 𝑔̃ ∶ [0, 𝑑] → ℝ of ℎ on the entire interval [0, 𝑑].2 Thus
𝑑 ∈ 𝐷 as claimed.
Second claim: 𝑑 = 1.
Suppose 𝑑 < 1. For the open set 𝑊 ⊂ [0, 1] as above, we have 𝑑 ∈ 𝑊 . Since 𝑊 is open,
it still contains the open interval (𝑑 − 2𝜀, 𝑑 + 2𝜀) for some small 𝜖 > 0. Then we could use the
above argument to extend the lift of ℎ to [0, 𝑑 + 𝜀]. Thus we would have 𝑑 + 𝜀 ∈ 𝐷 which
contradicts that 𝑑 is the least upper bound of 𝐷. Hence we must have 𝑑 = 1 as claimed.
Hence we have shown that there is a smooth map 𝑔̃ ∶ [0, 1] → ℝ such that
𝑔̃
[0, 1] /ℝ
𝑝 𝑝
𝕊1 / 𝕊1
𝑓
𝑝(𝑔(1))
̃ = 𝑓 (𝑝(1)) = 𝑓 (𝑝(0)) = 𝑝(𝑔(0))
̃
and hence
𝑔(1) = 𝑔(0) + 𝑞 for some fixed 𝑞 ∈ ℤ.
1
You find a proof of this fact in almost every textbook on Algebraic Topology in the section on covering spaces
and lifting of paths. Here we follow Vick’s argument in his book Homology Theory, page 88. You may want to have
a look at other books if you do not like this one.
2
Actually, this gives us only a continuous lift a priori. But we can compose with appropriate smooth bump
functions to smoothen out any possible singular points. Since the notation would get extremely annoying, we skip
this step.
Chapter 11. The Brouwer Degree modulo 2 227
Finally, we define 𝑔 ∶ ℝ → ℝ by setting
{
̃ for all 𝑡 ∈ [0, 1], and
𝑔(𝑡) = 𝑔(𝑡)
𝑔(𝑡 + 1) = 𝑔(𝑡) + 𝑞 for all 𝑡 ∈ ℝ.
First, note that if 𝑓 is not surjective, then we can pick a point 𝑦 ∉ 𝑓 (𝕊1 ). This 𝑦 is auto-
matically a regular value. Since #𝑓 −1 (𝑦) = 0, we must have deg(𝑓 ) = 0. In this case, we need
to have 𝑞 = 0, i.e., 𝑔(𝑡 + 1) = 𝑔(𝑡). For otherwise 𝑝◦𝑔 was surjective and hence 𝑓 would be
surjective.
Note that, since the stereographic projection map 𝕊1 ⧵ {𝑦} → ℝ is a diffeomorphism and
ℝ is contractible, this shows that 𝕊1 ⧵ {𝑦} is contractible. Hence 𝑓 is a map to a contractible
space and therefore homotopic to a constant map and has degree 0.
And since 𝑝(𝑢) = 𝑝(𝑢′ ) if and only if 𝑢 − 𝑢′ ∈ ℤ, the points of the form 𝑔(𝑠) + 𝑘 are the
only points with 𝑝(𝑢) = 𝑦. Hence we need to count how often we have 𝑔(𝑡) = 𝑔(𝑠) + 𝑘 for
𝑡 ∈ [𝑠, 𝑠 + 1]. See Figure 11.5.
We assume that 𝑞 ≥ 0. If 𝑞 < 0, then we use a similar argument with −𝑞. By the Interme-
diate Value Theorem in Calculus, we know that the smooth function 𝑔|[𝑠,𝑠+1] ∶ [𝑠, 𝑠 + 1] → ℝ
takes the value 𝑔(𝑠) + 𝑘 for each integer 𝑘 ∈ {0, 1, … , 𝑞 − 1} an odd number of times and the
values 𝑔(𝑠) + 𝑘 for any other integer 𝑘 an even number of times.3 Hence, modulo 2, there are
𝑞 many points 𝑡 in the interval4 [𝑠, 𝑠 + 1) such that
Thus, modulo 2, there are 𝑞 many points in 𝑓 −1 (𝑦), and we have shown deg2 (𝑓 ) = 𝑞.
Now we are ready to prove Theorem 11.12. The proof will proceed by induction.
The case 𝑘 = 1:
3
Here we use that 𝑦 being regular implies det(𝑑𝑓𝑥 ) ≠ 0 and hence 𝑑𝑔(𝑡𝑘 ) ≠ 0 for all 𝑡𝑘 with 𝑔(𝑡𝑘 ) = 𝑔(𝑠) + 𝑘.
4
Remember that 𝑝(𝑔(𝑠 + 1)) = 𝑝(𝑔(𝑠)). Hence we only need to take one of 𝑠 and 𝑠 + 1 into account.
228 11.2. Winding Numbers and the Borsuk–Ulam Theorem
Figure 11.5: An application of important theorems of Calculus. We count how many times the
graph of 𝑔 hits integer values, marked by the horizontal lines. We know that if the graph drops
below a given value it has to pass it again to reach the value 𝑔(𝑠) + 𝑞 eventually. Counting
modulo 2 we get 𝑞 crossings of the horizontal lines.
By the previous observation, to show the theorem is equivalent to showing that a map
𝑓 ∶ 𝕊1 → 𝕊1 with 𝑓 (−𝑥) = −𝑓 (𝑥) has deg2 (𝑓 ) = 1. By the previous lemma, we can find a
smooth map 𝑔 ∶ ℝ → ℝ with 𝑓 ◦𝑝 = 𝑝◦𝑔 and 𝑔(𝑡 + 1) = 𝑔(𝑡) for some 𝑞 ∈ ℤ.
If 𝑓 is odd, then
𝑓 (𝑝(𝑡 + 1∕2)) = 𝑓 (−𝑝(𝑡)) = −𝑓 (𝑝(𝑡))
where we use 𝑝(𝑡 + 1∕2) = exp(2𝜋𝑖(𝑡 + 1∕2)) = − exp(2𝜋𝑖𝑡) = −𝑝(𝑡). Hence
𝑝(𝑔(𝑡 + 1∕2)) = 𝑓 (𝑝(𝑡 + 1∕2)) = −𝑓 (𝑝(𝑡)) = −𝑝(𝑔(𝑡)).
We also have
𝑝(𝑠1 ) = −𝑝(𝑠2 )
⇐⇒ exp(2𝜋𝑖𝑠1 ) = − exp(2𝜋𝑖𝑠2 )
⇐⇒ exp(2𝜋𝑖𝑠1 ) = exp(2𝜋𝑖𝑠2 + 𝜋𝑖)
⇐⇒ 𝑠1 = 𝑠2 + 𝑚∕2 for some odd 𝑚 ∈ ℤ.
This implies
𝑔(𝑡 + 1∕2) = 𝑔(𝑡) + 𝑚∕2 for some odd 𝑚 ∈ ℤ.
Applied to 𝑡 = 1∕2, this yields
𝑔(1) = 𝑔(1∕2 + 1∕2) = 𝑔(1∕2) + 𝑚∕2 = 𝑔(0) + 𝑚∕2 + 𝑚∕2 = 𝑔(0) + 𝑚.
Thus 𝑞 = 𝑚 is odd. Hence, by the previous lemma, we have deg2 (𝑓 ) = 𝑞 = 1 mod 2. This
finishes the case 𝑘 = 1.
Chapter 11. The Brouwer Degree modulo 2 229
Induction step: Assume the theorem is true for 𝑘 − 1 and 𝑘 ≥ 2. Let 𝑓 ∶ 𝕊𝑘 → ℝ𝑘+1 ⧵ {0}
satisfy the symmetry condition (11.1). We consider 𝕊𝑘−1 to be the equator of 𝕊𝑘 , embedded
by
Let 𝑔 ∶ 𝕊𝑘−1 → ℝ𝑘+1 ⧵ {0} be the restriction of 𝑓 to the equator. By Sard’s Theorem, we
can choose a value 𝑦 ∈ 𝕊𝑘 which is regular for both smooth maps
𝑔 𝑓
∶ 𝕊𝑘−1 → 𝕊𝑘 , and ∶ 𝕊𝑘 → 𝕊𝑘 .
|𝑔| |𝑓 |
The symmetry condition implies that 𝑦 is regular for both these maps if and only if −𝑦 is
regular for both maps, since the derivatives at preimages of 𝑦 and −𝑦 just differ by multiplying
with (−1).
𝑔
Since dim 𝕊𝑘−1 < dim 𝕊𝑘 , the only way 𝑦 can be a regular value of |𝑔|
is when 𝑦 is not in
𝑔
the image. Hence neither 𝑦 nor −𝑦 are in the image of |𝑔|
.
𝑓
That 𝑦 is regular for |𝑓 |
means by definition
( ( ) )
𝑓
Im 𝑑 = 𝑇𝑦 (𝕊𝑘 ).
|𝑓 | 𝑥
The tangent space to 𝕊𝑘 at 𝑦 is the orthogonal complement of the line pointing in direction
of 𝑦. The map 𝑥 → |𝑓𝑓 (𝑥)
(𝑥)|
is the composite of 𝑓 and the map 𝑥 → 𝑥∕|𝑥| which is smooth in
dimensions 𝑘 ≥ 2.
𝑓
𝑦 is a regular value of ⇐⇒ 𝑓 −
⋔ 𝐿 = span(𝑦). (11.2)
|𝑓 |
Now we are going to exploit these two observations for calculating 𝑊2 (𝑓 , 0). By defini-
tion, we have
( ) ( ) ( )−1
𝑓 −0 𝑓 𝑓
𝑊2 (𝑓 , 0) = deg2 = deg2 =# (𝑦) mod 2.
|𝑓 − 0| |𝑓 | |𝑓 |
By symmetry, we have
( )−1 ( )−1
𝑓 𝑓
# (𝑦) = # (−𝑦).
|𝑓 | |𝑓 |
𝑓 −1 (𝐿) = {𝑥 ∈ 𝕊𝑘 ∶ 𝑓 (𝑥) ∈ 𝐿}
𝑓 (𝑥)
= {𝑥 ∈ 𝕊𝑘 ∶ = ±𝑦}
|𝑓 (𝑥)|
( )−1 ( )−1
𝑓 𝑓
= (𝑦) ∪ (−𝑦).
|𝑓 | |𝑓 |
Thus
( )−1
𝑓 1
# (𝑦) = #𝑓 −1 (𝐿).
|𝑓 | 2
By symmetry, we can do this on the upper hemisphere 𝕊𝑘+ of 𝕊𝑘 , i.e., the points on 𝕊𝑘
with 𝑥𝑘+1 ≥ 0. Let 𝑓+ be the restriction of 𝑓 to 𝕊𝑘+ . By the choice of 𝑦, 𝐿 does not meet the
equator, and hence no point on the equator is in 𝑓 −1 (𝐿). This implies
1 −1
#𝑓 (𝐿) = #𝑓+−1 (𝐿).
2
Chapter 11. The Brouwer Degree modulo 2 231
The upper hemisphere is a manifold with boundary
∑
𝜕𝕊𝑘+ = {𝑥 = (𝑥1 , … , 𝑥𝑘+1 ) ∶ 𝑥2𝑖 = 1 and 𝑥𝑘+1 = 0} = 𝕊𝑘−1
𝑖
Now we would like to apply the previous theorem to the 𝑓+ and 𝑔 = 𝜕𝑓+ and use the
induction hypothesis. However, the target of 𝑓+ has dimension 𝑘 + 1, whereas for both the
theorem and the induction hypothesis we need as target a Euclidean space of dimension 𝑘. So
we need to fix this.
The key is that the orthogonal complement of 𝐿 in ℝ𝑘+1 , denoted by 𝑉 , is a vector space
of dimension 𝑘. By choosing a basis of 𝑉 , we can identify it with ℝ𝑘 .
To complete the argument, let 𝜋 ∶ ℝ𝑘+1 → 𝑉 be the orthogonal projection onto 𝑉 . Since
𝑔 is symmetric and 𝜋 is linear,
Moreover, we have
To finish, recall 𝑓+ −
⋔ 𝐿 and hence for
𝜋◦𝑓+ ∶ 𝕊𝑘 → 𝑉 , (𝜋◦𝑓+ ) −
⋔ {0}.
Hence we can apply the previous theorem to 𝜋◦𝑓+ and its boundary map 𝜕(𝜋◦𝑓+ ) = 𝜋◦𝑔
to get
𝜋(𝑓+ (𝑥)) = 0 ⇐⇒ 𝑓+ (𝑥) ∈ 𝐿, and hence (𝜋◦𝑓+ )−1 (0) = 𝑓+−1 (𝐿).
Thus
Remark 11.17 Going back to the definition of 𝑊2 (𝑓 , 𝑧) and the picture at the be-
ginning, we learn from the proof, in particular, that lines tangential to 𝑓 (𝑋) are not
allowed for calculating 𝑊2 (𝑓 , 𝑧).
Proof: Let 𝐿 be a line in ℝ𝑘+1 through the origin. If 𝑓 never hits 𝐿, then #𝑓 −1 (𝐿) = 0 and
−
𝑓 ⋔ 𝐿. By repeating the above proof using this 𝑓 and 𝐿 for calculating 𝑊2 (𝑓 , 0), we would get
the contradiction to the Borsuk–Ulam Theorem
𝑊2 (𝑓 , 0) = #𝑓 −1 (𝐿) = 0.
In this section we will have a first glimpse at the Hopf invariant in a modulo 2 version. We will
discuss the actual ℤ-valued Hopf invariant in the exercises later.
There are many different ways to define this tremendously influential invariant. Here we
follow Milnor’s outline [13, Problems 13-15] of Hopf’s original approach using linking num-
bers.
∙ Note that, since 𝑋 and 𝑌 are disjoint, the map 𝜆 is well-defined and smooth.
∙ The order of 𝑋 and 𝑌 does not matter modulo 2, i.e., 𝐿2 (𝑋, 𝑌 ) = 𝐿2 (𝑌 , 𝑋). For the
oriented linking number that we will study later this will be different. Then we have to
Chapter 11. The Brouwer Degree modulo 2 233
consider a sign and have 𝐿(𝑌 , 𝑋) = (−1)(𝑚+1)(𝑛+1) 𝐿(𝑋, 𝑌 ). This will imply that the Hopf
invariant vanishes when𝑛 is odd.
Figure 11.6: The red circle is the boundary of a compact manifold 𝐷. In both cases the green
curve intersects the manifold 𝐷. The linking number detects the intersection. However, multiple
intersections may occur. The circles on the left-hand side are linked and cannot be moved apart.
The curves on the right-hand side are not linked and can be moved.
Lemma 11.20 (Linking and boundary) Assume that 𝑋 is the boundary of a smooth
(𝑚 + 1)-manifold 𝑊 which is disjoint from 𝑌 . Then we have 𝐿2 (𝑋, 𝑌 ) = 0.
Proof: Since 𝑌 does not have a boundary, the product 𝑊 × 𝑌 is a smooth manifold with
boundary 𝜕(𝑊 × 𝑌 ) = 𝜕𝑊 × 𝑌 = 𝑋 × 𝑌 . Since 𝑊 and 𝑌 are disjoint, 𝜆 extends to a smooth
map on 𝑊 × 𝑌 . Then the Boundary Theorem 11.8 for degrees implies that deg2 (𝜆) = 0.
Now we extend the definition of the linking number to submanifolds 𝑋, 𝑌 in 𝕊𝑘+1 as follows:
Our next goal is to show that this number does not depend on the choice of 𝑤 and 𝑧 and
only depends on the homotopy class of 𝑓 .
Proof: Since 𝑤 is a regular value for 𝐹 , the Boundary Theorem 10.14 implies that the
subset 𝐹 −1 (𝑤) ⊂ 𝕊2𝑛−1 × [0, 1] ⊂ ℝ2𝑛+1 is a compact submanifold with boundary 𝜕𝐹 −1 (𝑤)
given by
Since 𝑓0−1 (𝑧) is a compact manifold without boundary, the product 𝐹 −1 (𝑤)×𝑓0−1 (𝑧) is a compact
manifold of with boundary given by
𝜕(𝐹 −1 (𝑤) × 𝑓0−1 (𝑧)) = (𝑓1−1 (𝑤) × {1} × 𝑓0−1 (𝑧)) ⊔ (𝑓0−1 (𝑤) × {0} × 𝑓0−1 (𝑧)).
Thus the maps 𝜆0 ∶ 𝑓0−1 (𝑤) × 𝑓0−1 (𝑧) → 𝕊2𝑛−2 and 𝜆1 ∶ 𝑓1−1 (𝑤) × 𝑓0−1 (𝑧) → 𝕊2𝑛−2 are the
restrictions of the map
𝑥−𝑦
𝜆𝐹 ∶ 𝐹 −1 (𝑤) × 𝑓0−1 (𝑧) → 𝕊2𝑛−2 , (𝑥, 𝑦) →
|𝑥 − 𝑦|
to the two boundary components. By the Boundary Theorem 11.8 for deg2 , this implies
deg2 (𝜆0 ⊔ 𝜆1 ) = 0
where 𝜆0 ⊔ 𝜆1 denotes the induced map (𝑓0−1 (𝑤) × 𝑓0−1 (𝑧)) ⊔ (𝑓1−1 (𝑤) × 𝑓0−1 (𝑧)) → 𝕊2𝑛−2 . Since
deg2 is by construction additive on connected components, this implies that
( ) ( )
𝐿2 𝑓0−1 (𝑤), 𝑓0−1 (𝑧) + 𝐿2 𝑓1−1 (𝑤), 𝑓0−1 (𝑧) ≡ 0 mod 2.
Thus we have
( ) ( )
𝐿2 𝑓0−1 (𝑤), 𝑓0−1 (𝑧) ≡ 𝐿2 𝑓1−1 (𝑤), 𝑓0−1 (𝑧) mod 2.
This proves the first equality in (11.3). Applying the same argument for the second factor proves
the second equality.
( )
Lemma 11.24 The linking number 𝐿2 𝑓 −1 (𝑤), 𝑓 −1 (𝑧) is locally constant as a func-
tion of 𝑤 and of 𝑧.
Chapter 11. The Brouwer Degree modulo 2 235
Proof: Because of the symmetry we only prove the assertion for 𝑤. The Local Submersion
Theorem 4.2 implies that the set of regular values of is an open subset of 𝕊𝑛 . Hence we can
choose an 𝜀 so that the open 𝜀-neighborhood of 𝑤 contains only regular values. Let 𝑣 be a
point in 𝕊𝑛 with |𝑣 − 𝑤| < 𝜀. We can choose a family of smooth rotations 𝑟𝑡 ∶ 𝕊𝑛 → 𝕊𝑛 so that
𝑟1 (𝑤) = 𝑣, and so that
By our choice of 𝑟𝑡 and 𝑣, note that, for each fixed 𝑡, 𝑣 is a regular value for the composition
𝑟𝑡 ◦𝑓 ∶ 𝕊2𝑛−1 → 𝕊𝑛 . This implies that 𝑣 also is a regular value for the map 𝐹 . Now we can
apply Lemma 11.23 to deduce the assertion.
the number 𝐿2 𝑓 −1 (𝑤), 𝑓 −1 (𝑧) does not depend on the choice of regular values 𝑤 and
𝑧 and only depends on the homotopy class of 𝑓 .
( )
Proof: By Lemma 11.24, the assignment (𝑤, 𝑧) → 𝐿2 𝑓 −1 (𝑤), 𝑓 −1 (𝑧) is a locally con-
stant function. Since 𝑌 is connected, the assignment is constant. Now if 𝑔 ∶ 𝕊2𝑛−1 → 𝕊𝑛
is a smooth map homotopic to 𝑓 , then there is a homotopy 𝐹 between 𝑓 and 𝑔. By Sard’s
Theorem 7.1, the set of regular values for 𝑓 , 𝑔 and 𝐹 are dense in 𝑌 . Hence we can find ele-
ments 𝑤 and 𝑧 in 𝑌 which are regular values for 𝑓 , 𝑔 and 𝐹 simultaneously. Now we can apply
Lemma 11.23 to deduce that 𝐻2 (𝑓 ) = 𝐻2 (𝑔).
Remark 11.26 (𝐻2 is a map on homotopy groups) Denoting by 𝜋2𝑛−1 (𝕊𝑛 ) the (2𝑛−1)-
homotopy group of 𝑆 𝑛 , we can view the mod 2 Hopf invariant as a map
Now we are going to compute the Hopf invariant for the Hopf fibration 𝜋 ∶ 𝕊3 → 𝕊2 , an
extremely important example of a smooth map. We recall the definition of 𝜋: We consider 𝕊3
as a subset of ℂ2 , i.e., 𝕊3 = {(𝑧0 , 𝑧1 ) ∈ ℂ2 ∶ |𝑧0 |2 + |𝑧1 |2 = 1}, and 𝕊2 as a subset of ℂ × ℝ,
i.e., 𝕊2 = {(𝑧, 𝑥) ∈ ℂ × ℝ ∶ |𝑧|2 + 𝑥2 = 1}. Then the Hopf fibration 𝜋 is the map 𝕊3 → 𝕊2
given by ( )
𝜋(𝑧0 , 𝑧1 ) = 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2 .
∙ We can conclude that 𝜋 is not homotopic to a constant map. This is a famous result
of Heinz Hopf providing examples of non-contractible maps between spheres where the
dimension of the domain is bigger than the codomain.
∙ We have shown in Exercise 2.8 that the Hopf map realizes 𝕊3 as a disjoint union of
fibers which each look like 𝕊1 . Since the Hopf invariant is the linking number of any two
distinct circles on 𝑆 3 , this shows that all these disjoint circles are linked in each other
and cannot be pulled apart.
∙ Hence 𝜋 exhibits a very special behavior of maps between spheres that only exists in a
handful of dimensions. The latter is a famous result of Frank Adams, known as the Hopf
invariant one problem, and is actually concerned with the integral version of the Hopf
invariant that we will study later. Adams’ theorem had enormous consequences on the
development of mathematics.
By definition of 𝐻2 (𝜋), we need to choose two distinct regular values 𝑤 and 𝑧 of 𝜋 and
calculate the mod 2 linking number of 𝜋 −1 (𝑤) and 𝜋 −1 (𝑧). Since we showed that each value is
regular, we can for example choose 𝑤 = 𝑎 = (0, 0, 1) and 𝑧 = 𝑏 = (0, 1, 0) on 𝕊2 ⊂ ℝ3 ≅ ℂ×ℝ.
To calculate the linking number of 𝜋 −1 (𝑎) and 𝜋 −1 (𝑏) we need to choose a point on 𝕊3
disjoint from these two subsets and stereographically project 𝕊3 from this point onto ℝ3 . By
our choice of 𝑎 and 𝑏, we get that the north pole 𝑁 = (0, 0, 0, 1) is neither on 𝜋 −1 (𝑎) nor on
𝜋 −1 (𝑏). Recall that the formula for the stereographic projection 𝜙−1
𝑁
∶ 𝕊3 ⧵ {𝑁} → ℝ3 is, with
the notation we use here, given by
1
(𝑥0 , 𝑦0 , 𝑥1 , 𝑦1 ) → (𝑥 , 𝑦 , 𝑥 ).
1 − 𝑦1 0 0 1
Chapter 11. The Brouwer Degree modulo 2 237
Hence we get
and
{ }
(1 − 𝑦0 )2
𝕊𝑏 ∶= 𝜙−1 −1
𝑁 (𝜋 (𝑏)) = 𝐲 = (𝑦0 , 𝑦1 , 𝑦2 ) ∈ ℝ ∶ 𝑦1 = 𝑦2 and
3
𝑦20 + 𝑦21 = .
2
To compute the degree of 𝜆 we pick a convenient point of 𝕊2 and determine the fiber over
this point. Then we check that we actually picked a regular value.
𝑥1 = 𝑦1 = 0 and 𝑥0 − 𝑦0 = |𝑥0 − 𝑦0 |.
The latter condition implies that 𝑣0 − 𝑤0 is positive. This does not look very helpful at first
glance, but we also know
1 = 𝑥20 + 𝑥21 = 𝑥20 , i.e., 𝑥0 = ±1,
and
(1 − 𝑦0 )2 √
𝑦20 = ⇐⇒ 𝑦0 = ± 2 − 1.
2
Hence we can check for the four possible permutations
√ of the signs whether they yield
√ 𝑥0 −𝑦0 ≥
0 and get three points: one√with 𝑥0 = 1, 𝑦0 = 2 − 1, one with 𝑥0 = 1, 𝑦0 = − 2 − 1, and
one with 𝑥0 = −1, 𝑦0 = − 2 − 1. Hence we get three points (𝐱, 𝐲) in 𝕊𝑎 × 𝕊𝑏 with 𝜆(𝐱, 𝐲) = 𝑝.
Hence, once we have shown that 𝑝 is a regular value, we will have proved 𝐻2 (𝜋) = deg2 (𝜆) ≡ 1
mod 2.
It remains to check the derivatives of 𝜆 at these points and to show that 𝑝 is a regular value.
Hence we have to show that the determinants at each point are nonzero.
Since 𝕊𝑎 is the unit circle in the 𝑥𝑦-plane, the tangent space of 𝕊𝑎 at a point 𝐱 is given by
The derivative of 𝑔𝑏 as a map from 𝑃 → ℝ is given by the (2×1)-matrix (we could also consider
it as a map ℝ3 → ℝ2 )
𝑑(𝑔𝑏 )𝐲 = (2𝑦0 + 2, 2𝑦1 ).
238 11.3. Linking numbers and the Hopf invariant modulo 2
Hence we get
This will help us, since 𝑔3 ◦𝜆̃ is constant on 𝕊𝑎 × 𝕊𝑏 . Hence 𝑑 𝜆̃(𝐱,𝐲) sends the subspace 𝑇𝐱 𝕊𝑎 ×
𝑇𝐲 𝕊𝑏 ∈ ℝ3 × ℝ3 to 𝑇𝑝 𝑆 2 ⊂ ℝ3 .
𝜕 𝜆̃ 𝜕 𝜆̃
We determine 𝑑 𝜆̃(𝐱,𝐲) by computing its partial derivatives 𝜕𝑥 𝑖 (𝐱, 𝐲) and 𝜕𝑦 𝑖 (𝐱, 𝐲) with respect
𝑗 𝑗
to the variables 𝑥0 , 𝑥1 , 𝑥2 and 𝑦0 , 𝑦1 , 𝑦2 :
For 𝑖 ≠ 𝑗, we have
For 𝑖 = 𝑗, we get
⎧(𝑥1 − 𝑦1 )2 + (𝑥2 − 𝑦2 )2 if 𝑖 = 0
𝜕 𝜆̃𝑖 1 ⎪
(𝐱, 𝐲) = ⋅ ⎨(𝑥0 − 𝑦0 )2 + (𝑥2 − 𝑦2 )2 if 𝑖 = 1
𝜕𝑥𝑖 |𝐱 − 𝐲| ⎪
3
2
⎩(𝑥0 − 𝑦0 ) + (𝑥1 − 𝑦1 )
2 if 𝑖 = 2
and
𝜕 𝜆̃𝑖 𝜕 𝜆̃
(𝐱, 𝐲) = − 𝑖 (𝐱, 𝐲).
𝜕𝑦𝑖 𝜕𝑥𝑖
Now we evaluate these formulae at the points (𝐱, 𝐲) with 𝜆(𝐱, 𝐲) = 𝑝. For each such point
we found 𝑥1 = 𝑥2 = 𝑦1 = 𝑦2 = 0. Hence we get
𝜕 𝜆̃𝑖 𝜕 𝜆̃
(𝐱, 𝐲) = 0 = 𝑖 (𝐱, 𝐲)
𝜕𝑥𝑗 𝜕𝑦𝑗
for 𝑖 ≠ 𝑗,
{
𝜕 𝜆̃𝑖 1 0 if 𝑖 = 0
(𝐱, 𝐲) = ⋅
𝜕𝑥𝑖 |𝑥0 − 𝑦0 | 1 if 𝑖 = 1, 2
and
{
𝜕 𝜆̃𝑖 1 0 if 𝑖 = 0
(𝐱, 𝐲) = ⋅
𝜕𝑦𝑖 |𝑥0 − 𝑦0 | −1 if 𝑖 = 1, 2.
⎛0⎞
At each of the points (𝐱, 𝐲) we found with 𝜆(𝐱, 𝐲) = 𝐩, the vector 𝐚 = ⎜1⎟ is a basis of 𝑇𝐱 𝕊𝑎
⎜ ⎟
⎝0⎠
⎛0⎞ ⎛0⎞
and the vector 𝐛 = ⎜1⎟ provides a basis of 𝑇𝐲 𝕊𝑏 . The map 𝑑𝜆(𝐱,𝐲) sends 𝐚 to |𝑥 −𝑦
1
⋅ ⎜1⎟ and 𝐛
⎜ ⎟ 0 0| ⎜ ⎟
⎝1⎠ ⎝0⎠
⎛0⎞
1
to |𝑥 −𝑦 ⋅ ⎜−1⎟. Hence we have
0 0| ⎜ ⎟
⎝−1⎠
1 1 1
𝑑𝜆(𝐱,𝐲) (𝐚) = ⋅ 𝐞 and 𝑑𝜆(𝐱,𝐲) (𝐛) = − ⋅𝐞 − ⋅𝐞 .
|𝑥0 − 𝑦0 | 2 |𝑥0 − 𝑦0 | 2 |𝑥0 − 𝑦0 | 3
These two vectors form a basis of 𝑇𝑝 𝕊2 and we see that (𝐱, 𝐲) is a regular point. Since this is
true for all points in the fiber of 𝑝 ∈ 𝕊2 , we conclude that 𝑝 actually is a regular value.
240 11.4. Exercises and more examples
11.4 Exercises and more examples
𝑓 𝑔
Exercise 11.1 Let 𝑋 ←←←→ ← 𝑍 be a sequence of smooth maps between manifolds with
← 𝑌 ←←→
𝑋 and 𝑌 compact, 𝑌 and 𝑍 connected. Assume that all three manifolds are boundaryless
and dim 𝑋 = dim 𝑌 = dim 𝑍. Show that
Exercise 11.3 Show that there exists a complex number 𝑧 such that
𝑝(𝑧) = 𝑧𝑚 + 𝑎1 𝑧𝑚−1 + ⋯ + 𝑎𝑚
be a monic complex polynomial. Show that there exists a 𝑤 ∈ ℂ such that 𝑝(𝑤) = 0.
Show that 𝑔 satisfies 𝑔(−𝑞) = −𝑔(𝑞) for all 𝑞 ∈ 𝕊1 = 𝜕𝔹2 . What is the degree modulo 2
of 𝑔? Conclude that 𝑓 cannot exist.
Exercise 11.9 Use the previous exercise to prove the following special case of our
previous observations: For every smooth map 𝕊2 → ℝ2 , there is a point 𝑝 on 𝕊2 such
that 𝑓 (𝑝) = 𝑓 (−𝑝).
Exercise 11.10 Assume the following continuous version of the assertion of Exer-
cise 11.9: For every continuous map 𝕊2 → ℝ2 , there is a point 𝑝 on 𝕊2 such that
𝑓 (𝑝) = 𝑓 (−𝑝).
Deduce from this fact the following version of Invariance of Dimension:
An open subset in ℝ2 cannot be homeomorphic to an open subset in ℝ𝑛 for 𝑛 ≥ 3.
12. Tubular Neighborhoods and Transversality
In this section we study two key tools in differential topology. We will see several important
application in the next sections. We are going to use a generalization of the Inverse Function
Theorem that we will prove first.
Proof: We know that 𝑓 maps 𝑍 diffeomorphically onto its image 𝑓 (𝑍), since 𝑓 ∶ 𝑍 →
𝑓 (𝑍) is a bijective local diffeomorphism and therefore a diffeomorphism. We would like to
show that we can extend this to an open neighborhood around 𝑍.
Since 𝑑𝑓𝑥 is an isomorphism for all 𝑥 ∈ 𝑍, for each 𝑥 ∈ 𝑍, there exists an open neighbor-
hood 𝑈𝑥 in 𝑋 around 𝑥 on which 𝑓|𝑈𝑥 is a diffeomorphism. The collection {𝑈𝑥 } is an open cover
of 𝑍. Since 𝑍 is compact, we can choose a finite subcover {𝑈1 , … , 𝑈𝑛 }. We set 𝑈 ∶= ∪𝑖 𝑈𝑖 .
Restricted to the open subset 𝑈 , 𝑓|𝑈 is a local diffeomorphism. It remains to shrink 𝑈 if nec-
essary to ensure that 𝑓|𝑈 is also injective, i.e., we need to show that there is some open subset
𝑉 in 𝑋 which contains 𝑍 such that 𝑓|𝑉 is injective. Then 𝑓|𝑈 ∩𝑉 is an injective local diffeo-
morphism. We have shown previously that this implies that 𝑓|𝑈 ∩𝑉 is a diffeomorphism onto
an open subset of 𝑌 . Since 𝑍 ⊂ 𝑈 and 𝑍 ⊂ 𝑉 , we also have 𝑍 ⊂ 𝑈 ∩ 𝑉 and the assertion is
proven.
We are going to show that 𝑉 exists by assuming the contrary, i.e., we assume that there
exists at least one point 𝑧 ∈ 𝑍 such that in any small open neighborhood 𝑊 of 𝑧 there are points
𝑎 ≠ 𝑏 with 𝑓 (𝑎) = 𝑓 (𝑏). By choosing open neighborhoods 𝑊𝑖 smaller and smaller around 𝑧
and by choosing subsequences 𝑎𝑖 ≠ 𝑏𝑖 with 𝑓 (𝑎𝑖 ) = 𝑓 (𝑏𝑖 ), we can assume that both the 𝑎𝑖 and
𝑏𝑖 converge to 𝑧. Since 𝑓 (𝑎𝑖 ) = 𝑓 (𝑏𝑖 ) for all 𝑖 and 𝑓 is continuous, we have 𝑓 (𝑎𝑖 ) → 𝑓 (𝑧) and
242
Chapter 12. Tubular Neighborhoods and Transversality 243
𝑓 (𝑏𝑖 ) → 𝑓 (𝑧). But since 𝑑𝑓𝑧 is an isomorphism, the usual Inverse Function Theorem implies
that there is a small open neighborhood 𝑊𝑧 in 𝑋 around 𝑧 such that 𝑓|𝑊𝑧 is a diffeomorphism.
Since 𝑎𝑖 → 𝑧 and 𝑏𝑖 → 𝑧, for 𝑁 large enough, we have 𝑎𝑖 , 𝑏𝑖 ∈ 𝑊𝑧 and hence 𝑓 (𝑎𝑖 ) = 𝑓 (𝑏𝑖 ) ∈
𝑓 (𝑊𝑧 ) for all 𝑖 ≥ 𝑁. But since 𝑓|𝑊𝑧 is injective, this implies 𝑎𝑖 = 𝑏𝑖 for all 𝑖 ≥ 𝑁. This
contradicts the choice of the 𝑎𝑖 and 𝑏𝑖 .
The existence of partitions of unity allows us to move from the compact to the general
case. First we recall the following notion from general topology:
Lemma 12.3 (Local finiteness lemma) Every open cover {𝑈𝛼 } on a manifold admits
a locally finite refinement {𝑉𝛼 }.
Since the above lemma is rather a statement in general topology, we omit its proof here. So
we move on and use it to generalize the Inverse Function Theorem:
Proof: For each 𝑥 ∈ 𝑍, there exists an open neighborhood 𝑉𝑥 in 𝑋 around 𝑥 on which 𝑓|𝑉𝑥
is a diffeomorphism, since 𝑑𝑓𝑥 is an isomorphism for all 𝑥 ∈ 𝑍. Let 𝑈𝑥 = 𝑓 (𝑉𝑥 ) be the open
image in 𝑌 . The collection of all 𝑈𝑥 is an open cover of 𝑓 (𝑍), since each 𝑓 (𝑥) ∈ 𝑓 (𝑍) lies
in some 𝑈𝑥 = 𝑓 (𝑉𝑥 ). By the previous lemma, we can choose a locally finite subcover {𝑈𝑖 } of
𝑓 (𝑍) in 𝑌 . For each 𝑈𝑖 , there is a local inverse 𝑔𝑖 ∶ 𝑈𝑖 → 𝑉𝑖 ⊂ 𝑋 of 𝑓|𝑉𝑖 . We define
It remains to show that 𝑊 contains an open subset which still contains 𝑓 (𝑍). Let 𝑥 ∈ 𝑍.
Since 𝑓 (𝑥) ∈ 𝑓 (𝑍), we can find a 𝑘 such that 𝑓 (𝑥) ∈ 𝑈𝑘 . If 𝑈𝑘 ⊂ 𝑊 , we are done, since then
every point in 𝑓 (𝑍) has an open neighborhood which is contained in 𝑊 . So assume 𝑈𝑘 is not
contained in 𝑊 . After shrinking 𝑈𝑗 if necessary, we can assume by the local finiteness of the
cover {𝑈𝑖 }, that there are only finitely many of the 𝑈𝑖 ’s which intersect 𝑈𝑘 , say 𝑈1 , … , 𝑈𝑛 .
244 12.1. Normal bundles and tubular neighborhoods
Then, for 𝑖 = 1, … , 𝑛, we set 𝐶𝑖𝑘 to be the closure of the set {𝑦 ∈ 𝑈𝑖 ∩ 𝑈𝑘 ∶ 𝑔𝑖 (𝑦) ≠ 𝑔𝑘 (𝑦)},
i.e.,
𝐶𝑖𝑘 = {𝑦 ∈ 𝑈𝑖 ∩ 𝑈𝑘 ∶ 𝑔𝑖 (𝑦) ≠ 𝑔𝑘 (𝑦)}.
Since the union of a finite collection of closed subsets is closed, 𝐶𝑘 ∶= 𝐶1𝑘 ∪ ⋯ ∪ 𝐶𝑛𝑘 is closed
in 𝑌 . Hence
𝑈 ∶= 𝑈𝑘 ⧵ 𝐶𝑘
Note that 𝑓 (𝑥) satisfies 𝑔𝑖 (𝑓 (𝑥)) = 𝑥 = 𝑔𝑘 (𝑓 (𝑥)) for all 𝑖 = 1, … , 𝑛. Since 𝑑𝑓𝑥 is an
isomorphism, the usual Inverse Function Theorem implies that there is a small open neigh-
borhood 𝑉𝜀 ⊂ 𝑈 around 𝑥 such that 𝑓|𝑉𝜀 is a diffeomorphism. Hence, for each 𝑖 = 1, … , 𝑛, we
have
Hence the finite intersection 𝑓 (𝑉𝜀 ) ∩ 𝑈𝑘 ∩ 𝑈1 ∩ ⋯ ∩ 𝑈𝑛 is an open subset which is not contained
in any of the sets {𝑦 ∈ 𝑈𝑖 ∩ 𝑈𝑘 ∶ 𝑔𝑖 (𝑦) ≠ 𝑔𝑘 (𝑦)}. Thus 𝑓 (𝑥) is not contained in 𝐶𝑘 . Hence we
have shown that 𝑈 ⊂ 𝑊 is an open subset containing 𝑓 (𝑥).
Let 𝑋 ⊂ ℝ𝑚 be a smooth manifold without boundary. We would like to understand a little bit
of the geometry of 𝑋 with respect to its environment. In order to prove the 𝜀-Neighborhood
Theorem we introduce an important geometric tool similar to the tangent bundle.
Definition 12.5 (The Normal Bundle) For each 𝑥 ∈ 𝑋, define 𝑁𝑥 (𝑋), the normal
space of 𝑋 at 𝑥, to be the orthogonal complement of 𝑇𝑥 (𝑋) in ℝ𝑚 . The normal bundle
𝑁(𝑌 ) is then defined to be the set
∙ Warning: Note that, unlike 𝑇 (𝑋), 𝑁(𝑋) is not intrinsic to the manifold 𝑌 but depends
on the specific relationship between 𝑌 and the surrounding ℝ𝑚 .
The normal bundle 𝑁(𝑋) is actually a smooth manifold itself. In order to show this, we
must recall a fact from linear algebra: Suppose that 𝐴 ∶ ℝ𝑚 → ℝ𝑘 is a linear map. Its trans-
pose is a linear map 𝐴𝑇 ∶ ℝ𝑘 → ℝ𝑚 characterised by the dot product equation
𝐴𝑣 ⋅ 𝑤 = 𝑣 ⋅ 𝐴𝑇 𝑤 for all 𝑣 ∈ ℝ𝑚 , 𝑤 ∈ ℝ𝑘 .
Chapter 12. Tubular Neighborhoods and Transversality 245
Lemma 12.6 (Help from Linear Algebra) If 𝐴 is surjective, then the transpose 𝐴𝑇
maps ℝ𝑘 isomorphically onto the orthogonal complement of the kernel of 𝐴.
Proof: We need to find local parametrizations for 𝑁(𝑋). Let 𝑥 be any point in 𝑋. We
learned in the discussion of the Local Immersion Theorem that we can find an open subset
𝑉 ⊂ ℝ𝑚 containing 𝑥 and local coordinate functions (𝑢1 , … , 𝑢𝑚 ) ∶ 𝑉 → ℝ𝑚 such that
We set 𝑈 = 𝜑−1 (0) = 𝑋 ∩ 𝑉 . Note that, since 𝑉 is open in ℝ𝑚 , we deduce that 𝑈 is open in
𝑋. Let 𝑁(𝑈 ) be the normal bundle of 𝑈 considered as a smooth manifold in ℝ𝑚 . We observe
that 𝑁(𝑈 ) equals 𝑁(𝑋) ∩ (𝑈 × ℝ𝑚 ), thus it is open in 𝑁(𝑋).
For each 𝑥 ∈ 𝑈 , 𝑑𝜑𝑥 ∶ ℝ𝑚 → ℝ𝑚−𝑛 is surjective and has kernel 𝑇𝑥 (𝑈 ) = 𝑇𝑥 (𝑋) by the
Preimage Theorem 6.3. Therefore the transpose of 𝑑𝜑𝑥 maps ℝ𝑚−𝑛 isomorphically onto the
orthogonal complement of Ker (𝑑𝜑𝑥 ) = 𝑇𝑥 (𝑋). But this is 𝑁𝑥 (𝑋) by definition. Hence we
get an isomorphism
≅
← (𝑇𝑥 (𝑋))⟂ = 𝑁𝑥 (𝑋).
(𝑑𝜑𝑥 )𝑇 ∶ ℝ𝑘 ←←←→
is bijective. It is also an embedding of 𝑈 ×ℝ𝑚−𝑛 into 𝑈 ×ℝ𝑚 , since it is the identity on the first
factor and an injective linear map on the second factor. Hence 𝜓 is a diffeomorphism. Thus
we have shown that 𝑁(𝑈 ) is a smooth manifold with local parametrization 𝜓. The dimension
of 𝑁(𝑈 ) is
dim 𝑁(𝑈 ) = dim 𝑈 + 𝑚 − 𝑛 = 𝑛 + 𝑚 − 𝑛 = 𝑚.
Since every point of 𝑁(𝑋) has such a neighborhood 𝑁(𝑈 ), 𝑁(𝑋) is a smooth manifold.
Note that 𝜎◦𝜓 ∶ 𝑈 ×ℝ𝑘 → 𝑈 is just the projection onto the first factor, which is a submersion.
Thus 𝑑(𝜎◦𝜓)(𝑢,𝑤) , is surjective at every point (𝑢, 𝑤). Hence 𝑑𝜎𝑢 is surjective at every 𝑢, and 𝜎
is a submersion.
246 12.1. Normal bundles and tubular neighborhoods
12.1.3 The Tubular Neighborhood Theorem
Proof: For each point 𝑥 ∈ 𝑋, we can find a small radius 𝜀𝑥 such that the open ball 𝐵2𝜀𝑥 (𝑥) ⊂
𝑈 is contained in 𝑈 . We set
𝑈𝑥 ∶= 𝑋 ∩ 𝐵𝜀𝑥 (𝑥).
𝜀
∙ Claim: 𝑈𝑥 𝑥 = {𝑦 ∈ ℝ𝑛 ∶ |𝑦 − 𝑥′ | < 𝜀𝑥 for some 𝑥′ ∈ 𝑈𝑥 } ⊂ 𝑈 .
𝜀
For, 𝑦 ∈ 𝑈𝑥 𝑥 means there is an 𝑥′ ∈ 𝑈𝑥 with |𝑦−𝑥′ | < 𝜀𝑥 . But 𝑥′ ∈ 𝑈𝑥 means |𝑥′ −𝑥| < 𝜀𝑥 .
Thus the triangle inequality implies
Thus 𝑣 ∈ 𝐵2𝜀𝑥 (𝑥) ⊂ 𝑈 by the choice of 𝜀𝑥 . The collection of all 𝑈𝑥 forms an open cover {𝑈𝑥 }
of 𝑋 ⊂ ℝ𝑛 . By the existence of partitions of unity for subsets in ℝ𝑛 , we can choose a partition
of unity {𝜌𝑖 } subordinate to the cover {𝑈𝑥 }. Now we define the function
∑
𝜀 ∶ 𝑋 → ℝ>0 , 𝑥 → 𝜌𝑖 (𝑥)𝜀𝑥
𝑖
∙ Claim: 𝑋 𝜀 ⊂ 𝑈 .
Let 𝑦 ∈ 𝑋 𝜀 . Then there is a 𝑥 ∈ 𝑋 such that |𝑦 − 𝑥| < 𝜀(𝑥). For this 𝑥, only finitely many
of the numbers 𝜌𝑖 (𝑥) are nonzero, say 𝜌𝑖1 (𝑥), … , 𝜌𝑖𝑛 (𝑥). This implies 𝑦 ∈ 𝑈𝑖1 ∩ ⋯ ∩ 𝑈𝑖𝑛 . Let
∑
𝜀𝑖𝑚 be the maximum of the finitely many numbers 𝜀𝑖1 , … , 𝜀𝑖𝑛 . Then, since 𝑖 𝜌𝑖 (𝑥) = 1, we
have 𝜀(𝑥) ≤ 𝜀𝑖𝑚 . Hence
𝜀𝑖𝑚
|𝑦 − 𝑥| < 𝜀(𝑥) ≤ 𝜀𝑖𝑚 implies 𝑣 ∈ 𝑈𝑖 ⊂ 𝑈.
𝑚
Thus 𝑋 𝜀 ⊂ 𝑈 .
If 𝑋 is compact, we can reduce {𝑈𝑥 } to a finite cover 𝑈𝑥1 , … , 𝑈𝑥𝑛 and let 𝜀 be the maxi-
mum of the 𝜀𝑥𝑗 .
Chapter 12. Tubular Neighborhoods and Transversality 247
Now we would like to realize such neighborhoods 𝑋𝜀 of 𝑋 using the normal bundle 𝑁(𝑋)
of 𝑋 in ℝ𝑛 . We define the map
ℎ ∶ 𝑁(𝑋) → ℝ𝑛 , (𝑥, 𝑣) → 𝑥 + 𝑣.
Since 𝑁(𝑋) is a smooth manifold in ℝ𝑛 × ℝ𝑛 and ℎ is just the restriction of the addition map
on ℝ𝑛 , ℎ is smooth. Geometrically, ℎ maps each normal space 𝑁𝑥 (𝑋) to the affine subspace
containing 𝑥 which is orthogonal to 𝑇𝑥 𝑋. The map ℎ provides us with the connection of the
neighborhood 𝑋 𝜀 with the normal bundle in the following way:
A key feature of smooth manifolds embedded in some Euclidean space is that they always
have a tubular neighborhood. We will first prove this important fact and then study some of its
consequences.
Proof: Let again 𝑋0 = 𝑁 0 (𝑋) ⊂ 𝑁(𝑋) denote the zero-section of 𝑁(𝑋), i.e., the subspace
𝑋0 = {(𝑥, 0) ∶ 𝑥 ∈ 𝑋}. The map ℎ maps 𝑋0 diffeomorphically onto 𝑋. Our goal is to show
that this extends to an open neighborhood around 𝑋0 . To do this we show that 𝑑ℎ(𝑥,0) is an
isomorphism for every point (𝑥, 0) in 𝑋0 as follows:
∙ First, we observe that, since ℎ|𝑋0 ∶ 𝑋0 → 𝑋 is a diffeomorphism, 𝑑ℎ(𝑥,0) maps 𝑇(𝑥,0) (𝑋0 ) ⊂
𝑇(𝑥,0) 𝑁(𝑋) isomorphically onto 𝑇𝑥 (𝑋) ⊂ ℝ𝑛 .
∙ Second, since the restriction of ℎ to the normal space 𝑁𝑥 (𝑋) ⊂ 𝑁(𝑋) is given by 𝑣 →
𝑥 + 𝑣, we see that 𝑑ℎ(𝑥,0) maps 𝑇(𝑥,0) ({𝑥} × 𝑁𝑥 (𝑋)) ⊂ 𝑇(𝑥,0) (𝑁(𝑋)) isomorphically onto
𝑁𝑥 (𝑋) ⊂ ℝ𝑛 .
∙ Since dim 𝑇(𝑥,0) (𝑁(𝑋)) = 𝑛, this implies that 𝑑ℎ(𝑥,0) is an isomorphism for every 𝑥.
Hence the assumptions of the generalized Inverse Function Theorem 12.4 are satisfied
and we can conclude that ℎ maps an open neighborhood of 𝑋0 in 𝑁(𝑋) diffeomorphically
onto an open neighborhood of 𝑋 in ℝ𝑛 . By the 𝜀-Neighborhood Lemma 12.8, every open
neighborhood of 𝑋 contains an 𝑋 𝜀 for a smooth function 𝜀 ∶ 𝑋 → ℝ>0 . It is clear from the
definition of ℎ that 𝑋 𝜀 is the image the open neighborhood 𝑁 𝜀 (𝑋) of 𝑋0 in 𝑁(𝑋) for the same
function 𝜀.
248 12.1. Normal bundles and tubular neighborhoods
Finally, we have shown that there is a smooth map ℎ−1 ∶ 𝑋 𝜀 → 𝑁 𝜀 (𝑋) ⊂ 𝑁(𝑋). We define
the map 𝜋 by
∙ We emphasize that a key point for the existence of 𝜋 is that we do not only have 𝑋 𝜀 but
also that we know that it is the diffeomorphic image of 𝑁 𝜀 (𝑋) under ℎ. Hence we really
need to use the normal bundle.
∙ For some applications, the important point of the theorem is not so much the existence
of the 𝑋 𝜀 , but rather that they come equipped with the submersion 𝜋.
∙ Recall from Lemma 12.8 that, if 𝑋 is compact, then 𝜀 > 0 can be chosen constant, and
𝑋 𝜀 is the open set of points in ℝ𝑛 with distance less than 𝜀 from 𝑋. If 𝜀 is sufficiently
small, then each point 𝑤 ∈ 𝑋 𝜀 possesses a unique closest point in 𝑋. Writing 𝜋(𝑤) for
this unique point, defines the map 𝜋 ∶ 𝑋 𝜀 → 𝑋 in this case. After studying the proof of
the theorem, it is a good exercise to check that this actually yields the desired map 𝜋.
∙ Tubular neighborhoods are very useful, for example for the Pontryagin–Thom construc-
tion which is key for proving more advanced results in differential topology. We will
see some applications in the next section.
We can actually consider normal bundles more generally whenever we have a submanifold
𝑍 ⊂ 𝑋 in order to understand the geometry of 𝑍 in 𝑋:
∙ We think of this as the relative normal bundle, since we take the normal space within the
tangent space of 𝑋, not in the ambient Euclidean space.
Theorem 12.12 (Normal bundles are manifolds revisited) The normal bundle
𝑁(𝑍, 𝑋) is a smooth manifold of dimension equal to dim 𝑋. The canonical map
𝜎 ∶ 𝑁(𝑍, 𝑋) → 𝑍, 𝜎(𝑧, 𝑣) = 𝑧,
is a submersion.
Proof: Let 𝑧 be a point in 𝑍. We can find an open subset 𝑉 ⊂ ℝ𝑚 containing 𝑧 and local
coordinate functions (𝑢1 , … , 𝑢𝑚 ) ∶ 𝑉 → ℝ𝑚 such that
𝑍 ∩ 𝑉 = {𝑣 ∈ ℝ𝑚 ∶ 𝑢1 (𝑣) = ⋯ = 𝑢𝑛 (𝑣) = 0}
and 𝑋 ∩ 𝑉 = {𝑢 ∈ ℝ𝑚 ∶ 𝑢𝑘+1 (𝑣) = ⋯ = 𝑢𝑛 (𝑣) = 0}
𝜓 ∶ 𝑈 × ℝ𝑛 → 𝑁𝑈 (𝑍, ℝ𝑚 ) ∶= (𝑈 × ℝ𝑚 ) ∩ 𝑁(𝑍, ℝ𝑚 ),
(𝑢, 𝑣) → (𝑢, 𝑑𝜑𝑇𝑢 (𝑣))
id×𝑝
𝜙
&
𝑁𝑈 (𝑍, 𝑋)
where 𝑁𝑈 (𝑍, 𝑋) ∶= (𝑈 × ℝ𝑚 ) ∩ 𝑁(𝑍, 𝑋) and 𝑝 is the map induced by the orthogonal pro-
jection 𝑝𝑧 ∶ ℝ𝑚 → 𝑇𝑧 (𝑋) at each 𝑧. Note that, for a vector 𝑤 ∈ ℝ𝑚 which satisfies 𝑤 ⟂ 𝑇𝑧 (𝑍),
we have 𝑝(𝑤) ∈ 𝑇𝑧 (𝑋) and 𝑝(𝑤) ⟂ 𝑇𝑧 (𝑍). Let 𝜑̃ = (𝑢𝑘+1 , … , 𝑢𝑛 ) ∶ 𝑉 → ℝ𝑛−𝑘 . We observe
that, by our choice of 𝜑 and 𝜑,
̃ we know
and the orthogonal projection 𝑝𝑧 varies smoothly with 𝑧. At each 𝑧 ∈ 𝑈 , the dimension of the
kernel of the composite
𝑑𝜑𝑇𝑧 𝑝𝑧
ℝ𝑛 ←←←←←←←→
← 𝑁𝑧 (𝑍, ℝ𝑚 ) ←←←←→
← 𝑁𝑧 (𝑍, 𝑋)
is
dim Ker (𝑝𝑧 ) = dim 𝑁𝑧 (𝑍, ℝ𝑚 ) − dim 𝑁𝑧 (𝑍, 𝑋),
since 𝑑𝜑𝑇𝑧 is an isomorphism. We can calculate this dimension by
We note again that 𝜎◦𝜙 ∶ 𝑈 × ℝ𝑘 → 𝑈 is just the projection onto the first factor, which is a
submersion. Thus 𝑑(𝜎◦𝜙)(𝑢,𝑣) , is surjective at every point (𝑢, 𝑣). Hence 𝑑𝜎𝑢 is surjective at
every 𝑢, and 𝜎 is a submersion.
∙ Note that an 𝑛-dimensional vector bundle which is diffeomorphic to the product of the
base space with ℝ𝑛 is called trivial. Hence we just showed that 𝑁(𝕊𝑘−1 , 𝕊𝑘 ) is a trivial
one-dimensional bundle.
∙ We get a similar result when we consider 𝕊𝑘−1 ⊂ ℝ𝑘 for 𝑘 ≥ 2. Then, at any 𝑝 ∈ 𝕊𝑘−1 ,
the unit vector 𝑝∕|𝑝| spans the normal complement to 𝑇𝑝 (𝕊𝑘−1 ) in ℝ𝑘 . Hence there is a
diffeomorphism
∙ However, there are a lot of nontrivial vector bundles as well. Important examples
are the tangent bundle 𝑇 (𝕊2 ) over 𝕊2 and the universal bundle over the Grassmannian
Gr 𝑘 (ℝ𝑛+𝑘 ).
∙ Note that for any 𝑧 ∈ 𝑍, the preimage 𝜎 −1 (𝑧) =∶ 𝑁𝑧 (𝑍, 𝑋) is the space of normal
vectors to 𝑍 at 𝑧 in 𝑇𝑧 (𝑋) that we have met before.
∙ Warning: As for 𝑁(𝑋), 𝑁(𝑍, 𝑋) is not intrinsic to the manifold 𝑍 but depends on the
specific relationship between 𝑍 and the surrounding space 𝑋.
𝜋𝑋 ∶ 𝑋 𝜀𝑋 → 𝑋.
ℎ ∶ 𝑁(𝑍, 𝑋) → ℝ𝑛 , (𝑧, 𝑣) → 𝑧 + 𝑣.
𝑊 ≔ ℎ−1 (𝑋 𝜀𝑋 ) ⊂ 𝑁(𝑍, 𝑋)
is an open neighborhood of 𝑍0 in 𝑁(𝑍, 𝑋). Since ℎ(𝑧, 0) = 𝑧 for all 𝑧 ∈ 𝑍, the composition
ℎ 𝜋
← 𝑋 𝜀𝑋 ←←→
𝑓 ∶ 𝑊 ←←→ ← 𝑋
is the identity when we restrict it to 𝑍0 . By the same argument as above, we can show that
𝑑ℎ(𝑧,0) is an isomorphism at every point of 𝑍0 in 𝑁(𝑍, 𝑋). Since 𝑑𝜋𝑧 is the identity for all 𝑧 ∈
𝑍 ⊂ 𝑋 𝜀𝑋 , 𝑑𝑓(𝑧,0) is an isomorphism for every (𝑧, 0) ∈ 𝑍0 ⊂ 𝑁(𝑍, 𝑋). Hence the assumptions
of the generalized Inverse Function Theorem 12.4, are satisfied, and we can conclude that
there is an open neighborhood 𝑉 of 𝑍0 in 𝑁(𝑍, 𝑋) which is mapped diffeomorphically onto
an open neighborhood 𝑈 of 𝑍 in 𝑋 by 𝑓 = 𝜋◦ℎ. Then we can find a positive smooth function
𝜀 ∶ 𝑍 → ℝ>0 such that 𝑍 ⊂ 𝑍 𝜀 ⊂ 𝑈 and 𝑍 𝜀 is diffeomorphic to an open neighborhood
𝑁 𝜀 (𝑍, 𝑋) of 𝑍0 in 𝑁(𝑍, 𝑋). The composition
≅ 𝜎𝑁 𝜀 (𝑍,𝑋)
𝜋 ∶ 𝑍 𝜀 ←←←→
← 𝑁 𝜀 (𝑍, 𝑋) ←←←←←←←←←←←←←←←→
← 𝑍
is a diffeomorphism.
In this section we show that every continuous map between smooth manifolds can be approx-
imated by a smooth map. In particular, we will show that every continuous map is homotopic
to a smooth one. The key tool that makes this work are tubular neighborhoods.
We begin with the approximation of maps to Euclidean space. We will show that such maps
can be approximated in the following sense:
Proof: Let 𝐴 ⊂ 𝑋 be a closed subset and assume that 𝑓|𝐴 is smooth. By definition, this
means that there is an open subset 𝑈 ⊂ 𝑋 with 𝐴 ⊂ 𝑈 and a smooth map 𝐹 ∶ 𝑈 → ℝ𝑛 such
that 𝑓|𝐴 = 𝐹|𝐴 . We define the set 𝑈 as
This is an open subset in 𝑋, since the function 𝑥 → 𝜀(𝑥) − |𝐹 (𝑥) − 𝑓 (𝑥)| is continuous and 𝑈
is the inverse image of the open subset ℝ>0 under this function. Note that 𝑋 ⧵ 𝐴 ⊂ 𝑈0 , since
𝐹 (𝑥) − 𝑓 (𝑥) = 0 for all 𝑥 ∈ 𝐴. If 𝐴 = ∅, we set 𝑈0 ∶= ∅.
1
|𝐹 (𝑦) − 𝑓 (𝑥)| < 𝜀(𝑥) < 𝜀(𝑦) for all 𝑦 ∈ 𝑈𝑥 .
2
Chapter 12. Tubular Neighborhoods and Transversality 253
We can find
{ such a neighborhood,
} since 𝑥 is fixed and 𝜀 and 𝐹 are continuous maps. The
collection 𝑈𝑥 ∶ 𝑥 ∈ 𝑋 ⧵ 𝐴 of all such neighborhoods for all 𝑥 is an open cover of 𝑋 ⧵ 𝐴.
{ }∞
Now we can choose a countable subcover 𝑈𝑥𝑖 and we set 𝑈𝑖 ∶= 𝑈𝑥𝑖 to simplify the
𝑖=1
notation.
Let {𝜌0 , 𝜌𝑖 } be a smooth partition of unity subordinate to the open cover {𝑈0 , 𝑈𝑖 } of 𝑋. We
define the map 𝑔 ∶ 𝑋 → ℝ𝑛 by
∑
∞
𝑔(𝑥) ∶= 𝜌0 (𝑥)𝐹 (𝑥) + 𝜌𝑖 (𝑥)𝑓 (𝑥𝑖 ).
𝑖=1
Since 𝐹 , 𝜌 and the 𝜌𝑖 are smooth and the 𝑓 (𝑥𝑖 ) are fixed values, 𝑔 is smooth. Moreover, we
∑
have 𝑔|𝐴 = 𝐹|𝐴 = 𝑓|𝐴 . It remains to show that 𝑔 and 𝑓 are 𝜀-close: Since 𝑖≥0 𝜌𝑖 (𝑥) = 1 and
|𝐹 (𝑥) − 𝑓 (𝑥𝑖 )| < 21 𝜀(𝑥) for all 𝑥 ∈ 𝑋, we see
| ( ) |
| ∑∞
∑∞
|
|𝑔(𝑥) − 𝑓 (𝑥)| = ||𝜌0 (𝑥)𝐹 (𝑥) + 𝜌𝑖 (𝑥)𝑓 (𝑥𝑖 ) − 𝜌0 (𝑥) + 𝜌𝑖 (𝑥) 𝑓 (𝑥)||
| |
| 𝑖=1 𝑖=1 |
∑
∞
≤ 𝜌0 (𝑥)|𝐹 (𝑥) − 𝑓 (𝑥)| + 𝜌𝑖 (𝑥)|𝑓 (𝑥𝑖 ) − 𝑓 (𝑥)|
𝑖=1
∑
∞
< 𝜌0 (𝑥)𝜀(𝑥) + 𝜌𝑖 (𝑥)𝜀(𝑥) = 𝜀(𝑥).
𝑖=1
Now we can extend this result to smooth manifolds embedded in ℝ𝑛 . Recall that two maps
𝑓 , 𝑔 ∶ 𝑋 → 𝑌 are said to be homotopic relative to a subset 𝐴 ⊂ 𝑋 if there is a homotopy
𝐻 ∶ 𝑋 × [0, 1] → 𝑌 such that 𝑓 (𝑥) = 𝐻(𝑥, 𝑡) = 𝑔(𝑥) for all 𝑥 ∈ 𝐴 and all 𝑡 ∈ [0, 1].
𝜀̃ = 𝜀◦𝑓 ∶ 𝑋 → ℝ>0 .
By the previous theorem, there is a smooth map 𝑔 ∶ 𝑋 → ℝ𝑛 with 𝑔|𝐴 = 𝑓|𝐴 and which is
𝜀-close
̃ to 𝑓 . Now we define the map 𝐻 ∶ 𝑋 × [0, 1] → 𝑌 as the composition of 𝜋 with a
straight-line homotopy between 𝑓 and 𝑔:
Hence we can conclude that 𝐻 is a homotopy between 𝐻(𝑥, 0) = 𝜋(𝑓 (𝑥)) = 𝑓 (𝑥) and
𝐻(𝑥, 1) = 𝜋(𝑔(𝑥)). Moreover, we have 𝐻(𝑥, 𝑡) = 𝑓 (𝑥) for all 𝑥 ∈ 𝐴. The map 𝜋◦𝑔 is smooth
and the desired map homotopic to 𝑓 .
The restriction of ℎ̃ to 𝑋 × {0} ∪ 𝑋 × {1} is smooth, since it is equal to 𝑓 ◦𝜋1 on 𝑋 × {1} and
𝑔◦𝜋1 where 𝜋1 ∶ 𝑋 × ℝ → 𝑋 denotes the projection. If ℎ is a homotopy relative to 𝐴, then ℎ̃
is also smooth on 𝐴 × [0, 1].
The ideas we used to prove Brouwer’s fixed point Theorem 10.23 together with the approxima-
tion theorems of this section lead to the following stronger form of Theorem 11.7:
∙ The theorem says that no compact manifold of dimension 𝑛 ≥ 1 has the homotopy type
of a point.
∙ In Algebraic Topology, we learn about this fact when we show that the 𝑛th singular ho-
mology group of a connected compact manifold is nontrivial, more specifically we have
𝐻𝑛 (𝑋; ℤ∕2) ≅ ℤ∕2. Since homology is invariant under homotopy and 𝐻𝑛 ({𝑥0 }; ℤ∕2) =
0 for 𝑛 ≥ 1, there cannot be a homotopy equivalence between 𝑋 and a one-point space.
∙ Here, however, we give a more geometric explanation. For the theorem follows again
from the classification of one-manifolds in Theorem 10.17 together with Whitney’s Ap-
proximation Theorem 12.18.
Proof: Assume there was a smooth homotopy 𝐻 ∶ 𝑋 × [0, 1] → 𝑋 between the identity on
𝑋 and a constant map with image 𝑥0 ∈ 𝑋, i.e., 𝐻(𝑥, 0) = 𝑥 and 𝐻(𝑥, 1) = 𝑥0 for all 𝑥 ∈ 𝑋. By
Sard’s Theorem 10.16, the map 𝐻 has at least one regular value 𝑦 ∈ 𝑋. We note that, since
dim 𝑋 ≥ 1, we must have 𝑦 ≠ 𝑥0 . This follows from the fact that 𝑑𝐻(𝑥,1) fails to be surjective,
since 𝐻(−, 1) ∶ 𝑋 → 𝑋 is constant.
The fiber 𝐻 −1 (𝑦) is a smooth manifold by the Preimage Theorem 10.14. Since 𝑋 is com-
pact, so is 𝑋 × [0, 1]. Since 𝐻 −1 (𝑦) is closed in 𝑋 × [0, 1], 𝐻 −1 (𝑦) is compact as well. Since
𝐻 −1 (𝑦) has codimension dim 𝑋 in 𝑋 × [0, 1], we have dim 𝐻 −1 (𝑦) = 1.
Moreover, 𝐻 −1 (𝑦) may have a boundary, since 𝑋 has no boundary and hence 𝑋 × [0, 1] is
a manifold with boundary. The Preimage Theorem 10.14 for manifolds with boundary implies
that this boundary has the form
For 𝑡 = 1, however, 𝐻(−, 1) is constant with 𝐻(𝑥, 1) = 𝑥0 for all 𝑥 ∈ 𝑋. Since we know
𝑥0 ≠ 𝑦, we have
(𝑋 × {1}) ∩ 𝐻 −1 (𝑦) = ∅.
Thus we have 𝜕𝐻 −1 (𝑦) = (𝑦, 0). Hence the boundary of 𝐻 −1 (𝑦) consists of a single point. This
contradicts Lemma 10.20 which says that a compact one-manifold with boundary always has
an even number of boundary points.
256 12.3. Ehresmann’s Fibration Theorem
By Whitney’s Approximation Theorem 12.18, we know that if there was a smooth homo-
topy between the identity and a constant map, then there was a continuous homotopy between
the identity and a constant map. Since such a smooth homotopy cannot exist, we get the theo-
rem.
𝑓|𝑓 −1 (𝑈 ) 𝑝𝑈
# z
𝑈
The following famous theorem gives us a sufficient criterion for a map to be locally trivial
based on the notions we have studied before:
∙ Ehresmann’s fibration theorem is highly influential in many areas of geometry and topol-
ogy. For example, in complex and algebraic geometry it implies that that the higher direct
images along 𝑓 of a constant sheaf form a local system on 𝑌 (see [18]).
∙ Let 𝑦0 ∈ 𝑌 be a given basepoint in 𝑌 . The theorem also tells us that we can think of a
proper submersion 𝑓 ∶ 𝑋 → 𝑌 as a family of diffeomorphic copies of the fiber 𝑓 −1 (𝑦0 )
varying over the points in 𝑌 .
Proof of Theorem 12.23: The assertion is clear if 𝑋 is the empty set. So assume 𝑋 and
𝑓 −1 (𝑦) are nonempty for 𝑦 ∈ 𝑌 . Since 𝑓 is a submersion, 𝑦 is a regular value for 𝑓 . Hence
𝑍 ∶= 𝑓 −1 (𝑦) is a submanifold of 𝑋. Then the relative version of the Tubular Neighborhood
Theorem 12.14 says that there is a neighborhood 𝑍 𝜀 of 𝑍 which is open in 𝑋 and a submersion
𝜋 ∶ 𝑍 𝜀 → 𝑍 for which the restriction to 𝑍 is the identity. Then we can form the map
𝑞 ∶= (𝜋, 𝑓|𝑍 𝜀 ) ∶ 𝑍 𝜀 → 𝑍 × 𝑌 .
Since 𝑓 is proper, 𝑍 = 𝑓 −1 (𝑦) is compact. Moreover, the Preimage Theorem 4.6 tells us
that 𝑍 is of dimension dim 𝑋 − dim 𝑌 and 𝑇𝑧 (𝑍) = Ker (𝑑𝑓𝑧 ). This implies that the derivative
𝑑𝑞𝑧 of 𝑞 at any point 𝑧 ∈ 𝑍 ⊂ 𝑍 𝜀 is an isomorphism, as 𝑑𝑓𝑧 is surjective onto 𝑇𝑦 𝑌 and 𝑑𝜋𝑧
is surjective onto 𝑇𝑧 𝑍. Hence 𝑑𝑞𝑧 is a surjective linear map and since the dimension of 𝑇𝑧 𝑍 𝜀
is equal to dim 𝑋 = dim 𝑍 + dim 𝑌 , 𝑑𝑞𝑧 must be an isomorphism. Thus we can apply the
generalized Inverse Function Theorem 12.4 to 𝑍 ⊂ 𝑍 𝜀 and 𝑞. Hence we get that there is a
neighborhood 𝑍 ⊂ 𝑊 ⊂ 𝑍 𝜀 of 𝑍 which is open in 𝑍 𝜀 such that 𝑞|𝑊 is a diffeomorphism
onto an open neighborhood of 𝑞(𝑍) in 𝑍 × 𝑌 .
Finally, since 𝑓 is proper, it is a closed map. Hence, Lemma 12.24 below implies that
there is an open neighborhood 𝑈 ⊂ 𝑌 around 𝑦 such that 𝑍 ⊂ 𝑓 −1 (𝑈 ) ⊂ 𝑊 . Then we
have 𝑞(𝑓 −1 (𝑈 )) = 𝑈 × 𝑍 and have shown that 𝜑 ∶= 𝑞𝑓 −1 (𝑈 ) ∶ 𝑓 −1 (𝑈 ) → 𝑈 × 𝑓 −1 (𝑦) is a
diffeomorphism such that 𝑓|𝑓 −1 (𝑈 ) = 𝑝𝑈 ◦𝜑.
Proof: We only show the implication we need. The other direction is left as an exercise.
Since 𝑊 is open, its complement 𝑋 ⧵ 𝑊 is closed in 𝑋. Since 𝑓 is closed, the image 𝑓 (𝑋 ⧵ 𝑊 )
is closed in 𝑌 . Hence the complement 𝑈 ∶= 𝑌 ⧵ 𝑓 (𝑋 ⧵ 𝑊 ) is open in 𝑌 . As 𝑓 −1 (𝑦) ⊂ 𝑊 , we
know 𝑦 ∉ 𝑓 (𝑋 ⧵ 𝑊 ) and hence 𝑦 ∈ 𝑈 and 𝑓 −1 (𝑦) ⊆ 𝑓 −1 (𝑈 ) ⊆ 𝑊 .
We are going to review what we have learned about transversality and show that transversality
is actually a generic property. This is a quite long and technical chapter. You may want to skip
258 12.4. Thom’s Transversality Theorem
some details for the first reading and get back to them later. The good news is that the results we
prove here will lay the ground for the intersection theory we will develop in the next chapter.
So hang in there, it is worth it!
∙ Main application:
Our main application will be to reach our goal to define an interesting intersection theory
for smooth manifolds which helps us deciding difficult questions. For example, we would like
to use it to show that 𝕊2 and ℝP2 are not homeomorphic. So assume we have a compact smooth
𝑘-dimensional manifold (without boundary), 𝑌 an 𝑛-dimensional manifold, and 𝑍 a closed 𝑚-
dimensional submanifold of 𝑌 . Let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map which is transverse to 𝑍.
Then the Preimage Theorem 6.2 tells us that 𝑓 −1 (𝑍) is a 𝑘 + 𝑚 − 𝑛-dimensional submanifold
of 𝑋. Since it is a closed subset in the compact space 𝑋, 𝑓 −1 (𝑍) is also compact.
Proof: Since both 𝐹 and 𝜕𝐹 are transversal to 𝑍, the Preimage Theorem 10.15 implies
that 𝑊 ∶= 𝐹 −1 (𝑍) is a submanifold of 𝑋 × 𝑆 with boundary
∙ Claim: 𝑓𝑠 −
⋔ 𝑍.
If 𝑒 is zero, we are done: Since 𝑓𝑠 is the restriction of 𝐹 to 𝑋 × {𝑠}, it follows that 𝑑(𝑓𝑠 )𝑥
is the restriction of 𝑑𝐹(𝑥,𝑠) to 𝑇𝑥 (𝑋) × {0} ⊂ 𝑇𝑥 (𝑋) × 𝑇𝑠 (𝑆), i.e., the diagram
𝑇𝑥 (𝑋) × {0} / 𝑇 (𝑋) × 𝑇 (𝑆)
𝑥 𝑠
𝑑𝐹(𝑥,𝑠)
𝑑(𝑓𝑠 )𝑥 (
𝑇𝑧 (𝑌 )
However, 𝑒 need not be zero. But we may use the projection 𝜋 to modify 𝑤 and 𝑒 as
follows: It is an exercise to check that
𝑑𝜋(𝑥,𝑠) ∶ 𝑇𝑥 (𝑋) × 𝑇𝑠 (𝑆) → 𝑇𝑠 (𝑆)
is just projection onto the second factor. In fact, this holds for every projection map from a
product of manifolds.
Now we use the assumption that 𝑠 is a regular value of 𝜋. For this implies that the restric-
tion of 𝑑𝜋(𝑥,𝑠) to 𝑇(𝑥,𝑠) (𝑊 )
𝑇(𝑥,𝑠) (𝑊 ) / 𝑇 (𝑋) × 𝑇 (𝑆)
𝑥 𝑠
𝑑𝜋(𝑥,𝑠)
𝑑𝜋(𝑥,𝑠) )|𝑇(𝑥,𝑠) (𝑊 ) '
𝑇𝑠 (𝑆)
260 12.4. Thom’s Transversality Theorem
is surjective. In particular, the fiber over 𝑒 ∈ 𝑇𝑠 (𝑆) is nonempty, and there is some vector of
the form (𝑢, 𝑒) in 𝑇(𝑥,𝑠) (𝑊 ).
and we remember that both 𝑑𝐹(𝑥,𝑠) (𝑤, 𝑒)−𝑎 and 𝑑𝐹(𝑥,𝑠) (𝑢, 𝑒) belong to 𝑇𝑧 (𝑍). Hence 𝑑(𝑓𝑠 )𝑥 (𝑣)−
𝑎 is in 𝑇𝑧 (𝑍) as we wished to prove.
∙ Claim: 𝜕𝑓𝑠 −
⋔ 𝑍.
This is a special instance of what we just proved, for the case of the boundaryless manifold
𝜕𝑋 and the map 𝜕𝐹 ∶ (𝜕𝑋) × 𝑆 → 𝑌 .
This shows that transversality for smooth maps 𝑋 → ℝ𝑁 is generic in the following sense:
𝐹 ∶ 𝑋 × 𝑆 → ℝ𝑁 , 𝐹 (𝑥, 𝑠) = 𝑓 (𝑥) + 𝑠.
Thus 𝑑𝐹(𝑥,𝑠) is surjective at any (𝑥, 𝑠). Hence 𝐹 is a submersion. This implies that 𝐹
is transverse to every submanifold 𝑍 ⊂ ℝ𝑁 . Now we can apply the Transversality
Theorem 12.25 we have just proven:
Let 𝑍 ⊂ ℝ𝑁 be a manifold. Since 𝐹 and 𝜕𝐹 are transverse to 𝑍, for almost every 𝑠 ∈ 𝑆,
the map 𝑓𝑠 (𝑥) = 𝑓 (𝑥) + 𝑠 is transversal to 𝑍. Moreover, the map
∙ This shows us that transversality is generic for maps 𝑋 → ℝ𝑁 . We would like to gen-
eralize this result to an arbitrary boundaryless smooth manifold 𝑌 ⊂ ℝ𝑁 and smooth
map 𝑓 ∶ 𝑋 → 𝑌 .
Chapter 12. Tubular Neighborhoods and Transversality 261
∙ Given a submanifold 𝑍 ⊂ 𝑌 , we have just learned how to vary a smooth map 𝑓 ∶ 𝑋 →
𝑌 ⊂ ℝ𝑁 as a family of maps 𝑋 → ℝ𝑁 such that 𝑓𝑠 − ⋔ 𝑍 for arbitrarily small 𝑠, where we
consider 𝑍 as a submanifold in ℝ𝑁 . It remains to project these maps down to 𝑌 such
that a small perturbation 𝑓𝑠 of 𝑓 remains transversal to the given submanifold 𝑍 ⊂ 𝑌 .
We can do this using normal bundles and tubular neighboorhoods we constructred in
the previous section.
𝑆 → 𝑌 , 𝑠 → 𝐹 (𝑥, 𝑠) is a submersion.
from the Tubular Neighborhood Theorem 12.10. Since 𝜋 restricts to the identity on 𝑌 , we
have
𝐹 (𝑥, 0) = 𝜋(𝑓 (𝑥) + 0) = 𝑓 (𝑥).
is the translation of a linear map. Thus 𝑑𝜑𝑠 is just given by multiplying a vector in 𝑇𝑠 (𝑆) =
ℝ𝑀 by the real number 𝜀(𝑓 (𝑥)) > 0 to get a vector in 𝑇𝜑(𝑠) (𝑌 𝜀 ) ⊂ ℝ𝑁 . This derivative is just
𝜀(𝑓 (𝑥)) times the identity of ℝ𝑁 , and therefore surjective. Thus 𝜑 is a submersion.
𝜑 𝜋
← 𝑌 , 𝑠 → 𝐹 (𝑥, 𝑠) is a submersion.
← 𝑌 𝜀 ←←→
𝐹 (𝑥, −) = 𝜋◦𝜑 ∶ 𝑆 ←←←→
Proof: For the family of mappings 𝐹 of the previous corollary of the 𝜀-Neighborhood
Theorem, the Transversality Theorem implies that 𝑓𝑠 −
⋔ 𝑍 and 𝜕𝑓𝑠 −
⋔ 𝑍 for almost all 𝑠 ∈ 𝑆.
But each 𝑓𝑠 is homotopic to 𝑓 , the homotopy being
𝑋 × 𝐼 → 𝑌 , (𝑥, 𝑡) → 𝐹 (𝑥, 𝑡𝑠).
In fact, Tubular Neighborhood Theorem 12.10 allows us to prove a stronger form of the
Transversality Homotopy Theorem 12.27. In order to be able to formulate it, we need some
terminology.
Proof: Let 𝐶 ′ be any closed set contained in 𝑈 that contains 𝐶 in its interior, and let {𝜌𝑖 }
be a partition of unity subordinate to the open cover {𝑈 , 𝑋 ⧵ 𝐶 ′ }. Here it comes handy that
we proved the existence of partition of unity for arbitrary subsets of ℝ𝑁 . Then just take 𝛾 to
be the sum of those 𝜌𝑖 that vanish outside of 𝑋 ⧵ 𝐶 ′ .
Second, let 𝛾 be the function in the above lemma for the closed subset 𝐶 and the open
neighborhood 𝑈 of 𝐶 in 𝑋. We set 𝜏 ∶= 𝛾 2 . Since
Now we modify the map 𝐹 ∶ 𝑋 × 𝑆 → 𝑌 which we defined in (12.1) in proving the Homo-
topy Theorem 12.27, where 𝑆 is the unit ball in ℝ𝑀 . and set
∙ Claim: 𝐺 −
⋔ 𝑍.
For suppose that (𝑥, 𝑠) ∈ 𝐺−1 (𝑍), and let us assume first 𝜏(𝑥) ≠ 0. Then the map
𝑆 → 𝑌 , 𝑟 → 𝐺(𝑥, 𝑟),
To show the claim when 𝜏(𝑥) = 0, we need to check that the image of the derivative 𝑑𝐺(𝑥,𝑠)
is big enough. To do this, we introduce the map
We observe that 𝐺 = 𝐹 ◦𝑚. Hence in order to calculate the derivative of 𝐺, we can apply
the chain rule. Since we are interested in the case where 𝜏(𝑥) = 0 and 𝑑𝜏𝑥 = 0 we get
Moreover, since 𝐹 (𝑥, 0) = 𝑓 (𝑥) for all 𝑥 by construction of 𝐹 , we know 𝐹|𝑋×{0} = 𝑓 . This
implies
𝑑𝐹(𝑥,𝑠) (𝑣, 0) = 𝑑𝐹(𝑥,0) (𝑣, 0) = 𝑑𝑓𝑥 (𝑣).
Hence we get
and therefore
Now 𝜏(𝑥) = 0, implies 𝑥 ∈ 𝑈 by definition of 𝛾 and 𝜏. But by the choice of 𝑈 above, this
implies 𝑓 −
⋔ 𝑍 at 𝑥. Hence (12.3) implies 𝐺 −
⋔ 𝑍 at (𝑥, 𝑠).
(a) The Transversality Theorem 12.25 says that when a homotopy 𝐹 is transversal
to 𝑍, then, in this homotopy family, almost every 𝑓𝑠 = 𝐹 (−, 𝑠) is transversal to
𝑍.
Chapter 12. Tubular Neighborhoods and Transversality 265
(b) The Transversality Homotopy Theorem 12.27 says that given a map 𝑓 and a sub-
manifold 𝑍, then there exists a map 𝑔 transversal to 𝑍 and 𝑔 is homotopic to
𝑓.
(c) The Extension Theorem 12.29 says that, given a map 𝑓 which is transversal to 𝑍
on a subset 𝐶, then we can always replace 𝑓 with a homotopic map 𝑔 which is
transversal to 𝑍 everywhere (not only on 𝐶) and 𝑓 = 𝑔 on an open set containing
𝐶.
(a) is a generalization of Sard’s Theorem 10.16. For (b) and (c), the key for the proof
is the Tubular Neighborhood Theorem 12.10.
266 12.5. Exercises and more examples
12.5 Exercises and more examples
12.5.1 Transversality
from previous exercise sets. Is the set Im (𝑓 ), the image of 𝑓 in ℝ2 , a manifold? Is the
set (𝑔◦𝑓 )−1 (1) a manifold?
𝑓 𝑔
Exercise 12.3 Let 𝑋 ←←←→ ← 𝑍 be a sequence of smooth maps between manifolds,
← 𝑌 ←←→
and let 𝑊 ⊂ 𝑍 be a submanifold. Assume that 𝑔 is transversal to 𝑊 . Show:
𝑓−
⋔ 𝑔 −1 (𝑊 ) if and only if (𝑔◦𝑓 ) −
⋔𝑊.
Exercise 12.4 Let 𝑉 be a vector space, and let Δ be the diagonal of 𝑉 × 𝑉 . For a linear
map 𝐴 ∶ 𝑉 → 𝑉 , consider the graph Γ(𝐴) = {(𝑣, 𝐴𝑣) ∶ 𝑣 ∈ 𝑉 }. Show that Γ(𝐴) − ⋔ Δ if
and only if +1 is not an eigenvalue of 𝐴.
Exercise 12.5 Let 𝑓 ∶ 𝑋 → 𝑋 be a map, and let 𝑥 be a fixed point of 𝑓 , i.e., 𝑓 (𝑥) = 𝑥.
If +1 is not an eigenvalue of 𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝑋) → 𝑇𝑥 (𝑋), then 𝑥 is called a Lefschetz fixed
point of 𝑓 . The map 𝑓 is called a Lefschetz map if all its fixed points are Lefschetz. Prove
that if 𝑋 is compact and 𝑓 is Lefschetz, then 𝑓 has only finitely many fixed points.
Hint: Show that the intersection of the graph of 𝑓 and the diagonal of 𝑋 is a 0-
Chapter 12. Tubular Neighborhoods and Transversality 267
dimensional submanifold of 𝑋 × 𝑋.
Prove that 𝑆𝑘7 is a 7-dimensional manifold by showing that the intersection is transver-
sal in ℂ5 ⧵ {0}.
Hint:
( At some point you may want ) to show that, at a point 𝑧 = (𝑧1 , … , 𝑧5 ), the vector
𝑤 ≔ 𝑚2 𝑧1 , 𝑚2 𝑧2 , 𝑚2 𝑧3 , 𝑚3 𝑧4 , 6𝑘−1
𝑚
𝑧5 , with 𝑚 ≔ 2 ⋅ 3 ⋅ (6𝑘 − 1), lies in one of the tangent
spaces but not in the other.
13. Intersection Theory modulo 2
A classical geometric approach to classifying maps is to study their fibres. This is directly
related to other fundamental problems in mathematics. For example, if 𝑓 ∶ 𝑋 → 𝑌 is a map
defined by an equation and given a value 𝑦 ∈ 𝑌 , the set {𝑥 ∈ 𝑋 ∶ 𝑓 (𝑥) = 𝑦} is the set of
solutions of the equation 𝑓 (𝑥) = 𝑦. In geometric terms, we could rephrase the question which
𝑥 solve equation 𝑓 (𝑥) = 𝑦 by asking how 𝑓 meets or intersects the subspace {𝑦} in 𝑌 .
Building on the methods we have developed so far, we are going to exploit this geometric
idea to derive interesting and powerful invariants. We will start with intersection numbers
modulo 2, i.e., intersection numbers with values in ℤ∕2. In order to define a ℤ-valued invariant
we will have to introduce orientations later.
We would like to generalize the notion of intersection numbers. A first obstacle is that if 𝑋
and 𝑍 do not intersect transversally, then it makes in general no sense to count the points in
𝑋 ∩ 𝑍. Hence, once again, transversality is key.
Luckily, we have learned how to move or deform manifolds to make intersections transver-
sal: we can alter them in homotopic families. And since embeddings form a stable class of
maps, i.e., for any homotopy 𝑖𝑡 of an embedding 𝑖0 , there is an 𝜀 > 0 such that 𝑖𝑡 is still an
embedding for all 𝑡 < 𝜀, any small homotopy of 𝑖 gives us another embedding 𝑋 → 𝑌 and thus
268
Chapter 13. Intersection Theory modulo 2 269
Figure 13.1: A tangential or non-transverse intersection is not stable. Any slight perturbation
will change the number of intersection points. We need to avoid such situations.
But we still have to be careful. For the intersection number may depend on how we move
or deform the manifold.
Figure 13.2: We can move the two circles in the plane and either move them apart or make them
intersect transversely. In both cases we get a stable number of intersection points. However, 0
is not equal 2, unless we work modulo 2. That is what we are going to do.
For example, take two circles in ℝ2 . Assume that they intersect non-transversally, i.e.,
they touch each other in a point such that both tangent spaces agree and together just span a line.
Then we can move the circles by a simple translations 𝑥 → 𝑥 + 𝑡𝑎 in direction 𝑎 such that they
intersect either in two points or in no points. In both cases, the intersection is transversal, but
the intersection numbers do not agree. We observe, however, that the parity of the intersection
numbers is preserved, i.e., up to a multiple of 2 the intersection numbers after moving into a
transversal intersection agree.
𝐼2 (𝑓 , 𝑍) ∶= #𝑓 −1 (𝑍) mod 2.
Of course, we need to check that the intersection number does not depend on the choice of
homotopic map. The key technical result that allows us to show independence is the Extension
Theorem 12.29 which says the following: Let 𝑓 ∶ 𝑋 → 𝑌 be a smooth map, 𝑌 boundaryless,
and 𝑍 a closed submanifold of 𝑌 without boundary. Let 𝐶 be a closed subset of 𝑋. Assume
that 𝑓 −
⋔ 𝑍 on 𝐶 and 𝜕𝑓 − ⋔ 𝑍 on 𝐶 ∩ 𝜕𝑋. Then there exists a smooth map 𝑔 ∶ 𝑋 → 𝑌
homotopic to 𝑓 such that 𝑔 −⋔ 𝑍 and 𝜕𝑔 − ⋔ 𝑍, and on a neighborhood of 𝐶 we have 𝑔 = 𝑓 .
Let 𝑋, 𝑌 and 𝑍 ⊂ 𝑌 are boundaryless manifolds. The product 𝑋 × [0, 1] is then a manifold
with boundary. We let 𝐶 be the boundary of 𝑋 × [0, 1], i.e., 𝐶 is the closed subset
Now we apply the Extension Theorem 12.29 to the case of a smooth homotopy
𝐹 ∶ 𝑋 × [0, 1] → 𝑌 .
Then 𝜕𝐹 , i.e., 𝐹 restricted to the boundary of 𝑋 × [0, 1], is given by the two maps
𝐺 ∶ 𝑋 × [0, 1] → 𝑌 with 𝐆 −
⋔ 𝐙 and 𝜕𝐺 −
⋔ 𝑍,
Hence
since we assume that dim 𝑋 + dim 𝑍 = dim 𝑌 . Moreover, the boundary of 𝐹 −1 (𝑍) is
𝜕𝐹 −1 (𝑍) = 𝐹 −1 (𝑍) ∩ 𝜕(𝑋 × [0, 1]) = 𝑓0−1 (𝑍) × {0} ∪ 𝑓1−1 (𝑍) × {1}.
Since 𝑋 is compact, 𝐹 −1 (𝑍) is compact. Hence Lemma 10.20, which follows from the classi-
fication of compact one-manifolds, implies that 𝜕𝐹 −1 (𝑍) must have an even number of points.
Thus, computing mod 2, we get
𝑓0 ∼ 𝑔0 ∼ 𝑔1 ∼ 𝑓1 , and hence 𝑓0 ∼ 𝑓1 .
Now that we have a solid notion of intersection numbers modulo 2 for maps and submanifolds,
let us return to the situation we started with:
𝐼2 (𝑋, 𝑍) ∶= 𝐼2 (𝑖, 𝑍)
∙ For example, on the torus 𝑌 = 𝕊1 × 𝕊1 , the two circles 𝕊1 × {1} and {1} × 𝕊1
have complimentary dimensions and nonzero mod 2 intersection number.
∙ An important and interesting example is the central circle 𝑋 on the open Möbius
band 𝑌 . See Figure 13.3. In Exercise 13.6 we show that 𝐼2 (𝑋, 𝑋) = 1. With-
out actually calculating the intersection number, we can already deduce from the
Intermediate Value Theorem in Calculus that we cannot continuously deform 𝑋
such that it does not intersect itself anymore.
Chapter 13. Intersection Theory modulo 2 273
Figure 13.3: We can construct the Möbius band by taking a rectangle and glue to opposite sides
after a twist. The central line then becomes a central circle on the band. If we move it a bit from
its initial position, the new curve has to intersect the original central line. This is a consequence
of the Intermediate Value Theorem, since the two ends of the curve must be on opposite sides
of the central line because of the twist.
Theorem 13.9 (Boundary Theorem for intersection numbers) Suppose that 𝑋 is the
boundary of some compact manifold 𝑊 and 𝑔 ∶ 𝑋 → 𝑌 is a smooth map. If 𝑔 can be
extended to all of 𝑊 , then 𝐼2 (𝑔, 𝑍) = 0 for every closed submanifold 𝑍 in 𝑌 which
satisfies dim 𝑋 + dim 𝑍 = dim 𝑌 .
We will see some applications of the previous two theorems in the exercises.
In this section we discuss an example for how intersection theory can be used to show the exis-
tence of non-trivial elements in the fundamental group. Recall from Section 9.2 real projective
𝑛-space ℝP𝑛 which consists of the set of equivalence classes [𝑥0 ∶ 𝑥1 ∶ … ∶ 𝑥𝑛 ] of (𝑛+1)-tuples
of real numbers with the equivalence relation
Recall that we showed that ℝP𝑛 is an n-dimensional smooth manifold. Alternatively, we can
describe ℝP𝑛 as a the quotient of 𝕊𝑛 under the equivalence relation 𝑥 ∼ −𝑥, i.e., we identify
antipodal points on 𝕊𝑛 to obtain ℝP𝑛 :
𝕊𝑛 → ℝP𝑛 , 𝑥 → [𝑥]
which sends a point 𝑥 in 𝕊𝑛 to its equivalence class [𝑥] in ℝP𝑛 . One can show that ℝP1 and 𝕊1
are diffeomorphic. However, this is the exception special to dimension one as we will now find
out.
We begin with a recollection of the following definition which we previously studied in the
exercises. See Exercise 8.3 and Exercise 10.7.
Remark 13.13 This implies that the fundamental group of 𝕊𝑛 , for all 𝑛 ≥ 2, is trivial,
i.e., 𝜋1 (𝕊𝑛 , 𝑥) = 0 for every 𝑥 ∈ 𝕊𝑛 .
In order to prove Theorem 13.12 we are going to use the following lemma:
Proof of Lemma 13.14: Let 𝑈 = 𝕊𝑛 ⧵{𝑝} and 𝑉 = 𝕊𝑛 ⧵{−𝑝}. Then 𝕊𝑛 is the union the two
open subsets 𝑈 and 𝑉 . Both 𝑈 and 𝑉 are diffeomorphic to ℝ𝑛 via stereographic projection. By
Lebesgue’s number lemma3 , we can find a subdivision 𝑏0 < ⋯ < 𝑏𝑚 of the unit interval [0, 1]
such that, for each 𝑖, the set 𝛾([𝑏𝑖−1 , 𝑏𝑖 ]) is contained either in 𝑈 or in 𝑉 . If there is an index
𝑖 such that 𝛾(𝑏𝑖 ) ∉ 𝑈 ∩ 𝑉 , i.e., if 𝛾(𝑏𝑖 ) ∈ {𝑝, −𝑝}, then we modify the subdivision as follows:
Each of the sets 𝛾([𝑏𝑖−1 , 𝑏𝑖 ]) and 𝛾([𝑏𝑖 , 𝑏𝑖+1 ]) is either contained in 𝑈 or in 𝑉 . If 𝛾([𝑏𝑖 ]) ∈ 𝑈 ,
then both of these sets must lie in 𝑈 . Then we may delete 𝑏𝑖 from our subdivision to obtain a
new subdivision 𝑐0 < … < 𝑐𝑚−1 that still satisfies the condition that 𝛾([𝑐𝑖−1 , 𝑐𝑖 ]) is contained
either in 𝑈 or in 𝑉 for each 𝑖. If 𝛾([𝑏𝑖 ]) ∈ 𝑉 we proceed similarly. After a finite number of
steps, this leads to a subdivision 𝑎0 < ⋯ < 𝑎𝑛 of [0, 1] such that 𝛾([𝑎𝑖−1 , 𝑎𝑖 ]) is contained either
in 𝑈 or in 𝑉 and 𝛾(𝑎𝑖 ) ∈ 𝑈 ∩ 𝑉 for each 𝑖.
As a consequence, we can show that the intersection theory for circles on 𝕊𝑛 is rather simple.
Example 13.15 (Intersecting circles on 𝕊2 ) Let us look at the concrete case for 𝕊2 .
We define a smooth map
𝑗 ∶ 𝕊1 → 𝕊2 , (cos(2𝜋𝑡), sin(2𝜋𝑡)) → (cos(2𝜋𝑡), sin(2𝜋𝑡), 0)
which maps the circle on the intersection of the sphere with the 𝑥𝑦-plane in ℝ3 . This map
is homotopic to the constant map 𝕊1 → 𝕊2 , 𝑡 → (0, 0, 1). We can show this directly us-
ing the homotopy ((cos(2𝜋𝑡), sin(2𝜋𝑡)), 𝑠) → (𝑠 cos(2𝜋𝑡), 𝑠 sin(2𝜋𝑡), 1−𝑠). Hence we can
smoothly
( 1 move 𝑗(𝕊1 ) away from itself and get that, on 𝕊2 , the self-intersection number
)
𝐼2 𝑗(𝕊 ), 𝑗(𝕊1 ) is zero.
3
A fact from general topology: Let (𝑋, 𝑑) be a compact metric space, and let be an open cover of 𝑋. Then
there is a number 𝜆 > 0 such that for every 𝑥 ∈ 𝑋, there is a 𝑈 ∈ with 𝐵𝜆 (𝑥) ⊂ 𝑈 .
276 13.4. Intersecting circles on 𝕊𝑛 versus ℝP𝑛
Our next goal is to use the fact that 𝕊𝑛 is simply-connected for 𝑛 ≥ 2 to show that 𝕊𝑛 and
ℝP𝑛 are not homeomorphic for 𝑛 ≥ 2. In fact, we will prove a stronger statement. We therefore
recall the following definition:
Lemma 13.17 Let 𝑋 and 𝑌 be topological spaces. Assume that 𝑋 and 𝑌 are homotopy
equivalent. Then 𝑋 is simply-connected if and only if 𝑌 is simply-connected.
Moreover, we have
Thus Γ is a homotopy between 𝛾 and the constant map with value 𝑓 (𝑐) ∈ 𝑌 . This shows that
𝑌 is simply-connected. By applying the argument with the roles of 𝑋 and 𝑌 reversed, we get
the assertion.
Chapter 13. Intersection Theory modulo 2 277
Proof: The case 𝑛 = 1 follows from ℝP1 ≅ 𝕊1 and the fact that 𝕊1 is not simply-connected.
This follows from deg2 (𝑧 → 𝑧) = 1. So now we assume 𝑛 ≥ 2. Our strategy is to find a smooth
map 𝑓 ∶ 𝕊1 → ℝP𝑛 and a suitable closed submanifold 𝑍 ⊂ ℝP𝑛 such that 𝐼2 (𝑓 , 𝑍) is defined
and non-zero. Then we know that 𝑓 is not smoothly homotopic to a constant map. For, if
it was, then 𝑓 would be homotopic to a constant map whose image was disjoint to 𝑍. Then
we would have 𝑓 − ⋔ 𝑍, and hence 𝐼2 (𝑓 , 𝑍) = 0. This is not possible if our strategy works
out. Then we use Whitney’s Approximation Theorem 12.20 which implies that if there was a
continuous homotopy between 𝑓 and a constant map, then there also was a smooth homotopy.
Since the latter does not exist, the former cannot exist either. So let us find suitable 𝑓 and 𝑍.
𝕊𝑛−1 is a closed subset in 𝕊𝑛 , 𝑍 is closed in ℝP𝑛 by definition of the quotient topology on ℝP𝑛 .
Hence the assumptions for applying intersection theory are satisfied.
𝐼2 (𝑓 , 𝑍) = #𝑓 −1 (𝑍) = 1
Theorem 13.18 implies that the fundamental group of ℝP𝑛 is non-trivial. Using techniques
from algebraic topology we can say even more:
Remark 13.19 (Fundamental group of ℝP𝑛 ) In Exercise 13.7 we will show that
2[𝑓 ] = 0 in 𝜋1 (ℝP2 ), and hence [𝑓 ] generates a subgroup ℤ∕2 inside 𝜋1 (ℝP2 ). One
can then check that 𝜋1 (ℝP2 ) ≅ ℤ∕2 and extend this computation, for example using
induction and the Seifert–van Kampen Theorem, to show
Example 13.20 (Intersecting circles on ℝP2 ) Let us look at the concrete case 𝑛 = 2.
The map 𝑓 of the above proof is given by
𝑓 ∶ 𝕊1 → ℝP2 , 𝑡 → [cos(𝑡∕2) ∶ sin(𝑡∕2) ∶ 0].
In Exercise 13.7, we study the map 𝑓 in more detail and check that 𝑓 is proper and
injective. The submanifold 𝑍
𝑍 = {[𝑥] = [𝑥0 ∶ 𝑥1 ∶ 𝑥2 ] ∈ ℝP2 |𝑥0 = 0}
corresponds to the image of 𝕊1 under the map
𝕊1 → ℝP2 , 𝑡 → [0 ∶ sin(𝑡∕2) ∶ cos(𝑡∕2)].
The intersection of the image of 𝑓 and 𝑍 therefore is the intersection of the images of
the two circles in ℝP2 . This intersection consists of the single point
[0 ∶ 1 ∶ 0] ∈ 𝑓 (𝕊1 ) ∩ 𝑍.
Chapter 13. Intersection Theory modulo 2 279
Hence, in ℝP2 , there is exactly one intersection point. If we had considered the map
and submanifold corresponding to 𝑓 and 𝑍 for 𝕊2 instead of ℝP2 , we would have gotten
two intersection points: (0, 1, 0) and (0, −1, 0). Hence on 𝕊2 , the corresponding mod 2
intersection number is zero. This fits well with our the fact that 𝕊2 is simply-connected.
On ℝP2 , however, we have 𝐼2 (𝑓 , 𝑍) which implies that it is impossible to move 𝑓 (𝕊1 )
and 𝑍 within ℝP2 such that they do not meet.
By Lemma 13.17, Theorem 13.12 and Theorem 13.18 imply that there cannot be a homo-
topy equivalence between 𝕊𝑛 and ℝP𝑛 for 𝑛 ≥ 2. Since there cannot be a homeomorphism
between spaces which are not homotopy equivalent, we have proven the following theorem:
We will now discuss an important and at first glance maybe surprising application of Theo-
rem 13.21. An algebra structure on ℝ𝑛 is a ℝ-bilinear multiplication map
ℝ𝑛 × ℝ𝑛 → ℝ𝑛 , (𝑎, 𝑏) → 𝑎𝑏.
Note that it is not required that this multiplication has an identity element. Such a map is called a
division algebra structure if there are no zero-divisors, i.e., 𝑎𝑏 = 0 implies 𝑎 = 0 or 𝑏 = 0. It is
called commutative if 𝑎𝑏 = 𝑏𝑎 for all 𝑎, 𝑏 ∈ ℝ𝑛 . Examples of finite-dimensional commutative
division algebras over ℝ with an identity element are given by ℝ and ℂ = ℝ2 . However, there
is not much more:
𝑓̄ ∶ ℝP𝑛−1 → 𝕊𝑛−1 .
∙ Claim: 𝑓̄ is injective.
4
The following proof follows Hatcher’s book [6] on Algebraic Topology.
280 13.5. ℝ𝑛 as a commutative division algebra
To show the claim we assume 𝑓 (𝑥) = 𝑓 (𝑦). This implies
√
|𝑥2 |
𝑥2 = 𝛼 2 𝑦2 for 𝛼 = > 0.
|𝑦2 |
Thus we have
𝑥2 − 𝛼 2 𝑦2 = 0.
Using commutativity and the fact that 𝛼 is a real number and multiplication is ℝ-bilinear we
see that this equation factors into
(𝑥 + 𝛼𝑦)(𝑥 − 𝛼𝑦) = 0.
𝑥 = ±𝛼𝑦.
Since |𝑥| = |𝑦| = 1 and 𝛼 is a real number, this implies |𝛼| = 1 and hence
𝑥 = ±𝑦.
Thus 𝑥 and 𝑦 determine the same point in ℝP𝑛−1 . This proves the claim that 𝑓̄ is injective.
Now we use the nice topological properties of ℝP𝑛−1 and 𝕊𝑛−1 . First we observe that 𝑓̄ is a
map between compact Hausdorff spaces. Since 𝑓̄ is injective, it is a homeomorphism onto its
image. Moreover, since ℝP𝑛−1 is compact, its image in 𝕊𝑛−1 is also compact and hence closed.
Since ℝP𝑛−1 and 𝕊𝑛−1 are smooth manifolds of the same dimension, invariance of domain5 ,
which we stated in Theorem 10.26, implies that the image is also open in 𝕊𝑛−1 . Since 𝕊𝑛−1 is
a connected, 𝑓̄ must be a homeomorphism. By Theorem 13.21, we know that for 𝑛 ≥ 3 there
cannot be a homeomorphism between ℝP𝑛−1 and 𝕊𝑛−1 . Hence our initial assumption must
have been wrong.
Remark 13.23 (Adams’ Theorem: Hopf invariant one) There is a vast generalisation
of the above result: Let ℝ𝑛 × ℝ𝑛 → ℝ𝑛 be a division algebra structure on ℝ𝑛 , without
assuming it is commutative. For 𝑛 = 4, there are the Hamiltonians, or Quaternions,
ℍ ≅ ℝ4 with a multiplication which is associative and almost as good as the one in ℂ
and ℝ, but it is not commutative. For 𝑛 = 8, there are the Octonions 𝕆 ≅ ℝ8 . The
multiplication is neither commutative nor associative. And that’s it! This is a deep
result. And surprisingly it is equivalent to a topological problem on the behavior of
tangent spaces on spheres. It was solved first by Adams. The prove goes way beyond
the methods of this class, unfortunately. However, it turned out to be not that hard using
complex 𝐾-theory. A key tool for the study is the Hopf invariant. We have seen its
mod 2 version in Section 11.3 and we will define the ℤ-valued version later via a series
of exercises in Section 15.4 using intersection theory and linking numbers.
5
Let 𝑈 ⊂ ℝ𝑛 be an open subset and ℎ ∶ 𝑈 → ℝ𝑛 an injective continuous map. Then ℎ(𝑈 ) is open in ℝ𝑛 .
Chapter 13. Intersection Theory modulo 2 281
13.6 Exercises and more examples
𝑓 𝑔
Exercise 13.1 Let 𝑋 ←←←→ ← 𝑍 be a sequence of smooth maps between manifolds
← 𝑌 ←←→
with 𝑋 compact, and let 𝑊 ⊂ 𝑍 be a submanifold. Assume that 𝑔 is transversal to 𝑊 ,
so that 𝑔 −1 (𝑊 ) is a submanifold in 𝑌 . Show that
𝐼2 (𝑓 , 𝑔 −1 (𝑊 )) = 𝐼2 (𝑔◦𝑓 , 𝑊 ).
In particular, verify that if one of these two intersection numbers is defined, then the other
one is defined as well.
(c) Show that 𝕊1 is not simply-connected. Recall that we call a manifold 𝑋 simply-
connected if it is connected and if every map of the circle 𝕊1 into 𝑋 is homotopic
to a constant map.
Hint: Consider the identity map.
Exercise 13.3 (a) Show that intersection theory is trivial in contractible boundaryless
manifolds: if 𝑌 is boundaryless and contractible, i.e., its identity map is homotopic
to a constant map, and dim 𝑌 > 0, then 𝐼2 (𝑓 , 𝑍) = 0 for every smooth map
𝑓 ∶ 𝑋 → 𝑌 such that 𝑋 compact and 𝑍 closed, dim 𝑋 ≥ 1 and dim 𝑋 + dim 𝑍 =
dim 𝑌 . In particular, intersection theory is trivial in Euclidean space.
(b) Prove that no compact boundaryless manifold - other than the one-point space - is
contractible.
Hint: Consider the identity map.
Exercise 13.5 Two compact manifolds 𝑋 and 𝑍 of the same dimension in 𝑌 are called
cobordant in 𝑌 if there exists a compact manifold with boundary 𝑊 ⊂ 𝑌 × [0, 1] such
that
𝜕𝑊 = 𝑋 × {0} ∪ 𝑍 × {1}.
(a) Show that if we can deform 𝑋 into 𝑍, i.e., if there is a smooth homotopy from the
embedding 𝑖0 ∶ 𝑋 → 𝑌 of 𝑋 in 𝑌 to an embedding 𝑖1 ∶ 𝑋 → 𝑌 with 𝑖1 (𝑋) = 𝑍
such that each 𝑖𝑡 is an embedding, then 𝑋 and 𝑍 are cobordant.
Note that the standard image of a cobordism, a pair of pants as in Figure 13.4,
illustrates that the converse is false: 𝑋 and 𝑍 are cobordant, but we cannot deform
𝑋 into 𝑍, since 𝑋 has one connected component whereas 𝑍 has two.
(b) Show that if 𝑋 and 𝑍 are cobordant in 𝑌 , then for every compact submanifold 𝐶
in 𝑌 with dimension complementary to 𝑋 and 𝑍, i.e., dim 𝑋 + dim 𝐶 = dim 𝑍 +
dim 𝐶 = dim 𝑌 (where dim 𝑋 = dim 𝑍 because they are cobordant), we have
Hint: Let 𝑓 be the restriction to 𝑊 of the projection map 𝑌 × [0, 1] → 𝑌 , and use
the Boundary Theorem 13.9.
Figure 13.4: A pair of pants is a cobordism between the upper boundary 𝑋 and the lower
boundary 𝑍. While 𝑋 is connected, 𝑍 has two connected components.
The following exercises require some computations for abstract manifolds. They shed light
on very interesting phenomenons and provide the proof for important results we mentioned in
the main text.
Exercise 13.7 Let 𝑋 be the open Möbius band given as the quotient of [−1, 1] × (−1, 1)
modulo the equivalence relation (−1, 𝑥) ∼ (1, −𝑥). Let 𝑍 denote the central circle given
by 𝑍 = {(𝑥, 0) ∶ 𝑥 ∈ [−1, 1]} ⊂ 𝑋.
(b) Conclude using Exercise 13.6 that there cannot be a submersion 𝑔 ∶ 𝑋 → ℝ such
that 𝑍 = 𝑔 −1 (0).
Exercise 13.8 Let 𝑓 ∶ 𝕊1 → ℝP2 be the map defined by sending 𝑡 ∈ [0, 2𝜋] to
[cos(𝑡∕2) ∶ sin(𝑡∕2) ∶ 0].
(a) Show that 𝑓 is a smooth embedding, i.e., explain why it is smooth, injective, and
proper.
(b) Write 𝑍 for the image of 𝑓 in ℝP2 and consider it as a submanifold of ℝP2 . Prove
that 𝐼2 (𝑍, 𝑍) = 1.
(d) We write 2𝑓 for the map 2𝑓 ∶ 𝕊1 → ℝP2 defined by sending 𝑡 ∈ [0, 2𝜋] to [cos(𝑡) ∶
sin(𝑡) ∶ 0]. Show that 2𝑓 is homotopic to a constant map by constructing a concrete
homotopy. Do you see why the homotopy works for 2𝑓 but not for 𝑓 ?
Aside: The observations for 𝑓 and 2𝑓 are an indicator for the fact that the fun-
damental group of ℝP2 is cyclic of order two, i.e., 𝜋1 (ℝP2 ) ≅ ℤ∕2, where 𝑓
represents the non-trivial element and [2𝑓 ] = 0. Apart from the formula for 2𝑓
this is another explanation of our choice for the name 2𝑓 .
14. Orientation
Our next goal is to improve our definition of degree and intersection number and remedy the
defect that they only have even and odd values.
One of the reasons for this limitation was that a homotopy can move a non-transversal inter-
section into, for example, either an empty intersection or an intersection in two points. The
idea to handle this phenomenon is to take into account in which direction the intersection hap-
pens and to count intersection points with signs:
Figure 14.1: We can again move the two circles in the plane and either move them apart or
make them intersect transversely. In both cases we get a stable number of intersection points.
However, 0 is still not equal 2. But this time we keep track of how the points are oriented by
looking at how the relation of the directions of tangent vectors change.
The technical solution to implement this idea is to introduce orientations. We will see that,
unfortunately, not all manifolds are orientable. For those manifolds that orientable, however,
we will introduce and study integer-valued invariants in the next chapters. In order to get a first
idea why the claimed solution might be reasonable we look back at an important example first.
284
Chapter 14. Orientation 285
If we knew in addition that the number deg(𝑓 ) is homotopy-invariant, i.e., it only depends on
the class of 𝑓 under the equivalence relation given by homotopy between maps, then we could
use the degree to distinguish between possibly infinitely many different homotopy classes of
maps 𝕊1 → 𝕊1 .
The way we came up with the formula deg2 (𝑓 ) = 𝑞 modulo 2 was to look at the graph of
𝑔 in Figure 11.5. Then we used the Intermediate Value Theorem and observed that the graph
crosses the horizontal lines 𝑔(𝑠) + 𝑘 for an integer 𝑘 ∈ {0, 1, … , 𝑞 − 1} an odd number of times
and lines 𝑔(𝑠) + 𝑘 for any other integer 𝑘 an even number of times. We would like to improve
this count by taking all the values with 𝑔(𝑡) = 𝑔(𝑠) + 𝑘 into account. But there might be many
more such values than just 𝑞. How can we fix this?
The idea is to remember how the graph of 𝑔 crosses the lines 𝑔(𝑠) + 𝑘: when 𝑡 increases
𝑔(𝑡) either increases while crossing 𝑔(𝑠) + 𝑘 or decreases. In other words, the derivative of 𝑔
is either positive or negative at 𝑡𝑘 with 𝑔(𝑡𝑘 ) = 𝑔(𝑠) + 𝑘. Note that we cannot have 𝑔 ′ (𝑡𝑘 ) = 0,
since 𝑦 is regular and hence det(𝑑𝑓𝑥 ) ≠ 0 and therefore also 𝑔 ′ (𝑡) ≠ 0 for all 𝑡 with 𝑝(𝑔(𝑡)) = 𝑦.
Now if 𝑔 ′ (𝑡𝑘 ) > 0, we count 𝑡𝑘 with value +1, if 𝑔 ′ (𝑡𝑘 ) < 0, we count 𝑡𝑘 with value −1.
See Figure 14.2. Counting all 𝑡 ∈ [𝑠, 𝑠 + 1) with 𝑔(𝑡) = 𝑔(𝑠) + 𝑘 for 𝑘 ∈ ℤ, this gives us the
desired value 𝑞. Hence if we can improve our definition of the degree such that we can use this
way of counting points on the graph of 𝑔, we would get deg(𝑓 ) = 𝑞.
Figure 14.2: The idea is to keep track of how the graph passes the horizontal lines: if 𝑔 increases
while passing the horizontal line we count the intersection as +1 and if 𝑔 decreases we count it
as −1.
To generalise this procedure, we need to describe 𝑔 ′ (𝑡𝑘 ) > 0 or 𝑔 ′ (𝑡𝑘 ) < 0 in more general
terms. In higher dimensions, we cannot just look at the sign of 𝑔 ′ (𝑡) or 𝑓 ′ (𝑡). A more fancy
way of describing the sign of the derivative of 𝑔 ′ (𝑡) is to say that the tangent space of the graph
of 𝑔 has a positive or a negative orientation. A line has exactly two orientations, depending
on if we walk in one direction or the other. It turns out that this can be generalised to higher
dimensional vector spaces and thereby to higher dimensional tangent spaces too. This is what
we are going to do next.
286 14.2. Orientation
14.2 Orientation
An orientation for a finite dimensional real vector space 𝑉 is an equivalence class of ordered
bases where the relation is defined as follows: the ordered basis (𝑣1 , … , 𝑣𝑛 ) has the same ori-
entation as the basis (𝑣′1 , … , 𝑣′𝑛 ) if the matrix 𝐴 with
It has the opposite orientation if det(𝐴) < 0. The fact that this an equivalence relation
follows from the multiplicativity of the determinant function. Thus each finite dimensional
vector space has precisely two orientations, corresponding to the two equivalence classes of
ordered bases.
Remark 14.1 (Warning) The ordering of the basis elements is essential. Interchang-
ing the positions of two basis vectors changes the sign of the orientation since the the
determinant of the corresponding permutation matrix is negative.
In the case of the zero dimensional vector space it is convenient to define an orientation
as the symbol +1 or −1.
Now we translate orientations from vector spaces to orientations for manifolds. The idea is to
orient each tangent space. However, this needs to be done in a compatible way.
We now introduce an important distinction to our terminology and distinguish between the
possibility to make a choice or that such a choice actually has been made:
Hence an oriented manifold is a pair consisting of a manifold together with a chosen ori-
entation.
Example 14.5 For zero-dimensional manifolds, orientations are very simple: to each
point 𝑥 ∈ 𝑋 we assign an orientation number +1 or −1.
Proof: We show in Exercise 14.4 that if 𝑑𝑓𝑥 preserves orientation at one point 𝑥, then 𝑓
preserves orientation at every point.
Then 𝑟𝑖 is an isomorphism which reverses the orientation. The composition 𝑟𝑖 ◦𝑟𝑗 of two
reflections is an isomorphism which preserves the orientation. This shows that −Idℝ𝑛 ,
288 14.2. Orientation
the negative of the identity of ℝ𝑛 , preserves the orientation if 𝑛 is even and −Idℝ𝑛 reverses
the orientation if 𝑛 is odd.
Now we can consider the reflection 𝑟𝑖 ∶ 𝕊𝑛 → 𝕊𝑛 on the 𝑛-sphere given by
The above observation on the composition of reflections and the negative of the identity
yield the following result:
Lemma 14.9 (Orientation and the antipodal map) The antipodal map 𝑎 ∶ 𝕊𝑛 → 𝕊𝑛 ,
𝑎(𝑥) = −𝑥, is a diffeomorphism which preserves the orientation if 𝑛 is odd and reverses
the orientation if 𝑛 is even.
Proof: Assume we are given two orientations on 𝑋. In fact, we know that there are at least
two, since given one, we can reverse signs everywhere and get another orientation. We need to
show that there are not more than two.
We do this by showing that the set of points at which two orientations agree and the set
where they disagree are both open. Consequently, two orientations of a connected manifold
are either identical or opposite.
𝑑(𝜙−1 ◦𝜙′ )0 ∶ ℝ𝑘 → ℝ𝑘
If the orientations on 𝑇𝑥 (𝑋) induced by 𝜙 and 𝜙′ , respectively, do not agree, the same
argument shows that the set of points where the orientations do not agree is open.
Definition 14.12 (Product orientation) Assume 𝑋 and 𝑌 are oriented and one of
them is without boundary. Then 𝑋 × 𝑌 is a manifold with boundary and inherits an
orientation in the following way: At a point (𝑥, 𝑦) ∈ 𝑋 × 𝑌 , let 𝛼 = (𝑣1 , … , 𝑣𝑘 ) and
𝛽 = (𝑤1 , … , 𝑤𝑚 ) be ordered bases of 𝑇𝑥 (𝑋) and 𝑇𝑦 (𝑌 ), respectively. We denote the
ordered basis ((𝑣1 , 0), … , (𝑣𝑘 , 0), (0, 𝑤1 ), … , (0, 𝑤𝑚 )) of 𝑇𝑥 (𝑋) × 𝑇𝑦 (𝑌 ) = 𝑇(𝑥,𝑦) (𝑋 × 𝑌 )
by (𝛼 × 0, 0 × 𝛽) . Now it comes handy that we related orientations of ordered bases to
signs. For we can define the orientation of 𝑇𝑥 (𝑋) × 𝑇𝑦 (𝑌 ) simply by determining a sign
by setting
Remark 14.13 (Classification) Not all manifolds are orientable. The most famous
example being the Möbius band. We will give a rigorous proof of this fact in Theo-
rem 14.16. Other examples of non-orientable manifolds include the Klein bottle and
the real projective plane ℝP2 . We will discuss the latter case in Theorem 14.15.
As a consequence, we see that the question whether a smooth manifold is orientable
or not helps classifying manifolds up to diffeomorphism: There is the class of orientable
manifolds, and the class of non-orientable manifolds.
(𝑔◦𝑓 )∗ 𝔬𝑍 = 𝑓 ∗ (𝑔 ∗ 𝔬𝑍 ) as orientations on 𝑋.
Proof: Since 𝑓 is a local diffeomorphism, at every point 𝑥 ∈ 𝑋, there are open subsets
𝑈 ⊂ 𝑋 and 𝑉 ⊂ 𝑌 with 𝑥 ∈ 𝑈 and 𝑓 (𝑥) ∈ 𝑉 such that 𝑓|𝑈 ∶ 𝑈 → 𝑉 is a diffeomorphism. By
choosing 𝑈 and 𝑉 small enough, we can assume that there are local parametrizations 𝜙 ∶ 𝑊 →
𝑈 ⊂ 𝑋 and 𝜓 ∶ 𝑊 → 𝑉 ⊂ 𝑌 such that 𝑓 ◦𝜙 = 𝜓 with 𝑊 ⊂ ℝ𝑛 open. Since 𝑌 is oriented,
we can choose 𝜓 such that 𝑑𝜓𝑤 determines the orientation on 𝑇𝑦 𝑌 for all 𝑦 = 𝜓(𝑤) in 𝑉 .
Since 𝑑𝑓𝑥 is an isomorphism for all 𝑥 by assumption, we can define an orientation on 𝑇𝑥 𝑋 by
requiring det(𝑑𝑓𝑥 ) > 0. This defines an orientation on 𝑇𝑥 𝑋 for all 𝑥 ∈ 𝑈 . It remains to check
that this orientation on the tangent spaces for points in 𝑈 ⊂ 𝑋 does not depend on the choice
of 𝜙. So let 𝜙′ ∶ 𝑊 ′ → 𝑈 ′ ⊂ 𝑋 be another local parametrization of 𝑋 with 𝑈 ∩ 𝑈 ′ ≠ ∅. Then
we can find a local parametrization 𝜓 ′ ∶ 𝑊 ′ → 𝑉 ′ ⊂ 𝑌 of 𝑌 such that 𝑓|𝑈 ′ ∶ 𝑈 ′ → 𝑉 ′ is a
diffeomorphism. Since 𝑌 is oriented, the orientation of 𝑇𝑦 𝑌 determined by 𝑑𝜓𝑤 and by 𝑑𝜓𝑤′ ,
with 𝜓(𝑤) = 𝑦 = 𝜓(𝑤′ ) are the same for all 𝑦 in 𝑉 ∩ 𝑉 ′ . In other words, we have
as well. Hence the orientation of 𝑇𝑥 𝑋 via 𝑑𝜙𝑤 and 𝑑𝜙′𝑤′ is the same for all 𝑤, 𝑤′ ∈ 𝑊 ∩ 𝑊
with 𝜙(𝑤) = 𝑥 = 𝜙′ (𝑤′ ). This shows that we have defined an orientation on 𝑋. The facts
that 𝑓 preserves orientations, uniqueness and functoriality follow from the construction of the
induced orientation.
Theorem 14.15 (Orientability of real projective space) Real projective space ℝP𝑛 is
orientable if and only if 𝑛 is odd.
This shows that ℝP𝑛 can be oriented only if 𝑎 preserves orientation. By Lemma 14.9 this is
the case if only if 𝑛 is odd. This proves that ℝP𝑛 is not orientable if 𝑛 is even.
Now we assume that 𝑛 is odd. We will now define an orientation of ℝP𝑛 . At every point
[𝑥] ∈ ℝP𝑛 , the fiber under 𝑞 consists of two antipodal points 𝑞 −1 ([𝑥]) = {𝑥, −𝑥} ⊂ 𝕊𝑛 . Given
a point 𝑥 ∈ 𝕊𝑛 , we can use stereographic projection from a point ≠ ±𝑥 on 𝕊𝑛 as one local
Chapter 14. Orientation 291
parametrization 𝜙 for both 𝑥 and −𝑥. The orientation of 𝑇𝑥 and 𝑇−𝑥 is then determined
𝕊𝑛 𝕊𝑛
such that the determinants of 𝑑𝜙𝜙−1 (𝑥) and 𝑑𝜙𝜙−1 (−𝑥) both have positive sign. Now we observe
that, for any choice of basis of 𝑇[𝑥] ℝP𝑛 , det(𝑑𝑞𝑥 ) and det(𝑑𝑞−𝑥 ) must have the same sign since
the products det(𝑑𝜙𝜙−1 (𝑥) ) ⋅ det(𝑑𝑞𝑥 ) and det(𝑑𝜙𝜙−1 (−𝑥) ) ⋅ det(𝑑𝑞−𝑥 ) must be either both positive
or both negative. This shows that we can define an orientation on 𝑇[𝑥] ℝP𝑛 by requiring that
det(𝑑𝑞𝑥 ) has positive sign and if we had used 𝑑𝑞−𝑥 instead of 𝑑𝑞𝑥 we would get the same
orientation. Since 𝕊𝑛 is oriented, this defines an orientation on ℝP𝑛 for 𝑛 odd.
Proof: Let 𝑋 denote the Möbius band. We assume that 𝑋 is orientable and assume that we
have chosen an orientation. We will now show that this leads to a contradiction to previously
obtained results. Let 𝑍 ⊂ 𝑋 be the central circle. Then 𝑍 is orientable as a circle, and it
is a submanifold of codimension one. By Exercise 14.6, this implies that the normal bundle
𝑁(𝑍, 𝑋) is trivial. By Theorem 12.15, this implies that there is a submersion 𝑔 ∶ 𝑈 → ℝ1
defined on an open subset 𝑈 ⊂ 𝑋 such that 𝑍 = 𝑔 −1 (0). By Exercise 13.7, such a 𝑔 does not
exist. This shows that 𝑋 cannot be equipped with an orientation.
It is now a long and technical endeavour to check how orientations behave under the main
constructions and relate to the concepts we have developed so far. We will go through them one
by one.
Let 𝑋 be an oriented smooth manifold with boundary. Then the boundary submanifold 𝜕𝑋
inherits an orientation as follows:
At every point 𝑥 ∈ 𝜕𝑋, 𝑇𝑥 (𝜕𝑋) is a subspace of codimension one in 𝑇𝑥 (𝑋). Its orthogo-
nal complement in 𝑇𝑥 (𝑋), is a line which contains exactly two unit vectors: one is pointing
inward into 𝑇𝑥 (𝑋), the other one is pointing outward away from 𝑇𝑥 (𝑋).
Figure 14.3: We define an outward normal vector 𝑛𝑥 as the image 𝑑𝜙0 (−𝑒𝑘 ) of the outward
pointing standard basis vector −𝑒𝑘 .
We denote the outward pointing unit normal vector by 𝑛𝑥 . We checked in Exercise 10.4
that the construction of 𝑛𝑥 does not depend on the choice of 𝜙 and that the assignment 𝑥 → 𝑛𝑥
is a smooth map on 𝜕𝑋. See Figure 14.4.
Since both the assignment 𝑥 → 𝑛𝑥 and the choice of sign for ordered bases on 𝑇𝑥 (𝑋) are
smooth, this defines an orientation on 𝜕𝑋 which is called the boundary orientation.
Figure 14.4: The outward pointing unit vector 𝑛𝑥 is the vector orthogonal to the tangent line at
𝑥 to 𝕊1 and pointing away from the origin.
1
Note that the inner product on 𝑇𝑥 (𝑋) is induced by the standard inner product on ℝ𝑁 , where 𝑋 ⊂ ℝ𝑁 and
hence 𝑇𝑥 (𝑋) ⊂ ℝ𝑁 .
Chapter 14. Orientation 293
Example 14.19 (Unit interval) Let us look at the compact interval 𝑋 = [0, 1] with its
standard orientation inherited from being a subset in ℝ. Note that local parametrizations
of [0, 1] are given by
As an application of product and boundary orientations, we would like to orient the product
[0, 1] × 𝑋 for a boundaryless smooth oriented manifold 𝑋 which is the domain of all homo-
topies on 𝑋. This will be crucial for the homotopy invariance of intersection numbers in the
next section.
We just learned that products and boundaries inherit orientations. For each 𝑡 ∈ [0, 1], the
slice 𝑋𝑡 ∶= {𝑡} × 𝑋 is diffeomorphic to 𝑋, and the orientation on 𝑋𝑡 should be such that the
diffeomorphism
For future applications, we are particularly interested in the orientation of the boundary
We start with 𝑋1 : We see from the local parametrization 𝜓 above that along 𝑋1 the outward-
pointing normal vector is
If we calculate the orientation induced from the product structure, then we get
We learn from these two equations, that the boundary orientation of 𝑋1 is just the orien-
tation of 𝑋 as a copy in the product [0, 1] × 𝑋.
We see from the local parametrization 𝜙 that along 𝑋0 the outward-pointing normal vector
is
𝑛(0,𝑥) = (−1, 0) = (−1, 0, … , 0) ∈ 𝑇0 ([0, 1]) × 𝑇𝑥 (𝑋).
Hence the orientation on 𝑇0 ([0, 1]) is opposite to the standard orientation of ℝ. Hence the
formula for product orientations yields
sign ((−1, 0), 0 × 𝛽)) = sign (−1) ⋅ sign (𝛽) = −sign (𝛽).
Thus the boundary orientation on 𝑋0 is the reverse of its orientation as a copy of 𝑋 in the
product [0, 1] × 𝑋.
𝜕([0, 1] × 𝑋) = 𝑋1 ∪ (−𝑋0 ).
𝜕([0, 1] × 𝑋) = 𝑋1 − 𝑋0 .
Our next goal is to orient preimages. In order to do so, we will have to look at direct sums (not
just products) of vector spaces, and we need to orient those guys.
Chapter 14. Orientation 295
Figure 14.5: The vectors 𝑛𝑥0 and 𝑛𝑥1 point in opposite directions. This provides the two bound-
ary circles with opposite orientations. This fact will turn out to be crucial for the definition of
intersection numbers later.
It follows immediately from the way matrices on direct sums are put together that
this formula determines an orientation on the third space if two orientations are given.
Note again that the order of the summands 𝑉1 and 𝑉2 is crucial.
a
Note that this not only means, orientations on 𝑉1 and 𝑉2 determine an orientation on 𝑉 , but also ori-
entations on 𝑉 and, say, 𝑉2 determine an orientation on 𝑉1 .
If 𝑓 (𝑥) = 𝑧 ∈ 𝑍, then
Hence, by Definition 14.21 on the orientation of a direct sum, we only need to choose an ori-
296 14.6. Example: Fibers of the Hopf fibration
entation on 𝑁𝑥 (𝑆; 𝑋) to obtain a direct sum orientation on 𝑇𝑥 (𝑆). Since 𝑓 −
⋔ 𝑍, we have
where we use 𝑑𝑓𝑥 (𝑇𝑥 (𝑆)) = 𝑇𝑧 (𝑍) for the last step. Thus the orientations on 𝑍 and 𝑌 induce
a direct sum orientation on 𝑑𝑓𝑥 (𝑁𝑥 (𝑆; 𝑋)). It remains to show that this also induces an
orientation on 𝑁𝑥 (𝑆; 𝑋): We have
and hence the restriction of 𝑑𝑓𝑥 to 𝑁𝑥 (𝑆; 𝑋) is in fact an isomorphism onto its image. There-
fore the induced orientation on 𝑑𝑓𝑥 (𝑁𝑥 (𝑆; 𝑋)) defines an orientation on 𝑁𝑥 (𝑆; 𝑋) via the iso-
morphism 𝑑𝑓𝑥 . Since the orientations on 𝑋, 𝑌 and 𝑍 vary smoothly and 𝑑𝑓𝑥 also depends
smoothly on 𝑥, the induced orientation on 𝑇𝑥 (𝑆) varies smoothly with 𝑥.
Then we define the sign of 𝛽𝑇𝑥 𝑓 −1 (𝑍) and hence the orientation of 𝑓 −1 (𝑍) such that
( ) ( ) ( )
sign 𝛽𝑁𝑥 (𝑓 −1 (𝑍);𝑋) ⋅ sign 𝛽𝑇𝑥 𝑓 −1 (𝑍) = sign 𝛽𝑇𝑥 𝑋 .
Remark 14.23 (Orthogonality is not important) Note that we did not really use
that 𝑁𝑥 (𝑆; 𝑋) is orthogonal to 𝑇𝑥 (𝑆). All we needed was a direct sum decomposition
𝐻 ⊕ 𝑇𝑥 (𝑆) = 𝑇𝑥 (𝑋) with a space 𝐻 with an orientation induced by the orientation of
𝑋. We will exploit this fact in the proof below.
Recall the Hopf fibration 𝜋 that we have seen previously: We consider 𝕊3 as a subset of ℂ2 ,
i.e., 𝕊3 = {(𝑧0 , 𝑧1 ) ∈ ℂ2 ∶ |𝑧0 |2 + |𝑧1 |2 = 1}, and 𝕊2 as a subset of ℂ × ℝ, i.e., 𝕊2 = {(𝑧, 𝑥) ∈
ℂ × ℝ ∶ |𝑧|2 + 𝑥2 = 1}. Then the Hopf fibration 𝜋 is the map 𝕊3 → 𝕊2 given by
( )
𝜋(𝑧0 , 𝑧1 ) = 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2 .
Chapter 14. Orientation 297
Consider the point 𝑎 = (0, 0, 1) on 𝕊2 ⊂ ℝ3 ≅ ℂ × ℝ. In a previous exercise we have
determined the fiber 𝜋 −1 (𝑎). Now we would like to compute its orientation as a preimage
under 𝜋.
Let 𝑞 = (𝑥0 , 𝑦0 , 0, 0) ∈ 𝜋 −1 (𝑎) be a point in the fiber over 𝑎. The tangent space 𝑇𝑞 𝕊3 is the
vector space
𝑇𝑞 𝕊3 = {𝐮 ∈ ℝ4 ∶ 𝐮 ⟂ 𝑞}
⎧ ⎛−𝑦0 ⎞ ⎛0⎞ ⎛0⎞⎫
⎪ ⟂ ⎜ 𝑥0 ⎟ 4 ⎜0⎟ 4 ⎜0⎟⎪
= span ⎨𝑞 = ⎜ ⎟ , 𝐞3 = ⎜1⎟ , 𝐞4 = ⎜0⎟⎬ .
⎪ ⎜ 0 ⎟ ⎜ ⎟ ⎜ ⎟⎪
⎩ ⎝ 0 ⎠ ⎝0⎠ ⎝1⎠⎭
The orientation of 𝑇𝑞 𝕊3 as a boundary of the unit ball is such that the outward pointing
vector 𝑞 together with the basis vectors of 𝑇𝑞 𝕊3 form a positively oriented basis of ℝ4 . The
determinant of the matrix expressing the basis (𝑞, 𝑞 ⟂ , 𝐞43 , 𝐞44 ) in the standard basis of ℝ4 equals
𝑥20 + 𝑦20 = 1 > 0. In particular, it is positive and the basis (𝑞 ⟂ , 𝐞43 , 𝐞44 ) is a positively oriented
basis of 𝑇𝑞 𝕊3 .
The tangent space 𝑇𝑞 𝜋 −1 (𝑎) equals the kernel of 𝑑 𝜋̃𝑞 , where we consider the map 𝜋̃ ∶ ℝ4 ≅
ℂ2 → ℂ × ℝ ≅ ℝ3 using the same formula as for 𝜋, i.e., 𝜋 = 𝜋̃|𝕊3 . In an exercise we computed
this map as represented by the matrix
⎛ 0 0 𝑥0 𝑦0 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ ⎜ 0 0 𝑦0 −𝑥0 ⎟ .
⎜ ⎟
⎝𝑥0 𝑦0 0 0 ⎠
The kernel of this map is the span of the vector 𝑞 ⟂ that we have just seen above. The normal
space 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) ⊂ 𝑇𝑞 𝕊3 of vectors which are orthogonal to 𝑇𝑞 𝜋 −1 (𝑎) is the span of (𝐞43 , 𝐞44 ).
The map 𝑑 𝜋̃𝑞 sends 𝐞3 and 𝐞4 to, respectively,
⎛𝑥0 ⎞ ⎛ 𝑦0 ⎞
⎜ 𝑦 ⎟ ⎜−𝑥 ⎟
𝑑 𝜋̃𝑞 (𝐞43 ) = 2 ⎜ 0 ⎟ , 𝑑 𝜋̃𝑞 (𝐞44 ) = 2 ⎜ 0 ⎟ .
⎜0⎟ ⎜ 0 ⎟
⎝0⎠ ⎝ 0 ⎠
We need to check the orientation of this basis: The tangent space 𝑇𝑎 𝕊2 has a basis (𝐞31 , 𝐞32 )
as a subspace in ℝ3 . This basis is positively oriented since, together with the outward pointing
vector 𝑎 = 𝐞33 , the basis (𝐞33 , 𝐞31 , 𝐞32 ) is a positively oriented basis of ℝ3 .2 The matrix 𝐴 which
expresses (𝑑 𝜋̃𝑞 (𝐞33 ), 𝑑 𝜋̃𝑞 (𝐞44 )) in terms of the basis (𝐞31 , 𝐞32 ) is given by
( )
𝑥0 𝑦0
𝐴=2⋅ .
𝑦0 −𝑥0
2
We make two permutations which lead to multiplying with (−1)2 = +1.
298 14.7. Orientation on boundary of preimage
We see that det 𝐴 = 4(−𝑥20 − 𝑦20 ) = −4 < 0 is negative. Hence the basis (𝑑 𝜋̃𝑞 (𝐞33 ), 𝑑 𝜋̃𝑞 (𝐞44 )) is a
negatively oriented basis of 𝑇𝑎 𝕊2 . This defines an orientation on the normal space 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 )
by declaring the orientation of the basis (𝐞43 , 𝐞44 ) to be negative.
𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) ⊕ 𝑇𝑞 𝜋 −1 (𝑎) = 𝑇𝑞 𝕊3
We check this by looking at the basis (𝐞43 , 𝐞44 , 𝑞 ⟂ ) of 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) ⊕ 𝑇𝑞 𝜋 −1 (𝑎). As a basis
of 𝑇𝑞 𝕊3 this basis is positively oriented, since it arises by two permutations from the positively
oriented basis (𝑞 ⟂ , 𝐞43 , 𝐞44 ). Since the sign of (𝐞43 , 𝐞44 ) is negative as a basis of 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ), we
need that 𝑞 ⟂ also has negative sign. Hence the vector 𝑞 ⟂ provides a negatively oriented basis
of 𝑇𝑞 𝜋 −1 (𝑎).
We will compute the orientation of the fiber of 𝜋 at another point in the exercises. You
should try to solve this exercise even though it is a bit involved.
It turns out that there is a formula that relates these two orientations:
𝐻 ⊕ 𝑇𝑥 (𝜕𝑆) = 𝑇𝑥 (𝜕𝑋).
𝐻 ⊕ 𝑇𝑥 (𝑆) = 𝑇𝑥 (𝑋).
For we have
and
Hence we may use 𝐻 to define the direct sum orientation of both 𝑆 and 𝜕𝑆 at 𝑥.
Since 𝐻 ⊂ 𝑇𝑥 (𝜕𝑋) ⊂ 𝑇𝑥 (𝑋), the maps 𝑑𝑓𝑥 and 𝑑(𝜕𝑓 )𝑥 agree on 𝐻, i.e.,
the intersection Ker (𝑑𝑓𝑥 ) ∩ 𝐻 is {0}. Hence the restrictions of 𝑑𝑓𝑥 and 𝑑(𝜕𝑓 )𝑥 to 𝐻 are
isomorphisms onto their common image.
Thus 𝑓 − ⋔ 𝑍 and 𝜕𝑓 − ⋔ 𝑍 imply that we have two direct sum decompositions 𝑑𝑓𝑥 (𝐻) ⊕
𝑇𝑧 (𝑍) = 𝑇𝑧 (𝑌 ) = 𝑑(𝜕𝑓 )𝑥 (𝐻) ⊕ 𝑇𝑧 (𝑍), and the two orients of 𝐻 via these direct sums agree.
To conclude, we obtained that 𝐻 has a well-defined orientation. Hence we can use this
unique orientation on 𝐻 to orient
It remains to check how this orientation of 𝑇𝑥 (𝜕𝑆) relates to the orientation of the boundary
induced from the orientation of 𝑇𝑥 (𝑆). Let 𝑛𝑥 be the outward unit vector to 𝜕𝑆 in 𝑇𝑥 (𝑆), and
let ℝ ⋅ 𝑛𝑥 represent the one-dimensional subspace spanned by 𝑛𝑥 . We orient this space by
assigning the sign +1 to the basis (𝑛𝑥 ). Even though 𝑛𝑥 need not be perpendicular to all of
𝑇𝑥 (𝜕𝑋), it suffices to know that 𝑛𝑥 lies in the half-space pointing away from 𝑇𝑥 (𝑋) to know
that the orientations of ℝ ⋅ 𝑛𝑥 , 𝑇𝑥 (𝜕𝑋) and 𝑇𝑥 (𝑋) are related by the direct sum
𝑇𝑥 (𝑋) = ℝ ⋅ 𝑛𝑥 ⊕ 𝑇𝑥 (𝜕𝑋).
Now we use that 𝐻 is complementary to both 𝑇𝑥 (𝑆) in 𝑇𝑥 (𝑋) and 𝑇𝑥 (𝜕𝑆) in 𝑇𝑥 (𝜕𝑋) and
plugg this into the above direct sum to get
𝐻 ⊕ 𝑇𝑥 (𝑆) = ℝ ⋅ 𝑛𝑥 ⊕ 𝐻 ⊕ 𝑇𝑥 (𝜕𝑆).
This equation is already almost what we need, since we would like to compare the orientations
𝑇𝑥 (𝑆) and ℝ ⋅ 𝑛𝑥 ⊕ 𝑇𝑥 (𝜕𝑆). For doing so, we need to move ℝ ⋅ 𝑛𝑥 passed 𝐻. If dim 𝐻 = 𝑚, 𝐻
300 14.8. Example: Simply-connected manifolds are orientable
has 𝑚 basis vectors (𝑤1 , … , 𝑤𝑚 ). Remembering the rule for orienting direct sums, this means
we have to apply exactly 𝑚 transpositions to the ordered set
Since 𝐻 appears on both sides as the first summand, we can disregard it for the computation
and get that if 𝜕𝑆 is oriented as a preimage under 𝜕𝑓 , then its orientation relates to the one of
𝑇𝑥 (𝑆) by
𝑇𝑥 (𝑆) = ℝ ⋅ 𝑛𝑥 ⊕ 𝑇𝑥 (𝜕𝑆).
Thus
The following theorem shows that a lot of manifolds are orientable. Recall that a manifold 𝑋
is called simply-connected if it is connected and every smooth map 𝕊1 → 𝑋 is homotopic to a
constant map.
Proof: We start by picking any point 𝑥 ∈ 𝑋, and choose an orientation for the tangent
space 𝑇𝑥 (𝑋). Since 𝑇𝑥 (𝑋) is a vector space, this is always possible. Now let 𝑦 ∈ 𝑋 be any
other point in 𝑋. Since 𝑋 is simply-connected, 𝑋 is connected. By a previous exercise, since
𝑋 is a smooth manifold, 𝑋 is therefore even path-connected. Hence there is a smooth map
𝛾 ∶ [0, 1] → 𝑋 with 𝛾(0) = 𝑥 and 𝛾(1) = 𝑦. For every point in 𝑧 ∈ 𝛾([0, 1]) we choose a local
parametrization 𝜙𝑧 ∶ 𝑉𝑧 → 𝑈𝑧 around 𝑧. By shrinking 𝑉𝑧 if necessary, we can assume that
each 𝑉𝑧 is an open ball in ℝ𝑘 .
The sets 𝑈𝑧 ∩ 𝛾([0, 1]) is open in 𝛾([0, 1]), and the collection of {𝑈𝑧 ∩ 𝛾([0, 1])} for all
𝑧 ∈ 𝛾([0, 1]) is an open covering of 𝛾([0, 1]). Since [0, 1] is compact and 𝛾 continuous, the
image 𝛾([0, 1]) is compact. Hence finitely many of the 𝑈𝑧 suffice to cover 𝛾([0, 1]). We label
these open sets 𝑈1 , … , 𝑈𝑚 and order them such that 𝑈𝑖 ∩ 𝑈𝑖+1 ≠ ∅ and 𝑥 ∈ 𝑈1 , 𝑦 ∈ 𝑈𝑚 .
For 𝑈1 , we choose the orientation which is compatible with the chosen orientation of
𝑇𝑥 (𝑋). This means: Let 𝜙1 ∶ 𝑈1 → 𝑋 be the associated local parametrization with 𝜙1 (0) = 𝑥.
Chapter 14. Orientation 301
∙ If 𝑑(𝜙1 )0 ∶ ℝ𝑘 → 𝑇𝑥 (𝑋) is orientation preserving, we orient the vector space 𝑇𝑎 (𝑈1 )
such that 𝑑(𝜙1 )𝜙−1 (𝑧) ∶ ℝ𝑘 → 𝑇𝑎 (𝑋) is orientation preserving for all 𝑎 ∈ 𝑈1 .
1
Hence after possibly replacing 𝜙1 with 𝜙̃ 1 , we can assume that 𝑑(𝜙1 )0 is orientation pre-
serving, and we orient all 𝑇𝑎 (𝑈1 ) as above. For 𝑈2 , we choose the orientation which is compat-
ible with the orientation of the 𝑇𝑎 (𝑋) for all points 𝑎 ∈ 𝑈1 ∩ 𝑈2 : If 𝑑(𝜙2 )𝜙−1 (𝑎) is orientation
2
preserving on 𝑇𝑎 (𝑋) for 𝑎 ∈ 𝑈1 ∩ 𝑈2 , we orient 𝑇𝑎 (𝑋) such that 𝑑(𝜙2 )𝜙−1 (𝑎) ∶ ℝ𝑘 → 𝑇𝑎 (𝑋)
2
is orientation preserving for all 𝑎 ∈ 𝑈2 . If it is not orientation preserving, then we replace
𝜙2 (𝑣) with 𝜙2 (−𝑣). Continuing this way, we obtain an orientation for 𝑈𝑚 and therefore an
orientation for 𝑇𝑦 (𝑋) after finitely many steps. See Figure 14.6.
It remains to show that the induced orientation on 𝑇𝑦 (𝑋) does not depend on the choice
of 𝛾 and the 𝑈𝑖 ’s. So let 𝜔 ∶ [0, 1] → 𝑋 be another smooth path with 𝜔(0) = 𝑥 and 𝜔(1) = 𝑦.
As for 𝛾, we choose open sets 𝑊1 , … , 𝑊𝑙 covering all points in 𝜔([0, 1]) with 𝑥 ∈ 𝑊1 and
𝑦 ∈ 𝑊𝑙 and 𝑊𝑖 ∩ 𝑊𝑖+1 ≠ ∅. Then we orient 𝑇𝑦 (𝑋) following the same procedure using the
𝑊𝑖 ’s.
Arriving at 𝑦, we do not know a priori whether the orientation of 𝑇𝑦 (𝑋) induced by 𝛾 and
the orientation of 𝑇𝑦 (𝑋) induced by 𝜔 agree. But now we use that 𝑋 is simply-connected.
Figure 14.6: We choose a path from 𝑥 to 𝑦 and open sets which overlap the path. Then we orient
the tangent spaces such that everything is compatible on overlaps. Simply-connectedness will
make sure that the choice of path and open sets did not matter, since we can shrink any path to
a point.
For, walking first along 𝛾 and then back on 𝜔 defines, after readjusting the speed and
smoothing things out, a loop 𝛼 ∶ [0, 1] → 𝑋 with 𝛼(0) = 𝑥 = 𝛼(1), i.e., a smooth map 𝛼 ∶ 𝕊1 →
302 14.9. Summary
𝑋. Walking along 𝛼, we obtain an isomorphism
≅
𝑇 (𝛼) ∶ 𝑇𝑥 (𝑋) = 𝑇𝛼(0) (𝑋) ←←←→
← 𝑇𝛼(1) (𝑋) = 𝑇𝑥 (𝑋)
by composing
𝑑(𝜙1 )−1
∙ 𝑑(𝜙2 )∙ 𝑑(𝜙2 )−1
∙ 𝑑(𝜙2 )∙ 𝑑(𝜓𝑚−1 )−1
∙ 𝑑(𝜓𝑚 )∙
← ℝ𝑘 ←←←←←←←←←←→
𝑇𝑥 (𝑋) ←←←←←←←←←←←←←→ ← ℝ𝑘 ←←←←←←←←←←→
← 𝑇𝑧 (𝑋) ←←←←←←←←←←←←←→ ← ℝ𝑘 ←←←←←←←←←←←→
← ⋯ ←←←←←←←←←←←←←←←←←→ ← 𝑇𝑥 (𝑋)
where the subscript ∙ stands for the varying points at which we take derivatives.
For each 𝑡 ∈ [0, 1], 𝐹 (−, 𝑡) defines a smooth loop from 𝑥 to 𝑥. Using the above procedure
for orienting tangent spaces along a path, we obtain an isomorphism
≅
← 𝑇𝐹 (1,𝑡) (𝑋) = 𝑇𝑥 (𝑋) for each 𝑡 ∈ [0, 1].
𝑇 (𝐹 (−, 𝑡)) ∶ 𝑇𝑥 (𝑋) = 𝑇𝐹 (0,𝑡) (𝑋) ←←←→
which is continuous, since each point of 𝑋 is contained an open neighborhood on which the
orientation is determined by the derivatives of local parametrizations, and these derivatives
vary smoothly with the base-points.
Since each 𝑇 (𝐹 (−, 𝑡)) is an isomorphism, its determinant is either strictly positive > 0 or
strictly negative < 0. Since [0, 1] is connected and 𝑡 → det(𝑇 (𝐹 (−, 𝑡))) is continuous, we have
either det(𝑇 (𝐹 (−, 𝑡))) > 0 or det(𝑇 (𝐹 (−, 𝑡))) < 0 for all 𝑡 ∈ [0, 1].
Hence we must have det(𝑇 (𝐹 (−, 𝑡))) > 0 for all 𝑡 ∈ [0, 1]. In other words, 𝑇 (𝐹 (−, 𝑡)) must
be orientation preserving for all 𝑡, and in particular, 𝑇 (𝛼) is orientation preserving.
This shows that the orientation of 𝑇𝑦 (𝑋) does not depend on the choice of 𝛾.
14.9 Summary
Let us summarise the key points we should remember from this chapter:
Chapter 14. Orientation 303
∙ An orientation of a vector space is a choice of a sign, +1 or −1, for an equivalence of
orderings of a bases. We can think of it as choosing a positive and negative direction.
∙ Orientations help us classifying manifolds: there is a box with orientable and a box with
non-orientable manifolds.
∙ For any compact oriented one-dimensional manifold with boundary, the sum of the
orientation numbers at the boundary points is zero.
𝜕([0, 1] × 𝑋) = 𝑋1 − 𝑋0 .
This is the key point for defining homotopy invariant intersection numbers soon.
(a) Show that replacing one 𝑣𝑖 by a multiple 𝑐𝑣𝑖 yields an equivalently oriented ordered
basis if 𝑐 > 0, and an oppositely oriented one if 𝑐 < 0.
(b) Show that transposing two elements, i.e., interchanging the places of 𝑣𝑖 and 𝑣𝑗 for
𝑖 ≠ 𝑗, yields an oppositely oriented ordered basis.
(c) Show that subtracting from one 𝑣𝑖 a linear combination of the others yields an
equivalently oriented ordered basis.
(d) Suppose that 𝑉 is the direct sum of 𝑉1 and 𝑉2 . Show that the direct sum orientation
of 𝑉 from 𝑉1 ⊕ 𝑉2 equals (−1)(dim 𝑉1 )(dim 𝑉2 ) times the orientation from 𝑉2 ⊕ 𝑉1 .
Exercise 14.2 The upper half space ℍ𝑘 is oriented by the standard orientation of ℝ𝑘 .
Thus 𝜕ℍ𝑘 acquires a boundary orientation. But 𝜕ℍ𝑘 may be identified with ℝ𝑘−1 . Show
that the boundary orientation agrees with the standard orientation of ℝ𝑘−1 if and only if
𝑘 is even.
(a) Write down the orientation of 𝕊2 as the boundary of the closed unit ball 𝔹3 in
ℝ3 , by specifying a positively oriented ordered basis for the tangent space at each
(𝑎, 𝑏, 𝑐) ∈ 𝕊2 .
(b) Show that the boundary orientation of 𝕊𝑘 equals the orientation of 𝕊𝑘 = 𝑔 −1 (1) as
the preimage under the map
𝑔 ∶ ℝ𝑘+1 → ℝ, 𝑥 → |𝑥|2 .
Exercise 14.7 (a) Let 𝑉 be a vector space. Show that both orientations on 𝑉 define
the same product orientation on 𝑉 × 𝑉 .
(c) Suppose that 𝑋 is not orientable. Show that 𝑋 × 𝑌 is never orientable, no matter
what manifold 𝑌 may be. In particular, 𝑋 × 𝑋 is not orientable.
Hint: First show that 𝑋 × ℝ𝑚 is not orientable, and then use that every 𝑌 has an
open subset diffeomorphic to ℝ𝑚 .
(d) Prove that there exists a natural orientation on some neighborhood of the diagonal
Δ in 𝑋 × 𝑋, whether or not 𝑋 can be oriented.
But note that Δ itself is orientable if and only if 𝑋 × 𝑋 is orientable. Why?
Hint: Cover a neighborhood of Δ by local parametrizations 𝜙 × 𝜙 ∶ 𝑈 × 𝑈 →
𝑋 × 𝑋, where 𝜙 ∶ 𝑈 → 𝑋 is a local parametrization of 𝑋, then apply the previous
observations.
Exercise 14.8 Recall the Hopf fibration 𝜋 that we have seen previously: We consider
𝕊3 as a subset of ℂ2 , i.e., 𝕊3 = {(𝑧0 , 𝑧1 ) ∈ ℂ2 ∶ |𝑧0 |2 + |𝑧1 |2 = 1}, and 𝕊2 as a subset
of ℂ × ℝ, i.e., 𝕊2 = {(𝑧, 𝑥) ∈ ℂ × ℝ ∶ |𝑧|2 + 𝑥2 = 1}. Then the Hopf fibration 𝜋 is the
map 𝕊3 → 𝕊2 given by
( )
𝜋(𝑧0 , 𝑧1 ) = 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2 .
We have seen in Section 11.1 that the mod 2 degree is a powerful invariant of smooth maps
between manifolds. Now we remedy the defect that it was merely an element in ℤ∕2 and define
an integer-valued degree. We will then study several applications.
Then the degree of 𝑓 is defined as follows: Let 𝑥 ∈ 𝑋 be a regular point of 𝑓 . With our
assumptions this means that 𝑑𝑓𝑥 ∶ 𝑇𝑥 𝑋 → 𝑇𝑓 (𝑥) 𝑌 is a linear isomorphism between oriented
vector spaces. The sign of 𝑑𝑓𝑥 is defined to be +1 if 𝑑𝑓𝑥 preserves orientations and it is defined
to be −1 if 𝑑𝑓𝑥 reverses orientations. Now, for a regular value 𝑦 ∈ 𝑌 of 𝑓 , we define the integer
∑
deg(𝑓 ; 𝑦) = sign 𝑑𝑓𝑥 .
𝑥∈𝑓 −1 (𝑦)
Figure 15.1: We calculate the degree by counting the number of points in the fiber of a regular
value. The number is well-defined if we take signs determined by orientation into account. In
dimension one, this boils down to looking at the sign of the derivative or by checking whether
the graph hits the horizontal line corresponding to a value from below or above.
306
Chapter 15. The Brouwer Degree 307
Proof: By the Stack of Records Theorem 4.14, we can find a neighborhood 𝑈 of 𝑦 such
that the preimage 𝑓 −1 (𝑈 ) is a disjoint union 𝑉1 ∪ ⋯ ∪ 𝑉𝑛 , where each 𝑉𝑖 is an open set in 𝑋
mapped by 𝑓 diffeomorphically onto 𝑈 . Hence, for all points 𝑧 ∈ 𝑈 , we have #𝑓 −1 ({𝑧}) = 𝑛.
It remains to take orientations into account. Since 𝑓|𝑉𝑖 ∶ 𝑉𝑖 → 𝑈 is a diffeomorphism, we
know that
𝑑𝑓𝑥𝑖 ∶ 𝑇𝑥𝑖 (𝑋) → 𝑇𝑦 (𝑌 )
is an isomorphism. Now both 𝑇𝑥𝑖 (𝑋) and 𝑇𝑦 (𝑌 ) are oriented, and hence 𝑑𝑓𝑥𝑖 is either orientation
preserving or reversing. But by our definition of orientations on manifolds, we have
∙ either det(𝑑𝑓𝑥𝑖 ) > 0 and hence, for all 𝑧 ∈ 𝑈 , det(𝑑𝑓𝑤𝑖 ) > 0, where 𝑤𝑖 is the unique
point in 𝑉𝑖 with 𝑓 (𝑤𝑖 ) = 𝑧; in other words, 𝑑𝑓𝑤𝑖 preserves orientations for all points
𝑤𝑖 ∈ 𝑉𝑖 ;
∙ or det(𝑑𝑓𝑥𝑖 ) < 0 and hence, for all 𝑧 ∈ 𝑈 , det(𝑑𝑓𝑤𝑖 ) < 0, where 𝑤𝑖 is the unique point
in 𝑉𝑖 with 𝑓 (𝑤𝑖 ) = 𝑧; in other words, 𝑑𝑓𝑤𝑖 reverses orientations for all points 𝑤𝑖 ∈ 𝑉𝑖 .
Thus the orientation number is the same for all points in 𝑉𝑖 . Hence the sum of orientation
numbers of the points in 𝑓 −1 (𝑧) is the same for all points 𝑧 ∈ 𝑈 . Consequently, the function
𝑌 → ℤ, 𝑦 → deg(𝑓 ; 𝑦)
is locally constant.
Now we show a very useful theorem about the degree which generalizes Theorem 11.8.
Theorem 15.2 (Boundary Theorem for deg) Assume that 𝑋 = 𝜕𝑊 is the boundary
of a compact oriented manifold 𝑊 and that 𝑋 is oriented as the boundary of 𝑊 . If
𝑓 ∶ 𝑋 → 𝑌 can be extended to a smooth map 𝐹 ∶ 𝑊 → 𝑌 , then deg(𝑓 ; 𝑦) = 0 for
every regular value 𝑦 of 𝑓 .
Proof: First we suppose that 𝑦 is a regular value for both 𝐹 and 𝑓 . By Theorem 10.14,
𝐹 −1 (𝑦) is a compact submanifold of 𝑊 with boundary of dimension dim 𝑊 − dim 𝑌 = 1.
Thus, by Theorem 10.17, 𝐹 −1 (𝑦) is the disjoint union of finitely many connected components
which are diffeomorphic to either 𝑆 1 or [0, 1]. Since 𝑆 1 does not have boundary points, only
the boundary points of the components diffeomorphic to [0, 1] lie on 𝑋 = 𝜕𝑊 . Let 𝐴 be one
such component diffeomorphic to [0, 1] and let 𝑎 and 𝑏 denote the boundary points of 𝐴, i.e.,
𝜕𝐴 = {𝑎} ∪ {𝑏}. We will show that
Since 𝑓 −1 (𝑦) = (𝐹|𝜕𝑊 )−1 (𝑦) consists of the finitely many boundary points of the components
diffeomorphic to [0, 1], this implies that the sum of the signs of all 𝑑𝑓𝑥 for 𝑥 ∈ 𝑓 −1 (𝑦) is zero
as claimed.
308 15.1. The integral-valued Brouwer Degree
To show (15.1) we look at how the tangent spaces at points in 𝐴 are oriented. So let 𝑥 ∈ 𝐴
and let (𝑣1 , 𝑣2 , … , 𝑣𝑛 , 𝑤) be a positively oriented bases for 𝑇𝑥 𝑊 such that 𝑤 is tangent to the
one-dimensional submanifold 𝐴, i.e., 𝑤 forms a basis of the one-dimensional vector subspace
𝑇𝑥 𝐴 ⊂ 𝑇𝑥 𝑊 , while (𝑣1 , … , 𝑣𝑛 ) is an ordered basis of the normal space 𝑁𝑥 (𝐴; 𝑊 ) ⊂ 𝑇𝑥 𝑊 .
Since we have a direct sum decomposition 𝑁𝑥 (𝐴; 𝑊 ) ⊕ 𝑇𝑥 𝐴 = 𝑇𝑥 𝑊 , we get the equation
Thus we have
where the right-hand side is determined by the given orientation of 𝑇𝑦 𝑌 . Thus, we conclude
that 𝑤 is a positively oriented basis of 𝑇𝑥 𝐴 if and only if 𝑑𝐹𝑥 sends the basis (𝑣1 , … , 𝑣𝑛 ) to a
positively oriented basis in 𝑇𝑦 𝑌 .
Let 𝑤(𝑥) be the positively oriented unit vector 𝑇𝑥 𝐴. Then 𝑤(𝑥) depends smoothly of 𝑥 and
points outward at one boundary point, say 𝑏, and inward at the other boundary point 𝑎. At the
boundary points we have the commutative diagrams
𝑇𝑎 𝑋 ⊕ 𝑇𝑎 𝐴 𝑇𝑎 𝑊 and 𝑇𝑏 𝑋 ⊕ 𝑇𝑏 𝐴 𝑇𝑏 𝑊
𝑑𝑓𝑎 𝑑𝑓𝑏
𝑑𝐹𝑎 𝑑𝐹𝑏
y y
𝑇𝑦 𝑌 𝑇𝑦 𝑌 .
The kernel of 𝑑𝐹𝑥 𝑇𝑥 𝐴 and, at the boundary points, 𝑑𝐹𝑥 restricts to 𝑑𝑓𝑥 . We just learned that
the sign of 𝑤(𝑥) is positive if and only if sign 𝑑𝑓𝑥 = +1. Thus, since 𝑤(𝑏) is positively oriented
as the outward pointing vector and 𝑤(𝑎) negatively, we conclude
Summing over all arcs in 𝐹 −1 (𝑦), we have proved the assertion when 𝑦 is a regular value for 𝐹 .
Now suppose that 𝑦0 is a regular value for 𝑓 , but not for 𝐹 . The function 𝑦 → deg(𝑓 ; 𝑦)
is constant within some open neighborhood 𝑈 of 𝑦0 by Lemma 15.1. By Sard’s Theorem 7.1
the open subset 𝑈 contains a regular value for 𝐹 . Hence we can choose a regular value 𝑦 for 𝐹
within 𝑈 and get
Lemma 15.3 (Homotopy Lemma for deg) Let 𝑓 , 𝑔 ∶ 𝑋 → 𝑌 be two smooth maps.
Assume 𝑓 and 𝑔 are smoothly homotopic. If 𝑦 ∈ 𝑌 is a regular value for both 𝑓 and
Chapter 15. The Brouwer Degree 309
𝑔, then
𝑓 −1 (𝑦) = 𝑔 −1 (𝑦).
Since 𝜕𝐹 can be extended to the compact oriented manifold [0, 1] × 𝑋, this difference must be
zero by Theorem 15.2.
Proof: Let 𝑦 and 𝑧 be two regular values of 𝑓 . By the Isotopy Lemma 11.3 we can choose a
diffeomorphism ℎ ∶ 𝑋 → 𝑌 which isotopic to the identity and with ℎ(𝑦) = 𝑧. Every diffeomor-
phism either preserves or reverses the orientation at every point. Hence we know sign 𝑑𝑥 ℎ = ±1
for all 𝑥. Since ℎ is isotopic to the identity, we must have sign 𝑑ℎ𝑥 = +1 for all 𝑥. Thus ℎ pre-
serves orientation, and we get
deg(ℎ◦𝑓 ; 𝑧) = deg(𝑓 ; 𝑧)
In the exercises we are going to show that the degree is multiplicative in the following sense:
In all examples we keep the assumption that 𝑋 and 𝑌 are smooth manifolds without boundary,
𝑋 is compact, 𝑌 is connected and dim 𝑋 = dim 𝑌 .
Example 15.6 (Constant maps have degree zero) Let 𝑓 ∶ 𝑋 → 𝑌 be a constant map
with value 𝑦0 . Then deg(𝑓 ) = 0. For, any value 𝑦 ≠ 𝑦0 is a regular value and 𝑓 −1 (𝑦) = ∅.
In particular, we get:
𝑓𝑚
𝕊O 1 / 𝕊1
O
𝜙 𝜙
ℝ / ℝ.
𝑡→𝑚𝑡
Taking derivatives yields, where we note that 𝑡 → 𝑚𝑡 is a linear map and therefore equal
Chapter 15. The Brouwer Degree 311
to its derivative:
𝑑(𝑓𝑚 )𝑧
𝑇𝑧 (𝕊1 ) / 𝑇 𝑚 (𝕊1 )
O 𝑧 O
𝑑𝜙𝑡 𝑑𝜙𝑚𝑡
ℝ / ℝ.
𝑡→𝑚𝑡
which means that 𝑑(𝑓𝑚 )𝑧 is the linear map given by multiplication by 𝑚. Hence, when
𝑚 > 0, 𝑓𝑚 wraps the circle uniformly around itself 𝑚 times preserving orientation. The
map is everywhere regular and orientation preserving, so its degree is the number
of preimages of any point. And that number is 𝑚. Similarly, when 𝑚 < 0 the map
is everywhere regular but orientation reversing. As each point has |𝑚| preimages, the
degree is −|𝑚| = 𝑚. Finally, when 𝑚 = 0 the map is constant, so its degree is zero.
Remark 15.11 (One homotopy class 𝕊1 → 𝕊1 for each integer) One immediate
consequence of this calculation (which could not have been proven with mod 2 theory)
is the interesting fact that the circle admits an infinite number of homotopically distinct
maps since deg(𝑧𝑚 ) = 𝑚 implies that 𝑓𝑛 and 𝑓𝑚 are not homotopic if 𝑛 ≠ 𝑚. We provide
a more complete picture of the self-maps of 𝕊1 in Theorem 15.18.
Example 15.12 (Self-maps of 𝕊2 and complex polynomials) Let 𝑝(𝑧) = 𝑧𝑚 +𝑎1 𝑧𝑚−1 +
⋯ + 𝑎𝑚 be a complex monic polynomial. We may consider 𝑝 as a smooth map ℂ → ℂ,
and as in Equation 4.1, it induces a smooth map 𝑓 ∶ 𝕊2 → 𝕊2 which sends ∞ to ∞, i.e.,
𝑓 sends the north pole to the north pole. We claim that the map 𝑓 has degree 𝑚.
To justify our claim we first observe that the homotopy
Proof: For 𝑚 = 0 it suffices to take any constant map. For 𝑘 = 1 and 𝑚 ∈ ℤ, we proved
the assertion in Example 15.10. We can generalize this case as follows. First we assume that
𝑚 is positive,√i.e., we assume√𝑚 ≥ 1. We can describe the coordinates of any point in 𝕊𝑘 as
the tuple (𝑥, 1 − |𝑥|2 cos 𝑡, 1 − |𝑥|2 sin 𝑡) where 𝑥 = (𝑥1 , … , 𝑥𝑘−1 ) denotes the first 𝑘 − 1
coordinates and |𝑥|2 = 𝑥21 + ⋯ 𝑥2𝑘−1 . We define the smooth map 𝑓𝑚 ∶ 𝕊𝑘 → 𝕊𝑘 defined by
√ √ √ √
(𝑥, 1 − |𝑥|2 cos 𝑡, 1 − |𝑥|2 sin 𝑡) → (𝑥, 1 − |𝑥|2 cos(𝑚𝑡), 1 − |𝑥|2 sin(𝑚𝑡)).
The map 𝑓𝑚 is surjective and by Sard’s Theorem 7.1 we can find a regular value 𝑝 ∈ 𝕊𝑘 . The
preimage of 𝑝 consists of exactly 𝑚 points. It remains to check the sign of the orientation at each
preimage point 𝑞 ∈ 𝑓 −1 (𝑝). With respect to the bases of 𝑇𝑞 𝕊𝑘 and 𝑇𝑓𝑚 (𝑞) 𝕊𝑘 that are induced by
the standard basis in ℝ𝑘 , the derivative 𝑑(𝑓𝑚 )𝑞 at 𝑞 is given by the (𝑘 × 𝑘)-matrix which consists
of the (𝑘 − 1 × 𝑘 − 1)-identity matrix and the integer 𝑚 in the bottom right-hand corner. This
matrix has determinant 𝑚 > 0. Thus, the orientation is preserved at each point in 𝑞 ∈ 𝑓 −1 (𝑝),
and we get deg(𝑓𝑚 ) = 𝑚.
If 𝑚 < 0 is negative, we consider the composition 𝑓|𝑚| ◦𝑟1 where 𝑟1 denotes the reflection
in the first coordinate. Since the degree is multiplicative by Lemma 15.5, this map has degree
𝑚 by the first case and Example 15.9.
Remark 15.14 (Homotopy groups of spheres - surjectivity) Let [𝕊𝑘 , 𝕊𝑘 ] denote the
set of smooth maps 𝕊𝑘 → 𝕊𝑘 modulo homotopy. By Theorem 15.4 we can think of the
degree as a map
deg ∶ [𝕊𝑘 , 𝕊𝑘 ] ←→ ℤ.
By Theorem 15.13, this map is surjective. We will later prove Hopf’s Theorem 16.1
which tells us that this map is injective as well.
In the exercises we are going ro prove the following two applications of the degree:
𝕋 2 = ℝ2 ∕ℤ2 ←→ ℝ2 ∕ℤ2 = 𝕋 2 .
We denote the induced maps on 𝕋 2 by the same symbols 𝐴 and 𝐵 respectively. The
effect of 𝐴 on 𝕋 2 is to revolve 𝑛 times around the circle 𝕊1 × {𝑦} in 𝕋 2 , for some 𝑦 ∈ 𝕊1 .
While the effect of 𝐵 on 𝕋 2 is to revolve 𝑛 times around the circle {𝑥} × 𝕊1 in 𝕋 2 , for
some 𝑥 ∈ 𝕊1 . Hence, for any point 𝑞 ∈ 𝕋 2 , the preimages 𝐴−1 (𝑞) and 𝐵 −1 (𝑞) consist of
exactly 𝑛 points.
The derivative 𝑑𝐴𝑝 of 𝐴 at any point 𝑝 ∈ 𝕋 2 equals the linear map ℝ2 → ℝ2 repre-
sented by 𝐴, i.e., 𝑑𝐴𝑝 = 𝐴, and similarly 𝑑𝐵𝑝 = 𝐵. In particular, we get
Since 𝑛 > 0, the determinant of 𝑑𝐴𝑝 and 𝑑𝐵𝑝 is positive at every point 𝑝 which is sent
to 𝑞. Thus we can conclude
deg(𝐴) = 𝑛 = deg(𝐵).
Theorem 15.18 (Hopf Degree Theorem in dimension one) Two smooth maps
𝑓0 , 𝑓1 ∶ 𝕊1 → 𝕊1 are homotopic if and only if they have the same degree.
Proof of Theorem 15.18: We already know that if 𝑓0 and 𝑓1 are homotopic, then we have
deg(𝑓0 ) = deg(𝑓1 ). So assume deg(𝑓0 ) = deg(𝑓1 ), and we need to show 𝑓0 ∼ 𝑓1 . Recall from
Figure 11.4 the map 𝑝 defined by
𝑝 ∶ ℝ → 𝕊1 , 𝑡 → 𝑒2𝜋𝑖𝑡 ,
and showed that every smooth map 𝑓 ∶ 𝕊1 → 𝕊1 can be lifted1 to a map 𝑔 ∶ ℝ → ℝ with
such that 𝑓0 (𝑝(𝑡)) = 𝑝(𝑔0 (𝑡)), 𝑓1 (𝑝(𝑡)) = 𝑝(𝑔1 (𝑡)). Then the map
𝑓0 = 𝑝◦𝑔0 to 𝑝◦𝑔1 = 𝑓1 .
It remains to prove:
Claim: 𝑞 = deg(𝑓 ).
∙ First, note that if 𝑓 is not surjective, then we can pick a point 𝑦 ∉ 𝑓 (𝕊1 ). Thus 𝑦 is
automatically a regular value. Since #𝑓 −1 (𝑦) = 0, we must have deg(𝑓 ) = 0. In this
case, we need to have 𝑞 = 0, i.e., 𝑔(𝑡 + 1) = 𝑔(𝑡). For otherwise 𝑝◦𝑔 was surjective and
hence 𝑓 would be surjective.
Note that, since the stereographic projection map 𝕊1 ⧵ {𝑦} → ℝ is a diffeomorphism
and ℝ is contractible, this shows that 𝕊1 ⧵ {𝑦} is contractible. Hence 𝑓 is a map to a
contractible space and therefore homotopic to a constant map and has degree 0.
1
The idea was to lift piecewise locally and to patch the pieces together.
Chapter 15. The Brouwer Degree 315
∙ Now we assume that 𝑓 is surjective. Let 𝑦 ∈ 𝕊1 be a regular value of 𝑓 , and let 𝑧 ∈
𝑓 −1 (𝑦). Since 𝑝 is surjective, there is a 𝑡 ∈ ℝ with 𝑝(𝑡) = 𝑧. Since 𝑦 is a regular value, 𝑓
is a local diffeomorphism around 𝑧. Its derivative is related to the one of 𝑔 by the chain
rule
Hence the determinant of 𝑑𝑝𝑡 at any 𝑡 is positive (in fact equal +2𝜋). Thus the sign of the
determinant of 𝑑𝑓𝑧 equals the sign of 𝑑𝑔𝑡 ∈ ℝ.
As above, let 𝑦 ∈ 𝕊1 be a regular value of 𝑓 and 𝑧 ∈ 𝑓 −1 (𝑦). Let us fix a 𝑡0 ∈ ℝ with
𝑝(𝑡0 ) = 𝑧. When we walk from 𝑡0 to 𝑡0 + 1 we need to count how many preimages of 𝑦
we collect along the way, with their orientation (!).
∙ We start with the case 𝑞 = 0, i.e., 𝑔(𝑡 + 1) = 𝑔(𝑡). It will actually teach us the key ideas
we need to remember from this proof.
We need to count how often 𝑔(𝑠) = 𝑔(𝑡0 ) with 𝑑𝑔𝑠 = 𝑔 ′ (𝑠) > 0 and how often 𝑔(𝑠) = 𝑔(𝑡0 )
with 𝑑𝑔𝑠 = 𝑔 ′ (𝑠) < 0. Note that since 𝑦 is regular, 𝑑𝑔𝑠 is always ≠ 0 for such an 𝑠.
Since 𝑔 is a smooth function ℝ → ℝ, this is now just an exercise in Calculus. Using the
periodicity of 𝑔, i.e., that 𝑔 ′ (𝑡0 ) must have the same sign as 𝑔 ′ (𝑡0 + 1), we see that there
are exactly as many points 𝑠 with 𝑔(𝑠) = 𝑔(𝑡0 ) and 𝑑𝑔𝑠 = 𝑔 ′ (𝑠) > 0 as there are points
with 𝑔(𝑠) = 𝑔(𝑡0 ) and 𝑑𝑔𝑠 = 𝑔 ′ (𝑠) < 0. Thus deg(𝑓 ) = 0. See Figure 15.2.
Figure 15.2: We count the intersection points with the sign of the derivative. If 𝑞 = 0, there are
as many crossing points from below with positive derivative as there are crossing points from
above with negative derivative.
Figure 15.3: We count the intersection points with the sign of the derivative. If 𝑞 > 0, there
are exactly 𝑞 more crossing points from below with positive derivative than there are crossing
points from above with negative derivative.
Thus in total the orientation numbers for 𝑔 −1 ([𝑔(𝑡0 )+𝑖, 𝑔(𝑡0 )+𝑖+1]) add up to +2. Repeating
this for all 𝑖 = 0, 1, … , 𝑞 − 1 gives a sum of orientation numbers equal to 𝑞, since we have to
account for that we counted the inner points twice.
Since the sum of orientation numbers of 𝑓 equals the one of 𝑔, this shows deg(𝑓 ) = 𝑞.
∙ Finally, if 𝑞 < 0, the same argument works with signs and directions reversed.
Chapter 15. The Brouwer Degree 317
15.3 Exercises and more examples
Aside: Note that we would not have been able to make the only if -conclusion with
the mod 2-degree. For in ℤ∕2, we cannot distinguish between 1 and −1.
Exercise 15.3 Use the degree to prove the Fundament Theorem of Algebra: Given a
monic complex polynomial
𝑝(𝑧) = 𝑧𝑚 + 𝑎1 𝑧𝑚−1 + ⋯ + 𝑎𝑚 .
Exercise 15.4 Explain why the degree cannot be used to prove that there is a zero in ℝ
for every monic real polynomial.
318 15.3. Exercises and more examples
𝑓 𝑔
Exercise 15.6 Let 𝑋 ←←←→ ← 𝑍 be a sequence of smooth maps between manifolds
← 𝑌 ←←→
with 𝑋 and 𝑌 compact, 𝑌 and 𝑍 connected. Assume that all three manifolds are oriented
and boundaryless and dim 𝑋 = dim 𝑌 = dim 𝑍. Show that
𝑓
Exercise 15.7 Let 𝕊𝑘 ←←←→
← 𝕊𝑘 be a smooth map. Show that if deg(𝑓 ) ≠ (−1)𝑘+1 , then 𝑓
must have a fixed point.
Hint: Assume that 𝑓 had no fixed point. Show that then the composition of 𝑓 with
the antipodal map would be homotopic to the identity map.
𝑓
← 𝕊𝑘 be a smooth map. In this exercise we prove the following
Exercise 15.9 Let 𝕊𝑘 ←←←→
statement: if deg(𝑓 ) is odd, then 𝑓 must send some pair of antipodal points to antipodal
points.
Chapter 15. The Brouwer Degree 319
(b) Now further assume that 𝑘 is even. Show that this implies deg(𝑔) = 0.
(c) Now assume that 𝑘 is odd. Use Exercise 15.6 and Exercise 15.8 to deduce that this
implies deg(𝑔) is even.
In the following exercises, we will study some basic properties of the Hopf invariant. Recall
that we have already defined the linking number and the Hopf invariant in a mod 2-version in
Section 11.3. Now we improve the constructions to get ℤ-valued invariants. In particular, we
show that the Hopf invariant of the Hopf fibration equals one. This is a much stronger result
than what we proved previously in Theorem 11.27.
Figure 15.4: The red circle is the boundary of a compact manifold 𝐷. In both cases the green
curve intersects the manifold 𝐷. The linking number detects the intersection. The circles on
the left-hand side are linked and cannot be moved apart. However, multiple intersections may
occur. The curves on the right-hand side are not linked and can be moved. The linking number
detects this by counting the linking points with signs.
Hint: Check what happens with the orientation numbers at points when we switch
the order of 𝑋 and 𝑌 , and think of our computation of the degree of the antipodal
map on 𝕊𝑘 .
(b) Assume that 𝑋 is the boundary of an oriented manifold 𝑊 which is disjoint from
𝑌 . Show that this implies 𝐿(𝑋, 𝑌 ) = 0.
Hint: Use the Boundary Theorem 15.2 for degrees.
Chapter 15. The Brouwer Degree 321
As in Definition 11.21, we extend the definition of the linking number to submanifolds 𝑋, 𝑌
in 𝕊𝑘+1 : Assume 𝑋 and 𝑌 are compact, oriented and boundaryless, and dim 𝑋 + dim 𝑌 = 𝑘.
Since the sphere is connected and 𝑋 and 𝑌 are closed subsets, there must be a point 𝑝 which
is not contained in either 𝑋 or 𝑌 . We identity 𝕊𝑘+1 ⧵ {𝑝} with ℝ𝑘+1 via the diffeomorphism
defined by stereographic projection from 𝑝. Then we consider 𝑋 and 𝑌 as submanifolds of
ℝ𝑘+1 and define the linking number 𝐿(𝑋, 𝑌 ) as above.
The number ( )
𝐻(𝑓 ) ∶= 𝐿 𝑓 −1 (𝑤), 𝑓 −1 (𝑧)
is called the Hopf invariant of 𝑓 . This is a famous invariant that played a crucial role in the
development of mathematics. As in Section 11.3.2 one can show that 𝐻(𝑓 ) does not depend on
the choice of 𝑤 and 𝑧 and only depends on the homotopy class of 𝑓 . We skip this verification
here. Denoting by 𝜋2𝑛−1 (𝕊𝑛 ) the (2𝑛−1)-homotopy group of 𝕊𝑛 , we can view the Hopf invariant
as a map
𝐻 ∶ 𝜋2𝑛−1 (𝕊𝑛 ) → ℤ.
Exercise 15.11 We are going to study some of the basic properties of the Hopf invariant
in this exercise:
𝑓 𝑔
𝕊2𝑛−1 ←←←→
← 𝕊𝑛 ←←→
← 𝕊𝑛 .
Show that
𝐻(𝑔◦𝑓 ) = 𝐻(𝑓 ) ⋅ deg(𝑔)2 .
Now we are going to compute the Hopf invariant for the Hopf fibration 𝜋 ∶ 𝕊3 → 𝕊2 . We
have met 𝜋 several times before and have solved several of the following problems in the main
text and previous exercises. We collect and reprove all the results for computing 𝐻(𝜋). We
recommend that you solve each of the tasks again to get more practice. The calculation of
𝐻(𝜋) = 1 is a key new step which goes beyond the computation modulo 2.
(b) Show that 𝜋(𝑧0 , 𝑧1 ) = 𝜋(𝑤0 , 𝑤1 ) if and only if there is a complex number 𝛼 with
|𝛼|2 = 𝛼 𝛼̄ = 1 such that (𝑤0 , 𝑤1 ) = (𝛼𝑧0 , 𝛼𝑧1 ).
322 15.4. Linking number and the Hopf invariant via exercises
(c) Show that, for every point 𝑝 ∈ 𝕊2 , the fiber 𝜋 −1 (𝑝) is diffeomorphic to 𝕊1 .
We proved previously that there is exactly one homotopy class of maps 𝕊1 → 𝕊1 for every
integer 𝑛 ∈ ℤ. By our classification of one-manifolds, we can read this also as follows:
For every compact, connected, boundaryless one-manifold 𝑋, there is exactly one homo-
topy class of maps 𝑋 → 𝕊1 for every integer 𝑛 ∈ ℤ.
Now we are going to generalize this result to higher dimensions. Recall that the degree of
a map 𝑓 ∶ 𝑋 → 𝕊𝑘 as in the theorem is defined as
∑
deg(𝑓 ) = sign (𝑑𝑓𝑥 )
𝑥∈𝑓 −1 (𝑦)
where 𝑦 is a regular value of 𝑓 and sign (𝑑𝑓𝑥 ) is +1 if 𝑑𝑓𝑥 preserves orientations and −1 if 𝑑𝑓𝑥
reverses orientations. We will refer to this sign rule as our usual orientation convention.
Together with Theorem 15.13 and since every continuous map between spheres is homo-
topic to a smooth map we then get:
Corollary 16.2 (Homotopy groups of spheres - injectivity) Let 𝜋𝑘 (𝕊𝑘 ) denote the 𝑘th
homotopy group of 𝕊𝑘 . The degree defines a bijection
deg ∶ 𝜋𝑘 (𝕊𝑘 ) → ℤ.
Remark 16.3 Note that the situation is similar, though different for non-orientable
manifolds: Two maps of a compact, connected, non-orientable, boundaryless 𝑘-
manifold 𝑋 to 𝕊𝑘 are homotopic if and only if they have the same degree modulo
2.
Now we start our march towards a proof Hopf’s theorem. We will follow the guideline of
Guillemin-Pollack [5]. But it is worth noting that there are many different ways to prove this
theorem and we will present Pontryagin’s proof as presented in Milnor’s book [13] in ??.
323
324 16.1. Strategy for proving Hopf’s theorem
∙ (Strategy for the proof) Assume given two maps 𝑓0 and 𝑓1 from 𝑋 to 𝕊𝑘 .
∙ For the proof of the Extension Theorem 16.13, use the Isotopy Lemma 11.3 to
move 𝑊 inside some ball 𝔹 ⊂ ℝ𝑘+1 with Int(𝑊 ) ⊂ 𝔹. This reduces to checking
an extension statement on balls and spheres.
In order to make this strategy work, we need to prove a series of technical results. This
will occupy the rest of the chapter. Two main technical ingredients are isotopies which allow
to move points, and winding numbers which help us calculating degrees. We will also need
the following version of the Isotopy Lemma:
𝐸(𝑥1 , 𝑥2 , … , 𝑥𝑘 ) = (−𝑥1 , 𝑥2 , … , 𝑥𝑘 ).
Proof: First we remark that it suffices to deal with the case that 𝐸 preserves orientations.
For if 𝐸 is orientation reversing, then 𝑟1 ◦𝐸 preserves orientations. Then if there is a homotopy
𝐹 between 𝑟1 ◦𝐸 and Id, then, after composing all maps with 𝑟1 , 𝑟1 ◦𝐹 is a homotopy between
Chapter 16. Hopf Degree Theorem 325
𝐸 = 𝑟1 ◦𝑟1 ◦𝐸 and 𝑟1 .
Now let 𝑘 = 2 and assume that 𝐸 has only complex eigenvalues. Then 𝐸𝑡 = 𝑡𝐸 + (1 − 𝑡)Id
is a linear homotopy between Id and 𝐸. Moreover, each ( 𝐸𝑡 is)a linear isomorphism. To show
𝑎 𝑏
this we show that det(𝐸𝑡 ) ≠ 0 for all 𝑡 ∈ [0, 1]. If 𝐸 = , then we get
𝑐 𝑑
(𝑎 + 𝑑 − 2)2 − 4(𝑎𝑑 − 𝑏𝑐 − 𝑎 − 𝑑 + 1)
=(𝑎 + 𝑑)2 − 4(𝑎 + 𝑑) + 4 − 4(𝑎𝑑 − 𝑏𝑐) + 4(𝑎 + 𝑑) − 4
=(𝑎 + 𝑑)2 − 4(𝑎𝑑 − 𝑏𝑐).
Now we show the induction step: So assume 𝑘 ≥ 2 and the assertion to be true in all
dimensions < 𝑘. Then 𝐸 has either at least one real eigenvalue or at least one complex eigen-
value. Let 𝑉 ⊂ ℝ𝑘 be the corresponding eigenspace, which is either one- or two-dimensional.
Then 𝐸 maps 𝑉 into itself. Hence ℝ𝑘 splits into a direct sum ℝ𝑘 = 𝑉 ⊕ 𝑊 . By choosing a
basis of ℝ𝑘 consisting of a basis of 𝑉 and one for 𝑊 , we can represent 𝐸 as a matrix of the
form
( )
𝐴 𝐵
𝐸= .
0 𝐶
As for many results on maps between spheres, the winding number is useful concept. We used
it before with values modulo 2. Now we need an integral version:
𝑓 (𝑥) − 𝑧
𝑢 ∶ 𝑋 → 𝕊𝑘 , 𝑥 → .
|𝑓 (𝑥) − 𝑧|
As a formula:
𝑊 (𝑓 , 𝑧) = deg(𝑢).
The winding number will be the main tool in the proof of Hopf’s theorem. In order to
exploit it effectively, we investigate some of its properties:
Proof: After possibly translating things, we can assume 𝑥 = 0 = 𝑧, which keeps the
notation simpler. We set 𝐴 = 𝑑𝑓0 . We are going to show that 𝑊 (𝐴, 0) can be used to calculate
𝑊 (𝜕𝑓 , 0). This will follow if we show that we can choose 𝔹 small enough such that there is a
homotopy 𝐹𝑡 ∶ 𝜕𝔹 × [0, 1] → 𝕊𝑘−1 between 𝐴𝑥∕|𝐴𝑥| and 𝜕𝑓 (𝑥)∕|𝜕𝑓 (𝑥)|. For then
( ) ( )
𝜕𝑓 (𝑥) 𝐴𝑥
𝑊 (𝜕𝑓 , 0) = deg = deg = 𝑊 (𝐴, 0).
|𝜕𝑓 (𝑥)| |𝐴𝑥|
Chapter 16. Hopf Degree Theorem 327
Now we are going to construct the homotopy 𝐹𝑡 : By Taylor theory, we can write
We define
Since 𝑥 = 0 is a regular point, we know that 𝐴 is an isomorphism. Hence the image of the
unit ball in ℝ𝑘 under 𝐴 strictly contains a closed ball of some radius 𝑟 > 0. Since every linear
isomorphism is a diffeomorphism, we also know that 𝐴 maps boundaries to boundaries, i.e.,
𝕊𝑘−1 to the boundary of the closed ball of radius 𝑟. Hence
As a consequence,
|𝐴(𝑥∕|𝑥|)| > 𝑐 and thus |𝐴𝑥| > |𝑟𝑥| for all 𝑥 ∈ ℝ𝑘 ⧵ {0}.
Now we use (16.1). Since 𝜀(𝑥)∕|𝑥| → 0 as 𝑥 → 0, we can choose a ball 𝔹 small enough
such that
𝑟
𝜀(𝑥)∕|𝑥| < for all 𝑥 ∈ 𝜕𝔹.
2
Then we have
𝑟 𝑟
|𝑓𝑡 (𝑥)| = |𝐴𝑥| − 𝑡|𝜀(𝑥)| > 𝑟|𝑥| − |𝑥| = |𝑥|,
2 2
𝑖.𝑒., |𝑓𝑡 (𝑥)| > 0 for all 𝑥 ∈ 𝜕𝔹.
Now we compute 𝑊 (𝐴, 0). Therefor we apply the Linear Isotopy Lemma 16.4 and get
that 𝐴 is homotopic to the identity if it preserves orientations, and homotopic to the reflection
map (𝑥1 , 𝑥2 , … , 𝑥𝑘 ) → (−𝑥1 , 𝑥2 , … , 𝑥𝑘 ) if it reverses orientations. In the former case, we have
𝑊 (𝐴, 0) = +1, and in the latter case 𝑊 (𝐴, 0) = −1.
This result determines how local diffeomorphisms can wind. Now we are going to use this
information to count preimages:
Lemma 16.7 (Step 2) Let 𝑓 ∶ 𝔹 → ℝ𝑘 be a smooth map defined on some closed ball
𝔹 in ℝ𝑘 . Suppose that 𝑧 is a regular value of 𝑓 that has no preimages on the boundary
sphere 𝜕𝔹, and let 𝜕𝑓 ∶ 𝜕𝔹 → ℝ𝑘 be its restriction to the boundary. Then the number
of preimages of 𝑧, counted with our usual orientation convention, equals the winding
number 𝑊 (𝜕𝑓 , 𝑧).
328 16.2. The Special Case
Proof: By the Stack of Records Theorem 4.14, we know that 𝑓 −1 (𝑧) is a finite set {𝑥1 , … , 𝑥𝑛 },
and we can choose disjoint balls 𝔹𝑖 around each 𝑥𝑖 . Since 𝑓 −1 (𝑧) is disjoint from 𝜕𝔹 by as-
sumption, we can shrink these balls such that 𝔹𝑖 ∩ 𝜕𝔹 = ∅ and so that each 𝔹𝑖 is sufficiently
small so that Step 1 can be applied. Let 𝜕𝑓𝑖 = 𝑓|𝜕𝔹𝑖 . Then Step 1, i.e., Lemma 16.6, implies
that the number of preimage points, counted with our usual orientation convention, equals
∑𝑛
𝑖=1 𝑊 (𝜕𝑓𝑖 , 𝑧). Let 𝔹 ∶= 𝔹 ⧵ ∪𝑖 𝔹𝑖 and consider the map
′
𝑓 (𝑥) − 𝑧
𝑢 ∶ 𝜕𝔹 → 𝕊𝑘−1 , 𝑥 → .
|𝑓 (𝑥) − 𝑧|
Since 𝑓 (𝑥) ≠ 𝑧 on 𝔹′ , this map extends to all of 𝔹′ . This implies
𝑊 (𝑓|𝜕𝔹′ , 𝑧) = deg(𝑢) = 0.
This implies
∑
𝑛
𝑊 (𝑓|𝜕𝔹′ , 𝑧) = 𝑊 (𝜕𝑓 , 𝑧) − 𝑊 (𝜕𝑓𝑖 , 𝑧).
𝑖=1
∑𝑛
Hence in total we get 𝑊 (𝜕𝑓 , 𝑧) = 𝑖=1 𝑊 (𝜕𝑓𝑖 , 𝑧).
Proof: For simplicity, we assume that 𝔹 is centred at 0. Then we can write every non-zero
point 𝑥 ∈ 𝔹 uniquely as 𝑥 = 𝑡𝑦 for some 𝑦 ∈ 𝜕𝔹 and some 𝑡 ∈ [0, 1]. By assumption, there is
a homotopy 𝑔𝑡 ∶ 𝜕𝔹 → 𝑌 with 𝑔1 = 𝜕𝑓 and 𝑔0 being a constant map.
Now it remains to use smooth bump function to turn 𝐹 into a smooth homotopy.1
1
It is already smooth except, possibly, on 𝜕𝔹.
Chapter 16. Hopf Degree Theorem 329
Lemma 16.9 (The Special Case) Let 𝑓 ∶ 𝕊𝑘 → 𝕊𝑘 be a smooth map of degree zero.
Then 𝑓 is homotopic to a constant map.
Theorem 16.10 (Winding number zero) Any smooth map 𝑓 ∶ 𝕊𝑘 → ℝ𝑘+1 ⧵ {0}
having winding number zero with respect to the origin is homotopic to a constant
map.
Proof of Theorem 16.10: By assumption, the degree of the map 𝑓 ∕|𝑓 | is zero. By the
special case, this implies that 𝑓 ∕|𝑓 | is homotopic to a constant map. But 𝑓 ∕|𝑓 | and 𝑓 are
homotopic via the homotopy
Proof: Since 𝑓 −1 (0) is a finite, we can choose a ball 𝔹 centred at the origin with 𝑓 −1 (0) ⊂
Int(𝔹). By assumption, the sum of preimage points is zero when counted with the usual ori-
entation convention. By Step 2, i.e., Lemma 16.7, the map 𝜕𝑓 ∶ 𝜕𝔹 → ℝ𝑘 ⧵ {0} has winding
number zero. Since 𝜕𝔹 is diffeomorphic to 𝕊𝑘−1 , we may consider 𝜕𝑓 as a map from 𝕊𝑘−1 to
ℝ𝑘 ⧵ {0}.
Since we are assuming the special case being true in dimension 𝑘 − 1, we can apply its
consequence, i.e., Theorem 16.10, in that dimension. Thus, 𝜕𝑓 is homotopic to a constant map.
Hence
𝑓|ℝ𝑘 ⧵Int(𝔹) ∶ ℝ𝑘 ⧵ Int(𝔹) → ℝ𝑘 ⧵ {0}
is a map to which we can apply Step 3, i.e., Lemma 16.8. This implies that 𝑓 extends to a
smooth map 𝑔 ∶ ℝ𝑘 → ℝ𝑘 ⧵ {0} with 𝑓 = 𝑔 outside the compact space 𝔹.
Now we get back to the proof of the special case. So we are given a smooth map 𝑓 ∶ 𝕊𝑘 →
𝕊𝑘 with deg(𝑓 ) = 0.
The idea of the proof is to show that 𝑓 is homotopic to a map ℎ ∶ 𝕊𝑘 → 𝕊𝑘 ⧵ {𝑏}, where
𝑏 is some point in 𝕊𝑘 . But 𝕊𝑘 ⧵ {𝑏} is diffeomorphic to ℝ𝑘 via stereographic projection from
330 16.2. The Special Case
𝑏. Since ℝ𝑘 is contractible, this implies ℎ is homotopic to a constant map. Then 𝑓 is also
homotopic to a constant map.
So we need to show:
By Sard’s Theorem 7.1, we can choose distinct regular values 𝑎 and 𝑏 of 𝑓 . By the Stack of
Records Theorem 4.14, the preimage sets are finite, say 𝑓 −1 (𝑎) = {𝑎1 , … , 𝑎𝑛 } and 𝑓 −1 (𝑏) =
{𝑏1 , … , 𝑏𝑚 }.
Since 𝑘 > 1, we can apply the Isotopy Theorem 11.10, which is an extension of the Isotopy
Lemma 11.3, to the points {𝑎2 , … , 𝑎𝑛 } in 𝑌 ∶= 𝕊𝑘 ⧵ {𝑏} to get a diffeomorphism which is
isotopic to the identity, compactly supported, and moves the points 𝑎𝑖 into 𝑈 .
Since homotopy is a transitive relation, we can therefore assume that 𝑈 is an open neigh-
borhood of 𝑓 −1 (𝑎) with 𝑏 ∉ 𝑓 (𝑈 ).
is a smooth map from ℝ𝑘 to ℝ𝑘 . Since 𝑎 is a regular value of 𝑓 , 0 is a regular value of 𝛽◦𝑓 ◦𝛼.
Moreover, since 𝑓 −1 (𝑎) is finite, (𝛽◦𝑓 ◦𝛼)−1 (0) is finite as well.
Now we use the assumption deg(𝑓 ) = 0. For this means that the number of preimages of
𝑎 under 𝑓 is zero when counted with our usual orientation convention. Hence the number
of preimages of 0 under 𝛽◦𝑓 ◦𝛼 is zero when counted with the usual orientation convention.
Thus, we can apply Lemma 16.11 to 𝛽◦𝑓 ◦𝛼 ∶ ℝ𝑘 → ℝ𝑘 and get a map 𝑔 ∶ ℝ𝑘 → ℝ𝑘 ⧵ {0}
such that 𝑔 = 𝛽◦𝑓 ◦𝛼 outside a compact set 𝐵 and 𝑔 is homotopic to 𝛽◦𝑓 ◦𝛼 on ℝ𝑘 .
𝛽 −1 ◦𝑔◦𝛼 −1 = 𝑓 on 𝑈 ⧵ 𝛼 −1 (𝐵).
ℎ ∶ 𝕊𝑘 → 𝕊𝑘 ⧵ {𝑏}
defined by setting
{
𝑓 on 𝕊𝑘 ⧵ 𝛼 −1 (𝐵)
ℎ=
𝛽 −1 ◦𝑔◦𝛼 −1 on 𝛼 −1 (𝐵)
is smooth, and ℎ is the desired map homotopic to 𝑓 . This proves the special case.
Chapter 16. Hopf Degree Theorem 331
16.3 The Extension Theorem for degree zero maps
Next we need to learn how to extend maps defined on the boundary to the whole manifold. This
is possible for maps of degree zero. This is an important result. We start with the simpler case
of maps to Euclidean space:
Now we choose a smooth bump function 𝜌 that is constant 1 on 𝜕𝑊 and 0 outside some
compact subset of 𝑈 . Then we can extend 𝑓 to all of 𝑊 by letting it be
𝜌 ⋅ 𝐹 on 𝑈 , and 0 outside of 𝑈 .
Theorem 16.13 (Extension Theorem for degree zero maps) Let 𝑊 be a com-
pact, connected, oriented 𝑘 + 1-dimensional smooth manifold with boundary, and
let 𝑓 ∶ 𝜕𝑊 → 𝕊𝑘 be a smooth map. Then 𝑓 extends to a globally defined map
𝐹 ∶ 𝑊 → 𝕊𝑘 with 𝜕𝐹 = 𝑓 if and only if deg(𝑓 ) = 0.
Proof: We already know that if 𝑓 can be extended to all of 𝑊 , then deg(𝑓 ) = 0. It remains
to show the opposite direction.
So let 𝑓 be as in the theorem, and assume deg(𝑓 ) = 0. By Lemma 16.12 above, we can
extend 𝑓 to a smooth map 𝐹 ∶ 𝑊 → ℝ𝑘+1 . By the Transversality Extension Theorem 12.29,
we can assume that 0 is a regular value of 𝐹 . Since 𝑊 is compact of dimension 𝑘 + 1, we know
that 𝐹 −1 (0) is a finite set. Hence we can apply the corollary to the Isotopy Lemma 11.3 to this
finite set, and move 𝐹 −1 (0) inside Int(𝔹) where 𝔹 is a closed ball contained Int(𝑊 ).
𝐹
In particular, since 𝐹 −1 (0) ⊂ Int(𝔹), the map |𝐹 |
extends to 𝑊 ′ ∶= 𝑊 ⧵ Int(𝔹). Hence
𝑊 (𝐹|𝜕𝑊 , 0) = 𝑊 (𝑓 , 0) = deg(𝑓 ) = 0,
Hence we get
Now Theorem 16.10, i.e., the consequence of the special case, implies that 𝜕𝐹 is homotopic
to a constant map. By Step 3, i.e., Lemma 16.8, this implies that 𝜕𝐹 extends to a map 𝐺 ∶ 𝑊 →
ℝ𝑘+1 ⧵ {0}. Then the map 𝐺∕|𝐺| ∶ 𝑊 → 𝕊𝑘 is the global extension of 𝑓 .
Let 𝑓0 and 𝑓1 be two maps 𝑋 → 𝕊𝑘 and let 𝑊 ∶= 𝑋 × [0, 1]. We define a map 𝑓 ∶ 𝜕𝑊 → 𝕊𝑘
by setting
{
𝑓0 on 𝑋 × {0}
𝑓=
𝑓1 on 𝑋 × {1}.
By the Extension Theorem 16.13, 𝑓 extends to a map on all of 𝑊 if and only if deg(𝑓 ) = 0.
By definition, such an extension would be a homotopy between 𝑓0 and 𝑓1 . Thus we have
𝑓0 ∼ 𝑓1 ⇐⇒ deg(𝑓 ) = 0.
It remains to relate deg(𝑓 ) to deg(𝑓0 ) and deg(𝑓1 ). But, since 𝜕𝑊 = (𝑋 × {1}) ∪ (𝑋 × {0})
with the opposite orientation on 𝑋 × {0}, it follows that
Thus
𝑓0 ∼ 𝑓1 ⇐⇒ deg(𝑓1 ) = deg(𝑓0 ).
17. Vector Fields and the Poincaré–Hopf Index Theorem
Let 𝑋 ⊂ ℝ𝑁 be a smooth manifold. Recall from the exercises that a vector field on 𝑋 is a
smooth assignment of a vector tangent to 𝑋 at each point 𝑥, i.e., a smooth map 𝐯 ∶ 𝑋 → ℝ𝑁
such that 𝐯(𝑥) ∈ 𝑇𝑥 (𝑋) for every 𝑥.
Vector fields play a crucial role in many applications, for example in mathematical physics.
To provide efficient tools to understand them is an important motivation for differential topol-
ogy. A priori one might expect that vector fields are free to move in every way they want. It
turns out, however, that the topology of 𝑋 provides pretty strong restrictions for what is al-
lowed and what is not. This is the content of the Poincaré–Hopf Index Theorem 17.16 that we
are going to prove. Since 𝐯(𝑥) varies smoothly with 𝑥, the most interesting points 𝑥 ∈ 𝑋 are
where 𝐯(𝑥) = 0. For, in a small neighborhood of such a point, 𝐯 can change directions radically.
Hence the main points we need to investigate are the zeros of the vector field.
We have already seen some examples of vector fields in the exercises. Here are some more
examples:
Example 17.1 (Vector field on an open subset) Let 𝑈 ⊂ ℝ𝑘 be an open subset. Recall
that the tangent space 𝑇𝑢 𝑈 is just ℝ𝑘 for every 𝑢 ∈ 𝑈 . Hence a vector field 𝐯 on 𝑈 is just
a smooth map 𝑈 → ℝ𝑘 . Let 𝑒1 , … , 𝑒𝑘 be the standard basis of ℝ𝑘 . Then 𝐯 is of the form
∑
𝐯 = 𝑘𝑖=1 𝑣𝑖 𝑒𝑖 for smooth functions 𝑣𝑖 ∶ 𝑈 → ℝ.
333
334 17.2. Index of a vector field at a zero
from linear algebra that for any such functional there is a vector 𝐯(𝑥) ∈ 𝑇𝑥 𝑋 such that
where 𝐯(𝑥)𝑇 denotes the transpose of 𝐯(𝑥) and ⋅ is matrix multiplication. We may think
of this assignment 𝑥 → 𝐯(𝑥) ∈ 𝑇𝑥 𝑋 as a vector field. This is called the gradient field
of 𝑓 and we denote it by 𝐠𝐫𝐚𝐝(𝑓 ). We note that if 𝑧 ∈ 𝑋 is a zero of 𝐠𝐫𝐚𝐝(𝑓 ), then
𝑑𝑓𝑧 (𝑤) = 𝐠𝐫𝐚𝐝(𝑓 )(𝑧)𝑇 ⋅ 𝑤 = 0 for all 𝑤. Hence 𝑑𝑓𝑧 = 0 and 𝑧 is a critical point of
𝑓 . Conversely, if 𝑧 is a critical point of 𝑓 , then 𝑑𝑓𝑧 = 0 and 𝐠𝐫𝐚𝐝(𝑓 )(𝑧) must be zero.
Thus, 𝑧 is a zero the gradient field of 𝑓 if and only if 𝑧 is a critical point.
In the special case 𝑋 = ℝ𝑘 , the gradient field of 𝑓 can be represented by the Jacobian
(1 × 𝑘)-matrix at 𝑥, i.e.,
( )
𝜕𝑓 𝜕𝑓
𝐠𝐫𝐚𝐝(𝑓 )(𝑥) = (𝑥), … , (𝑥) ,
𝜕𝑥1 𝜕𝑥𝑘
∑𝑘
𝜕𝑓
𝐠𝐫𝐚𝐝(𝑓 )(𝑥) = (𝑥)𝑒𝑖 .
𝑖=1
𝜕𝑥𝑖
Definition 17.4 (Index at the origin) We assume that the vector field 𝐯 on ℝ𝑛 has an
isolated zero at the origin, i.e., there is a small radius 𝜀 > 0 such that 𝐯 has no other
zeros in the closed ball 𝔹̄ 𝑛𝜀 around the origin with radius 𝜀. Then we can define the map
𝐯(𝑦)
𝐯̄ ∶ 𝕊𝑛−1 → 𝕊𝑛−1 , 𝑦 → .
𝜀 |𝐯(𝑦)|
We equip both spheres with the standard orientation, i.e., they are oriented as the
boundary of 𝔹̄ 𝑛𝜀 and 𝔹̄ 𝑛1 , respectively. We define the index of 𝐯 at 0, denoted ind0 (𝐯), the
be the degree of 𝐯:̄
̄
ind0 (𝐯) ∶= deg(𝐯).
Note:
∙ We can think the map 𝑦 → 𝐯(𝑦)∕|𝐯(𝑦)| as a measure of the variation of the direction of
𝐯 around the origin.
∙ The choice of the radius does not matter for the definition of ind0 , since if we choose
another radius, say 𝜀′ < 𝜀, then 𝐯̄ can be extended to a compact manifold 𝑊 = 𝔹̄ 𝑛𝜀 ⧵ 𝔹𝑛𝜀′
where 𝔹̄ denotes the closed ball and 𝔹 the open ball. The boundary of 𝑊 is 𝜕𝑊 =
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 335
𝜀 − 𝕊𝜀′ . Hence the Boundary Theorem 15.2 implies that the two degrees that arise
𝑛−1
𝕊𝑛−1
from using 𝕊𝑛−1
𝜀 and 𝕊𝑛−1
𝜀′
, respectively, agree.
∙ We can easily construct vector fields with a zero with arbitrary index as follows: In the
plane of complex numbers the polynomial 𝑧 → 𝑧𝑘 defines a smooth vector field with a
zero of index 𝑘 at the origin, and the function 𝑧 → 𝑧̄ 𝑘 defines a vector field with a zero
of index −𝑘. See Figure 17.1.
∙ More examples of vector fields, or for some the curves that the fields are tangent to, in
Figure 17.2 and Figure 17.3.
Figure 17.1: Vector fields arising from the maps 𝑧 → 𝑧1 and 𝑧 → 𝑧̄ with indices 1 and −1,
respectively.
Figure 17.2: Source and sink have index +1, while saddle points have index −1.
Now let 𝑋 be a smooth manifold and let 𝐯 be a vector field on 𝑋. We would like to transfer
the definition of the index of 𝐯 at a point 𝑥 ∈ 𝑋 from Euclidean space to 𝑋. The idea is of how
to do this is the familiar one: we use a local parametrization to transfer information between 𝑋
and Euclidean space.
Definition 17.5 (Pullback of vector fields) Let 𝑋 and 𝑌 be smooth manifolds and
let 𝐯 be a vector field on 𝑌 . Assume that there is a diffeomorphism 𝑓 ∶ 𝑋 → 𝑌 . Then
we define the pullback vector field, denoted 𝑓 ∗ 𝐯, by assigning to 𝑥 the vector which
336 17.2. Index of a vector field at a zero
Figure 17.3: Two types of spiral movements. One has index +1, the other one has index +2,
since we have a positive contribution from above and below.
To compute the right-hand side, let (𝑒1 , … , 𝑒𝑘 ) denote the standard basis of ℝ𝑘 . For
𝑢 ∈ 𝑈 , the vectors 𝑑𝜙𝑢 (𝑒1 ), … , 𝑑𝜙𝑢 (𝑒𝑘 ) form a basis of 𝑇𝜙(𝑢) 𝑋. We need to determine
how 𝑓𝜙(𝑢) acts on each 𝑑𝜙𝑢 (𝑒𝑖 ). By definition of the induced derivative we have
𝜙 𝑓
where 𝑑(𝑓 ◦𝜙)𝑢 is the derivative of the smooth map 𝑈 ←←←→ ← ℝ. Hence, with respect
← 𝑋 ←←←→
to the basis (𝑑𝜙𝑢 (𝑒1 ), … , 𝑑𝜙𝑢 (𝑒𝑘 )), we can represent 𝑑𝑓𝜙(𝑢) by the Jacobian (1×𝑘)-matrix
( )
𝜕(𝑓 ◦𝜙) 𝜕(𝑓 ◦𝜙)
𝑑𝑓𝜙(𝑢) = (𝑢), … , (𝑢) .
𝜕𝑢1 𝜕𝑢𝑘
The vector 𝐠𝐫𝐚𝐝(𝑓 )(𝜙(𝑢)) ∈ 𝑇𝜙(𝑢) 𝑋 such that 𝑑𝑓𝜙(𝑢) (𝑤) = (𝐠𝐫𝐚𝐝(𝑓 )(𝜙(𝑢)))𝑇 ⋅ 𝑤 for all
𝑤 ∈ 𝑇𝜙(𝑢) 𝑋, is then given by
∑
𝑘
𝜕(𝑓 ◦𝜙)
𝐠𝐫𝐚𝐝(𝑓 )(𝜙(𝑢)) = (𝑢)𝑑𝜙𝑢 (𝑒𝑖 ) ∈ 𝑇𝜙(𝑢) 𝑋.
𝑖=1
𝜕𝑢𝑖
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 337
For a basis vector 𝑑𝜙𝑢 (𝑒𝑗 ) ∈ 𝑇𝜙(𝑢) 𝑋, we then get
In order to obtain 𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 ) it remains to apply 𝑑𝜙−1 𝑢 and to express the effect of
𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 )(𝑢) in terms of the standard basis 𝑒1 , … , 𝑒𝑘 of ℝ𝑘 . For the latter we observe
that we have
( ∗ )𝑇
𝜙 𝐠𝐫𝐚𝐝(𝑓 )(𝑢) ⋅ 𝑒𝑗 = (𝐠𝐫𝐚𝐝(𝑓 )(𝜙(𝑢)))𝑇 ⋅ 𝑑𝜙𝑢 (𝑒𝑗 ) ∈ ℝ.
Moreover, Equation 17.1 tells us which real number that is. Thus, in terms of the stan-
dard basis of ℝ𝑘 , we can express 𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 )(𝑢) as follows: We define smooth functions
𝑔𝑖𝑗 ∶ 𝑈 → ℝ by the formula
Finally, recall from Example 17.1 that every vector field 𝐯 on 𝑈 ⊂ ℝ𝑘 is of the form
∑
𝐯 = 𝑗 𝑣𝑗 𝑒𝑗 for smooth functions 𝑣𝑗 ∶ 𝑈 → ℝ. Then we can write 𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 ) as
∑
𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 ) = 𝑣𝑗 𝑒𝑗
𝑗
Definition 17.7 (Index of a zero) Let 𝑋 be a smooth manifold and let 𝐯 be a vector
field on 𝑋 with an isolated zero at 𝑧 ∈ 𝑋. Let 𝜙 ∶ 𝑈 → 𝑋 be a local parametrization
with 𝜙(0) = 𝑧. We define the index of 𝐯 at 𝑧 to be
We need to show that this definition does not depend on the choice of the local parametriza-
tion 𝜙. We do this in two steps via the following two lemmata:
∙ Note that the assertion of Lemma 17.8 is quite different from the situation on the sphere
𝕊𝑛 where there exists orientation preserving diffeomorphism which is not smoothly ho-
motopic to the identity.
That the index of a zero of a vector field on a smooth manifold is well-defined will now
follow from the next lemma:
Proof: Since we only need to study the zeros in a small open neighborhood of the origin,
we may assume that 𝑈 and 𝑈 ′ are open balls around the origin with sufficiently small radii 𝜀
and 𝜀′ , respectively. First we assume that 𝑓 preserves orientation. Then by Lemma 17.8 we
can find a smooth isotopy
𝑓𝑡 ∶ 𝑈 ′ × [0, 1] → ℝ𝑛 , with 𝑓0 = id, 𝑓1 = 𝑓 , and 𝑓𝑡 (0) = 0 for all 𝑡.
For each 𝑡, we set 𝐯𝑡 ∶= 𝑓𝑡∗ 𝐯 on 𝑓𝑡−1 (𝑈 ) ⊂ ℝ𝑛 . Since 𝑓𝑡 is a smooth isotopy, each 𝑓𝑡 is a
diffeomorphism and the vector field 𝐯𝑡 is well-defined with an isolated zero at the origin for
every 𝑡 ∈ [0, 1]. For each 𝑡 ∈ [0, 1], let 𝜀𝑡 > 0 be the maximal radius such that 𝐯𝑡 has no zeros
other than the origin in 𝔹𝑛𝜀 . The function [0, 1] → ℝ+ , 𝑡 → 𝜀𝑡 is continuous and hence takes
𝑡
a minimum when 𝑡 varies in [0, 1]. Let 𝜀 > 0 be that minimum. Then we have a well-defined
smooth map
𝐯 (𝑦)
𝐯̄ 𝑡 ∶ 𝕊𝑛−1 → 𝕊𝑛−1 , 𝑦 → 𝑡 .
𝜀 |𝐯𝑡 (𝑦)|
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 339
As 𝐯0 = 𝐯 and 𝐯1 = 𝑓 ∗ 𝐯, we deduce
from the invariance of deg under homotopy. This proves the assertion for an orientation pre-
serving diffeomorphism.
Now we assume that 𝑓 reverses orientation. Then we can write 𝑓 as the composition of
an orientation preserving diffeomorphism and a reflection 𝑟 ∶ ℝ𝑛 → ℝ𝑛 which switches the sign
of exactly one coordinate, i.e., 𝑟 sends (𝑥1 , … , 𝑥𝑖 , … , 𝑥𝑛 ) to (𝑥1 , … , −𝑥𝑖 , … , 𝑥𝑛 ). By the first
case, we may therefore assume that 𝑓 = 𝑟. We write 𝐯′ = 𝑓 ∗ 𝐯. Then the associated map 𝐯̄ ′ on
the 𝜀-sphere 𝕊𝑛−1
𝜀 → 𝕊𝑛−1 with 𝐯̄ ′ (𝑦) = 𝐯′ (𝑦)∕|𝐯′ (𝑦)| satisfies
𝐯̄ ′ = 𝑟−1 ◦𝐯◦𝑟
̄
since 𝑟 is a linear isomorphism and therefore 𝑑𝑟0 = 𝑟. Using the definition of the degree this
shows deg(𝐯) ̄ = deg(𝐯̄ ′ ) and completes the proof of the lemma.
Lemma 17.10 (Index of a zero is well-defined) Let 𝑋 be a smooth manifold and let
𝐯 be a vector field on 𝑋 with an isolated zero at 𝑧 ∈ 𝑋. Then the index of 𝐯 at 𝑧 does
not depend on the choice of a local parametrization around 𝑧.
𝜙∗ 𝐯 = (𝜓 −1 ◦𝜙)∗ (𝜓 ∗ 𝐯).
Example 17.11 (Index sum on spheres) Recall the the vector field 𝐯 on the 𝑛-
dimensional sphere 𝕊𝑛 defined in Example 17.2 given by
𝐯(𝑥) = 𝑁 − (𝑁 ⋅ 𝑥)𝑥
where 𝑁 ∈ 𝕊𝑛 denotes the north pole. and ⋅ denotes the inner product defined by consid-
ering 𝑁 and 𝑥 as vectors in ℝ𝑛+1 . We compute the effect of 𝐯 on 𝑥 = (𝑥1 , … , 𝑥𝑛+1 ) ∈ 𝕊𝑛
as
This shows that 𝐯 has exactly two zeros: the north pole 𝑁 = (0, … , 0, 1) and the south
pole 𝑆 = (0, … , 0, −1). Now we compute the index of 𝐯 at both zeros. By definition,
we compute the index at a zero by choosing a local parametrization and pulling back the
vector field to ℝ𝑛 . For this example, it is convenient to use the stereographic projections
from the poles of 𝕊𝑛 . First, we look at the zero at 𝑆. Let 𝜙𝑁 ∶ ℝ𝑛 → 𝕊𝑛 ⧵ 𝑝𝑁 be the
340 17.3. The Euler characteristic - algebraic topology in a nutshell
stereographic projection form the north pole. We then restrict 𝜙𝑁 to the open lower
hemisphere on §𝑛 as an open neighborhood 𝑉𝑆 of 𝑆 which does not contain any other
zeros of 𝐯. This has the advantage that the restriction of 𝜙𝑁 to 𝑉𝑆 is a diffeomorphism of
𝑉𝑆 to the open ball 𝑈𝑆 ∶= 𝔹𝑛1 ⊂ ℝ𝑛 . Thus the boundary of the closure 𝑈 𝑆 of 𝑈𝑆 is just
the unit sphere 𝕊𝑛−1 ∈ ℝ𝑛 . Hence 𝜙𝑁 send a point 𝑢 ∈ 𝜕𝑈 𝑆 to 𝜙𝑁 (𝑢) = (𝑢1 , … , 𝑢𝑛 , 0) ∈
𝕊𝑛 ⊂ ℝ𝑛+1 . Thus, by Equation 17.3, we get
This does not mean that 𝜙∗𝑁 𝐯 is constant, since we still have to apply the derivative
(𝑑(𝜙𝑁 )𝑢 )−1 . We computed this derivative in Exercise 2.7. The formula there yields
⎛1 0 … 0 𝑢1 ⎞ ⎛0⎞ ⎛𝑢1 ⎞
⎜0 1 … 0 𝑢2 ⎟ ⎜⋮⎟ ⎜𝑢2 ⎟
𝜙∗𝑁 𝐯(𝑢) = ⎜ ⎟ ⋅ ⎜ ⎟ = ⎜ ⎟ for all 𝑢 ∈ 𝜕𝑈 𝑆
⎜⋮ ⋮ … ⋮ ⋮ ⎟ ⎜0⎟ ⎜ ⋮ ⎟
⎝ 0 … 0 1 𝑢𝑛 ⎠ ⎝ 1 ⎠ ⎝ 𝑢𝑛 ⎠
Thus, on the boundary of the closure of the neighborhood around the zero 𝑆, 𝜙∗𝑁 𝐯 acts
as the identity. This shows that the index of 𝐯 at 𝑆 is +1:
To compute the index at the north pole 𝑁 we use the stereographic projection 𝜙𝑆 and
use the upper hemisphere as a neighborhood 𝑉𝑁 of 𝑁. We then get again 𝐯(𝜙𝑁 (𝑢) =
(0, … , 0, 1) for all 𝑢 ∈ 𝜕𝑈 𝑁 . However, the vector field 𝜙∗𝑆 𝐯 now acts as
This shows that 𝜙∗𝑆 𝐯 acts as the antipodal map 𝑎𝕊𝑛−1 on 𝕊𝑛−1 . Hence the index of 𝐯 at
𝑁 is equals the degree of the antipodal map on 𝕊𝑛−1 which is (−1)𝑛 by Exercise 15.1:
We observe that, in this example, the sum of the indices of the zeros of the vector field
𝐯 is 0 if 𝑛 is odd and is 2 if 𝑛 is even. We will see in the next section that this sum is
actually independent of the particular choice of vector field.
In order to state the main result of this chapter we have to take a brief detour to algebraic topol-
ogy and introduce the Euler characteristic. There are of course many different ways to define
the Euler characteristic of a smooth manifold 𝑋. However, the definitions that are accessible
with the techniques developed in this book, for example as the self-intersection number of the
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 341
diagonal embedded into 𝑋 × 𝑋, do not convey the property that we are interested here: the
Euler characteristic is a purely topology invariant, i.e., it only depends on the topology of 𝑋
and not it structure as a smooth manifold. Other definition, for example via triangulation, are
more intuitive and seem more elementary. To make them precise and independent of choices,
however, requires a lot of work. Since we assume the reader to be interested or familiar with
algebraic topology anyway we will define the Euler characteristic via singular homology.
We will now briefly introduce singular homology groups. We refer the reader to any general
introduction to algebraic topology for any details and further explanation and motivation, for
example [6] or [16]. One may view the underlying idea as to map certain model spaces with
precisely understood properties into the topological spaces we would like to understand. There
are different choices for these model spaces, for example one can use spheres and define homo-
topy groups. For singular homology one uses spaces with a very nice combinatorial behavior.
A rigorous way to do this is In algebraic topology is the following.
∑
𝑛
𝑛
Δ = {(𝑡0 , … , 𝑡𝑛 ) ∈ ℝ 𝑛+1
∶ 𝑡𝑖 = 1, 𝑡𝑖 ≥ 0 for all 𝑖}.
𝑖=0
The standard simplices are related by face maps for 0 ≤ 𝑖 ≤ 𝑛 which can be described as
with the 0 inserted at the 𝑖th coordinate where 𝑡0 is the 0th coordinate. Let 𝑋 be a topological
space. A singular 𝑛-simplex in 𝑋 is a continuous map 𝜎 ∶ Δ𝑛 → 𝑋. We denote by Sing𝑛 (𝑋)
the set of all 𝑛-simplices in 𝑋. For example, Sing0 (𝑋) is just the set of points of 𝑋. But, in
general, Sing𝑛 (𝑋) carries more interesting information for 𝑛 ≥ 1. For 0 ≤ 𝑖 ≤ 𝑛, we can use
the face maps 𝜙𝑛𝑖 to define maps
by sending an 𝑛-simplex 𝜎 to the 𝑛 − 1-simplex defined by precomposition with the 𝑖th face
inclusion. The image 𝑑𝑖𝑛 (𝜎) = 𝜎◦𝜙𝑛𝑖 is called the 𝑖th face of 𝜎.
Figure 17.4: The face maps from the one-dimensional simplex into the two-dimensional one.
There are three maps corresponding to the three edges of Δ2 . Then 𝜎 maps the simplices to 𝑋.
342 17.3. The Euler characteristic - algebraic topology in a nutshell
To make this construction accessible to algebraic tools, we define the group 𝑆𝑛 (𝑋) of sin-
gular 𝑛-chains in 𝑋 as the free abelian group generated by 𝑛-simplices, i.e.,
Thus an 𝑛-chain is a finite ℤ-linear combination of simplices. We define the boundary operator
as the homomorphism of abelian groups determined by
∑
𝑛
𝜕𝑛 ∶ 𝑆𝑛 (𝑋) → 𝑆𝑛−1 (𝑋), 𝜕𝑛 (𝜎) = (−1)𝑖 𝑑𝑖𝑛 𝜎.
𝑖=0
A key property of the operators 𝜕𝑛 is that their composition vanishes, i.e., 𝜕𝑛+1 ◦𝜕𝑛 = 0. This
implies that the sequence of pairs {𝑆𝑛 (𝑋), 𝜕𝑛 } forms a chain complex:
𝜕𝑛+1 𝜕𝑛 𝜕𝑛−1 𝜕2 𝜕1 𝜕0
⋯ ←←←←←←←→
← 𝑆𝑛 (𝑋) ←←←←→ ← ⋯ ←←←←→
← 𝑆𝑛−1 (𝑋) ←←←←←←←→ ← 𝑆1 (𝑋) ←←←←→
← 𝑆0 (𝑋) ←←←←→
← 0.
Definition 17.12 (Singular homology) The 𝑛th singular homology group of 𝑋 with
coefficients in ℤ is defined to be the quotient group
∙ For each 𝑛, the group 𝐻𝑛 (𝑋; ℤ) only depends on the homotopy type of 𝑋, i.e. 𝑓 ∶ 𝑌 →
𝑌 is a homotopy equivalence, then 𝑓∗ ∶ 𝐻𝑛 (𝑋; ℤ) → 𝐻𝑛 (𝑌 ; ℤ) is an isomorphism.
∙ This follows from the fact that the assignment 𝑓 → 𝑓∗ is invariant under homotopy, i.e.,
𝑓 ≃ 𝑔 implies 𝑓∗ = 𝑔∗ .
∙ Intuitively, the 𝑛th singular homology measures the following property of 𝑋: A 𝑛-cycle
𝛼, i.e., an element in the kernel of 𝜕𝑛 , is a closed 𝑛-dimensional loop on 𝑋. The cycle 𝛼
vanishes in homology if it is a boundary, i.e., if there is an element 𝛽 in 𝑆𝑛+1 (𝑋) such that
𝜕𝑛+1 (𝛽) = 𝛼. In other words, 𝛼 vanishes if it can be filled by a chain of one dimension
higher. Hence we may think of a nonzero element in 𝐻𝑛 (𝑋; ℤ) as a tool to detect an
𝑛-dimensional hole in 𝑋.
𝑓
𝕊𝑛−1
_ /𝑌
𝔹̄ 𝑛 /𝑌 ∪ 𝔹 ̄ 𝑛.
𝑓
The Euler characteristic may now be viewed as a very rough summary of the information the
homology groups contain. First we reduce the information of homology groups to their rank,
i.e., we let 𝑏𝑖 denote the rank of 𝐻𝑖 (𝑋; ℤ) as an abelian group. The number 𝑏𝑖 is called the 𝑖th
Betti number of 𝑋. If 𝑋 is a 𝑘-dimensional manifold, then we have 𝑏𝑖 = 0 for all 𝑖 ≥ 𝑘 + 1.
Hence we may take the alternating sum of the Betti numbers:
∑
𝑘
∑
𝑘
𝜒(𝑋) ∶= (−1)𝑖 𝑏𝑖 = (−1)𝑖 rank𝐻𝑖 (𝑋; ℤ).
𝑖=1 𝑖=1
∙ Since, for all 𝑖, the groups 𝐻𝑖 (𝑋; ℤ) only depend on the topology of 𝑋, it follows that
Euler characteristic of 𝑋 only depends on the topology of 𝑋. In fact, 𝜒(𝑋) only depends
on the homotopy type of 𝑋.
The Euler characteristic can often be computed by other methods. The following situation
is of particular interest in Morse theory which will play again a role later in this chapter.
Remark 17.15 (Euler characteristic of a cell complex) Assume that the manifold 𝑋 is
homotopy equivalent to a cell complex 𝑌 , i.e., a space which is obtained by successively
gluing 𝑖-dimensional cells together for 𝑖 = 1, … , 𝑘. Let 𝑐𝑖 denote the number of 𝑖-
dimensional cells we attach. In this case, the Euler characteristic of 𝑋 can be computed
344 17.4. The Poincaré–Hopf Index Theorem
as
∑
𝑘
𝜒(𝑋) = (−1)𝑖 𝑐𝑖 . (17.4)
𝑖=1
Equation 17.4 comes quite close to the familiar formula of Euler which says that the
alternating sum 𝑣 − 𝑒 + 𝑓 is an invariant of a surface 𝑆, where 𝑣, 𝑒, 𝑓 denote the number
of vertices, edges and faces, respectively, of a polygon which covers 𝑆. For example,
the Euler characteristic of the 2-sphere is two and the Euler characteristic of the torus is
zero.
∙ Theorem 17.16 consists of two remarkable statements: First, the sum of the indices is
independent of the vector field, i.e., the sum is entirely determined by the manifold 𝑋.
Secondly, it is just the topology of 𝑋 that matters since the latter determines the Euler
characteristic of 𝑋.
∙ The Euler characteristic is a topological invariant while the nature of the index is an-
alytic. Hence we may think of the theorem as a version of an index theorem. The
generalizations of such results are extremely influential and important in many fields of
mathematics, mostly for the reason that they actually connect different branches.
∙ We have seen in Example 17.11 that the index sum for some vector field on 𝕊2𝑛 is 2.
Hence the index sum for any vector field on 𝕊2𝑛 must be 2. In particular, every vector
field on 𝕊2𝑛 must have at least one zero.
∙ The theorem can be extended to manifolds with boundary by requiring that the vector field
has to point outwards at every boundary point. We will omit the proof of this extended
result here.
∙ For example, we know from the exercises that odd-dimensional spheres admit non-
vanishing vector fields. This again shows 𝜒(𝕊2𝑘+1 ) = 0.
In fact, the dimension of the manifold has a strong influence on the Euler characteristic.
Proof: Let 𝐯 be a vector field on 𝑋. Then we can multiply each 𝐯(𝑥) by −1 to get a new field
we denote by −𝐯. If 𝑧 is a zero of 𝐯, 𝑧 is a zero of −𝐯 as well. The induced maps 𝕊𝑛−1
𝜀 → 𝕊𝑛−1
we use to compute the indices at 𝑧 of 𝐯 and −𝐯, respectively, differ by a composition with the
antipodal map 𝑎𝕊𝑛−1 on 𝕊𝑛−1 . This changes the degree by multiplication by (−1)𝑛 which is −1,
since 𝑛 is odd. Thus, by Theorem 17.16, the index sum of 𝐯 and −𝐯 satisfy
∑ ∑ ∑
ind𝑧 (𝐯) = ind𝑧 (−𝐯) = − ind𝑧 (𝐯).
𝑧∈𝐯−1 (0) 𝑧∈(−𝐯)−1 (0) 𝑧∈𝐯−1 (0)
Hence the index sum must be zero. By Theorem 17.16 this implies that the Euler characteristic
is zero as well.
We can verify that 𝐯(𝑥) ∈ 𝑇𝑥 𝕋 = 𝑇(𝑥1 ,𝑦1 ) 𝕊1 × 𝑇(𝑥2 ,𝑦2 ) 𝕊1 ⊂ ℝ4 . Moreover, we see 𝐯(𝑥)
is never zero. Hence there are nowhere vanishing vector fields on the torus. We deduce
from this example and Theorem 17.16 that the index sum of any vector field on the torus
and that the Euler characteristic of the torus are both zero.
We split the proof of Theorem 17.16 into two steps: the assertion on the independence of the
index sum of the vector field, and the identification with the Euler characteristic. We begin with
independence and will prove the following theorem:
We prove Theorem 17.22 in several steps. The key idea is show that the index sum equals
the degree of a particular map that only depends on 𝑋.
𝑔 ∶ 𝜕𝑋 → 𝕊𝑛−1
In the following we are going to use without mentioning the following observation:
Proof: Let 𝑧1 , … , 𝑧𝑘 be the zeros of 𝐯 which are not on the boundary of 𝑋. After removing
an 𝜀-ball around each zero for a sufficiently small 𝜀 we obtain again a smooth manifold with
boundary which we denote by 𝑌 . The map 𝐯(𝑦) ̄ = 𝐯(𝑦)∕|𝐯(𝑦)| defines a smooth map 𝑌 → 𝕊𝑛−1 .
Since 𝐯̄ is defined on all of 𝑌 , the degree of the restriction of 𝐯̄ to 𝜕𝑌 is zero by the Boundary
Theorem 15.2. However, the degree of the restriction of 𝐯̄ to 𝜕𝑌 equals the sum of the degrees
of the restrictions of 𝐯̄ to the components of 𝜕𝑌 . One of the components of 𝜕𝑌 is 𝜕𝑋. By
assumption, 𝐯 points outward at all points on 𝜕𝑋. This implies 𝐯̄ |𝜕𝑋 is homotopic to the Gauss
map 𝑔 which sends 𝑥 ∈ 𝜕𝑋 to the outward unit normal vector at 𝑥. Since the degree is invariant
under homotopy, this implies deg(𝐯̄ |𝜕𝑋 ) = deg(𝑔).
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 347
The other boundary components of 𝜕𝑌 are 𝑛 − 1-dimensional 𝜀-spheres with the boundary
orientation induced by the orientation of 𝑋. By construction of 𝑌 the boundary orientation is
such that the normal vector pointing into the 𝜀-sphere completes to a positive oriented basis.
That is, the orientation of each of the 𝜀-spheres, which are the components of 𝜕𝑌 , is the opposite
of the standard orientation. Thus, by definition of the index of a zero of 𝐯, the degree of the
∑
restriction of 𝐯̄ to the other boundary components of 𝜕𝑌 equals − 𝑘𝑖=0 ind𝑧𝑖 (𝐯), the negative
of the index sum. Hence, in total, we have shown
∑
𝑘
deg(𝑔) − ind𝑧𝑖 (𝐯) = deg(𝐯̄ |𝜕𝑌 ) = 0.
𝑖=0
In order to extend Lemma 17.25 to arbitrary manifolds we will now study the derivative of
the vector field.
∙ Note that it follows from the Inverse Function Theorem 12.4 that, if 𝐯 is nondegenerate
at 𝑧, then 𝑧 is an isolated zero.
Proof: We may assume 𝑧 = 0. We can find a sufficiently small open ball 𝔹 around the
origin inside 𝑈 We may then consider 𝐯|𝔹 as a diffeomorphism into ℝ𝑛 . Now we can apply
Lemma 17.8 and the method used in the proof of Lemma 17.9. If 𝐯 preserves orientation, then
we can smoothly deform it into the identity without introducing any new zeros. In this case
the index equals +1. If 𝐯 reverses orientation, then we can smoothly deform it into a reflection
without introducing any new zeros. In this case the index equals −1. This proves the lemma.
348 17.5. Poincaré–Hopf Theorem - Independence
Proof: We choose a local parametrization 𝜙 ∶ 𝑈 → 𝑋 around 𝑧. Let 𝑒𝑖 denote the 𝑖th basis
vector of ℝ𝑛 . For a point 𝑢 = (𝑢1 , … , 𝑢𝑛 ) ∈ 𝑈 and writing 𝜙 = (𝜙1 , … , 𝜙𝑁 ), we set
1
⎛ 𝜕𝜙 (𝑢) ⎞
⎜ 𝜕𝑢𝑖 ⎟ 𝜕𝜙
𝑡𝑖 (𝑢) ∶= 𝑑𝜙𝑢 (𝑒𝑖 ) = ⎜ ⋮ ⎟ =∶ (𝑢). (17.5)
⎜ 𝜕𝜙 (𝑢)⎟
𝑁 𝜕𝑢𝑖
⎝ 𝜕𝑢
𝑖
⎠
By definition of the tangent space and the derivative, the 𝑡1 (𝑢), … , 𝑡𝑛 (𝑢) then form a basis of
the tangent space 𝑇𝜙(𝑢) 𝑋. We need to determine the image of the 𝑡𝑖 (𝑢) under the linear transfor-
mation 𝑑𝐰𝜙(𝑢) . We fix an 𝑖 and then have
𝜕(𝐰◦𝜙)
𝑑𝐰𝜙(𝑢) (𝑡𝑖 (𝑢)) = 𝑑(𝐰◦𝜙)𝑢 (𝑒𝑖 ) = (𝑢) (17.6)
𝜕𝑢𝑖
using the shortened notation of (17.5) on the right-hand side. Now we let 𝐯 = 𝜙∗ 𝐰 denote
pullback vector field on 𝑈 along 𝜙. Since 𝑈 is an open subset of ℝ𝑛 , the tangent space 𝑇𝑢 𝑈 at
∑
any 𝑢 ∈ 𝑈 is just ℝ𝑛 . We may then write 𝐯 as 𝐯 = 𝑛𝑖=1 𝑣𝑖 𝑒𝑖 with suitable smooth functions
𝑣1 , … , 𝑣𝑛 ∶ 𝑈 → ℝ. By definition we have 𝐯(𝑢) = 𝑑𝜙−1
𝑢 ◦𝐰◦𝜙(𝑢) and 𝑡 (𝑢) = 𝑑𝜙𝑢 (𝑒 ) so that
𝑖 𝑖
∑
𝐰(𝜙(𝑢)) = 𝑑𝜙𝑢 (𝐯(𝑢)) = 𝑣𝑖 (𝑢)𝑡𝑖 (𝑢).
𝑖
Taking partial derivatives with respect to 𝑢𝑖 and using the product rule then yields
𝜕(𝐰◦𝜙) ∑ 𝜕𝑣𝑗
𝑑𝐰𝑧 (𝑡𝑖 (𝑢))
̄ = (𝑢)
̄ = ̄ 𝑗 (𝑢).
(𝑢)𝑡 ̄ (17.8)
𝜕𝑢𝑖 𝑗
𝜕𝑢𝑖
This shows that, at the zero 𝑧, the image of the basis vector 𝑡𝑖 under 𝑑𝐰𝑧 is a linear combination
of the basis vectors of 𝑇𝑧 𝑋. Thus, the image of 𝑑𝐰𝑧 in ℝ𝑁 is contained in the subspace 𝑇𝑧 𝑋.
This proves the first assertion of the lemma.
Moreover, (17.8) also shows that the determinant of the linear transformation
𝑑𝐰𝑧 ∶ 𝑇𝑧 𝑋 → 𝑇𝑧 𝑋
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 349
𝜕𝑣
equals the determinant of the matrix with (𝑖, 𝑗)th entry 𝜕𝑢𝑗 (𝑢).
̄ Thus we get det 𝑑𝐰𝑧 = det 𝑑𝐯𝑢̄ .
𝑖
Since 𝜙 is a diffeomorphism between open neighborhoods of the zeros 𝑢̄ of 𝐯 and 𝑧 of 𝐰, the
indices ind𝑧 (𝐰) and ind𝑢̄ (𝐯) are equal. Moreover, by the Inverse Function Theorem 12.4 the
zero 𝑧 is isolated if det 𝑑𝐰𝑧 ≠ 0, since then there is a small open neighborhood of 𝑧 such that
𝑧 is the only point where 𝐰(𝑥) vanishes.
By Lemma 17.28 we know ind𝑢̄ (𝐯) is +1 or −1 according to whether det 𝑑𝐯𝑢̄ is positive or
negative. Thus, ind𝑧 (𝐰) is +1 or −1 according to whether det 𝑑𝐰𝑧 is positive or negative. This
proves the lemma.
Lemma 17.29 allows us, in particular, to extend the definition of nondegenerate zeros to
arbitrary vector fields.
Nondegenerate zeros of the gradient field of a function can be detected by studying the
Hessian matrix:
Proof: We already know that 𝑧 ∈ 𝑋 is a zero of 𝐠𝐫𝐚𝐝(𝑓 ) if and only if 𝑧 is a critical point
of 𝑓 . Let 𝑧 ∈ 𝑋 be a zero of 𝐠𝐫𝐚𝐝(𝑓 ). Let 𝜙 ∶ 𝑈 → 𝑋 be a local parametrization of 𝑋 with
𝜙(𝑢) = 𝑧. Then it follows from Equation 17.2 that the derivative 𝑑(𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 ))𝑢 of 𝜙∗ 𝐠𝐫𝐚𝐝(𝑓 )
at 𝑢 is represented, with respect to the standard basis of ℝ𝑘 , by a matrix which is the product
of the Hessian matrix of 𝑓 ◦𝜙 at 𝑢 and the real (𝑘 × 𝑘)-matrix 𝐺(𝑢) with (𝑖, 𝑗)-entry 𝑔 𝑖 𝑗(𝑢). By
definition, 𝑧 is a nondegenerate critical point of 𝑓 if and only if the Hessian matrix of 𝑓 ◦𝜙 at 𝑢
is invertible. Thus, to prove the assertion it suffices to show that det 𝐺(𝑢) ≠ 0.
Let 𝐴𝑢 denote the matrix which represents 𝑑𝜙𝑢 with respect to the bases 𝑒1 , … , 𝑒𝑘 of ℝ𝑘 and
𝑑𝜙𝑢 (𝑒1 ), … , 𝑑𝜙𝑢 (𝑒𝑘 ) of 𝑇𝜙(𝑢) 𝑋. Then 𝑑𝜙𝑢 (𝑒𝑖 )𝑇 is the 𝑖th row and 𝑑𝜙𝑢 (𝑒𝑗 ) is the 𝑗th column of
𝑀𝑢 . Thus, by definition of 𝐺(𝑢) and the 𝑔𝑖𝑗 (𝑢), we have 𝐺(𝑢) = 𝑀𝑢𝑇 ⋅𝑀𝑢 . Since 𝑀𝑢 is invertible,
this shows that 𝐺(𝑢) is an invertible real symmetric matrix. Hence it is positive definite and all
its eigenvalues are strictly positive. In particular, we see that det 𝐺(𝑢) > 0.
Remark 17.32 (Existence of vector fields with finitely many zeros on compact
manifolds) Since functions with only nondegenerate critical points are generic by The-
orem 7.13, it follows from Lemma 17.31 that there are vector fields on every smooth
manifold with only nondegenerate zeros. In particular, every smooth manifold admits a
vector field with only isolated zeros. Moreover, if 𝑋 is compact, then the set of isolated
zeros is closed and hence finite. Thus, on every compact manifold there exists a vector
field with only finite many zeros which are all isolated.
350 17.5. Poincaré–Hopf Theorem - Independence
Now we let 𝑋 ⊂ ℝ𝑛 be a compact 𝑘-dimensional smooth manifold without boundary. For
𝜀 > 0, we let
𝑋 𝜀 = {𝑦 ∈ ℝ𝑛 ∶ |𝑦 − 𝑥| < 𝜀(𝑥) for some 𝑥 ∈ 𝑋}
be the open subspace in ℝ𝑛 containing 𝑋 of the 𝜀-Neighborhood Lemma 12.8. We then write
𝑋̄ 𝜀 = {𝑦 ∈ ℝ𝑛 ∶ |𝑦 − 𝑥| ≤ 𝜀(𝑥) for some 𝑥 ∈ 𝑋}
for the closure of 𝑋 𝜀 . Note that if we choose 𝜀 > 0 small enough, then 𝑋̄ 𝜀 is a smooth 𝑛-
dimensional manifold with boundary.
Theorem 17.33 (Index sum and Gauss map on 𝜀-neighborhood) Let 𝑋 and 𝑋̄ 𝜀 be
as above. Let 𝐰 be a vector field on 𝑋 with only finitely many zeros all of which are
nondegenerate. Then the index sum of 𝐰 is equal to the degree of the Gauss map
𝑔 ∶ 𝜕 𝑋̄ 𝜀 → 𝕊𝑘−1 .
In particular, the index sum does not depend on the choice of vector field.
Proof: Given a point 𝑦 ∈ 𝑋̄ 𝜀 . We know from ?? that, if we choose 𝜀 small enough, we can
find the point 𝜋(𝑦) ∈ 𝑋 which is closest to 𝑦 and such that the vector 𝑦 − 𝜋(𝑦) is perpendicular
to the tangent space of 𝑋 at 𝜋(𝑦). Moreover, we can consider the assignment 𝑦 → 𝜋(𝑦) as a
smooth map 𝜋 ∶ 𝑋̄ 𝜀 → 𝑋. Since 𝑦 − 𝜋(𝑦) is perpendicular to 𝑇𝜋(𝑦) 𝑋, the tangent space 𝑇𝑦 𝑋̄ 𝜀
at 𝑦 equals the tangent space at 𝜋(𝑦). For a point 𝑦 on the boundary 𝜕 𝑋̄ 𝜀 , we have |𝑦 − 𝜋(𝑦)| = 𝜀
and the vector 𝑦 − 𝜋(𝑦) points outward from 𝜋(𝑦) to 𝑦. Thus the outward unit normal vector
at 𝑦 ∈ 𝜕 𝑋̄ 𝜀 is
𝑔(𝑦) = (𝑦 − 𝜋(𝑦))∕𝜀.
Thus the Gauss map on 𝜕 𝑋̄ 𝜀 is given by
𝑔 ∶ 𝜕 𝑋̄ 𝜀 → 𝕊𝑛−1 , 𝑦 → (𝑦 − 𝜋(𝑦))∕𝜀.
Now we extend the vector field 𝐯 to a vector field 𝐰 on 𝑋̄ 𝜀 by setting
𝐰(𝑦) = (𝑦 − 𝜋(𝑦)) + 𝐯(𝜋(𝑦)).
We will now show that 𝑋̄ 𝜀 and 𝐰 satisfy the hypotheses of Lemma 17.25. Since 𝑋 is compact,
𝑋̄ 𝜀 is a bounded and closed subset of ℝ𝑛 and therefore a compact 𝑛-dimensional submanifold
of ℝ𝑛 with boundary.
Since 𝑔(𝑦) points outward and the inner product of 𝐰(𝑦) and 𝑔(𝑦), considered as vectors in ℝ𝑛 ,
is given by
𝐰(𝑦) ⋅ 𝑔(𝑦) = [(𝑦 − 𝜋(𝑦)) + 𝐯(𝜋(𝑦))] ⋅ [(𝑦 − 𝜋(𝑦))∕𝜀]
= [(𝑦 − 𝜋(𝑦)) ⋅ (𝑦 − 𝜋(𝑦))]∕𝜀 + [𝐯(𝜋(𝑦) ⋅ (𝑦 − 𝜋(𝑦))]∕𝜀
= 𝜀2 ∕𝜀 + 0
=𝜀>0
where we use that 𝑦 is on the boundary of 𝑋̄ 𝜀 and the 𝑦 − 𝜋(𝑦) is perpendicular to the tangent
vector 𝐯(𝜋(𝑦) ∈ 𝑇𝜋(𝑦) 𝑋.
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 351
∙ Claim: 𝐯 and 𝐰 have the same zeros.
Since the vectors 𝑦−𝜋(𝑦) and 𝐯(𝜋(𝑦)) are perpendicular, they cannot cancel each others when-
ever they are nonzero. Thus, 𝐰(𝑦) can only be zero if both 𝑦 − 𝜋(𝑦) and 𝐯(𝜋(𝑦)) are the zero
vector. Since 𝑦 − 𝜋(𝑦) = 0 only if 𝑦 ∈ 𝑋, we see that 𝐰 vanishes exactly when 𝐯 does.
Let 𝑧 ∈ 𝑋 be a zero of 𝐰 and therefore of 𝐯 by the previous claim. By Lemma 17.29 we can
compute both ind𝑧 (𝐰) and ind𝑧 (𝐯) as the the derivatives 𝑑𝐰𝑧 and 𝑑𝐯𝑧 , respectively, and we
know that 𝑑𝐯𝑧 is a linear transformation of 𝑇𝑧 𝑋 into itself, and 𝑑𝐰𝑧 is a linear transformation
of 𝑇𝑧 𝑋̄ 𝜀 into itself. Since 𝑇𝑧 𝑋 ⊂ 𝑇𝑧 𝑋̄ 𝜀 and hence 𝑇𝑧 𝑋̄ 𝜀 = 𝑇𝑧 𝑋 ⊕ 𝑇𝑧 𝑋 ⟂ , we can determine the
effect of 𝑑𝐰𝑧 by computing its effect on vectors in 𝑇𝑧 𝑋 and the orthogonal complement 𝑇𝑧 𝑋 ⟂
separately. Since the restriction of 𝜋 to 𝑋 is the identity, the derivative of 𝑦 → 𝑦 − 𝜋(𝑦) in the
direction of 𝑇𝑧 𝑋 acts trivially. Thus we get
In the direction orthogonal to 𝑇𝑧 𝑋, however, 𝑑𝐯𝑧 acts trivially and 𝜋 is constant along a fixed
line perpendicular to 𝑋. Thus we get
Since 𝑇𝑧 𝑋 and 𝑇𝑧 𝑋 ⟂ are orthogonal to each other and together span all of 𝑇𝑧 𝑋̄ 𝜀 , we can com-
pute the determinant of 𝑑𝐰𝑧 as the product
det 𝑑𝐰𝑧 = det(𝑑𝐰𝑧 )|𝑇𝑧 𝑋 ⋅ det(𝑑𝐰𝑧 )|𝑇𝑧 𝑋 ⟂ = det 𝑑𝐯𝑧 ⋅ 1 = det 𝑑𝐯𝑧 .
This proves the claim by Lemma 17.29. Hence, in particular, the index sums of 𝐰 and of 𝐯 are
equal. Moreover, since det 𝑑𝐰𝑧 = det 𝑑𝐯𝑧 ≠ 0, all zeros of 𝐰 are isolated by Lemma 17.29.
Thus we can apply Lemma 17.25 which shows that the index sum of 𝐰 is equal to the degree
of the Gauss map 𝑔. Thus the index sum of 𝐯 is equal to the degree of the Gauss map 𝑔 which
proves the theorem.
It remains to remove the assumption that the zeros of the vector field have to be nondegen-
erate.
Proof: First we assume that 𝐯 is a vector field defined on an open subset 𝑈 ⊂ ℝ𝑘 , and
assume that 𝑧 is the only zero of 𝐯. We choose a smooth bump function 𝜆 ∶ 𝑈 → [0, 1] which
has the value 1 on a small ball 𝔹𝜀′ around 𝑧 and has the value 0 outside a ball 𝔹𝜀 around 𝑧 with
slightly larger radius 𝜀. Since 𝐯 is a smooth map and has no other zeros than 𝑧, we can choose
a regular value 𝑦 of 𝐯. We then define a new vector field 𝐰 on 𝑈 by
Again, since 𝔹̄ 𝜀′ is compact, we can apply Lemma 17.25 to see that the index sum of 𝐯 and
the index sum of 𝐰 on 𝔹̄ 𝜀′ equals the degree of the Gauss map 𝑔 ∶ 𝜕 𝔹̄ 𝜀′ = 𝕊𝑘−1
𝜀′
→ 𝕊𝑘−1 , i.e.,
∑ ∑
ind𝑧 (𝐯) = deg(𝑔) = ind𝑧 (𝐰).
𝑧∈𝐯−1 (0) 𝑢∈𝐰−1 (0)
Now we let 𝑋 ⊂ ℝ𝑛 and 𝐯 be as assumed in the theorem. Let 𝑧𝑖 be one of the finitely many
isolated zeros 𝑧1 , … , 𝑧𝑚 of 𝐯. Then we can choose a local parametrization 𝜙𝑖 ∶ 𝑈𝑖 → 𝑋 around
𝑧. Let 𝜆𝑖 ∶ 𝑈𝑖 → [0, 1] smooth bump function and 𝑦𝑖 ∈ ℝ𝑘 be a regular value of 𝜙∗𝑖 𝐯 as in the
first case above. Then we can form the new vector field 𝐰𝑖 on 𝑋 by setting
Now we can check as above that 𝐰𝑖 has finitely many zeros and the zeros of 𝐰𝑖 which lie in the
image of 𝜙𝑖 are all nondegenerate. Moreover, the index sums of 𝐯 and 𝐰𝑖 are equal. Choosing
the local parametrizations around all the 𝑧𝑖 small enough so that they do not overlap, we can
perform this process for each zero and define a vector field 𝐰 on 𝑋 by
∑
𝑚
𝐰(𝑥) ∶= 𝐯(𝑥) − 𝜆𝑖 (𝜙−1
𝑖 (𝑥))𝑑𝜙𝜙−1 (𝑥) (𝑦𝑖 ).
𝑖
𝑖=1
Now we can conclude that Theorem 17.33 and Theorem 17.34 together imply Theorem 17.22.
Theorem 17.22 reduces the proof of Theorem 17.16 to finding at least one vector field on a
given manifold 𝑋 with index sum equal to 𝜒(𝑋). The idea is to find a suitable Morse function
𝑓 ∶ 𝑋 → ℝ such that the index sum of its gradient field equals the Euler characteristic of 𝑋.
We will now sketch how this can be achieved.
∙ One can show that the index is well-defined, i.e., the number of negative eigenvalues the
Hessian of 𝑓 at 𝑧 is the same for every local coordinate system.
Remark 17.36 (Index of a critical point and bilinear forms) Recall that the index of
a bilinear form 𝐵 on a vector space 𝑉 is defined to be maximal dimension of a vector
subspace of 𝑉 on which 𝐵 is negative definite. The Hessian matrix 𝐻𝑧 (𝑓 ) of a smooth
function 𝑓 ∶ 𝑋 → ℝ at 𝑧, computed in a local coordinate system around 𝑧 and with
respect to a suitable basis, is a symmetric matrix and hence defines a symmetric bilinear
form 𝐻𝑧 (𝑓 ) on 𝑇𝑧 𝑋 by setting 𝑤 → 𝑤𝑇 ⋅ 𝐻𝑧 (𝑓 ) ⋅ 𝑤. It then follows that the index of 𝑧
as a critical point equals the index of the Hessian at 𝑧 as a symmetric bilinear form.
Now we can relate the index of a critical point to the index of a zero as follows:
for all 𝑥 ∈ 𝑋 where 𝑠 is the number of negative eigenvalues of the Hessian 𝐻(𝑓 )0 of 𝑓 at 0.
From Equation 17.9 we deduce that
Writing 𝐯 = 𝐠𝐫𝐚𝐝(𝑓 ) this shows that the induced map 𝐯̄ ∶ 𝕊𝑘−1 → 𝕊𝑘−1 is the composition of
the reflection of each of the first 𝑠 coordinate. Since the reflection of an individual coordinate
has degree −1, this implies that 𝐯̄ has degree (−1)𝑠 .
Proof: Let 𝑧 be a zero of 𝐠𝐫𝐚𝐝(𝑓 ) and let 𝜆 be its index as a critical point. By Lemma 17.37,
𝑧 contributes with (−1)𝜆 to the index sum of 𝐠𝐫𝐚𝐝(𝑓 ). Hence, if there are 𝑐𝜆 many such zeros,
they contribute with (−1)𝜆 𝑐𝜆 to the index sum. Since the zeros of 𝐠𝐫𝐚𝐝(𝑓 ) are the critical points
∑
of 𝑓 , this shows that the sum 𝜆 (−1)𝜆 𝑐𝜆 is exactly the index sum of 𝑓 .
By Theorem 7.13 every smooth manifold admits a Morse function. Hence we can always
∑
find a Morse function and we know that the corresponding sum 𝜆 (−1)𝜆 𝑐𝜆 is independent of
our choice of function. It remains to relate this sum to the Euler characteristic, i.e., it remains
∑
to show 𝜆 (−1)𝜆 𝑐𝜆 = 𝜒(𝑋). The details of this identification are beyond the scope of these
notes. The idea, however, is based on the discussion of homotopy types at the beginning of
Section 7.2 where we sketched how a Morse function on 𝑋 helps us finding a cell complex
which is homotopy equivalent to 𝑋.
We now hint briefly at the general procedure and refer to [12, Part I] for the details. Let 𝑋
be a compact smooth 𝑘-dimensional manifold and let 𝑓 ∶ 𝑋 → ℝ be a Morse function. For
𝑎 ∈ ℝ, set
𝑋 𝑎 ∶= 𝑓 −1 ((−∞, 𝑎]) = {𝑥 ∈ 𝑋 ∶ 𝑓 (𝑥) ≤ 𝑎}.
Now let 𝑎1 < ⋯ < 𝑎𝑛 be real numbers such that 𝑋 𝑎𝑖 contains exactly 𝑖 critical points, and
𝑋 𝑎𝑛 = 𝑋. We then have ⊂ 𝑋 𝑎1 ⊂ … ⊂ 𝑋 𝑎𝑛 = 𝑋. Let 𝔹̄ 𝜆 denote the unit ball in ℝ𝜆 . We think
of 𝔹̄ 𝜆 as a 𝜆-dimensional cell. In 𝑋 𝑎𝑖 there is exactly one critical point which is not contained
in 𝑋 𝑎𝑖−1 . Let 𝜆𝑖 denote the index of this critical point. We then get
where the first isomorphism can be deduced from the idea we sketched at beginning of Sec-
tion 7.2 and the second follows from excision.
Now let 𝑏𝜆 (𝑋 𝑎𝑖 , 𝑋 𝑎𝑖−1 ) denote the relative Betti number, i.e., the rank of the abelian group
𝐻𝜆 (𝑋 𝑎𝑖 , 𝑋 𝑎𝑖−1 ; ℤ). We then get
∑
𝑘
𝜒(𝑋 𝑎𝑖 , 𝑋 𝑎𝑖−1 ) ∶= 𝑏𝜆 (𝑋 𝑎𝑖 , 𝑋 𝑎𝑖−1 ) = 𝑐𝜆
𝑖=1
where 𝑐𝜆 denotes the number of critical points of 𝑓 of index 𝜆. Since the Euler characteristic is
additive (see [12, Part I, §5]), this shows
∑
𝑘
∑
𝑘
𝜒(𝑋) = 𝜒(𝑋 𝑎𝑖 , 𝑋 𝑎𝑖−1 ) = 𝑐0 − 𝑐1 + 𝑐2 − + ⋯ ± 𝑐𝑘 = (−1)𝜆 𝑐𝜆 .
𝑖=1 𝑖=1
Together with Theorem 17.38 this shows that the index sum of the gradient field equals the
Euler characteristic and finishes the proof of Theorem 17.16.
Chapter 17. Vector Fields and the Poincaré–Hopf Index Theorem 355
17.7 Existence of vector fields with no zeros
Given a compact oriented smooth manifold 𝑋, it is a natural question whether we can find a
nowhere vanishing vector field 𝐯 on 𝑋. Theorem 17.16 provides a necessary condition: if 𝐯
has no zeros, then the index sum is zero and hence the Euler characteristic of 𝑋 must be zero
too. Hopf showed in [8] that this is also a sufficient condition:
∙ The proof is based on the ideas we developed for the proof of the Hopf Degree Theo-
rem 16.1. It shows once more the power of Brouwer degree as an invariant.
Since smooth manifolds of odd dimension have Euler characteristic zero by Corollary 17.19,
we get the following consequence.
Corollary 17.40 (Odd dimensional manifolds have nowhere vanishing vector fields)
Let 𝑋 be a compact smooth manifold without boundary. If the dimension 𝑛 of 𝑋 is odd,
then 𝑋 possesses a nowhere vanishing vector field.
Proof of Theorem 17.39: First we assume that 𝑋 = ℝ𝑘 and that we have a vector field 𝐯
on 𝑋 with only finitely many zeros. Then the vector field is just a smooth map 𝐯 ∶ ℝ𝑘 → ℝ𝑘 .
Since 𝐯 has only finitely many zeros, there is a closed ball 𝔹̄ = 𝔹̄ 𝑘𝑟 containing all the zeros of 𝐯 in
its interior. Using the technique of the proof of Theorem 17.34 we can assume that the zeros of
𝐯 are nondegenerate. Moreover, we can assume that the origin in ℝ𝑘 is not a zero of 𝐯. Since
there are no zeros of 𝐯 on the boundary of 𝔹,̄ 𝐯 points either inward or outward at all boundary
points of 𝔹.̄ Since the index sum of 𝐯 is zero, we can replace 𝐯 with −𝐯 if 𝐯 points inwards.
Hence we can assume that 𝐯 points outward at every boundary point of 𝔹. ̄ Since 𝔹̄ is compact,
we can apply Lemma 17.25 and obtain that the index sum of 𝐯 equals the degree of the Gauss
map 𝑔 ∶ 𝜕 𝔹̄ = 𝕊𝑘−1
𝑟 → 𝕊𝑘−1 . By assumption, the index sum of 𝐯 is zero, and hence the degree
of 𝑔 is zero. Since 𝑔 and 𝐯̄ = 𝐯∕|𝐯| are homotopic, this show that 𝐯̄ has degree zero. Hence
the winding number of 𝐯|𝔹̄ around the origin is zero. Thus, 𝜕𝐯|𝔹̄ = 𝐯|𝜕𝔹̄ ∶ 𝜕 𝔹̄ → ℝ𝑘 ⧵ {0} is
homotopic to a constant map by Theorem 16.10. Hence
𝐯|ℝ𝑘 ⧵𝔹 ∶ ℝ𝑘 ⧵ 𝔹 → ℝ𝑘 ⧵ {0}
is a map to which we can apply Lemma 16.8. This implies that 𝐯|ℝ𝑘 ⧵𝔹 extends to a smooth map
𝐰 ∶ ℝ𝑘 → ℝ𝑘 ⧵ {0}
with 𝐰 = 𝐯 outside the compact space 𝔹. ̄ Hence we have constructed a vector field 𝐰 on ℝ𝑘
without zeros which equals 𝐯 outside a compact subset.
Solution (Exercise 2.1) (a) We have remarked in the main text that 𝑓 is smooth,
since each component of 𝑓 is a polynomial. To get some more exercise, we could
calculate all partial derivatives (in all degrees) and check that they exist and are
continuous. So let us calculate the partial derivatives. We denote the two compo-
nents of 𝑓 by 𝑓1 (𝑥, 𝑦) = 𝑥2 − 𝑦2 and 𝑓2 (𝑥, 𝑦) = 2𝑥𝑦. The first partial derivatives
are
𝜕𝑓1 𝜕𝑓1 𝜕𝑓2 𝜕𝑓2
= 2𝑥, = −2𝑦, = 2𝑦, = 2𝑥.
𝜕𝑥 𝜕𝑦 𝜕𝑥 𝜕𝑦
All these functions are differentiable. Hence we can calculate the second deriva-
tives:
𝜕 2 𝑓1 𝜕 2 𝑓1 𝜕 2 𝑓1 𝜕 2 𝑓1
= 2, = 0, = 0, = −2,
𝜕𝑥𝜕𝑥 𝜕𝑥𝜕𝑦 𝜕𝑦𝜕𝑥 𝜕𝑦𝜕𝑦
𝜕 2 𝑓2 𝜕 2 𝑓2 𝜕 2 𝑓2 𝜕 2 𝑓2
= 0, = 2, = 2, = 0.
𝜕𝑥𝜕𝑥 𝜕𝑥𝜕𝑦 𝜕𝑦𝜕𝑥 𝜕𝑦𝜕𝑦
The second derivatives are again all differentiable and we see that the next deriva-
tives will all vanish. This implies that all further partial derivatives vanish and
therefor are differentiable. This shows that 𝑓 is smooth.
The determinant of the Jacobian at 𝑝 = (𝑥, 𝑦) is 4𝑥2 + 4𝑦2 . This is a positive real
number for every 𝑝 in 𝑈 = ℝ2 ⧵{(0, 0)}. This implies that the Jacobian is invertible
at every point in 𝑈 .
(c) Even though 𝑓|𝑈 has an invertible derivative at every point, it is not invertible itself.
For 𝑓|𝑈 is not injective. For example, 𝑓 (1, 1) = (0, 2) = 𝑓 (−1, −1).
357
358 A.2. Smooth manifolds
Solution (Exercise 2.2) (a) The assertion is true if 𝑋, 𝑌 and 𝑍 are open subsets
of ℝ𝑁 , ℝ𝑀 , ℝ𝐿 , respectively. For in this case this is just the Chain Rule from
Calculus. Now let 𝑥 ∈ 𝑋 and 𝑦 = 𝑓 (𝑥). Since 𝑔 is smooth at 𝑦, there is an
open subset 𝑉 ⊂ ℝ𝑀 with 𝑦 ∈ 𝑉 and a smooth map 𝐺 ∶ 𝑉 → ℝ𝐿 such that
𝐺|𝑉 ∩𝑌 = 𝑔|𝑉 ∩𝑌 . Since 𝑓 is smooth at 𝑥, there is an open subset 𝑈 ⊂ ℝ𝑁 with
𝑥 ∈ 𝑈 and a smooth map 𝐹 ∶ 𝑈 → ℝ𝑀 such that 𝐹|𝑈 ∩𝑋 = 𝑓|𝑈 ∩𝑋 . After replacing
𝑈 with 𝑈 ∩ 𝑓 −1 (𝑉 ) if necessary (to make sure that the image of 𝐹 lies in 𝑉 ), we
have
𝐺◦𝐹 = 𝑔◦𝑓 on 𝑋 ∩ 𝑈 , since 𝑓 (𝑋 ∩ 𝑈 ) ⊂ 𝑌 ∩ 𝑉 .
By the Chain Rule, we know that 𝐺◦𝐹 ∶ 𝑈 → ℝ𝐿 is smooth, since 𝑈 is open in
ℝ𝑁 . This shows that 𝑔◦𝑓 is smooth.
(b) By assumption 𝑓 and 𝑔 are smooth and have smooth inverses 𝑓 −1 and 𝑔 −1 respec-
tively. Since 𝑓 and 𝑔 are bijective, so is 𝑔◦𝑓 . By the previous point, both 𝑔◦𝑓 and
(𝑔◦𝑓 )−1 = 𝑓 −1 ◦𝑔 −1 are smooth.
+|𝑦|2
=𝑦
and
𝑟 √ 2𝑟𝑥 2 √
𝑟𝑓 (𝑥) 𝑟 −|𝑥| 𝑟2 𝑥 𝑟2 − |𝑥|2
𝑔(𝑓 (𝑥)) = √ =√ =√ ⋅√
𝑟2 + 𝑟2|𝑟𝑥| 𝑟2 − |𝑥|2 𝑟2 (𝑟2 − |𝑥|2 + |𝑥|2 )
2
𝑟2 + 𝑓 (𝑥)|2
−|𝑥|2
= 𝑥.
Both 𝑓 and 𝑔 are smooth, since they are the composite of several smooth maps.
Hence the previous exercise shows that they are both smooth.
(b) Let 𝑥 ∈ 𝑋. By definition of a smooth manifold, there is an open subset 𝑉 ⊂ 𝑋
with 𝑥 ∈ 𝑉 , an open subset 𝑈 ⊂ ℝ𝑘 and a diffeomorphism 𝜙 ∶ 𝑈 → 𝑉 . Let
𝑢 ∈ 𝑈 be the inverse 𝜙−1 (𝑥). If 𝑢 is not the origin in ℝ𝑘 , we compose 𝜙 with the
translation 𝑇𝑢 ∶ ℝ𝑘 → ℝ𝑘 defined by 𝑇𝑢 (𝑦) = 𝑦 + 𝑢 which satisfies 𝑇𝑢 (0) = 𝑢. Note
that 𝑇𝑢 is a diffeomorphism, since it is invertible and both 𝑇𝑢 and its inverse 𝑇−𝑢
have the identity matrix as their Jacobian matrix at any point. Hence all higher
partial derivatives vanish and exist.
Thus after composing 𝜙 with 𝑇𝑢 , we can assume 𝜙(0) = 𝑥. Now it suffices to
choose a small enough radius 𝑟 such that 𝜙(𝐵𝑟𝑘 (0)) ⊂ 𝑉 . Then 𝜙|𝐵𝑘 (0) ∶ 𝐵𝑟𝑘 (0) →
𝑟
𝜙(𝐵𝑟𝑘 (0)) ⊂ 𝑋 is the desired local parametrization.
Appendix A. Solutions to exercises 359
(c) By the previous point, for every 𝑥 ∈ 𝑋 there is a diffeomorphism 𝜙 ∶ 𝐵𝑟𝑘 (0) → 𝑉
for some open subset 𝑉 ⊂ 𝑋 with 𝑥 ∈ 𝑉 . Now it suffices to precompose 𝜙 with
the diffeomorphism 𝑔 ∶ ℝ𝑘 → 𝐵𝑟𝑘 (0) of the first point in this exercise.
Solution (Exercise 2.4) Every linear map ℝ𝑛 → ℝ𝑚 is smooth, and the derivative
is equal to the map itself. Hence, given a 𝑘-dimensional vector subspace 𝑉 of ℝ𝑁 , it
suffices to choose a basis in 𝑉 to get a linear isomorphism 𝜙 ∶ 𝑉 → ℝ𝑘 . This map serves
as a parametrization, since it is a diffeomorphism.
Now given a linear map 𝑓 ∶ 𝑉 → ℝ𝑚 , the composite 𝜙◦𝑓 ∶ ℝ𝑘 → ℝ𝑚 is linear and
therefore smooth. Since 𝜙 is a diffeomorphism, this implies that 𝑓 must be smooth too.
Solution (Exercise 2.5) (a) Let 𝑎 > 0 be a real number. We want to show that the
subset
𝐻𝑎 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥2 + 𝑦2 − 𝑧2 = 𝑎} ⊂ ℝ3
is a 2-dimensional manifold. Hence we need to find local parametrizations. First,
there is a diffeomorphism
( √ )
𝜙 ∶ ℝ2 ⧵ 𝐵𝑎 ((0, 0)) → 𝐻 ∩ {𝑧 > 0}, (𝑥, 𝑦) → 𝑥, 𝑦, 𝑥2 + 𝑦2 − 𝑎
where 𝐵𝑎 ((0, 0)) denotes the closed ball of radius 𝑎 around the origin, i.e.,
Note that 𝐻 ∩{𝑧 > 0} is an open subset of 𝐻, because it is equal to the intersection
of 𝐻 with the open subset {𝑧 > 0} ⊂ ℝ3 . The inverse to 𝜙 is the projection map
𝜙−1 ∶ 𝐻 ∩ {𝑧 > 0} → ℝ2 , (𝑥, 𝑦, 𝑧) → (𝑥, 𝑦).
This map is smooth, since it can be extended to a smooth map on the whole of
ℝ3 to ℝ2 . We know that 𝜙 is smooth, since its component functions are infinitely
often differentiable functions in each variable on the open subset ℝ2 ⧵ 𝐵𝑎 ((0, 0)).
We can for example calculate the Jacobian matrix in the standard basis at a point
( )
3 2 𝜕𝜙1 ∕𝜕𝑥 𝜕𝜙2 ∕𝜕𝑥 𝜕𝜙3 ∕𝜕𝑥
𝑑𝜙𝑥 ∶ ℝ → ℝ ,
𝜕𝜙1 ∕𝜕𝑦 𝜕𝜙2 ∕𝜕𝑦 𝜕𝜙3 ∕𝜕𝑦
( )
1 0 𝑥(𝑥2 + 𝑦2 − 𝑎)−1∕2
= .
0 1 𝑦(𝑥2 + 𝑦2 − 𝑎)−1∕2
On the open set ℝ2 ⧵ 𝐵𝑎 ((0, 0)), the entries of this matrix are continuously differ-
entiable functions.
The local parametrization for 𝐻 ∩ {𝑧 < 0} is similarly given by
( √ )
𝜙 ∶ ℝ2 ⧵ 𝐵𝑎 ((0, 0)) → 𝐻 ∩ {𝑧 < 0}, (𝑥, 𝑦) → 𝑥, 𝑦, − 𝑥2 + 𝑦2 − 𝑎 .
It remains to cover the points in 𝐻 ∩ {𝑧 = 0}. We are going to cover those points
by the following four open sets together with local parametrizations:
( √ )
𝐵√𝑎 ((0, 0)) → 𝐻 ∩ {𝑥2 + 𝑧2 < 𝑎}, (𝑥, 𝑧) → 𝑥, 𝑧2 − 𝑥2 + 𝑎, 𝑧
360 A.2. Smooth manifolds
√ ( )
𝐵√𝑎 ((0, 0)) → 𝐻 ∩ {𝑥2 + 𝑧2 < 𝑎}, (𝑥, 𝑧) → 𝑥, − 𝑧2 − 𝑥2 + 𝑎, 𝑧
(√ )
𝐵√𝑎 ((0, 0)) → 𝐻 ∩ {𝑦2 + 𝑧2 < 𝑎}, (𝑦, 𝑧) → 𝑧2 − 𝑦2 + 𝑎, 𝑦, 𝑧
( √ )
𝐵√𝑎 ((0, 0)) → 𝐻 ∩ {𝑦2 + 𝑧2 < 𝑎}, (𝑦, 𝑧) → − 𝑧2 − 𝑦2 + 𝑎, 𝑦, 𝑧 .
𝐻0 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 ∶ 𝑥2 + 𝑦2 − 𝑧2 = 0} ⊂ ℝ3
Solution (Exercise 2.6) For 0 < 𝑏 < 𝑎, let 𝑇 (𝑎, 𝑏) denote the set of points in ℝ3 at
distance 𝑏 from the circle of radius 𝑎 in the 𝑥𝑦-plane. We can parametrize these points
as follows: First the points in the 𝑥𝑦-plane which lie on the circle of radius 𝑎 satisfy
A point in the plane in the direction of a fixed point (𝑎 cos 𝑡, 𝑎 sin 𝑡, 0) which lies on the
circle of radius 𝑏 around the point (𝑎 cos 𝑡, 𝑎 sin 𝑡, 0) has coordinates (𝑎+𝑏 cos 𝑠) cos 𝑡, (𝑎+
𝑏 cos 𝑠) sin 𝑡, 𝑏 sin 𝑠) with 𝑠 ∈ [0, 2𝜋). For we have
𝜙 ∶ 𝑆 1 × 𝑆 1 → 𝑇 (𝑎, 𝑏),
(cos 𝑡, sin 𝑡, cos 𝑠, sin 𝑠) → ((𝑎 + 𝑏 cos 𝑠) cos 𝑡, (𝑎 + 𝑏 cos 𝑠) sin 𝑡, 𝑏 sin 𝑠) .
Appendix A. Solutions to exercises 361
⎛𝑎 + 𝑏𝑥3 0 𝑏𝑥1 0⎞
4 3
𝑑𝜙𝑝 ∶ ℝ → ℝ , 𝑑𝜙𝑝 = ⎜ 0 𝑎 + 𝑏𝑥3 𝑏𝑥2 0⎟ .
⎜ ⎟
⎝ 0 0 0 𝑏⎠
To check the other equation, we recall that 𝑦1 , 𝑦2 and 𝑦3 are connected by the condition
of being on 𝑇 (𝑎, 𝑏) which means
2 2
⎛ ⎞ ⎛ ⎞
⎜ 𝑎𝑦1 ⎟ ⎜ 𝑎𝑦2 ⎟ 2 2
⎜𝑦1 − √ ⎟ + ⎜𝑦2 − √ ⎟ + 𝑦3 = 𝑏
⎜ 𝑦21 + 𝑦22 ⎟ ⎜ 𝑦21 + 𝑦22 ⎟
⎝ ⎠ ⎝ ⎠
(√ )2
⇐⇒ 𝑦21 + 𝑦22 − 𝑎 + 𝑦23 = 𝑏2
√
⇐⇒ 𝑦21 + 𝑦22 + 𝑦23 + 𝑎2 − 2𝑎 𝑦21 + 𝑦22 = 𝑏2 .
⎛ 𝑦22 𝑦 𝑦 ⎞
⎜ (𝑦21 +𝑦22 )3∕2 − (𝑦2 +𝑦1 22)3∕2 0 ⎟
1 2
⎜ 𝑦1 𝑦2 𝑦21 ⎟
𝑑𝜓𝑞 ∶ ℝ3 → ℝ4 , 𝑑𝜓𝑞 = ⎜− (𝑦21 +𝑦22 )3∕2 (𝑦1 +𝑦22 )3∕2
2 0 ⎟.
⎜ 𝑦1 𝑦2 ⎟
⎜ 𝑏(𝑦2 +𝑦2 )1∕2 𝑏(𝑦21 +𝑦22 )1∕2
0 ⎟
⎜ 1
0
2
0 1∕𝑏⎟⎠
⎝
Since 𝑏 < 𝑎 we know 𝑦21 + 𝑦22 ≠ 0 and all partial derivatives are continuous smooth
functions. Hence 𝜓 is smooth. This proves that 𝜙 is a global diffeomorphism
Solution (Exercise 2.7) (a) Let 𝑁 = (0, … , 0, 1) ∈ 𝑆 𝑘 be the north pole on the
𝑘-dimensional sphere. The stereographic projection 𝜙−1𝑁
from 𝑆 𝑘 ⧵ {𝑁} onto ℝ𝑘
is the map which sends a point 𝑝 to the point at which the line through 𝑁 and 𝑝
intersects the subspace in ℝ𝑘+1 defined by 𝑥𝑘+1 = 0. In order to get from 𝑁 to 𝑝,
we walk in the direction of the vector
To find 𝜙−1
𝑁
(𝑝) we need to find the real number 𝜆 such that the (𝑘 + 1)-st coordinate
of 𝑁 + 𝜆 ⋅ 𝑣 is 0. Hence we need to solve
1
1 + 𝜆(𝑥𝑘+1 − 1) = 0 ⇐⇒ 𝜆 = .
1 − 𝑥𝑘+1
Thus
1
𝜙−1
𝑁 (𝑥1 , … , 𝑥𝑘+1 ) = (𝑥 , … , 𝑥𝑘 ).
1 − 𝑥𝑘+1 1
(b) We calculate the inverse 𝜙𝑁 : Given the point 𝑥 = (𝑥1 , … , 𝑥𝑘 ) ∈ ℝ𝑘 . To find its
image under 𝜙𝑁 , we walk from 𝑁 in the direction of the vector 𝑤 = 𝑥 − 𝑁 until
we reach the sphere, i.e. we need to find 𝜆 ∈ ℝ such that
𝑁 + 𝜆 ⋅ 𝑤 = (𝜆𝑥1 , … , 𝜆𝑥𝑘 , 1 − 𝜆)
Appendix A. Solutions to exercises 363
⎛ 1 0 … 0
𝑥1 ⎞
⎜ 1−𝑥𝑘+1 1
(1−𝑥𝑘+1 )2 ⎟
𝑥2
⎜ 0 … 0 ⎟
𝑑(𝜙−1
𝑁 )𝑥 = ⎜
1−𝑥𝑘+1 (1−𝑥𝑘+1 )2 .
⎟
⎜ ⋮ ⋮ … ⋮ ⋮ ⎟
⎜ 0 0 … 1 𝑥𝑘 ⎟
⎝ 1−𝑥𝑘+1 (1−𝑥𝑘+1 )2 ⎠
(c) The formulae for the projection from the south pole 𝑆 = (0, … , 0, −1) ∈ 𝕊𝑘 are
similar:
1
𝜙−1
𝑆 (𝑥1 , … , 𝑥𝑘+1 ) = 1 + 𝑥 (𝑥1 , … , 𝑥𝑘 ).
𝑘+1
and, while we get the same 𝜆,
1 ( 2
)
𝜙𝑆 (𝑥) = 𝑆 + 𝜆(𝑥 − 𝑆) = 2𝑥1 , … , 2𝑥𝑘 , 1 − |𝑥| .
1 + |𝑥|2
To check that both 𝜙𝑆 and 𝜙−1 𝑆
are both smooth is completely analogous. Since
all points are covered by these two parametrizations, 𝕊𝑘 is a smooth manifold. We
note that the formula for the derivative of 𝜙−1
𝑆
is given by
⎛ 1 0 … 0
−𝑥1 ⎞
⎜ 1+𝑥𝑘+1 1
(1+𝑥𝑘+1 )2 ⎟
−𝑥2
⎜ 0 … 0 ⎟
𝑑(𝜙−1
𝑆 )𝑥 = ⎜
1+𝑥𝑘+1 (1+𝑥𝑘+1 )2 .
⎟
⎜ ⋮ ⋮ … ⋮ ⋮ ⎟
⎜ 0 0 … 1 −𝑥𝑘 ⎟
⎝ 1+𝑥𝑘+1 (1+𝑥𝑘+1 )2 ⎠
364 A.2. Smooth manifolds
( )2
(2𝑧0 𝑧̄ 1 ) ⋅ (2𝑧̄ 0 𝑧1 ) + |𝑧0 |2 − |𝑧1 |2
= 4|𝑧0 |2 |𝑧1 |2 + |𝑧0 |4 − 2|𝑧0 |2 |𝑧1 |2 + |𝑧1 |4
( )2
= |𝑧0 |2 + |𝑧1 |2
=1
We have |𝑤0 |2 − |𝑤1 |2 = |𝑧0 |2 − |𝑧1 |2 and |𝑤0 |2 + |𝑤1 |2 = 1 = |𝑧0 |2 + |𝑧1 |2 .
Putting these together implies
Remembering that neither of the numbers can be zero, by rewriting these equations
we get
𝑤0 𝑧̄ 𝑤 𝑧̄
= 0 and 1 = 1 .
𝑧0 𝑤̄ 0 𝑧1 𝑤̄ 1
Hence, looking at the left-hand side, there is a complex number 𝛼 ≠ 0 such that
𝑤0 𝑧̄ 1
𝛼= = 0 = , and thus 𝑤0 = 𝛼𝑧0 with 𝛼 𝛼̄ = 1.
𝑧0 𝑤̄ 0 𝛼̄
𝑤0 𝑧̄ 1
On the other hand, we also know 𝑧0
= 𝑤̄ 1
. Combining these equations yields
𝑤0 𝑧̄ 𝑤
𝛼= = 1 = 1 , and thus 𝑤1 = 𝛼𝑧1 .
𝑧0 𝑤̄ 1 𝑧1
(c) Let 𝑝 ∈ 𝕊2 be a fixed point and fix a point (𝑧0 , 𝑧1 ) in 𝕊3 with 𝜋(𝑧0 , 𝑧1 ) = 𝑝. By the
previous point, we have that the points in 𝜋 −1 (𝑝) are parametrized by the complex
Appendix A. Solutions to exercises 365
Solution (Exercise 2.9) We can choose any basis of 𝑉 to define a linear isomorphism
𝜙 ∶ ℝ𝑘 → 𝑉 which is a diffeomorphism. Given a point 𝑥 ∈ 𝑉 , we modify 𝜙 by adding
𝑥 and get a new diffeomorphism (not linear anymore!)
𝜓 ∶ ℝ𝑘 → 𝑉 , 𝑤 → 𝜙(𝑤) + 𝑥.
Solution (Exercise 2.10) We only answer the question about the tangent space of
𝕋 (𝑎, 𝑏) in ℝ3 . For all points apart from (𝑎 + 𝑏, 0, 0) we can parametrize 𝕋 (𝑎, 𝑏) ⊂ ℝ3 by
and
In order to cover also the point (𝑎 + 𝑏, 0, 0) it suffices to rotate our parametrization by the
angle 𝜋 in the 𝑥𝑦−plane and use the diffeomorphism
√
Solution (Exercise 2.11) Around the point ( 𝑎, 0, 0) on 𝐻𝑎 we can choose the local
parametrization
(√ )
𝜙 ∶ 𝐵√𝑎 ((0, 0)) → 𝐻 ∩ {𝑦2 − 𝑧2 < 𝑎}, (𝑦, 𝑧) → 𝑧2 − 𝑦2 + 𝑎, 𝑦, 𝑧 .
The derivative in the standard basis at a point (𝑦, 𝑧) is the linear map
⎛− √ 𝑦 √
𝑧 ⎞
⎜ 𝑧2 −𝑦2 +𝑎 𝑧2 −𝑦2 +𝑎 ⎟
𝑑𝜙(𝑦,𝑧) ∶ ℝ2 → ℝ3 , 𝑑𝜙(𝑦,𝑧) =⎜ 1 0 ⎟.
⎜ 0 1 ⎟
⎝ ⎠
Hence at a point (𝑢, 𝑣, 𝑤) ∈ 𝐻𝑎 the image of the standard basis of ℝ2 is
⎛−𝑣∕𝑢⎞ ⎛𝑤∕𝑢⎞
⎜ 1 ⎟ and ⎜ 0 ⎟
⎜ ⎟ ⎜ ⎟
⎝ 0 ⎠ ⎝ 1 ⎠
√
where we write 𝑢 = 𝑤2 − 𝑣2 + 𝑎). Hence the tangent space at 𝑇(𝑢,𝑣,𝑤) (𝐻𝑎 ) is spanned
by these two vectors.
√
For (𝑢, 𝑣, 𝑤) = ( 𝑎, 0, 0) we get that 𝑇(√𝑎,0,0) (𝐻𝑎 ) is simply spanned by
⎛0⎞ ⎛0⎞
⎜1⎟ and ⎜0⎟ .
⎜ ⎟ ⎜ ⎟
⎝0⎠ ⎝1⎠
Appendix A. Solutions to exercises 367
Solution (Exercise 2.12) (a) The inverse map to 𝐹 is the projection 𝜋 onto the
first factor, for we obviously have 𝜋◦𝐹 = Id𝑋 and 𝐹 ◦𝜋 = IdΓ(𝑓 ) . Let 𝑋 ⊂ ℝ𝑁 ,
𝑌 ⊂ ℝ𝑀 , and let 𝑓 be smooth. Then for any point 𝑥 ∈ 𝑋, there is an open
subset 𝑈 ⊂ ℝ𝑁 and a smooth map 𝑓̃ ∶ 𝑈 → ℝ𝑀 with 𝑓̃𝑋∩𝑈 = 𝑓𝑋∩𝑈 . Then
(𝑓̃ × Id) ∶ 𝑈 → ℝ𝑁 × ℝ𝑀 is a smooth extension of 𝐹𝑋∩𝑈 . Hence 𝐹 is smooth.
The inverse map 𝜋 is a smooth map, since it extends to the smooth projection
on all of ℝ𝑁 × ℝ𝑀 . Hence 𝐹 is a diffeomorphism when 𝑓 is smooth. Hence
any local parametrization 𝜙 ∶ 𝑉 → 𝑋 can be extended to a local parametrization
𝐹 ◦𝜙 ∶ 𝑉 → Γ(𝑓 ). Thus the graph Γ(𝑓 ) is a manifold if 𝑋 is.
𝑋O
𝐹 /𝑋×𝑌
O
𝜙 𝜙×𝜓
𝑈 /𝑈 ×𝑊
𝐺=𝜓 −1 ◦𝐹 ◦𝜙
where 𝐺 is the map defined by 𝑣 → (𝑣, 𝜓 −1 (𝑓 (𝜙(𝑣))). Thus 𝐺 is the map Id𝑉 ×
(𝜓 −1 ◦𝑓 𝜙). Hence 𝑑𝐺0 ∶ ℝ𝑘 → ℝ𝑘 × ℝ𝑙 is the linear map
Thus in the commutative diagram below, 𝑑𝐹𝑥 has to be defined as Id𝑇𝑥 (𝑋) × 𝑑𝑓𝑥 :
ℝ𝑘 / ℝ𝑘 × ℝ𝑙 .
𝑑𝐺0 =Idℝ𝑘 ×(𝑑𝜓𝑓−1(𝑥) ◦𝑑𝑓𝑥 ◦𝑑𝜙0 )
𝑇(𝑥,𝑓 (𝑥)) (Γ(𝑓 )) = 𝑑𝜓0 (ℝ𝑘 ) = 𝑑𝐹𝑥 (𝑑𝜙0 (ℝ𝑘 )) = 𝑑𝐹𝑥 (𝑇𝑥 (𝑋)).
368 A.2. Smooth manifolds
Finally, by the previous point, we know 𝑑𝐹𝑥 = Id𝑇𝑥 (𝑋) × 𝑑𝑓𝑥 and get
𝑇(𝑥,𝑓 (𝑥)) (Γ(𝑓 )) = (Id𝑇𝑥 (𝑋) × 𝑑𝑓𝑥 )(𝑇𝑥 (𝑋)) = Γ(𝑑𝑓𝑥 ) ⊂ 𝑇𝑥 (𝑋) × 𝑇𝑓 (𝑥) (𝑌 )
Thus
Solution (Exercise 3.1) The derivative of 𝑓 at 𝑥 ∈ ℝ𝑛 is just given by the linear map
𝑑𝑓𝑥 ∶ ℝ𝑛 → ℝ𝑛 , 𝑣 → 𝐴 ⋅ 𝑣. Hence 𝑑𝑓𝑥 is an isomorphism if and only if 𝐴 is invertible.
So 𝑓 is a local diffeomorphism if and only if 𝐴 is invertible. Now 𝑓 is a diffeomorphism
if and only if the map 𝑥 → 𝑓 (𝑥) − 𝑏 = 𝐴𝑥 is a diffeomorphism which is the case if and
only if 𝐴 is invertible. For, if this is the case then 𝑦 → 𝐴−1 𝑦 + 𝐴−1 𝑏 is the inverse of 𝑓 .
Hence 𝑓 is a diffeomorphism if and only if 𝐴 is invertible.
this shows 𝑡 = 𝑠.
For each 𝑡, 𝑑𝑓𝑡 is a linear map ℝ → ℝ2 . Since Ker (𝑑𝑓𝑡 ) is a vector subspace of
ℝ, it is either {0} or ℝ itself. Since 𝑑𝑓𝑡 is not the zero matrix for any 𝑡, 𝑑𝑓𝑡 must
be injective for all 𝑡 ∈ ℝ.
Solution (Exercise 3.4) The map 𝑓 is not an embedding, since it is not injective. But
we can check it is an immersion by showing that the derivative is injective everywhere.
370 A.3. The Inverse Function Theorem, immersions and embeddings
The derivative of 𝑓 at (𝑠, 𝑡) is
In order to show that 𝑑𝑓(𝑠,𝑡) is injective, we need to check that it has full or maximal
rank, i.e. rank 2. Hence we need to check that the two column vectors are always linearly
independent. To simplify notation, we set 𝑥 = sin 𝑠, 𝑦 = cos 𝑠, 𝑢 = sin 𝑡, and 𝑣 = cos 𝑡.
Now assume there are two real numbers 𝜆 and 𝜇 such that
𝜇(2 + 𝑦)𝑢 = 0
𝜇(2 + 𝑦)𝑣 = 0.
But since 𝑢 and 𝑣 are never both 0, we know that the vectors (𝑣, 𝑢) and (𝑢, 𝑣) are linearly
independent. Hence we must have 𝜆 = 0 and 𝜇(2 + 𝑦) = 0. The latter implies 𝜇 = 0,
since 2 + 𝑦 ≠ 0.
Solution (Exercise 3.5) Let 𝑎 and 𝑏 be two relatively prime integers with 𝑎 ≠ 0.
Recall that, for 𝑡1 , 𝑡2 ∈ 𝑅, we have
𝑒2𝜋𝑖𝑡1 = 𝑒2𝜋𝑖𝑡2 ⇐⇒ 𝑡1 − 𝑡2 ∈ ℤ.
Since 𝑎 and 𝑏 are integers, 𝑡1 − 𝑡2 ∈ ℤ implies 𝑎𝑡1 − 𝑎𝑡2 ∈ ℤ and 𝑏𝑡1 − 𝑏𝑡2 ∈ ℤ. Hence
we have
𝑒2𝜋𝑖𝑡1 = 𝑒2𝜋𝑖𝑡2 ⇒ 𝑒2𝜋𝑖𝑎𝑡1 = 𝑒2𝜋𝑖𝑎𝑡2 and 𝑒2𝜋𝑖𝑏𝑡1 = 𝑒2𝜋𝑖𝑏𝑡2 .
This shows that we have a well-defined map
𝑔𝑎,𝑏 ∶ 𝕊1 → 𝕊1 × 𝕊1 , 𝑒2𝜋𝑖𝑡 → (𝑒2𝜋𝑖𝑎𝑡 , 𝑒2𝜋𝑖𝑏𝑡 ).
Hence, for 𝑓 ∶ ℝ → 𝕊1 , 𝑡 → 𝑒2𝜋𝑖𝑡 , we get a commutative diagram
𝛾𝑎,𝑏
ℝ / 𝕊1 × 𝕊1
;
𝑓 𝑔𝑎,𝑏
𝕊1 .
Appendix A. Solutions to exercises 371
Solution (Exercise 3.6) (a) The derivative of 𝑓 at 𝑡 is the map 𝑑𝑓𝑡 ∶ ℝ → ℝ2 given
by the Jacobian matrix
where we have used several trigonometric identities. Since the derivative 𝑑𝑓𝑡 is
nontrivial for all 𝑡, it is always injective as a linear map ℝ → ℝ2 . Hence 𝑓 is an
immersion.
(b) But 𝑓 is not a homeomorphism onto Im (𝑓 ). See Figure A.1. For, consider the open
subset (𝜋∕4, 3𝜋∕4) in (0, 3𝜋∕4). If 𝑓 was a homeomorphism, then 𝑓 ((𝜋∕4, 3𝜋∕4))
had to be open in Im (𝑓 ) as well. That means that around any point, for exam-
ple the point 𝑓 (𝜋∕2) = (0, 0), there had to an open neighborhood contained in
𝑓 ((𝜋∕4, 3𝜋∕4)). By the definition of the open sets in Im (𝑓 ) as a subspace of ℝ2 ,
there had to be an open ball 𝐵𝜀 (0, 0) ∈ ℝ2 with
since | sin(2𝑡)(cos 𝑡, sin 𝑡)| < 𝜀 for all 𝑡 < 𝜖∕2 (where we use sin 𝑥 ≤ 𝑥 and
|(cos 𝑡, sin 𝑡)| = 1). Hence 𝑓 cannot be an open map and therefore not a home-
omorphism.
• Is the map ℎ ∶ [0, 3𝜋∕4] → ℝ3 , 𝑡 → (sin(2𝑡) cos 𝑡, sin(2𝑡) sin 𝑡, 2𝑡) an embed-
ding?
Answer: Yes, this time we have a map which is an immersion, it is one-to-
one this time 𝑓 (0) = (0, 0, 0) ≠ (0, 0, 𝜋) = 𝑓 (𝜋∕2), and it is defined on a
compact space and is therefore a proper map.
Figure A.1: The origin is the critical point. We cannot separate the two branches of the graph
with open subsets.
Solution (Exercise 3.7) By the Local Immersion Theorem, we can choose lo-
cal parametrizations 𝜙 ∶ 𝑉 → 𝑍 and 𝜓 ∶ 𝑊 → 𝑋 around 𝑧 with 𝑉 ⊂ ℝ𝑘 and
𝑊 = 𝑉 ⊕ 𝑉 ′ ⊂ ℝ𝑛 such that
inclusion /𝑋
𝑍O O
𝜙 𝜓
canonical / 𝑊 ⊆ ℝ𝑛
ℝ𝑘 ⊇ 𝑉
immersion
commutes. The map 𝜓 is a diffeomorphism onto its image 𝜓(𝑊 ) ⊂ 𝑋. The inverse
map 𝜓 −1 ∶ 𝜓(𝑊 ) → 𝑊 is a local coordinate system on the open neighborhood 𝜓(𝑊 )
around 𝑧 ∈ 𝑋. We write 𝑥𝑖 ∶ 𝜓(𝑊 ) → ℝ for the 𝑖th component of 𝜓 −1 , i.e. a point
𝑝 ∈ 𝜓(𝑊 ) has the local coordinates (𝑥1 (𝑝), … , 𝑥𝑛 (𝑝)) = (𝜓1−1 (𝑝), … , 𝜓𝑛−1 (𝑝)). Since
the above diagram commutes and 𝜙 is a diffeomorphism onto its image, we have
𝜙(𝑉 ) = 𝑍 ∩ 𝜓(𝑊 ).
Hence, since the lower horizontal map is the canonical immersion, the points in 𝑍∩𝜓(𝑊 )
are exactly those on which the coordinate functions 𝑥𝑘+1 , … , 𝑥𝑛 vanish. Relabelling the
open subset 𝜓(𝑊 ) as 𝑈 we have
𝑍 ∩ 𝑈 = {𝑝 ∈ 𝑈 such that 𝑥𝑘+1 (𝑝) = ⋯ = 𝑥𝑛 (𝑝) = 0}.
Appendix A. Solutions to exercises 373
A.4 Submersions and regular values
Solution (Exercise 4.1) We can assume that 𝑈 = 𝑋, since we just need to consider
𝑓
the map 𝑈 ⊆ 𝑋 ←←←→← 𝑌 . Let 𝑦 be any point in 𝑓 (𝑋). We need to show that there is an
open neighborhood 𝑊 around 𝑦 which is contained in 𝑓 (𝑋). Let 𝑥 be a point in 𝑋 with
𝑓 (𝑥) = 𝑦 which exists since 𝑦 ∈ 𝑓 (𝑋). By the Local Submersion Theorem, we can
choose local parametrizations 𝜙 ∶ 𝑉 → 𝑋 around 𝑥 with 𝑉 ⊂ ℝ𝑛 open and 𝜓 ∶ 𝑉 ′ → 𝑌
around 𝑦 with 𝑉 ′ ⊂ ℝ𝑚 open such that the induced map 𝑉 → 𝑉 ′ is the canonical
submersion:
𝑓
𝑋O /𝑌
O
𝜙 𝜓
canonical /𝑉′
𝑉
submersion
Solution (Exercise 4.2) The derivative of 𝑔 at a point (𝑥, 𝑦) is given by the 1×2-matrix
( )
𝑑𝑔(𝑥,𝑦) = 2𝑥 −2𝑦 .
As a linear map from ℝ2 to ℝ, 𝑑𝑔(𝑥,𝑦) is surjective whenever it is not the zero map. Hence
𝑑𝑔(𝑥,𝑦) is surjective for all (𝑥, 𝑦) ≠ (0, 0).
Thus the set of regular values of 𝑔 is the subset ℝ ⧵ {0}. Since 𝑔(0, 0) = 0, the only
critical value is 0.
Since the derivative of 𝑔 is not surjective at all points, 𝑔 is not a submersion.
of the norm of the 𝑖th row vector of 𝐴. Since 𝐴 ∈ 𝑂(𝑛), we have 𝐴𝐴𝑇 = 𝐼 and the 𝑖th
diagonal entry in 𝐴𝐴𝑇 is equal 1. This shows that 𝑂(𝑛) is contained in the product of
∏ ∏ 2
𝑛 spheres 𝕊𝑛−1 in ℝ𝑛 = ℝ𝑛 = 𝑀(𝑛). Hence 𝑂(𝑛) is bounded. But 𝑂(𝑛) is also
2
closed in ℝ𝑛 , since we can define it as the inverse image of the closed point 𝐼 ∈ 𝑆(𝑛)
under the map 𝑀(𝑛) → 𝑆(𝑛) sending 𝐴 to 𝐴𝐴𝑇 . Thus 𝑂(𝑛) is closed and bounded in
2
ℝ𝑛 and therefore compact.
2
Solution (Exercise 4.5) We consider 𝑂(𝑛) as a subspace in 𝑀(𝑛) = ℝ𝑛 . We
defined 𝑂(𝑛) as 𝑓 −1 (𝐼) under the map 𝑓 ∶ 𝑀(𝑛) → 𝑆(𝑛), 𝑓 (𝐴) = 𝐴𝐴𝑇 . We have
checked in the proof of Theorem 4.12 that 𝐼 is a regular value for 𝑓 . As a consequence
of the Preimage Theorem we saw that this implies that 𝑇𝐼 (𝑂(𝑛)) equals the kernel of
𝑑𝑓𝐼 ∶ 𝑀(𝑛) = 𝑇𝐼 (𝑀(𝑛)) → 𝑇𝐼 (𝑆(𝑛)) = 𝑆(𝑛). We calculated the derivative 𝑑𝑓𝐴 for any
𝐴 ∈ 𝑂(𝑛) in the proof of Theorem 4.12: it is given by 𝑑𝑓𝐴 (𝐵) = 𝐵𝐴𝑇 + 𝐴𝐵 𝑇 . For
𝐴 = 𝐼, this gives 𝑑𝑓𝐼 (𝐵) = 𝐵 + 𝐵 𝑇 . Hence the kernel of 𝑑𝑓𝐼 is the space of matrices
satisfying 𝐵 + 𝐵 𝑇 = 0, i.e., 𝐵 𝑇 = −𝐵.
(b) A 2×2-matrix 𝐴 has rank 0 if and only if it is the zero matrix. Thus 𝐴 ∈ 𝑀(2)⧵{0}
has rank 1 if and only if it does not have rank 2, i.e., if and only if it is not invertible.
Hence 𝐴 ∈ 𝑀(2) ⧵ {0} has rank 1 if and only if det 𝐴 = 0. By the previous point,
the determinant function is a submersion 𝑀(2) ⧵ {0} → ℝ. Hence 𝑅1 = det −1 (0)
is a submanifold of dimension 4 − 1 = 3 by the Preimage Theorem.
Appendix A. Solutions to exercises 375
Solution (Exercise 4.7) We consider the function 𝑄 defined in the hint. Since 𝑃 is
homogeneous, we know 𝑄 is always 0. Hence its derivative with respect to 𝑡 is zero as
well. Hence we get
∑
0 = 𝜕𝑄∕𝜕𝑡 = 𝑥𝑖 𝜕𝑃 ∕𝜕𝑥𝑖 (𝑡𝑥1 , … , 𝑡𝑥𝑘 ) − 𝑚𝑡𝑚−1 𝑃 (𝑡𝑥1 , … , 𝑡𝑥𝑘 ) (A.1)
𝑖
where we apply the chain rule to the first summand of 𝑄 which is the composite 𝑡 →
𝑡𝑥 → 𝑃 (𝑡𝑥). Setting 𝑡 = 1 in (A.1) yields Euler’s identity (4.2).
⎛ 𝑧1 ⎞
𝑑𝑃𝑥 ∶ ℝ𝑘 → ℝ, (𝑧1 , … , 𝑧𝑘 ) →(𝜕𝑃 ∕𝜕𝑥1 (𝑥) … 𝜕𝑃 ∕𝜕𝑥𝑘 (𝑥)) ⋅ ⎜ ⋮ ⎟
⎜ ⎟
⎝𝑧𝑘 ⎠
∑
= 𝑧𝑖 𝜕𝑃 ∕𝜕𝑥𝑖 (𝑥).
𝑖
To show that 𝑑𝑃𝑥 is nonsingular, i.e. surjective, it suffices to show that 𝑑𝑃𝑥 is
nontrivial. But applying 𝑑𝑃𝑥 to 𝑥 and using Euler’s identity yields
∑
𝑑𝑃𝑥 (𝑥) = 𝑥𝑖 𝜕𝑃 ∕𝜕𝑥𝑖 (𝑥1 , … , 𝑥𝑘 ) = 𝑚𝑃 (𝑥1 , … , 𝑥𝑘 ).
𝑖
Hence if 𝑥 = (𝑥1 , … , 𝑥𝑘 ) is not a zero of 𝑃 , then 𝑑𝑃𝑥 (𝑥) is nonzero. Hence all
nonzero real numbers are regular values of 𝑃 . The Preimage Theorem now implies
that 𝑃 −1 (𝑎) is a 𝑘 − 1-dimensional submanifold of ℝ𝑘 for all 𝑎 ≠ 0.
(b) Given two real numbers 𝑎, 𝑏 > 0, then (𝑏∕𝑎)1∕𝑚 exists and we if 𝑃 (𝑥) = 𝑎, we have
Similarly, if both 𝑎, 𝑏 < 0 are negative, then (𝑏∕𝑎)1∕𝑚 exists and the same argument
shows that 𝑃 −1 (𝑎) and 𝑃 −1 (𝑏) are diffeomorphic.
Hence we get
𝑇𝐼 (𝑆𝐿(𝑛)) = Ker (𝑑(det)𝐼 ) = {𝐴 ∈ 𝑀(𝑛) ∶ tr (𝐴) = 0}.
In other words, the tangent space to 𝑆𝐿(𝑛) at the identity is the space of matrices
whose trace vanishes.
Appendix A. Solutions to exercises 377
Solution (Exercise 4.10) (a) We write (𝑧0 , 𝑧1 ) = (𝑥0 , 𝑦0 , 𝑥1 , 𝑦1 ) for real coordinates
𝑥0 , 𝑦0 , 𝑥1 , 𝑦1 . First we get
( )
̃ 0 , 𝑧1 ) = 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2
𝜋(𝑧
( )
= 2(𝑥0 𝑥1 + 𝑦0 𝑦1 ) + 𝑖2(−𝑥0 𝑦1 + 𝑦0 𝑥1 ), (𝑥20 + 𝑦20 ) − (𝑥21 + 𝑦21 ) .
Let 𝑞 = (𝑥0 , 𝑥1 , 𝑥1 , −𝑥0 ) ∈ 𝜋 −1 (𝑏) be a point in the fiber over 𝑏. Since not
both 𝑥0 and 𝑥1 can be zero, we assume that 𝑥0 ≠ 0. The tangent space 𝑇𝑞 𝕊3
is the vector space
𝑇𝑞 𝕊3 = {𝐮 ∈ ℝ4 ∶ 𝐮 ⟂ 𝑞}
⎧ ⎛−𝑥1 ⎞ ⎛−𝑥1 ⎞ ⎛𝑥0 ⎞⎫
⎪ ⟂ ⎜ 𝑥0 ⎟ ⟂ ⎜ 0 ⎟ ⟂ ⎜ 0 ⎟⎪
= span ⎨𝑞1 = ⎜ ⎟ , 𝑞2 = ⎜ 𝑥 ⎟ , 𝑞3 = ⎜ 0 ⎟⎬ .
⎪ ⎜ 0 ⎟ ⎜ 0⎟ ⎜ ⎟⎪
⎩ ⎝ 0 ⎠ ⎝ 0 ⎠ ⎝𝑥0 ⎠⎭
Now we consider the map 𝑑 𝜋̃𝑞 . We computed this map as represented by the
matrix
⎛𝑥1 −𝑥0 𝑥0 𝑥1 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ ⎜𝑥0 𝑥1 𝑥1 −𝑥0 ⎟ .
⎜ ⎟
⎝𝑥0 𝑥1 −𝑥1 𝑥0 ⎠
⎛−𝑥1 ⎞
⎜𝑥 ⎟
The kernel of this map is the span of the vector 𝑞0⟂ = ⎜ 0 ⎟. This implies
⎜ 𝑥0 ⎟
⎝ 𝑥1 ⎠
that 𝑑𝜋𝑞 is surjective onto 𝑇𝑎 𝕊2 .
More concretely, the tangent space 𝑇𝑏 𝕊2 consists of the vectors which are
orthogonal to 𝑏 in ℝ3 . Hence it has a basis (𝐞31 , 𝐞33 ) as a subspace in ℝ3 . The
map 𝑑 𝜋̃𝑞 ∶ 𝑇𝑞 ℝ4 → 𝑇𝑏 ℝ3 sends
This map is surjective, as one can check by using the conditions we have on
𝑥0 and 𝑥1 . Hence 𝑞 is a regular point for 𝜋. Since 𝑞 was any point in the fiber
over 𝑎, we have shown that 𝑏 is a regular value for 𝜋.
(d) By a previous point, we need to show that 𝑑 𝜋̃𝑞 restricted to 𝑇𝑞 𝕊3 is surjective onto
𝑇𝜋(𝑞) 𝕊2 at every 𝑞 ∈ 𝕊3 . Since the tangent space 𝑇𝜋(𝑞) 𝕊2 of 𝕊2 is two-dimensional,
we need to check that the image of 𝑑 𝜋̃𝑞 restricted to Ker (𝑑(𝑔4 )𝑞 ) spans a two-
dimensional subspace. Since Ker (𝑑(𝑔4 )𝑞 ) is of dimension 3, it suffices to show
that 𝑑 𝜋̃𝑞 has rank 3, which implies that the kernel of 𝑑 𝜋̃𝑞 has dimension 1. Hence
we need to show that 𝑑 𝜋̃𝑞 always has 3 linear independent columns.
We can show this for example by calculating the determinants of appropriate 3 × 3-
minors. Ignoring the factor 2 in our formula for 𝑑 𝜋̃𝑞 we look at the minors 𝐴𝑗 of
the remaining matrix where we omit the 𝑗th column:
( )
𝑎 𝑏
Solution (Exercise 5.1) (a) Every 𝐴 = ∈ 𝑆𝑂(2) satisfies
𝑐 𝑑
( )( ) ( 2 ) ( )
𝑇 𝑎 𝑐 𝑎 𝑏 𝑎 + 𝑐 2 𝑎𝑏 + 𝑐𝑑 1 0
𝐴 𝐴= = = .
𝑏 𝑑 𝑐 𝑑 𝑎𝑏 + 𝑐𝑑 𝑏2 + 𝑑 2 0 1
Together with det 𝐴 = 𝑎𝑑 − 𝑏𝑐 = 1 we get four linear equations for the complex
numbers 𝑎, 𝑏, 𝑐, 𝑑, and their complex conjugates. Unraveling these equations
shows that we can write 𝐴 as
( )
𝑎 𝑏
𝐴= with 𝑎𝑎̄ + 𝑏𝑏̄ = 1.
−𝑏̄ 𝑎̄
is a diffeomorphism.
Figure A.2: The column vectors are orthogonal to each other and they determine each other.
inverses are unique (as maps of sets), (𝑓 −1 )|𝑉 must agree with (𝑓|𝑈 )−1 . Hence 𝑓 −1
is a smooth map on an open neighborhood of 𝑦. Since 𝑦 was arbitrary, we see that
𝑓 −1 is smooth at every point and therefore smooth.
(c) We would like to show dim 𝑋 = rank(𝑓 ) = dim 𝑌 . Because then the Inverse
Function Theorem implies that 𝑓 is a local diffeomorphism, and, since 𝑓 is also
bijective, 𝑓 would be a diffeomorphism by the first point and we were done.
Assume 𝑋 ⊆ ℝ𝑀 and 𝑌 ⊆ 𝑅𝑁 , dim 𝑋 = 𝑚, dim 𝑌 = 𝑛, and set 𝑟 ∶= rank(𝑓 ).
By definition of the rank, we have 𝑚 ≥ 𝑟 and 𝑛 ≥ 𝑟. We want to show 𝑚 = 𝑟 = 𝑛.
For any point 𝑥 ∈ 𝑋, the linear map 𝑑𝑓𝑥 has rank 𝑟. Recall that for a linear
map 𝐿 ∶ ℝ𝑚 → ℝ𝑛 of rank 𝑟, we can choose a basis of ℝ𝑛 such that the first 𝑟
basis vectors 𝑏1 , … , 𝑏𝑟 span the image of 𝐿 and the remaining 𝑛 − 𝑟 basis vectors
𝑏𝑟+1 , … , 𝑏𝑛 span the orthogonal complement of 𝐿 in ℝ𝑛 . Then we choose a basis
of ℝ𝑚 such that the 𝑖th basis vector is sent to 𝑏𝑖 . The matrix representing 𝐿 in
these bases has the 𝑟 × 𝑟-identity matrix sitting in the upper left corner and zeros
elsewhere. Then, as in the proof of the Local Immersion (or Submersion) Theorem,
we can choose local parametrizations 𝜙 ∶ 𝑈 → 𝑋 around 𝑥 and 𝜓 ∶ 𝑉 → 𝑌
around 𝑦 such that the map 𝜃 ∶ 𝑈 → 𝑉 in the commutative diagram
𝑓
𝑋O /𝑌
O
𝜙 𝜓
𝑈 /𝑉
𝜃=𝜓 −1 ◦𝑓 ◦𝜙
382 A.5. Lie groups
has the form 𝜃(𝑥1 , … , 𝑥𝑚 ) = (𝑥1 , … , 𝑥𝑟 , 0) ∈ ℝ𝑛 . (Note that the 0 at the end of
𝜃(𝑥) only occur if 𝑟 < 𝑛.)
If 𝑚 > 𝑟, then for a sufficiently small 𝜖 > 0, 𝜃(𝑥1 , … , 𝑥𝑟 , 𝜖, 0) = (𝑥1 , … , 𝑥𝑟 , 0) and
𝜃 is not injective. Since 𝜙 and 𝜓 are diffeomorphisms, this would imply that 𝑓 is
not injective which contradicts that 𝑓 is bijective. Hence we can assume 𝑚 = 𝑟
and 𝑓 is an immersion.
Assume we had 𝑟 < 𝑛. Then, after possibly shrinking 𝑈 , we can assume that
𝑈 is a small open 𝐵𝜖 (0) around 0 in ℝ𝑚 and that 𝜃(𝐵̄ 𝜖 (0)) ⊆ 𝑉 (where 𝐵̄ 𝜖 (0)
denotes the closed ball of radius 𝜖: 𝐵̄ 𝜖 (0) = {𝑥 ∈ ℝ𝑚 ∶ |𝑥| ≤ 𝜖}). Since 𝐵̄ 𝜖 (0)
is compact, so is 𝜃(𝐵̄ 𝜖 (0)). Hence 𝜃(𝐵̄ 𝜖 (0)) is closed in 𝑉 and is contained in
𝑉 ∩ (ℝ𝑟 × {0}). Hence 𝜃(𝐵̄ 𝜖 (0)) does not contain any open subsets of 𝑉 . Since
𝜙 and 𝜓 are diffeomorphisms, this implies that 𝑓 (𝜙(𝐵̄ 𝜖 (0))) is closed and does
not contain any nonempty open subset of 𝑌 . Since we can cover 𝑋 by such local
parametrizations, we see that 𝑓 (𝑋) is the union of subsets which do not contain
any nonempty open subset of 𝑌 .
Now if 𝑋 could be assumed to be compact, then 𝑓 (𝑋) is compact, and 𝑓 (𝑋) can
be covered by finitely many closed subsets which do not contain any nonempty
open subset of 𝑌 . That would imply that 𝑓 (𝑋) is itself a closed subset which does
not contain any nonempty open subset of 𝑌 . Hence 𝑓 (𝑋) cannot be all of 𝑌 , and
𝑓 would not be surjective.
In general, for any open cover of a subspace in ℝ𝑀 , we can always choose a count-
able subcover. This implies that 𝑓 (𝑋) is the countable union of subsets which do
not contain any nonempty open subset of 𝑌 . By Baire’s Category Theorem, this
implies that 𝑓 (𝑋) does not contain any nonempty open subset of 𝑌 . Hence 𝑓 (𝑋)
cannot be equal 𝑌 and 𝑓 would not be surjective.
(d) We learned in Theorem 5.3 that a Lie group homomorphism has constant rank.
Hence we just need to apply the previous point.
𝑗 𝜇
𝐺O /𝐺×𝐺 /𝐺
O O
𝜙𝑔 𝜙𝑔 ×𝜙ℎ 𝜙𝑔ℎ
𝑈𝑔 /𝑈 ×𝑈 /𝑈 .
𝛾 𝑔 ℎ 𝜃 𝑔ℎ
where we define the maps 𝛾 and 𝜃 such that the diagram commutes. Since 𝜙𝑔 (0) =
𝑔 and 𝜙ℎ (0) = ℎ, we must have 𝛾(𝑢) = (𝑢, 0) ∈ 𝑈𝑔 × 𝑈ℎ to make the left hand
diagram commute. Moreover, we must have 𝜃(0, 0) = 0 ∈ 𝑈𝑔ℎ .
Taking derivatives at 𝑔 and using 𝑇(𝑔,ℎ) (𝐺 × 𝐺) = 𝑇𝑔 (𝐺) × 𝑇ℎ (𝐺) gives
𝑑𝑗𝑔 𝑑𝜇(𝑔,ℎ)
𝑇𝑔 (𝐺) / 𝑇 (𝐺) × 𝑇 (𝐺) / 𝑇 (𝐺)
𝑔
O O ℎ 𝑔ℎ
O
𝑑(𝜙𝑔 )0 𝑑(𝜙𝑔 )0 ×𝑑(𝜙ℎ )0 𝑑(𝜙𝑔ℎ )0
ℝ𝑛 / ℝ𝑛 × ℝ𝑛 / ℝ𝑛 .
𝑑𝛾0 𝑑𝜃0
Since 𝛾(𝑢) = (𝑢, 0), we have 𝑑𝛾0 (𝑣) = (𝑣, 0) and hence 𝑑𝑗𝑔 (𝑋) = (𝑋, 0). Since
𝜇◦𝑗 = 𝑅ℎ , we have 𝑑𝜇(𝑔,ℎ) ◦𝑑𝑗𝑔 = 𝑑(𝑅ℎ )𝑔 . Thus
As we have just learned 𝑑𝜇(𝑒,𝑒) (𝑋, 𝑑𝜄𝑒 (𝑋)) = 𝑋 + 𝑑𝜄𝑒 (𝑋) = 0, and hence 𝑑𝜄𝑒 (𝑋) =
−𝑋.
384 A.5. Lie groups
𝐺
𝜄 /𝐺
O
𝐿𝑔−1 𝑅𝑔−1
𝐺 / 𝐺.
𝜄
One easily checks that it commutes. Taking the derivative at 𝑔 of the top map
yields a commutative diagram of derivatives
𝑑𝜄𝑔
𝑇𝑔 (𝐺) / 𝑇 −1 (𝐺)
𝑔
O
𝑑(𝐿𝑔−1 )𝑔 𝑑(𝑅𝑔−1 )𝑒
𝑇𝑒 (𝐺) / 𝑇 (𝐺).
𝑑𝜄𝑒 𝑒
We just calculated the effect of the map 𝑑𝜄𝑒 ∶ 𝑇𝑒 (𝐺) → 𝑇𝑒 (𝐺) as 𝑋 → −𝑋. Hence,
since all maps in the above diagram are linear, we get
Solution (Exercise 5.5) Given elements 𝑔, ℎ ∈ 𝐺. Let 𝑅ℎ−1 denote the right translation
with ℎ−1 . We define the smooth map 𝑗ℎ by
Note that 𝑗ℎ (𝑔ℎ) = (𝑔ℎℎ−1 , ℎ) = (𝑔, ℎ) ∈ 𝐺 × 𝐺. For the tangent spaces we get
Since 𝜇◦𝑗ℎ = Id𝐺 , we also have 𝑑𝜇(𝑔,ℎ) ◦𝑑(𝑗ℎ )𝑔ℎ = Id𝑇𝑔ℎ (𝐺) . In particular,
is surjective. Since we started with arbitrary elements 𝑔 and ℎ, this shows that 𝜇 is a
submersion.
where we have used 𝑑(𝐿𝐴−1 )𝐴 (𝐵) = 𝐴−1 𝐵 which can be easily checked, since matrix
multiplication is linear. Hence, after replacing 𝐵 with 𝐴𝐵, we get
A.6.1 Transversality
1 1
𝐶 = (𝐶 + 𝐶 𝑡 ) + (𝐶 − 𝐶 𝑡 ) for any 𝐶 ∈ 𝑀(𝑛).
2 2
(c) Yes, 𝑆𝐿(𝑛) and 𝑂(𝑛) do not meet transversally in 𝑀(𝑛), since 𝑆𝑂(𝑛) is contained
in 𝑆𝐿(𝑛). Hence we also have 𝑇𝐴 (𝑂(𝑛)) = 𝑇𝐴 (𝑆𝑂(𝑛)) ⊆ 𝑇𝐴 (𝑆𝐿(𝑛)), and these
tangent spaces do not span all of 𝑇𝐴 (𝑀(𝑛)) = 𝑀(𝑛).
Now we assume 𝑓 −
⋔ 𝑔 −1 (𝑊 ), i.e.,
We need to show (𝑔◦𝑓 ) − ⋔ 𝑊 . So let 𝑥 ∈ 𝑋 be a point such that 𝑔(𝑓 (𝑥)) ∈ 𝑊 and let
𝑐 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑍). By (A.6), there are vectors 𝑏1 ∈ 𝑇𝑓 (𝑥) (𝑌 ) and 𝑏2 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) such
that
𝑑𝑔𝑓 (𝑥) (𝑏1 ) + 𝑏2 = 𝑐.
By (A.7), there are vectors 𝑎 ∈ 𝑇𝑥 (𝑋) and 𝑏3 ∈ 𝑇𝑓 (𝑥) (𝑔 −1 (𝑊 )) such that
𝑑𝑓𝑥 (𝑎) + 𝑏3 = 𝑏1 .
𝑐 = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎) + 𝑏3 ) + 𝑏2 = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎1 )) + 𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 .
By the chain rule, we have 𝑑(𝑔◦𝑓 )𝑥 = 𝑑𝑔𝑓 (𝑥) ◦𝑑𝑓𝑥 . By a previous exercise, we
know 𝑇𝑓 (𝑥) (𝑔 −1 (𝑊 )) = (𝑑𝑔𝑓 (𝑥) )−1 (𝑇𝑔(𝑓 (𝑥)) (𝑊 ) in 𝑇𝑓 (𝑥) (𝑌 ). In particular, 𝑑𝑔𝑓 (𝑥) (𝑏3 ) ∈
𝑇𝑔(𝑓 (𝑥)) (𝑊 ) and thus
𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ).
Hence we have
𝑐 = 𝑑(𝑔◦𝑓 )𝑥 (𝑎) + 𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 ∈ [Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 )] ⊂ 𝑇𝑔(𝑓 (𝑥)) (𝑍).
Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) = 𝑇𝑔(𝑓 (𝑥)) (𝑍) for all 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) ∈ 𝑊 .
Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) = 𝑇𝑔(𝑓 (𝑥)) (𝑍) for all 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) ∈ 𝑊 . (A.4)
Let 𝑥 ∈ 𝑋 be a point such that 𝑓 (𝑥) ∈ 𝑔 −1 (𝑊 ) and let 𝑏 ∈ 𝑇𝑓 (𝑥) (𝑌 ). Since 𝑔(𝑓 (𝑥)) ∈ 𝑊 ,
we can use (A.8) to find vectors 𝑎 ∈ 𝑇𝑥 (𝑋) and 𝑐 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) such that
By the chain rule, we have 𝑑(𝑔◦𝑓 )𝑥 (𝑎) = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎)). Thus
In other words,
If we can find a suitable 𝑤, then we just set 𝑣 ∶= 𝑣1 − 𝑤. Hence, by taking the difference
of the two equations, we see that the question is reduced to checking whether we can find
a 𝑤 such that 𝑣2 − 𝑣1 = 𝐴𝑤 − 𝑤 = (𝐴 − 𝐼)𝑤 where 𝐼 is the identity map of 𝑉 . But such
a 𝑤 exists for any choice of 𝑣1 and 𝑣2 if and only if 𝐴 − 𝐼 is invertible, i.e. if and only if
det(𝐴 − 𝐼) ≠ 0 which happens if and only if +1 is not an eigenvalue of 𝐴 (because the
eigenvalues are the 𝜆 such that det(𝐴 − 𝜆𝐼) = 0).
𝑓 ∶ ℂ5 ⧵ {0} → ℂ,
(𝑧1 , … , 𝑧5 ) → 𝑧21 + 𝑧22 + 𝑧23 + 𝑧34 + 𝑧6𝑘−1
5
.
Setting 𝑍 = 𝑓 −1 (0), we need to show that 𝑍 and 𝕊9 meet transversally. The tangent
space to 𝑍 in a point 𝑧 ∈ 𝑍 is the kernel of the derivative 𝑑𝑓𝑧 . Since 𝑓 is a polynomial
in the variables 𝑧1 , … , 𝑧5 , we can use our usual rules for partial differentiation to get the
following matrix for 𝑑𝑓𝑧 (in the standard basis):
( )
𝑑𝑓𝑧 ∶ ℂ5 → ℂ, 𝑑𝑓𝑧 = 2𝑧1 , 2𝑧2 , 2𝑧3 , 3𝑧24 , (6𝑘 − 1)𝑧6𝑘−25
.
( )
𝑥 −𝑦
Recall that we can represent every element 𝑥+𝑖𝑦 ∈ ℂ by the real 2×2-matrix .
𝑦 𝑥
Then we see that 𝑑𝑓𝑧 is a real 2 × 10-matrix. Its maximal rank (as a matrix with entries in
ℝ) is therefore 2. And, in fact, for every 𝑧 ≠ 0, 𝑑𝑓𝑧 has rank 2, since it maps surjectively
onto ℂ ≅ ℝ2 . Thus 0 is a regular value for 𝑓 and the tangent space 𝑇𝑧 (𝑍) is the kernel
of 𝑑𝑓𝑧 .
Writing a complex number 𝑧 = 𝑥 + 𝑖𝑦, we can express 𝕊9 as the fiber of the smooth
map
𝑔 ∶ ℂ5 ≅ ℝ10 → ℝ,
(𝑧1 , … , 𝑧5 ) → 𝑥21 + 𝑦21 + 𝑥22 + 𝑦22 + ⋯ + 𝑥25 + 𝑦25 − 1
at the regular value 0, i.e., 𝕊9 = 𝑔 −1 (0) ⊂ ℂ5 ≅ ℝ10 . The tangent space to 𝕊9 at 𝑧 is then
given by the kernel of the derivative (in standard bases)
( )
𝑑𝑔𝑧 ∶ ℂ5 = ℝ10 → ℝ, 𝑑𝑔𝑧 = 2𝑥1 , 2𝑦1 , 2𝑥2 , 2𝑦2 , … , 2𝑥5 , 2𝑦5 .
Thus, as expected, the tangent space 𝑇𝑧 (𝕊9 ) consists of all vectors 𝑤 in ℝ10 which are
orthogonal to 𝑧, i.e., which satisfy 𝑤 ⋅ 𝑧 = 0.
The tangent space of 𝕊9 is of dimension 9 and the tangent space of ℝ10 ⧵ {0} is of
dimension 10. Hence in order to show that 𝑍 and 𝕊9 meet transversally in ℝ10 ⧵ {0} we
need to show: For every 𝑧 ∈ 𝑍 ∩ 𝕊9 , there is at least one vector 𝑤 in 𝑇𝑧 (𝑍) which does
not belong to 𝑇𝑧 𝕊9 . Then we have 𝑇𝑧 (𝑍) + 𝑇𝑧 (𝕊9 ) ⊆ 𝑇𝑧 (ℝ10 ⧵ {0}) is a vector subspace
of the same dimension as 𝑇𝑧 (ℝ10 ⧵ {0}) and therefore equal 𝑇𝑧 (ℝ10 ⧵ {0}).
So let 𝑧 = (𝑧1 , … , 𝑧5 ) be a fixed point in 𝑍 ∩ 𝕊9 . The tangent space 𝑇𝑧 (𝑍) is the
kernel of 𝑑𝑓𝑧 . Hence we need to find at least one vector 𝑤 ∈ ℂ5 = ℝ10 with 𝑑𝑓𝑧 (𝑤) = 0
and 𝑤 ⋅ 𝑧 ≠ 0.
390 A.6. Transversality
⎛𝑥1 ⎞
⎜ 𝑦1 ⎟
( ) ⎜𝑥 ⎟
𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚
𝑤⋅𝑧= 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 ⋅ ⎜ 2⎟
2 1 2 1 2 2 2 2 2 3 2 3 3 4 3 4 6𝑘 − 1 5 6𝑘 − 1 5 ⎜ ⋮ ⎟
⎜𝑥 ⎟
⎜ 5⎟
⎝ 𝑦5 ⎠
𝑚 2 𝑚 2 𝑚 2 𝑚 2 𝑚 2
= |𝑧1 | + |𝑧2 | + |𝑧3 | + |𝑧4 | + |𝑧 |
2 2 2 3 6𝑘 − 1 5
>0
which is bigger than zero, since 𝑧 is a point on 𝕊9 . Thus 𝑤 is a vector in 𝑇𝑧 (ℝ10 ) which
is in 𝑇𝑧 (𝑍), but not in 𝑇𝑧 (𝕊9 ), and we have shown
Hence 𝑍 and 𝕊9 meet transversally in ℝ10 ⧵{0}. The codimension of 𝑍 ∩𝕊9 in ℝ10 ⧵{0}
is 2 + 1 by the codimension formula. Thus dim(𝑍 ∩ 𝕊9 ) = 10 − 3 = 7.
Appendix A. Solutions to exercises 391
A.7 Smooth Homotopy
Solution (Exercise 8.1) (a) Let 𝑓 and 𝑔 be two smooth maps 𝑌 → 𝑋. Let 𝐹 be
a homotopy from the identity map of 𝑋 to the constant map 𝑋 → {𝑥0 } for some
𝑥0 ∈ 𝑋. Then we can use 𝐹 to define a homotopy from 𝑓 to 𝑌 → {𝑥0 } and a
homotopy from 𝑌 → {𝑥0 } to 𝑔. Setting these two homotopies together yields a
homotopy 𝐻 from 𝑓 to 𝑔. It only remains to make sure that 𝐻 is smooth. To
achieve this we apply the technique used in the main text. After composing with
a smooth bump function, we can assume 𝐹 (𝑥, 𝑡) = 𝑥 for all (𝑥, 𝑡) ∈ 𝑋 × [0, 1∕4]
and 𝐹 (𝑥, 𝑡) = 𝑥0 for all (𝑥, 𝑡) ∈ 𝑋 × [3∕4, 1]. Then we can define 𝐻 by
{
𝐹 (𝑓 (𝑦), 2𝑡) 𝑡 ∈ [0, 1∕2]
𝐻 ∶ 𝑌 × [0, 1] → 𝑋, (𝑦, 𝑡) →
𝐹 (𝑔(𝑦), 2(1 − 𝑡)) 𝑡 ∈ [1∕2, 1].
Solution (Exercise 8.2) By Sard’s Theorem 7.1, there is a regular value 𝑦 ∈ 𝕊𝑛 for
𝑓 . Assume there is a point 𝑥 ∈ 𝑓 −1 (𝑦). Since dim 𝑇𝑥 𝑋 = 𝑘 and dim 𝑇𝑦 𝕊𝑛 = 𝑛, 𝑑𝑓𝑥
cannot be surjective if 𝑘 < 𝑛. Thus, if 𝑘 < 𝑛, then 𝑓 −1 (𝑦) must be empty. Hence we can
assume that the image of 𝑓 is contained in 𝑈 ∶= 𝕊𝑛 ⧵{𝑦}. Now we can use stereographic
projection from 𝑦 to define a diffeomorphism 𝜓 ∶ 𝑈 → ℝ𝑛 . Thus, 𝜓◦𝑓 is homotopic to a
constant map. Composing the homotopy with the inverse of 𝜓 defines a homotopy from
𝑓 to a constant map.
Solution (Exercise 8.3) For 𝑘 = 1, the antipodal map is (𝑥, 𝑦) → (−𝑥, −𝑦). The map
( )( )
1 1 cos(𝜋𝑡) − sin(𝜋𝑡) 𝑥
𝐹1 ∶ 𝕊 × [0, 1] → 𝕊 , ((𝑥, 𝑦), 𝑡) → .
sin(𝜋𝑡) cos(𝜋𝑡) 𝑦
is a smooth homotopy from the identity of 𝕊1 to the antipodal map. To convince ourselves
that 𝐹1 (𝑥, 𝑦,(𝑡) is an element in 𝕊
) , we can either just calculate its norm or observe that
1
cos(𝜋𝑡) − sin(𝜋𝑡)
the matrix is an element in 𝑂(2) for every 𝑡. Elements in 𝑂(2)
sin(𝜋𝑡) cos(𝜋𝑡)
preserve the scalar product and hence the norm of vectors in ℝ2 .
For an arbitrary odd 𝑘, we have 𝕊𝑘 ⊂ ℝ𝑘+1 and 𝑘 + 1 is even. Then we define a
smooth homotopy from the identity in 𝕊𝑘 to the antipodal map by
𝐹𝑘 ∶ 𝕊𝑘 × [0, 1] → 𝕊𝑘 ,
((𝑥1 , 𝑦1 ), … , (𝑥(𝑘+1)∕2 , 𝑦(𝑘+1)∕2 ), 𝑡) → (𝐹1 (𝑥1 , 𝑦1 , 𝑡), … , 𝐹1 ((𝑥(𝑘+1)∕2 , 𝑦(𝑘+1)∕2 , 𝑡)
392 A.7. Smooth Homotopy
Again, for every 𝑡, 𝐹𝑘 (−, −, 𝑡) ∶ ℝ𝑘+1 → ℝ𝑘+1 is an element in 𝑂(𝑘 + 1) and preserves
the norm on ℝ𝑘+1 .
[𝑥] = {𝑦 ∈ 𝑋 ∶ 𝑥 ∼ 𝑦}.
A crucial feature of equivalence relations is that equivalence classes are either equal or
disjoint, i.e. for any 𝑥 and 𝑦 in 𝑋 we have either [𝑥] = [𝑦] or [𝑥] ∩ [𝑦] = ∅. We are going
to use this fact in the following way: If we can show that every equivalence class [𝑥] is
an open subset of 𝑋, then we know that every [𝑥] is also a closed subset. For, 𝑋 ⧵ [𝑥]
is the union of all the other open classes and is therefore open itself (arbitrary unions of
open sets are open).
So, given an arbitrary point 𝑥0 ∈ 𝑋, we would like to show that [𝑥0 ] is open. Let
𝑥 ∈ 𝑋 be a point in [𝑥]. Since 𝑋 is a smooth manifold, there is a local parametrization
𝜙 ∶ 𝐵𝜖 (0) → 𝑈 with 𝜙(0) = 𝑥, where 𝑈 is open in 𝑋 and 𝐵𝜖 (0) is the open ball of radius
𝜖 in ℝdim 𝑋 . Given any 𝑦 ∈ 𝑈 , let 𝜙−1 (𝑦) be its preimage in 𝐵𝜖 (0). In 𝐵𝜖 (0), all points
are path-connected to 0. Hence there is a smooth path
with 𝛾(0) = 0 and 𝛾(1) = 𝜙−1 (𝑦). Since 𝜙 is a diffeomorphism, the composite 𝜙◦𝛾 is a
smooth path from 𝑥 to 𝑦 in 𝑋, i.e. 𝑥 ∼ 𝑦.
This shows that 𝑈 is contained in [𝑥0 ]. Thus [𝑥0 ] is an open subset in 𝑋, since every
point 𝑥 ∈ [𝑥0 ] has an open neighborhood in 𝑋 which is completely contained in [𝑥0 ].
Thus [𝑥0 ] is a nonempty, open and closed subset of 𝑋. Since 𝑋 is connected, this
implies [𝑥0 ] = 𝑋. Thus 𝑋 is path-connected.
Solution (Exercise 8.6) Let 𝑘 > 1 and 𝑓 ∶ 𝕊1 → 𝕊𝑘 be a smooth map. At any point
𝑥 ∈ 𝕊1 , the derivative 𝑑𝑓𝑥 ∶ 𝑇𝑥 (𝕊1 ) → 𝑇𝑓 (𝑥) (𝕊𝑘 ) is a linear map from a one-dimensional
Appendix A. Solutions to exercises 393
space to a 𝑘-dimensional space. Hence 𝑑𝑓𝑥 is not surjective for any 𝑥. Thus the only way
𝑝 ∈ 𝕊𝑘 can be a regular value is when it is not in the image of 𝑓 . But, by Sard’s Theorem,
almost every 𝑝 ∈ 𝕊𝑘 is a regular value. Hence there must be a point 𝑝 ∈ 𝕊𝑘 ⧵ 𝑓 (𝕊1 ).
Fix such a point 𝑝. Then, after possibly rotating our coordinate system, we can use 𝑝 as
a center for stereographic projection. This gives us a diffeomorphism 𝕊𝑘 ⧵ {𝑝} → ℝ𝑘 .
Since ℝ𝑘 is contractible, every smooth map into ℝ𝑘 is homotopic to a constant map.
Since the image of 𝑓 is contained in 𝕊𝑘 ⧵ {𝑝} ≅ ℝ𝑘 , 𝑓 is homootopic to a constant map.
Since 𝕊𝑘 is connected, this shows that 𝕊𝑘 is simply-connected.
Solution (Exercise 8.7) (a) The assumption that |𝑓 (𝑥) − 𝑔(𝑥)| < 2 implies that
𝑓 (𝑥) and 𝑔(𝑥) are never antipodal points. In particular, the straight line segment
from 𝑓 (𝑥) to 𝑔(𝑥) in ℝ𝑘+1 does not go through the origin for all 𝑥 ∈ 𝑋. Hence the
vector (1 − 𝑡)𝑓 (𝑥) + 𝑡𝑔(𝑥) is nonzero for all 𝑥 ∈ 𝑋 and all 𝑡 ∈ [0, 1]. Thus, we can
form the well-defined map
Solution (Exercise 8.8) (a) If 𝑘 is odd, then 𝑘 + 1 is even and we can define the
map
This map can be extended to a linear map ℝ𝑘+1 → ℝ𝑘+1 and therefore 𝑠 is smooth.
For each 𝑥 ∈ 𝕊𝑘 , 𝑠(𝑥) is nonzero and satisfies 𝑥 ⟂ 𝑠(𝑥). Thus 𝑠(𝑥) is a tangent
vector at 𝑥, i.e. 𝑠(𝑥) ∈ 𝑇𝑥 (𝕊𝑘 ) ⧵ {0}. Hence
(b) Given a vector field 𝜎 ∶ 𝕊𝑘 → 𝑇 (𝑆 𝑘 ) which has no zeros. Let 𝜎(𝑥) = (𝑥, 𝑠(𝑥)).
Since 𝑠(𝑥) ≠ 0 for every 𝑥 ∈ 𝕊𝑘 , we can define a new vector field by
𝑠(𝑥)
𝑥 → .
|𝑠(𝑥)|
By replacing 𝑠 with this new non-vanishing vector field, we can assume |𝑠(𝑥)| = 1.
Hence we can assume 𝑠(𝑥) ∈ 𝕊𝑘 and 𝑠(𝑥) ⋅ 𝑥 = 0 for every 𝑥 ∈ 𝕊𝑘 .
Now we define the map
But this is the subspace of ℝP2 given by removing the point [0 ∶ 0 ∶ 1]. Since any
subspace consisting of just one point is closed in ℝP2 , we have shown that 𝑋 can
be identified with an open subset of ℝP2 :
(b) Every line in ℝ2 is determined by the point where it crosses the 𝑥-axis and a direc-
tion which can be expressed by an angle ∈ [0, 2𝜋]. Since we have not specified a
direction for the line, two angles which differ by adding 𝜋 determine the same line.
Any angle between 0 and 2𝜋 can be described by a point on the unit circle, where
the points 𝑠 and −𝑠 on 𝕊1 correspond to angles which differ by adding 𝜋. Hence
any line in ℝ2 is determined by a (𝑠, 𝑥) ∈ 𝕊1 × ℝ where 𝑠 is uniquely determined
up to multiplying with ±1.
𝜙1 𝜙−1
2
𝜙−1
1
(𝑉1 ∩ 𝑉2 ) ←←←←→ ← 𝜙−1
← 𝑉1 ∩ 𝑉2 ←←←←←←→ 2
(𝑉1 ∩ 𝑉2 )
and
𝜙2 𝜙−1
1
𝜙−1
2
(𝑉1 ∩ 𝑉2 ) ←←←←→ ← 𝜙−1
← 𝑉1 ∩ 𝑉2 ←←←←←←→ 2
(𝑉1 ∩ 𝑉2 )
are given by
1
(𝑥, 𝑦) → (𝑥, −𝑦)
𝑥2 + 𝑦2
and are therefore smooth maps.
(b) We copy the relation we used before and just add a condition to make sure that the
norms are respected. We define ∼𝑠 by
(𝑧0 , 𝑤0 ) ∼𝑠 (𝑧1 , 𝑤1 )
⇐⇒ 𝑧1 = 𝜆𝑧0 and 𝑤1 = 𝜆𝑤0 for some 𝜆 ∈ ℂ ⧵ {0} with |𝜆| = 1.
(0, 1) if 𝑤 = 0
where we consider 𝕊2 as a subset of ℂ × ℝ as before. We need to check that this
map has the desired properties:
∙ We need to check that ℎ([𝑧 ∶ 𝑤]) actually is a point on 𝕊2 . This is true for
ℎ([𝑧 ∶ 0]) = (0, 1) and for the other points we check
2𝑧∕𝑤 ̄ 𝑤̄
2𝑧∕ (|𝑧∕𝑤|2 − 1)2
|ℎ([𝑧 ∶ 𝑤])|2 = ⋅ +
|𝑧∕𝑤|2 + 1 |𝑧∕𝑤|2 + 1 (|𝑧∕𝑤|2 + 1)2
4|𝑧∕𝑤|2 + |𝑧∕𝑤|4 − 2|𝑧∕𝑤|2 + 1
=
(|𝑧∕𝑤|2 + 1)2
|𝑧∕𝑤|4 + 2|𝑧∕𝑤|2 + 1
=
(|𝑧∕𝑤|2 + 1)2
= 1.
Appendix A. Solutions to exercises 397
∙ We need to check that this map is well-defined, i.e., if 𝜆 ≠ 0 ∈ ℂ, we need
to check that ℎ sends [𝑧 ∶ 𝑤] and [𝜆𝑧 ∶ 𝜆𝑤] to the same point. This is true,
since we have |𝑧∕𝑤| = |(𝜆𝑧)∕(𝜆𝑤)|.
∙ For (𝑧, 0) ∈ 𝕊3 , we compute
1
ℎ(𝜑([𝑧 ∶ 𝑤])) = (2𝑧∕𝑤, |𝑧∕𝑤|2 − 1)
|𝑧∕𝑤|2 + 1
|𝑤|2
= (2𝑧∕𝑤, |𝑧∕𝑤|2 − 1)
|𝑧| + |𝑤|
2 2
̄ |𝑧|2 − |𝑤|2 )
= (2𝑧∕𝑤 ⋅ (𝑤𝑤),
̄ |𝑧|2 − |𝑤|2 )
= (2𝑧 ⋅ 𝑤,
= 𝜋(𝑧, 𝑤)
(d) The fiber consists of all points (𝑧, 𝑤) in 𝕊3 with [𝑧 ∶ 𝑤] = [𝑧0 ∶ 𝑤0 ]. By the
definition of ℂP1 , that means (𝑧, 𝑤) is in the fiber if and only if there is a 𝜆 ∈ ℂ⧵{0}
such that (𝑧, 𝑤) = (𝜆𝑧0 , 𝜆𝑤0 ). The additional feature we need to remember is that
(𝑧, 𝑤) is in 𝕊3 just as we did for the relation ∼𝑠 . Hence 𝜆 needs to satisfy |𝜆| = 1.
Summarising, we have
In other words, the points in the fiber 𝜑−1 ([𝑧0 ∶ 𝑤0 ]) are in one-to-one correspon-
dence to points 𝜆 on the circle 𝕊1 ⊂ ℂ. This shows again that the fiber of the Hopf
map at any point is diffeomorphic to 𝕊1 .
Solution (Exercise 9.3) (a) We write down local coordinate charts. For 𝑧 ∈ ℂ⧵{0},
let [𝑧] be its equivalence class in 𝐻 2 . We pick a point [𝑧0 ] ∈ 𝐻 2 . Choosing
𝜀 > 0 small enough, i.e., in our case 0 < 𝜀 < 𝜆∕2 is enough, the open ball
𝐵𝜀2 (𝑧0 ) ⊂ ℂ⧵{0} does not contain any point 𝑧 with [𝑧] = [𝑧0 ]. Hence the restriction
𝐵𝜀2 (𝑧0 ) → 𝐻 2 of the quotient map to 𝐵𝜀2 (𝑧0 ) is a homeomorphism onto its image
𝐵̄ 𝜀2 ([𝑧0 ]) ⊂ 𝐻 2 . The inverse is given by sending a point [𝑧] ∈ 𝐵̄ 𝜀2 ([𝑧0 ]) to the point
𝑧 ∈ 𝐵𝜀2 (𝑧0 ). Since 𝜀 is small enough, there is a unique such lift in 𝐵𝜀2 (𝑧0 ). This
defines a homeomorphism
is just the identity. Since this is a smooth map, we have shown that 𝐻 2 is an abstract
smooth 2-manifold.
(b) We write [𝑧] for the image of 𝑧 ∈ 𝐴 in 𝐴∕ℤ. We claim that the map
where [𝑧]𝐴 and [𝑧]𝐻 denote the equivalence classes of 𝑧 ∈ ℂ in 𝐴∕ℤ and 𝐻 2 ,
respectively. We claim that ℎ is a homeomorphism:
This shows that 𝐻 2 is compact, since it is the image of a compact space under a
continuous map.
(c) For (𝑧, 𝑤) ∈ ℂ2 ⧵ {0}, let [𝑧, 𝑤] be its equivalence class in 𝐻 4 . We pick a point
[𝑧0 , 𝑤0 ] ∈ 𝐻 4 . Choosing 𝜀 > 0 small enough, the open ball 𝐵𝜀4 (𝑧0 ) ⊂ ℂ2 ⧵ {0}
does not contain any point (𝑧, 𝑤) with [𝑧, 𝑤] = [𝑧0 , 𝑤0 ]. Hence the restriction
𝐵𝜀4 (𝑧0 , 𝑤0 ) → 𝐻 4 to 𝐵𝜀2 (𝑧0 ) is a homeomorphism onto its image 𝐵̄ 𝜀4 ([𝑧0 , 𝑤0 ]) ⊂
𝐻 4 . The inverse is given by sending a point [𝑧, 𝑤] ∈ 𝐵̄ 𝜀4 ([𝑧0 , 𝑤0 ]) to the point
(𝑧, 𝑤) ∈ 𝐵𝜀4 (𝑧0 , 𝑤0 ). Since 𝜀 is small enough, there is a unique such lift in
𝐵𝜀4 (𝑧0 , 𝑤0 ). This defines a homeomorphism
is just the identity. Since this is a smooth map, we have shown that 𝐻 4 is an abstract
smooth 4-manifold.
Appendix A. Solutions to exercises 399
(d) We consider 𝕊3 as a subset in ℂ2 ⧵ {0} and 𝕊1 as the quotient [1, 1∕𝜆]∕(1 ∼ 1∕𝜆),
i.e., the closed interval [1, 1∕𝜆] ⊂ ℝ where we identify the endpoints. We define
a map
∙ Our copy of 𝕊1 equals the quotient ℝ∕ℤ where we identify two real numbers
𝑠1 and 𝑠2 if 𝑠1 − 𝑠2 ∈ ℤ. This makes it easy to provide local charts. Around
any 𝑥 ∈ ℝ we can look at the open neighborhood (−𝜀 + 𝑥, 𝑥 + 𝜀) which
maps homeomorphically onto its image in 𝕊1 = ℝ∕ℤ under the quotient map
ℝ → ℝ∕ℤ. On points in ℂ2 ⧵ {0} and 𝕊3 , 𝑓 and its inverse are just scaling
by a real number, some integer power of 𝜆. So composition with local charts
results in a smooth map, since the local charts on 𝕊1 = ℝ∕ℤ are just identity
maps and the local charts on 𝕊3 are diffeomorphisms as we have seen many
times before. Hence 𝑓 is a diffeomorphism as it is a smooth homeomorphism
with a smooth inverse.
where the first 1 and the sum are at position 𝑖 + 1, and the second 1 is at position 𝑗 + 1.
Note that this actually yields an element in 𝐻(𝑚, 𝑛), and not just ℝP𝑚 × ℝP𝑛 , since the
400 A.8. Abstract Smooth Manifolds
We can check that the change of coordinate maps are smooth just as we did for real
∑
projective space, since the sum 𝑚𝑖=0 𝑥𝑖 𝑦𝑖 is a polynomial and hence smooth.
Appendix A. Solutions to exercises 401
A.9 Manifolds with Boundary
𝑓 ◦𝜙 = 𝜓◦𝜃.
Boundary points of 𝑋 are those which are in the image 𝜙(𝜕𝑈 ) = 𝜙(𝑈 ∩ 𝜕ℍ𝑘 ). Sim-
ilarly, boundary points of 𝑋 are those which are in the image 𝜓(𝜕𝑉 ) = 𝜓(𝑉 ∩ 𝜕ℍ𝑘 ).
Hence we need to show 𝜃(𝜕𝑈 ) ⊂ 𝜕𝑉 , for then
The argument is again based on the Inverse Function Theorem. Suppose there is
a point 𝑢 ∈ 𝜕𝑈 which is mapped to an interior point 𝑣 = 𝜃(𝑢) in 𝑉 . Since 𝜃 is a
diffeomorphism, the derivative 𝑑(𝜃 −1 )𝑣 ∶ ℝ𝑘 → ℝ𝑘 of its inverse is an isomorphism.
But, since 𝑣 ∈ Int(𝑉 ), 𝑉 contains a neighborhood 𝑊 of 𝑣 that is open in ℝ𝑘 . Thus the
Inverse Function Theorem implies that 𝜃 −1 (𝑊 ) contains a neighborhood of 𝑢 that is open
in ℝ𝑘 . Hence 𝑢 is also an interior point in 𝑈 which contradicts the assumption 𝑢 ∈ 𝜕𝑈 .
Solution (Exercise 10.3) (a) The image of 𝐹 is the product 𝕊1 × [−1∕2, 1∕2]. This
is a product of a manifold without a boundary 𝕊1 and the manifold [−1∕2, 1∕2]
with boundary. The boundary of [−1∕2, 1∕2] constists of the disjoint union of
{−1∕2} and {1∕2}. By Lemma 10.7, we get
As one can check by calculating the partial derivatives, each of these maps are
diffeomorphisms, and the union of their images covers 𝑌 . Hence we can use these
four maps as local parametrizations of 𝑌 .
The boundary of 𝑌 is then given by the union of the points
𝜕𝑌 = 𝜙+ ((−𝜋, 𝜋) × {0}) ∪ 𝜙− ((−𝜋, 𝜋) × {0}) ∪ 𝜓+ ((0, 2𝜋) × {0}) ∪ 𝜓− ((0, 2𝜋) × {0}).
Hence
1 1 1
𝜕𝑌 = {((1 + cos(𝑡∕2)) cos 𝑡, (1 + cos(𝑡∕2)) sin 𝑡, sin(𝑡∕2)) ∈ ℝ3 ∶ 𝑡 ∈ ℝ}.
2 2 2
𝜙 ∶ 𝑈 → 𝕊1 , 𝑡 → (cos(𝑡∕2), sin(𝑡∕2))
and
1 1 1
𝜓 ∶ 𝑈 → 𝜕𝑌 , 𝑡 → ((1 + cos(𝑡∕2)) cos 𝑡, (1 + cos(𝑡∕2)) sin 𝑡, sin(𝑡∕2))
2 2 2
where 𝑈 ⊂ ℝ is some sufficiently small open subset. Then 𝜙 and 𝜓 serve as local
parametrizations of 𝕊1 and 𝜕𝑌 , respectively, for suitable choices of 𝑈 . But the
induced map 𝜃 ∶ 𝑈 → 𝑈 which arises as the composite 𝜓 −1 ◦𝜑|𝕊1 ◦𝜙 is just the
identity 𝑡 → 𝑡. Hence 𝜑|𝕊1 is a local diffeomorphism.
Solution (Exercise 10.4) (a) We proceed as before when we showed that tangent
spaces are well-defined.
Let 𝜓 ∶ 𝑉 → 𝑋 be another local parametrization around 𝑥 with 𝜓(0) = 𝑥, where 𝑉
is an open subset of ℍ𝑘 . By shrinking both 𝑈 and 𝑉 , we can assume 𝜙(𝑈 ) = 𝜓(𝑉 )
(replace 𝑈 by 𝜙−1 (𝜙(𝑈 ) ∩ 𝜓(𝑉 )) ⊂ 𝑈 and 𝑉 by 𝜓 −1 (𝜙(𝑈 ) ∩ 𝜓(𝑉 )) ⊂ 𝑉 ).
Then the map
𝜃 ∶= 𝜓 −1 ◦𝜙 ∶ 𝑈 → 𝑉
is a diffeomorphism (its the composite of two diffeomorphisms). By definition of
𝜃, we have 𝜙 = 𝜓◦𝜃. Differentiating yields
(where we have used the chain rule). This implies that the image of 𝑑𝜙0 is con-
tained in the image of 𝑑𝜓0 :
(b) The codimension of 𝑇𝑥 (𝜕𝑋) in 𝑇𝑥 (𝑋) is one. Thus the orthogonal complement
of 𝑇𝑥 (𝜕𝑋) is one-dimensional and is spanned by one vector. By definition of 𝜕𝑋
as the image of the points in 𝜕𝐻𝐻 𝑘 under local parametrizations, we know that
𝑑𝜙0 (𝑒𝑘 ) spans the complement of 𝑇𝑥 (𝜕𝑋) in 𝑇𝑥 (𝑋), since 𝑑𝜙0 is an isomorphism
404 A.9. Manifolds with Boundary
and 𝑒𝑘 = (0, … , 0, 1) is nonzero and not contained in 𝑑𝜙0 (ℍ𝑘 ). We also know by
the definition of 𝐻𝑥 (𝑋) that 𝑑𝜙0 (𝑒𝑘 ) ∈ 𝐻𝑥 (𝑋), and therefore 𝑑𝜙0 (−𝑒𝑘 ) ∉ 𝐻𝑥 (𝑋).
But we do not know whether 𝑑𝜙0 (−𝑒𝑘 ) is orthogonal to 𝑇𝑥 (𝜕𝑋) in 𝑇𝑥 (𝑋). To make
𝑑𝜙0 (−𝑒𝑘 ) into a vector which is orthogonal to 𝑇𝑥 (𝜕𝑋), we apply the Gram-Schmidt
process. It produces a unit vector which is orthogonal to 𝑇𝑥 (𝜕𝑋). We denote this
vector 𝑛(𝑥), this is the outward unit normal vector to 𝜕𝑋. Note that −𝑛(𝑥) is a
unit vector contained in 𝐻𝑥 (𝑋) and orthogonal to 𝑇𝑥 (𝜕𝑋), this is the inward unit
normal vector to 𝜕𝑋.
(c) From what we have learned in the previous point, we can construct 𝑛(𝑥) by ap-
plying the Gram-Schmidt orthonormalization process to 𝑑𝜙0 (−𝑒𝑘 ). This process
depends smoothly on the coefficients in the matrix representing 𝑑𝜙0 . Since the
derivative 𝑑𝜙𝑢 depends smoothly on 𝑢, 𝑑𝜙𝑢 (−𝑒𝑘 ) depends smoothly on 𝑢. By the
independence of the choice of local parametrization, we see that 𝑛(𝑦) = 𝑑𝜙𝑢 (−𝑒𝑘 )
for all 𝑦 ∈ 𝜙(𝜕𝑈 ) which is an open neighborhood of 𝑥 in 𝜕𝑋, where 𝜙(𝑢) = 𝑦.
Thus, in total we see that 𝑛(𝑥) depends smoothly on 𝑥 in 𝜕𝑋.
𝜕𝑓 ∶ 𝜕𝑋 → 𝑌 , (−1, 𝑦) → 1 + 𝑦2 .
Hence the derivative of 𝜕𝑓 is given by the 1 × 1-matrix (𝜕𝑓 )(−1,𝑦) = 2𝑦. This is
a linear map which is surjective if and only if 𝑦 ≠ 0. Since (−1, 0) ∈ 𝜕𝑋 and
𝜕𝑓 (−1, 0) = 1, we see that 1 is not a regular value of 𝜕𝑓 .
(b) The preimage 𝑓 −1 (1) is just the unit sphere 𝕊1 . Hence the boundary 𝜕(𝑓 −1 (1)) is
empty. However,
Let 𝑔 ∶ 𝕊𝑛−1 → 𝕊𝑛−1 be the map 𝑣 → 𝐴𝑣∕|𝐴𝑣|. Now we use the assumption on 𝐴: if
then 𝐴𝑣 has only nonnegative entries, since the entries in 𝑣 are all nonnegative and all
the entries in 𝐴 are by assumption nonnegative. Since |𝐴𝑣| > 0 is nonnegative as well,
we know that 𝑔(𝑣) is an element in 𝑄. Thus we can restrict 𝑔 to a map 𝑔 ∶ 𝑄 → 𝑄.
≅
Now we can compose with a homeomorphism 𝜑 ∶ 𝑄 ←←←→
← 𝔹𝑛−1 to get a continuous
map
𝜑 𝑔 𝜑−1
𝑓 ∶ 𝔹𝑛−1 ←←←→
← 𝑄 ←←→ ← 𝔹𝑛−1 .
← 𝑄 ←←←←←←→
By the Brouwer Fixed Point Theorem for continuous maps, 𝑓 must have a fixed point
𝑦 ∈ 𝔹𝑛−1 with 𝑓 (𝑦) = 𝑦. Hence the image of 𝑤 ∶= 𝜑−1 (𝑦) is a vector in 𝕊𝑛−1 ⊂ ℝ𝑛 with
(d) Assume there was such a homotopy 𝐹 ∶ 𝕊1 × [0, 1] → 𝑋 = ℝ2 ⧵ {(0, 0)}. Then we
get a homotopy 𝐻 between the identity map of ℝ2 ⧵ {(0, 0)} and a constant map
as follows: Let 𝐺 ∶ 𝑋 × [0, 1] → 𝑋 be a homotopy between the identity of 𝑋, i.e.,
𝐺(𝑝, 0) = 𝑝, and the map 𝑟, i.e., 𝐺(𝑝, 1) = 𝑝∕|𝑝|. Then we define a homotopy by
This would imply that ℝ2 ⧵ {(0, 0)} is contractible. This contradicts our result that
ℝ2 ⧵ {(0, 0)} is not simply-connected.
406 A.10. Brouwer Degree mod 2 and Borsuk–Ulam Theorem
A.10 Brouwer Degree mod 2 and Borsuk–Ulam Theorem
Solution (Exercise 11.1) Let 𝑧 be a regular value for 𝑔. If 𝑧 is not in the image of 𝑔,
then deg2 (𝑔) and deg2 (𝑔◦𝑓 ) both vanish and the claim is true. So let us assume that 𝑧 is
in the image of 𝑔. We showed in a previous exercise that 𝑧 then is a regular value for 𝑔◦𝑓
if and only if every 𝑦 ∈ 𝑔 −1 (𝑧) is a regular value for 𝑓 . Hence we can use 𝑧 to compute
deg2 (𝑔◦𝑓 ) if and only if we can use 𝑦 ∈ 𝑔 −1 (𝑧) to compute deg2 (𝑓 ). If 𝑧 is not in the
image of 𝑔◦𝑓 , then 𝑦 is not in the image of 𝑓 . In this case both deg2 (𝑔◦𝑓 ) and deg2 (𝑓 )
vanish and the claim is true. So assume there are points 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) = 𝑧. Then
it remains to apply the definition of deg2 . By our assumptions, we have
where we use that the number of points in the fiber 𝑓 −1 (𝑦) is the same for every 𝑦 ∈ 𝑔 −1 (𝑧)
which is a regular value for 𝑓 .
deg2 (𝑓 ) = #𝑓 −1 (𝑦) = 0.
(b) Let us assume deg2 (𝑓 ) ≠ 0 and derive a contradiction. By the previous point, if
deg2 (𝑓 ) ≠ 0, then 𝑓 is surjective. But that means 𝑌 = 𝑓 (𝑋). Since 𝑓 is continuous
and 𝑋 is compact, the image of 𝑋 under 𝑓 is compact. Hence 𝑌 would be compact
as the continuous image of a compact space. This contradicts the assumption.
Hence we must have deg2 (𝑓 ) = 0.
(c) Let 𝑓 ∶ 𝕊1 → 𝕊1 be a smooth map without fixed points. We define the map
𝑓 (𝑥)(1 − 𝑡) − 𝑡𝑥
𝐹 (𝑥, 𝑡) ∶ 𝕊1 × [0, 1] → 𝕊1 , (𝑥, 𝑡) →
|𝑓 (𝑥)(1 − 𝑡) − 𝑡𝑥|
is a homotopy between 𝑓 and 𝛼.
Since 𝛼 −1 (𝑥) = −𝑥 for all 𝑥 ∈ 𝕊1 , there is exactly one preimage point for each 𝑥.
Hence deg2 (𝛼) = 1. Since 𝑓 and 𝛼 are homotopic, the invariance of deg2 under
homotopy implies deg2 (𝑓 ) = 1. By the first point, deg2 (𝑓 ) = 1 implies that 𝑓 is
surjective.
Since |𝑧|7 dominates the absolute value of 𝑓𝑡 (𝑧) for all 𝑡 ∈ [0, 1], or in other words, since
the second summand in the parantheses goes to 0 when 𝑧 → ∞ in
𝑓𝑡 (𝑧) cos(|𝑧|2 )
= 1 + 𝑡 (1 + 93𝑧4 ),
𝑧7 𝑧7
𝑓𝑡 (𝑧) has no zero on the boundary of a closed ball 𝑊 ⊂ ℂ of large enough radius. Thus
the homotopy
𝑓𝑡 (𝑧)
∶ 𝜕𝑊 → 𝕊1
|𝑓𝑡 (𝑧)|
Thus, by the Boundary Theorem 11.8 for deg2 , 𝑓 (𝑧) must have a zero inside 𝑊 .
408 A.10. Brouwer Degree mod 2 and Borsuk–Ulam Theorem
For 𝑧 ≠ 0, we consider
𝑝𝑡 (𝑧) (𝑎 𝑎𝑚 )
1
= 1 + 𝑡 ⋅ + ⋯ + .
𝑧𝑚 𝑧 𝑧𝑚
𝑎 𝑎
As 𝑧 → ∞ moves towards infinity, the term 𝑧1 + ⋯ + 𝑧𝑚𝑚 → 0 moves towards zero.
Hence, if 𝑊 is a closed ball around the origin in ℂ with sufficiently large radius, none
of the 𝑝𝑡 has a zero on 𝜕𝑊 .
Thus the homotopy
𝑝𝑡
∶ 𝜕𝑊 → 𝕊1
|𝑝𝑡 |
Solution (Exercise 11.5) Assume that 𝑓1 , … , 𝑓𝑘 did not have a common zero. Then
we can form the smooth odd map
Now we can apply Theorem 11.18 to 𝑓 and 𝐿 being the 𝑥𝑘+1 -axis. Hence 𝑓 intersects
𝐿 at least once. But 𝑥 with 𝑓 (𝑥) ∈ 𝐿 is a common zero of the 𝑓1 , … , 𝑓𝑘 . Hence the
𝑓1 , … , 𝑓𝑘 must have had a common zero after all.
Then each 𝑓𝑖 is smooth and odd. The functions 𝑓1 , … , 𝑓𝑘 satisfy the assumption of the
previous exercise. Hence there is a common zero of the 𝑓1 , … , 𝑓𝑘 which is the desired
Appendix A. Solutions to exercises 409
point 𝑝 ∈ 𝕊𝑘 .
Solution (Exercise 11.7) Since the 𝑝𝑖 ’s are all homogeneous of odd order, they satisfy
Moreover, for any 𝑥 ∈ ℝ𝑛+1 ⧵ {0}, we can consider the associated map
( )
𝑛 𝑥
𝑞𝑖 ∶ 𝕊 → ℝ, 𝑥 → 𝑝𝑖 .
|𝑥|
By the first exercise, we know that these 𝑛 maps 𝑞1 , … , 𝑞𝑛 must have a common zero
on 𝕊𝑛 , say 𝑥0 ∈ 𝕊𝑛 .
Since the 𝑝𝑖 are homogeneous, 𝑞𝑖 (𝑥0 ) = 𝑝𝑖 (𝑥0 ) = 0 implies
Hence the line spanned by 𝑥0 in ℝ𝑛+1 is the desired line on which all 𝑝𝑖 vanish simulta-
neously.
Solution (Exercise 11.8) Assume such a map 𝑓 existed. Then we could define the
continuous map
√
𝑔 ∶ 𝔹2 = {(𝑥, 𝑦) ∈ ℝ2 ∶ 𝑥2 + 𝑦2 ≤ 1} → 𝕊1 , 𝑔(𝑥, 𝑦) ∶= 𝑓 (𝑥, 𝑦, 1 − 𝑥2 − 𝑦2 ).
Note that 𝑔 is continuous, since 𝑔 is the composite of 𝑓 with the inverse of the projection
the projection from the upper hemisphere of the sphere to 𝔹2 . Then, by the assumption
on 𝑓 , we have
for any (𝑥, 𝑦) ∈ 𝕊1 = 𝜕𝔹2 , i.e., for (𝑥, 𝑦) such that 1 − 𝑥2 − 𝑦2 = 0. In particular,
𝑔|𝜕𝔹2 ∶ 𝕊1 → 𝕊1 satisfies the assumptions of the Borsuk–Ulam Theorem. Hence, by the
Borsuk–Ulam Theorem, deg2 (𝑔) = 1. But 𝑔|𝜕𝔹2 can be extended to a continuous map
on all of 𝔹2 . Thus, by the Boundary Theorem for degrees, deg2 (𝑔) = 0. Hence the
assumption that 𝑓 exists, leads to a contradiction. Thus 𝑓 cannot exist.
Solution (Exercise 11.9) Assume that there is no such point 𝑝. Then we can consider
the map 𝑔 ∶ 𝕊2 → 𝕊1 given by
𝑓 (𝑝) − 𝑓 (−𝑝)
𝑔(𝑝) = .
|𝑓 (𝑝) − 𝑓 (−𝑝)|
By the assumption, 𝑔 is smooth and satisfies 𝑔(−𝑝) = −𝑔(𝑝) for all 𝑝 ∈ 𝕊2 . This contra-
410 A.10. Brouwer Degree mod 2 and Borsuk–Ulam Theorem
dicts the result of the previous exercise.
A.11.1 Transversality
1 1
𝐶 = (𝐶 + 𝐶 𝑡 ) + (𝐶 − 𝐶 𝑡 ) for any 𝐶 ∈ 𝑀(𝑛).
2 2
(c) Yes, 𝑆𝐿(𝑛) and 𝑂(𝑛) do not meet transversally in 𝑀(𝑛), since 𝑆𝑂(𝑛) is contained
in 𝑆𝐿(𝑛). Hence we also have 𝑇𝐴 (𝑂(𝑛)) = 𝑇𝐴 (𝑆𝑂(𝑛)) ⊆ 𝑇𝐴 (𝑆𝐿(𝑛)), and these
tangent spaces do not span all of 𝑇𝐴 (𝑀(𝑛)) = 𝑀(𝑛).
Now we assume 𝑓 −
⋔ 𝑔 −1 (𝑊 ), i.e.,
We need to show (𝑔◦𝑓 ) − ⋔ 𝑊 . So let 𝑥 ∈ 𝑋 be a point such that 𝑔(𝑓 (𝑥)) ∈ 𝑊 and let
𝑐 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑍). By (A.6), there are vectors 𝑏1 ∈ 𝑇𝑓 (𝑥) (𝑌 ) and 𝑏2 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) such
that
𝑑𝑔𝑓 (𝑥) (𝑏1 ) + 𝑏2 = 𝑐.
By (A.7), there are vectors 𝑎 ∈ 𝑇𝑥 (𝑋) and 𝑏3 ∈ 𝑇𝑓 (𝑥) (𝑔 −1 (𝑊 )) such that
𝑑𝑓𝑥 (𝑎) + 𝑏3 = 𝑏1 .
𝑐 = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎) + 𝑏3 ) + 𝑏2 = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎1 )) + 𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 .
By the chain rule, we have 𝑑(𝑔◦𝑓 )𝑥 = 𝑑𝑔𝑓 (𝑥) ◦𝑑𝑓𝑥 . By a previous exercise, we
know 𝑇𝑓 (𝑥) (𝑔 −1 (𝑊 )) = (𝑑𝑔𝑓 (𝑥) )−1 (𝑇𝑔(𝑓 (𝑥)) (𝑊 ) in 𝑇𝑓 (𝑥) (𝑌 ). In particular, 𝑑𝑔𝑓 (𝑥) (𝑏3 ) ∈
𝑇𝑔(𝑓 (𝑥)) (𝑊 ) and thus
𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ).
Hence we have
𝑐 = 𝑑(𝑔◦𝑓 )𝑥 (𝑎) + 𝑑𝑔𝑓 (𝑥) (𝑏3 ) + 𝑏2 ∈ [Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 )] ⊂ 𝑇𝑔(𝑓 (𝑥)) (𝑍).
Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) = 𝑇𝑔(𝑓 (𝑥)) (𝑍) for all 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) ∈ 𝑊 .
Im (𝑑(𝑔◦𝑓 )𝑥 ) + 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) = 𝑇𝑔(𝑓 (𝑥)) (𝑍) for all 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) ∈ 𝑊 . (A.8)
Let 𝑥 ∈ 𝑋 be a point such that 𝑓 (𝑥) ∈ 𝑔 −1 (𝑊 ) and let 𝑏 ∈ 𝑇𝑓 (𝑥) (𝑌 ). Since 𝑔(𝑓 (𝑥)) ∈ 𝑊 ,
we can use (A.8) to find vectors 𝑎 ∈ 𝑇𝑥 (𝑋) and 𝑐 ∈ 𝑇𝑔(𝑓 (𝑥)) (𝑊 ) such that
By the chain rule, we have 𝑑(𝑔◦𝑓 )𝑥 (𝑎) = 𝑑𝑔𝑓 (𝑥) (𝑑𝑓𝑥 (𝑎)). Thus
In other words,
If we can find a suitable 𝑤, then we just set 𝑣 ∶= 𝑣1 − 𝑤. Hence, by taking the difference
of the two equations, we see that the question is reduced to checking whether we can find
a 𝑤 such that 𝑣2 − 𝑣1 = 𝐴𝑤 − 𝑤 = (𝐴 − 𝐼)𝑤 where 𝐼 is the identity map of 𝑉 . But such
a 𝑤 exists for any choice of 𝑣1 and 𝑣2 if and only if 𝐴 − 𝐼 is invertible, i.e. if and only if
det(𝐴 − 𝐼) ≠ 0 which happens if and only if +1 is not an eigenvalue of 𝐴 (because the
eigenvalues are the 𝜆 such that det(𝐴 − 𝜆𝐼) = 0).
𝑓 ∶ ℂ5 ⧵ {0} → ℂ,
(𝑧1 , … , 𝑧5 ) → 𝑧21 + 𝑧22 + 𝑧23 + 𝑧34 + 𝑧6𝑘−1
5
.
Setting 𝑍 = 𝑓 −1 (0), we need to show that 𝑍 and 𝕊9 meet transversally. The tangent
space to 𝑍 in a point 𝑧 ∈ 𝑍 is the kernel of the derivative 𝑑𝑓𝑧 . Since 𝑓 is a polynomial
in the variables 𝑧1 , … , 𝑧5 , we can use our usual rules for partial differentiation to get the
following matrix for 𝑑𝑓𝑧 (in the standard basis):
( )
𝑑𝑓𝑧 ∶ ℂ5 → ℂ, 𝑑𝑓𝑧 = 2𝑧1 , 2𝑧2 , 2𝑧3 , 3𝑧24 , (6𝑘 − 1)𝑧6𝑘−25
.
( )
𝑥 −𝑦
Recall that we can represent every element 𝑥+𝑖𝑦 ∈ ℂ by the real 2×2-matrix .
𝑦 𝑥
Then we see that 𝑑𝑓𝑧 is a real 2 × 10-matrix. Its maximal rank (as a matrix with entries in
ℝ) is therefore 2. And, in fact, for every 𝑧 ≠ 0, 𝑑𝑓𝑧 has rank 2, since it maps surjectively
onto ℂ ≅ ℝ2 . Thus 0 is a regular value for 𝑓 and the tangent space 𝑇𝑧 (𝑍) is the kernel
of 𝑑𝑓𝑧 .
Writing a complex number 𝑧 = 𝑥 + 𝑖𝑦, we can express 𝕊9 as the fiber of the smooth
map
𝑔 ∶ ℂ5 ≅ ℝ10 → ℝ,
(𝑧1 , … , 𝑧5 ) → 𝑥21 + 𝑦21 + 𝑥22 + 𝑦22 + ⋯ + 𝑥25 + 𝑦25 − 1
at the regular value 0, i.e., 𝕊9 = 𝑔 −1 (0) ⊂ ℂ5 ≅ ℝ10 . The tangent space to 𝕊9 at 𝑧 is then
given by the kernel of the derivative (in standard bases)
( )
𝑑𝑔𝑧 ∶ ℂ5 = ℝ10 → ℝ, 𝑑𝑔𝑧 = 2𝑥1 , 2𝑦1 , 2𝑥2 , 2𝑦2 , … , 2𝑥5 , 2𝑦5 .
Thus, as expected, the tangent space 𝑇𝑧 (𝕊9 ) consists of all vectors 𝑤 in ℝ10 which are
orthogonal to 𝑧, i.e., which satisfy 𝑤 ⋅ 𝑧 = 0.
The tangent space of 𝕊9 is of dimension 9 and the tangent space of ℝ10 ⧵ {0} is of
dimension 10. Hence in order to show that 𝑍 and 𝕊9 meet transversally in ℝ10 ⧵ {0} we
need to show: For every 𝑧 ∈ 𝑍 ∩ 𝕊9 , there is at least one vector 𝑤 in 𝑇𝑧 (𝑍) which does
not belong to 𝑇𝑧 𝕊9 . Then we have 𝑇𝑧 (𝑍) + 𝑇𝑧 (𝕊9 ) ⊆ 𝑇𝑧 (ℝ10 ⧵ {0}) is a vector subspace
of the same dimension as 𝑇𝑧 (ℝ10 ⧵ {0}) and therefore equal 𝑇𝑧 (ℝ10 ⧵ {0}).
So let 𝑧 = (𝑧1 , … , 𝑧5 ) be a fixed point in 𝑍 ∩ 𝕊9 . The tangent space 𝑇𝑧 (𝑍) is the
kernel of 𝑑𝑓𝑧 . Hence we need to find at least one vector 𝑤 ∈ ℂ5 = ℝ10 with 𝑑𝑓𝑧 (𝑤) = 0
and 𝑤 ⋅ 𝑧 ≠ 0.
Appendix A. Solutions to exercises 415
⎛𝑥1 ⎞
⎜ 𝑦1 ⎟
( ) ⎜𝑥 ⎟
𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚 𝑚
𝑤⋅𝑧= 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 , 𝑥 , 𝑦 ⋅ ⎜ 2⎟
2 1 2 1 2 2 2 2 2 3 2 3 3 4 3 4 6𝑘 − 1 5 6𝑘 − 1 5 ⎜ ⋮ ⎟
⎜𝑥 ⎟
⎜ 5⎟
⎝ 𝑦5 ⎠
𝑚 2 𝑚 2 𝑚 2 𝑚 2 𝑚 2
= |𝑧1 | + |𝑧2 | + |𝑧3 | + |𝑧4 | + |𝑧 |
2 2 2 3 6𝑘 − 1 5
>0
which is bigger than zero, since 𝑧 is a point on 𝕊9 . Thus 𝑤 is a vector in 𝑇𝑧 (ℝ10 ) which
is in 𝑇𝑧 (𝑍), but not in 𝑇𝑧 (𝕊9 ), and we have shown
Hence 𝑍 and 𝕊9 meet transversally in ℝ10 ⧵{0}. The codimension of 𝑍 ∩𝕊9 in ℝ10 ⧵{0}
is 2 + 1 by the codimension formula. Thus dim(𝑍 ∩ 𝕊9 ) = 10 − 3 = 7.
416 A.12. Intersection Theory modulo 2
A.12 Intersection Theory modulo 2
𝑓−
⋔ 𝑔 −1 (𝑊 ) if and only if (𝑔◦𝑓 ) −
⋔𝑊.
Thus,
𝐼2 (𝑓 , 𝑔 −1 (𝑊 )) is defined if and only if 𝐼2 (𝑔◦𝑓 , 𝑊 ) is defined.
Now it remains to observe that the finite numbers satisfy
Solution (Exercise 13.2) (a) Let 𝑍 ⊂ 𝑌 be any closed submanifold with dim 𝑋 +
dim 𝑍 = dim 𝑌 . For dim 𝑋 ≥ 1, this implies dim 𝑌 > 0 and dim 𝑍 = dim 𝑌 −
dim 𝑋 < dim 𝑌 . In particular, 𝑍 is not all of 𝑌 and there exist points in 𝑌 which
are not in 𝑍.
We can assume that 𝑌 is connected, since 𝑓 being homotopic to a constant map
implies that the image of 𝑓 lies in only one connected component of 𝑌 . Hence any
part of 𝑍 in a different connected component satisfies 𝑓 −1 (𝑍) = ∅.
Assume that 𝑓 is homotopic to the constant map 𝑔0 ∶ 𝑋 → {𝑦0 } ⊂ 𝑌 . If 𝑦0 ∈ 𝑍,
then 𝑔 and 𝑍 do not meet transversally, since dim 𝑍 < dim 𝑌 and dim{𝑦0 } = 0.
But since 𝑌 is connected and a smooth manifold, it is path-connected. Hence there
is a path 𝛾 from 𝑦0 to a point 𝑦1 with 𝑦1 ∉ 𝑍. Then 𝑔0 is homotopic to the constant
map 𝑔1 ∶ 𝑋 → {𝑦1 } ⊂ 𝑌 by composing 𝑔0 with the path 𝛾. Since being smoothly
homotopic is a transitive relation, we have 𝑓 ≃ 𝑔1 .
Since 𝑦 ∉ 𝑍, we have 𝑔 −
1 1⋔ 𝑍. This means 𝑔 −1 (𝑍) = ∅, i.e.,
1
(c) Since 𝕊1 is compact, the intersection number 𝐼2 (id𝕊1 , {𝑥}) is defined for every
point 𝑥 ∈ 𝕊1 where we consider intersections in 𝕊1 , i.e., 𝑌 = 𝑋 = 𝕊1 . Since the
Appendix A. Solutions to exercises 417
id−1
𝕊1
(𝑥) = {𝑥}, we have 𝐼2 (id𝕊1 , {𝑥}) = 1 ≠ 0. But we just learned that if id𝕊1 was
homotopic to a constant map, then 𝐼2 (Id𝕊1 , {𝑥}) had to be zero, since dim 𝕊1 = 1.
Thus there is at least one map 𝕊1 → 𝕊1 which is not homotopic to a constant map.
In fact, up to homotopy there is one homotopy class of maps 𝕊1 → 𝕊1 for every
integer 𝑛 ∈ ℤ represented by taking 𝑛th powers in ℂ: 𝑧 → 𝑧𝑛 . We will learn more
about that later.
Solution (Exercise 13.3) (a) Let 𝑌 be contractible and dim 𝑌 > 0. Let 𝑓 ∶ 𝑋 → 𝑌
be a smooth map with 𝑋 compact and 𝑍 ⊂ 𝑌 closed, and dim 𝑋 + dim 𝑍 =
dim 𝑌 . Since 𝑌 is contractible, the identity map id𝑌 is homotopic to a constant
map 𝑌 → {𝑦} ⊂ 𝑌 . Thus 𝑓 is homotopic to the constant map 𝑋 → {𝑦} ⊂ 𝑌 .
Since dim 𝑌 > 0, we have {𝑦} ≠ 𝑌 and there are points in 𝑌 different from 𝑦.
Hence, since dim 𝑋 ≥ 1, the previous exercise implies 𝐼2 (𝑓 , 𝑍) = 0.
Finally, we recall that Euclidean space ℝ𝑘 is contractible for all 𝑘.
(b) If 𝑋 is compact, then the intersection number 𝐼2 (id𝑋 , {𝑥}) is defined for every
point 𝑥 ∈ 𝑋 where we consider intersections in 𝑋 (with 𝑌 = 𝑋 and 𝑍 = {𝑥}). For
the dimensions are complementary and id𝑋 meets every submanifold transversally.
This intersection number satisfies 𝐼2 (id𝑋 , {𝑥}) = #{𝑥} = 1, since 𝑥 has exactly
one preimage under id𝑋 .
If 𝑋 is contractible and dim 𝑋 ≥ 1, we can apply the previous point. Or we note
that we have id𝑋 ∼ 𝑔 where 𝑔 ∶ 𝑋 → {𝑦} is some constant map. Since dim 𝑋 ≥ 1,
we have 𝑔 −⋔ {𝑥} for 𝑥 ∈ 𝑋 if and only if 𝑥 ≠ 𝑦. Hence 𝐼2 (𝑔, {𝑥}) = #𝑔 −1 (𝑥) = 0.
However, since id𝑋 ≃ 𝑔, we have 𝐼2 (id𝑋 , {𝑥}) = 𝐼2 (𝑔, {𝑥}). This is impossible.
Hence, if dim 𝑋 ≥ 1, 𝑋 cannot be both compact and contractible.
If dim 𝑋 = 0, then the only way 𝑋 can be contractible is that it consists of just one
point. This is a compact space.
Thus
0 = 𝐼2 (𝜕𝑓 , 𝐶)
= #(𝜕𝑓 )−1 (𝐶)
= #((𝑋 ∩ 𝐶) × {0}) + #((𝑍 ∩ 𝐶) × {1})
= #(𝑋 ∩ 𝐶) + #(𝑍 ∩ 𝐶)
= 𝐼2 (𝑋, 𝐶) + 𝐼2 (𝑍, 𝐶).
and 𝑓1 being the embedding of 𝑍 ′ into 𝑋. (Note that, by abuse of notation, we have
not distinguished between a point in [−1, 1] × (−1, 1) and its equivalence class in
𝑋.) Moreover, there is only one point where 𝑍 and 𝑍 ′ meet, namely the image of
420 A.12. Intersection Theory modulo 2
𝑓 (0) in 𝑋, i.e., the point with coordinates 𝑝 = (0, 0). Finally, the intersection of 𝑍
and 𝑍 ′ is transverse. We check this locally for the induced maps on the codomains
of local charts: the tangent space to 𝑋 at 𝑝 (is the )
𝑥-axis in ℝ2 and the tangent space
1
to 𝑍 ′ at 𝑝 is the line spanned by the vector ∈ ℝ2 . Hence 𝑇𝑝 (𝑍) and 𝑇𝑝 (𝑍 ′ )
1∕2
span a two-dimensional subspace in the two-dimensional vector space 𝑇𝑝 (𝑋), i.e.,
they span all of 𝑇𝑝 (𝑋).
Thus we can conclude:
Solution (Exercise 13.8) (a) We have seen this map in Section 10.1 when we
introduced the idea of intersection theory. There we showed that 𝑓 is smooth by
checking it for the induced map on local charts. It is proper, since its domain 𝕊1 is
compact. It is injective, since cos(𝑡1 ∕2) = ± cos(𝑡2 ∕2) implies 𝑡2 = 𝑡1 + 2𝜋, and
similarly for sin.
(b) We consider 𝑍 = Im (𝑓 ) as a submanifold of ℝP2 . We now need to show we
can deform 𝑍 such that it intersects itself in an odd number of points. Thinking
of 𝑍 as being the image of the horizontal equator on 𝕊2 under the quotient map
𝕊2 → ℝP2 we could try the image of a vertical equator. So let 𝑔 ∶ 𝕊1 → ℝP2 be
the map defined by sending 𝑡 ∈ [0, 2𝜋] to [0 ∶ sin(𝑡∕2) ∶ cos(𝑡∕2)]. Just as for 𝑓 ,
we can check that 𝑔 is a smooth embedding. Properness and injectivity follow as
for 𝑓 . For smoothness we observe look at the effect of 𝑔 on the codomains of local
charts: We look at the diagram
𝑔
𝕊O 1 / ℝP2
𝜙𝑖 𝜓𝑗
𝑊𝑖 / ℝ2
𝑔
To add some visual understanding note that 𝐿 is the 𝑦-axis and 𝐿⟂ is the 𝑥𝑧-plane
in ℝ3 . In particular, 𝑇𝑝 (ℝP2 ) is a two-dimensional vector space. Hence, in order to
show transversality, it suffices to show that the one-dimensional subspaces 𝑇𝑝 (𝑍)
and 𝑇𝑝 (𝑍 ′ ) are spanned by two linearly independent vectors.
⎛0⎞
A linear map 𝐿 → 𝐿⟂ is determined by where it sends the vector 𝑣𝑝 = ⎜1⎟ which
⎜ ⎟
⎝0⎠
spans 𝐿. By abusing notation a bit, we can think of
as local parametrizations of ℝP2 around 𝑝. Then the image of 𝑑𝑓𝜋 is the one-
dimensional subspace in Homℝ (𝐿, 𝐿⟂ ) spanned by linear maps which send 𝐿 to
the 𝑥-axis. And the image of 𝑑𝑔𝜋 is the one-dimensional subspace in Homℝ (𝐿, 𝐿⟂ )
spanned by linear maps which send 𝐿 to the 𝑧-axis. These two subspaces together
span all of 𝑇𝑝 (ℝP2 ). Hence 𝑍 −
⋔ 𝑍 ′.
Finally, we need to check that 𝑓 and 𝑔 are homotopic. Let 𝜑 ∶ ℝ → ℝ be a smooth
bump function such that
{
0 𝑥 ≤ 1∕4
𝜑(𝑡) =
1 𝑥 ≥ 3∕4.
such that 𝐹 (𝑡, 0) = 𝑓 (𝑡) and 𝐹 (𝑡, 1) = 𝑔(𝑡) for all 𝑡. Summarising we have proven
𝐼2 (𝑍, 𝑍) = 1.
422 A.12. Intersection Theory modulo 2
(c) Recall that any constant map 𝑐 with value not in 𝑍 is transverse to 𝑍 and has mod
2-intersection number 𝐼2 (𝑐, 𝑍) = 0. Since ℝP2 is path-connected, all constant
maps with value in ℝP2 are homotopic to each other. Hence 𝑓 is not homotopic to
a constant map.
(d) We pick a point on ℝP2 , say 𝑄 = [0 ∶ 0 ∶ 1], and let 𝑞 ∶ 𝕊1 → ℝP2 be the constant
map with value 𝑄. Let 𝜑 ∶ ℝ → ℝ again be a smooth bump function such that
{
0 𝑥 ≤ 1∕4
𝜑(𝑡) =
1 𝑥 ≥ 3∕4.
does not work for 𝑓 , i.e., it does not provide a homotopy between 𝑓 and the constant
map 𝑐. One problem is that it does not induce a map on 𝕊1 × [0, 1], since
√ √
̃ 𝑠) = [ 1 − 𝜑(𝑠) ∶ 0 ∶ 𝜑(𝑠)]
𝐻(0,
whereas √ √
̃
𝐻(2𝜋, 𝑠) = [− 1 − 𝜑(𝑠) ∶ 0 ∶ 𝜑(𝑠)].
There is a minus sign occurring in the first coordinate because
Hence
̃ 𝑠) ≠ 𝐻(2𝜋,
𝐻(0, ̃ 𝑠) in ℝP2 when 0 < 𝜑(𝑠) < 1.
This indicates that we cannot continuously deform the loop corresponding to 𝑓 to
a constant map without cutting it open at some point. And the above calculation
of the intersection number for 𝑓 shows that it is impossible to remedy this defect.
Appendix A. Solutions to exercises 423
A.13 Orientation
(c) Subtracting from one 𝑣𝑖 a linear combination of the others corresponds to multi-
plying 𝛽 with a matrix that we obtain from the identity matrix by subtracting the
corresponding linear combination of rows from the 𝑖th row. We know from Linear
Algebra that this operation does not change the determinant of the matrix. Hence
the determinant of the change-of-basis-matrix is still +1.
(d) Suppose that 𝑉 is the direct sum of 𝑉1 and 𝑉2 . Let (𝑣1 , … , 𝑣𝑘 ) be an ordered
positively oriented basis of 𝑉1 and (𝑤1 , … , 𝑤𝑚 ) an ordered positively oriented
basis of 𝑉2 . Then (𝑣1 , … , 𝑣𝑘 , 𝑤1 , … , 𝑤𝑚 ) is an ordered positively oriented basis
of 𝑉1 ⊕ 𝑉2 , and (𝑤1 , … , 𝑤𝑚 , 𝑣1 , … , 𝑣𝑘 ) is an ordered positively oriented basis of
𝑉2 ⊕ 𝑉1 . Switching from the given positive basis of 𝑉1 ⊕ 𝑉2 to the positive basis of
𝑉2 ⊕ 𝑉1 corresponds to transposing exactly (dim 𝑉1 )(dim 𝑉2 ) many elements in the
basis. Hence the determinant of the change-of-basis-matrix is (−1)(dim 𝑉1 )(dim 𝑉2 ) .
Solution (Exercise 14.2) Let (𝑒1 , … , 𝑒𝑘 ) be the ordered basis of ℝ𝑘 which defines the
standard orientation of ℝ𝑘 . The orientation of ℍ𝑘 is given by the standard orientation
of ℝ𝑘 restricted to the subspace ℍ𝑘 ⊂ ℝ𝑘 . The boundary orientation of 𝜕ℍ𝑘 is given by
requiring that, at any point 𝑥 ∈ 𝜕ℍ𝑘 , the outward pointing unit normal vector 𝑛𝑥 = −𝑒𝑘
fits into a positively oriented basis for ℝ𝑘
⎛0 1 … 0⎞
⎜0 0 1 … 0⎟
⎜ ⎟
𝐴=⎜⋮ ⋮ ⋱ ⋮⎟ .
⎜0 0 … 1⎟
⎜−1 0 … 0⎟⎠
⎝
The matrix 𝐴 can be transormed into the diagonal matrix
⎛1 0 … 0 ⎞
⎜0 1 … 0 ⎟
𝐷=⎜
⎜⋮ ⋱ ⋮ ⎟⎟
⎝0 0 … −1⎠
424 A.13. Orientation
by interchanging two columns exactly 𝑘 − 1 times. Hence det(𝐷) = (−1)𝑘−1 det(𝐴). But
det(𝐷) = −1. Thus det(𝐴) = 1 > 0 if and only if (−1)𝑘 = 1, i.e., if 𝑘 is even.
(b) The boundary orientation of 𝕊𝑘 is, at any point 𝑥 ∈ 𝕊𝑘 , given on 𝑇𝑥 (𝕊𝑘 ) by chosing
the ordered basis (𝑛𝑥 , 𝑣1 , … , 𝑣𝑘 ) to be positively oriented where 𝑛𝑥 is the outward
pointing unit normal vector in 𝑇𝑥 (ℝ𝑘+1 ) = ℝ𝑘+1 and (𝑣1 , … , 𝑣𝑘 ) is an ordered basis
of 𝑇𝑥 (𝕊𝑘 ). But since 𝕊𝑘 ⊂ ℝ𝑘+1 is of codimension one, 𝑛𝑥 spans the orthogonal
complement 𝑁𝑥 (𝕊𝑘 , ℝ𝑘+1 ) of 𝑇𝑥 (𝕊𝑘 ) in ℝ𝑘+1 . Hence the orientation of 𝑇𝑥 (𝕊𝑘 )
induced by the direct sum
Solution (Exercise 14.4) Assume that 𝑑𝑓𝑥0 ∶ 𝑇𝑥0 (𝑋) → 𝑇𝑓 (𝑥0 ) (𝑌 ) preserves orien-
tation at some point 𝑥0 ∈ 𝑋. Since 𝑓 is a diffeomorphism, 𝑑𝑓𝑥 is an isomorphism for
all 𝑥 ∈ 𝑋. Hence det(𝑑𝑓𝑥 ) ≠ 0 for all 𝑥 ∈ 𝑋. In particular, the two disjoint open
subsets 𝑈 ∶= {𝑥 ∈ 𝑋 ∶ det(𝑑𝑓𝑥 ) > 0} and 𝑉 ∶= {𝑥 ∈ 𝑋 ∶ det(𝑑𝑓𝑥 ) < 0} cover 𝑋.
By assumption 𝑥0 ∈ 𝑈 , and hence 𝑈 is nonempty. Since 𝑋 is connected, this implies
𝑈 = 𝑋.
and the fact that 𝑑(𝑖𝑦 )|𝑁𝑦 (𝑆,𝑋) is an isomorphism onto its image. Since all these vector
spaces are subspaces in 𝑇𝑦 (𝑌 ), and are oriented as subspaces of 𝑇𝑦 (𝑌 ), we can identify
𝑁𝑦 (𝑆, 𝑋) with its image under 𝑑𝑖𝑦 in 𝑇𝑦 (𝑌 ) and can rewrite this equation as
𝑁𝑦 (𝑆, 𝑋) ⊕ 𝑇𝑦 (𝑍) = 𝑇𝑦 (𝑌 ).
Now let 𝑁𝑦 (𝑆, 𝑍) be the orthogonal complement of 𝑇𝑦 (𝑆) in 𝑇𝑦 (𝑍). Then the orien-
tation of 𝑇𝑦 (𝑆) is determined by the direct sum
We learned in the first exercise that the signs of the orientations of 𝑁𝑦 (𝑆, 𝑋) ⊕
𝑁𝑦 (𝑆, 𝑍) and 𝑁𝑦 (𝑆, 𝑍) ⊕ 𝑁𝑦 (𝑆, 𝑋) differ by (−1)(dim 𝑁𝑦 (𝑆,𝑋))(dim 𝑁𝑦 (𝑆,𝑍)) . Now it re-
mains to remark that, by definition of the normal spaces as orthogonal complements, we
have
Solution (Exercise 14.6) First, we assume that 𝑍 is orientable. At any point 𝑧 ∈ 𝑍, let
(𝑣1 , … , 𝑣𝑛 ) be an oriented basis of 𝑇𝑧 𝑍. Since 𝑇𝑧 𝑍 is a vector subspace of codimension
one in 𝑇𝑧 𝑋 we can choose a single vector 𝑛𝑧 ∈ (𝑇𝑧 𝑍)⟂ = 𝑁𝑧 (𝑍, 𝑋) ⊂ 𝑇𝑧 𝑋 such that
(𝑛𝑧 , 𝑣1 , … , 𝑣𝑛 ) is an oriented basis of 𝑇𝑧 𝑋. Since both 𝑋 and 𝑍 are oriented we can
make this choice of 𝑛𝑧 smoothly for every 𝑧 ∈ 𝑍. Since 𝑛𝑧 has to be non-trivial in order
to be part of a basis we can define a diffeomorphism
Solution (Exercise 14.7) (a) Any basis of 𝑉 ×𝑉 consists of the product (𝛼 ×0, 0×𝛽)
426 A.13. Orientation
where 𝛼 and 𝛽 are ordered bases of 𝑉 . The sign of this basis satisfies
Switching the orientation of 𝑉 changes both signs, sign (𝛼) and sign (𝛽). Changing
both signs simultaneously results in multiplying with (−1)2 = 1. Hence the sign
of the basis of 𝑉 × 𝑉 is independent of the choice of orientation for 𝑉 .
Changing the orientation of 𝑋 means changing the orientation of both 𝑇𝑥 (𝑋) and
𝑇𝑦 (𝑋). As in the previous point, this means multiplying the sign of any ordered
basis of 𝑇(𝑥,𝑦) (𝑋 × 𝑋) by +1. Hence the product orientation on 𝑋 × 𝑋 is the same
for all choices of orientation on 𝑋.
(c) Let 𝑋 be a smooth manifold which is not orientable. Any Euclidean space ℝ𝑚
is oriented as a manifold by the choice of the standard orientation of the tangent
space 𝑇𝑧 (ℝ𝑚 ) = ℝ𝑚 for any 𝑧 ∈ ℝ𝑚 . For any points 𝑥 ∈ 𝑋 and 𝑧 ∈ ℝ𝑚 , the
tangent space 𝑇(𝑥,𝑧) (𝑋 × ℝ𝑚 ) is just 𝑇𝑥 (𝑋) × ℝ𝑚 . If there was a smooth choice
for an orientation of 𝑋 × ℝ𝑚 , then each tangent space 𝑇𝑥 (𝑋) of 𝑋 would inherit a
smooth choice of orientation from the product 𝑇𝑥 (𝑋) × ℝ𝑚 . This contradicts the
non-orientability of 𝑋.
Now let 𝑌 by any smooth manifold. If 𝑋 × 𝑌 was orientable, then also 𝑋 × 𝑈 for
an open subspace 𝑈 ⊂ 𝑌 which is diffeomorphic to some ℝ𝑚 . But then 𝑋 × ℝ𝑚
would also inherit an orientation which is not possible. Applied to 𝑌 = 𝑋, we see
that 𝑋 × 𝑋 is not orientable.
Solution (Exercise 14.8) To determine the fiber over 𝑏, we write 𝑧0 = 𝑥0 + 𝑖𝑦0 and
𝑧1 = 𝑥1 + 𝑖𝑦1 . Then we get
1
𝜋(𝑧0 , 𝑧1 ) = (0, 1, 0) ⇒ 2𝑧0 𝑧̄ 1 = 𝑖 and |𝑧0 |2 = |𝑧1 |2 =
2
1
⇒ 𝑦0 = 𝑥1 , 𝑦1 = −𝑥0 and 𝑥20 + 𝑥21 = .
2
Thus the fiber over 𝑏 has the form
𝑖
𝜋 −1 (𝑏) = {(𝑧0 , 𝑧1 ) ∈ 𝕊3 ∶ 𝑧̄ 1 = }
2𝑧0
= {(𝑥0 , 𝑦0 , 𝑥1 , 𝑦1 ) ∈ 𝕊3 ∶ 𝑦0 = 𝑥1 , 𝑦1 = −𝑥0 }.
Let 𝑞 = (𝑥0 , 𝑥1 , 𝑥1 , −𝑥0 ) ∈ 𝜋 −1 (𝑏) be a point in the fiber over 𝑏. Since not both 𝑥0
and 𝑥1 can be zero, we assume that 𝑥0 ≠ 0. The tangent space 𝑇𝑞 𝕊3 is the vector space
𝑇𝑞 𝕊3 = {𝐮 ∈ ℝ4 ∶ 𝐮 ⟂ 𝑞}
⎧ ⎛−𝑥1 ⎞ ⎛−𝑥1 ⎞ ⎛𝑥0 ⎞⎫
⎪ ⟂ ⎜ 𝑥0 ⎟ ⟂ ⎜ 0 ⎟ ⟂ ⎜ 0 ⎟⎪
= span ⎨𝑞1 = ⎜ ⎟ , 𝑞2 = ⎜ 𝑥 ⎟ , 𝑞3 = ⎜ 0 ⎟⎬ .
⎪ ⎜ 0 ⎟ ⎜ 0⎟ ⎜ ⎟⎪
⎩ ⎝ 0 ⎠ ⎝ 0 ⎠ ⎝𝑥0 ⎠⎭
The orientation of 𝑇𝑞 𝕊3 as a boundary of the unit ball is such that the outward point-
ing vector 𝑞 together with the basis vectors of 𝑇𝑞 𝕊3 form a positively oriented basis of
ℝ4 . The matrix expressing the basis (𝑞, 𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) in the standard basis of ℝ4 is
⎛ 𝑥0 −𝑥1 −𝑥1 𝑥0 ⎞
⎜ 𝑥1 𝑥0 0 0⎟
⎜𝑥 0 𝑥 0 ⎟.
⎜ 1 0 ⎟
⎝−𝑥0 0 0 𝑥0 ⎠
The determinant of this matrix is
2𝑥40 + 2𝑥20 𝑥21 = 2𝑥20 (𝑥20 + 𝑥21 ) = 𝑥20 > 0.
In particular, it is positive and the basis (𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) is a positively oriented basis of 𝑇𝑞 𝕊3 .
The tangent space 𝑇𝑞 𝜋 −1 (𝑏) equals the kernel of 𝑑 𝜋̃𝑞 . In a previous exercise, we
computed this map as represented by the matrix
⎛𝑥1 −𝑥0 𝑥0 𝑥1 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ ⎜𝑥0 𝑥1 𝑥1 −𝑥0 ⎟ .
⎜ ⎟
⎝𝑥0 𝑥1 −𝑥1 𝑥0 ⎠
⎛−𝑥1 ⎞
⎜𝑥 ⎟
The kernel of this map is the span of the vector 𝑞0⟂ = ⎜ 0 ⎟. The normal space
⎜ 𝑥0 ⎟
⎝ 𝑥1 ⎠
𝑁𝑞 (𝜋 −1 (𝑏); 𝕊3 ) ⊂ 𝑇𝑞 𝕊3 of vectors which are orthogonal to 𝑇𝑞 𝜋 −1 (𝑏) is the span of
(𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ ). The map 𝑑 𝜋̃𝑞 sends 𝑞1⟂ − 𝑞2⟂ and 𝑞3⟂ to, respectively,
⎛ −2𝑥20 ⎞ ⎛−2𝑥0 𝑥1 ⎞
⟂ ⟂
𝑑 𝜋̃𝑞 (𝑞1 − 𝑞2 ) = 2 ⎜ 0 ⎟ , 𝑑 𝜋̃𝑞 (𝑞3 ) = 2 ⎜ 0 ⎟ .
⟂
⎜ ⎟ ⎜ 2 ⎟
⎝−2𝑥0 𝑥1 ⎠ ⎝ 2𝑥0 ⎠
428 A.13. Orientation
These two vectors form a basis (𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) of 𝑇𝑏 𝕊2 . We need to check the
orientation of this basis.
The tangent space 𝑇𝑏 𝕊2 has a basis (𝐞31 , 𝐞33 ) as a subspace in ℝ3 . This basis is negatively
oriented, since, together with the outward pointing vector 𝑏 = 𝐞32 , the basis (𝐞32 , 𝐞31 , 𝐞33 )
is a negatively oriented basis of ℝ3 . For we need to make one permutation to get the
standard basis which leads to multiplying the sign with −1. The matrix 𝐵 which expresses
(𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) in terms of the basis (𝐞31 , 𝐞33 ) is given by
( )
−𝑥20 −𝑥0 𝑥1
𝐵 =4⋅ .
−𝑥0 𝑥1 𝑥20
We see that det 𝐵 = 16(−𝑥40 − 𝑥20 𝑥21 ) = −16𝑥20 (𝑥20 + 𝑥21 ) = −8𝑥20 < 0 is negative. Hence
the basis (𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) is a negatively oriented basis of 𝑇𝑏 𝕊2 . This defines an
orientation on the normal space 𝑁𝑞 (𝜋 −1 (𝑏); 𝕊3 ) by declaring the orientation of the basis
(𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ ) to be negative.
𝑁𝑞 (𝜋 −1 (𝑏); 𝕊3 ) ⊕ 𝑇𝑞 𝜋 −1 (𝑏) = 𝑇𝑞 𝕊3
⎛1 0 1 ⎞
⎜−1 0 1 ⎟.
⎜ ⎟
⎝ 0 1 𝑥1 ∕𝑥0 ⎠
The determinant of this matrix is −2. In particular, it is negative. Since we checked that
the basis (𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) is a negatively oriented basis, we see that the orientation of the basis
(𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ , 𝑞0⟂ ) of 𝑇𝑞 𝕊3 is positive. Since the sign of (𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ ) is positive as a basis
of 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ), we need that the basis 𝑞0⟂ also has positive sign. Hence the vector 𝑞0⟂
provides a positively oriented basis of 𝑇𝑞 𝜋 −1 (𝑏).
Appendix A. Solutions to exercises 429
A.14 The Brouwer Degree
Solution (Exercise 15.1) (a) For 1 ≤ 𝑖 ≤ 𝑘 + 1, let 𝑟𝑖 be the reflection map on the
𝑖th coordinate:
The antipodal map is equal to the composition of reflections 𝑟1 ◦𝑟2 ◦ ⋯ ◦𝑟𝑘+1 . Each
reflection 𝑟𝑖 is a diffeomorphism which reverses the orientation on 𝕊𝑘 . The com-
position of two such reflections 𝑟𝑖 ◦𝑟𝑖+1 , however, is then a diffeomorphism which
preserves the orientation on 𝕊𝑘 . Hence, if 𝑘 = 2𝑛 is even, 𝑎 = 𝑟1 ◦𝑟2 ◦ ⋯ ◦𝑟2𝑛+1
has degree −1. Whereas if 𝑘 = 2𝑛 − 1 is odd, 𝑎 = 𝑟1 ◦𝑟2 ◦ ⋯ ◦𝑟2𝑛 has degree +1.
In other words, deg(𝑎) = (−1)𝑘+1 .
(b) As pointed out, we know that 𝑎 is homotopic to the identity if 𝑘 is odd. By the
previous point, deg(𝑎) = −1 if 𝑘 is even. Since deg is homotopy invariant, deg(𝑎) =
−1 ≠ 1 = deg(1id) implies that the antipodal map is not homotopic to the identity
if 𝑘 is even.
Solution (Exercise 15.2) Recall that we have proven the existence part for 𝑘 odd
before:
If 𝑘 is odd, then 𝑘 + 1 is even and we can define the map
𝑠 ∶ 𝕊𝑘 → ℝ𝑘+1 , (𝑥1 , … , 𝑥𝑘+1 ) → (−𝑥2 , 𝑥1 , −𝑥3 , 𝑥4 , … , −𝑥𝑘+1 , 𝑥𝑘 ).
This map can be extended to a linear map ℝ𝑘+1 → ℝ𝑘+1 and therefore 𝑠 is smooth. For
each 𝑥 ∈ 𝕊𝑘 , 𝑠(𝑥) is nonzero and satisfies 𝑥 ⟂ 𝑠(𝑥). Thus 𝑠(𝑥) is a tangent vector at 𝑥,
i.e. 𝑠(𝑥) ∈ 𝑇𝑥 (𝕊𝑘 ) ⧵ {0}. Hence
𝜎 ∶ 𝕊𝑘 → 𝑇 (𝕊𝑘 ), 𝜎(𝑥) ∶= (𝑥, 𝑠(𝑥))
is the desired non-vanishing vector field on 𝕊𝑘 .
Recall that we also have shown that if 𝕊𝑘 has a vector field which has no zeros, then
its antipodal map 𝑥 → −𝑥 is homotopic to the identity:
Given a vector field 𝜎 ∶ 𝕊𝑘 → 𝑇 (𝕊𝑘 ) which has no zeros. Let 𝜎(𝑥) = (𝑥, 𝑣(𝑥)). Since
𝑣(𝑥) ≠ 0 for every 𝑥 ∈ 𝕊𝑘 , we can define a new vector field by
𝑣(𝑥)
𝑥 → 𝑤(𝑥) = .
|𝑣(𝑥)|
430 A.14. The Brouwer Degree
By replacing 𝑠 with this new non-vanishing vector field, we can assume |𝑣(𝑥)| = 1.
Hence we can assume 𝑣(𝑥) ∈ 𝕊𝑘 and 𝑣(𝑥) ⋅ 𝑥 = 0 for every 𝑥 ∈ 𝕊𝑘 .
Now we define the map
where we use 𝑥⋅𝑥 = 1 = 𝑣(𝑥)⋅𝑣(𝑥) and 𝑥⋅𝑣(𝑥) = 0. Thus 𝐹 (𝑥, 𝑡) is a vector of norm 1 for
every 𝑥 and every 𝑡. Moreover, 𝐹 is a smooth map with 𝐹 (𝑥, 0) = 𝑥 and 𝐹 (𝑥, 1) = −𝑥,
i.e. 𝐹 is a smooth homotopy from the identity to the antipodal map on 𝕊𝑘 .
Hence, by homotopy invariance of deg, if 𝕊𝑘 has a vector field which has no zeros,
then deg(𝑎) = deg(id) = 1. By the previous exercise, since deg(𝑎) = (−1)𝑘+1 and hence
𝑘 must be even.
Note that we would not have been able to make this conclusion with the mod 2-degree.
For in ℤ∕2, we cannot distinguish between 1 and −1.
Solution (Exercise 15.3) We begin as in the solution to Exercise 11.4. We use the
homotopy from 𝑝0 (𝑧) = 𝑧𝑚 to 𝑝1 (𝑧) = 𝑝(𝑧) defined by
We then observe that, if 𝑊 is a closed ball around the origin in ℂ with sufficiently
large radius, none of the 𝑝𝑡 has a zero on 𝜕𝑊 . Hence the homotopy
𝑝𝑡
∶ 𝜕𝑊 → 𝕊1
|𝑝𝑡 |
Thus, if 𝑚 > 0, 𝑝∕|𝑝| does not extend to all of 𝑊 , since otherwise its degree had to
be zero. Hence 𝑝 must have a zero inside 𝑊 .a
a
Note that the final argument was not available for deg2 if 𝑚 is even.
Appendix A. Solutions to exercises 431
Solution (Exercise 15.4) Let us try to set up the argument we used in Exercise 15.3:
Let
𝑝(𝑥) = 𝑥𝑚 + 𝑎1 𝑥𝑚−1 + ⋯ + 𝑎𝑚
be a monic real polynomial. Define a homotopy from 𝑝0 (𝑥) = 𝑥𝑚 to 𝑝1 (𝑥) = 𝑝(𝑥) by
Solution (Exercise 15.5) (a) First, 𝑔 is in fact a map 𝕊1 → 𝜕𝔻0 , since any point 𝑧
with |𝑧| = 1 is sent to point with
|𝑔(𝑧) − 𝑧0 | = 𝑧0 + 𝑟𝑧 − 𝑧0 = |𝑟𝑧| = 𝑟 since |𝑧| = 1.
Moreover, 𝑔 is smooth and has an inverse given by
𝑧 − 𝑧0
𝑔 −1 ∶ 𝜕𝔻0 → 𝕊1 , 𝑧 → .
|𝑧 − 𝑧0 |
Note that 𝑔 −1 is also smooth, since both taking norms and dividing by |𝑧 − 𝑧0 | are
smooth operations in ℂ ⧵ {𝑧0 }.
Hence we know that 𝑔 either preserves or reverses orientations. To check that 𝑔
preserves orientations, it suffices to note that the derivative of 𝑔 at any point, for
example 𝑑𝑔1 , is given by multiplication with 𝑟 > 0.
(b) Let 𝑦 ∈ 𝕊1 . Since 𝑔 is a diffeomorphism, 𝑦 is a regular value for 𝑝 if and only if it
is a regular value for 𝑝◦𝑔. Moreover, 𝑔 defines a bijection between the finite sets
{𝑧 ∈ 𝜕𝐷0 |𝑝(𝑧) = 𝑦} and {𝑧 ∈ 𝕊1 |𝑝(𝑔(𝑧)) = 𝑦}. Since 𝑔 preserves orientations,
the orientation numbers at each point in these sets agree. Hence the degrees of the
above maps are the same.
(c) Since 𝑝(𝑧) ≠ 0 for all 𝑧 ∈ 𝔻0 ⧵ {𝑧0 } by our choice of 𝔻0 , we have |𝑞(𝑧0 + 𝑡𝑟𝑧)| ≠ 0
for all |𝑧| = 1. Hence we have a well-defined smooth map
𝑧𝑚 𝑞(𝑧0 + 𝑡𝑟𝑧)
𝐻 ∶ 𝕊1 × [0, 1] → 𝕊1 , (𝑧, 𝑡) → .
|𝑞(𝑧0 + 𝑡𝑟𝑧)|
432 A.14. The Brouwer Degree
𝑧𝑚 𝑞(𝑧0 ) 𝑞(𝑧0 ) 𝑚
ℎ0 (𝑧) = = ⋅𝑧
𝑞(𝑧0 )| 𝑞(𝑧0 )|
and
𝑧𝑚 𝑞(𝑧0 + 𝑟𝑧) (𝑟𝑧)𝑚 𝑞(𝑧0 + 𝑟𝑧)
ℎ1 (𝑧) = = 𝑚
|𝑞(𝑧0 + 𝑟𝑧)| 𝑟 |𝑞(𝑧0 + 𝑟𝑧)|
𝑚
(𝑧0 + 𝑟𝑧 − 𝑧0 ) 𝑞(𝑧0 + 𝑟𝑧)
=
|𝑧0 + 𝑟𝑧 − 𝑧0 |𝑚 |𝑞(𝑧0 + 𝑟𝑧)|
𝑝(𝑔(𝑧))
= .
|𝑝(𝑔(𝑧))|
Hence ℎ𝑡 is the desired homotopy.
(d) We know from the calculation in Example 15.10 that the map 𝕊1 → 𝕊1 , 𝑧 → 𝑧𝑚 ,
has degree 𝑚. Since multiplying with a constant 𝑐 does not change the degree,
we also have deg(𝑧 → 𝑐 ⋅ 𝑧𝑚 ) = deg(ℎ0 ) = 𝑚. Since degrees are homotopy
invariant, the previous point shows deg(ℎ1 ) = deg(ℎ0 ). As we observed before,
deg(𝑝∕|𝑝|) = deg(ℎ1 ). Hence we can conclude deg(𝑝∕|𝑝|) = 𝑚.
Solution (Exercise 15.6) Let 𝑧 be a regular value for 𝑔. If 𝑧 is not in the image of 𝑔,
then deg(𝑔) and deg(𝑔◦𝑓 ) both vanish and the claim is true. So let us assume that 𝑧 is in
the image of 𝑔. We showed in a previous exercise that 𝑧 then is a regular value for 𝑔◦𝑓
if and only if every 𝑦 ∈ 𝑔 −1 (𝑧) is a regular value for 𝑓 . Hence we can use 𝑧 to compute
deg(𝑔◦𝑓 ) if and only if we can use 𝑦 ∈ 𝑔 −1 (𝑧) to compute deg(𝑓 ). If 𝑧 is not in the image
of 𝑔◦𝑓 , then 𝑦 is not in the image of 𝑓 . In this case both deg(𝑔◦𝑓 ) and deg(𝑓 ) vanish
and the claim is true.
So assume there are points 𝑥 ∈ 𝑋 with 𝑔(𝑓 (𝑥)) = 𝑧. Given such an 𝑥, the chain rule
gives us
𝑑(𝑔◦𝑓 )𝑥 = 𝑑𝑔𝑓 (𝑥) ◦𝑑𝑓𝑥
Since 𝑥 and 𝑓 (𝑥) are regular points by assumption, all three derivatives in this equa-
tion are isomorphisms. Recall that we equip 𝑥 with orientation number +1 if 𝑑𝑓𝑥 pre-
serves orientation and with orientation number −1 if 𝑑𝑓𝑥 reverses orientation. Moreover,
whether 𝑑𝑓𝑥 preserves or reverses orientation is determined by the sign of its determi-
nant. Similarly, for 𝑓 (𝑥) with respect to 𝑑𝑔𝑓 (𝑥) and 𝑥 with respect to 𝑑(𝑔◦𝑓 )𝑥 . By the
chain rule and since the determinant is a multiplicative function, we get the formula for
signs:
sign (det(𝑑(𝑔◦𝑓 )𝑥 )) = sign (det(𝑑𝑔𝑓 (𝑥) )) ⋅ sign (det(𝑑𝑓𝑥 )).
Appendix A. Solutions to exercises 433
Since the degree is the sum of the orientation numbers at all points in the fiber, we get
∑
deg(𝑔◦𝑓 ) = sign (det(𝑑(𝑔◦𝑓 )𝑥 ))
𝑥∈(𝑔◦𝑓 )−1 (𝑧)
∑
= [sign (det(𝑑𝑔𝑓 (𝑥) )) ⋅ sign (det(𝑑𝑓𝑥 ))]
𝑥∈(𝑔◦𝑓 )−1 (𝑧)
( ) ( )
∑ ∑
= sign (det(𝑑𝑔𝑦 )) ⋅ sign (det(𝑑𝑓𝑥 ))
𝑦∈𝑔 −1 (𝑧) 𝑥∈𝑓 −1 (𝑦)
= deg(𝑔) ⋅ deg(𝑓 ).
Note that we used here that we can compute deg 𝑓 using any regular value 𝑦 ∈ 𝑌 for 𝑓
with 𝑔(𝑦) = 𝑧.
which implies deg(𝑓 ) = (−1)𝑘+1 . Thus, if 𝑓 does not have fixed point, then deg(𝑓 ) =
(−1)𝑘+1 . This proves the claim.
Solution (Exercise 15.8) At every point [𝑥] ∈ ℝP𝑘 , the fiber under 𝑞 consists of two
antipodal points 𝑞 −1 ([𝑥]) = {𝑥, −𝑥} ⊂ 𝕊𝑘 . Let 𝑎 ∶ 𝕊𝑘 → 𝕊𝑘 be the antipodal map. We
have 𝑞(𝑥) = 𝑞(−𝑥) for all 𝑥 ∈ 𝕊𝑘 , i.e., we have 𝑞 = 𝑞◦𝑎. This implies 𝑑𝑞−𝑥 ◦𝑑𝑎𝑥 = 𝑑𝑞𝑥
for all 𝑥 ∈ 𝕊𝑘 . In particular, this implies det(𝑑𝑞−𝑥 ) ⋅ det(𝑑𝑎𝑥 ) = det(𝑑𝑞𝑥 ). If 𝑘 is odd,
then 𝑎 preserves orientation, i.e, det(𝑑𝑎𝑥 ) > 0. Thus, det(𝑑𝑞−𝑥 ) and det(𝑑𝑞𝑥 ) have the
same sign. This implies that 𝑥 and −𝑥 contribute to the degree with the same orientation
number. Hence deg(𝑞) is either +2 or −2. This also shows that, if det(𝑑𝑞𝑥 ) > 0, i.e., if 𝑞
preserves orientations, then deg(𝑞) = +2.
(c) Since 𝑘 is odd, the degree of the quotient map 𝑞 ∶ 𝕊𝑘 → ℝP𝑘 is defined. Since 𝑔
satisfies 𝑔(𝑥) = 𝑔(−𝑥) and 𝑞 is a quotient map, 𝑔 can be written as a composition
𝑞 [𝑔]
𝕊𝑘 ←←→
← ℝP𝑘 ←←←←←→
← 𝕊𝑘 .
(d) The previous points show that the assumption that 𝑓 ∶ 𝕊𝑘 → 𝕊𝑘 sends no pair of
antipodal points to antipodal points implies that deg(𝑓 ) = deg(𝑔) must be even.
Since deg(𝑓 ) is odd, there must be at least one pair of antipodal points 𝑥0 , −𝑥0
such that 𝑓 (−𝑥0 ) = −𝑓 (𝑥0 ). This proves the claim.
Appendix A. Solutions to exercises 435
A.15 Linking Number and the Hopf Invariant
Solution (Exercise 15.10) (a) Let 𝑧 ∈ 𝕊𝑘 be a regular value for 𝜆 and let (𝑥, 𝑦) ∈
𝑋 × 𝑌 with 𝜆(𝑥, 𝑦) = 𝑧. Consider the derivative
As we learned in Section 14.2.1, switching the order of the factors in the domain
corresponds to making dim 𝑋 ⋅dim 𝑌 many flips of basis vectors. Each flip requires
to multiply orientation numbers with a factor (−1). Hence the map 𝑠 ∶ 𝑋 × 𝑌 →
𝑌 × 𝑋 induces multiplication by (−1)𝑚𝑛 on the degree. In addition, we change the
value of 𝜆 by sending 𝑥 − 𝑦 to 𝑦 − 𝑥 = −(𝑥 − 𝑦). Hence we compose with the
antipodal map 𝑎 on 𝕊𝑘 . This map has degree (−1)𝑘+1 as we learned in a previous
exercise. Still using that we showed in an exercise that deg sends composition of
maps to products, this implies in total
where we use 𝑘 = 𝑚 + 𝑛.
(b) Since 𝑌 does not have a boundary, the product 𝑊 × 𝑌 is a smooth manifold with
boundary 𝜕(𝑊 × 𝑌 ) = 𝜕𝑊 × 𝑌 = 𝑋 × 𝑌 . Since 𝑊 and 𝑌 are disjoint, 𝜆 extends
to a smooth map on 𝑊 × 𝑌 . Then the Boundary Theorem for degrees implies that
deg(𝜆) = 0.
Solution (Exercise 15.11) (a) Since taking preimages preserves codimensions, the
dimension of 𝑓 −1 (𝑤) and 𝑓 −1 (𝑧) for any regular values 𝑤 and 𝑧 for 𝑓 is given by
Hence, if 𝑛 is odd, then dim 𝑓 −1 (𝑤) + 1 = dim 𝑓 −1 (𝑧) = 𝑛 are odd and hence
𝐻(𝑓 ) = 𝐿(𝑓 −1 (𝑤), 𝑓 −1 (𝑧)) = −𝐿(𝑓 −1 (𝑧), 𝑓 −1 (𝑤)) = −𝐻(𝑓 ) by a previous exer-
cise. Thus we must have 𝐻(𝑓 ) = 0.
(b) By Sard’s Theorem we can find a point 𝑎 ∈ 𝕊𝑛 which is a regular value for both 𝑔
and 𝑔◦𝑓 . The fiber 𝑔 −1 (𝑎) consists of a finite number of points, say 𝑎1 , … , 𝑎𝑟 in
𝕊𝑛 . We can assume that these points are ordered such that the orientation numbers
of 𝑔 at 𝑎1 , … , 𝑎𝑝 are positive, while the orientation numbers of 𝑔 at 𝑎𝑝+1 , … , 𝑎𝑟
are negative. Then we have deg(𝑔) = 2𝑝 − 𝑟. The fiber 𝑓 −1 (𝑔 −1 (𝑎)) then consists
436 A.15. Linking Number and the Hopf Invariant
Similarly, we can choose 𝑏 such that the fiber 𝑓 −1 (𝑔 −1 (𝑏)) consists of a disjoint
union of the submanifolds 𝑓 −1 (𝑏𝑗 ) ⊂ 𝕊2𝑛−1 for 𝑗 = 1, … , 𝑠 and such that the
𝑓 −1 (𝑎𝑖 ) and 𝑓 −1 (𝑏𝑗 are mutually disjoint. Again we order these points are ordered
such that the orientation numbers of 𝑔 at 𝑏1 , … , 𝑏𝑠 are positive, while the orienta-
tion numbers of 𝑔 at 𝑏𝑞+1 , … , 𝑏𝑠 are negative. Then we have deg(𝑔) = 2𝑞 − 𝑠. The
fiber 𝑓 −1 (𝑔 −1 (𝑏)) then consists of a disjoint union of the oriented submanifolds
𝑓 −1 (𝑎𝑖 ) ⊂ 𝕊2𝑛−1 :
𝑓 −1 (𝑔 −1 (𝑎)) × 𝑓 −1 (𝑔 −1 (𝑏))
= ⊔𝑖,𝑗 (±𝑓 −1 (𝑎𝑖 )) × (±𝑓 −1 (𝑏𝑗 ))
= ⊔𝑝,𝑞
𝑖=1,𝑗=1
(𝑓 −1 (𝑎𝑖 ) × 𝑓 −1 (𝑏𝑗 )) ⊔ ⊔𝑟,𝑞
𝑖=𝑝+1,𝑗=1
(−(𝑓 −1 (𝑎𝑖 ) × 𝑓 −1 (𝑏𝑗 )))
⊔ ⊔𝑝,𝑠
𝑖=1,𝑗=𝑞+1
(−(𝑓 −1 (𝑎𝑖 ) × 𝑓 −1 (𝑏𝑗 ))) ⊔ ⊔𝑟,𝑠
𝑖=𝑝+1,𝑗=𝑞+1
(𝑓 −1 (𝑎𝑖 ) × 𝑓 −1 (𝑏𝑗 )).
𝐻(𝑔◦𝑓 ) = deg(𝜆)
∑
𝑝,𝑞
∑
𝑟,𝑞
∑
𝑝,𝑠
∑
𝑟,𝑠
= deg(𝜆𝑖𝑗 ) − deg(𝜆𝑖𝑗 ) − deg(𝜆𝑖𝑗 ) + deg(𝜆𝑖𝑗 )
𝑖=1,𝑗=1 𝑖=𝑝+1,𝑗=1 𝑖=1,𝑗=𝑞+1 𝑖=𝑝+1,𝑗=𝑞+1
( )2
(2𝑧0 𝑧̄ 1 ) ⋅ (2𝑧̄ 0 𝑧1 ) + |𝑧0 |2 − |𝑧1 |2
= 4|𝑧0 |2 |𝑧1 |2 + |𝑧0 |4 − 2|𝑧0 |2 |𝑧1 |2 + |𝑧1 |4
( )2
= |𝑧0 |2 + |𝑧1 |2
=1
Remembering that neither of the numbers can be zero, the left hand equation gives
𝑤 𝑧̄
us 𝑧 0 = 𝑤̄1 . Moreover, we have |𝑤0 |2 − |𝑤1 |2 = |𝑧0 |2 − |𝑧1 |2 and |𝑤0 |2 + |𝑤1 |2 =
0 1
1 = |𝑧0 |2 +|𝑧1 |2 . Putting these together implies 𝑧20 = 𝑤20 and 𝑧21 = 𝑤21 . This shows
that the desired 𝛼 with 𝛼 𝛼̄ = 1, i.e., 𝛼̄ = 𝛼1 , exists.
Now we assume that (𝑤0 , 𝑤1 ) = (𝛼𝑧0 , 𝛼𝑧1 ) with |𝛼|2 = 𝛼 𝛼̄ = 1: Then we compute
( )
𝜋(𝑤0 , 𝑤1 ) = 2𝑤0 𝑤̄ 1 , |𝑤0 |2 − |𝑤1 |2
( )
= 2𝛼𝑧0 𝛼̄ 𝑧̄ 1 , |𝛼|2 |𝑧0 |2 − |𝛼|2 |𝑧1 |2
( )
= 2𝑧0 𝑧̄ 1 , |𝑧0 |2 − |𝑧1 |2
= 𝜋(𝑧0 , 𝑧1 ).
(c) Let 𝑝 ∈ 𝕊2 be a fixed point and fix a point (𝑧0 , 𝑧1 ) in 𝕊3 with 𝜋(𝑧0 , 𝑧1 ) = 𝑝. By the
previous point, we have that the points in 𝜋 −1 (𝑝) is parametrized by the complex
number 𝛼 with |𝛼|2 = 1. The latter condition means 𝛼 ∈ 𝕊1 ⊂ ℂ. Hence we get a
bijective map
𝕊1 → 𝜋 −1 (𝑝), 𝛼 → (𝛼𝑧0 , 𝛼𝑧1 ).
Since this map just consists of multiplication with nonzero complex numbers, we
can conclude that it is a diffeomorphism (where we consider𝕊1 and 𝜋 −1 (𝑝) as sub-
sets in real Euclidean space).
⎛ 𝑥1 𝑦1 𝑥0 𝑦0 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ −𝑦1 𝑥1 𝑦0 −𝑥0 ⎟ .
⎜
⎜ ⎟
⎝ 𝑥0 𝑦0 −𝑥1 −𝑦1 ⎠
438 A.15. Linking Number and the Hopf Invariant
Recall: Since it has been a while that we looked at tangent spaces, let us see
why this claim is true. So let 𝑔4 ∶ ℝ4 → ℝ and 𝑔3 ∶ ℝ3 → ℝ be the usual
smooth maps such that 𝕊3 = 𝑔4−1 (1) and 𝕊2 = 𝑔3−1 (1) respectively. Then we have
𝑇𝑞 𝕊3 = Ker (𝑑(𝑔4 )𝑞 ) ⊂ 𝑇𝑞 ℝ4 = ℝ4 and 𝑇𝑝 𝕊2 = Ker (𝑑(𝑔3 )𝑝 ) ⊂ 𝑇𝑝 ℝ3 = ℝ3 .
We know that 𝜋(𝑞) ̃ ∈ 𝕊2 if 𝑞 ∈ 𝕊3 . Actually, our calculation above shows that
𝜋(𝑞)
̃ ∈ 𝕊 if and only if 𝑞 ∈ 𝕊3 . This implies 𝕊3 = 𝜋̃ −1 (𝑔3−1 (1)) = (𝑔3 ◦𝜋)
2 ̃ −1 (1). In
particular, 𝑔3 (𝜋(𝑞))
̃ = 1 is constant on 𝕊3 . Hence, for every 𝑞 ∈ 𝕊3 , the image of
the restriction (𝑑 𝜋̃𝑞 )|𝑇𝑞 𝕊3 is contained in the kernel of 𝑑(𝑔3 )𝜋(𝑞)
̃ which is 𝑇𝜋(𝑞)
̃ 𝕊 .
2
𝑇𝑞 𝕊3 = {𝐮 ∈ ℝ4 ∶ 𝐮 ⟂ 𝑞}
⎧ ⎛−𝑦0 ⎞ ⎛0⎞ ⎛0⎞⎫
⎪ ⟂ ⎜ 𝑥0 ⎟ 4 ⎜0⎟ 4 ⎜0⎟⎪
= span ⎨𝑞 = ⎜ ⎟ , 𝐞3 = ⎜1⎟ , 𝐞4 = ⎜0⎟⎬ .
⎪ ⎜ 0 ⎟ ⎜ ⎟ ⎜ ⎟⎪
⎩ ⎝ 0 ⎠ ⎝0⎠ ⎝1⎠⎭
Appendix A. Solutions to exercises 439
The orientation of 𝑇𝑞 𝕊3 as a boundary of the unit ball is such that the outward
pointing vector 𝑞 together with the basis vectors of 𝑇𝑞 𝕊3 form a positively
oriented basis of ℝ4 . The determinant of the matrix expressing the basis
(𝑞, 𝑞 ⟂ , 𝐞43 , 𝐞44 ) in the standard basis of ℝ4 equals 𝑥20 +𝑦20 = 1 > 0. In particular,
it is positive and the basis (𝑞 ⟂ , 𝐞43 , 𝐞44 ) is a positively oriented basis of 𝑇𝑞 𝕊3 .
The tangent space 𝑇𝑞 𝜋 −1 (𝑎) equals the kernel of 𝑑 𝜋̃𝑞 . We computed this map
as represented by the matrix
⎛ 0 0 𝑥0 𝑦0 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ ⎜ 0 0 𝑦0 −𝑥0 ⎟ .
⎜ ⎟
⎝𝑥0 𝑦0 0 0 ⎠
The kernel of this map is the span of the vector 𝑞 ⟂ that we have just seen
above. The normal space 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) ⊂ 𝑇𝑞 𝕊3 of vectors which are or-
thogonal to 𝑇𝑞 𝜋 −1 (𝑎) is the span of (𝐞43 , 𝐞44 ). The map 𝑑 𝜋̃𝑞 sends 𝐞3 and 𝐞4 to,
respectively,
⎛𝑥0 ⎞ ⎛ 𝑦0 ⎞
⎜ 𝑦 ⎟ ⎜−𝑥 ⎟
𝑑 𝜋̃𝑞 (𝐞43 ) = 2 ⎜ 0 ⎟ , 𝑑 𝜋̃𝑞 (𝐞44 ) = 2 ⎜ 0 ⎟ .
⎜0⎟ ⎜ 0 ⎟
⎝0⎠ ⎝ 0 ⎠
These two vectors form a basis (𝑑 𝜋̃𝑞 (𝐞33 ), 𝑑 𝜋̃𝑞 (𝐞44 )) of 𝑇𝑎 𝕊2 . We need to check
the orientation of this basis. The tangent space 𝑇𝑎 𝕊2 has a basis (𝐞31 , 𝐞32 ) as
a subspace in ℝ3 . This basis is positively oriented, since, together with the
outward pointing vector 𝑎 = 𝐞33 , the basis (𝐞33 , 𝐞31 , 𝐞32 ) is a positively oriented
basis of ℝ3 .a The matrix 𝐴 which expresses (𝑑 𝜋̃𝑞 (𝐞33 ), 𝑑 𝜋̃𝑞 (𝐞44 )) in terms of
the basis (𝐞31 , 𝐞32 ) is given by
( )
𝑥0 𝑦0
𝐴=2⋅ .
𝑦0 −𝑥0
We see that det 𝐴 = 4(−𝑥20 − 𝑦20 ) = −4 < 0 is negative. Hence the basis
(𝑑 𝜋̃𝑞 (𝐞33 ), 𝑑 𝜋̃𝑞 (𝐞44 )) is a negatively oriented basis of 𝑇𝑎 𝕊2 . This defines an
orientation on the normal space 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) by declaring the orientation
of the basis (𝐞43 , 𝐞44 ) to be negative.
𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ) ⊕ 𝑇𝑞 𝜋 −1 (𝑎) = 𝑇𝑞 𝕊3
Let 𝑞 = (𝑥0 , 𝑥1 , 𝑥1 , −𝑥0 ) ∈ 𝜋 −1 (𝑏) be a point in the fiber over 𝑏. Since not
both 𝑥0 and 𝑥1 can be zero, we assume that 𝑥0 ≠ 0. The tangent space 𝑇𝑞 𝕊3
is the vector space
𝑇𝑞 𝕊3 = {𝐮 ∈ ℝ4 ∶ 𝐮 ⟂ 𝑞}
⎧ ⎛−𝑥1 ⎞ ⎛−𝑥1 ⎞ ⎛𝑥0 ⎞⎫
⎪ ⟂ ⎜ 𝑥0 ⎟ ⟂ ⎜ 0 ⎟ ⟂ ⎜ 0 ⎟⎪
= span ⎨𝑞1 = ⎜ ⎟ , 𝑞2 = ⎜ 𝑥 ⎟ , 𝑞3 = ⎜ 0 ⎟⎬ .
⎪ ⎜ 0 ⎟ ⎜ 0⎟ ⎜ ⎟⎪
⎩ ⎝ 0 ⎠ ⎝ 0 ⎠ ⎝𝑥0 ⎠⎭
The orientation of 𝑇𝑞 𝕊3 as a boundary of the unit ball is such that the outward
pointing vector 𝑞 together with the basis vectors of 𝑇𝑞 𝕊3 form a positively
oriented basis of ℝ4 . The matrix expressing the basis (𝑞, 𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) in the
standard basis of ℝ4 is
⎛ 𝑥0 −𝑥1 −𝑥1 𝑥0 ⎞
⎜ 𝑥1 𝑥0 0 0⎟
⎜𝑥 0 𝑥 0 ⎟.
⎜ 1 0 ⎟
⎝−𝑥0 0 0 𝑥0 ⎠
The determinant of this matrix is
In particular, it is positive and the basis (𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) is a positively oriented
basis of 𝑇𝑞 𝕊3 .
The tangent space 𝑇𝑞 𝜋 −1 (𝑏) equals the kernel of 𝑑 𝜋̃𝑞 . We computed this map
as represented by the matrix
⎛𝑥1 −𝑥0 𝑥0 𝑥1 ⎞
𝑑 𝜋̃𝑞 = 2 ⋅ ⎜𝑥0 𝑥1 𝑥1 −𝑥0 ⎟ .
⎜ ⎟
⎝𝑥0 𝑥1 −𝑥1 𝑥0 ⎠
⎛−𝑥1 ⎞
⎜𝑥 ⎟
The kernel of this map is the span of the vector 𝑞0⟂ = ⎜ 0 ⎟. The normal
⎜ 𝑥0 ⎟
⎝ 𝑥1 ⎠
space 𝑁𝑞 (𝜋 (𝑏); 𝕊 ) ⊂ 𝑇𝑞 𝕊 of vectors which are orthogonal to 𝑇𝑞 𝜋 −1 (𝑏) is
−1 3 3
Appendix A. Solutions to exercises 441
the span of (𝑞1⟂ −𝑞2⟂ , 𝑞3⟂ ). The map 𝑑 𝜋̃𝑞 sends 𝑞1⟂ −𝑞2⟂ and 𝑞3⟂ to, respectively,
⎛ −2𝑥20 ⎞ ⎛−2𝑥0 𝑥1 ⎞
𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ) = 2 ⎜ 0 ⎟ , 𝑑 𝜋̃𝑞 (𝑞3⟂ ) = 2 ⎜ 0 ⎟ .
⎜ ⎟ ⎜ 2 ⎟
⎝−2𝑥0 𝑥1 ⎠ ⎝ 2𝑥0 ⎠
These two vectors form a basis (𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) of 𝑇𝑏 𝕊2 . We need to
check the orientation of this basis. The tangent space 𝑇𝑏 𝕊2 has a basis (𝐞31 , 𝐞33 )
as a subspace in ℝ3 . This basis is negatively oriented, since, together with the
outward pointing vector 𝑏 = 𝐞32 , the basis (𝐞32 , 𝐞31 , 𝐞33 ) is a negatively oriented
basis of ℝ3 . For we need to make one permutation to get the standard basis
which leads to multiplying the sign with −1. The matrix 𝐵 which expresses
(𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) in terms of the basis (𝐞31 , 𝐞33 ) is given by
( )
−𝑥20 −𝑥0 𝑥1
𝐵 =4⋅ .
−𝑥0 𝑥1 𝑥20
We see that det 𝐵 = 16(−𝑥40 − 𝑥20 𝑥21 ) = −16𝑥20 (𝑥20 + 𝑥21 ) = −8𝑥20 < 0 is neg-
ative. Hence the basis (𝑑 𝜋̃𝑞 (𝑞1⟂ − 𝑞2⟂ ), 𝑑 𝜋̃𝑞 (𝑞3⟂ )) is a negatively oriented basis
of 𝑇𝑏 𝕊2 . This defines an orientation on the normal space 𝑁𝑞 (𝜋 −1 (𝑏); 𝕊3 ) by
declaring the orientation of the basis (𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ ) to be negative.
𝑁𝑞 (𝜋 −1 (𝑏); 𝕊3 ) ⊕ 𝑇𝑞 𝜋 −1 (𝑏) = 𝑇𝑞 𝕊3
⎛1 0 1 ⎞
⎜−1 0 1 ⎟.
⎜ ⎟
⎝ 0 1 𝑥1 ∕𝑥0 ⎠
The determinant of this matrix is −2. In particular, it is negative. Since we
checked that the basis (𝑞1⟂ , 𝑞2⟂ , 𝑞3⟂ ) is a negatively oriented basis, we see that
the orientation of the basis (𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ , 𝑞0⟂ ) of 𝑇𝑞 𝕊3 is positive. Since the
sign of (𝑞1⟂ − 𝑞2⟂ , 𝑞3⟂ ) is positive as a basis of 𝑁𝑞 (𝜋 −1 (𝑎); 𝕊3 ), we need that
the basis 𝑞0⟂ also has positive sign. Hence the vector 𝑞0⟂ provides a positively
oriented basis of 𝑇𝑞 𝜋 −1 (𝑏).
(f) By definition of 𝐻(𝜋), we need to choose two distinct regular values 𝑎 and 𝑏 of
𝜋 and calculate the linking number of 𝜋 −1 (𝑎) and 𝜋 −1 (𝑏). Since we showed that
each value is regular, we can for example choose 𝑎 = (0, 0, 1) and 𝑏 = (0, 1, 0) on
𝕊2 ⊂ ℝ3 ≅ ℂ × ℝ.
To calculate the linking number of 𝜋 −1 (𝑎) and 𝜋 −1 (𝑏) we need to choose a point
on 𝕊3 disjoint from these two subsets and stereographically project 𝕊3 from this
point onto ℝ3 . By our choice of 𝑎 and 𝑏, we get that the north pole 𝑁 = (0, 0, 0, 1)
442 A.15. Linking Number and the Hopf Invariant
is neither on 𝜋 −1 (𝑎) nor on 𝜋 −1 (𝑏). Recall that the formula for the stereographic
projection 𝜙−1
𝑁
∶ 𝕊3 ⧵ {𝑁} → ℝ3 is, with the notation we use here, given by
1
(𝑥0 , 𝑦0 , 𝑥1 , 𝑦1 ) → (𝑥 , 𝑦 , 𝑥 ).
1 − 𝑦1 0 0 1
Hence we get
𝑆𝑎 ≔ 𝜙−1 −1
𝑁 (𝜋 (𝑎))
= {𝐯 = (𝑣0 , 𝑣1 , 𝑣2 ) ∈ ℝ3 ∶ 𝑣20 + 𝑣21 = 1 and 𝑣2 = 0}
and
𝑆𝑏 ≔ 𝜙−1 −1
𝑁 (𝜋 (𝑏))
{ }
(1 − 𝑤0 )2
= 𝐰 = (𝑤0 , 𝑤1 , 𝑤2 ) ∈ ℝ ∶ 𝑤1 = 𝑤2 and 𝑤0 + 𝑤1 =
3 2 2
.
2
𝑣1 = 𝑤1 = 0 and 𝑣0 − 𝑤0 = |𝑣0 − 𝑤0 |.
The latter condition, implies that 𝑣0 − 𝑤0 is positive. This does not look very
helpful at first glance, but we also know
and
(1 − 𝑤0 )2 √
𝑤20 = ⇐⇒ 𝑤0 = ± 2 − 1.
2
Now we can check for the four possible permutations of the signs
√ whether they
yield 𝑣0 − 𝑤0 ≥ √ 0 and get three points: one with 𝑣0 = 1, 𝑤
√0 = 2 − 1, one with
𝑣0 = 1, 𝑤0 = − 2 − 1, and one with 𝑣0 = −1, 𝑤0 = − 2 − 1. Hence we get
three points (𝐯, 𝐰) in 𝑆𝑎 × 𝑆𝑏 with 𝜆(𝐯, 𝐰) = 𝑝.
It remains to check the derivatives of 𝜆 at these points. We have to show that the
determinants at each point are nonzero and that the sum of the signs is +1. For
then we get deg(𝜆) = +1 as claimed.
Since 𝑆𝑎 is the unit circle in the 𝑥𝑦-plane, the tangent space of 𝑆𝑎 at a point 𝐯 is
given by
(1 − 𝑤0 )2
𝑔𝑏 ∶ 𝑃 → ℝ, 𝐰 = (𝑤0 , 𝑤1 , 𝑤1 ) → 𝑤20 + 𝑤21 = .
2
After a simple computation we get
The derivative of 𝑔𝑏 as a map from 𝑃 → ℝ is given by the matrix (we could also
consider it as a map ℝ3 → ℝ2 )
Hence we get
This will help us, since 𝑔3 ◦𝜆̃ is constant on 𝑆𝑎 × 𝑆𝑏 . Hence 𝑑 𝜆̃(𝐯,𝐰) sends the
subspace 𝑇𝐯 × 𝑇𝐰 𝑆𝑏 ∈ ℝ3 × ℝ3 to 𝑇𝑝 𝕊2 ⊂ ℝ3 .
𝜕 𝜆̃𝑖 𝜕 𝜆̃𝑖
We determine 𝑑 𝜆̃(𝐯,𝐰) by computing its partial derivatives 𝜕𝑣𝑗
(𝐯, 𝐰) and 𝜕𝑤𝑗
(𝐯, 𝐰)
with respect to the variables 𝑣0 , 𝑣1 , 𝑣2 and 𝑤0 , 𝑤1 , 𝑤2 : For 𝑖 ≠ 𝑗, we have
For 𝑖 = 𝑗, we get
⎧(𝑣1 − 𝑤1 )2 + (𝑣2 − 𝑤2 )2 if 𝑖 = 0
𝜕 𝜆̃𝑖 1 ⎪
(𝐯, 𝐰) = ⋅ ⎨(𝑣0 − 𝑤0 )2 + (𝑣2 − 𝑤2 )2 if 𝑖 = 1
𝜕𝑣𝑖 |𝐯 − 𝐰|3 ⎪(𝑣 − 𝑤 )2 + (𝑣 − 𝑤 )2
⎩ 0 0 1 1 if 𝑖 = 2
and
𝜕 𝜆̃𝑖 𝜕 𝜆̃
(𝐯, 𝐰) = − 𝑖 (𝐯, 𝐰).
𝜕𝑤𝑖 𝜕𝑣𝑖
Now we evaluate these formulae at the points (𝐯, 𝐰) with 𝜆(𝐯, 𝐰) = 𝑝. For each
such point we found 𝑣1 = 𝑣2 = 𝑤1 = 𝑤2 = 0. Hence we get
𝜕 𝜆̃𝑖 𝜕 𝜆̃𝑖
(𝐯, 𝐰) = 0 = (𝐯, 𝐰)
𝜕𝑣𝑗 𝜕𝑤𝑗
444 A.15. Linking Number and the Hopf Invariant
for 𝑖 ≠ 𝑗,
{
𝜕 𝜆̃𝑖 1 0 if 𝑖 = 0
(𝐯, 𝐰) = ⋅
𝜕𝑣𝑖 |𝑣0 − 𝑤0 | 1 if 𝑖 = 1, 2
and
{
𝜕 𝜆̃𝑖 1 0 if 𝑖 = 0
(𝐯, 𝐰) = ⋅
𝜕𝑤𝑖 |𝑣0 − 𝑤0 | −1 if 𝑖 = 1, 2.
Thus in total we get that the sum of the orientation numbers is +1 − 1 + 1 = +1.
Hence we have shown 𝐻(𝜋) = deg(𝜆) = 1.
a
We make two permutations which lead to multiplying with (−1)2 = +1.
b
Imagining 𝑆𝑏 as a deformation of the unit circle lying in the plane 𝑦 = 𝑧 in ℝ3 and the vector 𝐛 points
upwards at a point with positive 𝑥-coordinate. We checked that the orientation of 𝑆𝑏 is opposite to the
standard orientation of the circle.
Bibliography
[1] Bjørn Dundas, A short course in differential topology, Cambridge Mathematical Textbooks,
Cambridge University Press, Cambridge, 2018.
[2] David Eisenbud, Harold I. Levine, An algebraic formula for the degree of a 𝐶 ∞ map germ,
Ann. of Math. (2) 106 (1977), no.1, 19–44.
[3] David Eisenbud, An algebraic approach to the topological degree of a smooth map, Bull.
Amer. Math. Soc. 84 (1978), no.5, 751–764.
[4] William Fulton, Algebraic topology, Graduate Texts in Mathematics 153, Springer-Verlag,
New York, 1995.
[5] Victor Guillemin, Alan Pollack, Differential topology, Reprint of the 1974 original, AMS
Chelsea Publishing, Providence, RI, 2010.
[6] Allen Hatcher, Algebraic topology, Cambridge University Press, Cambridge, 2002.
[7] Heinz Hopf, Über die Abbildungen von Sphären auf Sphären niedrigerer Dimension, Fun-
damenta Mathematicae 25 (1935), 427–440.
[11] John M. Lee, Introduction to smooth manifolds, Graduate Texts in Mathematics 218,
Springer-Verlag, New York, 2013.
[12] John W. Milnor, Morse Theory, Annals of Mathematics Studies, No. 51, Princeton Uni-
versity Press, Princeton, NJ, 1963.
[13] John W. Milnor, Topology from the differentiable viewpoint, Princeton Landmarks in
Mathematics, Revised reprint of the 1965 original, Princeton University Press, Princeton,
NJ, 1997.
[14] John W. Milnor, James D. Stasheff, Characteristic classes, Annals of Mathematics Stud-
ies, No. 76, Princeton University Press, Princeton, NJ, 1974.
[15] James R. Munkres, Topology, Prentice Hall, Inc., Upper Saddle River, NJ, Second edition,
2000.
[16] Gereon Quick, Algebraic Topology - Lecture Notes, online notes from a course in 2018.
446
Bibliography 447
[17] Loring W. Tu, An introduction to manifolds, Universitext, Springer-Verlag, New York,
2011.
[18] Claire Voisin, Hodge theory and complex algebraic geometry. I. Translated from the
French original by Leila Schneps. Cambridge Studies in Advanced Mathematics, 76. Cam-
bridge University Press, Cambridge, 2002.
Index
448
Index 449
Milnor hypersurface, 185 Boundary (degree), 233, 234, 307,
Morse function, 131, 354 335, 407, 435
Morse Lemma, 132 Boundary (intersection number), 273,
282
neighborhood, 17 Brouwer Fixed Point, 203, 205, 209
normal bundle, 244, 249, 251, 305 Classification of one-manifolds, 197,
201, 211, 271
open ball, 14 Constant Rank, 99, 382
orientable, 287, 288, 290, 291, 298, 305 Dirichlet’s Approximation, 69
orientation Ehresmann Fibration, 256
boundary, 292, 298, 332 Embedding, 71
opposite, 289 Extension (transversality), 262, 331
preimage, 296, 298 Extension for degree zero maps, 331
pullback, 289 Heine–Borel, 21
real vector space, 286 Hopf Degree, 313, 323
smooth manifold, 287 Hopf nowhere vanishing vector fields,
345, 355
path-connected, 23, 103, 149
Invariance of domain, 206
proper map, 71
Inverse Function, 59, 61, 65, 85, 208,
real projective space, 151, 155, 163, 274, 242, 243, 251, 257
280 Isotopy, 218, 330
regular value, 80, 81, 83, 85, 86, 91, 195, Local Immersion, 65, 70, 109
235, 307, 309 Local Submersion, 77, 79, 125
relative open, 15 Perron–Frobenius, 209
retraction, 202 Poincaré–Hopf Index, 344
Preimage, 80, 162, 195, 245, 258,
simply-connceted, 276 268, 374, 376
simply-connected, 149, 209, 274, 277, Preimage with boundary, 196, 211,
281, 300 271
singular chain, 342 Sard, 124, 128, 149, 173, 213, 235,
singular homology, 342 281, 312
smooth map, 24, 26, 35, 152 Sard with boundary, 197
stable property, 143 Stack of Records, 86, 91, 307, 328,
stereographic projection, 34, 35, 56, 149, 330
186, 233, 236, 275, 281 Topological invariance of domain, 206
Stiefel manifold, 160 Transversality, 258
submanifold, 35, 66, 80, 111, 114 Transversality Homotopy, 262, 273
submersion, 75 Tubular Neighborhood, 247, 253, 257,
331
tangent bundle, 51 Whitney Approximation, 140, 253,
tangent space, 40, 155 277
2-sphere, 42 Whitney Embedding, 174, 175
Grassmannian, 167 Whitney Immersion, 173
real projective space, 157 torus, 36, 43, 57, 64, 68, 130, 134, 158,
unit circle, 41 272, 313, 344, 345
Theorem transverse, 111, 113, 114, 116, 128, 268,
Baire Category, 382 270, 272, 386, 411
Borsuk–Ulam, 220 tubular neighborhood, 247
450 Index
vector bundle, 53, 164 isolated zero, 334
vector field, 150, 333 nondegenerate zero, 347
derivative, 348 pullback, 335
gradient, 333, 336, 353 torus, 345
index of a zero, 334, 337 velocity vector, 57
index sum, 339, 344, 351
index sum of gradient field, 353 winding number, 219, 324, 326, 327, 329