Lectures
Lectures
Liviu I. Nicolaescu
Shape is a fascinating and intriguing subject which has stimulated the imagination of
many people. It suffices to look around to become curious. Euclid did just that and came
up with the first pure creation. Relying on the common experience, he created an abstract
world that had a life of its own. As the human knowledge progressed so did the ability of
formulating and answering penetrating questions. In particular, mathematicians started
wondering whether Euclid’s “obvious” absolute postulates were indeed obvious and/or
absolute. Scientists realized that Shape and Space are two closely related concepts and
asked whether they really look the way our senses tell us. As Felix Klein pointed out
in his Erlangen Program, there are many ways of looking at Shape and Space so that
various points of view may produce different images. In particular, the most basic issue
of “measuring the Shape” cannot have a clear cut answer. This is a book about Shape,
Space and some particular ways of studying them.
Since its inception, the differential and integral calculus proved to be a very versatile
tool in dealing with previously untouchable problems. It did not take long until it found
uses in geometry in the hands of the Great Masters. This is the path we want to follow
in the present book.
In the early days of geometry nobody worried about the natural context in which the
methods of calculus “feel at home”. There was no need to address this aspect since for the
particular problems studied this was a non-issue. As mathematics progressed as a whole
the “natural context” mentioned above crystallized in the minds of mathematicians and
it was a notion so important that it had to be given a name. The geometric objects which
can be studied using the methods of calculus were called smooth manifolds. Special cases
of manifolds are the curves and the surfaces and these were quite well understood. B.
Riemann was the first to note that the low dimensional ideas of his time were particular
aspects of a higher dimensional world.
The first chapter of this book introduces the reader to the concept of smooth manifold
through abstract definitions and, more importantly, through many we believe relevant
examples. In particular, we introduce at this early stage the notion of Lie group. The
main geometric and algebraic properties of these objects will be gradually described as we
progress with our study of the geometry of manifolds. Besides their obvious usefulness in
geometry, the Lie groups are academically very friendly. They provide a marvelous testing
ground for abstract results. We have consistently taken advantage of this feature through-
out this book. As a bonus, by the end of these lectures the reader will feel comfortable
manipulating basic Lie theoretic concepts.
To apply the techniques of calculus we need things to derivate and integrate. These
i
ii
“things” are introduced in Chapter 2. The reason why smooth manifolds have many
differentiable objects attached to them is that they can be locally very well approximated
by linear spaces called tangent spaces . Locally, everything looks like traditional calculus.
Each point has a tangent space attached to it so that we obtain a “bunch of tangent spaces”
called the tangent bundle. We found it appropriate to introduce at this early point the
notion of vector bundle. It helps in structuring both the language and the thinking.
Once we have “things to derivate and integrate” we need to know how to explicitly
perform these operations. We devote the Chapter 3 to this purpose. This is perhaps
one of the most unattractive aspects of differential geometry but is crucial for all further
developments. To spice up the presentation, we have included many examples which
will found applications in later chapters. In particular, we have included a whole section
devoted to the representation theory of compact Lie groups essentially describing the
equivalence between representations and their characters.
The study of Shape begins in earnest in Chapter 4 which deals with Riemann manifolds.
We approach these objects gradually. The first section introduces the reader to the notion
of geodesics which are defined using the Levi-Civita connection. Locally, the geodesics
play the same role as the straight lines in an Euclidian space but globally new phenomena
arise. We illustrate these aspects with many concrete examples. In the final part of this
section we show how the Euclidian vector calculus generalizes to Riemann manifolds.
The second section of this chapter initiates the local study of Riemann manifolds.
Up to first order these manifolds look like Euclidian spaces. The novelty arises when we
study “second order approximations ” of these spaces. The Riemann tensor provides the
complete measure of how far is a Riemann manifold from being flat. This is a very involved
object and, to enhance its understanding, we compute it in several instances: on surfaces
(which can be easily visualized) and on Lie groups (which can be easily formalized). We
have also included Cartan’s moving frame technique which is extremely useful in concrete
computations. As an application of this technique we prove the celebrated Theorema
Egregium of Gauss. This section concludes with the first global result of the book, namely
the Gauss-Bonnet theorem. We present a proof inspired from [38] relying on the fact
that all Riemann surfaces are Einstein manifolds. The Gauss-Bonnet theorem will be a
recurring theme in this book and we will provide several other proofs and generalizations.
One of the most fascinating aspects of Riemann geometry is the intimate correlation
“local-global”. The Riemann tensor is a local object with global effects. There are cur-
rently many techniques of capturing this correlation. We have already described one in
the proof of Gauss-Bonnet theorem. In Chapter 5 we describe another such technique
which relies on the study of the global behavior of geodesics. We felt we had the moral
obligation to present the natural setting of this technique and we briefly introduce the
reader to the wonderful world of the calculus of variations. The ideas of the calculus of
variations produce remarkable results when applied to Riemann manifolds. For example,
we explain in rigorous terms why “very curved manifolds” cannot be “too long” .
In Chapter 6 we leave for a while the “differentiable realm” and we briefly discuss the
fundamental group and covering spaces. These notions shed a new light on the results
of Chapter 5. As a simple application we prove Weyl’s theorem that the semisimple Lie
groups with definite Killing form are compact and have finite fundamental group.
Chapter 7 is the topological core of the book. We discuss in detail the cohomology
iii
of smooth manifolds relying entirely on the methods of calculus. In writing this chapter
we could not, and would not escape the influence of the beautiful monograph [22], and
this explains the frequent overlaps. In the first section we introduce the DeRham coho-
mology and the Mayer-Vietoris technique. Section 2 is devoted to the Poincaré duality, a
feature which sets the manifolds apart from many other types of topological spaces. The
third section offers a glimpse at homology theory. We introduce the notion of (smooth)
cycle and then present some applications: intersection theory, degree theory, Thom iso-
morphism and we prove a higher dimensional version of the Gauss-Bonnet theorem at the
cohomological level. The fourth section analyzes the role of symmetry in restricting the
topological type of a manifold. We prove Élie Cartan’s old result that the cohomology
of a symmetric space is given by the linear space of its bi-invariant forms. We use this
technique to compute the lower degree cohomology of compact semisimple Lie groups. We
conclude this section by computing the cohomology of complex Grassmannians relying on
Weyl’s integration formula and Schur polynomials. The chapter ends with a fifth section
containing a concentrated description of Čech cohomology.
Chapter 8 is a natural extension of the previous one. We describe the Chern-Weil
construction for arbitrary principal bundles and then we concretely describe the most im-
portant examples: Chern classes, Pontryagin classes and the Euler class. In the process,
we compute the ring of invariant polynomials of many classical groups. Usually, the con-
nections in principal bundles are defined in a global manner, as horizontal distributions.
This approach is geometrically very intuitive but, at a first contact, it may look a bit
unfriendly in concrete computations. We chose a local approach build on the reader’s ex-
perience with connections on vector bundles which we hope will attenuate the formalism
shock. In proving the various identities involving characteristic classes we adopt an invari-
ant theoretic point of view. The chapter concludes with the general Gauss-Bonnet-Chern
theorem. Our proof is a variation of Chern’s proof.
Chapter 9 is the analytical core of the book.1 Many objects in differential geometry
are defined by differential equations and, among these, the elliptic ones play an important
role. This chapter represents a minimal introduction to this subject. After presenting
some basic notions concerning arbitrary partial differential operators we introduce the
Sobolev spaces and describe their main functional analytic features. We then go straight
to the core of elliptic theory. We provide an almost complete proof of the elliptic a
priori estimates (we left out only the proof of the Calderon-Zygmund inequality). The
regularity results are then deduced from the a priori estimates via a simple approximation
technique. As a first application of these results we consider a Kazhdan-Warner type
equation which recently found applications in solving the Seiberg-Witten equations on
a Kähler manifold. We adopt a variational approach. The uniformization theorem for
compact Riemann surfaces is then a nice bonus. This may not be the most direct proof but
it has an academic advantage. It builds a circle of ideas with a wide range of applications.
The last section of this chapter is devoted to Fredholm theory. We prove that the elliptic
operators on compact manifolds are Fredholm and establish the homotopy invariance of the
index. These are very general Hodge type theorems. The classical one follows immediately
from these results. We conclude with a few facts about the spectral properties of elliptic
1
In the current edition, this is Chapter 10
iv
operators.
The last chapter is entirely devoted to a very important class of elliptic operators
namely the Dirac operators. The important role played by these operators was singled
out in the works of Atiyah and Singer and, since then, they continue to be involved in the
most dramatic advances of modern geometry. We begin by first describing a general notion
of Dirac operators and their natural geometric environment, much like in [14]. We then
isolate a special subclass we called geometric Dirac operators. Associated to each such
operator is a very concrete Weitzenböck formula which can be viewed as a bridge between
geometry and analysis, and which is often the source of many interesting applications. The
abstract considerations are backed by a full section describing many important concrete
examples.
In writing this book we had in mind the beginning graduate student who wants to
specialize in global geometric analysis in general and gauge theory in particular. The
second half of the book is an extended version of a graduate course in differential geometry
we taught at the University of Michigan during the winter semester of 1996.
The minimal background needed to successfully go through this book is a good knowl-
edge of vector calculus and real analysis, some basic elements of point set topology and
linear algebra. A familiarity with some basic facts about the differential geometry of curves
of surfaces would ease the understanding of the general theory, but this is not a must.
Some parts of the chapter on elliptic equations may require a more advanced background
in functional analysis.
The theory is complemented by a large list of exercises. Quite a few of them contain
technical results we did not prove so we would not obscure the main arguments. There
are however many non-technical results which contain additional information about the
subjects discussed in a particular section. We left hints whenever we believed the solution
is not straightforward.
Personal note It has been a great personal experience writing this book, and I sincerely
hope I could convey some of the magic of the subject. Having access to the remarkable
science library of the University of Michigan and its computer facilities certainly made my
job a lot easier and improved the quality of the final product.
I learned differential equations from Professor Viorel Barbu, a very generous and en-
thusiastic person who guided my first steps in this field of research. He stimulated my
curiosity by his remarkable ability of unveiling the hidden beauty of this highly technical
subject. My thesis advisor, Professor Tom Parker, introduced me to more than the funda-
mentals of modern geometry. He played a key role in shaping the manner in which I regard
mathematics. In particular, he convinced me that behind each formalism there must be
a picture, and uncovering it, is a very important part of the creation process. Although
I did not directly acknowledge it, their influence is present throughout this book. I only
hope the filter of my mind captured the full richness of the ideas they so generously shared
with me.
My friends Louis Funar and Gheorghe Ionesei 2 read parts of the manuscript. I am
grateful to them for their effort, their suggestions and for their friendship. I want to thank
Arthur Greenspoon for his advice, enthusiasm and relentless curiosity which boosted my
2
He passed away while I was preparing the second edition. He was the ultimate poet of mathematics.
v
spirits when I most needed it. Also, I appreciate very much the input I received from the
graduate students of my “Special topics in differential geometry” course at the University
of Michigan which had a beneficial impact on the style and content of this book.
At last, but not the least, I want to thank my family who supported me from the
beginning to the completion of this project.
Ann Arbor, 1996.
Rarely in life is a man given the chance to revisit his “youthful indiscretions”. With
this second edition I have been given this opportunity, and I have tried to make the best
of it.
The first edition was generously sprinkled with many typos, which I can only attribute
to the impatience of youth. In spite of this problem, I have received very good feedback
from a very indulgent and helpful audience, from all over the world.
In preparing the new edition, I have been engaged on a massive typo hunting, supported
by the wisdom of time, and the useful comments that I have received over the years from
many readers. I can only say that the number of typos is substantially reduced. However,
experience tells me that Murphy’s Law is still at work, and there are still typos out there
which will become obvious only in the printed version.
The passage of time has only strengthened my conviction that, in the words of Isaac
Newton, “in learning the sciences examples are of more use than precepts”. The new
edition continues to be guided by this principle. I have not changed the old examples, but
I have polished many of my old arguments, and I have added quite a large number of new
examples and exercises.
The only major addition to the contents is a new chapter (Chapter 9) on classical
integral geometry. This is a subject that captured my imagination over the last few years,
and since the first edition developed all the tools needed to understand some of the juiciest
results in this area of geometry, I could not pass the chance to share with a curious reader
my excitement about this line of thought.
One novel feature in our presentation of the classical results of integral geometry is the
use of tame geometry. This is a recent extension of the better know area of real algebraic
geometry which allowed us to avoid many heavy analytical arguments, and present the
geometric ideas in as clear a light as possible.
Notre Dame, 2007.
I started writing the first edition 25 years ago as a fresh PhD and I was warned
by many “adults” that this was a wrong career move. I embarked in this project with
all the energy, enthusiasm, inexperience and confidence of young age. It was meant to
be an honest presentation of the basic elements of differential geometry used in global
analysis. By “honest” presentation I understood that I should include clear and detailed
explanations for many of the folklore results, examples and points of view that are harder
to trace in the literature and had helped in my research.
i
It was an exciting experience writing the first edition and I was rewarded for my effort
it in many ways. There was an immediate reward for the time spent immersed in the
minutiæ of many examples and proofs. The detailed understanding I achieved allowed me
to push my own research to new directions and in such a depth that I did not believe I
was capable of at that age.
There was a long term reward since it turns out that the useful facts and examples
and computations that were harder to trace are still useful and harder to trace, but now
I can always turn to this book for details. Personal bias aside, I keep a copy of my book
on my desk since I frequently need to look up something in it.
There is a personal reward, as an author. I have been receiving input and acknowl-
edgments from many readers from all the corners of the world. I have implemented all
the corrections and suggestions I have received. A book lives through its readers and
apparently the present one is still alive.
So what is knew in the third edition? There are some obvious additions reflecting
my current research interests. There is the new Chapter 11 on spectral geometry leading
to the original results in Subsection 11.4.5. This is the first place where they appear in
published form.
Describing new results in a monograph rather than in a research journal has allowed
me to go to a level of detail and provide perspective that is not possible in a journal. In
particular, I have added two new subsections 4.2.5 and 5.2.3 on Riemannian geometry
containing facts that, surprisingly, are not familiar to many geometers. These play an
important role in Subsection 11.4.5.
There are less conspicuous changes. I have been using various parts of the book for
graduate courses I taught over the years. The new edition contains the useful feedback
I have received from my students. I have enhanced and cleaned many proofs and I have
added new examples. Of course I have corrected the typos pointed out to me by readers.
It is likely I have introduced new ones.
Notre Dame, 2019.
ii
Contents
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
1 Manifolds 3
1.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1.1 Space and Coordinatization . . . . . . . . . . . . . . . . . . . . . . . 3
1.1.2 The implicit function theorem . . . . . . . . . . . . . . . . . . . . . . 5
1.2 Smooth manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.1 Basic definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.2 Partitions of unity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.2.4 How many manifolds are there? . . . . . . . . . . . . . . . . . . . . . 22
3 Calculus on Manifolds 81
3.1 The Lie derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.1.1 Flows on manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.1.2 The Lie derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
3.1.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
3.2 Derivations of Ω• (M ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
iii
iv CONTENTS
7 Cohomology 243
7.1 DeRham cohomology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
7.1.1 Speculations around the Poincaré lemma . . . . . . . . . . . . . . . 243
7.1.2 Čech vs. DeRham . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
7.1.3 Very little homological algebra . . . . . . . . . . . . . . . . . . . . . 249
7.1.4 Functorial properties of the DeRham cohomology . . . . . . . . . . . 256
7.1.5 Some simple examples . . . . . . . . . . . . . . . . . . . . . . . . . . 259
7.1.6 The Mayer–Vietoris principle . . . . . . . . . . . . . . . . . . . . . . 261
7.1.7 The Künneth formula . . . . . . . . . . . . . . . . . . . . . . . . . . 265
7.2 The Poincaré duality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
7.2.1 Cohomology with compact supports . . . . . . . . . . . . . . . . . . 267
7.2.2 The Poincaré duality . . . . . . . . . . . . . . . . . . . . . . . . . . . 271
7.3 Intersection theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275
7.3.1 Cycles and their duals . . . . . . . . . . . . . . . . . . . . . . . . . . 275
7.3.2 Intersection theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280
7.3.3 The topological degree . . . . . . . . . . . . . . . . . . . . . . . . . . 285
7.3.4 The Thom isomorphism theorem . . . . . . . . . . . . . . . . . . . . 287
7.3.5 Gauss–Bonnet revisited . . . . . . . . . . . . . . . . . . . . . . . . . 290
7.4 Symmetry and topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294
7.4.1 Symmetric spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294
7.4.2 Symmetry and cohomology . . . . . . . . . . . . . . . . . . . . . . . 297
7.4.3 The cohomology of compact Lie groups . . . . . . . . . . . . . . . . 300
7.4.4 Invariant forms on Grassmannians and Weyl’s integral formula . . . 301
7.4.5 The Poincaré polynomial of a complex Grassmannian . . . . . . . . 309
7.5 Čech cohomology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
7.5.1 Sheaves and presheaves . . . . . . . . . . . . . . . . . . . . . . . . . 315
7.5.2 Čech cohomology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 320
Manifolds
1.1 Preliminaries
1.1.1 Space and Coordinatization
Mathematics is a natural science with a special modus operandi. It replaces concrete
natural objects with mental abstractions which serve as intermediaries. One studies the
properties of these abstractions in the hope they reflect facts of life. So far, this approach
proved to be very productive.
The most visible natural object is the Space, the place where all things happen. The
first and most important mathematical abstraction is the notion of number. Loosely
speaking, the aim of this book is to illustrate how these two concepts, Space and Number,
fit together.
It is safe to say that geometry as a rigorous science is a creation of ancient Greeks.
Euclid proposed a method of research that was later adopted by the entire mathematics.
We refer of course to the axiomatic method. He viewed the Space as a collection of points,
and he distinguished some basic objects in the space such as lines, planes etc. He then
postulated certain (natural) relations between them. All the other properties were derived
from these simple axioms.
Euclid’s work is a masterpiece of mathematics, and it has produced many interesting
results, but it has its own limitations. For example, the most complicated shapes one
could reasonably study using this method are the conics and/or quadrics, and the Greeks
certainly did this. A major breakthrough in geometry was the discovery of coordinates
by René Descartes in the 17th century. Numbers were put to work in the study of Space.
Descartes’ idea of producing what is now commonly referred to as Cartesian coor-
dinates is familiar to any undergraduate. These coordinates are obtained using a very
special method (in this case using three concurrent, pairwise perpendicular lines, each one
endowed with an orientation and a unit length standard. What is important here is that
they produced a one-to-one mapping
We call such a process coordinatization. The corresponding map is called (in this case)
3
4 CHAPTER 1. MANIFOLDS
r
θ
There are many ways to coordinatize the configuration space of a motion of a particle,
and for each choice of coordinates we get a different description of the motion. Clearly, all
these descriptions must “agree” in some sense, since they all reflect the same phenomenon.
In other words, these descriptions should be independent of coordinates. Differential ge-
ometry studies the objects which are independent of coordinates.
The coordinatization process had been used by people centuries before mathematicians
accepted it as a method. For example, sailors used it to travel from one point to another
on Earth. Each point has a latitude and a longitude that completely determines its
position on Earth. This coordinatization is not a global one. There exist four domains
delimited by the Equator and the Greenwich meridian, and each of them is then naturally
coordinatized. Note that the points on the Equator or the Greenwich meridian admit two
different coordinatizations which are smoothly related.
The manifolds are precisely those spaces which can be piecewise coordinatized, with
smooth correspondence on overlaps, and the intention of this book is to introduce the
reader to the problems and the methods which arise in the study of manifolds. The next
section is a technical interlude. We will review the implicit function theorem which will
be one of the basic tools for detecting manifolds.
Loosely speaking, a continuous function is differentiable at a point if, near that point,
it admits a “best approximation” by a linear map.
When F is differentiable at u0 ∈ U , the operator T in the above definition is uniquely
determined by
d 1
Th = |t=0 F (u0 + th) = lim (F (u0 + th) − F (u0 )) .
dt t→0 t
We will use the notation T = Du0 F and we will call T the Fréchet derivative of F at u0 .
Assume that the map F : U → Y is differentiable at each point u ∈ U . Then F is said
to be of class C 1 , if the map u 7→ Du F ∈ L(X, Y ) is continuous. F is said to be of class
C 2 if u 7→ Du F is of class C 1 . One can define inductively C k and C ∞ (or smooth) maps.
6 CHAPTER 1. MANIFOLDS
Exercise 1.1.3. (a) Let U ⊂ L(Rn , Rn ) denote the set of invertible n × n matrices. Show
that U is an open subset of L(Rn , Rn ).
(b) Let F : U → U be defined as A → A−1 . Show that DA F (H) = −A−1 HA−1 for any
n × n matrix H.
(c) Show that the Fréchet derivative of the map det : L(Rn , Rn ) → R, A 7→ det A, at
A = 1Rn ∈ L(Rn , Rn ) is given by tr H, i.e.,
d
|t=0 det(1Rn + tH) = tr H, ∀H ∈ L(Rn , Rn ). ⊔
⊓
dt
Theorem 1.1.4 (Inverse function theorem). Let X, Y be two Banach spaces, U ⊂ X open
and F : U ⊂ X → Y a smooth map. If at a point u0 ∈ U the derivative Du0 F ∈ L(X, Y )
is invertible, then there exits an open neighborhood U1 of u0 in U such that F (U1 ) is an
open neighborhood of v0 = F (u0 ) in Y and F : U1 → F (U1 ) is bijective, with smooth
inverse. ⊔
⊓
The spirit of the theorem is very clear: the invertibility of the derivative Du0 F “prop-
agates” locally to F because Du0 F is a very good local approximation for F .
More formally, if we set T = Du0 F , then
F (u0 + h) = F (u0 ) + T h + r(h),
where r(h) = o(khk) as h → 0. The theorem states that, for every v sufficiently close to
v0 , the equation F (u) = v has a unique solution u = u0 + h, with h very small. To prove
the theorem one has to show that, for kv − v0 kY sufficiently small, the equation below
v0 + T h + r(h) = v
1.2. SMOOTH MANIFOLDS 7
has a unique solution. We can rewrite the above equation as T h = v − v0 − r(h) or,
equivalently, as h = T −1 (v − v0 − r(h)). This last equation is a fixed point problem that
can be approached successfully via the Banach fixed point theorem.
Theorem 1.1.5 (Implicit function theorem). Let X, Y , Z be Banach spaces, U ⊂ X,
V ⊂ Y open sets and
F :U ×V →Z
a smooth map. Let (x0 , y0 ) ∈ U × V , and z0 := F (x0 , y0 ). Set
F2 : V → Z, F2 (y) = F (x0 , y).
Assume that Dy0 F2 ∈ L(Y, Z) is invertible. Then there exist open neighborhoods U ⊂ U
of x0 in X, V ⊂ V of y0 in Y , and a smooth map G : U → V such that the set S of
solution (x, y) of the equation F (x, y) = z0 which lie inside U × V can be identified with
the graph of G, i.e.,
(x, y) ∈ U × V ; F (x, y) = z0 = (x, G(x)) ∈ U × V ; x ∈ U .
In pre-Bourbaki times, the classics regarded the coordinate y as a function of x defined
implicitly by the equality F (x, y) = z0 .
Proof. Consider the map
H : X × Y → X × Z, ξ = (x, y) 7→ (x, F (x, y)).
The map H is a smooth map, and at ξ0 = (x0 , y0 ) its derivative Dξ0 H : X × Y → X × Z
has the block decomposition
1X 0
Dξ0 H = .
Dξ0 F1 Dξ0 F2
Above, DF1 (respectively DF2 ) denotes the derivative of x 7→ F (x, y0 ) (respectively the
derivative of y 7→ F (x0 , y)). The linear operator Dξ0 H is invertible, and its inverse has
the block decomposition
1X 0
(Dξ0 H)−1 = .
−1 −1
− (Dξ0 F2 ) ◦ (Dξ0 F1 ) (Dξ0 F2 )
Thus, by the inverse function theorem, the equation (x, F (x, y)) = (x, z0 ) has a unique
solution (x̃, ỹ) = H −1 (x, z0 ) in a neighborhood of (x0 , y0 ). It obviously satisfies x̃ = x and
F (x̃, ỹ) = z0 . Hence, the set {(x, y); F (x, y) = z0 } is locally the graph of x 7→ H −1 (x, z0 ).
⊔
⊓
Uj
Ui
ψi ψ
j
ψ ψ −1
j i
m
m R
R
(called charts or local coordinates) such that, Ψi (Ui ) is open in Rm and, if Ui ∩Uj 6=
∅, then the transition map
Ψj ◦ Ψ−1 m
i : Ψi (Ui ∩ Uj ) ⊂ R → Ψj (Ui ∩ Uj ) ⊂ R
m
⊔
is smooth. (We say that the various charts are smoothly compatible; see Figure 1.2).⊓
(b) Since a manifold is a second countable space we can always work with atlases that are
at most countable. ⊔
⊓
The first and the most important example of manifold is Rn itself. The natural smooth
structure consists of an atlas with a single chart, 1Rn : Rn → Rn . To construct more
examples we will use the implicit function theorem .
Definition 1.2.3. (a) Let M , N be two smooth manifolds of dimensions m and respec-
tively n. A continuous map f : M → N is said to be smooth if, for any local charts φ
on M and ψ on N , the composition ψ ◦ f ◦ φ−1 (whenever this makes sense) is a smooth
map Rm → Rn .
(b) A smooth map f : M → N is called a diffeomorphism if it is invertible and its inverse
is also a smooth map. ⊔
⊓
Example 1.2.4. The map t 7→ et is a diffeomorphism (−∞, ∞) → (0, ∞). The map
t 7→ t3 is a homeomorphism R → R, but it is not a diffeomorphism! ⊔
⊓
Ψ : U → Rm
a smooth, one-to-one map with open image and smooth inverse. Then Ψ defines local
coordinates over U compatible with the existing atlas of M . Thus (U, Ψ) can be added
to the original atlas and the new smooth structure is diffeomorphic with the initial one.
Using Zermelo’s Axiom we can produce a maximal atlas (no more compatible local chart
can be added to it). ⊔
⊓
Our next result is a general recipe for producing manifolds. Historically, this is how
manifolds entered mathematics.
Proposition 1.2.6. Let M be a smooth manifold of dimension m and f1 , . . . , fk ∈
C ∞ (M ). Define
Z = Z(f1 , . . . , fk ) = p ∈ M ; f1 (p) = · · · = fk (p) = 0 .
10 CHAPTER 1. MANIFOLDS
Assume that the functions f1 , . . . , fk are functionally independent along Z, i.e., for each
p ∈ Z, there exist local coordinates (x1 , . . . , xm ) defined in a neighborhood of p in M such
that xi (p) = 0, i = 1, . . . , m, and the matrix
∂f1 ∂f1 ∂f1
1 2 · · · ∂xm
∂ f~ ∂x
..
∂x
. . ..
|p := . .
. .
. .
∂~x ∂fk ∂fk ∂fk
∂x1 ∂x2 · · · ∂xm x1 =···=xm =0
Proof. Step 1: Constructing the charts. Let p0 ∈ Z, and denote by (x1 , . . . , xm ) local
coordinates near p0 such that xi (p0 ) = 0. One of the k × k minors of the matrix
∂f1 ∂f1 ∂f1
∂x1 ∂x2
· · · ∂x m
∂f~ ..
|p := . .
.. .
.. ..
.
∂~x ∂fk ∂fk ∂fk
∂x1 ∂x2 · · · ∂xm x1 =···=xm =0
is nonzero. Assume this minor is determined by the last k columns (and all the k lines).
We can think of the functions f1 , . . . , fk as defined on an open subset U of Rm . Split
Rm as Rm−k × Rk , and set
X = Rm−k , Y = Rk , Z = Rk ,
and F : X × Y → Z given by
f1 (x)
..
x 7→ . ∈ Rk .
fk (x))
∂F
In this case, DF2 = ∂x ′′ : Rk → Rk is invertible since its determinant corresponds to
our nonzero minor. Thus, in a product neighborhood Up0 = Up′ 0 × Up′′0 of p0 , the set Z is
the graph of some function
i.e.,
Z ∩ Up 0 = (x′ , g(x′ ) ) ∈ Rm−k × Rk ; x′ ∈ Up′ 0 , |x′ | small .
We now define ψp0 : Z ∩ Up0 → Rm−k by
ψp
( x′ , g(x′ ) ) 7−→
0
x′ ∈ Rm−k .
Step 2. The transition maps for the charts constructed above are smooth. The details
are left to the reader. ⊔
⊓
1.2. SMOOTH MANIFOLDS 11
Proposition 1.2.6 shows that any submanifold N ⊂ M has a natural smooth structure
so it becomes a manifold per se. Moreover, the inclusion map i : N ֒→ M is smooth.
Exercise 1.2.9. Suppose that M is a smooth m-dimensional manifold. Prive that S ⊂ M
is a codimension k-submanifold of M if and only if, for any p0 ∈ M , there exists a
coordinate chart (U, Ψ) with local coordinates (x1 , . . . , xm ) such that p0 ∈ U and
U ∩S = p ∈ U ; x1 (p) = · · · = xk (p) = 0 . ⊔
⊓
(i) 0 ≤ fβ ≤ 1.
(iii) The family (supp fβ ) is locally finite, i.e., any point x ∈ M admits an open neigh-
borhood intersecting only finitely many of the supports supp fβ .
P
(iv) β fβ (x) = 1 for all x ∈ M . ⊔
⊓
We include here for the reader’s convenience the basic existence result concerning
partitions of unity. For a proof we refer to [128].
Proposition 1.2.11. (a) For any open cover U = (Uα )α∈A of a smooth manifold M there
exists at least one smooth partition of unity (fβ )β∈B subordinated to U such that supp fβ
is compact for any β.
(b) If we do not require compact supports, then we can find a partition of unity in which
B = A and φ = 1A . ⊔
⊓
12 CHAPTER 1. MANIFOLDS
1.2.3 Examples
Manifolds are everywhere, and in fact, to many physical phenomena which can be modelled
mathematically one can naturally associate a manifold. On the other hand, many problems
in mathematics find their most natural presentation using the language of manifolds. To
give the reader an idea of the scope and extent of modern geometry, we present here a
short list of examples of manifolds. This list will be enlarged as we enter deeper into the
study of manifolds.
Example 1.2.13. (The n-dimensional sphere) This is the codimension 1 submanifold of
Rn+1 given by the equation
n
X
2
|x| = (xi )2 = r 2 , x = (x0 , . . . , xn ) ∈ Rn+1 .
i=0
One checks that, along the sphere, the differential of |x|2 is nowhere zero, so by Proposition
1.2.6, S n is indeed a smooth manifold. In this case one can explicitly construct an atlas
(consisting of two charts) which is useful in many applications. The construction relies on
stereographic projections.
Let N and S denote the North and resp. South pole of S n (N = (0, . . . , 0, 1) ∈ Rn+1 ,
S = (0, . . . , 0, −1) ∈ Rn+1 ). Consider the open sets UN = S n \ {N } and US = S n \ {S}.
They form an open cover of S n . The stereographic projection from the North pole is the
map σN : UN → Rn such that, for any P ∈ UN , the point σN (P ) is the intersection of the
line N P with the hyperplane {xn = 0} ∼ = Rn .
The stereographic projection from the South pole is defined similarly.
• For P ∈ UN we denote by y 1 (P ), . . . , y n (P ) the coordinates of σN (P ).
• For Q ∈ US we denote by z 1 (Q), . . . , z n (Q) ) the coordinates of σS (Q).
Exercise 1.2.14. Show that the functions y i , z j constructed in the above example satisfy
yi
z i = P , ∀i = 1, . . . , n. ⊔
⊓
n j 2
j=1 (y )
1.2. SMOOTH MANIFOLDS 13
Example 1.2.15. (The n-dimensional torus) This is the codimension n submanifold of the
vector space R2n with Cartesian coordinates (x1 , y1 ; ... ; xn , yn ), defined by the equalities
Note that T 1 is diffeomorphic with the 1-dimensional sphere S 1 (unit circle). As a set T n
is a direct product of n circles (see Figure 1.3)
T n = x21 + y12 = 1 × x2n + yn2 = 1 = S 1 × · · · × S 1 . ⊔
⊓
Example 1.2.17. (The connected sum of two manifolds) Let M1 and M2 be two manifolds
of the same dimension m. Pick pi ∈ Mi (i = 1, 2), choose small open neighborhoods Ui of
pi in Mi and then local charts ψi identifying each of these neighborhoods with B2 (0), the
ball of radius 2 in Rm .
Let Vi ⊂ Ui correspond (via ψi ) to the annulus {1/2 < |x| < 2} ⊂ Rm . Consider
x
φ: 1/2 < |x| < 2 → 1/2 < |x| < 2 , φ(x) = .
|x|2
The action of φ is clear: it switches the two boundary components of {1/2 < |x| < 2},
and reverses the orientation of the radial directions.
Now “glue” V1 to V2 using the “prescription” given by ψ2−1 ◦ φ ◦ ψ1 : V1 → V2 . In this
way we obtain a new topological space with a natural smooth structure induced by the
smooth structures on Mi . Up to a diffeomeorphism, the new manifold thus obtained is
independent of the choices of local coordinates ([24]), and it is called the connected sum
of M1 and M2 and is denoted by M1 #M2 (see Figure 1.4). ⊔
⊓
Example 1.2.18. (The real projective space RPn ) As a topological space RPn is the
quotient of Rn+1 modulo the equivalence relation
def
x ∼ y ⇐⇒ ∃λ ∈ R∗ : x = λy.
14 CHAPTER 1. MANIFOLDS
V V
1 2
The maps ψk define local coordinates on the projective space. The transition map on the
overlap region Uk ∩ Um = {[x0 , . . . , xn ] ; xk xm 6= 0} can be easily described. Set
The equality
This shows the map ψk ◦ ψm −1 is smooth and proves that RPn is a smooth manifold. Note
1
that when n = 1, RP is diffeomorphic with S 1 . One way to see this is to observe that
the projective space can be alternatively described as the quotient space of S n modulo the
equivalence relation which identifies antipodal points . ⊔
⊓
Example 1.2.19. (The complex projective space CPn ) The definition is formally identical
to that of RPn . CPn is the quotient space of Cn+1 \ {0} modulo the equivalence relation
def
x ∼ y ⇐⇒ ∃λ ∈ C∗ : x = λy.
The open sets Uk are defined similarly and so are the local charts ψk : Uk → Cn . They
satisfy transition rules similar to (1.2.1) so that CPn is a smooth manifold of dimension
2n. ⊔
⊓
In the above example we encountered a special (and very pleasant) situation: the
−1 : U → V
gluing maps not only are smooth, they are also holomorphic as maps ψk ◦ ψm
n
where U and V are open sets in C . This type of gluing induces a “rigidity” in the
underlying manifold and it is worth distinguishing this situation.
Definition 1.2.21 (Complex manifolds). A complex manifold is a smooth, 2n-dimensional
manifold M which admits an atlas {(Ui , ψi ) : Ui → Cn } such that all transition maps are
holomorphic. ⊔
⊓
The complex projective space is a complex manifold. Our next example naturally
generalizes the projective spaces described above.
Example 1.2.22. (The real and complex Grassmannians Grk (Rn ), Grk (Cn ))
Suppose V is a real vector space of dimension n. For every 0 ≤ k ≤ n we denote by
Grk (V ) the set of k-dimensional vector subspaces of V . We will say that Grk (V ) is the
linear Grassmannian of k-planes in E. When V = Rn we will write Grk,n (R) instead of
Grk (Rn ).
We would like to give several equivalent descriptions of the natural structure of smooth
manifold on Grk (V ). To do this it is very convenient to fix an Euclidean metric on V .
Any k-dimensional subspace L ⊂ V is uniquely determined by the orthogonal projec-
tion onto L which we will denote by PL . Thus we can identify Grk (V ) with the set of
rank k projectors
Projk (V ) := P : V → V ; P ∗ = P = P 2 , rank P = k .
Let use observe that the rank of an orthogonal projector is determined by the equality
rank P = tr P.
P : Grk (V ) → Projk (V ), L 7→ PL
16 CHAPTER 1. MANIFOLDS
P 2 = P, tr P = k.
This proves that Projk (V ) is a closed subset of End+ (V ). From the equality
1 1 k
kP k2 = tr P 2 = tr P = , ∀P ∈ Projk (V )
2 2 2
we deduce that Projk (V ) is also bounded subset of End+ (V ). The bijection
P : Grk (V ) → Projk (V ), L 7→ PL
induces a topology on Grk (V ), and with this topology Grk (V ) is a compact metric space.
We want to show that Grk (V ) has a natural structure of smooth manifold compatible
with this topology. To see this, we define for every L ⊂ Grk (V ) the set
Grk (V, L) := U ∈ Grk (V ); U ∩ L⊥ = 0 .
U ∩ L⊥ = 0 ⇐⇒ 1 − PL + PU : V → V is an isomorphism. (1.2.3)
U ∩ L⊥ = 0 ⇐⇒ U + L⊥ = V ⇐⇒ U ⊥ ∩ L = 0.
0 = PL (1 − PL + PU )v = PL PU v = 0 =⇒ PU v ∈ U ∩ ker PL = U ∩ L⊥ = 0.
1.2. SMOOTH MANIFOLDS 17
v ∈ U ⊥ ∩ L = 0.
v = PL⊥ x + PU x ∈ L⊥ + U.
(b) We have to show that, for every K ∈ Grk (V, L), there exists ε > 0 such that any U
satisfying
kPU − PK k < ε
intersects L⊥ trivially. Since K ∈ Grk (V, L) we deduce from (a) that the map 1 −PL −PK :
V → V is an isomorphism. Note that
k(1 − PL − PK ) − (1 − PL − PU )k = kPK − PU k.
The space of isomorphisms of V is an open subset of End(V ). Hence there exists ε > 0
such that, for any subspace U satisfying kPU − PK k < ε, the endomorphism (1 − PL − PU )
is an isomorphism. We now conclude using part (a). ⊔
⊓
is an open cover of Grk (V ). Note that for every L ∈ Grk (V ) we have a natural map
that associates to each linear map S : L → L⊥ its graph (see Figure 1.5)
ΓS = {x + Sx ∈ L + L⊥ = V ; x ∈ L}.
U ∩ ker PL = U ∩ L⊥ = 0.
S : L → L⊥ , Sx = PL⊥ HU x.
We will show that the bijection (1.2.4) is a homeomorphism. We first prove that it is
continuous by providing an explicit description of the orthogonal projection PΓS
18 CHAPTER 1. MANIFOLDS
v
v Γ
L S
Sx
v x L
L
and,
x
PΓS v = .
Sx
Hence PΓS has the block decomposition
1L
PΓS = · (1L + S ∗ S)−1 (1L + S ∗ S)−1 S ∗
S
(1L + S ∗ S)−1 (1L + S ∗ S)−1 S ∗
= . (1.2.5)
S(1L + S ∗ S)−1 S(1L + S ∗ S)−1 S ∗
This proves that the map
is continuous. Note that if U ∈ Grk (V, L), then with respect to the decomposition V =
L + L⊥ the projector PU has the block form
PL PU IL PL PU IL⊥
A B
PU = = ,
C D
PL⊥ PU IL PL⊥ PU IL⊥
Hom(L, L⊥ ) ∋ S 7→ ΓS ∈ Grk (V )
is also continuous. Moreover, the above formulæ show that if U ∈ Grk (V, L0 )∩Grk (V, L1 ),
then we can represent U in two ways,
and the correspondence S0 → S1 is smooth. This shows that Grk (V ) has a natural
structure of smooth manifold of dimension
Exercise 1.2.24. Show that Grk (Cn ) is a complex manifold of complex dimension k(n −
k)). ⊔
⊓
Example 1.2.25. (Lie groups). A Lie group is a smooth manifold G together with a
group structure on it such that the map
is smooth. These structures provide an excellent way to formalize the notion of symmetry.
T · T t = 1.
To describe its smooth structure we will use the Cayley transform trick as in [113] (see
also the classical [133]). Set
Mn (R)# := T ∈ Mn (R) ; det(1 + T ) 6= 0 .
The matrices in Mn (R)# are called non exceptional. Clearly 1 ∈ O(n)# = O(n)∩Mn (R)#
so that O(n)# is a nonempty open subset of O(n). The Cayley transform is the map
# : Mn (R)# → Mn (R) defined by
A 7→ A# = (1 − A)(1 + A)−1 .
Define ΨT : T · O(n)# → o(n) by S 7→ (T −1 · S)# . One can show that the collection
T · O(n)# , ΨT
T ∈O(n)
The group SO(n) is a Lie group as well and dim SO(n) = dim O(n).
(e) The unitary group U (n) is defined as
where T ∗ denotes the conjugate transpose (adjoint) of T . To prove that U (n) is a manifold
one uses again the Cayley transform trick. This time, we coordinatize the group using the
space u(n) of skew-adjoint (skew-Hermitian) n × n complex matrices (A = −A∗ ). Thus
U (n) is a smooth manifold of dimension
Inside U (n) sits the normal subgroup SU (n), the kernel of the group homomorphism
det : U (n) → S 1 . SU (n) is also called the special unitary group. This a smooth manifold
of dimension n2 − 1. In fact the Cayley transform trick allows one to coordinatize SU (n)
using the space
su(n) = {A ∈ u(n) ; tr A = 0}. ⊔
⊓
Exercise 1.2.26. (a) Prove the properties (i)-(iii) of the Cayley transform, and then show
#
that T · O(n) , ΨT T ∈O(n) defines a smooth structure on O(n).
(b) Prove that U (n) and SU (n) are manifolds.
(c) Show that O(n), SO(n), U (n), SU (n) are compact spaces.
(d) Prove that SU (2) is diffeomorphic with S 3 (Hint: think of S 3 as the group of unit
quaternions.) ⊔
⊓
Exercise 1.2.27. Let SL(n; K) denote the group of n × n matrices of determinant 1 with
entries in the field K = R, C. Use the implicit function theorem to show that SL(n; K) is
a smooth manifold of dimension dK (n2 − 1), where dK = dimR K. ⊔
⊓
22 CHAPTER 1. MANIFOLDS
Exercise 1.2.28. (Quillen). Suppose V0 , V1 are two real, finite dimensional Euclidean
space, and T : V0 → V1 is a linear map. We denote by T ∗ is adjoint, T ∗ : V1 → V0 , and by
ΓT the graph of T ,
ΓT = (v0 , v1 ) ∈ V0 ⊕ V1 ; v1 = T v0 l .
We form the skew-symmetric operator
v0 0 T∗ v0
X : V0 ⊕ V1 → V0 ⊕ V1 , X · = · .
v1 −T 0 v1
CT = (1 − X)(1 + X)−1 ,
RT2 = 1, RT∗ = RT ,
Again we face the same question: do we get non-diffeomorphic surfaces for different
choices of g? Figure 1.6 suggests that this may be the case but this is no rigorous argument.
We know another example of compact surface, the projective plane RP2 , and we nat-
urally ask whether it looks like one of the surfaces constructed above. Unfortunately,
we cannot visualize the real projective plane (one can prove rigorously it does not have
enough room to exist inside our 3-dimensional Universe). We have to decide this question
using a little more than the raw geometric intuition provided by a picture. To kill the
suspense, we mention that RP2 does not belong to the family of donuts. One reason is
that, for example, a torus has two faces: an inside face and an outside face (think of a car
rubber tube). RP2 has a weird behavior: it has “no inside” and “no outside”. It has only
one side! One says the torus is orientable while the projective plane is not.
We can now connect sum any numbers of RP2 ’s to any donut an thus obtain more
and more surfaces, which we cannot visualize and we have yet no idea if they are pairwise
distinct. A classical result in topology says that all compact surfaces can be obtained in
this way (see [90]), but in the above list some manifolds are diffeomorphic, and we have
to describe which. In dimension 3 things are not yet settled3 and, to make things look
hopeless, in dimension ≥ 4 Question 1 is algorithmically undecidable .
We can reconsider our goals, and look for all the manifolds with a given property X.
In many instances one can give fairly accurate answers. Property X may refer to more
than the (differential) topology of a manifold. Real life situations suggest the study of
manifolds with additional structure. The following problem may give the reader a taste
of the types of problems we will be concerned with in this book.
Question 2 Can we wrap a planar piece of canvas around a metal sphere in a one-
to-one fashion? (The canvas is flexible but not elastic).
3
Things are still not settled in 2007, but there has been considerable progress due to G. Perelman’s
proof of the Poincaré’s and Thursron’s conjectures.
24 CHAPTER 1. MANIFOLDS
A simple do-it-yourself experiment is enough to convince anyone that this is not pos-
sible. Naturally, one asks for a rigorous explanation of what goes wrong. The best ex-
planation of this phenomenon is contained in the celebrated Theorema Egregium (Golden
Theorem) of Gauss. Canvas surfaces have additional structure (they are made of a special
material), and for such objects there is a rigorous way to measure “how curved” are they.
One then realizes that the problem in Question 2 is impossible, since a (canvas) sphere is
curved in a different way than a plane canvas.
There are many other structures Nature forced us into studying them, but they may
not be so easily described in elementary terms.
A word to the reader. The next two chapters are probably the most arid in geometry
but, keep in mind that, behind each construction lies a natural motivation and, even if we
do not always have the time to show it to the reader, it is there, and it may take a while
to reveal itself. Most of the constructions the reader will have to “endure” in the next two
chapters constitute not just some difficult to “swallow” formalism, but the basic language
of geometry. It might comfort the reader during this less than glamorous journey to carry
in the back of his mind Hermann Weyl’s elegantly phrased advise.
“It is certainly regrettable that we have to enter into the purely formal aspect
in such detail and to give it so much space but, nevertheless, it cannot be
avoided. Just as anyone who wishes to give expressions to his thoughts with
ease must spend laborious hours learning language and writing, so here too the
only way we can lessen the burden of formulæ is to master the technique of
tensor analysis to such a degree that we can turn to real problems that concern
us without feeling any encumbrance, our object being to get an insight into
the nature of space [...]. Whoever sets out in quest of these goals must possess
a perfect mathematical equipment from the outset.”
H. Weyl: Space, Time, Matter.
Chapter 2
Natural Constructions on
Manifolds
The goal of this chapter is to introduce the basic terminology used in differential geometry.
The key concept is that of tangent space at a point which is a first order approximation of
the manifold near that point. We will be able to transport many notions in linear analysis
to manifolds via the tangent space.
(S 2 ) : x2 + y 2 + z 2 = 1 in R3 .
We want to find the plane passing through the North pole N (0, 0, 1) that is “closest” to
the sphere. The classics would refer to such a plane as an osculator plane.
The natural candidate for this osculator plane would be a plane given by a linear
equation that best approximates the defining equation x2 + y 2 + z 2 = 1 in a neighborhood
of the North pole. The linear approximation of x2 + y 2 + z 2 near N seems like the best
candidate. We have
x2 + y 2 + z 2 − 1 = 2(z − 1) + O(2),
where O(2) denotes a quadratic error. Hence, the osculator plane is z = 1, Geometrically,
it is the horizontal affine plane through the North pole. The linear subspace {z = 0} ⊂ R3
is called the tangent space to S 2 at N .
The above construction has one deficiency: it is not intrinsic, i.e., it relies on objects
“outside” the manifold S 2 . There is one natural way to fix this problem. Look at a smooth
path γ(t) on S 2 passing through N at t = 0. Hence, t 7→ γ(t) ∈ R3 , and
|γ(t)|2 = 1. (2.1.1)
25
26 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
If we differentiate (2.1.1) at t = 0 we get (γ̇(0), γ(0)) = 0, i.e., γ̇(0) ⊥ γ(0), so that γ̇(0) lies
in the linear subspace z = 0. We deduce that the tangent space consists of the tangents
to the curves on S 2 passing through N .
This is apparently no major conceptual gain since we still regard the tangent space
as a subspace of R3 , and this is still an extrinsic description. However, if we use the
stereographic projection from the South pole we get local coordinates (u, v) near N , and
any curve γ(t) as above can be viewed as a curve t 7→ (u(t), v(t)) in the (u, v) plane. If
φ(t) is another curve through N given in local coordinates by t 7→ (u(t), v(t)), then
γ̇(0) = φ̇(0) ⇐⇒ u̇(0), v̇(0) = u̇(0), v̇(0) .
The right hand side of the above equality defines an equivalence relation ∼ on the set of
smooth curves passing trough (0, 0). Thus, there is a bijective correspondence between the
tangents to the curves through N , and the equivalence classes of “∼”. This equivalence
relation is now intrinsic modulo one problem: “∼” may depend on the choice of the local
coordinates. Fortunately, as we are going to see, this is a non-issue. ⊔
⊓
where
α(t) = (xα (t)) := x1α (t), . . . , xm
α (t) ,
and
β(t) = (xβ (t)) = x1β (t), . . . , xm
β (t) .
We write this α ∼1 β. ⊔
⊓
Proof. Choose local coordinates (x1 , . . . , xm ) near p such that xi (p) = 0, ∀i, and let α and
β be two smoothcurves through p. In the above local coordinates the curves α, β become
xiα (t) , xiβ (t) . Construct a new curve γ through p whose x coordinates are given by
xiγ (t) = xiα (t) + xiβ (t) .
We want to emphasize that the above curve depends on α, β and the choice of local
coordinates (x1 , . . . , xm ). We will indicate this using the notation α +x β when referring
to γ. Set
[α̇(0)] + [β̇(0)] := [γ̇(0)].
The zero vector in Tp M is described by a curve α such that ẋiα (0) = 0, ∀i. For this
operation to be well defined one has to check several things.
(a) The equivalence class [γ̇(0)] is independent of coordinates. In other words, if
(x1 , . . . , xm ) and (y 1 , . . . , y m ) are local coordinates near p such that
then
α +x β ∼1 α +y β.
(b) If a curve represents the zero vector in some local coordinates (xi ) at p, then it
represents the zero vector in any other choice of local coordinates at p.
(c) If [α˙1 (0)] = [α˙2 (0)] and [β˙1 (0)] = [β˙2 (0)], then
From this point on we will omit the brackets [ – ] in the notation of a tangent vector.
Thus, [α̇(0)] will be written simply as α̇(0).
As one expects, all the above notions admit a nice description using local coordinates.
Let (x1 , . . . , xm ) be coordinates near p ∈ M such that xi (p) = 0, ∀i. Consider the curves
We set
∂xk (p) := ėk (0) = δk1 , . . . , δkn . (2.1.2)
Often, when the point p is clear from the context, we will omit it in the above notation.
☞ Note that the vectors ∂xk (p) depend on the choice of local coordinates (x1 , . . . , xm ).
28 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
Lemma 2.1.7. The collection ∂xk (p) 1≤k≤m
is a basis of Tp M .
we have
α̇(0) = ẋ1α (0), . . . , ẋm 1 m
α (0) = ẋα (0) 1, 0, . . . , 0 + · · · + ẋα (0) 0, . . . , 0, 1
Xm
(2.1.3)
= ẋ1α (0)∂x1 + · · · + ẋm
α (0)∂ x m = ẋiα (0)∂xi .
i=1
⊔
⊓
Deduce from the above that Tp0 M can be identified with the vector subspace of Rn
spanned by the vectors ~v ∈ Rn with the property that there exists a smooth path α :
(ε, ε) → M ⊂ Rn such that
dα(0)
α(0) = p0 , = ~v . ⊔
⊓
dt
Exercise 2.1.9. Let F : RN → Rk be a smooth map. Assume that
(a) M = F −1 (0) 6= ∅;
(b) rank Dx F = k, for all x ∈ M .
Then M is a smooth manifold of dimension N − k and
Tx M ∼
= ker Dx F, ∀x ∈ M. ⊔
⊓
Example 2.1.10. We want to describe T1 G, where G is one of the Lie groups discussed
in Section 1.2.2.
(a) G = O(n). Let (−ε, ε) ∋ s 7→ T (s) be a smooth path of orthogonal matrices such that
T (0) = 1. Then T t (s) · T (s) = 1. Differentiating this equality at s = 0 we get
Ṫ t (0) + Ṫ (0) = 0.
The matrix Ṫ (0) defines a vector in T1 O(n), so the above equality states that this tangent
space lies inside the space of skew-symmetric matrices, i.e., T1 O(n) ⊂ o(n). On the other
hand, we proved in Section 1.2.2 that dim G = dim o(n) so that
T1 O(n) = o(n).
2.1. THE TANGENT BUNDLE 29
(b) G = SL(n; R). Let (−ε, ε) ∋ s 7→ T (s) be a smooth path in SL(n; R) such that
T (0) = 1. Then det T (s) = 1 and differentiating this equality at s = 0 we get (see
Exercise 1.1.3)
tr Ṫ (0) = 0.
Thus, the tangent space at 1 lies inside the space of traceless matrices, i.e. T1 SL(n; R) ⊂
sl(n; R). Since (according to Exercise 1.2.27) dim SL(n; R) = dim sl(n; R) we deduce
• which tangent space does it belong to, i.e., we know the point p = π(v) ∈ M ,
• the coordinates of v in the basis ∂xi (p) ,
X
v= X i (v)∂xi (p).
i
Ψx : T U → U x × Rm ⊂ Rm × Rm ,
where U x is the image of U in Rm via the coordinates (xi ). We can now use the map Ψx
to transfer the topology on Rm × Rm to T U . Again, we have to make sure this topology
is independent of local coordinates.
To see this, pick a different coordinate system y = (y i ) on U . The coordinate inde-
pendence referred to above is equivalent to the statement that the transition map
Ψy ◦ Ψ−1 x m y
x : U × R −→ T U −→ U × R
m
is a homeomorphism.
30 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
α(t) = x + tX.
Proof. Recall that Tp M is the space of tangent vectors to curves through p. If α(t) is such
a curve (α(0) = p), then β(t) = f (α(t)) is a smooth curve in N through q = f (p), and we
define
Df (α̇(0)) := β̇(0).
One checks easily that if α1 ∼1 α2 , then f (α1 ) ∼1 f (α2 ), so that Df is well defined. To
prove that the map Df : Tp M → Tq N is linear it suffices to verify this in any particular
local coordinates (x1 , . . . , xm ) near p, and (y 1 , . . . , y n ) near q, such that xi (p) = 0, y j (q) =
0, ∀i, j, since any two choices differ (infinitesimally) by a linear substitution. Hence, we
can regard f as a collection of maps
Then
F (α(t)) = yα1 (t), . . . , yαn (t) = y 1 ( xiα (t) ), . . . , y n ( xiα (t) ) ,
F (β(t)) = yβ1 (t), . . . , yβn (t) = y 1 ( xiβ (t) ), . . . , y n ( xiβ (t) ) ,
F (α(t) + β(t) ) = y 1 ( xiα (t) + xiβ (t) ), . . . , y n ( xiα (t) + xiβ (t) ) .
n n m
!
d (2.1.3) X X X ∂y j
DF α̇(0) = t=0
F (α(t)) = ẏαj (0)∂yj = ẋiα (0) ∂yj ,
dt ∂xi
j=1 j=1 i=1
n n m
!
d (2.1.3 ) X X X ∂y j i
DF β̇(0) = F (β(t)) = ẏβj (0)∂yj = ẋ (0) ∂yj ,
dt t=0 ∂xi β
j=1 j=1 i=1
n X
d X m
∂y j i i
DF α̇(0) + β̇(0) = t=0
F α(t) + β(t) = ẋ α (0) + ẋ β (0) ∂yj
dt ∂xi
j=1 i=1
= DF α̇(0) + DF β̇(0) .
This shows
jthat Df : Tp M → Tq N is the linear operator given in these bases by the
∂y
matrix ∂xi . In particular, this implies that Df is also smooth. ⊔
⊓
1≤j≤n, 1≤i≤m
2.1.3 Transversality
In this subsection we want to explain rigorously a phenomenon with which the reader may
already be intuitively acquainted. We describe it first in a special case.
Suppose M is a submanifold of dimension 2 in R3 . Then, a simple thought experiment
suggests that most horizontal planes will not be tangent to M . Equivalently, if we denote
by f the restriction to M of the function (x, y, z) 7→ z , then for most real numbers h the
level set f −1 (h) = M ∩ {z = h} does not contain a point where the differential of f is
zero, so that most level sets f −1 (h) are smooth submanifolds of M of codimension 1, i.e.,
smooth curves on M .
We can ask a more general question. Given two smooth manifolds X, Y , a smooth map
f : X → Y , is it true that for “most” y ∈ Y the level set f −1 (y) is a smooth submanifold of
X of codimension dim Y ? This question has a positive answer, known as Sard’s theorem.
Definition 2.1.14. Suppose that Y is a smooth, connected manifold of dimension m.
(a) We say that a subset S ⊂ Y is negligible if, for any coordinate chart of Y , Ψ : U →
Rm , the set Ψ(S ∩ U ) ⊂ Rm has Lebesgue measure zero in Rm .
Definition 2.1.16. Suppose U, V are finite dimensional real Euclidean vector spaces,
O ⊂ U is an open subset, and F : O → V is a smooth map. Then a point u ∈ O is a
critical point of F if and only if
rank (Du F : U → V ) < min dim U, dim V . ⊔
⊓
Exercise 2.1.17. Show that if F : M → N is a smooth map and dim N ≤ dim M , hen for
every q ∈ N \ ∆F the fiber F −1 (q) is either empty, or a submanifold of M of codimension
dim N . ⊔
⊓
Theorem 2.1.18 (Sard). Suppose U, V are finite dimensional real Euclidean vector spaces,
O ⊂ U is an open subset, and F : O → V is a smooth map. Assume dim V ≤ dim U .
Then the discriminant set ∆F is negligible.
Proof. We follow the elegant approach of J. Milnor [97] and L. Pontryagin [111]. Set
n = dim U , and m = dim V . We will argue inductively on the dimension n.
For every positive integer k we denote by Cr kF ⊂ Cr F the set of points u ∈ O such
that all the partial derivatives of F up to order k vanish at u. We obtain a decreasing
filtration of closed sets
Cr F ⊃ Cr 1F ⊃ Cr 2F ⊃ · · · .
The case n = 0 is trivially true so we may assume n > 0, and the statement is true for
any n′ < n, and any m ≤ n′ . The inductive step is divided into three intermediary steps.
Step 1. The set F (Cr F \ Cr 1F ) is negligible.
Step 2. The set F (Cr kF \ Cr k+1 F ) is negligible for all k ≥ 1.
k
Step 3. The set F (Cr F ) is negligible for some sufficiently large k.
Step
1. Set Cr ′F := Cr F \ Cr 1F . We will show that there exists a countable open cover
Oj j≥1 of Cr ′F such that F (Oj ∩Cr ′F is negligible for all j ≥ 1. Since Cr ′F is separable,
it suffices to prove that every point u ∈ Cr ′F admits an open neighborhood N such that
F N ∩ Cr ′F is negligible.
Suppose u0 ∈ Cr ′F . Assume first that there exist local coordinates (x1 , . . . , , xn )
defined in a neighborhood N of u0 , and local coordinates (y 1 , . . . , y m ) near v0 = F (u0 )
such that,
xi (u0 ) = 0, ∀i = 1, . . . n, y j (v0 ) = 0, ∀j = 1, . . . m,
and the restriction of F to N is described by functions y j = y j (x1 , . . . , xm ) such that
y 1 = x1 .
For every t ∈ R we set
Nt := (x1 , . . . , xn ) ∈ N; x1 = t ,
2.1. THE TANGENT BUNDLE 33
and we define
Gt : Nt → Rm−1 , (t, x2 , . . . , xn ) 7→ y 2 (t, x2 , . . . , xn ), . . . , y m (t, x2 , . . . , xn ) .
Observe that [
F N ∩ Cr ′F = {t} × Gt (Cr Gt ).
t
The inductive assumption implies that the sets Gt (Cr Gt ) have trivial (m − 1)-dimensional
Lebesgue measure. Using Cavalieri’s principle or Fubini’s theorem we deduce that F (N ∩
Cr ′F ) has trivial m-dimensional Lebesgue measure.
To conclude Step 1 is suffices to prove that the above simplifying assumption concerning
the existence of nice coordinates is always fulfilled. To see this, choose local coordinates
(s1 , . . . , sn ) near u0 and coordinates (y 1 , . . . , y m ) near v0 such that
si (u0 ) = 0, ∀i = 1, . . . n, y j (v0 ) = 0, ∀j = 1, . . . m,
x1 = y 1 (s1 , . . . , sn ), xi = si , ∀i = 2, . . . , n.
The implicit function theorem shows that the collection of functions (x1 , . . . , xn ) defines
a coordinate system in a neighborhood of u0 . We regard y j as functions of xi . From the
definition we deduce y 1 = x1 .
(k) (k)
Step 2. Set Cr F := Cr kF \ Cr k+1 F . Since u0 ∈ Cr F , we can find local coordinates
(s1 , . . . , sn ) near u0 and coordinates (y 1 , . . . , y m ) near v0 such that
si (u0 ) = 0, ∀i = 1, . . . n, y j (v0 ) = 0, ∀j = 1, . . . m,
∂ j y1
(u0 ) = 0, ∀j = 1, . . . , k,
∂(s1 )j
and
∂ k+1 y 1
(u0 ) 6= 0.
∂(s1 )k+1
Define
∂ k y1
x1 (s) = ,
∂(s1 )k
and set xi := si , ∀i = 2, . . . , n.
Then the collection (xi ) defines smooth local coordinates on an open neighborhood N
of u0 , and Cr kF ∩N is contained in the hyperplane {x1 = 0}. Define
Then
Cr kG ∩N = Cr kG , F (Cr kF ∩N) = G(Cr kG ),
34 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
(k)
and the induction assumption implies that G(Cr kG ) is negligible. By covering Cr F with
a countably many open neighborhood {Nℓ }ℓ>1 such that F (Cr kF ∩Nℓ ) is negligible we
(k)
conclude that F (Cr F ) is negligible.
n
Step 3. Suppose k > m . We will prove that F (Cr kF ) is negligible. More precisely, we
will show that, for every compact subset S ⊂ O, the set F (S ∩ Cr kF ) is negligible.
From the Taylor expansion around points in Cr kF ∩S we deduce that there exist num-
bers r0 ∈ (0, 1) and λ0 > 0, depending only on S, such that, if C is a cube with edge
r < r0 which intersects Cr kF ∩S, then
Cover Cr kF ∩S by finitely many cubes {Cℓ }1≤ℓ≤L , of edges < r0 , such that their interiors
are disjoint. For every positive integer P , subdivide each of the cubes Cℓ into P n sub-cubes
Cℓσ of equal sizes. For every sub-cube Cℓσ which intersects Cr kF we have
λ1
µm ( F (Cℓσ ) ) ≤ λ1 µn (Cℓσ )mk/n = µn (Cℓ ).
P mk
We deduce that
X
µm F (Cℓ ∩ Cr kF ) ≤ µm ( F (Cℓσ ∩ Cr kF ) ) ≤ P n−mk µn (C).
σ
n
If we let P → ∞ in the above inequality, we deduce that when k > m we have
µm ( F (Cℓ ∩ Cr kF ) ) = 0, ∀ℓ = 1, . . . , L. ⊔
⊓
ZS := F −1 (S) ⊂ Λ × X.
ZS ⊂ Λ × X → Λ,
(b) From (2.1.4) we deduce that there exists a trivializing cover, i.e., an open cover U
of M , and for every U ∈ U a diffeomorphism
ΨU : E |U → U × F, v 7→ (p = π(v), ΦU
p (v))
p 7→ ΦV U (p) ∈ Aut(F )
is smooth.
36 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
In our special case, the map ΦV U (p) is explicitly defined by the matrix (2.1.4)
j
∂y
A(p) = (p) .
∂xi 1≤i,j≤m
In the above formula, the functions (xi ) are local coordinates on U , and the functions (y j )
are local coordinates on V .
The properties (a) and (b) make no mention of the special relationship between E =
T M and M . There are many other quadruples (E, π, M, F ) with these properties and
they deserve a special name.
Definition 2.1.23. A vector bundle is a quadruple (E, π, M, F ) such that
• π : E → M is a surjective submersion,
The manifold E is called the total space, and M is called the base space. The vector
space F is called the standard fiber, and its dimension (over the field of scalars K) is called
the rank of the bundle. A line bundle is a vector bundle of rank one. ⊔
⊓
Roughly speaking, a vector bundle is a smooth family of vector spaces. Note that the
properties (b1) and (b2) imply that the fibers π −1 (p) of a vector bundle have a natural
structure of linear space. In particular, one can add elements in the same fiber. Moreover,
the addition and scalar multiplication operations on π −1 (p) depend smoothly on p. The
smoothness of the addition operation this means that the addition is a smooth map
+ : E ×M E = (u, v) ∈ E × E; π(u) = π(v) → E.
R × E → E.
There is an equivalent way of defining vector bundles. To describe it, let us introduce
a notation. For any vector space F over the field K = R, C we denote by GLK (F ), (or
simply GL(F ) if there is no ambiguity concerning the field of scalars K) the Lie group of
linear automorphisms F → F .
According to Definition 2.1.23, we can find an open cover (Uα ) of M such that each of
the restrictions Eα = E |Uα is isomorphic to a product Ψα : Eα ∼ = F × Uα . Moreover, on
the overlaps Uα ∩ Uβ , the transition maps gαβ = Ψα Ψ−1
β can be viewed as smooth maps
gαβ : Uα ∩ Uβ → GL(F ).
(a) gαα = 1F
gαβ : Uα ∩ Uβ → GL(F )
satisfying the cocycle condition, we can reconstruct a vector bundle by gluing the product
bundles Eα = F × Uα on the overlaps Uα ∩ Uβ according to the gluing rules
the point (v, x) ∈ Eα is identified with the point gβα (x)v, x ∈ Eβ ∀x ∈ Uα ∩ Uβ .
(a) Show that ∼ is an equivalence relation, and E = X/ ∼ equipped with the quotient
topology has a natural structure of smooth manifold.
(b) Show that the projection π : X → M , (u, x, α) 7→ x descends to a submersion E → M .
(c) Prove that (E, π, M, V ) is naturally a smooth vector bundle. ⊔
⊓
⊔
Exercise 2.1.26. Find a gluing cocycle description of the tangent bundle of the 2-sphere.⊓
In the sequel, we will prefer to think of vector bundles in terms of gluing cocycles.
π
Definition 2.1.27. (a) A section in a vector bundle E → M defined over the open subset
U ⊂ M is a smooth map s : U → E such that
s(p) ∈ Ep = π −1 (p), ∀p ∈ U ⇐⇒ π ◦ s = 1U .
38 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
The space of smooth sections of E over U will be denoted by Γ(U, E) or C ∞ (U, E). Note
that Γ(U, E) is naturally a vector space. We will use the simpler notation C ∞ (E) when
referring to the space of sections of E over M .
(b) A section of the tangent bundle of a smooth manifold is called a vector field on that
manifold. The space of vector fields over on open subset U of a smooth manifold is denoted
by Vect (U ). ⊔
⊓
• The map F covers f , i.e., F (Ep1 ) ⊂ Ef2(p) , ∀p ∈ M1 . Equivalently, this means that
the diagram below is commutative
E1
F
w E2
π1 π2
u u
M1
f
w M2
• The induced map F : Ep1 → Ef2(p) is linear.
The composition of bundle maps is defined in the obvious manner and so is the identity
morphism so that one can define the notion of bundle isomorphism in the standard way.
(b) If E and F are two vector bundles over the same manifold, then we denote by
Hom(E, F ) the space of bundle maps E → F which cover the identity 1M . Such bundle
maps are called bundle morphisms. ⊔
⊓
Example 2.1.32. Consider the trivial vector bundle RnM → M over the smooth manifold
M . A section of this vector bundle is a smooth map u : M → Rn . We can think of u as a
smooth family of vectors (u(x) ∈ Rn )x∈M .
An endomorphism of this vector bundle is a smooth map A : M → EndR (Rn ). We can
think of A as a smooth family of n × n matrices
1
a1 (x) a12 (x) · · · a1n (x)
a2 (x) a2 (x) · · · a2 (x)
1 2 n
Ax = .. .. .. .. , aij (x) ∈ C ∞ (M ).
. . . .
an1 (x) an2 (x) · · · ann (x)
The map A is a gauge transformation if and only if det Ax 6= 0, ∀x ∈ M . ⊔
⊓
Exercise 2.1.33. Suppose E1 , E2 → M are two smooth vector bundles over the smooth
manifold with standard fibers F1 , and respectively F2 . Assume that both bundles are
defined by a common trivializing cover (Uα )α∈A and gluing cocycles
gβα : Uαβ → GL(F1 ), hβα : Uαβ → GL(F2 ).
Prove that there exists a bijection between the vector space of bundle morphisms Hom(E, F ),
and the set of families of smooth maps
Tα : Uα → Hom(F1 , F2 ); α ∈ A ,
satisfying the gluing conditions
−1
Tβ (x) = hβα (x)Tα (x)(x)gβα , ∀x ∈ Uαβ . ⊔
⊓
Exercise 2.1.34. Let V be a vector space, M a smooth manifold, {Uα } an open cover
of M , and gαβ , hαβ : Uα ∩ Uβ → GL(V ) two collections of smooth maps satisfying the
cocycle conditions. Prove the two collections define isomorphic vector bundles if and only
they are cohomologous, i.e., there exist smooth maps φα : Uα → GL(V ) such that
hαβ = φα gαβ φ−1
β . ⊔
⊓
If π denotes the natural projection π : Uk,n → Grk (Cn ), then for each L ∈ Grk (Cn ) the
fiber over L coincides with the subspace in Cn defined by L. Note that UC C
n−1 = U1,n . ⊔
⊓
gβα : Uα ∩ Uβ → GL(1, C) ∼
= C∗ ,
Consider for example the manifold M = S 2 ⊂ R3 . Denote as usual by N and S the North
and respectively South pole. We have an open cover
S 2 = U0 ∪ U∞ , U0 = S 2 \ {S}, U1 = S 2 \ {N }.
In this case, we have only a single nontrivial overlap, UN ∩ US . Identify U0 with the
complex line C, so that the North pole becomes the origin z = 0.
For every n ∈ Z we obtain a complex line bundle Ln → S 2 , defined by the open cover
{U0 , U1 } and gluing cocycle
u0 : U0 → C, u1 : U1 → C,
which along the overlap U0 ∩ U1 satisfy the equality u1 (z) = z −n u0 (z). For example, if
n ≥ 0, the pair of functions
u0 (z) = z n , u1 (p) = 1, ∀p ∈ U1
Exercise 2.1.39. We know that CP1 is diffeomorphic to S 2 . Prove that the universal
line bundle Un → CP1 is isomorphic with the line bundle L−1 constructed in the above
example. ⊔
⊓
2.1. THE TANGENT BUNDLE 41
Prove that the closure of I in Cn+1 × CPn is a smooth manifold diffeomorphic to the total
space of the universal line bundle Un → CPn . This manifold is called the complex blowup
of Cn+1 at the origin. ⊔
⊓
The family of vector bundles is very large. The following construction provides a very
powerful method of producing vector bundles.
Definition 2.1.41. Let f : X → M be a smooth map, and E a vector bundle over M
defined by an open cover (Uα ) and gluing cocycle (gαβ ). The pullback of E by f is the
vector bundle f ∗ E over X defined by the open cover f −1 (Uα ), and the gluing cocycle
( gαβ ◦ f ). ⊔
⊓
One can check easily that the isomorphism class of the pullback of a vector bundle E is
independent of the choice of gluing cocycle describing E. The pullback operation defines
a linear map between the space of sections of E and the space of sections of f ∗ E.
More precisely, if s ∈ Γ(E) is defined by the open cover (Uα ), and the collection of
smooth maps ( sα ), then the pullback f ∗ s is defined by the open cover f −1 (Uα ), and
the smooth maps ( sα ◦ f ). Again, there is no difficulty to check the above definition is
independent of the various choices.
Exercise 2.1.42. For every positive integer k consider the map
Exercise 2.1.43. Let E → X be a rank k (complex) smooth vector bundle over the
manifold X. Assume E is ample, i.e. there exists a finite family s1 , . . . , sN of smooth
sections of E such that, for any x ∈ X, the collection {s1 (x), . . . , sN (x)} spans Ex . For
each x ∈ X we set n X o
Sx := v ∈ CN ; v i si (x) = 0 .
i
Exercise 2.1.44. Show that any vector bundle over a smooth compact manifold is ample.
Thus any vector bundle over a compact manifold is a pullback of some tautological bundle!
⊔
⊓
The notion of vector bundle is trickier than it may look. Its definition may suggest
that a vector bundle looks like a direct product manif old×vector space since this happens
at least locally. We will denote by KnM the bundle Kr × M → M .
42 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
π
Definition 2.1.45. A rank r vector bundle E → M (over the field K = R, C) is called
trivial or trivializable if there exists a bundle isomorphism E ∼ = KrM . A bundle isomor-
phism E → KrM is called a trivialization of E, while an isomorphism Kr → E is called a
framing of E.
A pair (trivial vector bundle, trivialization) is called a trivialized, or framed bundle. ⊔
⊓
r
Remark 2.1.46. Let us explain why we refer to a bundle isomorphism ϕ : KM → E as
a framing.
Denote by (e1 , . . . , er ) the canonical basis of Kr . We can also regard the vectors
ei as constant maps M → Kr , i.e., as (special) sections of KrM . The isomorphism ϕ
determines sections fi = ϕ(ei ) of E with the property that for every x ∈ M the collection
(f1 (x), . . . , fr (x) ) is a frame of the fiber Ex .
This observation shows that we can regard any framing of a bundle E → M of rank r
as a collection of r sections u1 , . . . , ur which are pointwise linearly independent. ⊔
⊓
One can naively ask the following question. Is every vector bundle trivial? We can
even limit our search to tangent bundles. Thus we ask the following question.
Is it true that for every smooth manifold M the tangent bundle T M is trivial (as a
vector bundle)?
Let us look at some positive examples.
Example 2.1.47. T S 1 ∼ = RS 1 Let θ denote the angular coordinate on the circle. Then
∂θ is a globally defined, nowhere vanishing vector field on S 1 . We thus get a map
RS 1 → T S 1 , (s, θ) 7→ (s∂θ , θ) ∈ Tθ S 1
Dθ |θ=0 Rθ : T1 S 1 → Tθ S 1 .
Moreover,
∂θ = Dθ |θ=0 Rθ j.
The existence of the trivializing vector field ∂θ is due to to our ability to “move freely and
coherently” inside S 1 . One has a similar freedom inside a Lie group as we are going to see
in the next example. ⊔
⊓
Example 2.1.48. For any Lie group G the tangent bundle T G is trivial.
To see this let n = dim G, and consider e1 , . . . , en a basis of the tangent space at the
origin, T1 G. We denote by Rg the right translation (by g) in the group defined by
Rg : x 7→ x · g, ∀x ∈ G.
2.2. A LINEAR ALGEBRA INTERLUDE 43
Rg is a diffeomorphism with inverse Rg−1 so that the differential DRg defines a linear
isomorphism DRg : T1 G → Tg G. Set
Ei (g) = DRg (ei ) ∈ Tg G, i = 1, · · · , n.
Since the multiplication G × G → G, (g, h) 7→ g · h is a smooth map we deduce that the
vectors Ei (g) define smooth vector fields over G. Moreover, for every g ∈ G, the collection
{E1 (g), . . . , En (g)} is a basis of Tg G so we can define without ambiguity a map
X
Φ : RnG → T G, (g; X 1 , . . . X n ) 7→ (g; X i Ei (g)).
One checks immediately that Φ is a vector bundle isomorphism and this proves the claim.
In particular T S 3 is trivial since the sphere S 3 is a Lie group (unit quaternions). (Using
the Cayley numbers one can show that T S 7 is also trivial; see [112] for details). ⊔
⊓
We see that the tangent bundle T M of a manifold M is trivial if and only if there exist
vector fields X1 , . . . , Xm (m = dim M ) such that for each p ∈ M , X1 (p), . . . , Xm (p) span
Tp M . This suggests the following more refined question.
Problem Given a manifold M , compute v(M ), the maximum number of pointwise linearly
independent vector fields over M . Obviously 0 ≤ v(M ) ≤ dim M and T M is trivial if and
only if v(M ) = dim M . A special instance of this problem is the celebrated vector field
problem: compute v(S n ) for any n ≥ 1.
We have seen that v(S n ) = n for n =1,3 and 7. Amazingly, these are the only cases
when the above equality holds. This is a highly nontrivial result, first proved by J.F.Adams
in [3] using very sophisticated algebraic tools. This fact is related to many other natural
questions in algebra. For a nice presentation we refer to [91].
The methods we will develop in this book will not suffice to compute v(S n ) for any n,
but we will be able to solve “half” of this problem. More precisely we will show that
v(S n ) = 0 if and only if n is an even number.
In particular, this shows that T S 2n is not trivial. In odd dimensions the situation is far
more elaborate (a complete answer can be found in [3]).
Exercise 2.1.49. v(S 2k−1 ) ≥ 1 for any k ≥ 1. ⊔
⊓
Equivalently, the vector space T(E, F ) can be identified with the space of functions c :
E ×F → K with finite support. The space T(E, F ) has a natural basis consisting of “Dirac
functions”
1 if (x, y) = (e, f )
δe,f : E × F → K, (x, y) 7→
0 if (x, y) 6= (e, f )
In particular, we have an injection1
λδe,f − δλe,f , λδe,f − δe,λf , δe+e′ ,f − δe,f − δe′ ,f , δe,f +f ′ − δe,f − δe,f ′ ,
E ⊗K F := T(E, F )/R(E, F ),
e ⊗ f := π(δe,f ).
'' w E ⊗ F
ι
E×F
'' Φ
φ )' u . ⊔
⊓
The proof of this result is left to the reader as an exercise. Note that if (ei ) is a basis
of E, and (fj ) is a basis of F , then (ei ⊗ fj ) is a basis of E ⊗ F , and therefore
Exercise 2.2.2. Using the universality property of the tensor product prove that there
exists a natural isomorphism E ⊗ F ∼
= F ⊗ E uniquely defined by e ⊗ f 7→ f ⊗ e. ⊔
⊓
The above construction can be iterated. Given three vector spaces E1 , E2 , E3 over
the same field of scalars K we can construct two triple tensor products:
(E1 ⊗ E2 ) ⊗ E3 ∼
= E1 ⊗ (E2 ⊗ E3 ). ⊔
⊓
The above exercise implies that there exists a unique (up to isomorphism) triple tensor
product which we denote by E1 ⊗ E2 ⊗ E3 . Clearly, we can now define multiple tensor
products: E1 ⊗ · · · ⊗ En .
Definition 2.2.4. (a) For any two vector spaces U, V over the field K we denote by
Hom(U, V ), or HomK (U, V ) the space of K-linear maps U → V .
(b) The dual of a K-linear space E is the linear space E ∗ defined as the space HomK (E, K)
of K-linear maps E → K. For any e∗ ∈ E ∗ and e ∈ E we set
he∗ , ei := e∗ (e). ⊔
⊓
The above constructions are functorial. More precisely, we have the following result.
and satisfying
(S1 ⊗ S2 ) ◦ (T1 ⊗ T2 ) = (S1 ◦ T1 ) ⊗ (S2 ◦ T2 ).
(b) Any linear operator A : E → F induces a linear operator A† : F ∗ → E ∗ uniquely
defined by
hA† f ∗ , ei = hf ∗ , Aei, ∀e ∈ E, f ∗ ∈ F ∗ .
The operator A† is called the transpose or adjoint of A. Moreover,
Remark 2.2.7. Any basis (ei )1≤i≤n of the n-dimensional K-vector space determines a
basis (ei )1≤i≤n of the dual vector space V ∗ uniquely defined by the conditions
(
1 i=j
hei , ej i = δji =
0 i 6= j.
We say that the basis (ei ) is dual to the basis (ej ). The quantity (δji ) is called the
Kronecker symbol.
A vector v ∈ V admits a decomposition
n
X
v= v i ei ,
i=1
Moreover,
n
X
hv ∗ , vi = vi∗ v i .
i=1
Classically, a vector v in V is represented by a one-column matrix
1
v
..
v = . ,
vn
v ∗ = [v1∗ . . . vn∗ ].
Then
v1
hv ∗ , vi = [v1∗ . . . vn∗ ] · ... ,
vn
where the · denotes the multiplication of matrices. ⊔
⊓
Using the functoriality of the tensor product and of the dualization construction one
proves easily the following result.
Proposition 2.2.8. (a) There exists a natural isomorphism
E∗ ⊗ F ∗ ∼
= (E ⊗ F )∗ ,
uniquely defined by
E ∗ ⊗ F ∗ ∋ e∗ ⊗ f ∗ 7−→ Le∗ ⊗f ∗ ∈ (E ⊗ F )∗ ,
2.2. A LINEAR ALGEBRA INTERLUDE 47
where
hLe∗ ⊗f ∗ , x ⊗ yi = he∗ , xihf ∗ , yi, ∀x ∈ E, y ∈ F.
In particular, this shows E ∗ ⊗F ∗ can be naturally identified with the space of bilinear maps
E × F → K.
(b) The adjunction morphism E ∗ ⊗ F → Hom (E, F ), given by
where
Te∗ ⊗f (x) := he∗ , xif, ∀x ∈ E,
is an isomorphism.2 ⊔
⊓
Tsr (V ) := V ⊗r ⊗ (V ∗ )⊗s ,
Example 2.2.10. According to Proposition 2.2.8 a tensor of type (1, 1) can be identified
with a linear endomorphism of V , i.e.,
T11 (V ) ∼
= End (V ),
V · · × V} → K.
| × ·{z ⊔
⊓
k
A tensor of type (r, 0) is called contravariant, while a tensor of type (0, s) is called
covariant. The tensor algebra of V is defined to be
M
T(V ) := Tsr (V ).
r,s
We use the term algebra since the tensor product induces bilinear maps
′ r+r ′
⊗ : Tsr × Tsr′ → Ts+s ′.
T = Tji11...j
...ir
e ⊗ · · · ⊗ eir ⊗ ej1 ⊗ · · · , ⊗ejs ,
s i1
where we use Einstein convention to sum over indices which appear twice, once as a
superscript, and the second time as a subscript.
Using the adjunction morphism in Proposition 2.2.8, we can identify the space T11 (V )
with the space End(V ) a linear isomorphisms. Using the bases (ei ) and (ej ), and Einstein’s
convention, the adjunction identification can be described as the correspondence which
associates to the tensor A = aij ei ⊗ ej ∈ T11 (V ), the linear operator LA : V → V which
maps the vector v = v j ej to the vector LA v = aij v j ei . ⊔
⊓
uniquely defined by
∀vi ∈ V, uj ∈ V ∗ .
In the coordinates determined by a basis (ei ) of V , the contraction can be described
as
(tr T )ji22...i
...js
r
= T ii2 ...ir
ij2 ...js ,
where again we use Einstein’s convention. In particular, we see that the contraction
coincides with the usual trace on T11 (V ) ∼
= End (V ).
In this subsection we will describe two subspaces invariant under this action. These
are special instances of the so called Schur functors. (We refer to [46] for more general
constructions.) Define
1 X
S r : T r (V ) → T r (V ), S r (t) := σt,
r!
σ∈Sr
and P
1
r r r! σ∈Srǫ(σ)σt if r ≤ dim V
Ar : T (V ) → T (V ), Ar (t) := .
0 if r > dim V
Above, we denoted by ǫ(σ) the signature of the permutation σ. Note that
A0 = S 0 = 1K .
The following results are immediate. Their proofs are left to the reader as exercises.
Lemma 2.2.13. The operators Ar and S r are projectors of T r (V ), i.e.,
S 2r = S r , A2r = Ar .
Moreover,
σS r (t) = S r (σt) = S r (t), σAr (t) = Ar (σt) = ǫ(σ)Ar (t), ∀t ∈ T r (V ). ⊔
⊓
Definition 2.2.14. A tensor T ∈ T r (V ) is called symmetric (respectively skew-symmetric)
if
S r (T ) = T (respectively Ar (T ) = T ).
The nonnegative integer r is called the degree of the (skew-)symmetric tensor.
The space of symmetric tensors (respectively skew-symmetric ones) of degree r will be
denoted by S r V (and respectively Λr V ). ⊔
⊓
Set M M
S • V := S r V, and Λ• V := Λr V.
r≥0 r≥0
Definition 2.2.15. The exterior product is the bilinear map
∧ : Λr V × Λs V → Λr+s V,
defined by
(r + s)!
ω r ∧ η s := Ar+s (ω ⊗ η), ∀ω r ∈ Λr V, η s ∈ Λs V. ⊔
⊓
r!s!
Proposition 2.2.16. The exterior product has the following properties.
(a) (Associativity)
(α ∧ β) ∧ γ = α ∧ (β ∧ γ), ∀α, β γ ∈ Λ• V.
In particular,
X
v1 ∧ · · · ∧ vk = k!Ak (v1 ⊗ ... ⊗ vk ) = ǫ(σ)vσ(1) ⊗ ... ⊗ vσ(k) , ∀vi ∈ V.
σ∈Sk
(b) (Super-commutativity)
ω r ∧ η s = (−1)rs η s ∧ ω r , ∀ω r ∈ Λr V, ω s ∈ Λs V.
50 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
ω r ∧1 η s := Ar+s (ω ⊗ η),
which will turn out to be associative and will force ∧ to be associative as well.
L of ∧r1 consider the quotient
To prove the associativity algebra Q∗ = T ∗ /I∗ , where T ∗
is the associative algebra ( r≥0 T (V ), +, ⊗), and I∗ is the bilateral ideal generated by
the set of squares {v ⊗ v/ v ∈ V }. Denote the (obviously associative) multiplication in Q∗
by ∪. The natural projection π : T ∗ → Q∗ induces a linear map π : Λ• V → Q∗ . We will
complete the proof of the proposition in two steps.
Step 1. We will prove that the map π : Λ• V → Q∗ is a linear isomorphism, and moreover
(u + v)⊗2 ∈ I∗ , ∀u, v ∈ V
ω = A∗ (ω) = 0.
The surjectivity of π follows immediately from (2.2.2). Indeed, any π(T ) can be alterna-
tively described as π(ω) for some ω ∈ Λ• V . It suffices to take ω = A∗ (T ).
To prove (2.2.1) it suffices to consider only the special cases when α and β are mono-
mials:
α = Ar (e1 ⊗ · · · ⊗ er ), β = As (f1 ⊗ · · · ⊗ fs ).
We have
π(α∧1 β) = π Ar+s Ar (e1 ⊗ · · · ⊗ er ) ⊗ As (f1 ⊗ · · · ⊗ fs )
(2.2.2)
= π Ar (e1 ⊗ · · · ⊗ er ) ⊗ As (f1 ⊗ · · · ⊗ fs )
def
= π Ar (e1 ⊗ · · · ⊗ er ) ∪ π As (f1 ⊗ · · · ⊗ fs ) = π(α) ∪ π(β).
Thus ∧1 is associative.
2.2. A LINEAR ALGEBRA INTERLUDE 51
Exercise 2.2.17. Finish the proof of part (b) in the above proposition. ⊔
⊓
The space Λ• V is called the exterior algebra of V . The operation ∧ is called the
exterior product. The exterior algebra is a Z-graded algebra, i.e.,
(Λr V ) ∧ (Λs V ) ⊂ Λr+s V, ∀r, s.
Note that Λr V = 0 for r > dim V (pigeonhole principle).
Definition 2.2.18. Let V be an n-dimensional K-vector space. The one dimensional
vector space Λn V is called thedeterminant line of V , and it is denoted by det V . ⊔
⊓
[y w Λ• V
ιV
V
[[
φ ℄
[ uΦ ,
A
i.e., Φ ◦ ιV = φ.
Exercise 2.2.20. Prove Proposition 2.2.19. ⊔
⊓
Exercise 2.2.21. Formulate and prove the analogue of Proposition 2.2.19 for the algebra
S• V . ⊔
⊓
(b) Prove that for every sufficiently small z ∈ C we have the equalities
X d X
det(1V + zA) = σr (A)z r , − log det(1 − zA) = ψr (A)z r−1 . ⊔
⊓
dz
j≥0 r≥1
The functors Λ• and S • have an exponential like behavior, i.e., for any finite dimen-
sional vectors spaces V, W , there exists natural isomorphisms of vector spaces
Λ• V ⊗ Λ• W ∼
= Λ• (V ⊕ W ). (2.2.4a)
S• V ⊗ S• W ∼
= S • (V ⊕ W ). (2.2.4b)
To define the isomorphism in (2.2.4a) consider the bilinear map
φ : Λ• V × Λ• W → Λ• (V ⊕ W ),
uniquely determined by
φ(v1 ∧ · · · ∧ vr , w1 ∧ · · · ∧ ws ) = v1 ∧ · · · ∧ vr ∧ w1 ∧ · · · ∧ ws .
The universality property of the tensor product implies the existence of a linear map
Φ : Λ• V ⊗ Λ• W → Λ• (V ⊕ W ),
φ : S • V ⊗ S • W → S • (V ⊕ W )
φ(v1 · · · vj , w1 · · · wk ) = v1 · · · vj · w1 · · · wk .
To prove that it is an isomorphism, fix basis (ei )1≤i≤m of V and basis (fj )1≤j≤n of W .
Note that the collection I
e ⊗ f J ; I ∈ Zm n
≥0 , J ∈ Z≥0
54 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
which is a basis of S • (V ⊕ W ).
We want to mention a few general facts about Z-graded vector spaces, i.e., vector
spaces equipped with a direct sum decomposition
M
V = Vn .
n∈Z
(We will always assume that each Vn is finite dimensional.) The vectors in Vn are said
to be homogeneous, of degree n. For example, the ring of polynomials K[x] is a K-graded
vector space. The spaces Λ• V and S • V are Z-graded vector spaces.
The direct sum of two Z-graded vector spaces V and W is a Z-graded vector space
with
(V ⊕ W )n := Vn ⊕ Wn .
The tensor product of two Z-graded vector spaces V and W is a Z-graded vector space
with M
(V ⊗ W )n := Vr ⊗ W s .
r+s=n
To any Z-graded vector space V one can naturally associate a formal series PV (t) ∈
Z[[t, t−1 ]] by X
PV (t) := (dimK Vn )tn .
n∈Z
Remark 2.2.29. If we try to compute χ(K[x]) using the first formula in Definition 2.2.28
we get χ(K[x]) = 1/2, while the second formula makes no sense (divergent series). ⊔
⊓
2.2. A LINEAR ALGEBRA INTERLUDE 55
and n n−1
1 1 d 1
PS •
V (t) = = .
1−t (n − 1)! dt 1−t
In particular, dim Λ• V = 2n and χ(Λ• V ) = 0.
Proof. From (2.2.4a) and (2.2.4b) we deduce using Exercise 2.2.27 that for any vector
spaces V and W we have
PΛ• (V ⊕W ) (t) = PΛ• V (t) · PΛ• W (t) and PS • (V ⊕W ) (t) = PS • V (t) · PS • W (t).
[•, •]s : A × A → A,
defined on homogeneous elements ω i ∈ Ai , η j ∈ Aj by
[ω i , η j ]s := ω i η j − (−1)ij η j ω j .
An s-algebra is called s-commutative, if the suppercommutator is trivial, [•, •]s ≡ 0. ⊔
⊓
.
56 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
where Tji ∈ End (E i , E j ). We can use this block decomposition to describe a structure of
s-algebra on End (E). The even endomorphisms have the form
T00 0
,
0 T11
[D, Lx ]End(A)
s = LDx , (2.2.5)
End(A)
where [ , ]s denotes the supercommutator in End (A) (with the s-structure defined
in Example 2.2.32), while for any z ∈ A we denoted by Lz the left multiplication operator
a 7→ z · a.
An s-derivation is called even (respectively odd), if it is even (respectively odd) as an
element of the s-algebra End (A). ⊔
⊓
Remark 2.2.35. The relation (2.2.5) is a super version of the usual Leibniz formula.
Indeed, assuming D is homogeneous (as an element of the s-algebra End (A)) then equality
(2.2.5) becomes
D(xy) = (Dx)y + (−1)|x||D| x(Dy),
As usual, a hat indicates a missing entry. The derivation iu∗ is called the interior derivation
by u∗ or the contraction by u∗ . Often, one uses the alternate notation u∗ to denote this
derivation. ⊔
⊓
and ∀x ∈ A, Rx is a s-derivation.
When A is purely even, i.e., A1 = {0}, then A is called simply a Lie algebra. The
multiplication in a (s-) Lie algebra is called the (s-)bracket. ⊔
⊓
The above definition is highly condensed. In down-to-earth terms, the fact that Rx is
a s-derivation for all x ∈ A is equivalent with the super Jacobi identity
Example 2.2.39. Let E be a vector space (purely even). Then A = End (E) is a Lie
algebra with bracket given by the usual commutator: [a, b] = ab − ba. ⊔
⊓
Proposition 2.2.40. Let A = A0 ⊕ A1 be a s-algebra, and denote by Ders (A) the vector
space of s-derivations of A.
(a) For any D ∈ Ders (A), its homogeneous components D 0 , D 1 ∈ End (A) are also
s-derivations.
End(A)
(b) For any D, D ′ ∈ Ders (A), the s-commutator [D, D ′ ]s is again an s-derivation.
x
(c) ∀x ∈ A the bracket B : a 7→ [a, x]s is a s-derivation called the bracket derivation
determined by x. Moreover
[B x , B y ]End
s
(A)
= B [x,y]s , ∀x, y ∈ A. ⊔
⊓
58 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
b instead
To emphasize the super-nature of the tensor product we will use the symbol “⊗”
of the usual “⊗”. ⊔
⊓
b •V ∼
V ∗ ⊗Λ = Ders (Λ• V ),
∗ ⊗ω ∗ ⊗ω
uniquely determined by v ∗ × ω 7→ D v , where D v is s-derivation defined by
∗ ⊗ω
Dv (v) = hv ∗ , viω, ∀v ∈ V.
then we see that the space of s-traces is isomorphic with the dual of the quotient space
A/[A, A]s .
then
trs T = tr T00 − tr T11 . ⊔
⊓
Exercise 2.2.45. Prove the above proposition. ⊔
⊓
2.2. A LINEAR ALGEBRA INTERLUDE 59
2.2.4 Duality
Duality is a subtle and fundamental concept which permeates all branches of mathematics.
This section is devoted to those aspects of the atmosphere called duality which are relevant
to differential geometry. In the sequel, all vector spaces will be tacitly assumed finite
dimensional, and we will use Einstein’s convention without mentioning it. K will denote
one of the fields R or C.
Definition 2.2.46. A pairing between two K-vector spaces V and W is a bilinear map
B : V × W → K. ⊔
⊓
Example 2.2.48. Let V be a finite dimensional real vector space. Any symmetric non-
degenerate quadratic form (•, •) : V × V → R defines a (self)duality, and in particular a
natural isomorphism
L := I(•,•) : V → V ∗ .
When (•, •) is positive definite, then the operator L is called metric duality or lowering-
the-indices map. This operator can be nicely described in coordinates as follows. Pick a
basis (ei ) of V , and set
gij := (ei , ej ).
Let (ej ) denote the dual basis of V ∗ defined by
hej , ei i = δij , ∀i, j.
The action of L is then
Lei = gij ej . ⊔
⊓
Example 2.2.49. Consider V a real vector space and ω : V × V → R a skew-symmetric
bilinear form on V . The form ω is said to be symplectic if this pairing is a duality. In
this case, the induced operator Iω : V → V ∗ is called symplectic duality. ⊔
⊓
B = B1 ⊗ B2 : (V1 ⊗ V2 ) × (W1 ⊗ W2 ) → R,
uniquely determined by
Λr B : Λr V × Λr W → K
uniquely determined by
h • , • i : Λr V ∗ × Λr V → R,
Λr V ∗ ∼ ∗
= (Λr V ) .
This shows that we can regard the elements of Λr V ∗ as skew-symmetric r-linear forms
V r → K.
A duality B : V × W → K naturally induces a duality B † : V ∗ × W ∗ → K by
B † (v ∗ , w∗ ) : =hv ∗ , I−1 ∗
B w i,
( • , • ) : Λr V × Λr V → R,
2.2. A LINEAR ALGEBRA INTERLUDE 61
determined by
In a Euclidean vector space V the inner product induces the metric duality L : V → V ∗ .
r−1
This induces an operator L : Tsr (V ) → Ts+1 (V ) defined by
T = Tji11...j
...ir
e ⊗ · · · ⊗ eir ⊗ ej1 ⊗ · · · ⊗ ejs ,
s i1
then
i2 ...ir
(LT )jj 1 ...jr
= gij Tjii12...j
...ir
s
,
where gij = (ei , ej ). The inverse of the metric duality L−1 : V ∗ → V induces a linear
r+1
operation Tsr (V ) → Ts−1 (V ) called raising the indices.
Exercise 2.2.55 (Cartan). Let V be an Euclidean vector space. For any v ∈ V denote by
ev (resp. iv ) the linear endomorphism of Λ∗ V defined by ev ω = v ∧ ω (resp. iv = ıv∗ where
ıv∗ denotes the interior derivation defined by v ∗ ∈ V ∗ -the metric dual of v; see Example
2.2.36). Show that for any u, v ∈ V
There is an equivalent way of looking at orientations. To describe it, note that any
nontrivial volume form µ on V uniquely specifies an orientation or µ given by
Then
µ1 ∼ µ2 ⇐⇒ or µ1 = or µ2 .
To every orientation or we can associate an equivalence class [µ]or of volume forms such
that
µ(ω)or(ω) > 0, ∀ω ∈ det V \ {0}.
Thus, we can identify the set of orientations with the set of equivalence classes of nontrivial
volume forms.
Equivalently, to specify an orientation on V it suffices to specify a basis ω of det V .
The associated orientation or ω is uniquely characterized by the condition
or ω (ω) = 1.
To any basis {e1 , ..., en } of V one can associate a basis e1 ∧ · · · ∧ en of det V . Note that a
permutation of the indices 1, . . . , n changes the associated basis of det V by a factor equal
to the signature of the permutation. Thus, to define an orientation on a vector space, it
suffices to specify a total ordering of a given basis of the space.
An ordered basis of an oriented vector space (V, or) is said to be positively oriented
if so is the associated basis of det V .
Definition 2.2.58. Given two orientations or 1 , or 2 on the vector space V we define
or 1 /or 2 ∈ {±1}
to be
or 1 /or 2 := or 1 (ω)or 2 (ω), ∀ω ∈ det V \ {0}.
2.2. A LINEAR ALGEBRA INTERLUDE 63
We will say that or 1 /or 2 is the relative signature of the pair of orientations or 1 , or 2 . ⊔
⊓
Assume now that V is an Euclidean space. Denote the Euclidean inner product by
g( • , • ). The vector space det V has an induced Euclidean structure, and in particular,
there exist exactly two length-one-vectors in det V . If we fix one of them, call it ω, and
we think of it as a basis of det V , then we achieve two things.
• First, it determines a volume form µg defined by
µg (λω) = λ.
If V = R2 with its standard metric, and the orientation given by e1 ∧ e2 , prove that
| Detor
g (v1 ∧ v2 )|
Definition 2.2.61. Let (V, g, or) be an oriented, Euclidean space and denote by Detorg
the associated volume form. The Berezin integral or ( berezinian) is the linear form
Z~
: Λ• V → R,
g
For any Euclidean space (V, g) we denote End− (V ) = End− (V, g) the space of skew-
symmetric linear maps V → V . a natural isomorphism
End− (V ) → Λ2 V ∗ , |; End− (V ) ∋ A 7→ ωA ∈ Λ2 V,
(2.2.10)
ωA (v1 , v2 ) = g(Av1 , v2 ), ∀v1 , v2 ∈ V.
where aij are the entris of the N × N matrix describing the endomorphism A in the basis
(ei ),
aij = g(Aej , ei ) = g(ei , Aej ).
Given an orientation or on (V, g) we define the pfaffian of A ∈ End− (V ) by
P f (A) = P f or
g (A) := P f(ωA ).
Exercise 2.2.65. Let (V, g, or) be an oriented Euclidean space of dimension 2n. Consider
A : V → V a skewsymmetric endomorphism and a positively oriented orthonormal frame
e1 , . . . , e2n . Prove that
(−1)n X
P f or
g (A) = ǫ(σ)aσ(1)σ(2) · · · aσ(2n−1)σ(2n)
2n n!
σ∈S2n
X
= (−1)n ǫ(σ)aσ(1)σ(2) · · · aσ(2n−1)σ(2n) ,
σ∈S′2n
where aij = g(ei , Aej ) is the (i, j)-th entry in the matrix representing A in the basis (ei ),
and S′2n denotes the set of permutations σ ∈ S2n satisfying
Let (V, g, or) be an n-dimensional, oriented, real Euclidean vector space. The metric
duality Lg : V → V ∗ induces both a metric, and an orientation on V ∗ . In the sequel we
will continue to use the same notation Lg to denote the metric duality Tsr (V ) → Tsr (V ∗ ) ∼
=
Tsr (V ).
2.2. A LINEAR ALGEBRA INTERLUDE 65
Definition 2.2.66. Suppose (V, g, or) oriented Euclidean space, and r is a nonnegative
integer, r ≤ dim V . The r-th Hodge pairing is the pairing
Ξ = Ξrg,or : Λr V ∗ × Λn−r V → R,
defined by
Ξ(ω r , η n−r ) := Detor
g L−1 r
g ω ∧η
n−r
, ω r ∈ Λr V ∗ , η n−r ∈ Λn−r V. ⊔
⊓
∗ = IΞ : Λr V ∗ → Λn−r V ∗
The above definition obscures the meaning of the ∗-operator. We want to spend some
time clarifying its significance.
Let α ∈ Λr V ∗ so that ∗α ∈ Λn−r V ∗ . Denote by h•, •i the standard pairing
Λn−r V ∗ × Λn−r V → R,
and by (•, •) the induced metric on Λn−r V ∗ . Then, by definition, for every β ∈ Λn−s V ∗
the operator ∗ satisfies
Detg (L−1 −1 −1
g α ∧ Lg β) = ∗α, Lg β = (∗α, β), ∀β ∈ Λ
n−r ∗
V . (2.2.11)
Let ω denote the unit vector in det V defining the orientation. Then (2.2.11) can be
rewritten as
hα ∧ β, ωi = (∗α, β), ∀β ∈ Λn−r V ∗ .
Thus
r ∗ n−r ∗
α ∧ β = (∗α, β) Det or
g , ∀α ∈ Λ V , ∀β ∈ Λ V . (2.2.12)
Example 2.2.69. Let V be the standard Euclidean space R3 with standard basis e1 , e2 , e3 ,
and orientation determined by e1 ∧ e2 ∧ e3 . Then
∗1 = e1 ∧ e2 ∧ e3 , ∗(e1 ∧ e2 ∧ e3 ) = 1,
Proposition 2.2.70. Suppose (V, g, or) is an oriented, real Euclidean space of dimension
n. Then the associated Hodge ∗-operator satisfies
∗(∗ω) = (−1)p(n−p) ω ∀ω ∈ Λp V ∗ ,
Detg (∗1) = 1,
and
α ∧ ∗β = (α, β) ∗ 1, ∀α ∈ Λk V ∗ , ∀β ∈ Λn−k V ∗ . ⊔
⊓
Exercise 2.2.71. Let (V, g, ε) be an n-dimensional, oriented, Euclidean space. For every
t > 0 Denote by gt the rescaled metric gt = t2 g. If ∗ is the Hodge operator corresponding
to the metric g and orientation or, and ∗t is the Hodge operator corresponding to the
metric gt and the same orientation, show that
n
Detor or
tg = t Detg ,
and
∗t ω = tn−2p ∗ ω ∀ω ∈ Λp V ∗ . ⊔
⊓
Definition 2.2.72. Let V be a real vector space. For any r ≥ 0 we define an r-density
to be a function f : det V → R such that
where
ıor µ(ω) = or(ω)µ(ω), ∀ω ∈ det V \ {0}.
i.e., X
Re h = (ei ⊗ ei + f i ⊗ f i ).
i
Also
ω(ei , fj ) = −Im h(ei , iej ) = δij , ω(ei , ej ) = ω(fi , fj ) = 0 ∀i, j,
which shows that X
ω = −Im h = ei ∧ f i .
i
Any complex
√ space V can be also thought of as a real vector space. The multiplication
by i = −1 defines a real linear operator which we denote by J. Obviously J satisfies
J 2 = −1V .
Conversely, if V is a real vector space then any real operator J : V → V as above
defines a complex structure on V by
(a + bi)v = av + bJv, ∀v ∈ V, a + bi ∈ C.
V ∼
= ker (J − i) V ∼
= (ker J + i). ⊔
⊓
Set
V 1,0 := ker(J − i) ∼
=C V V 0,1 := ker(J + i) ∼
=C V .
= V 1,0 ⊕ V 0,1 ∼
Thus VC ∼ = V ⊕ V . We obtain an isomorphism of Z-graded complex vector
spaces
Λ• VC ∼
= Λ• V 1,0 ⊗ Λ• V 0,1 .
If we set
We can define similarly Λp,q V ∗ as the Λp,q -construction applied to the real vector space
Vr∗ equipped with the complex structure J ∗ . Note that
Λ1,0 V ∗ ∼
= (Λ1,0 V )∗c ,
Λ0,1 V ∗ ∼
= (Λ0,1 V )∗c ,
and, more generally
Λp,q V ∗ ∼
= (Λp,q V )∗c .
If h is a Hermitian metric on the complex vector space (V, J), then we have a natural
isomorphism of complex vector spaces
Vc∗ ∼
= (Vr∗ , J ∗ ) ∼
=C (V, −J) ∼
=C V ,
so that
Λp,q V ∗ ∼
=C Λq,p V.
The Euclidean metric g = Re h, and the associated 2-form ω = −Im h are related by
Moreover, ω is a (1, 1)-form. To see this it suffices to pick a unitary basis (ei ) of V , and
construct as usual the associated real orthonormal basis {e1 , f1 , · · · , en , fn } (fi = Jei ).
Denote by {ei , f i ; i = 1, . . . , n} the dual orthonormal basis in Vr∗ . Then J ∗ ei = −f i , and
if we set
1 1
εi := √ (ei + if i ), ε̄j := √ (ej − if j ),
2 2
then
Λ1,0 V ∗ = spanC {εi } Λ0,1 = spanC {ε̄j },
and X
ω=i εi ∧ ε̄i .
Let V be a complex vector space, and e1 , . . . , en be a basis of V over C. This is not a real
basis of V since dimR V = 2 dimC V . We can however complete this to a real basis. More
precisely, the vectors e1 , ie1 , . . . , en , ien form a real basis of V .
2.2. A LINEAR ALGEBRA INTERLUDE 69
Proposition 2.2.75. Suppose (e1 , . . . , en ) and (f1 , . . . , fn ) are two complex bases of V ,
and Z = (zkj )1≤j,k is the complex matrix describing the transition from the basis e to the
basis f , i.e., X j
fk = zk ej , ∀1 ≤ k ≤ n.
j
Then
f1 ∧ if1 ∧ · · · ∧ fn ∧ ifn = | det Z|2 e1 ∧ ie1 ∧ · · · ∧ en ∧ ien .
Proof. We write
zkj = xjk + iykj , xjk , ykj ∈ R, êj = iej , fˆk = ifk .
Then X j X j X j
fk = (xk + iykj )ej = xk ej + yk êj ,
j j j
and X X
fˆk = − ykj ej + xjk êj .
j j
and X, Y denote the n × n real matrices with entries (xjk ) and respectively (ykj ).
We want to show that
Let
A := Z ∈ EndC (Cn ); | det Z|2 = det Zb .
We will prove that A = EndC (Cn ).
The set A is nonempty since it contains all the diagonal matrices. Clearly A is a closed
subset of EndC (Cn ), so it suffices to show that A is dense in EndC (Cn ).
Observe that the correspondence
is an endomorphism of R-algebras. Then, for every Z ∈ EndC (Cn ), and any complex
linear automorphism T ∈ AutC (Cn ), we have
T\
ZT −1 = TbZ
bTb−1 .
70 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
Hence
Z ∈ A ⇐⇒ T ZT −1 ∈ A, ∀T ∈ AutC (Cn ).
In other words, if a complex matrix Z satisfies (2.2.16), then so will any of its conjugates.
In particular, A contains all the diagonalizable n × n complex matrices, and these form a
dense subset of EndC (Cn ) (see Exercise 2.2.76). ⊔
⊓
Exercise 2.2.76. Prove that the set of diagonalizable n × n complex matrices form a
dense subset of the vector space EndC (Cn ). ⊔
⊓
1 n
Detg = Detor
g
c
= ω ,
n!
and we conclude
P f h ω = 1.
† −1
(gαβ ) : Uαβ → GL( VE∗ ), Λr gαβ : Uαβ → GL(Λr VE ),
where † denotes the transpose of a linear map, satisfy the cocycle condition, and therefore
define vector bundles which we denote by E ⊕ F , E ⊗ F , E ∗ , and respectively Λr E. In
particular, if r = rankK E, the bundle Λr E has rank 1. It is called the determinant line
bundle of E, and it is denoted by det E. Given the adjunction isomorphism VE∗ ⊗ VF ∼ =
Hom(VE , VF ), we set
Hom(E, F ) := E ∗ ⊗ F.
We set
End(E) := Hom(E, E).
The reader can check easily that these vector bundles are independent of the choices of
transition maps used to characterize E and F (use Exercise 2.1.34). The bundle E ∗ is
called the dual of the vector bundle E. The direct sum E ⊕ F is also called the Whitney
sum of vector bundles. All the functorial constructions on vector spaces discussed in the
previous section have a vector bundle correspondent. (Observe that a vector space can be
thought of as a vector bundle over a point.)
These above constructions are natural in the following sense. Let E ′ and F ′ be vector
bundles over the same smooth manifold M ′ . Any bundle maps S : E → E ′ and T : F →
F ′ , both covering the same diffeomorphism φ : M → M ′ , induce bundle morphisms
S ⊕ T : E ⊕ F → E′ ⊕ T ′, S ⊗ T : E ⊗ F → E′ ⊗ F ′,
covering φ, a morphism
S † : (E ′ )∗ → E ∗ ,
covering φ−1 etc.
Exercise 2.3.1. Prove the assertion above. ⊔
⊓
Example 2.3.2. Let E, F , E ′ and F ′ be vector bundles over a smooth manifold M . Con-
sider bundle isomorphisms S : E → E ′ and T : F → F ′ covering the same diffeomorphism
of the base, φ : M → M . Then (S −1 )† : E ∗ → (E ′ )∗ is a bundle isomorphism covering φ,
so that we get an induced map (S −1 )† ⊗ T : E ∗ ⊗ F → (E ′ )∗ ⊗ F ′ . Note that we have a
natural identification
C ∞ Hom(E, F ) = C ∞ (E ∗ ⊗ F ) ∼
= Hom(E, F ),
where we recall that Hom(E, F ) denotes the space of smooth bundle morphisms E → F .⊓
⊔
T ∗ M := (T M )∗ .
Definition 2.3.4. (a) A tensor field of type (r, s) over the open set U ⊂ M is a section
of Tsr (M ) over U .
(b) A degree r differential form (r-form for brevity) is a section of Λr (T ∗ M ). The space
of (smooth) r-forms over M is denoted by Ωr (M ). We set
M
Ω• (M ) := Ωr (M ).
r≥0
gx : Tx M × Tx M → R
If we view the tangent bundle as a smooth family of vector spaces, then a tensor field
can be viewed as a smooth selection of a tensor in each of the tangent spaces. In particular,
a Riemann metric defines a smoothly varying procedure of measuring lengths of vectors
in tangent spaces.
Example 2.3.5. It is often very useful to have a local description of these objects.
If (x1 , . . . , xn ) are local coordinates on an open set U ⊂ M , then the vector fields
( ∂x∂ 1 , . . . , ∂x∂ n ) trivialize T M |U , i.e., they define a framing of the restriction of T M to
U . We can form a dual framing of T ∗ M |U , using the 1-forms dxi , i = 1, . . . , n. They
satisfy the duality conditions
hdxi , ∂xj i = δji , ∀i, j.
A basis in Tsr (Tx M ) is given by
∂xi1 ⊗ ... ⊗ ∂xir ⊗ dxj1 ⊗ ... ⊗ dxjs ; 1 ≤ i1 , ... , ir ≤ n, 1 ≤ j1 , ... , js ≤ n .
T = Tji11...j
...ir
s
∂xi1 ⊗ ... ⊗ ∂xir ⊗ dxj1 ⊗ ... ⊗ dxjs .
In the above equality we have used Einstein’s convention. In particular, an r-form ω has
the local description
X
ω= ωi1 ...ir dxi1 ∧ · · · ∧ dxir , ωi1 ...ir = ω ∂xi1 , · · · , ∂xir ,
1≤i1 <···<ir ≤n
2.3. TENSOR FIELDS 73
Remark 2.3.6. (a) A covariant tensor field, i.e., a (0, s)-tensor field S, naturally defines
a C ∞ (M )-multilinear map
s
M
S: Vect (M ) → C ∞ (M ),
1
(X1 , ... , Xs ) 7→ p 7→ Sp X1 (p), . . . , Xs (p) ∈ C ∞ (M ).
Conversely, any such map uniquely defines a (0, s)-tensor field. In particular, an r-form η
can be identified with a skew-symmetric C ∞ (M )-multilinear map
r
M
η: Vect (M ) → C ∞ (M ).
1
Notice that the wedge product in the exterior algebras induces an associative product in
Ω• (M ) which we continue to denote by ∧.
(b) Let f ∈ C ∞ (M ). Its Fréchet derivative Df : T M → T R ∼
= R × R is naturally a 1-form.
Indeed, we get a smooth C ∞ (M )-linear map df : Vect (M ) → C ∞ (M ) defined by
In the sequel we will always regard the differential of a smooth function f as a 1-form and
to indicate this we will use the notation df (instead of the usual Df ). ⊔
⊓
S : Vect (M ) × · · · × Vect (M ) → C ∞ (M ).
| {z }
k
Then f∗ S is a (0, k) tensor field on N . Let q ∈ N , and set p := f −1 (q). Then, for any
Y1 , . . . , Yk ∈ Tq N , we have
(f∗ S)q (Y1 , . . . , Yk ) = Sp f∗−1 Y1 , . . . , (f∗ )−1 Yk
= Sp (Dp f )−1 Y1 , . . . , (Dp f )−1 Yk . ⊔
⊓.
74 CHAPTER 2. NATURAL CONSTRUCTIONS ON MANIFOLDS
For covariant tensor fields a more general result is true. More precisely, any smooth
map f : M → N defines a linear map
f ∗ : Ts0 (N ) → Ts0 (M ),
f ∗ = (f∗−1 )† ,
The map F defines the usual polar coordinates. It is a diffeomorphism onto the open
subset
U := R2 \ {(x, 0); x ≥ 0}.
∂ ∂
For simplicity, we will write ∂r , ∂x instead of ∂r , ∂x etc. We have
F ∗ dx = d(r cos θ) = cos θdr − rsinθdθ, F ∗ dy = d(r sin θ) = sinθdr + r cos θdθ,
F ∗ (dx ∧ dy) = d(r cos θ) ∧ d(rsinθ) = (cos θdr − rsinθdθ) ∧ (sinθdr + r cos θdθ)
= r(cos2 θ + sin2 θ)dr ∧ dθ = rdr ∧ dθ.
To compute F∗ ∂r and F∗ ∂θ we use the chain rule which implies
∂x ∂y 1
F∗ ∂r = ∂x + ∂y = cos θ∂x + sin θ∂y = (x∂x + y∂y )
∂r ∂r r
1
= (x∂x + y∂y ).
(x2 + y 2 )1/2
The Euclidean metric is described over U by the symmetric (0, 2)–tensor g = dx2 + dy 2 .
The pullback of g by F is the symmetric (0, 2)–tensor
2 2
F ∗ (dx2 + dy 2 ) = d(r cos θ) + d(r sin θ)
All the operations discussed in the previous section have natural extensions to ten-
sor fields. There exists a tensor multiplication, a Riemann metric defines a duality
L : Vect (M ) → Ω1 (M ) etc. In particular, there exists a contraction operator
r+1
tr : Ts+1 (M ) → Tsr (M )
defined by
∀Xi ∈ Vect (M ), ∀ωj ∈ Ω1 (M ). In local coordinates the contraction has the form
n o
tr Tji00...j
...ir
s
= {Tijii11...j
...ir
s
}.
Let us observe that a Riemann metric g on a manifold M induces metrics in all the
associated tensor bundles Tsr (M ). If we choose local coordinates (xi ) on an open set U
then, as explained above, the metric g can be described as
X
g= gij dxi ⊗ dxj ,
i,j
T = Tji11...j
...ir
∂ i ⊗ ... ⊗ ∂xir ⊗ dxj1 ⊗ ... ⊗ dxjs .
s x 1
If we denote by (gij ) the inverse of the matrix (gij ), then, for every point p ∈ U , the length
of T (p) ∈ Tsr (M )p is the number |T (p)|g defined by
We have already met examples of fiber bundles when we discussed vector bundles.
These were “smooth families of vector spaces”. A fiber bundle wants to be a smooth
family of copies of the same manifold. This is a very loose description, but it offers a first
glimpse at the notion about to be discussed.
The model situation is that of direct product X = F × B, where B and F are smooth
manifolds. It is convenient to regard this as a family of manifolds (Fb )b∈B . The manifold
B is called the base, F is called the standard (model) fiber, and X is called the total
space. This is an example of trivial fiber bundle.
In general, a fiber bundle is obtained by gluing a bunch of trivial ones according to
a prescribed rule. The gluing may encode a symmetry of the fiber, and we would like to
spend some time explaining what do we mean by symmetry.
Definition 2.3.11. (a) Let M be a smooth manifold, and G a Lie group. We say the
group G acts on M from the left (respectively right), if there exists a smooth map
Φ : G × M → M, (g, m) 7→ Φg m,
Then the counterclockwise rotations about the z-axis define a smooth left action of S 1 on
S 2 . More formally, if we use cylindrical coordinates (r, θ, z),
x = r cos θ, y = r sin θ, z = 0,
Example 2.3.14. Let G be a Lie group. For any g ∈ G denote by Lg (resp. Rg ) the left
(resp right) translation by g. In this way we get the tautological left (resp. right) action
of G on itself. ⊔
⊓
(e) a collection of local trivializations, i.e., an open cover (Uα ) of the base B, and dif-
feomorphisms Ψα : F × Uα → π −1 (Uα ) such that
π ◦ Ψα (f, b) = b, ∀(f, b) ∈ F × Uα ,
F × Uα
'')
Ψα
w π−1(Uα)
)' ^[^[[π
Uα
We can form the transition (gluing) maps Ψαβ : F × Uαβ → F × Uαβ , where Uαβ =
Uα ∩ Uβ , defined by Ψαβ = ψα−1 ◦ ψβ . According to (e), these maps can be written
as
ψαβ (f, b) = (Tαβ (b)f, b),
where Tαβ (b) is a diffeomorphism of F depending smoothly upon b ∈ Uαβ .
G × F ∋ (g, x) 7→ g · x = Tg x ∈ F.
(g) There exist smooth maps gαβ : Uαβ → G satisfying the cocycle condition
The choice of an open cover (Uα ) in the above definition is a source of arbitrariness
since there is no natural prescription on how to perform this choice. We need to describe
when two such choices are equivalent.
Two open covers (Uα ) and (Vi ), together with the collections of local trivializations
are said to be equivalent if, for all α, i, there exists a smooth map
Tαi : Uα ∩ Vi → G,
Φ−1
α Ψi (f, x) = (Tαi (x)f, x).
Definition 2.3.17. (a) A fiber bundle is an object defined by conditions (a)-(d) and (f) in
the above definition. (One can think the structure group is the group of diffeomorphisms
of the standard fiber).
(b) A section of a fiber bundle E → B is a smooth map s : B → E such that π ◦ s = 1B ,
π
−1
i.e., s(b) ∈ π (b), ∀ b ∈ B. ⊔
⊓
Example 2.3.18. A rank r vector bundle (over K = R, C) is a GL(r, K)-fiber bundle with
standard fiber Kr , and where the group GL(r, K) acts on Kr in the natural way. ⊔
⊓
2.3. TENSOR FIELDS 79
Example 2.3.19. Let G be a Lie group. A principal G-bundle is a G-fiber bundle with
fiber G, where G acts on itself by left translations. Equivalently, a principal G-bundle
over a smooth manifold M can be described by an open cover U of M and a G-cocycle,
i.e., a collection of smooth maps
gU V : U ∩ V → G U, V ∈ U,
P × G → P, (p, g) 7→ p · g,
such that its orbits coincide with the fibers of the bundle P , and there exists a trivializing
cover
Ψα : G × Uα → π −1 (Uα ) ,
such that
Ψα (hg, u) = Ψα (h, u) · g, ∀g, h ∈ G, u ∈ Uα . ⊔
⊓
Exercise 2.3.21 (The frame bundle of a manifold). Let M n be a smooth manifold.
Denote by F (M ) the set of frames on M , i.e.,
(a) Prove that F (M ) can be naturally organized as a smooth manifold such that the
natural projection p : F (M ) → M , (m; X1 , ... , Xn ) 7→ m is a submersion.
(b) Show F (M ) is a principal GL(n, R)-bundle. The bundle F (M ) is called the frame
bundle of the manifold M .
Hint: A matrix T = (Tji ) ∈ GL(n, K) acts on the right on F (M ) by
Exercise 2.3.24 (The Hopf bundle). If we identify the unit odd dimensional sphere S 2n−1
with the submanifold
(z1 , ... , zn ) ∈ Cn ; |z0 |2 + · · · + |zn |2 = 1
The space of orbits of this action is naturally identified with the complex projective space
CPn−1 .
(a) Prove that p : S 2n−1 → CPn−1 is a principal S 1 bundle called Hopf bundle. (p is the
obvious projection). Describe one collection of transition maps.
(b) Prove that the tautological line bundle over CPn−1 is associated to the Hopf bundle
via the natural action of S 1 on C1 . ⊔
⊓
Exercise 2.3.25. Let E be a vector bundle over the smooth manifold M . Any metric h
on E (euclidian or Hermitian) defines a submanifold S(E) ⊂ E by
Prove that S(E) is a fibration over M with standard fiber a sphere of dimension rank E −1.
The bundle S(E) is usually called the sphere bundle of E. ⊔
⊓
Chapter 3
Calculus on Manifolds
This chapter describes the “kitchen” of differential geometry. We will discuss how one
can operate with the various objects we have introduced so far. In particular, we will
introduce several derivations of the various algebras of tensor fields, and we will also
present the inverse operation of integration.
(a) Φ0 (m) = m, ∀m ∈ M .
The conditions (a) and (b) above show that a flow is nothing but a left action of the
additive (Lie) group (R, +) on M .
81
82 CHAPTER 3. CALCULUS ON MANIFOLDS
d
X(p) = XΦ (p) := |t=0 Φt (p), ∀p ∈ M,
dt
i.e., X(p) is the tangent vector to the smooth path t 7→ Φt (p) at t = 0. This path is called
the flow line through p. ⊔
⊓
Example 3.1.5. Consider the flow etA on Rn generated by a n×n matrix A. Its generator
is the vector field XA on Rn defined by
d
XA (u) = |t=0 etA u = Au. ⊔
⊓
dt
Proposition 3.1.6. Let M be a smooth n-dimensional manifold. The map
is a surjection. Moreover, if Φi : Ni → M (i=1,2) are two local flows such that XΦ1 = XΦ2 ,
then Φ1 = Φ2 on N1 ∩ N2 .
γx (0) = x. (3.1.2)
Moreover, as a consequence of the smooth dependence upon initial data we deduce that
the map
ΦK : NK = (−ε, ε) × K → M, (x, t) 7→ γx (t),
is smooth.
3.1. THE LIE DERIVATIVE 83
Now we can cover M by open, precompact, local coordinate neighborhoods (Kα )α∈A ,
and as above, we get smooth maps Φα : Nα = (−εα , εα ) × Kα → M solving the initial
value problem (3.1.1-2). Moreover, by uniqueness, we deduce
Φα = Φα on Nα ∩ Nβ .
Define [
N := Nα ,
α∈A
and set Φ : N → M , Φ = Φα on Nα .
The uniqueness of solutions of initial value problems for ordinary differential equations
implies that Φ satisfies all the conditions in the definition of a local flow. Tautologically,
X is the infinitesimal generator of Φ. The second part of the proposition follows from the
uniqueness in initial value problems. ⊔
⊓
The family of local flows on M with the same infinitesimal generator X ∈ Vect(M ) is
naturally ordered according to their domains,
(Φ1 : N1 → M ) ≺ (Φ2 : N2 → M )
if and only if N1 ⊂ N2 . This family has a unique maximal element which is called the
local flow generated by X, and it is denoted by ΦX .
Exercise 3.1.7. Consider the unit sphere
S 2 = (x, y, z) ∈ R3 ; x2 + y 2 + z 2 = 1 .
For every point p ∈ S 2 we denote by X(p) ∈ Tp R3 , the orthogonal projection of the vector
k = (0, 0, 1) onto Tp S 2 .
(a) Prove that p 7→ X(p) is a smooth vector field on S 2 , and then describe it in cylindrical
coordinates (z, θ), where
Xf := LX f = hdf, Xi = df (X).
h•, •i : C ∞ (T ∗ M ) × C ∞ (T M ) → C ∞ (M ),
C ∞ (T ∗ M ) × C ∞ (T M ) ∋ (α, X) 7→ α(X) ∈ C ∞ (M ).
In particular, LX is a derivation of C ∞ (M ).
Proof. Let Φt = ΦtX be the local flow generated by X. Assume for simplicity that it is
defined for all t. The map Φt acts on C ∞ (M ) by the pullback of its inverse, i.e.
Φt∗ = (Φ−t )∗ .
Lemma 3.1.10. Let X, Y ∈ Vect (M ). Then the Lie derivative of Y along X is a new
vector field LX Y which, viewed as a derivation of C ∞ (M ), coincides with the commutator
of the two derivations of C ∞ (M ) defined by X and Y i.e.
LX Y f = [X, Y ]f, ∀ f ∈ C ∞ (M ).
The vector field [X, Y ] = LX Y is called the Lie bracket of X and Y . In particular the Lie
bracket induces a Lie algebra structure on Vect (M ).
Note in particular that [∂xi , ∂xj ] = 0, i.e., the basic vectors ∂xi commute as derivations.
So far we have not proved the vector field in (3.1.4) is independent of coordinates. We
will achieve this by identifying it with the intrinsically defined vector field LX Y .
Set γ(t) = Φt m so that we have a parametrization γ(t) = (xi (t)) with ẋi = X i . Then
Φ−t m = γ(−t) = γ(0) − γ̇(0)t + O(t2 ) = xi0 − tX i + O(t2 ) ,
and
j ∂Y j
Yγ(−t) + O(t2 ).
= Ymj − tX i (3.1.5)
∂xi
Note that Φ−t
∗ : Tγ(0) M → Tγ(−t) M is the linearization of the map
(xi ) 7→ xi − tX i + O(t2 ) ,
(1 − A)−1 = 1 + A + A2 + · · · ,
where A is a matrix of operator norm strictly less than 1, we deduce that the differential
Φt∗ = (Φ−t
∗ )
−1
: Tγ(−t) M → Tγ(0) M,
Lemma 3.1.11. For any differential form ω ∈ Ω1 (M ), and any vector fields X, Y ∈
V ect(M ) we have
(LX ω)(Y ) = LX (ω(Y )) − ω([X, Y ]), (3.1.8)
where X · ω(Y ) denotes the (Lie) derivative of the function ω(Y ) along the vector field X.
86 CHAPTER 3. CALCULUS ON MANIFOLDS
Proof. Denote by Φt the local flow generated by X. We have Φt∗ ω = (Φ−t )∗ ω, i.e., for any
p ∈ M , and any Y ∈ Vect (M ), we have
(Φt∗ ω)p (Yp ) = ωΦ−t p Φ−t
∗ Yp .
Denote by γ(t) the path t 7→ Φt (p). We set ωi (t) = ωi (γ(t)), X0i = X i (p), and Y0i = Y i (p).
Using (3.1.6) we deduce
X X ∂X i j
(Φt∗ ω)p (Yp ) = ωi (−t) · Y0i − t Y 0 + O(t 2
) .
∂xj
i j
Hence
d X X ∂X i
−(LX ω)Y = |t=0 (Φt∗ ω)p (Yp ) = − ω̇i (0)Y0i − ωi (0) j Y0j .
dt ∂x
i i,j
d X X X ∂Y i
X · ω(Y ) = |t=0 ωi (t)Y i (t) = ω̇i (0)Y0i + ωi (0)X0j j .
dt ∂x
i i i,j
We deduce that
X i ∂X i j
j ∂Y
X · ω(Y ) − (LX ω)Y = ωi (0) X0 j − Y = ωp ([X, Y ]p ). ⊔
⊓
∂x ∂xj 0
i,j
Observe that if S, T are two tensor fields on M such that both LX S and LX T exist,
then using (3.1.3) we deduce that LX (S ⊗ T ) exists, and
LX (S ⊗ T ) = LX S ⊗ T + S ⊗ LX T. (3.1.9)
Since any tensor field is locally a linear combination of tensor monomials of the form
X1 ⊗ · · · ⊗ Xr ⊗ ω1 ⊗ · · · ⊗ ωs , Xi ∈ Vect (M ), ωj ∈ Ω1 (M ),
we deduce that the Lie derivative LX S exists for every X ∈ Vect (M ), and any smooth
tensor field S. We can now completely describe the Lie derivative on the algebra of tensor
fields.
Proposition 3.1.12. Let X be a vector field on the smooth manifold M . Then the Lie
derivative LX is the unique derivation of T∗∗ (M ) with the following properties.
(a) LX f = hdf, Xi = Xf , ∀f ∈ C ∞ (M ).
(b) LX Y = [X, Y ], ∀X, Y ∈ Vect (M ).
r+1
(c) LX commutes with the contraction tr : Ts+1 (M ) → Tsr (M ).
Moreover, LX is a natural operation, i.e., for any diffeomorphism φ : M → N we have
φ∗ ◦ LX = Lφ∗ X ◦ φ∗ , ∀X ∈ Vect (M ), i.e., φ∗ (LX ) = Lφ∗ X .
3.1. THE LIE DERIVATIVE 87
Proof. The fact that LX is a derivation follows from (3.1.9). Properties (a) and (b) were
proved above. As for part (c), in its simplest form, when T = Y ⊗ ω, where Y ∈ Vect (M ),
and ω ∈ Ω1 (M ), the equality
LX tr T = tr LX T
is equivalent to
LX (ω(Y )) = (LX ω)(Y ) + ω(LX (Y ), (3.1.10)
which is precisely (3.1.8).
Since LX is a derivation of the algebra of tensor fields, its restriction to C ∞ (M ) ⊕
Vect (M ) ⊕ Ω1 (M ) uniquely determines the action on the entire algebra of tensor fields
which is generated by the above subspace. The reader can check easily that the general
case of property (c) follow from this observation coupled with the product rule (3.1.9).
The naturality of LX is another way of phrasing the coordinate independence of this
operation. We leave the reader to fill in the routine details. ⊔
⊓
[LX , LY ] = L[X,Y ] ,
as derivations of the algebra of tensor fields on M . In particular, this says that Vect (M )
as a space of derivations of T∗∗ (M ) is a Lie subalgebra of the Lie algebra of derivations.
so that [LX , LY ] = L[X,Y ] on Vect (M ). Finally, since the contraction commutes with both
LX and LY it obviously commutes with LX LY − LY LX . The corollary is proved. ⊔
⊓
LX (Ω∗ (M )) ⊂ Ω∗ (M ).
Φt ◦ Ψs = Ψs ◦ Ψt , ∀s, t ∈ R.
Prove that if
p ∈ M ; X(p) = 0 = p ∈ M ; Y (p) = 0 .
the two flows commute if and only if [X, Y ] = 0. ⊔
⊓
3.1.3 Examples
Example 3.1.17. Let ω = ωi dxi be a 1-form on Rn . If X = X j ∂xj is a vector field on
Rn , then LX ω = (LX ω)k dxk is defined by
∂X i
(LX ω)k = (LX ω)(∂xk ) = Xω(∂xk ) − ω(LX ∂xk ) = X · ωk + ω ∂ i .
∂xk x
Hence
∂ωk ∂X j
LX ω = Xj j + ωj k dxk .
∂x ∂x
P
In particular, if X = ∂xi = j δij ∂xj , then
n
X ∂ωk
LX ω = L∂xi ω = dxk .
∂xi
k=1
P i∂ ,
If X is the radial vector field X = ix xi then
X
LX ω = (X · ωk + ωk )dxk . ⊔
⊓
k
∂F ∂F ∂F
LX (dx) = dF := dx + dy + dz, LX (dy) = dG, LX (dz) = dH
∂x ∂y ∂z
3.1. THE LIE DERIVATIVE 89
so that
∂F ∂G ∂H
LX (dv) = + + dv = (div X)dv.
∂x ∂y ∂z
In particular, we deduce that if div X = 0, the local flow generated by X preserves the
form dv. We will get a better understanding of this statement once we learn integration
on manifolds, later in this chapter. ⊔
⊓
Example 3.1.19. (The exponential map of a Lie group). Consider a Lie group
G. Any element g ∈ G defines two diffeomorphisms of G: the left (Lg ), and the right
translation (Rg ) on G,
A tensor field T on G is called left (respectively right) invariant if for any g ∈ G (Lg )∗ T =
T (respectively (Rg )∗ T = T ). The set of left invariant vector fields on G is denoted by
LG .
The naturality of the Lie bracket implies
d
|t=0 (g · exp (tX)) = Xg .
dt
We can write (g · exp(tX)) = Lg exp(tX) so that
d d
|t=0 (Lg exp(tX)) = (Lg )∗ |t=0 exp(tX) = (Lg )∗ X = Xg , (left invariance).
dt dt
90 CHAPTER 3. CALCULUS ON MANIFOLDS
The reason for the notation exp (tX) is that when G = GL(n, K), then the Lie algebra of G
is the Lie algebra gl(n, K) of all n × n matrices with the bracket given by the commutator
of two matrices, and for any X ∈ LG we have (Exercise)
X 1
exp (X) = eX = Xk. ⊔
⊓
k!
k≥0
Exercise 3.1.20. Prove the statements left as exercises in the example above. ⊔
⊓
Exercise 3.1.21. Let G be a matrix Lie group, i.e., a Lie subgroup of some general linear
group GL(N, K). This means the tangent space T1 G can be identified with a linear space
of matrices. Let X, Y ∈ T1 G, and denote by exp(tX) and exp(tY ) the 1-parameter groups
with they generate, and set
3. Show that o(n) ⊂ gl(n, R) (defined in Section 1.2.2) is a Lie subalgebra with respect
to the commutator [ · , · ]. Similarly, show that u(n), su(n) ⊂ gl(n, C) are real Lie
subalgebras of gl(n, C), while su(n, C) is even a complex Lie subalgebra of gl(n, C).
4. Prove that we have the following isomorphisms of real Lie algebras,
= o(n), LU (n) ∼
LO(n) ∼ = u(n), LSU (n) ∼
= su(n) and LSL(n,C) ∼
= sl(n, C).
⊔
⊓
1
d3 (X, Y ) = ([X, [X, Y ]] + [Y, [Y, X]]) etc.
12
For more details we refer to [58, 113]. ⊔
⊓
iX ω : =tr (X ⊗ ω), ∀ω ∈ Ωr (M ).
[LX , iY ]s = LX iY − iY LX = i[X,Y ] . ⊔
⊓
The proof uses the fact that the Lie derivative commutes with the contraction operator,
and it is left to the reader as an exercise.
The above s-derivations by no means exhaust the space of s-derivations of Ω• (M ). In
fact we have the following fundamental result.
Proposition 3.2.3. There exists an odd s-derivation d on the s-algebra of differential
forms Ω• ( · ) uniquely characterized by the following conditions.
(b) d2 = 0.
(c) d is natural, i.e., for any smooth function φ : N → M , and for any form ω on M ,
we have
dφ∗ ω = φ∗ dω(⇐⇒ [φ∗ , d] = 0).
92 CHAPTER 3. CALCULUS ON MANIFOLDS
X
∂ωi1 ...ir i
= dx ∧ (dxi1 ∧ · · · ∧ dxir ). (3.2.1)
∂xi
1≤i1 <···<ir ≤n
Thus, the form dω is uniquely determined on any coordinate neighborhood, and this
completes the proof of the uniqueness of d.
dω |U = dω |V on U ∩ V.
Denote by (x1 , ... , xn ) the local coordinates on U , and by (y 1 , ... , y n ) the local coordinates
along V , so that on the overlap U ∩ V we can describe the y’s as functions of the x’s. Over
U we have X
ω= ωi1 ...ir dxi1 ∧ ... ∧ dxir
1≤i1 <···<ir ≤n
X
∂ωi1 ...ir i
dω = dx ∧ (dxi1 ∧ ... ∧ dxir ),
∂xi
1≤i1 <···<ir ≤n
X
bj1 ...jr j
∂ω
dω = dy (dy j1 ∧ · · · ∧ dy jr ).
∂y j
1≤j1 <···<jr ≤n
∂y j1 ∂y jr
ωi1 ...ir = · · · bj ...j .
ω
∂xi1 ∂xir 1 r
3.2. DERIVATIONS OF Ω• (M ) 93
Hence
X r
∂ωi1 ...ir ∂y j1 ∂ 2 y jk ∂y jr ∂y j1 ∂y jr ∂ ω
bj1 ...jr
= · · · · · · b
ω j 1 ...j r + · · · ,
∂xi ∂x i 1 i
∂x ∂x i k ∂x i r ∂x i 1 ∂x i r ∂xi
k=1
where in the above equality we also sum over the indices j1 , ..., jr according to Einstein’s
convention. We deduce
X X ∂ωi1 ...ir i
dx ∧ dxi1 ∧ ... ∧ dxir
∂xi
i 1≤i1 <···<ir ≤n
r
XX ∂y j1 ∂ 2 y jk ∂y jr
= ··· · · · bj ...j dxi ∧ dxi1 ∧ ... ∧ dxir
ω
∂xi1 ∂xi ∂xik ∂xir 1 r
i k=1
r
XX ∂y j1 ∂y jr ∂ ω
bj1 ...jr i
+ ··· dx ∧ dxi1 ∧ ... ∧ dxir . (3.2.2)
∂xi1 ∂x i r ∂xi
i k=1
Notice that
∂2 ∂2
= ,
∂xi ∂xik ∂xik ∂xi
while dxi ∧ dxik = −dxik ∧ dxi so that the first term in the right hand side of (3.2.2)
vanishes. Consequently on U ∩ V
∂ωi1 ...ir i i1 ir ∂y j1 ∂y jr ∂ ω
bj1 ...jr
i
dx ∧ dx · · · ∧ dx = i
· · · i
∧ dxi ∧ dxi1 · · · dxir
∂x ∂x 1 ∂x r ∂xi
j1 jr
∂ωbj1 ...jr i ∂y i1 ∂y ir
= dx ∧ dx ∧ ··· ∧ dx
∂xi ∂xi1 ∂xir
bj1 ...jr j
∂ω
ωj1 ...jr ) ∧ dy j1 ∧ · · · ∧ dy jr =
= (db dy ∧ dy j1 ∧ · · · ∧ dy jr .
∂y j
This proves dω |U = dω |V over U ∩ V . We have thus constructed a well defined linear map
d : Ω• (M ) → Ω•+1 (M ).
To prove that d is an odd s-derivation it suffices to work in local coordinates and show
that the (super)product rule on monomials.
Let θ = f dxi1 ∧ · · · ∧ dxir and ω = gdxj1 ∧ · · · ∧ dxjs . We set for simplicity
Then
d(θ ∧ ω) = d(f gdxI ∧ dxJ ) = d(f g) ∧ dxI ∧ dxJ
= (df · g + f · dg) ∧ dxI ∧ dxJ
= df ∧ dxI ∧ dxJ + (−1)r (f ∧ dxI ) ∧ (dg ∧ dxJ )
= dθ ∧ ω + (−1)deg θ θ ∧ dω.
94 CHAPTER 3. CALCULUS ON MANIFOLDS
∂f 2
d2 f = dxi ∧ dxj .
∂xi ∂xj
The desired conclusion follows from the identities
∂f 2 ∂f 2
= and dxi ∧ dxj = −dxj ∧ dxi .
∂xi ∂xj ∂xj ∂xi
P
Finally, let φ be a smooth map N → M and ω = I ωI dxI be an r-form on M . Here I
runs through all ordered multi-indices 1 ≤ i1 < · · · < ir ≤ dim M . We have
X
dN (φ∗ ω) = dN (φ∗ ωI ) ∧ φ∗ (dxI ) + φ∗ ω I ∧ d(φ∗ dxI ) .
I
For functions, the usual chain rule gives dN (φ∗ ωI ) = φ∗ (dM ωI ). In terms of local coor-
dinates (xi ) the map φ looks like a collection of smooth n functions φi = xi ◦ φ = φ∗ (xi )
and since the pullback is a morphism with respect to the exterior multiplication we get
dN φi1 ∧ · · · ∧ dN φir .
In particular, dN (dφI ) = 0. We put all the above together and we deduce
1.
[d, d]s = 2d2 = 0.
3.
[d, LX ]s = dLX − LX d = 0, ∀X ∈ Vect (M ).
D := [d, iX ]s = diX + iX d.
On the other hand, since LX commutes with the contraction operator, we deduce
Hence
(dω)(X, Y, Z) = X(ω(Y, Z)) − ω([X, Y ], Z) − ω(Y, [X, Z]) − d(iX ω)(Y, X). (3.2.6)
3.2.2 Examples
Example 3.2.8. (The exterior derivative in R3 ).
(a) Let f ∈ C ∞ (R3 ). Then
∂f ∂f ∂f
df = dx + dy + dz.
∂x ∂y ∂z
dω = dP ∧ dx + dQ ∧ dy + dR ∧ dz
∂Q ∂P ∂R ∂Q ∂P ∂R
= − dx ∧ dy + − dy ∧ dz + − dz ∧ dx,
∂x ∂y ∂y ∂z ∂z ∂x
so that dω looks very much like a curl.
(c) Let ω = P dy ∧ dz + Qdz ∧ dx + Rdx ∧ dy ∈ Ω2 (R3 ). Then
∂P ∂Q ∂R
dω = + + dx ∧ dy ∧ dz.
∂x ∂y ∂z
Example 3.2.9. Let G be a connected Lie group. In Example 3.1.19 we defined the Lie
algebra LG of G as the space of left invariant vector fields on G. Set
Ωrlef t (G) ∼
= Λr L∗G .
The exterior derivative of a form in Ω∗lef t can be described only in terms of the algebraic
structure of LG .
Indeed, let ω ∈ L∗G = Ω1lef t (G). For X, Y ∈ L∗G we have (see (3.2.3) )
Since ω, X and Y are left invariant, the scalars ω(X) and ω(Y ) are constants. Thus, the
first two terms in the above equality vanish so that
More generally, if ω ∈ Ωrlef t , then the same arguments applied to (3.2.3) imply that for all
X0 , ..., Xr ∈ LG we have
X
dω(X0 , X1 , ..., Xr ) = (−1)i+j ω([Xi , Xj ], X1 , ..., X̂i , ..., X̂j , ..., Xr ). (3.2.9)
0≤i<j≤r
⊔
⊓
1. It should measure how fast is a given section changing along a direction given by a
vector field X. Hence it has to be an operator
where we recall (see Definition 2.1.27) that C ∞ (E) denotes the space of smooth
sections of E over M .
98 CHAPTER 3. CALCULUS ON MANIFOLDS
2. If we think of the usual directional derivative, we expect that after “rescaling” the
direction X the derivative along X should only rescale by the same factor, i.e.,
∀f ∈ C ∞ (M ) : ∇f X u = f ∇X u.
∇X (f u) = (Xf )u + f ∇X u, ∀f ∈ C ∞ (M ), u ∈ C ∞ (E).
The conditions (a) and (b) can be rephrased as follows: for any u ∈ C ∞ (E), the map
∇u : Vect (M ) → C ∞ (E), X 7→ ∇X u,
is C ∞ (M )-linear so that it defines a bundle morphism (see Definition 2.1.30(b) and Ex-
ample 2.3.2)
∇u ∈ Hom(T M, E) ∼ = C ∞ (T ∗ M ⊗ E).
Summarizing, we can formulate the following definition.
Definition 3.3.1. A covariant derivative (or linear connection) on E is a K-linear map
∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E),
∇(f u) = df ⊗ u + f ∇u. ⊔
⊓
Example 3.3.2. Let KrM ∼ = Kr ×M be the rank r trivial vector bundle over M . The space
∞
C (KM ) of smooth sections coincides with the space C ∞ (M, Kr ) of Kr -valued smooth
functions on M . We can define
∇0 : C ∞ (M, Kr ) → C ∞ (M, T ∗ M ⊗ Kr )
is again a connection. ⊔
⊓
Ωk (F ) : =C ∞ (Λk T ∗ M ⊗ F ).
We will refer to these sections as differential k-forms with coefficients in the vector bundle
F. ⊔
⊓
3.3. CONNECTIONS ON VECTOR BUNDLES 99
Proposition 3.3.4. Let E be a vector bundle. The space A(E) of linear connections on
E is an affine space modeled on Ω1 (End (E)).
Proof. We first prove that A (E) is not empty. To see this, choose an open cover {Uα } of
M such that E |Uα is trivial ∀α. Next, pick a smooth partition of unity (µβ ) subordinated
to this cover.
Since E |Uα is trivial, it admits at least one connection, the trivial one, as in the above
example. Denote such a connection by ∇α . Now define
X
∇ := µα ∇α .
α
One checks easily that ∇ is a connection so that A(E) is nonempty. To check that A (E)
is an affine space, consider two connections ∇0 and ∇1 . Their difference A = ∇1 − ∇0 is
an operator
A : C ∞ (E) → C ∞ (T ∗ M ⊗ E),
satisfying A(f u) = f A(u), ∀u ∈ C ∞ (E). Thus,
A ∈ C ∞ Hom(E, T ∗ M ⊗ E) ∼ = C ∞ (T ∗ M ⊗ E ∗ ⊗ E) ∼
= Ω1 (E ∗ ⊗ E) ∼
= Ω1 End(E) .
Conversely, given ∇0 ∈ A(E) and A ∈ Ω1 End(E) one can verify that the operator
∇A = ∇0 + A : C ∞ (E) → Ω1 (E).
The tensorial operations on vector bundles extend naturally to vector bundles with
connections. The guiding principle behind this fact is the product formula.
where
h•, •i : C ∞ (E ∗ ) × C ∞ (E) → C ∞ (M )
is the pairing induced by the natural duality between the fibers of E ∗ and E.
∀T ∈ End(E), u ∈ C ∞ (E).
100 CHAPTER 3. CALCULUS ON MANIFOLDS
If ∇ is compatible with a metric h then, for any u, v ∈ C ∞ (E) and any X ∈ Vect (M ) we
have
X · h(u, v) = h(∇X Du, v) + h(u, ∇X v). (3.3.2)
It is often useful to have a local description of a covariant derivative. This can be obtained
using Cartan’s moving frame method.
Let E → M be a K-vector bundle of rank r over the smooth manifold M . Pick a
coordinate neighborhood U such E |U is trivial. A moving frame 2 is a bundle isomorphism
φ : KrU = Kr × M → E |U .
Consider the sections eα = φ(δα ), α = 1, ..., r, where δα are the natural basic sections
of the trivial bundle KrU . As x moves in U , the collection (e1 (x), ..., er (x)) describes a basis
of the moving fiber Ex , whence the terminology moving frame. A section u ∈ C ∞ (E |U )
can be written as a linear combination
u = uα eα uα ∈ C ∞ (U, K).
∇u = duα ⊗ eα + uα ∇eα .
Thus, the covariant derivative is completely described by its action on a moving frame.
To get a more concrete description, pick local coordinates (xi ) over U . Then ∇eα ∈
Ω1 (E |U ) so that we can write
The form Γ = dxi ⊗ Γi is called the connection 1-form associated to the choice of local
gauge. A moving frame allows us to identify sections of E |U with Kr -valued functions on
U , and we can rewrite (3.3.3) as
∇u = du + Γu. (3.3.4)
If the connection ∇ is compatible with a metric h and we choose the local moving frame
(eα ) defined to be h-orthonormal, then (3.3.2) implies that for every. vector field CX we
have
h(∇X eα , eβ ) = −h(eα , ∇X eβ ).
2
A moving frame is what physicists call a choice of local gauge.
3.3. CONNECTIONS ON VECTOR BUNDLES 101
Choosing local coordinates (xi ) we deduce that if ∇ is compatible with a metric h and
(eα ) is an orthonormal moving frame, then the associated connection 1-form
X
Γ= dxi ⊗ Γi
i
∇fα = Γ̂βα fβ .
Consider a section σ of E |U . With respect to the local frame (eα ) the section σ has a
decomposition
σ = uα eα ,
while with respect to (fβ ) it has a decomposition
σ = ûβ fβ .
u = gû. (3.3.5)
Now, we can identify the E-valued 1-form ∇σ with a Kr -valued 1-form in two ways: either
using the frame e, or using the frame f . In the first case, the derivative ∇σ is identified
with the Kr -valued 1-form
du + Γu,
while in the second case it is identified with
dû + Γ̂û.
du + Γu = g(dû + Γ̂û).
Hence
Γ̂ = g −1 dg + g−1 Γg.
The above relation is the transition rule relating two local gauge descriptions of the same
connection.
102 CHAPTER 3. CALCULUS ON MANIFOLDS
{moving f rames} ∼
= {local trivialization}
should be treated carefully. These are like an object and its image in a mirror, and there
is a great chance of confusing the right hand with the left hand.
∼
= ∼
=
More concretely, if tα : Eα −→ Kr × Uα (respectively tβ : Eβ −→ Kr × Uβ ) is a
trivialization of a bundle E over an open set Uα (respectively Uβ ), then the transition
map “from α to β” over Uα ∩ Uβ is gβα = tβ ◦ t−1 r
α . The standard basis in K , denoted by
(δi ), induces two local moving frames on E:
eα,i = t−1 −1
α (δi ) and eβ,i = tβ (δi ).
On the overlap Uα ∩ Uβ these two frames are related by the local gauge transformation
−1
eβ,i = gβα eα,i = gαβ eα,i .
This is precisely the opposite way the two trivializations are identified. ⊔
⊓
The above arguments can be reversed producing the following global result.
Proposition 3.3.5. Let E → M be a rank r smooth vector bundle, and (Uα ) a trivializing
cover with transition maps gαβ : Uα ∩Uβ → GL(r; K). Then any collection of matrix valued
1-forms Γα ∈ Ω1 (End KrUα ) satisfying
−1 −1 −1 −1
Γβ = (gαβ dgαβ ) + gαβ Γα gαβ = −(dgβα )gβα + gβα Γα gβα over Uα ∩ Uβ , (3.3.6)
We can use the local description in Proposition 3.3.5 to define the notion of pullback
of a connection. Suppose we are given the following data.
• A smooth map f : N → M .
• A rank r K-vector bundle E → M defined by the open cover (Uα ), and transition
maps gβα : Uα ∩ Uβ → GL(Kr ).
• A connection ∇ on E defined by the 1-forms Γα ∈ Ω1 End(KrUα ) satisfying the
gluing conditions (3.3.6).
Then, these data define a connection f ∗ ∇ on f ∗ E described by the open cover f −1 (Uα ),
transition maps gβα ◦ f and 1-forms f ∗ Γα . This connection is independent of the various
choices and it is called the pullback of ∇ by f
3.3. CONNECTIONS ON VECTOR BUNDLES 103
where
d
Γt = Γ = γ̇ Γ ∈ Ω0 End(Kr ) = End(Kr ).
dt
More explicitly, if the path γ(t) is given by the smooth map
t 7→ γ(t) = x1 (t), . . . , xn (t) ,
Γt eβ = ẋi Γαiβ eα .
1.
d∇ |Ω0 (E) = ∇,
2. ∀ω ∈ Ωr (M ), η ∈ Ωs (E)
d∇ (ω ∧ η) = dω ∧ η + (−1)r ω ∧ d∇ η.
Using a partition of unity one shows that any η ∈ Ωr (E) is a locally finite combination of
monomials as above so the above definition induces an operator Ωr (E) → Ωr+1 (E). We
let the reader check that this extension satisfies conditions (a) and (b) above.
Uniqueness. Any operator with the properties (a) and (b) acts on monomials as in (3.3.12)
so it has to coincide with the operator described above using a given partition of unity. ⊔
⊓
Example 3.3.9. The trivial bundle KM has a natural connection ∇0 - the trivial con-
nection. This coincides with the usual differential d : Ω0 (M ) ⊗ K → Ω1 (M ) ⊗ K. The
0
extension d∇ is the usual exterior derivative. ⊔
⊓
There is a major difference between the usual exterior derivative d, and an arbitrary
d∇ . In the former case we have d2 = 0, which is a consequence of the commutativity
[∂xi , ∂xj ] = 0, where (xi ) are local coordinates on M . In the second case, the equality
(d∇ )2 = 0 does not hold in general. Still, something very interesting happens.
Lemma 3.3.10. For any smooth function f ∈ C ∞ (M ), and any ω ∈ Ωr (E) we have
Proof. We compute
(d∇ )2 (f ω) = d∇ (df ∧ ω + f d∇ ω)
As a map Ω0 (E) → Ω2 (E), the operator (d∇ )2 can be identified with a section of
HomK (E, Λ2 T ∗ M ⊗R E) ∼
= E ∗ ⊗ Λ2 T ∗ M ⊗R E ∼
= Λ2 T ∗ M ⊗R EndK (E).
Definition 3.3.11. For any connection ∇ on a smooth vector bundle E → M , the object
(d∇ )2 ∈ Ω2 (EndK (E)) is called the curvature of ∇, and it is usually denoted by F (∇). ⊔
⊓
106 CHAPTER 3. CALCULUS ON MANIFOLDS
Example 3.3.12. Consider the trivial bundle KrM . The sections of this bundle are smooth
Kr -valued functions on M . The exterior derivative d defines the trivial connection on KrM ,
and any other connection differs from d by a Mr (K)-valued 1-form on M . If A is such a
form, then the curvature of the connection d + A is the 2-form F (A) defined by
The ∧-operation above is defined for any vector bundle E as the bilinear map
uniquely determined by
(ω r ⊗ A) ∧ (η s ⊗ B) = ω j ∧ η k ⊗ AB, A, B ∈ C ∞ End(E) . ⊔
⊓
We conclude this subsection with an alternate description of the curvature which hope-
fully will shed some light on its analytical significance.
Let E → M be a smooth vector bundle on M and ∇ a connection on it. Denote its
curvature by F = F (∇) ∈ Ω2 (End (E)). For any X, Y ∈ Vect (M ) the quantity F (X, Y )
is an endomorphism of E. In the remaining part of this section we will give a different
description of this endomorphism.
For any vector field Z, we denote by iZ : Ωr (E) → Ωr−1 (E) the C ∞ (M ) − linear
operator defined by
iZ (ω ⊗ u) = (iZ ω) ⊗ u, ∀ω ∈ Ωr (M ), u ∈ Ω0 (E).
For any vector field Z, the covariant derivative ∇Z : C ∞ (E) → C ∞ (E) extends naturally
as a linear operator Ωr (E) → Ωr (E), which we continue to denote by ∇Z , uniquely defined
by the requirements
∇Z (ω ⊗ u) := (LZ ω) ⊗ u + ω ⊗ ∇Z u.
The operators d∇ , iZ , ∇Z : Ω• (E) → Ω• (E) satisfy the usual super-commutation identi-
ties.
iZ d∇ + d∇ iZ = ∇Z . (3.3.13)
iX iY + iY iX = 0. (3.3.14)
∇X iY − iY ∇X = i[X,Y ] . (3.3.15)
For any u ∈ Ω0 (E) we compute using (3.3.13)-(3.3.15)
If in the above formula we take X = ∂xi and Y = ∂xj , where (xi ) are local coordinates on
M , and we set ∇i := ∇∂xi , ∇j = ∇∂xj , then we deduce
Fij = −Fji := F ∂xi , ∂xj = [∇i , ∇j ]. (3.3.17)
Thus, the endomorphism Fij measures the extent to which the partial derivatives ∇i , ∇j
fail to commute. This is in sharp contrast with the classical calculus and an analytically
oriented reader may object to this by saying we were careless when we picked the con-
nection. Maybe an intelligent choice will restore the classical commutativity of partial
derivatives so we should concentrate from the very beginning to covariant derivatives ∇
such that F (∇) = 0.
Definition 3.3.13. A connection ∇ such that F (∇) = 0 is called flat. ⊔
⊓
3.3.4 Holonomy
The reader may ask a very legitimate question: why have we chosen to name curvature,
the deviation from commutativity of a given connection. In this subsection we describe the
geometric meaning of curvature, and maybe this will explain this choice of terminology.
Throughout this subsection we will use Einstein’s convention.
Let E → M be a rank r smooth K-vector bundle, and ∇ a connection on it. Consider
local coordinates (x1 , ..., xn ) on an open subset U ⊂ M such that E |U is trivial. Pick a
108 CHAPTER 3. CALCULUS ON MANIFOLDS
moving frame (e1 , ..., er ) of E over U . The connection 1-form associated to this moving
frame is
Γ = Γi dxi = (Γαiβ )dxi , 1 ≤ α, β ≤ r.
It is defined by the equalities (∇i := ∇∂xi )
∇i eβ = Γαiβ eα . (3.3.20)
Using (3.3.17) we compute
Fij eβ = (∇i ∇j − ∇j ∇i )eβ = ∇i (Γj eβ ) − ∇j (Γi eβ )
α
∂Γjβ ∂Γαjβ γ α γ α
= − eα + Γ Γ
jβ iγ − Γ iβ jγ eα ,
Γ
∂xi ∂xj
so that
∂Γj ∂Γi
Fij = − + Γ Γ
i j − Γ Γ
j i . (3.3.21)
∂xi ∂xj
Though the above equation looks very complicated it will be the clue to understanding
the geometric significance of curvature.
p p
3 2
p p
0 1
Assume for simplicity that the point of coordinates (0, ..., 0) lies in U . Denote by
T1s the parallel transport (using the connection ∇) from (x1 , ..., xn ) to (x1 + s, x2 , ..., xn )
along the curve τ 7→ (x1 + τ, x2 , ..., xn ). Define T2t in a similar way using the coordinate
x2 instead of x1 .
Look at the parallelogram Ps,t in the “plane” (x1 , x2 ) described in Figure 3.1, where
p0 = (0, . . . , 0), p1 = (s, 0, . . . , 0), p2 = (s, t, 0, . . . , 0), p3 = (0, t, 0, . . . , 0).
We identify E|U with the trivial bundle E0 × U where E0 is the fiber of E over p0 . We now
perform the counterclockwise parallel transport along the boundary of Ps,t . The outcome
is a linear map Ts,t : E0 → E0 . Set F12 := F ( ∂x∂ 1 ∂x∂ 2 ) |(0,...,0) . F12 is also an endomorphism
of E0 .
Proposition 3.3.15. We have
∂2
F12 = − Ts,t .
∂s∂t
3.3. CONNECTIONS ON VECTOR BUNDLES 109
We see that the parallel transport of an element u ∈ E0 along a closed path may not
return it to itself. The curvature is an infinitesimal measure of this deviation.
The term C1 is an endomorphism of E0 whose exact form is not relevant to our computa-
tions. In the sequel the letter C (eventually indexed) will denote constants.
T2t T1s = 1 − tΓ2 (p1 ) + C2 t2 + O(t3 ) T1s
= 1 − tΓ2 (p1 ) + C2 t2 + O(t3 ) 1 − sΓ1 (p0 ) + C1 s2 + O(s3 )
= 1 − sΓ1 (p0 ) − tΓ2 (p1 ) + tsΓ2 (p1 )Γ1 (p0 ) + C1 s2 + C2 t2 + O(3).
Above and in the sequel O(k) denotes an error ≤ C(s2 + t2 )k/2 as s, t → 0. Now use the
approximation
∂Γ2
Γ2 (p1 ) = Γ2 (p0 ) + s 1 (p0 ) + O(2),
∂x
to deduce
( )
∂Γ
T2t T1s = 1 − sΓ1 − tΓ2 − st
2
− Γ2 Γ1 |p0
∂x1 (3.3.23)
+C1 s2 + C2 t2 + O(3).
Similarly, we have
T1−s T2t T1s = 1 + sΓ1 (p2 ) + C3 s2 + O(3) T2t T1s .
The Γ-term in the right-hand-side of the above equality can be approximated as
∂Γ1 ∂Γ1
Γ1 (p2 ) = Γ1 (p0 ) + s 1
(p0 ) + t 2 (p0 ) + O(2).
∂x ∂x
Using (3.3.23) we get after an elementary computation
∂Γ1 ∂Γ2
T1 T2 T1 = 1 − tΓ2 + st
−s t s
− + Γ2 Γ1 − Γ1 Γ2 |p0
∂x2 ∂x1 (3.3.24)
2 2
+C4 s + C5 t + O(3).
Finally, we have
T2−t T1−s T2t T1s = 1 + tΓ2 (p3 ) + C6 t2 + O(3) T1−s T2t T1s ,
with
∂Γ2
Γ2 (p3 ) = Γ2 (p0 ) + t (p0 ) + C7 t2 + O(3).
∂x2
110 CHAPTER 3. CALCULUS ON MANIFOLDS
Remark 3.3.16. If we had kept track of the various constants in the above computation
we would have arrived at the conclusion that C8 = C9 = 0, i.e.,
Alternatively, the constant C8 is the second order correction in the Taylor expansion of
s 7→ Ts,0 = 1, ∀s, so C8 = 0. The same goes for C9 . Thus we have
dTs,t dT√s,√s
−F12 = = .
dareaPs,t ds
Loosely speaking, the last equality states that the curvature is the the “amount of holon-
omy per unit of area”. ⊔
⊓
We see that the curvature measures the holonomy along infinitesimal parallelograms.
A connection can be viewed as an analytic way of trivializing a bundle. We can do so
along paths starting at a fixed point, using the parallel transport, but using different paths
ending at the same point we may wind up with trivializations which differ by a twist. The
curvature provides an infinitesimal measure of that twist.
Exercise 3.3.18. Prove that any vector bundle E over the Euclidean space Rn is trivial-
izable.
Hint: Use the parallel transport defined by a connection on the vector bundle E to
produce a bundle isomorphism E → E0 × Rn , where E0 is the fiber of E over the origin.
⊔
⊓
Proof. We will use the identities (3.3.13)–(3.3.15). For any vector fields X, Y , Z we have
End(E)
iX D E = ∇X − D E iX .
Hence,
End(E)
(D E F )(X, Y, Z) = iZ iY iX D E F = iZ iY (∇X − D E iX )F
End(E) End(E)
= iZ (∇X iY − i[X,Y ] )F − iZ (∇Y − D E iY )iX F
End(E)
= ∇X iZ iY − i[X,Z] iY − iZ i[X,Y ] F
End(E) End(E)
− ∇Y iZ iX − i[Y,Z]iX − ∇Z iY iX F
= i[X,Y ] iZ + i[Y,Z] iX + i[Z,X] iY F
End(E) End(E) End(E)
− ∇X iY iZ + ∇Y iZ iX + ∇Z iX iY F
= A(X, Y, Z) + A(Y, Z, X) + A(Z, X, Y ),
where
End(E)
A(X, Y, Z) := i[X,Y ] iZ − ∇X iY iZ F, ∀X, Y, Z ∈ Vect (M ).
We compute immediately
h i
i[X,Y ] iZ F = F (Z, [X, Y ]) = ∇E E E
Z , ∇[X,Y ] − ∇[Z,[X,Y ]] .
Example 3.3.20. Let K be the trivial line bundle over a smooth manifold M . Any
connection on K has the form ∇ω = d + ω, where d is the trivial connection, and ω is a
K-valued 1-form on M . The curvature of this connection is
F (ω) = dω.
The Bianchi identity is in this case precisely the equality d2 ω = 0. ⊔
⊓
112 CHAPTER 3. CALCULUS ON MANIFOLDS
The coefficients Γkij are usually known as the Christoffel symbols of the connection. As
usual we construct the curvature tensor
F (X, Y ) = [∇X , ∇Y ] − ∇[X,Y ] ∈ C∞ End(T M ) .
Then ∀f ∈ C ∞ (M )
T (f X, Y ) = T (X, f Y ) = f T (X, Y ),
so that T (•, •) is a tensor T ∈ Ω2 (T M ), i.e., a 2-form whose coefficients are vector fields
on M . The tensor T is called the torsion of the connection ∇. ⊔
⊓
The proof of this lemma is left to the reader as an exercise. In terms of Christoffel
symbols, the torsion has the description
We guess by now the reader is wondering how the mathematicians came up with
this object called torsion. In the remaining of this subsection we will try to sketch the
geometrical meaning of torsion.
To find such an interpretation, we have to look at the finer structure of the tangent
space at a point x ∈ M . It will be convenient to regard Tx M as an affine space modeled
by Rn , n = dim M . Thus, we will no longer think of the elements of Tx M as vectors, but
instead we will treat them as points. The tangent space Tx M can be coordinatized using
affine frames. These are pairs (p; e), where p is a point in Tx M , and e is a basis of the
3.3. CONNECTIONS ON VECTOR BUNDLES 113
underlying vector space. A frame allows one to identify Tx M with Rn , where p is thought
of as the origin.
Suppose that A, B are two affine spaces, both modelled by Rn , and (p; e) , (p; f ) are
affine frames of A and respectively B. Denote by (xi ) the coordinates in A induced by
the frame (p; e), and by (y j ) the coordinates in B induced by the frame (q; f ). An affine
isomorphism T : A → B can then be described using these coordinates as
T : Rnx → Rny x 7→ y = Sx + v,
This shows the origin x0 of Ax0 “drifts” by tX + O(t2 ). The frame (∂i ) suffers a parallel
transport measured as usual by 1 − tiX Γ + O(t2 ). The total affine transport will be
Exercise 3.3.24. Consider a smooth vector bundle E → M over the smooth manifold
M . We assume that both E and T M are equipped with connections and moreover the
connection on T M is torsionless. Denote by ∇ ˆ the induced connection on Λ2 T ∗ M ⊗
End (E). Prove that ∀X, Y, Z ∈ Vect (M )
ˆ X F (Y, Z) + ∇
∇ ˆ Y F (Z, X) + ∇ˆ Z F (X, Y ) = 0. ⊔
⊓
114 CHAPTER 3. CALCULUS ON MANIFOLDS
tβα := | det gβα |−r = | det gαβ |r : Uαβ → R>0 ֒→ GL(1, R).
Denote by C ∞ (|Λ|M ) the space of smooth sections of |Λ|M , and by C0∞ (|Λ|M ) its
subspace consisting of compactly supported densities.
It helps to have local descriptions of densities. To this aim, pick an open cover of M
consisting of coordinate neighborhoods (Uα ). Denote the local coordinates on Uα by (xiα ).
This choice of a cover produces a trivializing cover of T M with transition maps
!
∂xiα
Tαβ = ,
∂xjβ
1≤i,j≤n
where n is the dimension of M . Set δαβ = | det Tαβ |. A 1-density on M is then a collection
of functions µα ∈ C ∞ (Uα ) related by
−1
µα = δαβ µβ .
It may help to think that for each point p ∈ Uα the basis ∂x∂ 1 , ..., ∂x∂n of Tp M spans an
α α
infinitesimal parallelepiped and µα (p) is its “volume”. A change in coordinates should
be thought of as a change in the measuring units. The gluing rules describe how the
numerical value of the volume changes from one choice of units to another.
The densities on a manifold resemble in many respects the differential forms of maximal
degree. Denote by det T M = Λdim M T M the determinant line bundle of T M . A density
is a map
µ : C ∞ (det T M ) → C ∞ (M ),
3.4. INTEGRATION ON MANIFOLDS 115
such that µ(f e) = |f |µ(e), for all smooth functions f : M → R, and all e ∈ C ∞ (det T M ).
In particular, any smooth map φ : M → N between manifolds of the same dimension
induces a pullback transformation
φ∗ : C ∞ (|Λ|N ) → C ∞ (|Λ|M ),
described by
(φ∗ µ)(e) = µ (det φ∗ ) · e = | det φ∗ | µ(e), ∀e ∈ C ∞ (det T M ).
Example 3.4.2. (a) Consider the special case M = Rn . Denote by e1 , ..., en the canonical
basis. This extends to a trivialization of T Rn and, in particular, the bundle of densities
comes with a natural trivialization. It has a nowhere vanishing section |dvn | defined by
In this case, any smooth density on Rn takes the form µ = f |dvn |, where f is some smooth
function on Rn . The reader should think of |dvn | as the standard Lebesgue measure on
Rn .
If φ : Rn → Rn is a smooth map, viewed as a collection of n smooth functions
then, !
∂φi
φ∗ (|dvn |) = det · |dvn |.
∂xj
(b) Suppose M is a smooth manifold of dimension m. Then any top degree form ω ∈
Ωm (M ) defines a density |ω| on M which associates to each section e ∈ C ∞ (det T M ) the
smooth function
x 7→ |ωx ( e(x) )|
Observe that |ω| = | − ω|, so this map Ωm (M ) → C ∞ (|Λ|M ) is not linear.
(c) Suppose g is a Riemann metric on the smooth manifold M . The volume density
defined by g is the density denoted by |dVg | which associates to each e ∈ C ∞ (det T M )
the pointwise length
x 7→ |e(x)|g .
If (Uα , (xiα ) ) is an atlas of M , then on each Uα we have top degree forms
to which we associate the density |dxα |. In the coordinates (xiα ) the metric g can be
described as X
g= gα;ij dxiα ⊗ dxjα .
i,j
We denote by |gα | the determinant of the symmetric matrix gα = (gα;ij )1≤i,j≤m . Then the
restriction of |dVg | to Uα has the description
p
|dVg | = |gα | |dxα |. ⊔
⊓
116 CHAPTER 3. CALCULUS ON MANIFOLDS
The importance of densities comes from the fact that they are exactly the objects that
can be integrated. More precisely, we have the following abstract result.
Proposition 3.4.3. There exists a natural way to associate to each smooth manifold M
a linear map Z
: C0∞ (|Λ|M ) → R
M
uniquely
R defined by the following conditions.
(a) M is invariant under diffeomorphisms, i.e., for any smooth manifolds M , N of the
same dimension n, any diffeomorphism φ : M → N , and any µ ∈ C0∞ (|Λ|M ), we have
Z Z
∗
φ µ= µ.
M N
R
(b) M is a local operation, i.e., for any open set U ⊂ M , and any µ ∈ C0∞ (|Λ|M ) with
supp µ ⊂ U , we have Z Z
µ= µ.
M U
(c) For any ρ ∈ C0∞ (Rn ) we have
Z Z
ρ|dvn | = ρ(x)dx,
Rn Rn
where inR the right-hand-side stands the Lebesgue integral of the compactly supported func-
tion ρ. M is called the integral on M .
(i) A smooth partition of unity A ⊂ C0∞ (M ) such that ∀α ∈ A the support supp α lies
entirely in some precompact coordinate neighborhood Uα , and such that the cover
(Uα ) is locally finite.
(ii) For each Uα we pick a collection of local coordinates (xiα ), and we denote by |dxα |
(n = dim M ) the density on Uα defined by
∂ ∂
|dxα | ∧ ... ∧ n = 1.
∂x1α ∂xα
For any µ ∈ C ∞ (|Λ|), the product αµ is a density supported in Uα , and can be written
as
αµ = µα |dxα |,
where µα is some smooth function compactly supported on Uα . The local coordinates
allow us to interpret µα as a function on Rn . Under this identification |dxiα | corresponds
to the Lebesgue measure |dvn | on Rn , and µα is a compactly supported, smooth function.
We set Z Z
αµ := µα |dxα |.
Uα Rn
3.4. INTEGRATION ON MANIFOLDS 117
Finally, define
Z A Z XZ
def
µ= µ = αµ.
M M α∈A Uα
The above sum contains only finitely many nonzero terms since supp µ is compact, and
thus it intersects only finitely many of the Uα′ s which form a locally finite cover.
To prove property (a) we will first prove that the integral defined as above is indepen-
dent of the various choices: the partition of unity A ⊂ C0∞ (M ), and the local coordinates
(xiα )α∈A .
• Independence of coordinates. Fix the partition of unity A, and consider a new
collection of local coordinates (yαi ) on each Uα . These determine two densities |dxiα | and
respectively |dyαj |. For each µ ∈ C0∞ (|Λ|M ) we have
Similarly Z XZ
βµ = αβµ. (3.4.2)
Uβ α Uαβ
Summing (3.4.1) over α and (3.4.2) over β we get the desired conclusion.
118 CHAPTER 3. CALCULUS ON MANIFOLDS
Property (c) is clear since, for M = Rn , we can assume that all the local coordinates chosen
are Cartesian. The uniqueness of the integral is immediate, and we leave the reader to fill
in the details. ⊔
⊓
We see that det T ∗ M is trivializable if and only if it admits a nowhere vanishing section.
Such a section is called a volume form on M . We say that two volume forms ω1 and ω2
are equivalent if there exists f ∈ C ∞ (M ) such that
ω2 = ef ω1 .
3.4. INTEGRATION ON MANIFOLDS 119
This is indeed an equivalence relation, and an equivalence class of volume forms will be
called an orientation of the manifold. We denote by Or(M ) the set of orientations on
the smooth manifold M . A pair (orientable manifold, orientation) is called an oriented
manifold.
Let us observe that if M is orientable, and thus Or(M ) 6= ∅, then for every point
p ∈ M we have a natural map
Proof. Suppose or and or ′ are two orientations on M such that or p = or ′p . The function
This shows that the two forms ω ′ and ω are equivalent and thus or = or ′ . ⊔
⊓
120 CHAPTER 3. CALCULUS ON MANIFOLDS
Example 3.4.7. Suppose M is a manifold such that the Whitney sum RkM ⊕ T M is
trivial. Then M is orientable. Indeed, we have
Both det Rk and det(Rk ⊕ T M ) are trivial. We deduce det T M is trivial since
det T M ∼
= det(Rk ⊕ T M ) ⊗ (det Rk )∗ .
This trick works for example when M ∼ = S n . Indeed, let ν denote the normal line bundle.
The fiber of ν at a point p ∈ S n is the 1-dimensional space spanned by the position vector
of p as a point in Rn ; (see Figure 3.3). This is clearly a trivial line bundle since it has a
tautological nowhere vanishing section p 7→ p ∈ νp . The line bundle ν has a remarkable
feature:
ν ⊕ T S n = Rn+1 .
Hence all spheres are orientable. ⊔
⊓
3.4. INTEGRATION ON MANIFOLDS 121
Rn+1 ∼
= νp ⊕ Tp S n .
An element ω ∈ det Tp S n defines the canonical orientation if p~ ∧ ω ∈ det Rn+1 defines the
canonical orientation of Rn+1 . Above, by ~p we denoted the position vector of p as a point
inside the Euclidean space Rn+1 . We can think of p~ as the “outer” normal to the round
sphere. We call this orientation outer normal first. When n = 1 it coincides with the
counterclockwise orientation of the unit circle S 1 . ⊔
⊓
Lemma 3.4.8. A smooth manifold M is orientable if and only if there exists an open
cover (Uα )α∈A , and local coordinates (x1α , ..., xnα ) on Uα such that
!
∂xiα
det > 0 on Uα ∩ Uβ . (3.4.3)
∂xjβ
Proof. 1. We assume that there exists an open cover with the properties in the lemma,
and we will prove that det T ∗ M is trivial by proving that there exists a volume form.
Consider a partition of unity B ⊂ C0∞ (M ) subordinated to the cover (Uα )α∈A , i.e.,
there exists a map ϕ : B → A such that
supp β ⊂ Uϕ(β) ∀β ∈ B.
Define X
ω := βωϕ(β) ,
β
where for all α ∈ A we define ωα := dx1α ∧ · · · ∧ dxnα . The form ω is nowhere vanishing
since condition (3.4.3) implies that on an overlap Uα1 ∩ · · · ∩ Uαm the forms ωα1 , ..., ωαm
differ by a positive multiplicative factor.
2. Conversely, let ω be a volume form on M and consider an atlas (Uα ; (xiα )). Then
where the smooth functions µα are nowhere vanishing, and on the overlaps they satisfy
the gluing condition !
∂xiα µβ
∆αβ = det j = .
∂xβ µα
A permutation ϕ of the variables x1α , ..., xnα will change dx1α ∧ · · · ∧ dxnα by a factor ǫ(ϕ) so
we can always arrange these variables in such an order so that µα > 0. This will insure
the positivity condition
∆αβ > 0.
122 CHAPTER 3. CALCULUS ON MANIFOLDS
We can rephrase the result in the above lemma in a more conceptual way using the
notion of orientation bundle. Suppose E → M is a real vector bundle of rank r on the
smooth manifold M described by the open cover (Uα )α∈A , and gluing cocycle
The orientation bundle associated to E is the real line bundle Θ(E) → M described by
the open cover (Uα )α∈A , and gluing cocycle
Using Lemma 3.4.8 and Proposition 2.2.75 we deduce the following result.
Proposition 3.4.12. Any complex manifold is orientable. In particular, the complex
Grassmannians Grk (Cn ) are orientable. ⊔
⊓
The reader can check immediately that the product of two orientable manifolds is
again an orientable manifold. Using connected sums and products we can now produce
many examples of manifolds. In particular, the connected sums of g tori is an orientable
manifold.
By now the reader may ask where does orientability interact with integration. The
answer lies in Subsection 2.2.4 where we showed that an orientation or on a vector space
V induces a canonical, linear isomorphism ıor : det V ∗ → |Λ|V ; see (2.2.13).
Similarly, an orientation or on a smooth manifold M defines an isomorphism
For any compactly supported differential form ω on M of maximal degree we define its
integral by Z Z
ω := ıor ω.
M M
3.4. INTEGRATION ON MANIFOLDS 123
ϕ P
r
y
O
h ∂xi i
β
det j 1≤,i,j≤M
> 0, n = dim M, (3.4.4)
∂xα
and on each coordinate patch Uα the orientation is given by the top degree form dxα =
dx1α ∧ · · · ∧ dxnα .
A top degree differential form ω is described by a collection of forms
ωα = ρα dxα , ρα ∈ C ∞ (Uα ),
satisfy µα = µβ on the overlap Uαβ . Thus they glue together to a density on M , which is
precisely ıor ω.
Example 3.4.14. Consider Rthe 2-form on R3 , ω = xdy ∧ dz, and let S 2 denote the unit
sphere. We want to compute S 2 ω |S 2 , where S 2 has the canonical orientation. To compute
this integral we will use spherical coordinates (r, ϕ, θ). These are defined by (see Figure
3.4.)
x = r sin ϕ cos θ
y = r sin ϕ sin θ .
z = r cos ϕ
At the point p = (1, 0, 0) we have
∂r = ∂x = p~, ∂θ = ∂y ∂ϕ = −∂z ,
124 CHAPTER 3. CALCULUS ON MANIFOLDS
xdy ∧ dz |S 2 = sin ϕ cos θ (cos θ sin ϕdθ + sin θ cos ϕdϕ)) ∧ (− sin ϕ)dϕ
4π
= = volume of the unit ball B 3 ⊂ R3 .
3
As we will see in the next subsection the above equality is no accident. ⊔
⊓
The quantity ∆(h) is independent of h because Rh∗ dVG is a left invariant form, so it has
to be a scalar multiple of dVG . Since (Rh1 h2 )∗ = (Rh2 Rh1 )∗ = Rh∗ 1 Rh∗ 2 we deduce
G → (R \ {0}, ·).
Since G is connected ∆(G) ⊂ R+ , and since G is compact, the set ∆(G) is bounded.
If there exists x ∈ G such that ∆(x) 6= 1, then either ∆(x) > 1, or ∆(x−1 ) > 1, and
in particular, we would deduce the set (∆(xn ))n∈Z is unbounded. Thus ∆ ≡ 1 which
establishes the right invariance of dVG .
3.4. INTEGRATION ON MANIFOLDS 125
The invariant measure dVG provides a very simple way of producing invariant objects
on G. More precisely, if T is tensor field on G, then for each x ∈ G define
Z
ℓ
T x = ((Lg )∗ T )x dVG (g).
G
Exercise 3.4.16. Let G be a Lie group. For any X ∈ LG denote by ad(X) the linear
map LG → LG defined by
LG ∋ Y 7→ [X, Y ] ∈ LG .
(a) If ω denotes a left invariant volume form prove that ∀X ∈ LG
LX ω = − tr ad(X)ω.
(b) Prove that if G is a compact Lie group, then tr ad(X) = 0, for any X ∈ LG . ⊔
⊓
where f : [a, b] → R is a smooth function and df = f ′ (t)dt. In fact, the higher dimensional
formula will follow from the simplest 1-dimensional situation.
We will spend most of the time finding the correct formulation of the general version,
and this requires the concept of manifold with boundary. The standard example is the
lower half-space
Hn− = { (x1 , ..., xn ) ∈ Rn ; x1 ≤ 0 }.
Definition 3.4.17. A smooth manifold with boundary of dimension n is a topological
space with the following properties.
(c) For each point p ∈ ∂M := M \ M 0 , there exist smooth local coordinates (x1 , ..., xn )
defined on an open neighborhood N of p in Mf such that
(c1 ) M 0 ∩ N = q ∈ N; x1 (q) < 0, .
(c2 ) ∂M ∩ N = { x1 = 0 }.
Example 3.4.18. (a) A closed interval I = [a, b] is a smooth 1-dimensional manifold with
boundary ∂I = {a, b}. We can take M f = R.
(b) The closed unit ball B ⊂ R is an orientable manifold with boundary ∂B 3 = S 2 .
3 3
We can take M f = R3 .
(c) Suppose X is a smooth manifold, and f : X → R is a smooth function such that
0 ∈ R is a regular value of f , i.e.,
f (x) = 0 =⇒ df (x) 6= 0.
Define M := x ∈ X; f (x) ≤ 0 . A simple application of the implicit function theorem
shows that the pair (X, M ) defines a manifold with boundary. Note that examples (a)
and (b) are special cases of this construction. In the case (a) we take X = R, and
f (x) = (x−a)(x−b), while in the case (b) we take X = R3 and f (x, y, z) = (x2 +y 2 +z 2 )−1.
⊔
⊓
Exercise 3.4.20. Prove that any manifold with boundary is diffeomorphic to a manifold
with boundary constructed via the process described in Example 3.4.18(c). ⊔
⊓
Proposition 3.4.21. Let M be a smooth manifold with boundary. Then its boundary
∂M is also a smooth manifold of dimension dim ∂M = dim M − 1. Moreover, if M is
orientable, then so is its boundary. ⊔
⊓
is positively oriented. The differential dx1 is usually called an outer conormal since
x1 increases as we go towards the exterior of M . −dx1 is then the inner conormal for
analogous reasons. The rule by which we get the induced orientation on the boundary can
be rephrased as
We may call this rule “outer normal first” for obvious reasons. ⊔
⊓
Example 3.4.22. The canonical orientation on S n ⊂ Rn+1 coincides with the induced
orientation of S n+1 as the boundary of the unit ball B n+1 . ⊔
⊓
Exercise 3.4.23. Consider the hyperplane Hi ⊂ Rn defined by the equation {xi = 0}.
Prove that the induced orientation of Hi as the boundary of the half-space Hn,i i
+ = {x ≥ 0}
ci ∧ · · · dxn where, as usual, the hat indicates
is given by the (n − 1)-form (−1)i dx1 ∧ · · · ∧ dx
a missing term. ⊔
⊓
In the above formula d denotes the exterior derivative, and ∂M has the induced orientation.
Proof. Via partitions of unity the verification is reduced to the following two situations.
Case 1. The (n − 1)-form ω is compactly supported in Rn . We have to show
Z
dω = 0.
Rn
ω = f (x)dx2 ∧ · · · ∧ dxn ,
where f (x) is a compactly supported smooth function. The general case is a linear com-
bination of these special situations. We compute
Z Z Z Z
∂f 1 n ∂f
dω = 1
dx ∧ · · · ∧ dx = 1
1
dx dx2 ∧ · · · ∧ dxn = 0,
Rn Rn ∂x Rn−1 R ∂x
since Z
∂f
dx1 = f (∞, x2 , ..., xn ) − f (−∞, x2 , ..., xn ),
R ∂x1
and f has compact support.
128 CHAPTER 3. CALCULUS ON MANIFOLDS
Then !
X ∂f
dω = (−1)i+1 i dx1 ∧ · · · ∧ dxn .
∂x
i
One verifies as in Case 1 that
Z
∂f 1
i
dx ∧ · · · ∧ dxn = 0 for i 6= 1.
n
H− ∂x
For i = 1 we have
Z Z Z 0
∂f 1 n ∂f
1
dx ∧ · · · ∧ dx = 1
1
dx dx2 ∧ · · · ∧ dxn
n
H+ ∂x R n−1 −∞ ∂x
Z
= f (0, x2 , ..., xn ) − f (−∞, x2 , ..., xn ) dx2 ∧ · · · ∧ dxn
Rn−1
Z Z
2 n 2 n
= f (0, x , ..., x )dx ∧ · · · ∧ dx = ω.
Rn−1 ∂Hn
−
The last equality follows from the fact that the induced orientation on ∂Hn− is given by
dx2 ∧ · · · ∧ dxn . This concludes the proof of the Stokes formula. ⊔
⊓
Example 3.4.26.
Z Z
4π
xdy ∧ dz = dx ∧ dy ∧ dz = vol (B 3 ) = . ⊔
⊓
S2 B3 3
Remark 3.4.27. The above considerations extend easily to more singular situations.
For example, when M is the cube [0, 1]n its topological boundary is no longer a smooth
manifold. However, its singularities are inessential as far as integration is concerned. The
Stokes formula continues to hold
Z Z
dω = ω ∀ω ∈ Ωn−1 (I n ).
In ∂I
The boundary is smooth outside a set of measure zero and is given the induced orientation:
“ outer (co)normal first”. The above equality can be used to give an explanation for the
terminology “exterior derivative” we use to call d. Indeed if ω ∈ Ωn−1 (Rn ) and Ih = [0, h]
then we deduce Z
−n
dω |x=0 = lim h ω. (3.4.6)
h→0 ∂Ihn
dg10 dz
ω1 = − + ω0 = n + ω0 .
g10 z
Let E + be the Equator equipped with the induced orientation as the boundary of the
northern hemisphere, and E − the equator with the opposite orientation, as the boundary
of the southern hemisphere. The orientation of E + coincides with the orientation given
by the form dθ, where z = exp(iθ).
We deduce from the Stokes formula (which works for complex valued forms as well)
that Z Z Z
ω0 = 0 ω1 = − ω1 = 0.
E+ E+ E−
On the other hand, over the Equator, we have
dz
ω1 − ω0 = n = nidθ,
z
from which we deduce
Z Z
0= ω0 − ω1 = ni dθ = 2nπi !!!
E+ E+
Thus there exist no flat connections on the line bundle Ln , n 6= 0, and at fault is the
gluing cocycle defining L. In a future chapter we will quantify the measure in which the
gluing data obstruct the existence of flat connections. ⊔
⊓
such that the map T (g) is linear for any g. One also says that V has a structure of
G-module. If V is a real (respectively complex) vector space, then it is said to be a real
(respectively complex) G-module.
Example 3.4.29. Let V = Cn . Then G = GL(n, C) acts linearly on V in the tautological
manner. Moreover V ∗ , V ⊗k , Λm V and S ℓ V are complex G-modules. ⊔
⊓
Example 3.4.30. Suppose G is a Lie group with Lie algebra LG − T1 G. For every g ∈ G,
the conjugation
C g : G → G, h 7→ C g (h) = ghg −1 ,
sends the identity 1 ∈ G to itself. We denote by Adg : LG → L the differential of
h 7→ C g (h) at h = 1. The operator Adg is a linear isomorphism of LG , and the resulting
map
G 7→ Aut(LG ), g 7→ Adg ,
is called adjoint representation of G.
For example, if G = SO(n), then LG = so(n), and the adjoint representation Ad :
SO(n) → Aut(so(n)), can be given the more explicit description
Ad(g)
so(n) ∋ X 7−→ gXg −1 ∈ so(n), ∀g ∈ SO(n). ⊔
⊓
Exercise 3.4.31. Let G be a Lie group, with Lie algebra LG . Prove that for any X, Y ∈
LG we have
d
|t=0 Adexp(tX) (Y ) = ad(X)Y = [X, Y ]. ⊔
⊓
dt
Definition 3.4.32. A morphism of G-modules V1 and V2 is a linear map L : V1 → V2
such that, for any g ∈ G, the diagram below is commutative, i.e., T2 (g)L = LT1 (g).
V1
L
w V2
T1 (g) T2.(g)
u u
V1
L
w V2
The space of morphisms of G-modules is denoted by HomG (V1 , V2 ). The collection of
isomorphisms classes of complex G-modules is denoted by G − M od. ⊔
⊓
3.4. INTEGRATION ON MANIFOLDS 131
Proposition 3.4.33. The direct sum “⊕”, and the tensor product “⊗” define a structure
of semi-ring with 1 on G − M od. 0 is represented by the null representation {0}, while 1
is represented by the trivial module G → Aut (C), g 7→ 1. ⊔
⊓
Example 3.4.34. Let Ti : G → Aut (Ui ) (i = 1, 2) be two complex G-modules. Then U1∗
is a G-module given by (g, u∗ ) 7→ T1 (g−1 )† u∗ . Hence Hom (U1 , U2 ) is also a G-module.
Explicitly, the action of g ∈ G is given by
We see that HomG (U1 , U2 ) can be identified with the linear subspace in Hom (U1 , U2 )
consisting of the linear maps U1 → U2 unchanged by the above action of G. ⊔
⊓
Proposition 3.4.35 (Weyl’s unitary trick). Let G be a compact Lie group, and V a
complex G-module. Then there exists a Hermitian metric h on V which is G-invariant,
i.e., h(gv1 , gv2 ) = h(v1 , v2 ), ∀v1 , v2 ∈ V .
where dVG (g) denotes the normalized bi-invariant measure on G. One can now check
easily that h is G-invariant. ⊔
⊓
χV : G → C, χV (g) := tr T (g).
Theorem 3.4.42. Let G be a compact Lie group, U1 , U2 complex G-modules and χUi
their characters. Then the following hold.
(a)χU1 ⊕U2 = χU1 + χU2 , χU1 ⊗U2 = χU1 · χU1 .
(b) χUi (1) = dim Ui .
(c) χUi∗ = χUi -the complex conjugate of χUi .
(d) Z
χUi (g)dVG (g) = dim UiG ,
G
(e) Z
χU1 (g) · χU2 (g)dVG (g) = dim HomG (U2 , U1 ).
G
Proof. The parts (a) and (b) are left to the reader. To prove (c), fix an invariant Hermitian
metric on U = Ui . Thus, each T (g) is a unitary operator on U . The action of G on U ∗ is
given by T (g−1 )† . Since T (g) is unitary, we have T (g −1 )† = T (g). This proves (c).
Proof of (d). Consider
Z
P : U → U, P u = T (g)u dVG (g).
G
Proof of (e).
Z Z Z Z
χU1 · χU2 dVG (g) = χU1 · χU2∗ dVG (g) = χU1 ⊗U2∗ dVG (g) = χHom(U2 ,U1 )
G G G G
G
= dimC (Hom (U2 , U1 )) = dimC HomG (U2 , U1 ),
U = ⊕m
1 (mi Ui ) V = ⊕ℓ1 (nj Vj ).
P P
Hence χU = mi χVi and χV = nj χVj . The equivalence “representation” ⇐⇒ “char-
acters” stated by this corollary now follows immediately from Schur’s lemma and the
previous corollary. ⊔
⊓
Thus, the problem of describing the representations of a compact Lie group boils down
to describing the characters of its irreducible representations. This problem was completely
solved by Hermann Weyl, but its solution requires a lot more work that goes beyond the
scope of this book. We will spend the remaining part of this subsection analyzing the
equality (d) in Theorem 3.4.42.
Describing the invariants of a group action was a very fashionable problem in the
second half of the nineteenth century. Formula (d) mentioned above is a truly remarkable
result. It allows (in principle) to compute the maximum number of linearly independent
invariant elements.
Let V be a complex G-module and denote by χV its character. The complex exterior
algebra Λ•c V ∗ is a complex G-module, as the space of complex multi-linear skew-symmetric
maps
V × · · · × V → C.
3.4. INTEGRATION ON MANIFOLDS 135
Denote by bck (V ) the complex dimension of the space of G-invariant elements in Λkc V ∗ .
One has the equality Z
bck (V ) = χΛkc V ∗ dVG (g).
G
These facts can be presented coherently by considering the Z-graded vector space
M
I•c (V ) := Λkinv V ∗ .
k
To obtain a more concentrated formulation of the above equality we need to recall some
elementary facts of linear algebra.
For each endomorphism A of V denote by σk (A) the trace of the endomorphism
Λk A : Λk V → Λk V.
Equivalently, (see Exercise 2.2.25) the number σk (A) is the coefficient of tk in the charac-
teristic polynomial
σt (A) = det(1V + tA).
Explicitly, σk (A) is given by the sum
X
σk (A) = det aiα iβ (n = dim V ).
1≤i1 ···ik ≤n
Consider now the following situation. Let V be a complex G-module. Denote by Λ•r V the
space of R-multi-linear, skew-symmetric maps
V × · · · × V → R.
The vector space Λ•r V ∗ is a real G-module. We complexify it, so that Λ•r V ⊗ C is the space
of R-multi-linear, skew-symmetric maps
V × · · · × V → C,
136 CHAPTER 3. CALCULUS ON MANIFOLDS
and as such, it is a complex G-module. The real dimension of the subspace Ikr (V ) of
G-invariant elements in Λkr V ∗ will be denoted by brk (V ), so that the Poincaré polynomial
of I•r (V ) = ⊕k Ikr is X
PI•r (V ) (t) = tk brk (V ).
k
On the other hand, ) is equal to the complex dimension of Λkr V ∗ ⊗ C. Using the
brk (V
results of Subsection 2.2.5 we deduce
∗ M ∗
Λ•r V ⊗ C ∼
= Λ•c V ∗ ⊗C Λ•c V = ⊕i+j=k Λic V ∗ ⊗ Λjc V . (3.4.9)
k
Each of the above summands is a G-invariant subspace. Using (3.4.7) and (3.4.9) we
deduce
XZ X
PI•r (V ) (t) = σi T (g) σj T (g) ti+j dVG (g)
k G i+j=k
Z
= det 1V + tT (g) det 1V + t T (g) dVG (g) (3.4.10)
ZG
= det 1V + tT (g) 2
dVG (g).
G
We will have the chance to use this result in computing topological invariants of man-
ifolds with a “high degree of symmetry” like, e.g., the complex Grassmannians.
We will first define a fibered version of integration. This requires a fibered version of
orientability.
Definition 3.4.45. Let p : E → B be a smooth bundle with standard fiber F . The
bundle is said to be orientable if the following hold.
3.4. INTEGRATION ON MANIFOLDS 137
Ψβ ◦ Ψ−1
α : F × Uαβ → F × Uαβ (Uαβ = Uα ∩ Uβ )
are fiberwise orientation preserving, i.e., for each y ∈ Uαβ , the diffeomorphism
F ∋ f 7→ Ψαβ (f, y) ∈ F
If ω = f dxI ∧ dy J , f ∈ Ccpt
∞ (Rr+m ), then
Z
R 0 , |I| =
6 r
= .
E/B Rr f dxI dy J, |I| = r
R
The operator E/B is called the integration-along-fibers operator. ⊔
⊓
138 CHAPTER 3. CALCULUS ON MANIFOLDS
The proof goes exactly as in the non-parametric case, i.e., when B is a point. One
shows using partitions of unity that these local definitions can be patched together to
produce a well defined map
Z
: Ω•cpt (E) → Ω•−r
cpt (B).
E/B
dim E we have Z Z
dE ω = (−1)r dB ω.
E/B E/B
p∗ (ω ∧ p∗ η) = p∗ ω ∧ η. (3.4.11)
!
Z Z X ∂f Z
X ∂f
dE ω = dxi ∧ dxI + (−1)|I| dxI ∧ dy j ∧ dy J .
E/B Rr ∂xi Rr ∂y j
i j
The above integrals are defined to be zero if the corresponding forms do not have degree
r. Stokes’ formula shows that the first integral is always zero. Hence
Z Z X Z
∂ |I| I j J r
dE ω = (−1) dx ∧ dy ∧ dy = (−1) dB ω.
E/B ∂y j Rr E/B
j
p
(c) Consider the fibration R2 → R, (x, y) 7→ t = ax + by, a2 + b2 6= 0. Compute the
Gelfand–Leray residue dx∧dy
dt along the fiber p(x, y) = 0. ⊔
⊓
p : [0, 1] × M → M (t; m) 7→ m,
defines a ∂-bundle. The interior fiber is the open interval (0, 1). The fiber of p : ∂(I ×M ) →
M is the disjoint union of two points. ⊔
⊓
• Interior models. Rr × Rm → Rm
This is “the mother of all homotopy formulæ”. It will play a crucial part in Chapter 7
when we embark on the study of DeRham cohomology.
Exercise 3.4.55. Prove the above theorem. ⊔
⊓
Chapter 4
Riemannian Geometry
Now we can finally put to work the abstract notions discussed in the previous chapters.
Loosely speaking, the Riemannian geometry studies the properties of surfaces (manifolds)
“made of canvas”. These are manifolds with an extra structure arising naturally in many
instances. On such manifolds one can speak of the length of a curve, and the angle between
two smooth curves. In particular, we will study the problem formulated in Chapter 1: why
a plane (flat) canvas disk cannot be wrapped in an one-to-one fashinon around the unit
sphere in R3 . Answering this requires the notion of Riemann curvature which will be the
central theme of this chapter.
141
142 CHAPTER 4. RIEMANNIAN GEOMETRY
tangent space is endowed with an inner product (which can vary from point to point in a
smooth way).
Definition 4.1.1. (a) A Riemann manifold is a pair (M, g) consisting of a smooth
manifold M and a metric g on the tangent bundle, i.e., a smooth, symmetric positive
definite (0, 2)–tensor field on M . The tensor g is called a Riemann metric on M .
(b) Two Riemann manifolds (Mi , gi ) (i = 1, 2) are said to be isometric if there exists a
diffeomorphism φ : M1 → M2 such that φ∗ g2 = g1 . ⊔
⊓
gx : Tx M × Tx M → R
is an inner product on the tangent space Tx M . We will frequently use the alternative
notation (•, •)x = gx (•, •). The length of a tangent vector v ∈ Tx M is defined as usual,
Proposition 4.1.2. Let M be a smooth manifold, and denote by RM the set of Riemann
metrics on M . Then RM is a non-empty convex cone in the linear space of symmetric
(0, 2)–tensors.
Proof. The only thing that is not obvious is that RM is non-empty. We will use again
partitions of unity. Cover M by coordinate neighborhoods (Uα )α∈A . Let (xiα ) be a collec-
tion of local coordinates on Uα . Using these local coordinates we can construct by hand
the metric gα on Uα by
gα = (dx1α )2 + · · · + (dxnα )2 .
Now, pick a partition of unity B ⊂ C0∞ (M ) subordinated to the cover (Uα )α∈A , i.e., there
exits a map φ : B → A such that ∀β ∈ B supp β ⊂ Uφ(β) . Then define
X
g= βgφ(β) .
β∈B
The reader can check easily that g is well defined, and it is indeed a Riemann metric on
M. ⊔
⊓
Example 4.1.3 (The Euclidean space). The space Rn has a natural Riemann metric
g0 = (dx1 )2 + · · · + (dxn )2 .
Figure 4.2: A plane sheet and a half cylinder are “not so different”.
Remark 4.1.5. On any manifold there exist many Riemann metrics, and there is no
natural way of selecting one of them. One can visualize a Riemann structure as defining
a “shape” of the manifold. For example, the unit sphere x2 + y 2 + z 2 = 1 is diffeomorphic
2 2 2
to the ellipsoid x12 + y22 + z32 = 1, but they look “different ” (see Figure 4.1). However,
appearances may be deceiving. In Figure 4.2 it is illustrated the deformation of a sheet
144 CHAPTER 4. RIEMANNIAN GEOMETRY
of paper to a half cylinder. They look different, but the metric structures are the same
since we have not changed the lengths of curves on our sheet. The conclusion to be drawn
from these two examples is that we have to be very careful when we use the attribute
“different”. ⊔
⊓
Example 4.1.6 (The hyperbolic plane). The Poincaré model of the hyperbolic plane is
the Riemann manifold (D, g) where D is the unit open disk in the plane R2 and the metric
g is given by
1
g= (dx2 + dy 2 ). ⊔
⊓
1 − x2 − y 2
Exercise 4.1.7. Let H denote the upper half-plane
z+i
z = x + iy 7→ w = −i = u + iv
z−i
establishes an isometry (D, g) ∼
= (H, h). ⊔
⊓
Example 4.1.8 (Left invariant metrics on Lie groups). Consider a Lie group G, and
denote by LG its Lie algebra. Then any inner product h·, ·i on LG induces a Riemann
metric h = h·, ·ig on G defined by
G ∋ g 7→ h·, ·ig
L∗g h = h ∀g ∈ G.
If G is also compact, we can use the averaging technique of Subsection 3.4.2 to produce
metrics which are both left and right invariant. ⊔
⊓
(i) A line segment is the shortest path connecting two given points.
γ̈(t) = 0. (4.1.1)
Since we have not said anything about calculus of variations which deals precisely with
problems of type (i), we will use the second interpretation as our starting point. We will
soon see however that both points of view yield the same conclusion.
Let us first reformulate (4.1.1). As we know, the tangent bundle of R3 is equipped
with a natural trivialization, and as such, it has a natural trivial connection ∇0 defined
by
∇0i ∂j = 0 ∀i, j, where ∂i := ∂xi , ∇i := ∇∂i ,
i.e., all the Christoffel symbols vanish. Moreover, if g0 denotes the Euclidean metric, then
∇0i g0 (∂j , ∂k ) = ∇0i δjk − g0 (∇0i ∂j , ∂k ) − g0 (∂j , ∇0i ∂k ) = 0,
i.e., the connection is compatible with the metric. Condition (4.1.1) can be rephrased as
T (∇) = 0, ∇g = 0.
The connection ∇ is usually called the Levi–Civita connection associated to the metric g.
1
g(∇i ∂j , ∂k ) = gkℓ Γℓij = (∂i gjk − ∂k gij + ∂j gik ) .
2
Above, the scalars Γℓij denote the Christoffel symbols of ∇ in these coordinates, i.e.,
∇∂i ∂j = Γℓij ∂ℓ .
1
Γℓij = gkℓ (∂i gjk − ∂k gij + ∂j gik ) . (4.1.4)
2
Existence. It boils down to showing that (4.1.3) indeed defines a connection with the
required properties. The routine but rather tedious details are left to the reader. ⊔
⊓
γ : (a, b) → M,
satisfying
∇γ̇(t) γ̇(t) = 0, (4.1.5)
where ∇ is the Levi–Civita connection. ⊔
⊓
Using local coordinates (x1 , ..., xn ) with respect to which the Christoffel symbols are
(Γkij ),
and the path γ is described by γ(t) = (x1 (t), ..., xn (t)), we can rewrite the geodesic
equation as a second order, nonlinear system of ordinary differential equations. Set
d
= γ̇(t) = ẋi ∂i .
dt
Then,
∇ d γ̇(t) = ẍi ∂i + ẋi ∇ d ∂i = ẍi ∂i + ẋi ẋj ∇j ∂i
dt dt
k
= ẍ ∂k + Γkji ẋi ẋj ∂k (Γkij = Γkji ),
4.1. METRIC PROPERTIES 147
Since the coefficients Γkij = Γkij (x) depend smoothly upon x, we can use the classical
Banach-Picard theorem on existence in initial value problems (see e.g. [6]). We deduce
the following local existence result.
Proposition 4.1.11. Let (M, g) be a Riemann manifold. For any compact subset K ⊂
T M there exists ε > 0 such that for any (x, X) ∈ K there exists a unique geodesic
γ = γx,X : (−ε, ε) → M such that γ(0) = x, γ̇(0) = X. ⊔
⊓
One can think of a geodesic as defining a path in the tangent bundle t 7→ (γ(t), γ̇(t)).
The above proposition shows that the geodesics define a local flow Φ on T M by
Definition 4.1.12. The local flow defined above is called the geodesic flow of the Riemann
manifold (M, g). When the geodesic glow is a global flow, i.e., any γx,X is defined at each
moment of time t for any (x, X) ∈ T M , then the Riemann manifold is called geodesically
complete. ⊔
⊓
Proposition 4.1.13 (Conservation of energy). If the path γ(t) is a geodesic, then the
length of γ̇(t) is independent of time. ⊔
⊓
Proof. We have
d d
|γ̇(t)|2 = g(γ̇(t), γ̇(t)) = 2g(∇γ̇(t) , γ̇(t)) = 0. ⊔
⊓
dt dt
Thus, if we consider the sphere bundles
Sr (M ) = {X ∈ T M ; |X| = r},
Example 4.1.15. Let G be a connected Lie group, and let LG be its Lie algebra. Any
X ∈ LG defines an endomorphism ad(X) of LG by
ad(X)Y := [X, Y ].
where the bracket in the right hand side is the usual commutator of two endomorphisms.
148 CHAPTER 4. RIEMANNIAN GEOMETRY
Assume that there exists an inner product h·, ·i on LG such that, for any X ∈ LG , the
operator ad(X) is skew-adjoint, i.e.,
We can now extend this inner product to a left invariant metric h on G. We want to
describe its geodesics.
First, we have to determine the associated Levi–Civita connection. Using (4.1.3) we
get
1
h(∇X Z, Y ) = {Xh(Y, Z) − Y (Z, X) + Zh(X, Y )
2
−h([X, Y ], Z) + h([Y, Z], X) − h([Z, X], Y )}.
If we take X, Y, Z ∈ LG , i.e., these vector fields are left invariant, then h(Y, Z) = const.,
h(Z, X) = const., h(X, Y ) = const. so that the first three terms in the above formula
vanish. We obtain the following equality (at 1 ∈ G)
1
h∇X Z, Y i = {−h[X, Y ], Zi + h[Y, Z], Xi − h[Z, X], Y i}.
2
Using the skew-symmetry of ad(X) and ad(Z) we deduce
1
h∇X Z, Y i = h[X, Z], Y i,
2
so that, at 1 ∈ G, we have
1
∇X Z = [X, Z] ∀X, Z ∈ LG . (4.1.8)
2
This formula correctly defines a connection since any X ∈ Vect (G) can be written as a
linear combination X
X= αi Xi αi ∈ C ∞ (G) Xi ∈ LG .
P
If γ(t) is a geodesic, we can write γ̇(t) = γi Xi , so that
X 1X
0 = ∇γ̇(t) γ̇(t) = γ̇i Xi + γi γj [Xi , Xj ].
2
i i,j
This means that γ is an integral curve of the left invariant vector field X so that the
geodesics through the origin with initial direction X ∈ T1 G are
γX (t) = exp(tX). ⊔
⊓
4.1. METRIC PROPERTIES 149
Exercise 4.1.16. Let G be a Lie group and h a bi-invariant metric on G. Prove that its
restriction to LG satisfies (4.1.7). In particular, on any compact Lie groups there exist
metrics satisfying (4.1.7). ⊔
⊓
Definition 4.1.17. Let L be a finite dimensional real Lie algebra. The Killing pairing or
form is the bilinear map
The Lie algebra L is said to be semisimple if the Killing pairing is a duality. A Lie group
G is called semisimple if its Lie algebra is semisimple. ⊔
⊓
Exercise 4.1.18. Prove that SO(n) and SU (n) and SL(n, R) are semisimple Lie groups,
but U (n) is not. ⊔
⊓
Exercise 4.1.19. Let G be a compact Lie group. Prove that the Killing form is positive
semi-definite1 and satisfies (4.1.7).
Hint: Use Exercise 4.1.16. ⊔
⊓
(Lexp( t Y ) )∗ (Rexp( t Y ) )∗ X. ⊔
⊓
2 2
Example 4.1.21. (Geodesics on flat tori, and on SU (2)). The n-dimensional torus
Tn ∼
= S 1 × · · · × S 1 is an Abelian, compact Lie group. If (θ 1 , ..., θ n ) are the natural angular
coordinates on T n , then the flat metric is defined by
g = (dθ 1 )2 + · · · + (dθ n )2 .
The metric g on T n is bi-invariant, and obviously, its restriction to the origin satisfies the
skew-symmetry condition (4.1.7) since the bracket is 0. The geodesics through 1 will be
the exponentials
γα1 ,...,αn (t) t 7→ (eiα1 t , ..., eiαn t ) αk ∈ R.
If the numbers αk are linearly dependent over Q, then obviously γα1 ,...,αn (t) is a closed
curve. On the contrary, when the α’s are linearly independent over Q then a classical
result of Kronecker (see e.g. [59]) states that the image of γα1 ,...,αn is dense in T n !!! (see
also Section 7.4 to come)
The special unitary group SU (2) can also be identified with the group of unit quater-
nions
a + bi + cj + dk ; a2 + b2 + c2 + d2 = 1 ,
so that SU (2) is diffeomorphic with the unit sphere S 3 ⊂ R4 . The round metric on S 3 is
bi-invariant with respect to left and right (unit) quaternionic multiplication (verify this),
1
The converse of the above exercise is also true, i.e., any semisimple Lie group with positive definite
Killing form is compact. This is known as Weyl’s theorem. Its proof, which will be given later in the book,
requires substantially more work.
150 CHAPTER 4. RIEMANNIAN GEOMETRY
and its restriction to (1, 0, 0, 0) satisfies (4.1.7). The geodesics of this metric are the 1-
parameter subgroups of S 3 , and we let the reader verify that these are in fact the great
circles of S 3 , i.e., the circles of maximal diameter on S 3 . Thus, all the geodesics on S 3 are
closed. ⊔
⊓
The tangent space Tq M is a Euclidean space, and we can define Dq (r) ⊂ Tq M , the open
“disk” of radius r centered at the origin. We have the following result.
Proposition 4.1.22. Let (M, g) and q ∈ M as above. Then there exists r > 0 such that
the exponential map
expq : Dq (r) → M
4.1. METRIC PROPERTIES 151
is a diffeomorphism onto. The supremum of all radii r with this property is denoted by
ρM (q). ⊔
⊓
Definition 4.1.23. The positive real number ρM (q) is called the injectivity radius of M
at q. The infimum
ρM = inf ρM (q)
q
is the identity Tq M → Tq M .
Proposition 4.1.22 follows immediately from the above lemma using the inverse func-
tion theorem. ⊔
⊓
Now choose an orthonormal frame (e1 , ..., en ) of Tq M , and denote by (x1 , ..., xn ) the
resulting cartesian coordinates in Tq M . For 0 < r < ρM (q), any point p ∈ expq (Dq (r))
can be uniquely written as
p = expq (xi ei ),
so that the collection (x1 , ..., xn ) provides a coordinatization of the open set expq (Dq (r)) ⊂
M . The coordinates thus obtained are called normal coordinates at q, the open set
expq (Dq (r)) is called a normal neighborhood, and will be denoted by Br (q) for reasons
that will become apparent a little later.
Proposition 4.1.25. Let (xi ) be normal coordinates at q ∈ M , and denote by g ij the
expression of the metric tensor in these coordinates. Then we have
∂g ij
gij (q) = δij and (q) = 0 ∀i, j, k.
∂xk
Thus, the normal coordinates provide a first order contact between g and the Euclidean
metric.
∂
Proof. By construction, the vectors ei = ∂x i form an orthonormal basis of Tq M and this
proves the first equality. To prove the second equality we need the following auxiliary
result.
Lemma 4.1.26. In normal coordinates (xi ) (at q) the Christoffel symbols Γijk vanish at
q.
152 CHAPTER 4. RIEMANNIAN GEOMETRY
Proof. For any (m1 , ..., mn ) ∈ Rn the curve t 7→ xi (t) 1≤i≤n , xi (t) = mi t, ∀i is the
P i ∂
geodesic t 7→ expq m t ∂xi so that
Γijk (x(t))mj mk = 0.
In particular,
Γijk (0)mj mk = 0 ∀mj ∈ Rn
from which we deduce the lemma. ⊔
⊓
so that,
∂
∇ ∂ = 0 at q, ∀i, j. (4.1.9)
∂xj ∂xi
∂g ij
Using ∇g = 0 we deduce ∂xk
(q) = ( ∂x∂ k g ij ) |q = 0. ⊔
⊓
The reader may ask whether we can go one step further, and find local coordinates
which produce a second order contact with the Euclidean metric. At this step we are in for
a big surprise. This thing is in general not possible and, in fact, there is a geometric way
of measuring the “second order distance” between an arbitrary metric and the Euclidean
metric. This is where the curvature of the Levi–Civita connection comes in, and we will
devote an entire section to this subject.
Proof. Using the smooth dependence upon initial data in ordinary differential equations
we deduce that there exists an open neighborhood V of (q, 0) ∈ T M such that expm X is
well defined for all (m, X) ∈ V . We get a smooth map
We compute the differential of F at (q, 0). First, using normal coordinates (xi ) near q we
get coordinates (xi ; X j ) near (q, 0) ∈ T M . The partial derivatives of F at (q, 0) are
∂ ∂ ∂ ∂ ∂
D(q,0) F ( )= + i
, D(q,0) F ( i
)= .
∂x i ∂x i
∂X ∂X ∂X i
Thus, the matrix defining D(q,0) F has the form
1 0
,
∗ 1
In particular, we deduce that, for any m ∈ Br (q), the map expm : Dε (m) ⊂ Tm M → M
is a diffeomorphism onto its image, and
Clearly, for any m ∈ M , the curve t 7→ expm (tX) is a geodesic of length < ε joining m to
expm (X). It is the unique geodesic with this property since F : V → U is injective. ⊔
⊓
Theorem 4.1.28. Let q, r and ε as in the previous lemma, and consider the unique
geodesic γ : [0, 1] → M of length < ε joining two points of Br (q). If ω : [0, 1] → M is a
piecewise smooth path with the same endpoints as γ then
Z 1 Z 1
|γ̇(t)|dt ≤ |ω̇(t)|dt.
0 0
with equality if and only if ω([0, 1]) = γ([0, 1]). Thus, γ is the shortest path joining its
endpoints.
The proof relies on two lemmata. Let m ∈ M be an arbitrary point, and assume
0 < R < ρM (m).
Lemma 4.1.29 (Gauss). In BR (m) ⊂ M , the geodesics through m are orthogonal to the
hypersurfaces
Σδ = expq (Sδ (0)) = expm (X); |X| = δ , 0 < δ < R.
154 CHAPTER 4. RIEMANNIAN GEOMETRY
s 7→ expm (sX(t)), 0 ≤ s ≤ R.
Set
∂
∂s := f∗ ∈ Tf (s,t) M.
∂s
Define ∂t similarly. The objects ∂s , and ∂t are sections of the restriction of T M to the
image of f . Using the normal coordinates on BR (m) we can think of ∂s as a vector field
on a region in Tm M , and as such we have the equality ∂s = X(t). We have to show
∇∂s ∂s = 0.
On the other hand, since [∂s , ∂t ] = 0, we deduce (using the symmetry of the Levi–Civita
connection)
1
h∂s , ∇∂s ∂t i = h∂s , ∇∂t ∂s i = ∂t |∂s |2 = 0,
2
since |∂s | = |X(t)| = 1. We conclude that the quantity h∂s , ∂t i is independent of s. For
s = 0 we have f (0, t) = expm (0) so that ∂t |s=0 = 0, and therefore
as needed. ⊔
⊓
Lemma 4.1.30. The length of the curve ω(t) is ≥ |ρ(b) − ρ(a)|. The equality holds if
and only if X(t) = const and ρ̇(t) ≥ 0. In other words, the shortest path joining two
concentrical shells Σδ is a radial geodesic.
Proof. Let f (ρ, t) := expm (ρX(t)), so that ω(t) = f (ρ(t), t). Then
∂f ∂f
ω̇ = ρ̇ + .
∂ρ ∂t
∂f ∂f
Since the vectors ∂ρ and ∂t are orthogonal, and since
∂f
= |X(t)| = 1
∂ρ
we get
2
∂f
|ω̇|2 = |ρ̇|2 + ≥ |ρ̇|2 .
∂t
∂f
The equality holds if and only if ∂t = 0, i.e. Ẋ = 0. Thus
Z b Z b
|ω̇|dt ≥ |ρ̇|dt ≥ |ρ(b) − ρ(a)|.
a a
Equality holds if and only if ρ(t) is monotone, and X(t) is constant. This completes the
proof of the lemma. ⊔
⊓
The proof of Theorem 4.1.28 is now immediate. Let m0 , m1 ∈ Br (q), and consider a
geodesic γ : [0, 1] → M of length < ε such that γ(i) = mi , i = 0, 1. We can write
Set R = |X|. Consider any other piecewise smooth path ω : [a, b] → M joining m0 to
m1 . For any δ > 0 this path must contain a portion joining the shell Σδ (m0 ) to the shell
ΣR (m0 ) and lying between them. By the previous lemma the length of this segment will
be ≥ R − δ. Letting δ → 0 we deduce
l(ω) ≥ R = l(γ).
If ω([a, b]) does not coincide with γ([0, 1]) then we obtain a strict inequality. ⊔
⊓
Any Riemann manifold has a natural structure of metric space. More precisely, we set
n o
d(p, q) = inf l(ω); ω : [0, 1] → M piecewise smooth path joining p to q
A piecewise smooth path ω connecting two points p, q such that l(ω) = d(p, q) is said to
be minimal. From Theorem 4.1.28 we deduce immediately the following consequence.
Corollary 4.1.31. The image of any minimal path coincides with the image of a geodesic.
In other words, any minimal path can be reparametrized such that it satisfies the geodesic
equation.
156 CHAPTER 4. RIEMANNIAN GEOMETRY
Theorem 4.1.28 also implies that any two nearby points can be joined by a unique
minimal geodesic. In particular we have the following consequence.
Corollary 4.1.33. Let q ∈ M . Then for all r > 0 sufficiently small
expq (Dr (0))( = Br (q) ) = p ∈ M d(p, q) < r . (4.1.10)
Proof. The same argument used in the proof of Theorem 4.1.28 shows that for any 0 <
r < ρM (q) the radial geodesics expq (tX) are minimal. ⊔
⊓
Proposition 4.1.36. For any q ∈ M there exists 0 < R < ιM (q) such that for any r < R
the ball Br (q) is convex.
Proof. Choose ax 0 < ε < 21 ρM (q), and 0 < R < ε such that any two points m0 , m1
in BR (q) can be joined by a unique minimal geodesic [0, 1] ∋ t 7→ γm0 ,m1 (t) of length
< ε, not necessarily contained in BR (q). We will prove that ∀m0 , m1 ∈ BR (q) the map
t 7→ d(q, γm0 ,m1 (t)) is convex and thus it achieves its maxima at the endpoints t = 0, 1.
Note that
d(q, γ(t)) < R + ε < ρM (q).
The geodesic γm0 ,m1 (t) can be uniquely expressed as
γm0 ,m1 (t) = expq (r(t)X(t)) X(t) ∈ Tq M with r(t) = d(q, γm0 ,m1 (t)).
d 2
2
It suffices to show dt2 (r ) ≥ 0 for t ∈ [0, 1] if d(q, m0 ) and d(q, m1 ) are sufficiently small.
At this moment it is convenient to use normal coordinates (xi ) near q. The geodesic
γm0 ,m1 takes the form (xi (t)), and we have
r 2 = (x1 )2 + · · · + (xn )2 .
We compute easily
d2 2
(r ) = 2r 2 (ẍ1 + · · · + ẍn ) + |ẋ|2 (4.1.11)
dt2
P
where ẋ(t) = ẋi ei ∈ Tq M . The path γ satisfies the equation
Since Γijk (0) = 0 (normal coordinates), we deduce that there exists a constant C > 0
(depending only on the magnitude of the second derivatives of the metric at q) such that
ẍi ≥ −C|x||ẋ|2 .
d2 2
2
(r ) ≥ 2|ẋ|2 1 − nC|x|3 . (4.1.12)
dt
If we chose from the very beginning
R + ε ≤ (nC)−1/3 ,
then, because along the geodesic we have |x| ≤ R + ε, the right-hand-side of (4.1.12) is
nonnegative. This establishes the convexity of t 7→ r 2 (t) and concludes the proof of the
proposition. ⊔
⊓
In the last result of this subsection we return to the concept of geodesic completeness.
We will see that this can be described in terms of the metric space structure alone.
Theorem 4.1.37 (Hopf-Rinow). Let M be a Riemann manifold and q ∈ M . The following
assertions are equivalent:
(a) expq is defined on all of Tq M .
(b) The closed and bounded (with respect to the metric structure) sets of M are compact.
(c) M is complete as a metric space.
(d) M is geodesically complete. S
(e) There exists a sequence of compact sets Kn ⊂ M , Kn ⊂ Kn+1 and n Kn = M such
that if pn 6∈ Kn then d(q, pn ) → ∞.
Moreover, on a (geodesically) complete manifold any two points can be joined by a
minimal geodesic. ⊔
⊓
Remark 4.1.39. (a) Every smooth manifold admits complete Riemann metrics. We refer
to [108] for a proof and more refined statements.
(b) On a complete manifold there could exist points (sufficiently far apart) which can be
joined by more than one minimal geodesic. Think for example of a manifold where there
exist closed geodesic, e.g., the tori T n . ⊔
⊓
Example 4.1.40. Let M = R3 with the Euclidean metric. A vector field V on M has
the form
∂ ∂ ∂
V =P +Q +R .
∂x ∂y ∂z
Then
W = LV = P dx + Qdy + Rdz.
If we think of V as a field of forces in the space, then W is the infinitesimal work of V . ⊔
⊓
ε : C ∞ (T ∗ M ⊗ Λk T ∗ M ) → C ∞ (Λk+1 T ∗ M )
ε(α ⊗ β) = α ∧ β, ∀α ∈ Ω1 (M ), β ∈ Ωk (M ).
Then the exterior derivative d is related to the Levi–Civita on Λk T ∗ M connection via the
equality d = ε ◦ ∇.
Proof. We will use a strategy useful in many other situations. Our discussion about normal
coordinates will payoff. Denote temporarily by D the operator ε ◦ ∇.
The equality d = D is a local statement, and it suffices to prove it in any coordinate
neighborhood. Choose (xi ) normal coordinates at an arbitrary point p ∈ M , and set
∂
∂i := ∂x i . Note that
X
D= dxi ∧ ∇i , ∇i = ∇∂i .
i
where as usual, for any ordered multi-index I: (1 ≤ i1 < · · · < ik ≤ n), we set
Thus, at p, X
Dω = dxi ∧ ∇i (ωI dxI )
I
X X
= dxi ∧ (∂j ωI dxI + ωI ∇i (dxI )) = dxi ∧ ∂j ωI = dω.
I I
Since the point p was chosen arbitrarily this completes the proof of Proposition 4.1.41. ⊔
⊓
4.1. METRIC PROPERTIES 159
Exercise 4.1.42. Show that for any k-form ω on the Riemann manifold (M, g) the exterior
derivative dω can be expressed by
k
X
dω(X0 , X1 , . . . , Xk ) = (−1)i (∇Xi ω)(X0 , . . . , X̂i , . . . , Xk ),
i=0
The Riemann metric defines a metric in any tensor bundle Tsr (M ) which we continue
to denote by g. Thus, given two tensor fields T1 , T2 of the same type (r, s) we can form
their pointwise scalar product
Using the orientation we can construct (using the results in subsection 2.2.4) a natural
volume form on M which we denote by dVg , and we call it the metric volume. This is the
positively oriented volume form of pointwise norm ≡ 1.
If (x1 , ..., xn ) are local coordinates such that dx1 ∧ · · · ∧ dxn is positively oriented, then
p
dVg = |g|dx1 ∧ · · · ∧ dxn ,
We consider each term separately. Note first that dVg (∂1 , ..., ∂n ) = (det(g ij ))1/2 , so that
is a linear combination of products in which each product has a factor of the form ∂k g ij |p .
Such a factor is zero since we are working in normal coordinates. Thus, the first term
in(4.1.14) is zero. The other terms are zero as well since in normal coordinates at p we
have the equality
∇X ∂i = ∇∂k ∂i = 0.
Proposition 11.3.3 is proved. ⊔
⊓
∗ : Ωk (M ) → Ωn−k (M ),
uniquely determined by
In particular, ∗1 = dVg .
Example 4.1.44. To any vector field F = P ∂x + Q∂y + R∂z on R3 we associated its
infinitesimal work
WF = L(F ) = P dx + Qdy + Rdz.
The infinitesimal energy flux of F is the 2-form
The exterior derivative of WF is the infinitesimal flux of the vector field curl F
= Φcurl F = ∗Wcurl F .
The divergence of F is the scalar defined as
div F = ∗d ∗ WF = ∗dΦF
Definition 4.1.45. (a) For any smooth function f on the Riemann manifold (M, g) we
denote by grad f , or gradg f , the vector field g-dual to the 1-form df . In other words
Proposition 4.1.47. Let X be a vector field on the oriented Riemann manifold (M, g),
and denote by α the 1-form dual to X. Then
(a) div X = tr (∇X), where we view ∇X as an element of C ∞ (End (T M )) via the iden-
tifications
∇X ∈ Ω1 (T M ) ∼
= C ∞ (T ∗ M ⊗ T M ) ∼
= C ∞ (End (T M )).
(b) div X = ∗d ∗ α.
(c) If (x1 , ..., xn ) are local coordinates such that dx1 ∧ · · · ∧ dxn is positively oriented, then
1 p
div X = p ∂i ( |g|X i ),
|g|
where X = X i ∂i .
The proof will rely on the following technical result which is interesting in its own. For
simplicity, we will denote the inner products by (•, •), instead of the more precise g(•, •).
Lemma 4.1.48. Denote by δ the operator
δ = ∗d∗ : Ωk (M ) → Ωk−1 (M ).
Let α be a (k − 1)-form and β a k-form such that at least one of them is compactly
supported. Then Z Z
νn,k
(dα, β)dVg = (−1) (α, δβ)dVg ,
M M
where νn,k = nk + n + 1.
Proof. We have
Z Z Z Z
(dα, β)dVg = dα ∧ ∗β = d(α ∧ ∗β) + (−1)k α ∧ d ∗ β.
M M M M
The first integral in the right-hand-side vanishes by the Stokes formula since α ∧ ∗β has
compact support. Since
d ∗ β ∈ Ωn−k+1 (M ) and ∗2 = (−1)(n−k+1)(k−1) on Ωn−k+1 (M )
we deduce Z Z
(dα, β)dVg = (−1)k+(n−k+1)(n−k) α ∧ ∗δβ.
M M
This establishes the assertion in the lemma since
(n − k + 1)(k − 1) + k ≡ νn,k (mod 2). ⊔
⊓
162 CHAPTER 4. RIEMANNIAN GEOMETRY
d∗ := (−1)νn,k δ = (−1)νn,k ∗ d ∗ . ⊔
⊓
Proof of the proposition. Set Ω := dVg , and let (X1 , ..., Xn ) be a local moving frame
of T M in a neighborhood of some point. Then
X
(LX Ω)(X1 , ..., Xn ) = X(Ω(X1 , ..., Xn )) − Ω(X1 , ..., [X, Xi ], ..., Xn ). (4.1.16)
i
Since ∇Ω = 0 we get
X
X · (Ω(X1 , ..., Xn )) = Ω(X1 , ..., ∇X Xi , ..., Xn ).
i
Over the neighborhood where the local moving frame is defined, we can find smooth
functions fij , such that
∇Xi X = fij Xj ⇒ tr (∇X) = fii .
Part (a) of the proposition follows after we substitute the above equality in (4.1.17).
Proof of (b) For any f ∈ C0∞ (M ) we have
We have thus proved that for any compactly supported function f we have the equality
Z Z Z
− f (div X)dVg = (Xf )dVg = df (X)dVg
M M M
Z Z
= (grad f, X)dVg = (df, α)dVg .
M M
Using Lemma 4.1.48 we deduce
Z Z
− f (div X)dVg = − f δαdVg ∀f ∈ C0∞ (M ).
M M
4.1. METRIC PROPERTIES 163
Exercise 4.1.50. Let (M, g) be a Riemann manifold and X ∈ Vect (M ). Show that the
following conditions on X are equivalent.
(a) LX g = 0.
(b) g(∇Y X, Z) + g(Y, ∇Z X) = 0 for all Y, Z ∈ Vect (M ).
(A vector field X satisfying the above equivalent conditions is called a Killing vector field).
⊔
⊓
Exercise 4.1.51. Consider a Killing vector field X on the oriented Riemann manifold
(M, g), and denote by η the 1-form dual to X. Show that δη = 0, i.e., div(X) = 0. ⊔
⊓
Definition 4.1.52. Let (M, g) be an oriented Riemann manifold (possibly with bound-
ary). For any k-forms α, β define
Z Z
hα, βi = hα, βiM = (α, β)dVg = α ∧ ∗β,
M M
whenever the integrals in the right-hand-side are finite. ⊔
⊓
Corollary 4.1.54 (Gauss). Let (M, g) be a compact, oriented Riemann manifold with
boundary, and X a vector field on M . Then
Z Z
div(X)dVg = (X, ~ν )dvg∂ ,
M ∂M
where g∂ = g |∂M .
Definition 4.1.56. Let (M, g) be an oriented Riemann manifold. The geometric Lapla-
cian is the linear operator ∆M : C ∞ (M ) → C ∞ (M ) defined by
∆M = d∗ df = − ∗ d ∗ df = − div(grad f ).
Using Proposition 4.1.47 we deduce that in local coordinates (x1 , ..., xn ), the geometer’s
Laplacian takes the form
1 p
∆M = − p ∂i |g|g ij ∂j ,
|g|
where (gij ) denotes as usual the matrix inverse to (gij ). Note that when g is the Euclidean
metric, then the geometers’ Laplacian is
∆0 = −(∂i2 + · · · + ∂n2 ),
Exercise 4.1.58. (a) Prove that the only harmonic functions on a compact oriented
Riemann manifold M are the constant ones. R
(b) If u, f ∈ C ∞ (M ) are such that ∆M u = f show that M f = 0. ⊔
⊓
Exercise 4.1.59. Denote by (u1 , . . . , un ) the coordinates on the round sphere S n ֒→ Rn+1
obtained via the stereographic projection from the south pole.
(a) Show that the round metric g0 on S n is given in these coordinates by
4
g0 = (du1 )2 + · · · + (dun )2 ,
1 + r2
where r 2 = (u1 )2 + · · · + (un )2 .
(b) Show that the n-dimensional “area” of S n is
Z
2π (n+1)/2
σn = dvg0 = ,
Sn Γ( n+1
2 )
and the classical Beta integrals (see [54], or [134], Chapter XII)
Z ∞
r n−1 (Γ(n/2))2
2 n
dr = . ⊔
⊓
0 (1 + r ) 2Γ(n)
Exercise 4.1.60. Consider the Killing form on su(2) (the Lie algebra of SU (2)) defined
by
hX, Y i = −tr X · Y.
(a) Show that the Killing form defines a bi-invariant metric on SU (2), and then compute
the volume of the group with respect to this metric. The group SU (2) is given the
orientation defined by e1 ∧ e2 ∧ e3 ∈ Λ3 su(2), where ei ∈ su(2) are the Pauli matrices
i 0 0 1 0 i
e1 = e2 = e3 =
0 −i −1 0 i 0
(b) Show that the trilinear form on su(2) defined by
R(g) = F (∇),
Theorem 4.2.2 (The symmetries of the curvature tensor). The Riemann curvature tensor
R satisfies the following identities (X, Y, Z, U, V ∈ Vect (M )).
4.
g(R(X, Y )U, V ) = g(R(U, V )X, Y ).
Proof. (a) It follows immediately from the definition of R as an End(T M )-valued skew-
symmetric bilinear map (X, Y ) 7→ R(X, Y ).
(b) This is a special case of Proposition 3.3.14 about the curvature of connections com-
patible with a metric.
(c) This is a local statement so we suffices to prove it holds on each coordinate patch. The
identity is also C ∞ (M )-multilinear so it suffices to prove it only in the special case when
the vector fields X, Y , Z pairwise commute. The 1st Bianchi identity is then equivalent
to
∇X ∇Y Z − ∇Y ∇X Z + ∇Z ∇X Y − ∇X ∇Z Y + ∇Y ∇Z X − ∇Z ∇Y X = 0.
The identity now follows from the symmetry of the connection: ∇X Y = ∇Y X etc.
(d) We will use the following algebraic lemma ([80], Chapter 5).
Lemma 4.2.3. Let R : E × E × E × E → R be a quadrilinear map on a real vector space
E. Define
S(X1 , X2 , X3 , X4 ) = R(X1 , X2 , X3 , X4 ) + R(X2 , X3 , X1 , X4 ) + R(X3 , X1 , X2 , X4 ).
If R satisfies the symmetry conditions
R(X1 , X2 , X3 , X4 ) = −R(X2 , X1 , X3 , X4 )
R(X1 , X2 , X3 , X4 ) = −R(X1 , X2 , X4 , X3 ),
then
R(X1 , X2 , X3 , X4 ) − R(X3 , X4 , X1 , X2 )
1n
= S(X1 , X2 , X3 , X4 ) − S(X2 , X3 , X4 , X1 )
2 o
− S(X3 , X4 , X1 , X2 ) + S(X4 , X3 , X1 , X2 ) .
The Riemann curvature 169
The Riemann curvature tensor is the source of many important invariants associated
to a Riemann manifold. We begin by presenting the simplest ones.
Definition 4.2.6. Let (M, g) be a Riemann manifold with curvature tensor R. Any two
vector fields X, Y on M define an endomorphism of T M by
U 7→ R(U, X)Y.
If (x1 , . . . , xn ) are local coordinates on M and the curvature R has the local expression
ℓ ) then the Ricci curvature has the local description
R = (Rkij
X
ℓ
Ric = (Ricij ) = Rjℓi .
ℓ
170 Riemannian geometry
The symmetries of the Riemann curvature imply that Ric is a symmetric (0,2)-tensor (as
the metric).
Definition 4.2.7. The scalar curvature s of a Riemann manifold is the trace of the Ricci
tensor. In local coordinates, the scalar curvature is described by
(X, X) (X, Y )
|X ∧ Y | = ,
(Y, X) (Y, Y )
which is non-zero since X and Y are linearly independent. (|X ∧ Y |1/2 measures the area
of the parallelogram in Tp M spanned by X and Y .)
If we choose local coordinates (xi ) near p than we set
Rijij
Kij (p) = Kp (∂xi , ∂xj ) = 2 .
gii gjj − gij
Remark 4.2.8. (a) The sectional curvature determines the Riemann tensor. For a proof
we refer to Thm. 3.8 in [48].
(b) Given a metric g on a smooth manifold M , and a constant λ > 0, we obtained a new,
rescaled metric gλ = λ2 g. A simple computation shows that the Christoffel symbols and
Riemann tensor of gλ are equal with the Christoffel symbols and the Riemann tensor of
the metric g. In particular, this implies
Ricgλ = Ricg .
In particular, the scalar curvature changes by the same factor upon rescaling the metric.
If we thing of the metric as a quantity measured in meter2 , then the sectional curvatures
are measured in meter−2 . ⊔
⊓
The Riemann curvature 171
According to the above exercise the quantity Kp (X, Y ) depends only upon the 2-plane
in Tp M generated by X and Y . Thus Kp is in fact a function on Gr2 (p) the Grassmannian
of 2-dimensional subspaces of Tp M .
Definition 4.2.10. The function Kp : Gr2 (p) → R defined above is called the sectional
curvature of M at p. ⊔
⊓
can be organized as a smooth fiber bundle over M with standard fiber Gr2 (Rn ), n =
dim M , such that, if M is a Riemann manifold, Gr2 (M ) ∋ (p; π) 7→ Kp (π) is a smooth
map. ⊔
⊓
4.2.2 Examples
Example 4.2.12. Consider again the situation discussed in Example 4.1.15. Thus, G is
a Lie group, and h•, •i is a metric on the Lie algebra LG satisfying
Denote the Killing form by κ(X, Y ) = −tr ad(X) ad(Y ) . To compute the Ricci curva-
ture we pick an orthonormal basis E1 , . . . , En of LG . For any X = X i Ei , Y = Y j Ej ∈ LG
we have
1
Ric (X, Y ) = tr (Z 7→ [[X, Z], Y ])
4
1X 1X
= h[[X, Ei ], Y ], Ei )i = − had(Y )[X, Ei ], Ei i
4 4
i i
1X 1X
= h[X, Ei ], [Y, Ei ]i = had(X)Ei , ad(Y )Ei i
4 4
i i
1X 1 1
=− had(Y ) ad(X)Ei , Ei i = − tr ad(Y ) ad(X) = κ(X, Y ).
4 4 4
i
In particular, if G is a compact semisimple group and the metric is given by the Killing
form then the scalar curvature is
1
s(κ) = dim G. ⊔
⊓
4
Remark 4.2.13. Many problems in topology lead to a slightly more general situation
than the one discussed in the above example namely to metrics on Lie groups which are
only left invariant. Although the results are not as “crisp” as in the bi-invariant case many
nice things do happen. For details we refer to [96]. ⊔
⊓
we deduce that the only nontrivial component of the Riemann tensor is R = R1212 . The
sectional curvature is simply a function on M
1 1
K= R1212 = s(g), where |g| = det(gij ).
|g| 2
In this case, the scalar K is known as the total curvature or the Gauss curvature of the
surface.
The Riemann curvature 173
In particular, if M is oriented, and the form dx1 ∧ dx2 defines the orientation, we can
construct a 2-form
1 1 1
ε(g) = Kdvg = s(g)dVg = p R1212 dx1 ∧ dx2 .
2π 4π 2π |g|
The 2-form ε(g) is called the Euler form associated to the metric g. We want to emphasize
that this form is defined only when M is oriented.
We can rewrite this using the pfaffian construction of Subsection 2.2.4. The curvature
R is a 2-form with coefficients in the bundle of skew-symmetric endomorphisms of T M so
we can write
1 0 R1212
R = A ⊗ dVg , A = p
|g| R2112 0
Assume for simplicity that (x1 , x2 ) are normal coordinates at a point q ∈ M . Thus at q,
|g| = 1 since ∂1 , ∂2 is an orthonormal basis of Tq M . Hence, at q, dVg = dx1 ∧ dx2 , and
1 1
ε(g) = g R(∂1 , ∂2 ∂2 , ∂1 )dx1 ∧ dx2 = R1212 dx1 ∧ dx2 .
2π 2π
Hence we can write
1 1
ε(g) = P f g (−A)dvg =: P f g (−R).
2π 2π
The Euler form has a very nice interpretation in terms of holonomy. Assume √ as before
√
that (x1 , x2 ) are normal coordinates at q, and consider the square St = [0, t] × [0, t]
in the (x1 , x2 ) plane. Denote the (counterclockwise) holonomy along ∂St by Tt . This is
an orthogonal transformation of Tq M , and with respect to the orthogonal basis (∂1 , ∂2 ) of
Tq M , it has a matrix description as
cos θ(t) − sin θ(t)
Tt = .
sin θ(t) cos θ(t)
The result in Subsection 3.3.4 can be rephrased as
sin θ(t) = −tg(R(∂1 , ∂2 )∂2 , ∂1 ) + O(t2 ),
so that
R1212 = θ̇(0).
Hence R1212 is simply the infinitesimal angle measuring the infinitesimal rotation suffered
by ∂1 along St . We can think of the Euler form as a “density” of holonomy since it
measures the holonomy per elementary parallelogram. ⊔
⊓
θ α (Xβ ) = δβα .
The 1-forms θ α measure the infinitesimal displacement of a point P with respect to the
frame (Xα ). Note that the T M -valued 1-form θ = θ α Xα is the differential of the identity
map 1 : Rn → Rn expressed using the given moving frame.
Introduce the 1-forms ωβα defined by
where we set
∂Xαi
dXα := dxj ⊗ ∂i .
∂xj
We can form the matrix valued 1-form ω = (ωβα ) which measures the infinitesimal rotation
suffered by the moving frame (Xα ) following the infinitesimal displacement x 7→ x + dx.
In particular, ω = (ωβα ) is a skew-symmetric matrix since
0 = dhXα , Xβ i = hω · Xα , Xβ i + hXα , ω · Xβ i.
Since θ = d1 we deduce
dθ α = θ β ∧ ωβα , or dθ = −ω ∧ θ. (4.2.5)
Above, in the last equality we interpret ω as a n × n matrix whose entries are 1-forms,
and θ as a column matrix, or a n × 1 matrix whose entries are 1-forms.
Using the equality d2 Xβ = 0 in (4.2.4) we deduce
The equations (4.2.5)–(4.2.6) are called the structural equations of the Euclidean space.
The significance of these structural equations will become evident in a little while. ⊔
⊓
Proof. Since the collection of two forms (θ α ∧ θ β )1≤α<β≤n defines a local frame of Λ2 T ∗ Rn ,
α , uniquely determined by the conditions
there exist functions gβγ
1 α β
dθ α = g θ ∧ θ γ , gβγ
α α
= −gγβ .
2 βγ
Uniqueness. Suppose that there exist forms ωβα satisfy the conditions (a)&(b) above. Then
α such that
there exist functions fβγ
α γ
ωβα = fβγ θ .
α 1 α β γ
fβγ = (gβγ + gγα − gαβ ) (4.2.7)
2
α θ γ , where the f ’s are given by (4.2.7). We let the reader check
Existence. Define ωβα = fβγ
α
that the forms ωβ satisfy both (a) and (b). ⊔
⊓
The reader may now ask why go through all this trouble. What have we gained by
constructing the forms ω, and after all, what is their significance?
To answer these questions, consider the Levi–Civita connection ∇. Define ω̂βα by
∇Xβ = ω̂βα Xα .
Hence
∇Xγ Xβ = ω̂βα (Xγ )Xα .
Since ∇ is compatible with the Riemann metric, we deduce in standard manner that
ω̂βα = −ω̂αβ .
The differential of θ α can be computed in terms of the Levi–Civita connection (see
Subsection 4.1.5), and we have
(use θ α (Xβ ) = δβα = const) = −θ α(ω̂γδ (Xβ )Xδ ) + θ α(ω̂βδ (Xγ )Xδ )
= ω̂βα (Xγ ) − ω̂γα (Xβ ) = (θ β ∧ ω̂βα )(Xβ , Xγ ).
Thus the ω̂’s satisfy both conditions (a) and (b) of Proposition 4.2.16 so that we must
have
ω̂βα = ωβα .
176 Riemannian geometry
In other words, the matrix valued 1-form (ωβα ) is the 1-form associated to the Levi–Civita
connection in this local moving frame. In particular, using the computation in Example
3.3.12 we deduce that the 2-form
Ω = (dω + ω ∧ ω)
H+ = {(x, y) ; y > 0}
1 1
dθ y = d( dy) = 0 = (− dx) ∧ θ x .
y y
Thus the connection 1-form in this local moving frame is
" #
0 − y1
ω= 1 dx.
y 0
Note that ω ∧ ω = 0. Using the structural equations (4.2.8) we deduce that the Riemann
curvature is " #
1
0 2 0 −1
Ω = dω = y
dy ∧ dx = θx ∧ θy .
− y12 0 1 0
(T M ) |S ∼
= T S ⊕ NS S.
The NS is called the normal bundle of S ֒→ M , and its is the orthogonal complement of
T S in (T M ) |S . Thus, a section of (T M ) |S , that is a vector field X of M along S, splits
into two components: a tangential component X τ , and a normal component, X ν .
Now choose a local orthonormal moving frame (X1 , ..., Xk ; Xk+1 , ..., Xm ) such that the
first k vectors (X1 , ..., Xk ) are tangent to S. Denote the dual coframe by (θ α )1≤α≤m . Note
that
θ α |S = 0 for α > k.
Denote by (µαβ ), (1 ≤ α, β ≤ m) the connection 1-forms associated to ∇M by this frame,
and let σβα , (1 ≤ α, β ≤ k) be the connection 1-forms of ∇S corresponding to the frame
(X1 , . . . , Xk ). We will analyze the structural equations of M restricted to S ֒→ M .
dθ α = θ β ∧ µαβ 1 ≤ α, β ≤ m. (4.2.9)
k
X
dθ α = θ β ∧ µαβ , µαβ = −µβα 1 ≤ α, β ≤ k.
β=1
σβα = µαβ 1 ≤ α, β ≤ k.
∇SX Y = (∇M
XY)
τ
∀X, Y ∈ Vect (S). (4.2.10)
B. k < α ≤ m. We deduce
k
X
0= θ β ∧ µαβ .
β=1
Lemma 4.2.19 (Cartan). Let V be a d-dimensional real vector space and consider p
linearly independent elements ω1 , ... , ωp ∈ Λ1 V , p ≤ d. If θ1 , ... , θp ∈ Λ1 V are such that
p
X
θi ∧ ωi = 0,
i=1
then there exist scalars Aij , 1 ≤ i, j ≤ p such that Aij = Aji and
p
X
θi = Aij ωj .
j=1
Proof. Extend the collection (ωi )1≤i≤p to a basis (ωk )1≤k≤d of Λ1 V . Then
d
X
θi = Aik ωk
k=1
and
X p X
X d
0= θi ∧ ωi = Aik ωk ∧ ωi
i i=1 k=1
X p X
X
= Aik − Aki ωk ∧ ωi + Aik ωk ∧ ωi .
1≤k<i≤p i=1 k>p
This proves that Aik = 0 for k > p and Aik = Aki for 1 ≤ i, k ≤ p. ⊔
⊓
λ , λ > k, 1 ≤ β, γ ≤ k satisfying
Using Cartan lemma we can find smooth functions fβγ
λ λ
fβγ = fγβ , and µλβ = fβγ
λ γ
θ .
Now define
λ β
N := fβγ θ ⊗ θ γ ⊗ Xλ .
We can view N as a symmetric bilinear map
If U, V ∈ Vect (S)
U = U β Xβ = θ β (U )Xβ 1 ≤ β ≤ k,
and
V = V γ Xγ = θ γ (V )Xγ 1 ≤ γ ≤ k,
then
X k
X X X
N(U, V ) = λ β
fβγ θ (U )θ γ (V ) X λ = µλβ (V )U β Xλ .
λ>k β,γ=1 λ>k β
The Riemann curvature 179
The second fundamental form can be used to determine a relationship between the curva-
ture of M and that of S. More precisely we have the following celebrated result.
We have
RM (X, Y )X = [∇M M M
X , ∇Y ]Z − ∇[X,Y ] Z
= ∇M
X ∇SY Z + N(Y, Z) − ∇M S S
Y ∇X Z + N(X, Z) − ∇[X,Y ] Z − N([X, Y ], Z).
Take the inner product with T of both sides above. Since N(•, •) is NS -valued, we deduce
using (4.2.10)-(4.2.12)
RM (X, Y )Z, T = ∇M S
X ∇Y Z, T + ∇M
X N(Y, Z), T
−h∇M S
Y ∇X Z, T − ∇M
Y N(X, Z), T − ∇S[X,Y ] Z, T
= [∇SX , ∇SY ]Z, T − N(Y, Z), N(X, T ) + N(X, Z), N(Y, T ) − ∇S[X,Y ] Z, T .
This is precisely the equality (4.2.13). ⊔
⊓
2
The first fundamental form is the induced metric.
180 Riemannian geometry
Nn (X, Y ) := N(X, Y ), n .
Nn (X, Y ) = − ∇M
X n, Y = − ∇M
Y n, X . (4.2.14)
Nn (X, Z) Nn (X, T )
RS (X, Y )Z, T = RM (X, Y )Z, T − .
Nn (Y, Z) Nn (Y, T )
Nn (X, T ) Nn (X, Z)
RS (X, Y )Z, T = . (4.2.15)
Nn (Y, T ) Nn (Y, Z)
Remark 4.2.20. (a) The equality (4.2.15) is a truly remarkable result. On the right-
hand-side we have an extrinsic term (it depends on the “space surrounding S”), while in
the left-hand-side we have a purely intrinsic term (which is defined entirely in terms of the
internal geometry of S). Historically, the extrinsic term was discovered first (by Gauss),
and very much to Gauss surprise (?!?) one does not need to look outside S to compute
it. This marked the beginning of a new era in geometry. It changed dramatically the way
people looked at manifolds and thus it fully deserves the name of The Golden (egregium)
Theorem of Geometry.
We can now explain rigorously why we cannot wrap a plane canvas around the sphere.
Notice that, when we deform a plane canvas, the only thing that changes is the extrinsic
geometry, while the intrinsic geometry is not changed since the lengths of the“fibers” stays
the same. Thus, any intrinsic quantity is invariant under “bending”. In particular, no
matter how we deform the plane canvas we will always get a surface with Gauss curvature
0 which cannot be wrapped on a surface of constant positive curvature! Gauss himself
called the total curvature a“bending invariant”. ⊔
⊓
S m−1 denotes the unit sphere in Rm . Observe that, for any p ∈ S, we have an equality of
subspaces in Rm
Tp S = Tn(p) S m−1 .
The second fundamental form N of S is symmetric bilinear form on Tp S. Using the metric
on Tp S we can identify it with a symmetric operator
The map G is called the Gauss map of the hypersurface S ⊂ Rm . The differential of the
Gauss map is known as the shape operator of the hypersurface. . We will have more to
say about the Gauss map in Section 9.2.1.
Suppose now that S is the sphere of radius r in Rm centered at the origin, and n is
the our unit normal vector field along S. In this case the Gauss map is
1
S ∋ x 7→ x ∈ S m−1 .
r
Thus
1
Dx G = 1T S .
r x
1
We deduce from (4.2.15) that the sectional curvature of S along any 2-plane is r2 . ⊔
⊓
QA = {u ∈ R3 ; Au, u = 1},
is nonempty and smooth (use the implicit function theorem to check this). Let u0 ∈ QA .
Then
Tu0 QA = {x ∈ R3 ; Au0 , x = 0} = (Au0 )⊥ .
QA is a transversally oriented hypersurface in R3 since the map QA ∋ u 7→ Au defines a
1
nowhere vanishing section of the normal bundle. Set n = |Au| Au.
3
Consider an orthonormal frame (e0 , e1 , e2 ) of R such that e0 = n(u0 ). Denote the
Cartesian coordinates in R3 with respect to this frame by (x0 , x1 , x2 ), and set ∂i := ∂xi .
Extend (e1 , e2 ) to a local moving frame of T QA near u0 .
The second fundamental form of QA at u0 is
Nn (∂i , ∂j ) = ∂i n, ∂j |u0 .
182 Riemannian geometry
We compute
Au −1/2 1
∂i n = ∂i = ∂i ( Au, Au )Au + A∂i u
|Au| |Au|
∂i Au, Au 1
=− Au + ∂i Au.
|Au|3/2 |Au|
Hence
1
Nn (∂i , ∂j ) |u0 = A∂i u, ej |u0
|Au0 |
1 1
= ∂i u, Aej |u0 = ei , Aej . (4.2.16)
|Au0 | |Au0 |
We can now compute the Gaussian curvature at u0 .
1 Ae1 , e1 Ae1 , e2
Ku0 = .
|Au0 |2 Ae2 , e1 Ae2 , e2
1
Ku = ∀|u| = r.
r2
Thus, the round sphere has constant positive curvature. ⊔
⊓
Let n be the unit section of NΣ defining the above orientation. The Gauss map
orientation Tq Σ = ∂1 ∧ ∂2 .
Consider the Euler form εΣ on Σ with the metric induced by the Euclidean metric in R3 .
Then, taking into account our orientation conventions, we have
Nn (∂1 , ∂1 ) Nn (∂1 , ∂2 )
2πεΣ (∂1 , ∂2 ) = R1212 = det . (4.2.17)
Nn (∂2 , ∂1 ) Nn (∂2 , ∂2 )
G∗ : Tq Σ → Tn(q) S 2
acts according to
∂i 7→ −Nn (∂i , ∂1 )∂1 − Nn (∂i , ∂2 )∂2 .
Hence
∗ −Nn (∂1 , ∂1 ) −Nn (∂1 , ∂2 )
G dvS 2 = (det G∗ )dvΣ = det dvΣ
−Nn (∂2 , ∂1 ) −Nn (∂2 , ∂2 )
Nn (∂1 , ∂1 ) Nn (∂1 , ∂2 )
= det dvΣ = 2πεΣ .
Nn (∂2 , ∂1 ) Nn (∂2 , ∂2 )
Hence
1 ∗
εΣ =
G dv 2 . (4.2.18)
2π Σ S
This is one form of the celebrated Gauss–Bonnet theorem. We will have more to say about
it in Subsection 4.2.6
Note that the last equality offers yet another interpretation of the Gauss curvature.
From this point of view the curvature is a “distortion factor”. The Gauss map “stretches”
an infinitesimal parallelogram to some infinitesimal region on the unit sphere. The Gauss
curvature describes by what factor the area of this parallelogram was changed. In Chap-
ter 9 we will investigate in greater detail the Gauss map of arbitrary submanifolds of a
Euclidean space. ⊔
⊓
∇S∂ i ∂ui = 0 at p0 .
u
In other words, at p0 , the vectors x′′ij := x′′ui uj (0) are normal to Tp0 S. Hence
If we define
K : S × S → R, K(p, q) := hp, qi,
184 Riemannian geometry
⊔
⊓
While the above description is local, we leave the reader to convince him/herself that the
resulting object is a well defined global section of T ∗ M0 ⊠ T ∗ M1 .
Definition 4.2.25. A correlator on the smooth manifold M is a smooth function
K :M ×M →R
⊔
⊓
V ∋ v 7→ dv(p) ∈ Tp∗ M
is onto. ⊔
⊓
The Riemann curvature 185
Ψ1 , . . . , Ψn : M → R
such that span Ψ1 , . . . , ΨN is an ample subspace of C ∞ (M ). Define K : M × M → R
N
X
K(p, q) = Ψi (p)Ψi (q), ∀p, q ∈ M.
i=1
where (−, −)RN denotes the canonical inner product in RN . The ampleness follows from
the fact that the natural map Tp∗ Rn → Tp∗ M is onto, ∀p ∈ M .
We will refer to the correlator constructed in this fashion as the correlator associated
to an embedding. More generally if F : M → Rn is an immersion, then the function
K : M × M → R, K(p, q) = F (p), F (q)
is also a correlator. ⊔
⊓
From the definition of a correlator we see that C defines a metric (−, −)C on E. Note
that when K is the correlator determined by an embedding, the metric defined by C is
none other than the induced metric.
186 Riemannian geometry
Sx,x : Ex → Ex∗
is an isomorphism. We set
−1
T (x, y) := Sx,x Sx,y ∈ Hom(Ey , Ex )
We will refer to T (x, y) as the tunneling map associated to the correlator K. Note that
Tx,x = 1Ex .
Fix a point p0 ∈ M and local coordinates (xi )1≤i≤m in a neighborhood O of p0 in
M . Suppose that e(x) = (eα (x))1≤α≤m is a local frame of E|O . We regard it as an
isomorphism of bundles Φ(e) : RrO → E|O . The tunnelings define a smooth map
If we denote by (eα (x)) the dual moving frame of E ∗ |O , then Cx,y is described by the
matrix Se (x, y) = sαβ (x, y) 1≤α,β≤m
X
Cx,y = sαβ (x, y)eα (x) ⊗ eβ (y),
α,β
If we choose the local frame e to be orthonormal with respect to the metric (−, −)C , then
i.e.,
∗
Ty,x = Tx,y ,
The Riemann curvature 187
Lemma 4.2.28. Let e be a frame of E|O orthonormal with respect to the correlator metric
(−, −)C . Then, for any i = 1, . . . , m, the operator
is skew-symmetric.
We deduce that
∂si Te (0, s) = ∂si Te (0, s)∗ = 0,
∂xi Te (0, s) = ∂z i Te (0, s) + ∂si Te (0, s) = ∂z i Te (0, s),
∗
∂xi Te (0, s) = ∂xi Te (0, s)∗ = −∂zi Te (0, s) + ∂si Te (0, s) = −∂xi Te (0, s).
⊔
⊓
Given a coordinate neighborhood with coordinates (xi ) and a local frame e defining
and isomorphism vector bundles Φ(e) : Rm
O → E|O as above, we define the endomorphisms
Γi (e) : Rm m
O → RO , i = 1, . . . , m = dim M,
(4.2.21)
Γi (e)y = −∂xi Te (x, y)|x=y .
P
We obtain a 1-form with matrix coefficients Γ(e) := i Γi (e)dy i . The operator
∇e = d + Γ(e) (4.2.22)
g : O → GLm (R), f = e · g.
Then
Tf (x, y) = g −1 (x)Te (x, y)g(y).
We denote by dx the differential with respect to the x variable. We deduce
Proof. It suffices to prove that the connection ∇C is symmetric, i.e., its torsion is 0. Once
we choose local coordinates (xi ) on O we have a natural frame of E|O
eα = ∂α := ∂xα .
With these choices Cx,y is represented by the matrix Se (x, y) with entries
We denote by sαβ (x) the inverse of (sαβ (x, x)). Then Γi (x) is an m × m matrix
Γi (x) = Γαβi (x) 1≤α,β≤m
.
Γi (e, x) = −∂xi Se (x, x)−1 Se (x, y)
x=y
= −∂xi Se (x, x)−1 Se (x, x) − Se (x, x)−1 ∂xi Se (x, y) y=x
where X
α α
Rij (e, x) = Rβij (e, x) 1≤α,β≤m
, Rβij eα = Rij eβ ,
α
is given by
Rij (e, y) = ∂xi Γj (e, x) − ∂xj Γi (e, x) + Γi (e, x), Γj (e, x) .
Using the local frame e we represent Se (x, y) as an m × m-matrix
Se (x, y) = sαβ (x, y) 1≤α,β≤m , sαβ (x, y) = Cx,y eα (x), eβ (y) .
∂xi Γj (e, x) = ∂xi Se (x, x)−1 ∂yj Se (x, y) y=x
= −Se (x, x)−1 ∂xi Se (x, x) Se (x, x)−1 ∂yj Se (x, y) x=y
+Se (x, x)−1 ∂xi ∂yj Se (x, y) x=y
= −Se (x, x)−1 ∂xi Se (x, y) + ∂yi Se (x, y) Se (x, x)−1 ∂yj Se (x, y) x=y
x=y
+Se (x, x)−1 ∂x2i yj Se (x, y) x=y + Se (x, x)−1 ∂y2i yj Se (x, y) x=y
.
∇C
∂ i ∂xα (p0 ) = 0, ∀i, α,
x
190 Riemannian geometry
then
∂yj Se (x, y) x=y=0
= Γi (e, 0) = 0,
so that
Rij (e, 0) = ∂x2i yj Se (x, y) x=y=0
− ∂x2j yi Se (x, y) x=y=0
and
γ
Rαβij = δαγ Rβij = ∂x4α xi yβ yj K(x, y) x=y=0
− ∂x4α xj yβ yi K(x, y) x=y=0
. (4.2.24)
K :M ×M →R
In other words, K(p, q) computes the covariance of the random variables Ψ(p) and Ψ(q).
Note that
" #
X X
K(p, q) = E Aα Ψα (p) Aβ Ψβ (q)
α β
XX X
= Ψα (p)Ψβ (p) E[ Aα Aβ ] = Ψα (q)Ψα (q).
| {z } α
α,β α,β
=δαβ
4
This implies that E Ai Aj = δij , 1 ≤ i, j ≤ N , where E X denotes the expectation of the random
variable X.
The Riemann curvature 191
In other words, K(p, q) coincides with what we dubbed the correlator determined by the
embedding M ֒→ RN .
Given two vector fields X, Y ∈ Vect (M ) we obtain two new random functions X · Ψ
and Y · Ψ. The metric g defined by the correlator K has the probabilistic description
gp (Xp , Yp ) = E XΨ(p) · Y Ψ(p) ].
To see this it suffices to work in local coordinates (xi ) near p such that xi (p) = 0 and
assume X = ∂xi , Y = ∂xj . Then
E (XΨ(p) · Y Ψ(p)
" ! !#
X X
=E Aα ∂xi Ψα (0) Aβ ∂xi Ψβ (0)
α α
X
= ∂xi Ψα (0)∂xj Ψβ (0)E[ Aα Aβ ]
α,β
X
= ∂xi Ψα (0)∂xj Ψα (0) = ∂x2i yk K(x, y) x=y=0
.
α
The curvature formula (4.2.24) also has a probabilistic interpretation, but it is a bit
more elaborate. To describe it we need to introduce an appropriate language. Given an
m-dimensional vector space V and integers 0 ≤ p, q ≤ m we set
Λp,q V ∗ := Λp V ∗ ⊗ Λq V ∗ .
Then
!
X X
B∧B= bik dxi ⊗ dxk ∧ bjℓ dxj ⊗ dxℓ
ik j,ℓ
X
= bik bjℓ dxi ∧ dx ⊗ dx ∧ dxℓ ∈ Λ2,2 V ∗ .
j k
i,j,k,ℓ
192 Riemannian geometry
The symmetry of the Levi–Civita connection implies that Hf is indeed a symmetric tensor.
If (xi ) are normal coordinates at a point p, then in these coordinates, the Hessian of f at
p is given by the familiar formula
X
Hfp = ∂x2i xj f (p)dxi ⊗ dxj ∈ Λ1,1 Tp∗ M.
i,j
Note that X
Hfp ∧ Hfp = ∂x2i xk f (0)∂x2j xℓ f (0)dxi ∧ dxj ⊗ dxk ∧ dxℓ . (4.2.25)
i,j,k,ℓ
described in (2.2.10).
Now observe that the Hessian at p of the random function Ψ is the random bilinear.
form X
p p
HΨ = Aα HΨ α
.
α
Example 4.2.31 (Reproducing kernels). We have seen that any immersion of a smooth
manifold into a Euclidean space produces a correlator on that manifold. It turns out that
all correlators satisfying a stronger positivity conditions are obtained in this fashion. This
follows using the concept of reproducing kernel Hilbert space that plays an important role
in varied areas of mathematics,[110].
Consider a smooth submanifold M ⊂ RN . It is equipped with a natural correlator
(4.2.20)
K : M × M → R, K(p, q) = (p, q). (4.2.27)
For any positive integer n and any p1 , . . . , pn ∈ M define the symmetric n × n matrix
K(p1 , p1 ) K(p1 , p2 ) · · · · · · K(p1 , pn )
K(p2 , p1 ) K(p2 , p2 ) · · · · · · K(p2 , pn )
CK (p1 , . . . , pn ) := .
.. .. .. .. ..
. . . . .
K(pn , p1 ) K(pn , p2 ) · · · · · · K(pn , pn )
Note that if K is defined by (4.2.27) then, for any n ∈ N and any p1 , . . . , pn ∈ M the
symmetric matrix CK (p1 , . . . , pn ) is positive semidefinite.
A correlator K on a smooth manifold M is called a reproducing kernel if for any
p1 , . . . , pn ∈ M the symmetric matrix CK (p1 , . . . , pn ) is a positive semidefinite. The
reason for this terminology stems from the following remarkable reproducing property.
For each q ∈ M define
Kq ∈ C ∞ (M ), ; Kq (p) = K(p, q)
Denote by W ⊂ C ∞ (M ) the linear span of the family
Kq ; q ∈ M
Then the following hold (see [110, Chap.2]).
1. If X X
u= ui Kq i ∈ W , v = vi Kqi ∈ W,
i∈I i∈I
then the quantity X
(u, v)K := ui vi K(q i , q j )
i,j∈I
is independent of the decomposition of u and v as linear combination of functions
Kq . Moreover, it defines an inner product on W . Denote by H the completion of
W with respect to this inner product.
194 Riemannian geometry
ev p : W → R, ev p (w) = w(p),
is continuous with continuous with respect to the norm on W induced by the inner
product (−, −)K and thus defines an element ev p in the dual H∗ of H.
Since K is smooth the above map is smooth. Moreover, the condition (ii) in Definition
4.2.26 of a correlator shows that this map is also an immersion. ⊔
⊓
where s(g) denotes the scalar curvature of (M, g). The functional g 7→ E(g) is called the
Hilbert-Einstein functional. ⊔
⊓
Lemma 4.2.34. Let M be a compact oriented manifold without boundary and g t = (gij
t )
In the above formula h·, ·it denotes the inner product induced by g t on the space of sym-
metric (0,2)-tensors while the dot denotes the t-derivative.
s(x)
Ricg = g,
n
where s(x) denotes the scalar curvature.
Example 4.2.36. Observe that if the Riemann metric g satisfies the condition
for some smooth function λ ∈ C ∞ (M ), then by taking the traces of both sides of the
above equality we deduce
s(g)(x)
λ(x) = , n := dim M.
n
Thus, the Riemann manifold is Einstein if and only if it satisfies (4.2.28).
Using the computations in Example 4.2.12 we deduce that a certain constant multiple
of the Killing metric on a compact semisimple Lie group is an Einstein metric.
We refer to [16] for an in depth study of the Einstein manifolds. ⊔
⊓
Exercise 4.2.37. Consider a 3-dimensional Riemann manifold (M, g). Show that
s
Rijkℓ = Eik gjℓ − Eiℓ gjk + Ejℓ gik − Ejk giℓ + (giℓ gjk − gik gjℓ ).
2
In particular, this shows that on a Riemann 3-manifold the full Riemann tensor is com-
pletely determined by the Einstein tensor. ⊔
⊓
196 Riemannian geometry
Notice that when (S, g) is a compact oriented Riemann surface two very nice things
happen.
(i) (S, g) is Einstein.
R (Recall that only R1212 is nontrivial).
(ii) E(g) = 2 S εg .
Theorem 4.2.32 is thus an immediate consequence of Lemma 4.2.34.
Proof of the lemma We will produce a very explicit description of the integrand
d d d
s(gt )dVgt = s(gt ) dVgt + s(gt ) dVgt (4.2.29)
dt dt dt
d
of dt E(g t ). We will adopt a “roll up your sleeves, and just do it” strategy reminiscent to
the good old days of the tensor calculus frenzy. By this we mean that we will work in a
nicely chosen collection of local coordinates, and we will keep track of the zillion indices
we will encounter. As we will see, the computations are not as hopeless as they may seem
to be.
We will study the integrand (4.2.29) at t = 0. The general case is entirely analogous.
For typographical reasons we will be forced to introduce new notations. Thus, ĝ will
denote (gt ) for t = 0, while g t will be denoted simply by g. A hat over a quantity means
we think of that quantity at t = 0, while a dot means differentiation with respect to t at
t = 0.
Let q be an arbitrary point on S, and denote by (x1 , . . . , xn ) a collection of ĝ-normal
coordinates at q. Denote by ∇ the Levi–Civita connection of g and let Γijk denote its
Christoffel symbols in the coordinates (xi ).
Many nice things happen at q, and we list a few of them which will be used later.
Set
h = (hij ) := (ġ) = (ġij ).
The tensor h is a symmetric (0, 2)-tensor. Its ĝ-trace is the scalar
The term ġ ij can be computed by derivating the equality gik gjk = δki at t = 0. We get
so that
ġij = −hij . (4.2.36)
To evaluate the derivatives Γ̇’s we use the known formulæ
1 km
Γm
ij = g (∂i gjk − ∂k gij + ∂j gik ) ,
2
which, upon differentiation at t = 0, yield
1 1
Γ̇m
ij = (∂i ĝjk − ∂k ĝij + ∂j ĝik ) + ĝkm (∂i hjk − ∂k hij + ∂j hik )
2 2 (4.2.37)
1
= (∂i hjm − ∂m hij + ∂j him ) .
2
We substitute (4.2.36) -(4.2.37) in (4.2.35), and we get, at q
X 1X 1X 2
ṡ = − hij R̂ij + (∂k ∂i hik − ∂k 2 hii + ∂k ∂i hik ) − (∂i hkk − ∂i ∂k hik )
2 2
i,j i,k i,k
X X X
=− hij R̂ij − ∂i 2 hkk + ∂i ∂k hik
i,j i,k i,k
X
d ġiĝ + ∆M,ĝ trĝ ġ +
= −hRic, ∂i ∂k hik . (4.2.38)
i,k
198 Riemannian geometry
where L−1 i
ĝ is the raising the indices operator defined by ĝ. In the local coordinates (x )
we have
ˆ = ĝ ij (∇
trĝ (∇h) ˆ i h)jk dxk .
Using (4.2.30), and (4.2.32) we deduce that the last term in (4.2.38) can be rewritten (at
q) as
ˆ = δtrĝ (∇
δtrĝ (∇h) ˆ ġ).
We have thus established that
d ġiĝ + ∆M,ĝ trĝ ġ + δtrĝ (∇
ṡ = −hRic, ˆ ġ). (4.2.39)
so that
1 d
dV̇g = ± |ĝ|−1/2 |g|dx1 ∧ · · · ∧ dxn .
2 dt
At q the metric is Euclidean, ĝij = δij , and
d X
|g| = ġii = |ĝ| · trĝ (ġ) = h ĝ, ġ iĝ |ĝ|.
dt
i
Hence
Z * ! + Z
1 ˆ ġ) dVĝ .
Ė(g) = s(ĝ)ĝ − Ricĝ , ġ dVĝ + ∆M,ĝ trĝ ġ + δtrĝ (∇
M 2 M
ĝ
Green’s formula shows the last two terms vanish and the proof of the Lemma is concluded.
⊔
⊓
Remark 4.2.40. According to the theorem we have just proved, the Euler characteristic is
independent of the metric used to define it. Hence, the Euler characteristic is a topological
invariant of the surface. The reason for this terminology will become apparent when we
discuss DeRham cohomology, a Z-graded vector space naturally associated to a surface
whose Euler characteristic coincides with the number defined above. So far we have no
idea whether χ(S) is even an integer. ⊔
⊓
Proposition 4.2.41.
χ(S 2 ) = 2 and χ(T 2 ) = 0.
Proof. To compute χ(S 2 ) we use the round metric g0 for which K = 1 so that
Z
2 1 1
χ(S ) = dvg0 = areag0 (S 2 ) = 2.
2π S2 2π
To compute the Euler characteristic of the torus we think of it as an Abelian Lie group with
a bi-invariant metric. Since the Lie bracket is trivial we deduce from the computations in
Subsection 4.2.2 that its curvature is zero. This concludes the proof of the proposition. ⊔⊓
Proposition 4.2.42. If Si (i=1,2) are two compact oriented surfaces without boundary
then
χ(S1 #S2 ) = χ(S1 ) + χ(S2 ) − 2.
k
X
χ(S1 # · · · #Sk ) = χ(Si ) − 2(k − 1),
i=1
for any compact oriented surfaces S1 , . . . , Sk . In terms of genera, the last equality can be
rephrased as
Xk
g(S1 # · · · #Sk ) = g(Si ).
i=1
In the proof of this proposition we will use special metrics on connected sums of surfaces
which require a preliminary analytical discussion.
Consider f : (−4, 4) → (0, ∞) a smooth, even function such that
S
f
- +
S S
f f
S
1
D1
S2
D
2
Set
Sf± := Sf ∩ {±x > 0}, Sf±1 := Sf ∩ {±x > 1}
Since f is even we deduce
Z Z
1
Kg dvg = Kg dvg = 2π. (4.2.40)
±
Sf 2 Sf
On the other hand, on the neck C = {|x| ≤ 2} the metric g is locally Euclidean g =
dx2 + dθ 2 , so that over this region Kg = 0. Hence
Z
Kg dvg = 0. (4.2.41)
C
Corollary 4.2.43 (Gauss–Bonnet). Let Σg denote the connected sum of g-tori. (By
definition Σ0 = S 2 . Then
Remark 4.2.44. It is a classical result that the only compact oriented surfaces are the
connected sums of g-tori (see [90]), so that the genus of a compact oriented surface is a
complete topological invariant. ⊔
⊓
Chapter 5
This is a very exciting subject lieing at the frontier between mathematics and physics. The
limited space we will devote to this subject will hardly do it justice, and we will barely
touch its physical significance. We recommend to anyone looking for an intellectual feast
the Chapter 16 in vol.2 of “The Feynmann Lectures on Physics” [47], which in our opinion
is the most eloquent argument for the raison d’être of the calculus of variations.
203
204 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
Example 5.1.2. To any Riemann manifold (M, g) one can naturally associate two la-
grangians L1 , L2 : T M → R defined by
and
1
L2 (v, q) = gq (v, v).
2
We see that the action defined by L1 coincides with the length of a path. The action
defined by L2 is called the energy of a path. ⊔
⊓
Before we present the main result of this subsection we need to introduce a bit of
notation.
Tangent bundles are very peculiar manifolds. Any collection (q 1 , . . . , q n ) of local co-
ordinates on a smooth manifold M automatically induces local coordinates on T M . Any
point in T M can be described by a pair (v, q), where q ∈ M , v ∈ Tq M . Furthermore, v
has a decomposition
∂
v = v i ∂i , where ∂i := i .
∂q
We set q̇ i := v i so that
v = q̇ i ∂i .
The collection (q̇ 1 , . . . , q̇ n ; q 1 , . . . , q n ) defines local coordinates on T M . These are said to
be holonomic local coordinates on T M . This will be the only type of local coordinates we
will ever use.
Theorem 5.1.3 (The least action principle). Let L : R × T M → R be a lagrangian, and
p0 , p1 ∈ M two fixed points. Suppose γ : [0, 1] → M is a smooth path such that the
following hold.
(i) γ(i) = pi , i = 0, 1.
More precisely, if (q̇ j , q i ) are holonomic local coordinates on T M such that γ(t) = (q i (t)),
and γ̇ = (q̇ j (t)), then γ is a solution of the system of nonlinear ordinary differential
equations
d ∂L ∂L
k
(t, q̇ j , q i ) = k (t, q̇ j , q i ), k = 1, . . . , n = dim M.
dt ∂ q̇ ∂q
To get a better feeling of these equations consider the special case discussed in Example
5.1.1
1
L = m|q̇|2 − U (q).
2
Then
∂ ∂
L = mq̇, L = −∇U (q),
∂ q̇ ∂q
and the Euler-Lagrange equations become
These are precisely Newton’s equation of the motion of a particle of mass m in the force
field −∇U generated by the potential U .
In the proof of the least action principle we will use the notion of variation of a path.
Definition 5.1.5. Let γ : [0, 1] → M be a smooth path. A variation of γ is a smooth
map
α = αs (t) : (−ε, ε) × [0, 1] → M,
such that α0 (t) = γ(t). If moreover, αs (i) = pi ∀s, i = 0, 1, then we say that α is a
variation rel endpoints. ⊔
⊓
so that
d
|s=0 SL (αs ) = 0.
ds
Assume for simplicity that the image of γ is entirely contained in some open coordinate
neighborhood U with coordinates (q 1 , . . . , q n ). Then, for very small |s|, we can write
dαs
αs (t) = (q i (s, t)) and = (q̇ i (s, t)).
dt
Following the tradition, we set
∂α ∂q i ∂ dαs ∂ q̇ j ∂
δα := |s=0 = ∂i δα̇ := |s=0 = .
∂s ∂s ∂s dt ∂s ∂ q̇ j
The quantity δα is a vector field along γ called infinitesimal variation (see Figure 5.1). In
fact, the pair (δα; δα̇) ∈ T (T M ) is a vector field along t 7→ (γ(t), γ̇(t)) ∈ T M . Note that
d
δα̇ = dt δα, and at endpoints δα = 0.
Exercise 5.1.6. Prove that if t 7→ X(t) ∈ Tγ(t) M is a smooth vector field along γ, such
that X(t) = 0 for t = 0, 1 then there exists at least one variation rel endpoints α such that
δα = X.
Hint: Use the exponential map of some Riemann metric on M . ⊔
⊓
206 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
γ
p
q
We compute (at s = 0)
Z 1 Z 1 Z 1
d d ∂L i ∂L j
0 = SL (αs ) = L(t, α˙s , αs ) = δα dt + δα̇ dt. (5.1.2)
ds ds 0 0 ∂q i 0 ∂ q̇ j s
Integrating by parts in the second term in the right-hand-side we deduce
Z 1
∂L d ∂L
i
− i
δαi dt. (5.1.3)
0 ∂q dt ∂ q̇
The last equality holds for any variation α. From Exercise 5.1.6 we deduce that it holds
for any vector field δαi ∂i along γ. At this point we use the following classical result of
analysis.
If f (t) is a continuous function on [0,1] such that
Z 1
f (t)g(t)dt = 0 ∀g ∈ C0∞ (0, 1),
0
Remark 5.1.7. (a) In the proof of the least action principle we used a simplifying assump-
tion, namely that the image of γ lies in a coordinate neighborhood. This is true locally,
and for the above arguments to work it suffices to choose only a special type of variations,
localized on small intervals of [0, 1]. In terms of infinitesimal variations this means we
need to look only at vector fields along γ supported in local coordinate neighborhoods.
We leave the reader fill in the details.
(b) The Euler-Lagrange equations were described using holonomic local coordinates.
The minimizers of the action, if any, are objects independent of any choice of local coordi-
nates, so that the Euler-Lagrange equations have to be independent of such choices. We
check this directly.
5.1. THE LEAST ACTION PRINCIPLE 207
If (xi ) is another collection of local coordinates on M and (ẋj , xi ) are the coordinates
induced on T M , then we have the transition rules
∂xj k
xi = xi (q 1 , . . . , q n ), ẋj = q̇ ,
∂q k
so that
∂ ∂xj ∂ ∂ 2 xj k ∂
= + q̇
∂q i ∂q i ∂xj ∂q k ∂q i ∂ ẋj
∂ ∂ ẋi ∂ ∂xj ∂
= =
∂ q̇ j ∂ q̇ j ∂ ẋj ∂q i ∂ ẋj
Then
∂L ∂xj ∂L ∂ 2 xj k ∂L
= + q̇ .
∂q i ∂q i ∂xj ∂q k ∂q i ∂ ẋj
d ∂L d ∂xj ∂L ∂ 2 xj k ∂L ∂xj d ∂L
= = k i q̇ + i .
dt ∂ q̇ i dt ∂q i ∂ ẋj ∂q ∂q ∂ ẋj ∂q dt ∂ ẋj
We now see that the Euler-Lagrange equations in the q-variables imply the Euler-Lagrange
in the x-variable, i.e., these equations are independent of coordinates.
The æsthetically conscious reader may object to the way we chose to present the
Euler-Lagrange equations. These are intrinsic equations we formulated in a coordinate
dependent fashion. Is there any way of writing these equation so that the intrinsic nature
is visible “on the nose”?
If the lagrangian L satisfies certain nondegeneracy conditions there are two ways of
achieving this goal. One method is to consider a natural nonlinear connection ∇L on T M
as in [100] . The Euler-Lagrange equations for an extremal γ(t) can then be rewritten as
a “geodesics equation”
∇Lγ̇ γ̇.
The example below will illustrate this approach on a very special case when L is the
lagrangian L2 defined in Example 5.1.2, in which the extremals will turn out to be precisely
the geodesics on a Riemann manifold.
Another far reaching method of globalizing the formulation of the Euler-Lagrange
equation is through the Legendre transform, which again requires a nondegeneracy condi-
tion on the lagrangian. Via the Legendre transform the Euler-Lagrange equations become
a system of first order equations on the cotangent bundle T ∗ M known as Hamilton equa-
tions.
These equations have the advantage that can be formulated on manifolds more general
than the cotangent bundles, namely on symplectic manifolds. These are manifolds carrying
a closed 2-form whose restriction to each tangent space defines a symplectic duality (see
Subsection 2.2.4.)
Much like the geodesics equations on a Riemann manifold, the Hamilton equations
carry a lot of information about the structure of symplectic manifolds, and they are
currently the focus of very intense research. For more details and examples we refer to
the monographs [7, 38]. ⊔
⊓
208 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
Example 5.1.8. Let (M, g) be a Riemann manifold. We will compute the Euler-Lagrange
equations for the lagrangians L1 , L2 in Example 5.1.2.
1
L2 (q̇, q) = gij (q)q̇ i q̇ j ,
2
so that
∂L2 ∂L2 1 ∂gij i j
= gjk q̇ j = q̇ q̇ .
∂ q̇ k ∂q k 2 ∂q k
The Euler-Lagrange equations are
∂gjk i j 1 ∂gij i j
q̈ j gjk + i
q̇ q̇ = q̇ q̇ . (5.1.4)
∂q 2 ∂q k
Since g km gjm = δjm , we get
m km ∂gjk 1 ∂gij
q̈ + g i
− q̇ i q̇ j = 0. (5.1.5)
∂q 2 ∂q k
When we derivate with respect to t the equality gik q̇ i = gjk q̇ j we deduce
km ∂gjk i j 1 km ∂gik ∂gjk
g q̇ q̇ = g + q̇ i q̇ j .
∂q i 2 ∂q j ∂q i
We substitute this equality in (5.1.5), and we get
m 1 km ∂gik ∂gjk ∂gij
q̈ + g + − q̇ i q̇ j = 0. (5.1.6)
2 ∂q j ∂q i ∂q k
The coefficient of q̇ i q̇ j in (5.1.6) is none other than the Christoffel symbol Γm
ij so this
equation is precisely the geodesic equation. ⊔
⊓
Example 5.1.9. Consider now the lagrangian L1 (q̇, q) = (gij q̇ i q̇ j )1/2 . Note that the
action Z p1
SL q(t) = L(q̇, q)dt,
p0
is independent of the parametrization t 7→ q(t) since it computes the length of the path
t 7→ q(t). Thus, when we express the Euler-Lagrange equations for a minimizer γ0 of
this action, we may as well assume it is parameterized by arclength, i.e., |γ̇0 | = 1. The
Euler-Lagrange equations for L1 are
∂g
ij i j
d gkj q̇ j ∂q k
q̇ q̇
p = p .
dt gij q̇ i q̇ j 2 gij q̇ i q̇ j
Along the extremal we have gij q̇ i q̇ j = 1 (arclength parametrization) so that the previous
equations can be rewritten as
d 1 ∂gij i j
gkj q̇ j = q̇ q̇ .
dt 2 ∂q k
We recognize here the equation (5.1.4) which, as we have seen, is the geodesic equation in
disguise. This fact almost explains why the geodesics are the shortest paths between two
nearby points. ⊔
⊓
5.1. THE LEAST ACTION PRINCIPLE 209
• The energy: H = pi q̇ i − L.
∂L
• The generalized force: F = ∂q i
.
quantum mechanics, and one can say it is a very important building block of quantum
physics in general. In the few instances of conservation laws where the symmetry was not
apparent the conservation was always “blamed” on a “hidden symmetry”. What is then
this Noether principle?
To answer this question we need to make some simple observations.
Let X be a vector field on a smooth manifold M defining a global flow Φs . This flow
induces a flow Ψs on the tangent bundle T M defined by
Ψs (v, x) = Φs∗ (v), Φs (x) .
L ◦ Ψs = L, ∀s. ⊔
⊓
LX L = 0.
Then
d
|s=0 q i (s) = X k δki .
ds
d
To compute ds |s=0 q̇ j (s) ∂q∂ j we use the definition of the Lie derivative on M
d ∂ ∂ ∂ q̇ j k ∂X
j ∂ i ∂X
j ∂
− q̇ j j = LX (q̇ i i ) = X k
− q̇ = − q̇ ,
ds ∂q ∂q ∂q k ∂q k ∂q j ∂q i ∂q j
∂ j ∂
k ∂X
X = Xi + q̇ .
∂q i ∂q k ∂ q̇ j
Corollary 5.1.13. The lagrangian L is X-invariant if and only if
∂L j
k ∂X ∂L
Xi + q̇ = 0. (5.1.7)
∂q i ∂q k ∂ q̇ j
⊔
⊓
5.1. THE LEAST ACTION PRINCIPLE 211
y
O
Theorem 5.1.14 (E. Noether). If the lagrangian L is X-invariant, then the quantity
∂L
PX = X i = X i pi
∂ q̇ i
Euler-Lagrange ∂X i k ∂L ∂L (5.1.7)
= k
q̇ i
+ X i i = 0. ⊔
⊓
∂q ∂ q̇ ∂q
The classical conservation-of-momentum law is a special consequence of Noether’s theo-
rem.
∂L
Corollary 5.1.15. Consider a lagrangian L = L(t, q̇, q, ) on Rn . If ∂q i
= 0, i.e., the i-th
dpi
component of the force is zero, then dt = 0 along any extremal, i.e., the i-th component
of the momentum is conserved).
∂
Proof. Take X = ∂q i
in Noether’s conservation law. ⊔
⊓
ds2 = dr 2 + dz 2 + r 2 dθ 2 .
212 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
We can choose (z, θ) as local coordinates on S, then the induced metric has the form
This fact can be given a nice geometric interpretation. Consider a geodesic γ(t) =
(z(t), θ(t)) on our surface of revolution S. The angle φ between the geodesic and the
parallels z = const can be determined from the equality
hγ̇, ∂θ i r 2 θ̇
cos φ = = ,
|γ̇| · |∂θ | r|γ̇|
i.e., r cos φ = r 2 θ̇|γ̇|−1 . The conservation of energy implies that |γ̇|2 = 2L = H is constant
along the geodesics. We deduce the following classical result.
Theorem 5.1.17 (Clairaut). On a a surface of revolution r = f (z) the quantity r cos φ
is constant along any geodesic, where φ ∈ (−π, π) is the oriented angle the geodesic makes
with the parallels z = const. ⊔
⊓
Exercise 5.1.18. Describe the geodesics on the round sphere S 2 , and on the cylinder
{x2 + y 2 = 1} ⊂ R3 . ⊔
⊓
lim V (t ± h)
hց0
exists for every t ∈ [0, 1], they are vectors V (t)± ∈ Tγ (t)M , and V (t)+ = V ( t)− for all
but finitely many t-s. The space of infinitesimal variations of γ is an infinite dimensional
linear space denoted by Tγ = Tγ Ωp,q .
Definition 5.2.1. Let γ ∈ Ωp,q . A variation of γ is a continuous map
such that
Fix γ ∈ Ωp,q , and let α be a variation of γ. The velocity γ̇(t) has a finite number of
discontinuities, so that the quantity
where ∇ denotes the Levi–Civita connection. (Note that the right-hand-side depends on
α only through δα so it is really a linear function on Tγ .)
214 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
∂αs
Proof. Set α̇s = ∂t . We differentiate under the integral sign using the equality
∂
|α̇s |2 = 2h∇ ∂ α̇s , α̇s i,
∂s ∂s
and we get Z 1
d
|s=0 E(αs ) = h∇ ∂ α̇s , α̇s i |s=0 dt.
ds 0 ∂s
∂ ∂
Since the vector fields ∂s and ∂t commute we have ∇ ∂ ∂α
∂t = ∇ ∂t∂
∂α
∂s .
∂s
Let 0 = t0 < t2 < · · · < tk = 1 be a partition of [0,1] as in Definition 5.2.1. Since
αs = γ for s = 0 we conclude
k Z
X ti
E∗ (δα) = h∇ ∂ δα, γ̇i.
∂t
i=1 ti−1
E∗ (V ) = 0, ∀V ∈ Tγ . ⊔
⊓
Remark 5.2.7. Note that, a priori, a critical path may have a discontinuous first deriva-
tive. The above corollary shows that this is not the case: the criticality also implies
smoothness. This is a manifestation of a more general analytical phenomenon called el-
liptic regularity. We will have more to say about it in Chapter 11. ⊔
⊓
Set U := (−ε, ε) × (−ε, ε) ⊂ R2 and γ := α0,0 . The map α is continuous, and has
second order derivatives everywhere except maybe on finitely many “coordinate” planes
∂α
si = const, or t = const. Set δi α := ∂s |
i (0,0)
, i = 1, 2. Note that δi α ∈ Tγ .
Exercise 5.2.8. Given V1 , V2 ∈ Tγ construct a 2-parameter variation α such that Vi = δi α.
⊔
⊓
∂ 2 E(αs1 ,s2 )
E∗∗ (δ1 α, δ2 α) := |(0,0) .
∂s1 ∂s2
Theorem 5.2.9 (The second variation formula).
X Z 1
∂
E∗∗ (δ1 α, δ2 α) = − hδ2 α, ∆t δ1 αi − hδ2 α, ∇2 δ1 α − R(γ̇, δ1 α)γ̇idt, (5.2.3)
t 0 ∂t
Hence
∂2E X X ∂α
=− h∇ ∂ δ2 α, ∆1 γ̇i − hδ2 α, ∇ ∂ ∆t i
∂s1 ∂s2 t
∂s1
t
∂s1 ∂t
Z 1 Z 1
∂α
− h∇ ∂ δ2 α, ∇ ∂ γ̇idt − hδ2 α, ∇ ∂ ∇ ∂ idt. (5.2.4)
∂s1 ∂t ∂s1 ∂t ∂t
0 0
Since γ is a geodesic, we have
∆t γ̇ = 0 and ∇ ∂ γ̇ = 0.
∂t
216 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
∂
Using the commutativity of ∂t with ∂s∂ 1 we deduce
∂α ∂α
∇ ∂ ∆t = ∆t ∇ ∂ = ∆t ∇ ∂ δ1 α .
∂s1 ∂t ∂s1 ∂t ∂t
Putting all the above together we deduce immediately the equality (5.2.3). ⊔
⊓
Corollary 5.2.10.
E∗∗ (V1 , V2 ) = E∗∗ (V2 , V1 ), ∀V1 , V2 ∈ Tγ . ⊔
⊓
Proposition 5.2.12. Let γ ∈ Ωp,q be a geodesic and (αs ) a geodesic variation of γ. Then
the infinitesimal variation δα satisfies the Jacobi equation
Proof.
∂α ∂α
∇2t δα = ∇t ∇t = ∇t ∇s
∂s ∂t
∂α ∂α ∂α
= ∇s ∇t + R(γ̇, δα) = R(γ̇, δα) . ⊔
⊓
∂t ∂t ∂t
Definition 5.2.13. A smooth vector field J along a geodesic γ is called a Jacobi field if
it satisfies the Jacobi equation
∇2t J = R(γ̇, J)γ̇. ⊔
⊓
Exercise 5.2.14. Show that if J is a Jacobi field along a geodesic γ, then there exists a
geodesic variation αs of γ such that J = δα. ⊔
⊓
E∗∗ (J, V ) = 0, ∀V ∈ Tγ .
(b) Show that a vector field J along γ which vanishes at endpoints, is a Jacobi field if any
only if E∗∗ (J, W ) = 0, for all vector fields W along γ. ⊔
⊓
5.2. THE VARIATIONAL THEORY OF GEODESICS 217
γ
N
Exercise 5.2.16. Let γ ∈ Ωp,q be a geodesic. Define Jp to be the space of Jacobi fields
V along γ such that V (p) = 0. Show that dim Jp = dim M , and moreover, the evaluation
map
evq : Jp → Tp M V 7→ ∇t V (p)
is a linear isomorphism. ⊔
⊓
Definition 5.2.17. Let γ(t) be a geodesic. Two points γ(t1 ) and γ(t2 ) on γ are said to
be conjugate along γ if there exists a nontrivial Jacobi field J along γ such that J(ti ) = 0,
i = 1, 2. ⊔
⊓
Example 5.2.18. Consider γ : [0, 2π] → S 2 a meridian on the round sphere connection
the poles. One can verify easily (using Clairaut’s theorem) that γ is a geodesic. The
counterclockwise rotation by an angle θ about the z-axis will produce a new meridian,
hence a new geodesic γθ . Thus (γθ ) is a geodesic variation of γ with fixed endpoints. δγ
is a Jacobi field vanishing at the poles. We conclude that the poles are conjugate along
any meridian (see Figure 5.3). ⊔
⊓
The following result (partially) explains the geometric significance of conjugate points.
We will prove that Vt is a Jacobi field along γ which contradicts the nondegeneracy of γ.
Step 1.
E∗∗ (U, U ) ≥ 0 ∀U ∈ Tγ . (5.2.5)
Indeed, let αs denote a variation of γ such that δα = U . One computes easily that
d2
E(αs2 ) = 2E∗∗ (U, U ).
ds2
Since γ is minimal for any small s we have length (αs2 ) ≥ length (α0 ) so that
Z 1 2
1 1 1
E(αs2 ) ≥ |α̇s2 |dt = length (αs2 )2 ≥ length (α0 )2
2 0 2 2
1
= length (γ)2 = E(α0 ).
2
Hence
d2
|s=0 E(αs2 ) ≥ 0.
ds2
This proves (5.2.5).
Step 2. E∗∗ (V, V ) = 0. This follows immediately from the second variation formula and
the fact that the nontrivial portion of V is a Jacobi field.
Step 3.
E∗∗ (U, V ) = 0, ∀U ∈ Tγ .
From (5.2.5) and Step 2 we deduce
fU′ (0) = 0.
Step 3 follows from the above equality using the bilinearity and the symmetry of E∗∗ . The
final conclusion (that V is a Jacobi field) follows from Exercise 5.2.15. ⊔
⊓
is discrete. ⊔
⊓
5.2. THE VARIATIONAL THEORY OF GEODESICS 219
Definition 5.2.22. Let γ ∈ Ωp,q be a geodesic. We define its index, denoted by ind (γ),
as the cardinality of the set
Cγ = t ∈ (0, 1) ; is conjugate to γ(0)
Proof. It suffices to show that for any geodesic γ : [0, 1] → M the point γ(1) is not
conjugated to γ(0).
Let Jt be a Jacobi field along γ vanishing at the endpoints. Thus
so that Z Z Z
1 1 1
h∇2t J, Jidt = hR(γ̇, J)γ̇, Jidt = − hR(J, γ̇)γ̇, Jidt.
0 0 0
This implies ∇t J = 0 which coupled with the condition J(0) = 0 implies J ≡ 0. The
proof is complete. ⊔
⊓
The notion of conjugacy is intimately related to the behavior of the exponential map.
Proof. Let q = expq0 v, v ∈ Tq0 M . Assume first that q is a critical value for expq0 , and v
is a critical point. Then Dv expq0 (X) = 0, for some X ∈ Tv (Tq0 M ). Let v(s) be a path
in Tq0 M such that v(0) = v and v̇(0) = X. The map (s, t) 7→ expq0 (tv(s)) is a geodesic
variation of the radial geodesic γv : t 7→ expq0 (tv). Hence, the vector field
∂
W = |s=0 expq0 (tv(s))
∂s
is a Jacobi field along γv . Obviously W (0) = 0, and moreover
∂
W (1) = |s=0 expq0 (v(s)) = Dv expq0 (X) = 0.
∂s
On the other hand this is a nontrivial field since
∂
∇t W = ∇s |s=0 expq0 (tv(s)) = ∇s v(s) |s=0 6= 0.
∂t
This proves q0 and q are conjugated along γv .
Conversely, assume v is not a critical point for expq0 . For any X ∈ Tv (Tq0 M ) denote
by JX the Jacobi field along γv such that
JX (q0 ) = 0. (5.2.7)
The existence of such a Jacobi field follows from Exercise 5.2.16. As in that exercise,
denote by Jq0 the space of Jacobi fields J along γv such that J(q0 ) = 0. The map
Tv (Tq0 M ) → Jq0 X 7→ JX
is a linear isomorphism. Thus, a Jacobi field along γv vanishing at both q0 and q must
have the form JX , where X ∈ Tv (Tq0 M ) satisfies Dv expq0 (X) = 0. Since v is not a critical
point, this means X = 0 so that JX ≡ 0. ⊔
⊓
We will see in the next chapter that this corollary has a lot to say about the topology
of M .
Consider now the following experiment. Stretch the round two-dimensional sphere of
radius 1 until it becomes “very long”. A possible shape one can obtain may look like
in Figure 5.4. The long tube is very similar to a piece of cylinder so that the total (=
scalar) curvature is very close to zero, in other words is very small. The lesson to learn
from this intuitive experiment is that the price we have to pay for lengthening the sphere
is decreasing the curvature. Equivalently, a highly curved surface cannot have a large
diameter. Our next result offers a more quantitative description of this phenomenon.
Theorem 5.2.26 (Myers). Let M be an n-dimensional complete Riemann manifold. If
for all X ∈ Vect (M )
(n − 1)
Ric (X, X) ≥ |X|2 ,
r2
5.2. THE VARIATIONAL THEORY OF GEODESICS 221
then every geodesic of length ≥ πr has conjugate points and thus is not minimal. Hence
Proof. Fix a minimal geodesic γ : [0, ℓ] → M of length ℓ, and let ei (t) be an orthonormal
basis of vector fields along γ such that en (t) = γ̇(t). Set q0 = γ(0), and q1 = γ(ℓ). Since
γ is minimal we deduce
E∗∗ (V, V ) ≥ 0 ∀V ∈ Tγ .
Set Wi = sin(πt/ℓ)ei . Then
Z ℓ
E∗∗ (Wi , Wi ) = − hWi , ∇t Wi + R(Wi , γ̇)γ̇idt
0
Z ℓ
= sin2 (πt/ℓ) π 2 /ℓ2 − hR(ei , γ̇)γ̇, ei i dt.
0
Hence, at least for some Wi , we have E∗∗ (Wi , Wi ) < 0. This contradicts the minimality
of γ. The proof is complete. ⊔
⊓
We already know that the Killing form of a compact, semisimple Lie group is positive
definite; see Exercise 4.1.19. The next result shows that the converse is also true.
Corollary 5.2.27. A semisimple Lie group G with positive definite Killing pairing is
compact.
Proof. The Killing form defines in this case a bi-invariant Riemann metric on G. Its
geodesics through the origin 1 ∈ G are the 1-parameter subgroups exp(tX) which are
defined for all t ∈ R. Hence, by Hopf-Rinow theorem G has to be complete.
On the other hand, we have computed the Ricci curvature of the Killing metric, and
we found
1
Ric (X, Y ) = κ(X, Y ) ∀X, Y ∈ LG .
4
The corollary now follows from Myers’ theorem. ⊔
⊓
Exercise 5.2.28. Let M be a Riemann manifold and q ∈ M . For the unitary vectors
X, Y ∈ Tq M consider the family of geodesics
Denote by Wt = δγs the associated Jacobi field along γ0 (t). Form f (t) = |Wt |2 . Prove the
following.
(a) Wt = DtX expq (Y ) = Frechet derivative of v 7→ expq (v).
(b) f (t) = t2 − 13 hR(Y, X)X, Y iq t4 + O(t5 ).
(c) Denote by xi a collection of normal coordinates at q. Prove that
1
gkℓ (x) = δkℓ − Rkijℓ xi xj + O(3).
3
1
det gij (x) = 1 − Rij xi xj + O(3).
3
(d) Let
Dr (q) = {x ∈ Tq M ; |x| ≤ r}.
Prove that if the Ricci curvature is negative definite at q then
for all r sufficiently small. Above, vol0 denotes the Euclidean volume in Tq M while volg
denotes the volume on the Riemann manifold M . ⊔
⊓
We want to investigate the partial derivatives of η at (p0 , p0 ). Using the above normal
coordinates we regard η as a function η = η(x, y) defined in an open neighborhood of
(0, 0) ∈ Rm × Rm . The next result is classical and it is the main tool that will help us find
the Taylor expansion of η(x, y) at (0, 0).
Theorem 5.2.30. Fix y ∈ U The function U ∋ x 7→ η(x, y) satisfies the Hamilton-Jacobi
equation.
∂η(x, y) ∂η(x, y)
gkℓ (x) = 4η(x, y), ∀x, y. (5.2.8)
∂xk ∂xℓ
Proof. Our approach is inspired from [114, p.167-170]. Denote by γx,y = γx,y (t) the unique
geodesic that connects x to y parametrized such that
Z 1
S(x, y) = distg (x, y) = |γ̇x,y (t)|dt.
0
where X
L(q, q̇) = gij (q)q̇ i q̇ j .
i,j
We now let the point x vary along an arbitrary smooth path x = x(s), |s| ≪ 1 in U . Set
x0 = x(0) ∈ U . We write
d
us (t) = γx(s),y (t), δu(t) = u (t)
s=0 s
ds
We have (using Einstein’s conventions)
Z 1
i∂η d d
δu (x0 , y) = S(x(s), y) = L(us , u̇s )ds
∂xi ds s=0 ds s=0 0
Z 1 Z 1
(5.1.2) ∂L i ∂L j
= δu dt + δu̇ dt
0 ∂q i
0 ∂ q̇ j s
(integrate by parts)
Z 1
∂L d ∂L ∂L ∂L
= − δui dt − i δui = − i δui
∂q i dt ∂ q̇ i ∂ q̇ ∂ q̇
|0 {z }
=0
224 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
∂S ∂L(x, ẋ)
i
=− , ∀i.
∂x ∂ ẋi
We set
1 ∂L 1
pi := i
= gik ẋk ,
2 ∂ ẋ 2
and we introduce the Hamiltonian
1 1
H(x, p) = gij pi pj = gij gik ẋk pj = δkj ẋk pj
2 2
1 j 1
= ẋ pj = gij ẋi ẋj = L(x, ẋ).
2 2
The quantity L(x, ẋ) is constant along the geodesic γx,y and, at the endpoint x, we have
1
η(x, y) = L(x, ẋ) = H(x, p) = H xi , ∂xj η(x, y) .
4
Expanding we get
1
η(x, y) = gij ∂xi η(x, y)∂xj η(x, y). ⊔
⊓
4
Besides their intrinsic interest, the Hamilton-Jacobi equations can be used to produce
an explicit description of the Taylor expansion for the square of the geodesic distance
η(x, y). This Taylor expansion will play an important part in Subsection 11.4.5.
With this goal in mind we introduce some notation. If f = f (t1 , . . . , tN ) is a smooth
function defined in a neighborhood of 0 ∈ RN and k is a nonnegative integer, then we
denote by [f ]k the degree k-homogeneous part in the Taylor expansion of f at 0, i.e.,
1 X α
[f ]k = ∂t f t=0
tα ∈ R[t1 , . . . , tN ].
k!
|α|=k
In the coordinates (xi ) the metric g has the form (using Einstein’s summation convention
throughout)
g = gij dxi dxj ,
where gij satisfy the estimates (see Exercise 5.2.28)
1
gkℓ = δkℓ − Rikjℓ (0)xi xj + O(|x|3 ). (5.2.9)
3
For any n × n matrix A that is small enough we have
(1 − A)−1 = 1 + A + A2 + · · · .
We deduce that
1
gkℓ = δkℓ + Rikjℓ (0)xi xj + O(|x|3 ). (5.2.10)
3
5.2. THE VARIATIONAL THEORY OF GEODESICS 225
The function η satisfies a Hamilton-Jacobi equation (5.2.8) and the obvious symmetry
conditions
m
X
η(x, y) = η(y, x), η(0, x) = η(x, 0) = |x|2 := (xi )2 , η(x, x) = 0. (5.2.11)
i=1
The degree 2 part of the Taylor expansion of η satisfies similar symmetries. This implies
m
X
[η]2 = |x − y|2 = (xi − y i )2 . (5.2.12)
i=1
ui = xi − y i , vj = xj + y j .
Then
1 1
xi = (ui + vi ), y j = (vj − uj ), ∂xi = ∂ui + ∂vi .
2 2
The equality (5.2.10) can be rewritten as
1 X
gkℓ (x) = δkℓ + Rikjℓ (ui + vi )(uj + vj ) + O(3). (5.2.13)
12
i,j
Note that
[Ak ]0 = [Aℓ ]0 = [gkℓ ]1 = 0, (5.2.17)
while (5.2.15) implies that
[Ak ]1 = 2uk .
We deduce
X X X
4[η]3 = [gkl ]0 [Ak ]1 [Aℓ ]2 + [Ak ]2 [Aℓ ]1 = 2[Ak ]2 [Ak ]1 = 4 uk [Ak ]2 .
k,ℓ k k
We can rewrite this last equality as a differential equation for [η]3 namely
X
[η]3 = uk (∂uk + ∂vk )[η]3 .
k
226 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
We deduce
P = 2P2 + Q3 + Q1 ,
where the polynomials Q3 and Q1 are odd in the variable u. Since P is even in the variable
u we deduce
Q3 + Q1 = 0,
so that P2 + P0 = P = 2P2 . Hence P2 = P0 = 0 and thus
[η]3 = 0. (5.2.18)
In particular
[Ak ]2 = 0, ∀k. (5.2.19)
Going back to (5.2.16) and using (5.2.17) and (5.2.19) we deduce
X X
4[η]4 = [g kℓ ]2 [Ak ]1 [Aℓ ]1 + [gkℓ ]0 [Ak ]1 [Aℓ ]3 + [Ak ]3 [Aℓ ]1
k,ℓ k,ℓ
X X (5.2.20)
=4 [gkℓ ]2 uk uℓ + 2 uk [Ak ]3 .
k,ℓ k
We have X X
uk ∂uk P2ν = 2νP2ν , ν = 0, 1, 2, uk ∂vk P4 = 0,
k k
X X X
uk ∂vk P2 = uk Qij (δki vj + δkj vi ) = Qkj uk vj + Qjk vj uk
k k,i,j k,j
This implies P0 = 0 so that P = P4 + P2 , and we can then rewrite the above equality as
X X
P2 = 2 [gkℓ ]2 uk uℓ + Qjk + Qkj uk vj . (5.2.21)
k,ℓ k,j
Therefore it suffices to determine P2 . This can be achieved using the equality (5.2.13) in
(5.2.21). We have
X 1 X
2 [g kℓ ]2 uk uℓ = Rikjℓ (ui + vi )(uj + vj )uk uℓ
6
k,ℓ i,j,k,ℓ
1 X X X
= Rikjℓ uk uℓ vi vj + Sj (u)vj ,
6
i,j k,ℓ j
| {z }
b ij (u)
Q
1b 1X
Qij (u) = Q ij (u) = Rikjℓ uk uℓ .
6 6
k,ℓ
This determines P2 .
1X b
P2 (u, v) = Qij (u)vi vj . (5.2.22)
6
i,j
228 CHAPTER 5. ELEMENTS OF THE CALCULUS OF VARIATIONS
Hence
K 2 2 K
P2 (u, v) = (u2 v1 + u21 v22 − 2u1 u2 v1 v2 ) = (u1 v2 − u2 v1 )2 .
6 6
In higher dimensions, if we restrict (5.2.24) to the surface
n o
Σij := x ∈ M ; xk = xℓ = 0, ∀k, ℓ 6∈ {i, j }
we deduce
Q̂ii Σij
= Kij u2j , Q̂jj Σij
= Kij u2i ,
Kij 2
Q̂ij Σij
= ui vj − uj vi .
6
⊔
⊓
Chapter 6
In the previous chapters we almost exclusively studied local properties of manifolds. This
study is interesting only if some additional structure is present since otherwise all manifolds
are locally alike.
We noticed an interesting phenomenon: the global “shape” (topology) of a manifold
restricts the types of structures that can exist on a manifold. For example, the Gauss–
Bonnet theorem implies that on a connected sum of two tori there cannot exist metrics
with curvature everywhere positive because the integral of the curvature is a negative
universal constant.
We used the Gauss–Bonnet theorem in the opposite direction, and we deduced the
intuitively obvious fact that a sphere is not diffeomorphic to a torus because they have
distinct genera. The Gauss–Bonnet theorem involves a heavy analytical machinery which
may obscure the intuition. Notice that S 2 has a remarkable property which distinguishes
it from T 2 : on the sphere any closed curve can be shrunk to a point while on the torus
there exist at least two “independent” unshrinkable curves (see Figure 6.1). In particular,
this means the sphere is not diffeomorphic to a torus.
This chapter will set the above observations on a rigorous foundation.
229
230 CHAPTER 6. THE FUNDAMENTAL GROUP AND COVERING SPACES
Example 6.1.2. The unit disk in the plane is contractible. The annulus {1 ≤ |z| ≤ 2} is
homotopy equivalent to the unit circle. ⊔
⊓
Note that a loop is more than a closed curve; it is a closed curve + a description of a
motion of a point around the closed curve.
Example 6.1.4. The two loops γ1 , γ2 : I → C, γk (t) = exp(2kπt), k = 1, 2 are different
though they have the same image. ⊔
⊓
Definition 6.1.5. (a) Let γ1 , γ2 be two loops based at x0 ∈ X. The product of γ1 and γ2
is the loop
γ1 (2s) , 0 ≤ s1/2
γ1 ∗ γ2 (s) = .
γ2 (2s − 1) , 1/2 ≤ s ≤ 1
The inverse of a based loop γ is the based loop γ − defined by
Intuitively, the product of two loops γ1 and γ2 is the loop obtained by first following
γ1 (twice faster), and then γ2 (twice faster).
The following result is left to the reader as an exercise.
Lemma 6.1.6. Let α0 ≃x0 α1 , β0 ≃x0 β1 and γ0 ≃x0 γ1 be three pairs of homotopic based
loops.. Then
(a) α0 ∗ β0 ≃x0 α1 ∗ β1 .
(b) α0 ∗ α−0 ≃x0 ex0 .
(c) α0 ∗ ex0 ≃x0 α0 .
(d) (α0 ∗ β0 ) ∗ γ0 ≃x0 α0 ∗ (β0 ∗ γ0 ). ⊔
⊓
Hence, the product operation descends to an operation “·” on Ω(X, x0 )/≃x0 , the set
of homotopy classes of based loops. Moreover the induced operation is associative, it has
a unit and each element has an inverse. Hence (Ω(X, x0 )/≃x0 , ·) is a group.
Definition 6.1.7. The group (Ω(X, x0 )/≃x0 , ·) is called the fundamental group (or the
Poincaré group) of the topological space X, and it is denoted by π1 (X, x0 ). The image of
a based loop γ in π1 (X, x0 ) is denoted by [γ]. ⊔
⊓
The elements of π1 (X, x0 ) are the “unshrinkable loops” discussed at the beginning of
this chapter.
The fundamental group π1 (X, x0 ) “sees” only the connected component of X which
contains x0 . To get more information about X one should study all the groups {π1 (X, x)}x∈X .
Proposition 6.1.8. Let X and Y be two topological spaces. Fix two points, x0 ∈ X and
y0 ∈ Y . Then any continuous map f : X → Y such that f (x0 ) = y0 induces a morphism
of groups
f∗ : π1 (X, x0 ) → π1 (Y, y0 ),
satisfying the following functoriality properties.
(a) (1X )∗ = 1π1 (X,x0 ) .
(b) If
f g
(X, x0 ) → (Y, y0 ) → (Z, z0 )
are continuous maps, such that f (x0 ) = y0 and g(y0 ) = z0 , then (g ◦ f )∗ = g∗ ◦ f∗ .
(c) Let f0 , f1 : (X, x0 ) → (Y, y0 ) be two base-point-preserving continuous maps. Assume f0
is homotopic to f1 rel x0 , i.e., there exists a continuous map F : I ×X → Y , (t, x) 7→ Ft (x)
such that, Fi (x) ≡ fi (x). for i = 0, 1 and Ft (x0 ) ≡ y0 . Then (f0 )∗ = (f1 )∗ .
Proof. Let γ ∈ Ω(X, x0 ), Then f (γ) ∈ Ω(Y, y0 ), and one can check immediately that
x −
0 α
x
γ 1
α
A priori, the fundamental group of a topological space X may change as the base point
varies and it almost certainly does if X has several connected components. However, if
X is connected, and thus path connected since it is locally so, all the fundamental groups
π1 (X, x), x ∈ X are isomorphic.
α∗ : π1 (X, x0 ) → π1 (X, x1 ),
Thus, the fundamental group of a connected space X is independent of the base point
modulo some isomorphism. We will write π1 (X, pt) to underscore this weak dependence
on the base point.
Corollary 6.1.11. Two homotopically equivalent connected spaces have isomorphic fun-
damental groups.
Definition 6.1.13. A connected space X such that π1 (X, pt) = {1} is said to be simply
connected. ⊔
⊓
Exercise 6.1.14. Prove that the spheres of dimension ≥ 2 are simply connected. ⊔
⊓
6.1. THE FUNDAMENTAL GROUP 233
Exercise 6.1.15. Let G be a connected Lie group. Define a new operation “⋆ ” on Ω(G, 1)
by
(α ⋆ β)(s) = α(s) · β(s),
where · denotes the group multiplication.
(a) Prove that α ⋆ β ≃1 α ∗ β.
(b) Prove that π1 (G, 1) is Abelian. ⊔
⊓
Exercise 6.1.16. Let E → X be a rank r complex vector bundle over the smooth manifold
X, and let ∇ be a flat connection on E, i.e., F (∇) = 0. Pick x0 ∈ X, and identify the fiber
Ex0 with Cr . For any continuous, piecewise smooth γ ∈ Ω(X, x0 ) denote by Tγ = Tγ (∇)
the parallel transport along γ, so that Tγ ∈ GL (Cr ).
(a) Prove that α≃x0 β ⇒ Tα = Tβ .
(b) Tβ∗α = Tα ◦ Tβ .
Thus, any flat connection induces a group morphism
Example 6.1.17. We want to compute the fundamental group of the complex projective
space CPn . More precisely, we want to show it is simply connected. We will establish this
by induction.
For n = 1, CP1 ∼ = S 2 , and by Exercise 6.1.14, the sphere S 2 is simply connected. We
k
next assume CP is simply connected for k < n and prove the same is true for n.
Notice first that the natural embedding Ck+1 ֒→ Cn+1 induces an embedding CPk ֒→
n
CP . More precisely, in terms of homogeneous coordinates this embedding is given by
Choose as base point pt = [1, 0, . . . , 0] ∈ CPn , and let γ ∈ Ω(CPn , pt). We may assume γ
avoids the point P = [0, . . . , 0, 1] since we can homotop it out of any neighborhood of P .
We now use a classical construction of projective geometry. We project γ from P to the
hyperplane H = CPn−1 ֒→ CPn . More precisely, if ζ = [z0 , . . . , zn ] ∈ CPn , we denote by
π(ζ) the intersection of the line P ζ with the hyperplane H . In homogeneous coordinates
Geometrically, πt flows the point ζ along the line P ζ until it reaches the hyperplane H .
Note that πt (ζ) = ζ, ∀t, and ∀ζ ∈ H . Clearly, πt is a homotopy rel pt connecting
γ = π0 (γ) to a loop γ1 in H ∼
= CPn−1 based at pt. Our induction hypothesis shows that
γ1 can be shrunk to pt inside H . This proves that CPn is simply connected. ⊔
⊓
234 CHAPTER 6. THE FUNDAMENTAL GROUP AND COVERING SPACES
(i) Ob(C) is a set whose elements are called the objects of the category.
(ii) Hom(C) is a family of sets Hom (X, Y ), one for each pair of objects X and Y . The
elements of Hom (X, Y ) are called the morphisms (or arrows) from X to Y .
(C1) For any object X, there exists a unique element 1X ∈ Hom(X, X) such that,
f ◦ 1X = f g ◦ 1X = g ∀f ∈ Hom(X, Y ), ∀g ∈ Hom(Z, X).
h ◦ (g ◦ f ) = (h ◦ g) ◦ f.
Example 6.1.18. • Top is the category of topological spaces. The objects are topological
spaces and the morphisms are the continuous maps. Here we have to be careful about one
foundational issue. Namely, the collection of all topological spaces is not a set. To avoid
this problem we need to restrict to topological spaces whose subjacent sets belong to a
certain Universe. For more about this foundational issue we refer to [74].
• (Top, ∗) is the category of marked topological spaces. The objects are pairs (X, ∗),
where X is a topological space, and ∗ is a distinguished point of X. The morphisms
f
(X, ∗) → (Y, ⋄)
Example 6.1.20. Let V be a real vector space. Then the operation of right tensoring
with V is a covariant functor
L L⊗1V
⊗V : R Vect → R Vect, U U ⊗ V, (U1 → U2 ) (U1 ⊗ V → U2 ⊗ V ).
On the other hand, the operation of taking the dual defines a contravariant functor,
∗ L Lt
: R Vect → R Vect, V V ∗ , (U → V ) (V ∗ → U ∗ ).
π1 : (Top, ∗) → Gr. ⊔
⊓
In Chapter 7 we will introduce other functors very important in geometry. For more
information about categories and functors we refer to [74, 89].
Example 6.2.2. Let D be a discrete set. Then, for any topological space X, the product
X × D is a covering of X with covering projection π(x, d) = x. This type of covering space
is said to be trivial. ⊔
⊓
Exercise 6.2.3. Show that a fibration with standard fiber a discrete space is a covering.
⊔
⊓
Exercise 6.2.5. Let (M, g) and (M̃ , g̃) be two Riemann manifolds of the same dimension
such that (M̃ , g̃) is complete. Let φ : M̃ → M be a surjective local isometry i.e. Φ is
smooth and
|v|g = |Dφ(v)|g̃ ∀v ∈ T M̃ .
Prove that φ is a covering map. ⊔
⊓
expq : Tq M → M
is a covering map.
Proof. The pull-back h = exp∗q (g) is a symmetric non-negative definite (0, 2)-tensor field
on Tq M . It is in fact positive definite since the map expq has no critical points due to the
non-positivity of the sectional curvature.
The lines t 7→ tv through the origin of Tq M are geodesics of h and they are defined
for all t ∈ R. By Hopf-Rinow theorem we deduce that (Tq M, h) is complete. The theorem
now follows from Exercise 6.2.5. ⊔
⊓
Exercise 6.2.7. Let G̃ and G be two Lie groups of the same dimension and φ : G̃ → G
a smooth, surjective group morphism. Prove that φ is a covering map. In particular, this
explains why exp : R → S 1 is a covering map. ⊔
⊓
S 3 = {q ∈ H ; |q| = 1}.
The linear space R3 can be identified with the space of purely imaginary quaternions
R3 = Im H = {xi + yj + zk}.
Tq : Im H → ImH x 7→ qxq −1
S 3 ∋ q 7→ Tq ∈ SO(3)
is a group morphism.
(c) Prove the above group morphism is a covering map. ⊔
⊓
6.2. COVERING SPACES 237
X1
[℄[ w X2
F
π 1 π 2
Exercise 6.2.12. (a) Prove that the natural projection S n → RPn is a covering map.
(b) Denote by UR n
1 the tautological (real) line bundle over RP . Using a metric on this
line bundle form the associated sphere bundle S(UR n
1 ) → RP . Prove that this fiber bundle
defines a covering space isomorphic with the one described in part (a). ⊔
⊓
π
F
⊔
⊓
f u
Z wY
π
Proposition 6.2.14 (Unique Path Lifting). Let X → Y be a covering map, γ : [0, 1] → Y
a path in Y and x0 a point in the fiber over y0 = γ(0), x0 ∈ π −1 (y0 ). Then there exists at
most one lift of γ, Γ : [0, 1] → Y such that Γ(0) = x0 .
Proof. We argue by contradiction. Assume there exist two such lifts, Γ1 , Γ2 : [0, 1] → X.
Set
S := t ∈ [0, 1]; Γ1 (t) = Γ2 (t) .
The set S is nonempty since 0 ∈ S. Obviously S is closed so it suffices to prove that it is also
open. We will prove that for every s ∈ S, there exists ε > 0 such that [s−ε, s+ε]∩[0, 1] ⊂ S.
For simplicity, we consider the case s = 0. The general situation is entirely similar.
238 CHAPTER 6. THE FUNDAMENTAL GROUP AND COVERING SPACES
We will prove that there exists r0 > 0 such that [0, r0 ] ⊂ S. Pick a small open
neighborhood U of x0 such that π restricts to a homeomorphism onto π(U ). There exists
r0 > 0 such that γi [0, r0 ] ⊂ U , i = 1, 2. Since π ◦ Γ1 = π ◦ Γ2 , we deduce Γ1 |[0,r0 ] =
Γ2 |[0,r0 ] . The proposition is proved. ⊔
⊓
π
Theorem 6.2.15. Let X → Y be a covering space, and f : Z → Y be a continuous map,
where Z is a connected space. Fix z0 ∈ Z, and x0 ∈ π −1 (y0 ), where y0 = f (z0 ). Then
there exists at most one lift F : Z → X of f such that F (z0 ) = x0 .
Proof. Use the previous theorem with f : {pt} → Y , f (pt) = γ(0), and ht (pt) = γ(t). ⊔
⊓
π
Corollary 6.2.18. Let X → Y be a covering space, and y0 ∈ Y . If γ0 , γ1 ∈ Ω(Y, y0 ) are
homotopic rel y0 , then any lifts Γ0 , Γ1 which start at the same point also end at the same
point, i.e. Γ0 (1) = Γ1 (1).
π −1 (y0 ) which connects Γ0 (1) to Γ1 (1). Since the fibers are discrete, this path must be
constant. ⊔
⊓
π
Let X → Y be a covering space, and y0 ∈ Y . Then, for every x ∈ π −1 (y0 ), and any
γ ∈ Ω( Y, y0 ), denote by Γx the unique lift of γ starting at x. Set
x · γ := Γx (1).
x · γ = x · ω.
Hence, x · γ depends only upon the equivalence class [γ] ∈ π1 (Y, y0 ). Clearly
and
x · ey0 = x,
so that the correspondence
defines a right action of π1 (Y, y0 ) on the fiber π −1 (y0 ). This action is called the monodromy
of the covering. The map x 7→ x · γ is called the monodromy along γ. Note that when Y
is simply connected, the monodromy is trivial. The map π induces a group morphism
Proof. Indeed, let γ ∈ Ω(X, x0 ) such that π(γ) is trivial in π1 (Y, y0 ). The homotopy
connecting π(γ) to ey0 lifts to a homotopy connecting γ to the unique lift of ey0 at x0 ,
which is ex0 . ⊔
⊓
Proof. Necessity. If F is such a lift then, using the functoriality of the fundamental
group construction, we deduce f∗ = π∗ ◦ F∗ . This implies the inclusion (6.2.1).
Sufficiency. For any z ∈ Z, choose a path γz from z0 to z. Then αz = f (γz ) is a
path from y0 to y = f (z). Denote by Az the unique lift of αz starting at x0 , and set
F (z) = Az (1). We claim that F is a well defined map.
240 CHAPTER 6. THE FUNDAMENTAL GROUP AND COVERING SPACES
This proves that F is a well defined map. We still have to show that this map is also
continuous.
Pick z ∈ Z. Since f is continuous, for every arbitrarily small, evenly covered neigh-
borhood U of f (z) ∈ Y there exists a path connected neighborhood V of z ∈ Z such that
f (V ) ⊂ U . For any ζ ∈ V pick a path σ = σζ in V connecting z to ζ. Let ω denote
the path ω = γz ∗ σζ (go from z0 to z along γz , and then from z to ζ along σζ ). Then
F (ζ) = Ω(1), where Ω is the unique lift of f (ω) starting at x0 . Since (f (ζ) ∈ U , we deduce
that Ω(1) belongs to the local sheet Σ, containing F (z), which homeomorphically covers
U . We have thus proved z ∈ V ⊂ F −1 (Σ). The proof is complete since the local sheets Σ
form a basis of neighborhoods of F (z). ⊔
⊓
π
Definition 6.2.21. Let Y be a connected space. A covering space X → Y is said to be
universal if X is simply connected. ⊔
⊓
pi
Corollary 6.2.22. Let X1 → Y (i=0,1) be two covering spaces of Y . Fix xi ∈ Xi such
that p0 (x0 ) = p(x1 ) = y0 ∈ Y . If X0 is universal, then there exists a unique covering
morphism F : X0 → X1 such that F (x0 ) = x1 .
Corollary 6.2.23. Every space admits at most one universal covering space (up to iso-
morphism).
Theorem 6.2.24. Let Y be a connected, locally path connected space such that each of
its points admits a simply connected neighborhood. Then Y admits an (essentially unique)
universal covering space.
Sketch of proof. Assume for simplicity that Y is a metric space. Fix y0 ∈ Y . Let Py0
denote the collection of continuous paths in Y starting at y0 . It can be topologized using
the metric of uniform convergence in Y . Two paths in Py0 are said to be homotopic rel
endpoints if they we can deform one to the other while keeping the endpoints fixed. This
defines an equivalence relation on Py0 . We denote the space of equivalence classes by Ỹ ,
and we endow it with the quotient topology. Define p : Ỹ → Y by
Example 6.2.27. Let (M, g) be a complete Riemann manifold with non-positive sectional
curvature. By Cartan-Hadamard theorem, the exponential map expq : Tq M → M is a
covering map. Thus, the universal cover of such a manifold is a linear space of the same
dimension. In particular, the universal covering space is contractible!!!
We now have another explanation why exp : Rn → T n is a universal covering space of
the torus: the sectional curvature of the (flat) torus is zero. ⊔
⊓
The map Ψ is clearly a group morphism. (Think monodromy!) The injectivity and the
surjectivity of Ψ are consequences of the lifting properties of the universal cover. ⊔
⊓
Corollary 6.2.31. If the space X has a compact universal cover, then π1 (X, pt) is finite.
Proof. Indeed, the fibers of the universal cover have to be both discrete and compact.
⊔
Hence they must be finite. The map Ev in the above proof is a bijection onto Deck (X̃).⊓
Corollary 6.2.32 (H. Weyl). The fundamental group of a compact semisimple group is
finite.
Proof. Indeed, we deduce from Example 6.2.29 that the universal cover of such a group is
compact. ⊔
⊓
Exercise 6.2.35. Show that the natural inclusion U (n − 1) ֒→ U (n) induces an isomor-
phism between the fundamental groups. Conclude that the map
det : U (n) → S 1
induces an isomorphism
π1 (U (n), 1) ∼
= π1 (S 1 , 1) ∼
= Z. ⊔
⊓
Chapter 7
Cohomology
∂u ∂u
= P, = Q. (7.1.1)
∂x ∂y
As is, the formulation is still ambiguous since we have not specified the domains of the
functions u, P and Q. As it will turn out, this aspect has an incredible relevance in
geometry.
Equation (7.1.1) has another interesting feature: it is overdetermined, i.e., it imposes
too many conditions on too few unknowns. It is therefore quite natural to impose some
additional restrictions on the data P , Q just like the zero determinant condition when
solving overdetermined linear systems.
To see what restrictions one should add it is convenient to introduce the 1-form α =
P dx + Qdy. The equality (7.1.1) can be rewritten as
du = α. (7.1.2)
If (7.1.2) has at least one solution u, then 0 = d2 u = dα, so that a necessary condition for
existence is
dα = 0, (7.1.3)
i.e.,
∂P ∂Q
= .
∂y ∂x
A form satisfying (7.1.3) is said to be closed. Thus, if the equation du = α has a solution
then α is necessarily closed. Is the converse also true?
Let us introduce a bit more terminology. A form α such that the equation (7.1.2) has
a solution is said to be exact. The motivation for this terminology comes from the fact
243
244 CHAPTER 7. COHOMOLOGY
that sometimes the differential form du is called the exact differential of u. We thus have
an inclusion of vector spaces
Proof. The necessity is clear. We prove the sufficiency. We may as well assume that 0 ∈ C.
Consider the radial vector field on C
~r = xi ∂xi ,
and denote by Φt the flow it generates. More explicitly, Φt is the linear flow
Φt (x) = et x, x ∈ Rn .
The flow lines of Φt are half-lines, and since C is convex, for every x ∈ C, and any t ≤ 0
we have Φt (x) ∈ C.
We begin with an a priori study of (7.1.4). Let u satisfy du = α. Using the homotopy
formula,
L~r = di~r + i~r d,
we get
dL~r u = d(di~r + i~r d)u = di~r α ⇒ d(L~r u − i~r α) = 0.
This suggests looking for solutions ϕ of the equation
so that X
L~r ω = (L~r ωI )dxI = 0.
I
7.1. DERHAM COHOMOLOGY 245
We deduce that L~r ωI = 0, and consequently, the coefficients ωI are constants along the
flow lines of Φt which all converge at 0. Thus,
ωI = cI = const.
Each monomial cI dxI is exact, i.e., there exist ηI ∈ Ωk−1 (C) such that dηI = cI dxI . For
example, when I = 1 < 2 < · · · < k
Thus, the equality L~r ω = 0 implies ω is exact. Hence there exists η ∈ Ωk−1 (C) such that
d(ϕ − η) = α,
i.e., the differential form u := ϕ − η solves (7.1.4). Conclusion: any solution of (7.1.5)
produces a solution of (7.1.4).
We now proceed to solve (7.1.5), and to this aim, we use the flow Φt .Define
Z 0
u= (Φt )∗ (i~r α)dt. (7.1.6)
−∞
Φt (C) ⊂ C ∀t ≤ 0,
then
X Z 0
u(x) = ηIt (x)dt dxI . (7.1.7)
I −∞
Then !
X
(Φt )∗ (i~r α) = et i~r αJ (et x)dxJ ,
J
so that
|ηIt (x)| ≤ Cet |x|A(x) ∀t ≤ 0.
246 CHAPTER 7. COHOMOLOGY
The local solvability does not in any way implies global solvability. Something happens
when one tries to go from local to global.
Example 7.1.2. Consider the form dθ on R2 \{0} where (r, θ) denote the polar coordinates
in the punctured plane. To write it in cartesian coordinates (x, y) we use the equality
y
tan θ =
x
so that
y dy y2 −ydx + xdy
(1 + tan2 θ)dθ = − 2 dx + and (1 + 2 )dθ = ,
x x x x2
i.e.,
−ydx + xdy
dθ = = α.
x2 + y 2
Obviously, dα = d2 θ = 0 on R2 \ {0} so that α is closed on the punctured plane. Can we
find a smooth function u on R2 \ {0} such that du = α?
We know that we can always do this locally. However, we cannot achieve this globally.
Indeed, if this was possible, then
Z Z Z
du = α= dθ = 2π.
S1 S1 S1
The artifice in the previous example simply increases the mystery. It is still not clear
what makes it impossible to patch-up local solutions. The next subsection describes two
ways to deal with this issue.
7.1. DERHAM COHOMOLOGY 247
The Čech approach. Consider a closed 1-form ω on a smooth manifold. To solve the
equation du = ω, u ∈ C ∞ (M ) we first cover M by open sets (Uα ) which are geodesically
convex with respect to some fixed Riemann metric.
Poincaré lemma shows that we can solve du = ω on each open set Uα so that we can
find a smooth function fα ∈ C ∞ (Uα ) such that dfα = ω. We get a global solution if and
only if
fαβ = fα − fβ = 0 on each Uαβ = Uα ∩ Uβ 6= ∅.
For fixed α, the solutions of the equation du = ω on Uα differ only by additive constants,
i.e., closed 0-forms.
The quantities fαβ satisfy dfαβ = 0 on the (connected) overlaps Uαβ so they are con-
stants. Clearly they satisfy the conditions
fαβ → fαβ + cα − cβ .
We can start the alteration at some open set Uα , and work our way up from one such
open set to its neighbors, always trying to implement (7.1.9). It may happen that in the
process we might have to return to an open set whose solution was already altered. Now
we are in trouble. (Try this on S 1 , and ω = dθ.) After several attempts one can point the
finger to the culprit: the global topology of the manifold may force us to always return to
some already altered local solution.
Notice that we replaced the partial differential equation du = ω with a system of linear
equations (7.1.9), where the constants fαβ are subject to the constraints (7.1.8). This is
no computational progress since the complexity of the combinatorics of this system makes
it impossible solve in most cases.
The above considerations extend to higher degree, and one can imagine that the com-
plexity increases considerably. This is however the approach Čech adopted in order to
study the topology of manifolds, and although it may seem computationally hopeless, its
theoretical insights are invaluable.
The DeRham approach. This time we postpone asking why the global solvability
is not always possible. Instead, for each smooth manifold M one considers the Z-graded
248 CHAPTER 7. COHOMOLOGY
vector spaces
M M
B • (M ) = B k (M ), B 0 (M ) := {0} Z • (M ) := Z k (M ),
k≥0 k≥0
where
B k (M ) = dω ∈ Ωk (M ); ω ∈ Ωk−1 (M ) = exact k − forms,
and
Z k (M = η ∈ Ωk (M ); dη = 0 = closed k − forms.
Clearly B k ⊂ Z k . We form the quotients,
H k (M ) := Z k (M )/B k (M ).
Intuitively, this space consists of those closed k-forms ω for which the equation du = ω
has no global solution u ∈ Ωk−1 (M ). Thus, if we can somehow describe these spaces, we
may get an idea “who” is responsible for the global nonsolvability.
Definition 7.1.3. For any smooth manifold M the vector space H k (M ) is called the k-th
DeRham cohomology group of M . ⊔
⊓
Proof. Indeed
H 0 (M ) = Z 0 (M ) = f ∈ C ∞ (M ) ; df = 0 .
Thus, H 0 (M ) coincides with the linear space of locally constant functions. These are
constant on the connected components of M . ⊔
⊓
We see that H 0 (M ), the simplest of the DeRham groups, already contains an important
topological information. Obviously the groups H k are diffeomorphism invariants, and its
is reasonable to suspect that the higher cohomology groups may contain more topological
information.
Thus, to any smooth manifold M we can now associate the graded vector space
M
H • (M ) := H k (M ).
k≥0
every time the right-hand-side expression makes sense. The number dim H k (M ) is usually
denoted by bk (M ), and it is called the k-th Betti number of M . Hence
X
PM (t) = bk (M )tk .
k
We will spend the remaining of this chapter trying to understand what is that these
groups do and which, if any, is the connection between the two approaches outlined above.
The elements of Z n (C) are called cocycles, and the elements of B n (C) are called cobound-
aries.
Two cocycles c, c′ ∈ Z n (C) are said to be cohomologous if c−c′ ∈ B n (C). The quotient
module
H n (C) := Z n (C)/B n (C)
is called the n-th cohomology group (module) of C. It can be identified with the set of
equivalence classes of cohomologous cocycles. A cochain complex complex C is said to be
acyclic if H n (C) = 0 for all n > 0. ⊔
⊓
(where d is the exterior derivative) is a cochain complex of real vector spaces called the
DeRham complex. Its cohomology groups are the DeRham cohomology groups of the
manifold. ⊔
⊓
Example 7.1.10. Let (g, [·, ·]) be a real Lie algebra. Define
d : Λk g∗ → Λk+1 g∗ ,
by
X
(dω)(X0 , X1 , . . . , Xk ) := (−1)i+j ω([Xi , Xj ], X0 , . . . , X̂i , . . . , X̂j , . . . , Xk ),
0≤i<j≤k
H 1 (g) ∼
= (g/[g, g])∗ ,
7.1. DERHAM COHOMOLOGY 251
where [g, g] = span [X, Y ] ; X, Y ∈ g .
(b) Compute H 1 (gl(n, R)), where gl(n, R) denotes the Lie algebra of n × n real matrices
with the bracket given by the commutator.
(c) (Whitehead) Let g be a semisimple Lie algebra, i.e., its Killing pairing is nondegenerate.
Prove that H 1 (g) = {0}. (Hint: Prove that [g, g]⊥ = 0, where ⊥ denotes the orthogonal
complement with respect to the Killing pairing.) ⊔
⊓
Corollary 7.1.13. The DeRham cohomology of a smooth manifold has an R-algebra struc-
ture induced by the exterior multiplication of differential forms. ⊔
⊓
An
dn
w An+1
φn φn+1
un u
B
δn
w B n+1
(b) Two cochain morphisms φ, ψ : A• → B • are said to be cochain homotopic , and we
write this φ ≃ ψ, if there exists a degree −1 morphism χ : {An → B n−1 } such that
φ(a) − ψ(a) = ±δ ◦ χ(a) ± χ ◦ d(a).
(c) Two cochain complexes (A• , d), and (B • , δ) are said to be homotopic, if there exist
cochain morphism
φ : A → B and ψ : B → A,
such that ψ ◦ φ ≃ 1A , and φ ◦ ψ ≃ 1B . ⊔
⊓
252 CHAPTER 7. COHOMOLOGY
Example 7.1.15. The commutation rules in Subsection 3.2.1, namely [LX , d] = 0, and
[iX , d]s = LX , show that for each vector field X on a smooth manifold M , the Lie derivative
along X, LX : Ω• (M ) → Ω• (M ) is a cochain morphism homotopic with the trivial map
(≡ 0). The interior derivative iX is the cochain homotopy achieving this. ⊔
⊓
φ∗ : H • (A) → H • (B).
(b) If the cochain maps φ, ψ : A → B are cochain homotopic, then they induce identical
morphisms in cohomology, φ∗ = ψ∗ .
φ ψ
(c) (1A )∗ = 1H • (A) , and if (A•0 , d0 ) −→ (A•1 , d1 ) −→ (A•2 , d2 ) are cochain morphisms, then
(ψ ◦ φ)∗ = ψ∗ ◦ φ∗ .
(c) Obvious. ⊔
⊓
Corollary 7.1.17. If two cochain complexes (A• , d) and (B • , δ) are cochain homotopic,
then their cohomology modules are isomorphic. ⊔
⊓
φ ψ
0 → (A• , dA ) −→ (B • , dB ) −→ (C • , dC ) → 0
be a short exact sequence of cochain complexes of R-modules. This means that we have a
commutative diagram
7.1. DERHAM COHOMOLOGY 253
.. .. ..
u
. u
. u
.
dA dB dC
0 w An+1
u
φn+1
w B n+1
u
ψn+1
w C n+1
u w0
dA dB dC
0 w Au n φn
w Bu n ψn
w Cu n w0 (7.1.10)
dA dB dC
0 w An−1
u
ψn−1
w B n−1
u
ψn−1
w C n−1
u w0
dA dB dC
.. .. ..
. . .
in which the rows are exact. Then there exists a long exact sequence
∂n−1 φ∗ ψ∗ ∂
· · · → H n−1 (C) → H n (A) → H n (B) → H n (C) →
n
H n+1 (A) → · · · . (7.1.11)
We will not include a proof of this proposition. We believe this is one proof in ho-
mological algebra that the reader should try to produce on his/her own. We will just
indicate the construction of the connecting maps ∂n . This construction, and in fact the
entire proof, relies on a simple technique called diagram chasing.
Start with x ∈ H n (C). The cohomology class x can be represented by some cocycle
c ∈ Z n (C). Since ψn is surjective there exists b ∈ B n such that c = ψn (b). From the
commutativity of the diagram (7.1.10) we deduce 0 = dC ψn (b) = ψn+1 dB (b), i.e., dB (b) ∈
ker ψn+1 = range φn+1 . In other words, there exists a ∈ An+1 such that φn+1 (a) = dB n b.
We claim that a is a cocycle. Indeed,
φn+2 dA B B B
n+1 a = dn+1 φn+1 a = dn+1 dn b = 0.
a = φ−1 B −1 −1 B
n+1 ◦ d ◦ ψn (c) = φn+1 ◦ d b.
This description is not entirely precise since a depends on various choices. We let the
reader check that the correspondence Z n (C) ∋ c 7→ a ∈ Z n+1 (A) above induces a well
254 CHAPTER 7. COHOMOLOGY
defined map in cohomology, ∂n : H n (C) → H n+1 (A) and moreover, the sequence (7.1.11)
is exact.
Exercise 7.1.19. 1 Suppose R is a commutative ring with 1. For any cochain complex
(K • , dK ) of R-modules, and any integer n we denote by K[n]• the complex defined by
K[n]m = K n+m , dK[n] = (−1)n dK . We associate to any cochain map f : K • → L• two
new cochain complexes:
(a) The cone C(f )• , dC(f ) where
i+1
• • • ki+1 −dK 0 k
C(f ) = K[1] ⊕ L , dC(f ) = · .
ℓi f dL ℓi
(b) The cylinder Cyl(f ), dCyl(f )
ki dK −1K i+1 0 ki
Cyl(f )• ∼
= K • ⊕ C(f )• , dCyl(f ) ki+1 = 0 −dK 0 · ki+1 .
ℓi 0 f dL ℓi
(i) Prove that α, β f¯, δ(f ) are cochain maps, β ◦ α = 1L and α ◦ β is cochain homotopic to
1Cyl(f ) .
(ii) Show that we have the following commutative diagram of cochain complexes, where
the rows are exact.
0 w L• π̄
w C(f )• δ(f )
w K[1]• w0
α 1C(f )
u u
0 w K• f¯
w Cyl(f ) π
w C(f ) w0
1K β
u u
K•
f
w L•
(iii) Show that the connecting morphism in the long exact sequence corresponding to the
short exact sequence
f¯ π
0 → K • −→ Cyl(f ) −→ C(f ) → 0
coincides with the morphism induced in cohomology by δ(f ) : C(f ) → K[1]• .
(iv) Prove that f induces an isomorphism in cohomology if and only if the cone of f is
acyclic. ⊔
⊓
1
This exercise describes additional features of the long exact sequence in cohomology. They are partic-
ularly useful in the study of derived categories.
7.1. DERHAM COHOMOLOGY 255
L
Exercise 7.1.20. (Abstract Morse inequalities). Let C • = n≥0 C n be a cochain
complex of vector spaces over the field F. Assume each of the vector spaces C n is finite
dimensional. Form the Poincaré series
X
PC (t) = tn dimF C n ,
n≥0
and X
PH • (C) (t) = tn dimF H n (C).
n≥0
Prove that there exists a formal series R(t) ∈ Z[[t]] with non-negative coefficients such
that
PC • (t) = PH • (C) (t) + (1 + t)R(t).
In particular, whenever it makes sense, the graded spaces C ∗ and H ∗ have identical Euler
characteristics
χ(C • ) = PC (−1) = PH • (C) (−1) = χ(H • (C)). ⊔
⊓
Exercise 7.1.21. (Additivity of Euler characteristic). Let
0 → A• → B • → C • → 0
be a short exact sequence of cochain complexes of vector spaces over the field F. Prove
that if at least two of the cohomology modules H • (A), H • (B) and H • (C) have finite
dimension over F, then the same is true about the third one, and moreover
be a cochain complex of real vector spaces such that dim V n < ∞, for all n. Assume that
each V n is equipped with a Euclidean metric, and denote by d∗n : V n+1 → V n the adjoint
of dn . We can now form the Laplacians
(d) Prove that c determined in part (c) satisfies ∆n c = 0. Deduce from all the above that
the natural map
ker ∆n → H n (V • )
is a linear isomorphism. ⊔
⊓
Exercise 7.1.23. Let V be a finite dimensional real vector space, and v0 ∈ V . Define
φ∗ : Ω• (N ) → Ω• (M )
φ∗ : H • (N ) → H • (M ).
and the exterior differentiation is a quasi-derivation, so that the map it induces in coho-
mology will also be a ring morphism.
Definition 7.1.25. (a) Two smooth maps φ0 , φ1 : M → N are said to be (smoothly)
homotopic, and we write this φ0 ≃sh φ1 , if there exists a smooth map
Φ : I × M → N (t, m) 7→ Φt (m),
φ∗0 = φ∗1 : H • (N ) → H • (M ).
Proof. According to the general results in homological algebra, it suffices to show that the
pullbacks
φ∗0 , φ∗1 : Ω• (N ) → Ω• (M )
are cochain homotopic. Thus, we have to produce a map
χ : Ω• (N ) → Ω∗•−1 (M ),
such that
φ∗1 (ω) − φ∗0 (ω) = ±χ(dω) ± dχω, ∀ω ∈ Ω• (M ).
At this point, our discussion on the fibered calculus of Subsection 3.4.5 will pay off.
The projection Φ : I × M → M defines an oriented ∂-bundle with standard fiber I.
For any ω ∈ Ω• (N ) we have the equality
Z
∗ ∗ ∗ ∗
φ1 (ω) − φ0 (ω) = Φ (ω) |1×M −Φ (ω) |0×M = Φ∗ (ω).
(∂I×M )/M
We now use the homotopy formula in Theorem 3.4.54 of Subsection 3.4.5, and we deduce
Z Z Z
∗ ∗
Φ (ω) = dI×M Φ (ω) − dM Φ∗ (ω)
(∂I×M )/M (I×M )/M (I×M )/M
Z Z
= Φ∗ (dN ω) − dM Φ∗ (ω).
(I×M )/M (I×M )/M
Thus Z
χ(ω) := Φ∗ (ω)
(I×M )/M
⊔
Corollary 7.1.27. Two homotopy equivalent spaces have isomorphic cohomology rings.⊓
δ : Ω• (U ) ⊕ Ω• (V ) → Ω• (U ∩ V ), (ω, η) 7→ −ω |U ∩V +η |U ∩V .
is exact.
Proof. Obviously r is injective. The proof of the equality Range r = ker δ can be safely
left to the reader. The surjectivity of δ requires a little more effort.
The collection {U, V } is an open cover of M , so we can find a partition of unity
{ϕU , ϕV } ⊂ C ∞ (M ) subordinated to this cover, i.e.,
supp ϕU ⊂ U, supp ϕV ⊂ V, 0 ≤ ϕU , ϕV ≤ 1, ϕU + ϕV = 1.
supp ϕV ω ⊂ supp ϕV ⊂ V,
δ(−ϕV ω, ϕU ω) = (ϕV + ϕU )ω = ω.
Using the abstract results in homological algebra we deduce from the above lemma
the following fundamental result.
Theorem 7.1.29 (Mayer–Vietoris). Let M = U ∪ V be an open cover of the smooth
manifold M. Then there exists a long exact sequence
r δ ∂
· · · → H k (M ) → H k (U ) ⊕ H k (V ) → H k (U ∩ V ) → H k+1 (M ) → · · · ,
The connecting morphisms ∂ can be explicitly described using the prescriptions fol-
lowing Proposition 7.1.18 in the previous subsection. Let us recall that construction.
Start with ω ∈ Ωk (U ∩ V ) such that dω = 0. Writing as before
ω = ϕV ω + ϕU ω,
we deduce
d(ϕV ω) = d(−ϕU ω) on U ∩ V.
7.1. DERHAM COHOMOLOGY 259
Then
∂ω := η.
The reader can prove directly that the above definition is independent of the various
choices.
The Mayer–Vietoris sequence has the following functorial property.
Proposition 7.1.30. Let φ : M → N be a smooth map and {U, V } an open cover of N .
Then U ′ = φ−1 (U ), V ′ = φ−1 (V ) form an open cover of M and moreover, the diagram
below is commutative.
H k (N )
r
w H k (U ) ⊕ H k (V ) δ
w H k (U ∩ V ) ∂
w H k+1(N )
φ∗ φ∗ φ∗ φ∗ ⊔
⊓
u u u u
H k (M )
r′
w H k (U ′) ⊕ H k (V ′) δ′
w H k (U ′ ∩ V ′) ∂′
w H k+1(M )
Exercise 7.1.31. Prove the above proposition. ⊔
⊓
PS 1 (t) = 1 + t.
Each is diffeomorphic to Rn+1 . Note that the overlap Unorth ∩ Usouth is homotopically
equivalent with the equator S n . The Poincaré lemma implies that
For k > 0 the group on the left is also trivial, so that we have the isomorphisms
H k (S n ) ∼
= H k (Unorth ∩ Usouth ) ∼
= H k+1 (S n+1 ) k > 0.
Denote by Pn (t) the Poincaré polynomial of S n and set Qn (t) = Pn (t) − Pn (0) = Pn (t) − 1.
We can rewrite the above equality as
PS n (t) = 1 + tn . ⊔
⊓
Example 7.1.33. Let {U, V } be an open cover of the smooth manifold M. We assume
that all the Betti numbers of U , V and U ∩ V are finite. Using the Mayer–Vietoris short
exact sequence, and the Exercise 7.1.21 in Subsection 7.1.3, we deduce that all the Betti
numbers of M are finite, and moreover
Σ = (Σ \ disk) ∪ disk,
7.1. DERHAM COHOMOLOGY 261
If now Σ1 and Σ2 are two surfaces with finite Betti numbers, then
where the two holed surfaces intersect over an entire annulus, which is homotopically a
circle. Thus
χ(Σ1 #Σ2 ) = χ(Σ1 \ disk) + χ(Σ2 \ disk) − χ(S 1 )
= χ(Σ1 ) + χ(Σ2 ) − 2.
This equality is identical with the one proved in Proposition 4.2.42 of Subsection 4.2.5.
We can decompose a torus as a union of two cylinders. The intersection of these
cylinders is the disjoint union of two annuli so homotopically, this overlap is a disjoint
union of two circles. In particular, the Euler characteristic of the intersection is zero.
Hence
χ(torus) = 2χ(cylinder) = 2χ(circle) = 0.
We conclude as in Proposition 4.2.42 that
This is a pleasant, surprising connection with the Gauss–Bonnet theorem. And the story
is not over. ⊔
⊓
Example 7.1.35. (a) All compact manifolds are of finite type. To see this, it suffices
to cover such a manifold by finitely many open sets which are geodesically convex with
respect to some Riemann metric.
(b) If M is a finite type manifold, and U ⊂ M is a closed subset homeomorphic with
the closed unit ball in Rdim M , then M \ U is a finite type non-compact manifold. (It
suffices to see that Rn \ closed ball is of finite type).
(c) The connected sums, and the direct products of finite type manifolds are finite
type manifolds. ⊔
⊓
262 CHAPTER 7. COHOMOLOGY
In the proof of this proposition we will use the following fundamental result.
∂n : Fn (U ∩ V ) → Fn+1 (M ),
r∗ δ ∂
· · · → Fn (M ) → Fn (U ) ⊕ Fn (V ) → Fn (U ∩ V ) →
n
Fn+1 (M ) → · · · ,
where r ∗ is defined by
r ∗ = F(ıU ) ⊕ F(ıV ),
and δ is defined by
δ(x ⊕ y) = F(ıU ∩V )(y) − F(ıU ∩V )(x).
(The maps ı• denote natural embeddings.) Moreover, if N ∈ Mn is an open submanifold
of N , and {U, V } is an M V -cover of M such that {U ∩ N, V ∩ N } is an M V -cover of N ,
7.1. DERHAM COHOMOLOGY 263
Fn (U ∩ V )
∂n
w Fn+1(M )
u u
Fn (U ∩ V ∩ N )
∂n
w Fn+1(N )
The vertical arrows are the morphisms F(ı• ) induced by inclusions. ⊔
⊓
The covariant M V -functors are defined in the dual way, by reversing the orientation
of all the arrows in the above definition.
Definition 7.1.40. Let F, G be two contravariant M V -functors,
F, G : Mn → GradR Mod.
ϕ
one morphism for each M ∈ Mn , such that, for any embedding M1 ֒→ M2 , the diagram
below is commutative
Fn (M2 )
F(ϕ)
Fn (M1 ) w
φM 2 φM 1 ,
u u
Gn (M2 )
G(ϕ)
w Gn(M1)
and, for any M ∈ Mn and any M V -cover {U, V } of M , the diagram below is commutative.
Fn (U ∩ V )
∂n
w Fn+1(M )
φU ∩V φM
u u
n
G (U ∩ V )
∂n
wG n+1
(M )
The correspondence is said to be a natural equivalence if all the morphisms φM are iso-
morphisms. ⊔
⊓
Fn (U ) ⊕ Fn (Ur ) w Fn (U ∩ Ur ) ∂
w Fn+1(M ) w Fn+1(U ) ⊕ Gn+1(Ur )
u u u u n+1
Gn (U ) ⊕ Gn (Ur ) w Gn(U ∩ Ur ) ∂
w Gn+1(M ) wF n+1
(U ) ⊕ G (Ur )
The vertical arrows are defined by the correspondence φ. Note the inductive assumption
implies that in the above infinite sequence only the morphisms φM may not be isomor-
phisms. At this point we invoke the following technical result.
Lemma 7.1.42. (The five lemma.) Consider the following commutative diagram of
R-modules.
A−2 A−1 A0 wA1 w
A2 w w
u u u u u
f−2 f−1 f0 f1 f2
B−2 w B−1 w B0 w B1 w B2
If fi is an isomorphism for any i 6= 0, then so is f0 . ⊔
⊓
The five lemma applied to our situation shows that the morphisms φM must be iso-
morphisms. ⊔
⊓
Remark 7.1.44. (a) The Mayer–Vietoris principle is true for covariant M V -functors as
well. The proof is obtained by reversing the orientation of the horizontal arrows in the
above proof.
(b) The Mayer–Vietoris principle can be refined a little bit. Assume that F and
G are functors from Mn to the category of Z-graded R-algebras, and φ : F → G is a
correspondence compatible with the multiplicative structures, i.e., each of the R-module
morphisms φM are in fact morphisms of R-algebras. Then, if φRn are isomorphisms of
Z-graded R-algebras , then so are the φM ’s, for any M ∈ Mn .
(c) Assume R is a field. The proof of the Mayer–Vietoris principle shows that if F is
a MV-functor and dimR F∗ (Rn ) < ∞ then dim F∗ (M ) < ∞ for all M ∈ Mn .
(d) The Mayer–Vietoris principle is a baby case of the very general technique in alge-
braic topology called the acyclic models principle. ⊔
⊓
7.1. DERHAM COHOMOLOGY 265
Corollary 7.1.45. Any finite type manifold has finite Betti numbers. ⊔
⊓
In particular, we deduce
PM ×N (t) = PM (t) · PN (t).
F, G : Mm → GradR Alg,
( )
M M M
F : M 7→ Fr (M ) = H p (M ) ⊗ H q (N ) ,
r≥0 r≥0 p+q=r
and M M
G : M 7→ Gr (M ) = H r (M × N ),
r≥0 r≥0
where !
M M
F(f ) = f ∗ |H p (M2 ) ⊗1H q (N ) , ∀f : M1 ֒→ M2 ,
r≥0 p+q=r
and M
G(f ) = (f × 1N )∗ |H r (M2 ×N ) , ∀f : M1 ֒→ M2 .
r≥0
× : H • (M ) ⊗ H • (N ) → H • (M × N ) (ω ⊗ η) 7→ ω × η,
is called the cross product. The Künneth formula is a consequence of the following lemma.
266 CHAPTER 7. COHOMOLOGY
Proof. The only nontrivial thing to prove is that for any MV-cover {U, V } of M ∈ Mm
the diagram below is commutative.
⊕p+q=r H p (U ∩ V ) ⊗ H q (N )
∂
w ⊕p+q=r H p+1(M ) ⊗ H q (N )
φU ∩V φM
u u
H r ((U × N ) ∩ (V × N ))
∂′
w H r+1(M × N )
We briefly recall the construction of the connecting morphisms ∂ and ∂ ′ .
One considers a partition of unity {ϕU , ϕV } subordinated to the cover {U, V }. Then,
the functions ψU = πM ∗ ϕ and ψ = π ∗ ϕ form a partition of unity subordinated to the
U V M V
cover
U × N, V × N
of M × N . If ω ⊗ η ∈ H • (U ∩ V ) ⊗ H • (N ),
∂(ω ⊗ η) = ω̂ ⊗ η,
where
ω̂ |U = −d(ϕV ω) ω̂ = d(ϕU ω).
On the other hand, φU ∩V (ω ⊗ η) = ω × η, and ∂ ′ (ω × η) = ω̂ × η. This proves (a).
To establish (b), note that the inclusion
: N ֒→ Rm × N, x 7→ (0, x)
G(Rm ) ∼
= H • (N ).
Using the Poincaré lemma and the above isomorphism we can identify the morphism φRm
with 1Rm . ⊔
⊓
Thus
n n
bk (T ) = , dim H ∗ (T n ) = 2n ,
k
and χ(T n ) = 0.
One can easily describe a basis of H ∗ (T n ). Choose angular coordinates (θ 1 , . . . , θ n ) on
n
T . For each ordered multi-index I = (1 ≤ i1 < · · · < ik ≤ n) we have a closed, non-exact
7.2. THE POINCARÉ DUALITY 267
form dθ I . These monomials are linearly independent (over R) and there are 2n of them.
Thus, they form a basis of H ∗ (T n ). In fact, one can read the multiplicative structure
using this basis. We have an isomorphism of R-algebras
H • (T n ) ∼
= Λ• Rn . ⊔
⊓
(b) There exist cohomology classes e1 , . . . , er ∈ H • (E) such that their restrictions to
any fiber generate the cohomology algebra of that fiber.
ω · η = p∗ ω ∧ η, ∀ω ∈ H • (M ), η ∈ H • (E).
is a cochain complex. Its cohomology is denoted by Hcpt• (M ), and it is called the DeR-
ham cohomology with compact supports. Note that when M is compact this cohomology
coincides with the usual DeRham cohomology.
Although it looks very similar to the usual DeRham cohomology, there are many
important differences. The most visible one is that if φ : M → N is a smooth map,
and ω ∈ Ω•cpt (N ), then the pull-back φ∗ ω may not have compact support, so this new
construction is no longer a contravariant functor from the category of smooth manifolds
and smooth maps, to the category of graded vector spaces.
On the other hand, if dim M = dim N , and φ is an embedding, we can identify M with
an open subset of N , and then any η ∈ Ω•cpt(M ) can be extended by 0 outside M ⊂ N .
This extension by zero defines a push-forward map
φ∗ : Ω•cpt(M ) → Ω•cpt (N ).
268 CHAPTER 7. COHOMOLOGY
One can verify easily that φ∗ is a cochain map so that it induces a morphism
• •
φ∗ : Hcpt (M ) → Hcpt (N ).
• is a covariant functor from the category M ,
In terms of our category Mn we see that Hcpt n
to the category of graded real vector spaces. As we will see, it is a rather nice functor.
• is a covariant M V -functor, and moreover
Theorem 7.2.1. Hcpt
k 0 , k<n
Hcpt (Rn ) = .
R , k=n
The last assertion of this theorem is usually called the Poincaré lemma for compact
supports.
We first prove the Poincaré lemma for compact supports. The crucial step is the
following technical result that we borrowed from [14].
p
Lemma 7.2.2. Let E → B be a rank r real vector bundle which is orientable in the sense
described in Subsection 3.4.5. Denote by p∗ the integration-along-fibers map
p∗ : Ω•cpt(E) → Ω•−r
cpt (B).
such that,
Note that
∂E = ( {0} × (E ⊕ E) ) ⊔ ({1} × (E ⊕ E)) .
Define π t : E ⊕ E → E as the composition
π
E⊕E ∼
= {t} × E ⊕ E ֒→ I × (E ⊕ E) → E.
Observe that
∂π = π |∂E = (−π 0 ) ⊔ π 1 .
For (α, β) ∈ Ω•cpt(E) × Ω•cpt(E) define α ⊙ β ∈ Ω•cpt(E ⊕ E) by
α ⊙ β := (π 0 )∗ α ∧ (π 1 )∗ β.
7.2. THE POINCARÉ DUALITY 269
(Verify that the support of α ⊙ β is indeed compact.) For α ∈ Ωicpt(E), and β ∈ Ωjcpt(E)
we have the equalities
p∗ p∗ α ∧ β = π∗1 (α ⊙ β) ∈ Ωi+j−r
cpt (E), α ∧ p∗ p∗ β = π∗0 (α ⊙ β) ∈ Ωi+j−r
cpt (E).
Hence
Z
∗ ∗
D(α, β) = p p∗ α ∧ β − α ∧ p p∗ β = π∗1 (α ⊙ β) − π∗0 (α ⊙ β) = α ⊙ β.
∂E/E
Proof of the Poincaré lemma for compact supports. Consider δ ∈ C0∞ (Rn ) such
that Z
0 ≤ δ ≤ 1, δ(x)dx = 1.
Rn
Define the, closed, compactly supported n-form
τ := δ(x)dx1 ∧ · · · ∧ dxn .
We want to use Lemma 7.2.2 in which E is the rank n bundle over a point, i.e., E =
p
{pt} × Rn → {pt}. The integration along fibers is simply the integration map.
0 , deg ω < n
p∗ : Ωcpt(R ) → R, ω 7→ p∗ ω = R
• n
Rn ω , deg ω = n.
ω = (p∗ p∗ τ ) ∧ ω.
where
i(ω) = (ω̂, ω̂), j(ω, η) = η̂ − ω̂.
The hat ˆ denotes the extension by zero outside the support. This sequence is called the
Mayer–Vietoris short sequence for compact supports.
Lemma 7.2.3. The above Mayer–Vietoris sequence is exact.
Proof. The morphism i is obviously injective. Clearly, Range (i) = ker(j). We have to
prove that j is surjective.
Let (ϕU , ϕV ) a partition of unity subordinated to the cover {U, V }. Then, for any
η ∈ Ω∗cpt(M ), we have
ϕU η ∈ Ω•cpt(U ) and ϕV η ∈ Ω•cpt (V ).
In particular, η = j(−ϕU η, ϕV η), which shows that j is surjective. ⊔
⊓
We get a long exact sequence called the long Mayer–Vietoris sequence for compact
supports.
k k k k k+1 δ
· · · → Hcpt (U ∩ V ) → Hcpt (U ) ⊕ Hcpt (V ) → Hcpt (M ) → Hcpt → ···
k
Hcpt (N )
δ
w Hcpt
k+1 −1
(φ (U ∩ V ))
φ∗
u u
φ∗
k
Hcpt (M )
δ
w Hcpt
k+1
(U ∩ V )
Remark 7.2.4. To be perfectly honest (from a categorical point of view) we should have
considered the chain complex M M
Ω̃k = Ω−k
cpt ,
k≤0 k≤0
Given M ∈ M+
n , there is a natural pairing
n−k
h•, •iκ : Ωk (M ) × Ωcpt (M ) → R,
defined by Z
hω, ηiκ := ω ∧ η.
M
This pairing is called the Kronecker pairing. We can extend this pairing to any (ω, η) ∈
Ω• × Ω•cpt as
0 , deg ω + deg η 6= n
hω, ηiκ = R .
M ω ∧ η , deg ω + deg η = n
The Kronecker pairing induces maps
n−k
D = Dk : Ωk (M ) → (Ωcpt (M ))∗ , hD(ω), ηi = hω, ηiκ .
Above, h•, •i denotes the natural pairing between a vector space V and its dual V ∗ ,
h•, •i : V ∗ × V → R.
n−k
If ω is closed, the restriction of D(ω) to the space Zcpt of closed, compactly supported
n−k
(n − k)-forms vanishes on the subspace Bcpt = dΩn−k−1
cpt (M ). Indeed, if η = dη ′ , η ′ ∈
Ωn−k−1
cpt (M ) then
Z Z
Stokes
hD(ω), ηi = ω ∧ dη ′ = ± dω ∧ η ′ = 0.
M M
n−k
Thus, if ω is closed, the linear functional D(ω) defines an element of (Hcpt (M ))∗ . If
n−k
moreover ω is exact, a computation as above shows that D(ω) = 0 ∈ (Hcpt (M ))∗ . Hence
D descends to a map in cohomology
D : H k (M ) → (Hcpt
n−k
(M ))∗ .
272 CHAPTER 7. COHOMOLOGY
Equivalently, this means that the Kronecker pairing descends to a pairing in cohomology,
Theorem 7.2.7 (Poincaré duality). The Kronecker pairing in cohomology is a duality for
all M ∈ M+
n.
DM : H k (M ) → H̃ k (M ).
L
Lemma 7.2.8. k Dk is a correspondence of M V functors.
H k (N )
φ∗
w H k (M )
DN
u DM
u .
H̃ k (N )
φ̃∗
w H̃ k (M )
Fact B. If {U, V } is an M V -cover of M ∈ M+
n , then the diagram bellow is commutative
H k (U ∩ V )
∂
w H k+1(M )
.
u u
DU ∩V DM
H̃ k (U ∩ V )
(−1)k δ†
w H̃ k+1(M )
7.2. THE POINCARÉ DUALITY 273
Hence φ̃∗ ◦ DN = DM ◦ φ∗ .
Proof of Fact B. Let ϕU , ϕV be a partition of unity subordinated to the M V -cover
{U, V } of M ∈ M+ k
n . Consider a closed k-form ω ∈ Ω (U ∩ V ). Then the connecting
morphism in usual DeRham cohomology acts as
d(−ϕV ω) on U
∂ω = .
d(ϕU ω) on V
Choose η ∈ Ωn−k−1
cpt (M ) such that dη = 0. We have
Z Z Z Z
hDM ∂ω, ηi = ∂ω ∧ η = ∂ω ∧ η + ∂ω ∧ η − ∂ω ∧ η
M U V U ∩V
Z Z Z
=− d(ϕV ω) ∧ η + dϕU ω) ∧ η + d(ϕV ω) ∧ η.
U V U ∩V
Note that the first two integrals vanish. Indeed, over U we have the equality
d(ϕV ω) ∧ η = d ( ϕV ω ∧ η ) ,
and the vanishing now follows from Stokes formula. The second term is dealt with in a
similar fashion. As for the last term, we have
Z Z Z
deg ω
d(ϕV ω) ∧ η = dϕV ∧ ω ∧ η = (−1) ω ∧ (dϕV ∧ η)
U ∩V U ∩V U ∩V
Z
= (−1)k ω ∧ δη = (−1)k hDU ∩V ω, δηi = h(−1)k δ† DU ∩V ω, ηi.
U ∩V
This concludes the proof of Fact B. The Poincaré duality now follows from the Mayer–
Vietoris principle. ⊔
⊓
Remark 7.2.9. Using the Poincaré duality we can associate to any smooth map f : M →
N between compact oriented manifolds of dimensions m and respectively n a natural push-
forward or Gysin map
D (f ∗ )† D −1
H • (M ) →
M
(H m−• (M ))∗ → (H m−• (N ))∗ →
N
H •+n−m (N ),
Corollary 7.2.10. If M ∈ M+
n then
•
Hcpt (M ) ∼
= (H n−k (M ))• .
DtM : (Hcpt
n−k
(M ))∗∗ → (H k (M ))∗
is an isomorphism. On the other hand, for any finite dimensional vector space there exists
a natural isomorphism
V ∗∗ ∼
= V. ⊔
⊓
If M is connected H 0 (M ) ∼= H n (M ) ∼
= R so that H n (M ) is generated by any volume
form defining the orientation.
The symmetry of Betti numbers can be translated in the language of Poincaré poly-
nomials as
1
tn PM ( ) = PM (t). (7.2.1)
t
Example 7.2.12. Let Σg denote the connected sum of g tori. We have shown that
χ(Σg ) = b0 − b1 + b2 = 2 − 2g.
Consider now a compact oriented smooth manifold such that dim M = 2k. The Kro-
necker pairing induces a non-degenerate bilinear form
Z
k k
I : H (M ) × H (M ) → R I(ω, η) = ω ∧ η.
M
Exercise 7.2.14. (a) Let P ∈ Z[t] be an odd degree polynomial with non-negative integer
coefficients such that P (0) = 1. Show that if P satisfies the symmetry condition (7.2.1)
there exists a compact, connected, oriented manifold M such that PM (t) = P (t).
(b) Let P ∈ Z[t] be a polynomial of degree 2k with non-negative integer coefficients.
Assume P (0) = 1 and P satisfies (7.2.1). If the coefficient of tk is even then there exists
a compact connected manifold M ∈ M+ 2k such that PM (t) = P (t).
Hint: Describe the Poincaré polynomial of a connect sum in terms of the polynomials of
its constituents. Combine this fact with the Künneth formula. ⊔
⊓
Remark 7.2.15. The result in the above exercise is sharp. Using his intersection theorem
F. Hirzebruch showed that there exist no smooth manifolds M of dimension 12 or 20 with
Poincaré polynomials 1 + t6 + t12 and respectively 1 + t10 + t20 . Note that in each of these
cases both middle Betti numbers are odd. For details we refer to J. P. Serre, “Travaux de
Hirzebruch sur la topologie des variétés”, Seminaire Bourbaki 1953/54,n◦ 88. ⊔
⊓
Definition 7.3.2. (a) Two cycles (S0 , φ0 ), (S1 , φ1 ) ∈ Ck (M ) are said to be cobordant,
and we write this (S0 , φ0 ) ∼c (S1 , φ1 ), if there exists a compact, oriented manifold with
boundary Σ, and a smooth map Φ : Σ → M such that the following hold.
(a1) ∂Σ = (−S0 ) ⊔ S1 where −S0 denotes the oriented manifold S0 equipped with the
opposite orientation, and “⊔” denotes the disjoint union.
(a2) Φ |Si = φi , i = 0, 1.
(b) A cycle (S, φ) ∈ Ck (M ) is called trivial if there exists a (k + 1)-dimensional,
oriented manifold Σ with (oriented) boundary S, and a smooth map Φ : Σ → M such
that Φ|∂Σ = φ. We denote by Tk (M ) the set of trivial cycles.
(c) A cycle (S, φ) ∈ Ck (M ) is said to be degenerate if it is cobordant to a constant
cycle, i.e., a cycle (S ′ , φ′ ) such that φ′ is map constant on the components of S ′ . We
denote by Dk (M ) the set of degenerate cycles. ⊔
⊓
Exercise 7.3.3. Let (S0 , φ0 ) ∼c (S1 , φ1 ). Prove that (−S0 ⊔ S1 , φ0 ⊔ φ1 ) is a trivial cycle.
⊔
⊓
S0
S
1
−[S, φ] = [−S, φ] ∈ Hk (M ). ⊔
⊓
Stokes formula shows that this map is well defined, i.e., it is independent of the closed
form representing a cohomology class.
Indeed, if ω is exact, i.e., ω = dω ′ , then
Z Z
φ∗ dω ′ = dφ∗ ω ′ = 0.
S S
7.3. INTERSECTION THEORY 277
In other words, each cycle defines an element in ( H k (M ) )∗ . Via the Poincaré duality we
n−k n−k
identify the vector space ( H k (M ) )∗ with Hcpt (M ). Thus, there exists δS ∈ Hcpt (M )
such that Z Z
ω ∧ δS = φ∗ ω ∀ω ∈ H k (M ).
M S
The compactly supported cohomology class δS is called the Poincaré dual of (S, φ).
n−k
There exist many closed forms η ∈ Ωcpt (M ) representing δS . When there is no risk
of confusion, we continue denote any such representative by δS .
✍ For any differential form ω, we set (for typographical reasons), |ω| := deg ω.
where
∗ ∗
ω × η := πM ω ∧ πN η, ∀(ω, η) ∈ Ω• (M ) × Ω• (N ).
Pick (ω, η) ∈ Ω• (M ) × Ω• (N ) such that, deg ω + deg η = (m + n) − (p + q). Then, using
Exercise 7.1.50
Z Z
(m−p)|η|
(ω × η) ∧ (δS × δT ) = (−1) (ω ∧ δS ) × (η ∧ δT ).
M ×N M ×N
The above integral should be understood in the generalized sense of Kronecker pairing.
The only time when this pairing does not vanish is when |ω| = p and |η| = q. In this case
the last term equals
Z Z Z Z
q(m−p) q(m−p) ∗ ∗
(−1) ω ∧ δS η ∧ δT = (−1) φ ω ψ η
S T S T
Z
= (−1)q(m−p) (φ × ψ)∗ (ω ∧ η).
S×T
This establishes the equality (7.3.1). ⊔
⊓
∆ = ∆M : M → M × M, x 7→ (x, x).
Indeed, for any homogeneous forms α, β ∈ Ω• (M ) such that |α| + |β| = n, we have
Z X Z
|ω i |
(α × β) ∧ δ M = (−1) (α × β) ∧ (ω i × ωi )
M ×M i M ×M
X Z
i i
= (−1)|ω | (−1)|β|·|ω | (α ∧ ω i ) × (β ∧ ωi )
i M ×M
X Z Z
i i
= (−1)|ω | (−1)|β|·|ω | α∧ω i
β ∧ ωi .
i M M
The i-th summand is nontrivial only when |β| = |ω i |, and |α| = |ωi |. Using the equality
|ω i | + |ω i |2 ≡ 0(mod 2) we deduce
Z X Z Z X
i
(α × β) ∧ δ M = α∧ω β ∧ ωi = hα, ω i iκ hβ, ωi iκ .
M ×M i M M i
7.3. INTERSECTION THEORY 279
we conclude
Z Z Z X X
∗
∆ (α × β) = α∧β = ( ωi hω i , αiκ ) ∧ ( hβ, ωj iκ ω j )
M M M i j
Z X X
= hω i , αiκ hβ, ωj iκ ωi ∧ ω j = hω i , αiκ hβ, ωj iκ hωi , ω j iκ
M i,j i,j
X X
= (−1)|ωi |(n−|ωi |) hα, ω i iκ (−1)|ωi |(n−|ωi |) hβ, ωi iκ = hα, ω i iκ hβ, ωi iκ .
i i
Proof. (a) Consider a compact manifold Σ with boundary ∂Σ = −S0 ⊔ S1 and a smooth
map Φ : Σ → M such that Φ |∂Σ = φ0 ⊔ φ1 . For any closed k-form ω ∈ Ωk (M ) we have
Z Z Z Z Z
∗ ∗ Stokes ∗
0= Φ (dω) = dΦ ω = ω= φ1 ω − φ∗0 ω.
Σ Σ ∂Σ S1 S0
Part (b) is left to the reader. Part (c) is obvious. To prove (d) consider Σ = [0, 1] × S0
and
Φ : [0, 1] × S0 → M, Φ(t, x) = φ0 (x), ∀(t, x) ∈ Σ.
Note that ∂Σ = (−S0 ) ⊔ S0 so that
descends to a map
n−k
δ : Hk (M ) → Hcpt (M ).
This is usually called the homological Poincaré duality. We are not claiming that δ is an
isomorphism.
280 CHAPTER 7. COHOMOLOGY
2 C
1
1
C
2
-1 1
1
If S ⋔ Φ, then for each x ∈ φ−1 (S) we define the local intersection number at x to be
(or = orientation)
1 , or (Tφ(x) S) ∧ or(φ∗ Tx T ) = or(Tφ(x) M )
ix (S, T ) = .
−1 , or(Tφ(x) S) ∧ or(φ∗ Tx T ) = −or(Tφ(x) M )
Finally, we define the intersection number of S with T to be
X
S · T := ix (S, T ). ⊔
⊓
x∈φ−1 (S)
Our next result offers a different description of the intersection number indicating how
one can drop the transversality assumption from the original definition.
Proposition 7.3.12. Let M ∈ M+ n . Consider a compact, oriented, k-dimensional sub-
manifold S ֒→ M , and (T, φ) ∈ Cn−k (M ) a (n − k)-dimensional cycle intersecting S
transversally, i.e., S ⋔ φ. Then
Z
S ·T = δS ∧ δT , (7.3.3)
M
7.3. INTERSECTION THEORY 281
Proof. Fix a Riemann metric on M . Each point p ∈ φ(S) has a geodesically convex open
neighborhood entirely contained in N. Cover φ(S) by finitely many such neighborhoods,
+
and denote by N their union. Then N ∈ M+ n , and (S, φ) ∈ Ck (N ).
N
Denote by δS the Poincaré dual of S in N . It can be represented by a closed form in
n−k
Ωcpt (N ) which we continue to denote by δSN . If we pick a closed form ω ∈ Ωk (M ), then
ω |N is also closed, and Z Z Z
ω ∧ δSN = ω ∧ δSN = φ∗ ω.
M N S
n−k
Hence, δSN represents the Poincaré dual of S in Hcpt (M ), and moreover, supp δSN ⊂ N. ⊔
⊓
Proof. Using the transversality S ⋔ φ, the implicit function theorem, and eventually
restricting φ to a smaller ball, we deduce that, for some sufficiently “thin” neighborhood
N of S, there exist local coordinates (x1 , . . . , xn ) defined on some neighborhood U of
p ∈ M diffeomorphic with the cube
x1 = 0, . . . , xk = 0, xk+1 = y 1 , . . . , xn = y n−k .
282 CHAPTER 7. COHOMOLOGY
ǫ = S · (B, φ).
Equivalently, Z
v(ξ) = φ∗ξ δSN ,
B
where φξ : B → M is defined by
The form ωϕ is then the restriction to U ∩ S of the closed compactly supported form
We have Z Z Z Z
ω̃ϕ ∧ δSN = ωϕ ∧ δSN = ωϕ = ϕ(ξ)dξ. (7.3.4)
M U S Rk
The integral over U can be evaluated using the Fubini theorem. Write
Z Z !
F ubini k+1 n
= ǫ ϕ(ξ) f |dx ∧ · · · ∧ dx | |dξ|
S∩U Pξ
Z Z ! Z
=ǫ ϕ(ξ) δSN |dξ| = ǫ ϕ(ξ)v(ξ)|dξ|.
S Pξ S
Comparing with (7.3.4), and taking into account that ϕ was chosen arbitrarily, we deduce
that Z
φ∗ δSN = v(0) = ǫ.
B
The local transversal lemma is proved. ⊔
⊓
The transversality assumption implies that each pi has an open neighborhood Bi diffeo-
morphic to an open ball such that φi = φ |Bi is a local transversal at yi = φ(pi ). Moreover,
we can choose the neighborhoods Bi to be mutually disjoint. Then
X
S·T = S · (Bi , φi ).
i
The compact set K := φ(T \ ∪Bi ) does not intersect S so that we can find a “thin”, closed
neighborhood” N of S ֒→ M such that K ∩ N = ∅. Then, φ∗ δSN is compactly supported in
the union of the Bi ’s and
Z Z XZ
δSN ∧ δT = φ∗ δSN = φ∗i δSN .
M T i Bi
(a) S0 ∼c S1 , T0 ∼c T1 ,
(c) Si ⋔ Ti ,
284 CHAPTER 7. COHOMOLOGY
then
S0 · T0 = S1 · T1 . ⊔
⊓
Definition 7.3.17. The homological intersection pairing is the Z-bilinear map
I = IM : Hk (M ) × Hn−k (M ) → R,
(M ∈ M+
n ) defined by Z
I(S, T ) = δS ∧ δT . ⊔
⊓
M
We have proved that in some special instances I(S, T ) ∈ Z. We want to prove that,
when M is compact, this is always the case.
Theorem 7.3.18. Let M ∈ M+ n be compact manifold. Then, for any (S, T ) ∈ Hk (M ) ×
Hn−k (M ), the intersection number I(S, T ) is an integer.
The theorem will follow from two lemmata. The first one will show that it suffices to
consider only the situation when one of the two cycles is embedded. The second one will
show that, if one of the cycles is embedded, then the second cycle can be deformed so that
it intersects the former transversally. (This is called a general position result.)
Lemma 7.3.19 (Reduction-to-diagonal trick). Let M , S and T as in Theorem 7.3.18.
Then Z
n−k
I(S, T ) = (−1) δS×T ∧ δ∆ ,
M ×M
where ∆ is the diagonal cycle ∆ : M → M × M , x 7→ (x, x). (It is here where the
compactness of M is essential, since otherwise ∆ would not be a cycle).
δS×T = (−1)n−k δS × δT .
Then Z Z
(−1)n−k = δS×T ∧ δ∆ = (δS × δT ) ∧ δ∆
M ×M M ×M
Z Z
∗
= ∆ (δS × δT ) = δS ∧ δT . ⊔
⊓
M M
The proof of this result relies on Sard’s theorem. For details we refer to [62], Chapter
3.
Proof of Theorem 7.3.18 Let (S, T ) ∈ Ck (M ) × Cn−k (M ). Then,
deg F := (M × {y}) · ΓF . ⊔
⊓
Note that the intersections of ΓF with M × {y} correspond to the solutions of the
equation F (x) = y. Thus the topological degree counts these solutions (with sign).
Proposition 7.3.22. Let F : M → N as above. Then for any n-form ω ∈ Ωn (N )
Z Z
∗
F ω = deg F ω.
M N
Thus, to prove the proposition it suffices to check it for any particular form which generates
H n (N ). Our candidate will be the Poincaré dual δy of a point y ∈ N . We have
Z
δy = 1,
N
δM ×{y} = δM × δy = 1 × δy .
This corollary follows immediately from the considerations at the end of Subsection
4.2.4.
Ax
FA : S n−1 → S n−1 , x 7→ .
|Ax|
F
Exercise 7.3.27. Let M → N be a smooth map (M , N are smooth, compact oriented of
dimension n). Assume y ∈ N is a regular value of F , i.e., for all x ∈ F −1 (y) the derivative
Dx F : Tx M → Ty N
Prove that
X
deg F = deg(F, x). ⊔
⊓
F (x)=y
Exercise 7.3.28. Let M denote a compact oriented manifold, and consider a smooth
map F : M → M Regard H • (M ) as a superspace with the obvious Z2 -grading
H • (M ) = H even (M ) ⊕ H odd (M ),
and define the Lefschetz number λ(F ) of F as the supertrace of the pull back
F ∗ : H • (M ) → H • (M ).
Prove that λ(F ) = ∆ · ΓF , and deduce from this the Lefschetz fixed point theorem:
λ(F ) 6= 0 ⇒ F has a fixed point. ⊔
⊓
Remark 7.3.29. For an elementary approach to degree theory, based only on Sard’s
theorem, we refer to the beautiful book of Milnor, [97]. ⊔
⊓
7.3. INTERSECTION THEORY 287
satisfies
p∗ dE = (−1)r dB p∗ ,
so that it induces a map in cohomology
•
p∗ : Hcpt (E) → H •−r (B).
This induced map in cohomology is sometimes called the Gysin map.
Remark 7.3.30. If the standard fiber F is compact, then the total space E is also
compact. Using Proposition 3.4.48 we deduce that for any ω ∈ Ω• (E), and any η ∈ Ω• (B)
such that
deg ω + deg η = dim E,
we have
hp∗ (ω), ηiκ = hω, p∗ (η)ik
If we denote by DM : Ω• (M ) → Ωdim M −• (M )∗ the Poincaré duality isomorphism on a
compact oriented smooth manifold M , then we can rewrite the above equality as
hDB p∗ (ω), ηi = hDE ω, p∗ ηi = h(p∗ )† DE ω, ηi =⇒ DB p∗ (ω) = (p∗ )† DE ω.
Hence
p∗ = D−1 ∗ †
B (p ) DE ,
so that, in this case, the Gysin map coincides with the pushforward (Gysin) map defined
in Remark 7.2.9. ⊔
⊓
Hence Z
p∗ δσ = δσ = (−1)rm ∈ Ω0 (B).
E/B
The map p∗ is not injective in general. For example, if (S, φ) is a k-cycle in F , then
it defines a cycle in any fiber π −1 (b), and consequently in E. Denote by δS its Poincaré
m+r−k
dual in Hcpt (E). Then for any ω ∈ Ωm−k (B) we have
Z Z Z Z
(p∗ δS ) ∧ ω = δS ∧ p∗ ω = ± φ∗ p∗ ω = (p ◦ φ)∗ ω = 0,
B E D S
is an isomorphism called the Thom isomorphism. Its inverse is the Gysin map
(−1)rm p∗ : Hcpt
•
(E) → H •−r (B).
Proof. We have already established that p∗ τ = (−1)rm . To prove the reverse equality,
τ p∗ = (−1)rm , we will use Lemma 7.2.2 of Subsection 7.2.1. For β ∈ Ω•cpt (E) we have
•+dim M
Exercise 7.3.35. Show that τE = ζ∗ 1, where ζ• : H • (M ) → Hcpt (E) is the push-
forward map defined by a section ζ : M → E. ⊔
⊓
exp : NS → M,
is given by,
•+(n−k)
H • (S) ∋ ω 7−→ exp∗ δSN ∧ π ∗ ω ∈ Hcpt (M ). (7.3.5)
⊔
⊓
Definition 7.3.39. Let E → M be a real orientable vector bundle over the compact,
n (E) the Thom class of
oriented, n-dimensional, smooth manifold M . Denote by τE ∈ Hcpt
E. The Euler class of E is defined by
e(E) := ζ0∗ τE ∈ H n (M ),
where ζ0 : M → E denotes the zero section. e(T M ) is called the Euler class of M , and it
is denoted by e(M ). ⊔
⊓
Note that the sections of T M are precisely the vector fields on M . Moreover, any such
section σ : M → T M tautologically defines an n-dimensional cycle in T M , and in fact,
any two such cycles are homotopic: try a homotopy, affine along the fibers of T M .
Any two sections σ0 , σ1 : M → T M determine cycles of complementary dimension,
and thus the intersection number σ0 · σ1 is a well defined integer, independent of the two
sections. It is a number reflecting the topological structure of the manifold.
Proposition 7.3.40. Let σ0 , σ1 : M → T M be two sections of T M . Then
Z
e(M ) = σ0 · σ1 .
M
hω i , ωj iκ = δji . ∀i, j.
According to (7.3.2), we have
X i
δM = (−1)|ω | ω i × ωi .
i
Similarly, if we start first with the basis (ω i ), then its dual basis is
i |·|ω
(−1)|ω i|
ωi ,
so, taking into account that |ωj | + |ω j | · |ωj | ≡ n|ωj | (mod 2), we also have
X
δM = (−1)n·|ωj | ωj × ω j .
i
Lemma 7.3.43. N∆ ∼
= T M.
T (M × M ) |∆ ∼
= T ∆ ⊕ N∆ ∼
= T M ⊕ N∆
and
T (M × M ) |∆ = T M ⊕ T M. ⊔
⊓
τ N∆ = τ M . (7.3.6)
At this point we want to invoke a technical result whose proof is left to the reader as an
exercise in Riemann geometry.
exp |U : U → M × M
is an embedding. ⊔
⊓
Hence Z
e(M ) = χ(M ). ⊔
⊓
M
Corollary 7.3.45. For any Riemann metric h on a connected sum of g-tori Σg we have
1 1
ε(h) = sh dvh = e(Σg ) in H ∗ (Σg ). ⊔
⊓
2π 4π
7.3. INTERSECTION THEORY 293
The remarkable feature of the Gauss–Bonnet theorem is that, once we choose a metric,
we can explicitly describe a representative of the Euler class in terms of the Riemann
curvature.
The same is true for any compact, oriented, even-dimensional Riemann manifold. In
this generality, the result is known as Gauss–Bonnet–Chern, and we will have more to say
about it in the next two chapters.
We now have a new interpretation of the Euler characteristic of a compact oriented
manifold M .
Given a smooth vector field X on M , its “graph” in T M ,
ΓX = {(x, X(x)) ∈ Tx M ; x ∈ M },
is an n-dimensional submanifold of T M . The Euler characteristic is then the intersection
number
χ(M ) = ΓX · M,
where we regard M as a submanifold in T M via the embedding given by the zero section.
In other words, the Euler characteristic counts (with sign) the zeroes of the vector fields
on M . For example if χ(M ) 6= 0 this means that any vector field on M must have a zero
! We have thus proved the following result.
Corollary 7.3.46. If χ(M ) 6= 0 then the tangent bundle T M is nontrivial. ⊔
⊓
The equality χ(S 2n ) = 2 is particularly relevant in the vector field problem discussed
in Subsection 2.1.4. Using the notations of that subsection we can write
v(S 2n ) = 0.
We have thus solved “half” the vector field problem.
Exercise 7.3.47. Let X be a vector field over the compact oriented manifold M . A point
x ∈ M is said to be a non-degenerate zero of X if X(0) = 0 and
∂Xi
det |x=x0 6= 0.
∂xj
for some local coordinates (xi ) near x0 such that the orientation of Tx∗0 M is given by
dx1 ∧ · · · ∧ dxn . Prove that the local intersection number of ΓX with M at x0 is given by
∂Xi
ix0 (ΓX , M ) = sign det |x=x0 .
∂xj
This is sometimes called the local index of X at x0 , and it is denoted by i(X, x0 ). ⊔
⊓
Exercise 7.3.49. Let X be a vector field on Rn and having a non-degenerate zero at the
origin.
(a) prove that for all r > 0 sufficiently small X has no zeros on Sr = {|x| = r}.
(b) Consider Fr : Sr → S n−1 defined by
1
Fr (x) = X(x).
|X(x)|
Prove that i(X, 0) = deg Fr for all r > 0 sufficiently small.
Hint: Deform X to a linear vector field. ⊔
⊓
G × M → M (g, m) 7→ g · m,
Ψm : G ∋ g 7→ g · m ∈ M
is surjective. For any point x of a homogeneous space M we define the isotropy group at
x by
Ix = {g ∈ G ; g · x = x}.
Lemma 7.4.2. Let M be a homogeneous space with symmetry group G and x, y ∈ M .
Then
(a) Ix is a closed subgroup of G;
(b) Ix ∼
= Iy .
Proof. (a) is immediate. To prove (b), choose g ∈ G such that y = g · x. Then note that
Iy = gIx g−1 . ⊔
⊓
Remark 7.4.3. It is worth mentioning some fundamental results in the theory of Lie
groups which will shed a new light on the considerations of this section. For proofs we
refer to [61, 128].
Fact 1. Any closed subgroup of a Lie group is also a Lie group (see Remark 1.2.31). In
particular, the isotropy groups Ix of a homogeneous space are all Lie groups. They are
smooth submanifolds of the symmetry group.
7.4. SYMMETRY AND TOPOLOGY 295
Fact 2. Let G be a Lie group, and H a closed subgroup. Then the space of left cosets,
G/H := g · H; g ∈ G ,
can be given a smooth structure such that the map
G × (G/H) → G/H (g1 , g2 H) 7→ (g1 g2 ) · H
is smooth. The manifold G/H becomes a homogeneous space with symmetry group G.
All the isotropy groups subgroups of G conjugate to H.
Fact 3. If M is a homogeneous space with symmetry group G, and x ∈ M , then M is
equivariantly diffeomorphic to G/Ix , i.e., there exists a diffeomorphism
φ : M → G/Ix ,
such that φ(g · y) = g · φ(y). ⊔
⊓
Remark 7.4.5. This may not be the most elegant definition of a symmetric space, and
certainly it is not the minimal one. As a matter of fact, a Riemann manifold (M, g) is
a symmetric space if and only if there exists a smooth map σ : M × M → M satisfying
the conditions (c1),(c2) and (c3). We refer to [61, 70] for an extensive presentation of this
subject, including a proof of the equivalence of the two descriptions. Our definition has
one academic advantage: it lists all the properties we need to establish the topological
results of this section. ⊔
⊓
296 CHAPTER 7. COHOMOLOGY
The next exercise offers the reader a feeling of what symmetric spaces are all about.
In particular, it describes the geometric significance of the family of involutions σm .
Exercise 7.4.6. Let (M, h) be a symmetric space. Denote by ∇ the Levi–Civita connec-
tion, and by R the Riemann curvature tensor.
(a) Prove that ∇R = 0.
(b) Fix m ∈ M , and let γ(t) be a geodesic of M such that γ(0) = m. Show that
σm γ(t) = γ(−t). ⊔
⊓
Example 7.4.7. Perhaps the most popular example of symmetric space is the round
sphere S n ⊂ Rn+1 . The symmetry group is SO(n + 1), the group of orientation preserving
“rotations” of Rn+1 . For each m ∈ S n+1 we denote by σm the orthogonal reflection through
the 1-dimensional space determined by the radius Om. We then set im (T ) = σm T σm −1 ,
∀T ∈ SO(n + 1). We let the reader check that σ and i satisfy all the required axioms. ⊔ ⊓
Example 7.4.8. Let G be a connected Lie group and m a bi-invariant Riemann metric
on G. The direct product G × G acts on G by
(g1 , g2 ) · h = g1 hg2−1 .
This action is clearly transitive and since m is bi-invariant its is also isometric. Define
σ : G × G → G σg h = gh−1 g−1
and
i : G × (G × G) → G × G ig (g1 , g2 ) = (gg1 g−1 , gg2 g−1 ).
We leave the reader to check that these data do indeed define a symmetric space structure
on (G, m). The symmetry group is G × G. ⊔
⊓
Example 7.4.9. Consider the complex Grassmannian M = Grk (Cn ). Recall that in
Example 1.2.22 we described Grk (Cn ) as a submanifold of End+ (V ) – the linear space
of self-adjoint n × n complex matrices, via the map which associates to each complex
subspace S, the orthogonal projection PS : Cn → Cn onto S.
The linear space End+ (Cn ) has a natural metric g0 (A, B) = 21 Re tr (AB ∗ ) that restricts
to a Riemann metric g on M . The unitary group U (n) acts on End+ (Cn ) by conjugation,
Note that U (n) ⋆ M = M , and g0 is U (n)-invariant. Thus U (n) acts transitively, and
isometrically on M .
For each subspace S ∈ M define
RS := PS − PS ⊥ = 2PS − 1.
The operator RS is the orthogonal reflection through S ⊥ . Note that RS ∈ U (n), and
RS2 = 2. The map A 7→ RS ⋆ A is an involution of End+ (Cn ). It descends to an involution
of M . We thus get an entire family of involutions
Define
i : M × U (n) → U (n), iS T = RS T RS .
We leave the reader to check that the above collection of data defines a symmetric space
structure on Grk (Cn ). ⊔
⊓
Exercise 7.4.10. Fill in the details left out in the above example. ⊔
⊓
Proof. Denote by dVG (g) the normalized bi-invariant volume form on G. For any form
ω ∈ Ω∗ (M ) we define its G-average by
Z
ω= ℓ∗g ω dVG (g).
G
Now pick a partition of unity (αj ) ⊂ C ∞ (G) subordinated to the cover (Bj ), i.e.,
X
0 ≤ αj ≤ 1, supp αj ⊂ Bj , αj = 1.
j
298 CHAPTER 7. COHOMOLOGY
Set Z
aj := αj dVG (g).
G
P P
Since the volume of G is normalized to 1, and j αj = 1, we deduce that j aj = 1. For
any j = 1, . . . , m define Tj : Ω∗ (M ) → Ω∗ (M ) by
Z
Tj ω := αj (g)ℓ∗g ω dVG (g).
G
Note that X
ω= Tj ω and dTj ω = Tj dω.
j
Define Tj,t : Ω∗ (M ) → Ω∗ (M ) by
Z
Tj,t ω = αj (g)ℓ∗φj,t (t) ω dVG (g) = Tj φ∗j,t ω. (7.4.1)
G
Then Z Z
def
Us ω = Tj,es ω = αj (gj g)ℓ∗gj gs ω = α(gj g)ℓ∗gs ℓ∗gj ω dVG (g).
Br Br
For each g ∈ Br the map (s, m) 7→ Ψs (m) = gs (m) defines a local flow on M . We denote
by Xg its infinitesimal generator. Then
Z
d
(Us ω) = αj (gj g)LXg ℓ∗gs ℓ∗gj ω dVG (g)
ds Br
Z
= αj (gj g)(dM iXg + iXg dM )(ℓ∗gs ℓ∗gj ω) dVG (g).
Br
7.4. SYMMETRY AND TOPOLOGY 299
Consequently,
Tj,0 ω − Tj,1 ω = U−∞ ω − U0 ω
Z 0 Z
=− αj (gj g)(diXg + iXg d)(ℓ∗gs ℓ∗gj ω) dVG (g) ds.
−∞ Br
(An argument entirely similar to the one we used in the proof of the Poincaré lemma
shows that the above improper integral is pointwise convergent). From the above formula
we immediately read a cochain homotopy χ : Ω• (M ) → Ω•−1 (M ) connecting U−∞ to U0 .
More precisely
Z 0 Z n o
{χ(ω)} |x∈M := − αj (gj g) iXg ℓ∗gs ℓ∗gj ω |x dVG (g) ds.
−∞ Br
The proposition we have just proved has a greater impact when M is a symmetric space.
Proposition 7.4.14. Let (M, h) be an, oriented symmetric space with symmetry group
G. Then the following are true.
(a) Every invariant form on M is closed.
(b) If moreover M is compact, then the only invariant form cohomologous to zero is the
trivial one.
∗ ω. We claim ω̂
Proof. (a) Consider an invariant k-form ω. Fix m0 ∈ M and set ω̂ = σm0
is invariant. Indeed, ∀g ∈ G
ℓ∗g ω̂ = ℓ∗g σm
∗
0
ω = (σm0 g)∗ ω = (gig σm0 g)∗ ω = σm
∗
ℓ ∗ ω = σm
0 i(g)
∗
0
ω = ω̂.
ω̂ = (−1)k ω on M.
(b) Let ω be an invariant form cohomologous to zero, i.e. ω = dα. Denote by ∗ the Hodge
∗-operator corresponding to the invariant metric h. Since G acts by isometries, the form
η = ∗ω is also invariant, so that dη = 0. We can now integrate (M is compact), and use
Stokes theorem to get
Z Z Z
ω ∧ ∗ω = dα∧ = ± α ∧ dη = 0.
M M M
This forces ω ≡ 0. ⊔
⊓
From Proposition 7.4.12 and the above theorem we deduce the following celebrated
result of Élie Cartan ([25])
Corollary 7.4.15 (É. Cartan). Let (M, h) be a compact, oriented symmetric space with
compact, connected symmetry group G. Then the cohomology algebra H ∗ (M ) of M is
isomorphic with the graded algebra Ω∗inv (M ) of invariant forms on M . ⊔
⊓
In the coming subsections we will apply this result to the symmetric spaces discussed
in the previous subsection: the Lie groups and the complex Grassmannians.
Definition 7.4.19. A Lie algebra is called simple if it has no nontrivial ideals. A Lie
group is called simple if its Lie algebra is simple. ⊔
⊓
We have to describe the U (n)-invariant forms on Grk (Cn ). These forms are completely
determined by their values at a particular point in the Grassmannian. We choose this point
to correspond to the subspace S0 determined by the canonical inclusion Ck ֒→ Cn .
The isotropy of S0 is the group H = U (k) × U (ℓ). The group H acts linearly on the
tangent space V0 = TS0 Grk (Cn ). If ω is an U (n)-invariant form, then its restriction to V0
is an H-invariant skew-symmetric, multilinear map
V0 × · · · × V0 → R.
Conversely, any H-invariant element of Λ• V0∗ extends via the transitive action of U (n)
to an invariant form on Grk (Cn ). Denote by Λ•inv the space of H-invariant elements of
Λ• V0∗ . We have thus established the following result.
Proposition 7.4.27. There exists an isomorphism of graded R-algebras:
H • (Grk (Cn )) ∼
= Λ•inv . ⊔
⊓
H ∋ h 7→ Th ∈ Aut (V0 ).
Each h ∈ U (k) × U (ℓ) is conjugate to diagonal unitary matrix, i.e., there exists g ∈ H
such that ghg −1 ∈ T.
We can rephrase this fact in terms of the conjugation action of H on itself
The class functions are constant along the orbits of this conjugation action, and each
such orbit intersects the maximal torus T. In other words, a class function is completely
determined by its restriction to the maximal torus. Hence, it is reasonable to expect
that we ought to be able to describe the integral in (7.4.2) as an integral over T. This is
achieved in a very explicit manner by the next result.
Define ∆k : Tk → C by
Y i j √
∆k (θ 1 , . . . , θ k ) = (eiθ − eiθ ), i= −1.
1≤i<j≤n
On the unitary group U (k) we fix the bi-invariant metric m = mk such that at T1 U (k) we
have
m(X, Y ) := Re tr(XY ∗ ).
If we think of X, Y as k × k matrices, then
X
m(X, Y ) = Re xij ȳij .
i,j
We denote by dvk the volume form induced by this metric, and by Vk the total volume
Z
Vk := dvk .
U (k)
Theorem 7.4.28 (Weyl’s integration formula). Consider a class function ϕ on the group
G = U (k1 ) × · · · × U (ks ), denote by dg the volume form
dg = dvk1 ∧ · · · ∧ dvks ,
Z s
Y
1
= Qs kj k !
ϕ(θ 1 , . . . , θ s ) |∆kj (θ j )|2 dθ 1 ∧ · · · ∧ dθ s
j=1 (2π) j T j=1
k
θ j := (θj1 , . . . , θj j ),
k
dθ j := dθj1 ∧ · · · ∧ dθj j .
304 CHAPTER 7. COHOMOLOGY
The remainder of this subsection is devoted to the proof of this proposition. The
reader may skip this part at the first lecture, and go directly to Subsection 7.4.5 where
this formula is used to produce an explicit description of the Poincaré polynomial of a
complex Grassmannian.
Proof of Weyl’s integration formula To keep the main ideas as transparent as possi-
ble, we will consider only the case s = 1. The general situation is entirely similar. Thus,
G = U (k) and T = Tk . In this case, the volume form dg is the volume form associated to
metric mk .
We denote by Ad : G → Aut(LG ) the adjoint representation of G; see Example 3.4.30.
In this concrete case, Ad can be given the explicit description
ωϕ := ϕ(g)dg.
LG = LT ⊕ L⊥
G.
The tangent space to G/T at 1 · T ∈ G/T can be identified with L⊥ T . For this reason we
⊥
will write LG/T instead of LT .
Fix x ∈ G/T. Any g ∈ G defines a linear map Lg : Tx G/T → Tgx G/T. Moreover, if
gx = hx = y, then Lg and Lh differ by an element in the stabilizer of x ∈ G/T. This
stabilizer is isomorphic to T, and in particular it is connected.
Hence, if ω ∈ det Tx G/T, then Lg ω ∈ det Ty G/T and Lh ω ∈ det Ty G/T define the
same orientation of Ty G/T. In other words, an orientation in one of the tangent spaces of
G/T “spreads” via the action of G to an orientation of the entire manifold. Thus, we can
orient G/T by fixing an orientation on LG/T .
We fix an orientation on LG , and we orient LT using the form dθ = dθ 1 ∧ · · · ∧ dθ k .
The orientation on LG/T will be determined by the condition
The proof of Weyl’s integration formula will be carried out in three steps.
7.4. SYMMETRY AND TOPOLOGY 305
Step 1. Z Z
1
ωϕ = q ∗ ωϕ ∀ω.
G k! T×G/T
Step 2. We prove that there exists a positive constant Ck such that for any class
function ϕ on G we have
Z Z Z
k! ωϕ = q ωϕ = Ck ϕ(θ)|∆k (θ)|2 dθ.
∗
G T×G/T T
Proof. This is a pompous rephrasing of the classical statement in linear algebra that two
unitary matrices are similar if and only if they have the same spectrum, multiplicities
included. The adjoint action of N (T) on T= diagonal unitary matrices simply permutes
the entries of a diagonal unitary matrix. This action descends to an action on the quotient
W so that W ⊂ Sk .
Conversely, any permutation of the entries of a diagonal matrix can be achieved by a
conjugation. Geometrically, this corresponds to a reordering of an orthonormal basis. ⊔ ⊓
1 k
Lemma 7.4.30. Let α1 , . . . , αk ) ∈ Rk such that 1, α2π , . . . , α2π are linearly independent
over Q. Set τ := (exp(iα1 ), . . . , exp(iαk )) ∈ Tk . Then the sequence (τ n )n∈Z is dense in
Tk . (The element τ is said to be a generator of Tk .)
For the sake of clarity, we defer the proof of this lemma to the end of this subsection.
Lemma 7.4.31. Let τ ∈ Tk ⊂ G be a generator of Tk . Then q −1 (τ ) ⊂ T × G/T consists
of |W| = k! points.
Proof.
q(s, gT ) = τ ⇐⇒ gsg −1 = τ ⇐⇒ gτ g −1 = s ∈ T.
306 CHAPTER 7. COHOMOLOGY
gT g−1 ⊂ T ⇒ g ∈ N (T).
Hence
q −1 (τ ) = (g−1 τ g, gT ) ∈ T × G/T ; g ∈ N (T) ,
and thus q −1 (τ ) has the same cardinality as the Weyl group W. ⊔
⊓
Proof. Fix a point x0 = (t0 , g0 T) ∈ T×G/T. We can identify Tx0 (T×G/T) with LT ⊕LG/T
using the isometric action of T × G on T × G/T.
Fix X ∈ LT , Y ∈ LG/T . For every real number s consider
hs = q t0 exp(sX) , g0 exp(sY )T = g0 exp(sY )t0 exp(sX) exp(−sY )g0−1 ∈ G.
We want to describe
d
|s=0 h−1
0 hs ∈ T1 G = LG .
ds
Using the Taylor expansions
we deduce
h−1 −1 −1
0 hs = g0 t0 (1 + sY )t0 (1 + sX)(1 − sY )g0 + O(s )
2
= 1 + s g0 t−1 −1 −1
0 Y t0 g0 + g0 Xg0 − g0 Y g0
−1
+ O(s2 ).
Hence, the differential
can be rewritten as
The linear operator Adg is an m-orthogonal endomorphism of LG , so that det Adg = ±1.
On the other hand, since G = U (k) is connected, det Adg = det Ad1 = 1. Hence,
Consequently
det Dx0 q = det(Adt−1 − 1) = |∆k (t)|2 . ⊔
⊓
Lemma 7.4.32 shows that q is an orientation preserving map. Using Lemma 7.4.31 and
Exercise 7.3.27 we deduce deg q = |W| = k!. Step 1 is completed.
Step 2 follows immediately from Lemma 7.4.32. More precisely, we deduce
Z Z !Z
1
ωϕ = dµ ϕ(θ)∆k (θ)dθ.
G k! G/T Tk
| {z }
=:Ck
To complete Step 3, that is, to find the constant Ck , we apply the above equality in the
special case ϕ = 1. so that ωϕ = dvk . We deduce
Z
Ck
Vk = ∆k (θ)dθ,
k! Tk
so that
Vk k!
Ck = R .
Tk ∆k (θ)dθ
To compute the last integral we observe that ∆k (θ) can be expressed as a Vandermonde
determinant
1 1 ··· 1
1 2 k
eiθ eiθ ··· eiθ
1 2 k
∆k (θ) = e2iθ e2iθ ··· e2iθ
.. .. .. ..
. . . .
i(k−1)θ 1 i(k−1)θ 2 i(k−1)θ k
e e ··· e
This shows that we can write ∆k as a trigonometric polynomial
X
∆k (θ) = ǫ(σ)eσ (θ),
σ∈Sk
308 CHAPTER 7. COHOMOLOGY
where, for any permutation σ of {1, 2, . . . , k} we denoted by ǫ(σ) its signature, and by
eσ (θ) the trigonometric monomial
Qk iσ(j)θ j
1 k j=1 e
eσ(θ) = eσ (θ , . . . , θ ) = Qk .
iθ j
j=1 e
The monomials eσ are orthogonal with respect to the L2 -metric on Tk , and we deduce
Z XZ k
2
|∆k (θ)| dθ = |eσ (θ)|2 dθ = (2π)k k!.
Tk σ∈Sk T
Proof of Lemma 7.4.30 We follow Weyl’s original approach ([129, 130]) in a modern
presentation.
Let X = C(T, C) denote the Banach space of continuous complex valued functions on
T. We will prove that
n Z
1 X j
lim f (τ ) = f dt, ∀f ∈ X. (7.4.3)
n→∞ n + 1 T
j=0
e := (e1 , . . . , eℓ ), ej = exp(2πiθ j ),
dτ = dτ 1 ∧ · · · ∧ dτ k ,
dθ = dθ 1 ∧ · · · ∧ dθ ℓ .
{εα ej ; 1 ≤ α ≤ k 1 ≤ j ≤ ℓ}.
Using the Weyl integration formula we deduce that the Poincaré polynomial of Grk (Cn )
is Z
1 Y
Pk,ℓ (t) = |1 + tεα ej |2 |∆k (ε(τ ))|2 |∆ℓ (e(θ))|2 dτ ∧ dθ
k!ℓ! T k ×T ℓ
α,j
Z Y
1
= |εα + tej |2 |∆k (ε(τ ))|2 |∆ℓ (e(θ))|2 dτ ∧ dθ.
k!ℓ! T k ×T ℓ α,j
so that Z
1
Pk,ℓ (t) = Ik,ℓ (t)Ik,ℓ (t)dτ ∧ dθ. (7.4.5)
k!ℓ! T k ×T ℓ
310 CHAPTER 7. COHOMOLOGY
6 5
5 4
5 4
3 3
1 3
1
The Weyl group W = Sk × Sℓ acts on the variables (x; y) by separately permuting the
x-components and the y-components. If (σ, ϕ) ∈ W then
Traditionally, one can visualize a partition using Young diagrams. A Young diagram
is an array of boxes arranged in left justified rows (see Figure 7.4). Given a partition
(λ1 ≥ · · · ≥ λn ), its Young diagram will have λ1 boxes on the first row, λ2 boxes on the
second row etc.
7.4. SYMMETRY AND TOPOLOGY 311
The Young diagram of λ̂ is the transpose of the Young diagram of λ (see Figure 7.4).
A strict partition is a partition which is strictly decreasing on its support. Denote
by Pn the set of partitions of length ≤ n, and by P∗n the set of strict partitions λ of
length n − 1 ≤ L(λ) ≤ n. Clearly P∗n ⊂ Pn . Denote by δ = δn ∈ P∗n the partition
(n − 1, n − 2, . . . , 1, 0, . . .).
Remark 7.4.34. The correspondence Pn ∋ λ 7→ λ + δn ∈ P∗n . is a bijection. ⊔
⊓
For each λ ∈ Pn we have λ + δ ∈ P∗n , so that aλ+δ is well defined and nontrivial.
Note that aλ+δ vanishes when xi = xj , so that the polynomial aλ+δ (x) is divisible by
each of the differences (xi − xj ) and consequently, it is divisible by aδ . Hence
aλ+δ (x)
Sλ (x) :=
aδ (x)
is a well defined polynomial. It is a symmetric polynomial since each of the quantities
aλ+δ and aδ is skew-symmetric in its arguments. The polynomial Sλ (x) is called the
Schur polynomial corresponding to the partition λ. Note that each Schur polynomial Sλ
is homogeneous of degree |λ|. We have the following remarkable result.
Lemma 7.4.35. X
Jk,ℓ (t) = t|λ| Sλ̂ (x)Sλ (y),
λ∈Pk,ℓ
where
Pk,ℓ := λ ; λ1 ≤ k L(λ) ≤ ℓ .
For each λ ∈ Pk,ℓ we denoted by λ the complementary partition
λ = (k − λℓ , k − λℓ−1 , . . . , k − λ1 ).
Geometrically, the partitions in Pk,ℓ are precisely those partitions whose Young dia-
grams fit inside a ℓ × k rectangle. If λ is such a partition then the Young diagram of the
complementary of λ is (up to a 180◦ rotation) the complementary of the diagram of λ in
the ℓ × k rectangle (see Figure 7.5).
312 CHAPTER 7. COHOMOLOGY
6 7
4 6
4 5
2 3
1 3
0 1
Figure 7.5: The complementary of (6, 4, 4, 2, 1, 0) ∈ P7,6 is (7, 6, 5, 3, 3, 1) ∈ P6,7
For a proof of Lemma 7.4.35 we refer to [88], Section I.4, Example 5. The true essence
of the Schur polynomials is however representation theoretic, and a reader with a little
more representation theoretic background may want to consult the classical reference
[87], Chapter VI, Section 6.4, Theorem V, for a very exciting presentation of the Schur
polynomials, and the various identities they satisfy, including the one in Lemma 7.4.35.
Using (7.4.5), Lemma 7.4.35, and the definition of the Schur polynomials, we can
describe the Poincaré polynomial of Grk (Cn ) as
2
Z X
1
Pk,ℓ (t) = t|λ| aλ̂+δk (ε)aλ+δℓ (e) dτ ∧ dθ. (7.4.6)
k!ℓ! T k ×T ℓ λ∈Pk,ℓ
k+1
R l+1 R
l+1
k+1
R
k+1
Figure 7.6: The rectangles Rℓ+1 and Rk+1 are “framed” inside Rℓ+1
The map
Pk,ℓ ∋ λ 7→ λ ∈ Pℓ,k ,
is a bijection so that X
Pk,ℓ (t) = t2|λ| = Pℓ,k (t). (7.4.8)
λ∈Pℓ,k
Computing the Betti numbers, i.e., the number of partitions in Pk,ℓ with a given weight
is a very complicated combinatorial problem, and currently there are no exact general
formulæ. We will achieve the next best thing, and rewrite the Poincaré polynomial as a
“fake” rational function.
Denote by bk,ℓ (w) the number of partitions λ ∈ Pℓ,k with weight |λ| = w. Hence
kℓ
X
Pk,ℓ (t) = bk,ℓ (w)t2w .
w=1
Alternatively, bk,ℓ (w) is the number of Young diagrams of weight w which fit inside a k × ℓ
rectangle.
Lemma 7.4.36.
bk+1,ℓ+1 (w) = bk,ℓ+1 (w) + bk+1,ℓ (w − ℓ − 1).
k+1
Proof. Look at the (k + 1) × ℓ rectangle Rk+1 inside the (k + 1) × (l + 1)-rectangle Rℓ+1
(see Figure 7.6). Then
k+1
bk+1,ℓ+1 (w) = # diagrams of weight w which fit inside Rℓ+1
= # diagrams which fit inside Rk+1
+# diagrams which do not fit inside Rk+1 .
On the other hand,
bk+1,ℓ = # diagrams which fit inside Rk+1 .
314 CHAPTER 7. COHOMOLOGY
If a diagram does not fit inside Rk+1 this means that its first line consists of ℓ + 1 boxes.
When we drop this line, we get a diagram of weight w − ℓ − 1 which fits inside the
k × (ℓ + 1) rectangle Rℓ+1 of Figure 7.6. Thus, the second contribution to bk+1,ℓ+1 (w) is
bk,ℓ+1 (w − ℓ − 1). ⊔
⊓
Because the roles of k and ℓ are symmetric (cf. (7.4.8)) we also have
Let m = k + ℓ + 1 and set Qd,m (t) = Pd,m−d (t) = PGd (m,C) (t). The last equality can be
rephrased as
1 − tm−k
Qk+1,m (t) = Qk,m · ,
1 − t2(k+1)
so that
1 − t2(m−k) 1 − t2(m−k+1) 1 − t2(n−1)
Qk+1,m(t) = · · · · .
1 − t2(k+1) 1 − t2k 1 − t4
Now we can check easily that b1,m−1 (w) = 1, i.e.,
1 − t2m
Q1,m (t) = 1 + t2 + t4 + · · · + t2(m−1) = .
1 − t2
Hence
(1 − t2(m−k+1) ) · · · (1 − t2m )
PGk (m,C) (t) = Qk,m(t) =
(1 − t2 ) · · · (1 − t2k )
(1 − t2 ) · · · (1 − t2m )
= .
(1 − t2 ) · · · (1 − t2k )(1 − t2 ) · · · (1 − t2(m−k) )
Remark 7.4.37. (a) The invariant theoretic approach in computing the cohomology
of Grk (Cn ) was used successfully for the first time by C. Ehresmann[41]. His method
was then extended to arbitrary compact, oriented symmetric spaces by H. Iwamoto [69].
However, we followed a different avenue which did not require Cartan’s maximal weight
theory.
(b) We borrowed the idea of using the Weyl’s integration formula from Weyl’s classical
monograph [133]. In turn, Weyl attributes this line of attack to R. Brauer. Our strategy
is however quite different from Weyl’s. Weyl uses an equality similar to (7.4.5) to produce
an upper estimate for the Betti numbers (of U (n) in his case) and then produces by hand
sufficiently many invariant forms. The upper estimate is then used to established that
these are the only ones. We refer also to [123] for an explicit description of the invariant
forms on Grassmannians. ⊔
⊓
7.5. ČECH COHOMOLOGY 315
Exercise 7.4.38. Show that the cohomology algebra of CPn is isomorphic to the truncated
ring of polynomials
R[x]/(xn+1 ),
where x is a formal variable of degree 2 while (xn+1 ) denotes the ideal generated by xn+1 .
Hint: Describe Λ•inv CPn explicitly. ⊔
⊓
(a) If (Uα ) is an open cover of the open set U , and f, g ∈ S(U ) satisfy f |Uα = g |Uα , ∀α,
then f = g.
(b) If (Uα ) is an open cover of the open set U , and fα ∈ S(Uα ) satisfy
Example 7.5.3. All the presheaves discussed in Example 7.5.1 are sheaves. ⊔
⊓
We let the reader verify this is not a sheaf since the condition (b) is violated. The reason
behind this violation is that in the definition of this presheaf we included a global condition
namely the boundedness assumption. ⊔
⊓
(a1) The map π is a local homeomorphism that is, each point e ∈ E has a neighborhood
U such that π |U is a homeomorphism onto the open subset π(U ) ⊂ X. For every
x ∈ X, the set Ex := π −1 (x), is called the stalk at x.
∀r ∈ R, ∀x ∈ X, ∀u, v ∈ Ex we have r · u ∈ Ex , u + v ∈ Ex ,
by
f ∼x g ⇐⇒ ∃ open U ∋ x such thatf |U = g |U .
F
The equivalence class of f ∈ U ∋x S(U ) is denoted by [f ]x , and it is called the germ of f
at x. Set G
Sx := {[f ]x ; dom f ∋ x}, and bS := Sx .
x∈X
We let the reader check that this collection of sets satisfies the axioms of a basis of
neighborhoods as discussed e.g. in [77].
With this topology, each f ∈ S(U ) defines a continuous section of π over U
[f ] : U ∋ u 7→ [f ]u ∈ Su .
Note that each fiber Sx has a well defined structure of Abelian group
π
Proposition 7.5.8. (a) Let E → X be a space of germs. Then E b = E.
Definition 7.5.10. If S is a presheaf over the topological space X, then the sheaf b
S is
called the sheaf associated to S, or the sheafification of S. ⊔
⊓
Definition 7.5.11. (a) Let A morphism between the (pre)sheaves of Abelian groups
(modules etc.) S and S̃ over the topological space X is a collection of morphisms of
Abelian groups (modules etc.) hU : S(U ) → S̃(U ), one for each open set U ⊂ X, such
that, when V ⊂ U , we have hV ◦ rUV = r̃UV ◦ hU . Above, rUV denotes the restriction
morphisms of S, while the r̃UV denotes the restriction morphisms of S̃. A morphism h is
said to be injective if each hU is injective.
(b) Let S be a presheaf over the topological space X. A sub-presheaf of S is a pair (T, ı),
where T is a presheaf over X, and ı : T → S is an injective morphism. The morphism ı is
called the canonical inclusion of the sub-presheaf. ⊔
⊓
U 7→ ker hU ⊂ S(U ),
(a2) For any x ∈ X, the induced map hx : π0−1 (x) → π1−1 (x) is a morphism of Abelian
groups (modules etc.).
Lemma 7.5.16. Consider two sheaves S and T, and let h : S → T be a morphism. Then
h induces a morphism between the associated spaces of germs b b
h : Ŝ → T. ⊔
⊓
☞ Due to Proposition 7.5.8(a), in the sequel we will make no distinction between sheaves
and spaces of germs.
nh hn+1
· · · → Sn → Sn+1 → Sn+2 → · · · ,
ı d d d
0 ֒→ S ֒→ S0 →0 S1 →1 · · · → Sn →
n
Sn+1 → · · · . ⊔
⊓
0 → S−1 → S0 → S1 → 0.
is a resolution of the constant sheaf RM . ΩkM denotes the sheaf of k-forms on M while d
denotes the exterior differentiation. ⊔
⊓
320 CHAPTER 7. COHOMOLOGY
We define dim A to be one less the cardinality of A, dim A = |A| − 1. The set of all
q-dimensional simplices is denoted by Nq (U). Their union,
[
Nq (U)
q
The elements of C q (S, U) are called Čech q-cochains subordinated to the cover U. In other
words, a q-cochain c associates to each ordered q-simplex σ an element hc, σi ∈ Sσ .
We have face operators
~ q (U) → N
∂j = ∂j : N ~ q−1 (U), j = 0, . . . , q,
is a cochain complex.
The cohomology of this cochain complex is called the Čech cohomology of the cover U
with coefficients in the pre-sheaf S.
Example 7.5.23. Let U and S as above. A 0-cochain is a correspondence which associates
to each open set Uα ∈ U an element cα ∈ S(Uα ). It is a cocycle if, for any pair (α, β) such
that Uα ∩ β 6= ∅, we have
cβ − cα = 0.
Observe that if S is a sheaf, then the collection (cα ) determines a unique section of S.
A 1-cochain associates to each pair (α, β) such that Uαβ 6= ∅ an element
cαβ ∈ S(Uαβ ).
This correspondence is a cocycle if, for any ordered 2-simplex (α, β, γ) we have
For example, if X is a smooth manifold, and U is a good cover, then we can associate to
each closed 1-form ω ∈ Ω1 (M ) a Čech 1-cocycle valued in RX as follows.
First, select for each Uα a solution fα ∈ C ∞ (Uα ) of
dfα = ω.
Since d(fα − fβ ) ≡ 0 on Uαβ , we deduce there exist constants cαβ such that fα − fβ = cαβ .
Obviously this is a cocycle, and it is easy to see that its cohomology class is independent
of the initial selection of local solutions fα . Moreover, if ω is exact, this cocycle is a
coboundary. In other words we have a natural map
H 1 (X) → H 1 N(U), RX .
Definition 7.5.24. Consider two open covers U = (Uα )α∈A , and V = (Vβ )β∈B of the same
topological space X. We say V is finer than U, and we write this U ≺ V, if there exists a
map ̺ : B → A, such that
Vβ ⊂ U̺(β) ∀β ∈ B.
The map ̺ is said to be a refinement map. ⊔
⊓
Proposition 7.5.25. Consider two open covers U = (Uα )α∈A and V = (Vβ )β∈B of the
same topological space X such that U ≺ V. Fix a sheaf of Abelian groups S. Then the
following are true.
(a) Any refinement map ̺ induces a cochain morphism
M M
̺∗ : C q (U, S) → C q (V, S).
q q
322 CHAPTER 7. COHOMOLOGY
(c) If U ≺ V ≺ W, then
ıW W V
U = ıV ◦ ıU .
where, by definition, ̺(σ) ∈ Aq is the ordered q-simplex (̺(β0 ), . . . , ̺(βq )). The fact that
̺∗ is a cochain morphism follows immediately from the obvious equality
̺ ◦ ∂qj = ∂qj ◦ ̺.
~ q−1 (V) → N
(b) We define hj : N ~ q (U) by
The reader should check that hj (σ) is indeed an ordered simplex of U for any ordered
simplex σ of V. Note that Vσ ⊂ Uhj (σ) ∀j. Now define
by
q−1
X
hχq (c), σi := (−1)j hc, hj (σ)i |Vσ ∀c ∈ C q (U) ∀σ ∈ V(q−1) .
j=0
so that,
hj (σ) = (λ0 , . . . , λj , µj , . . . , µq ).
Then X
hχ ◦ δ(c), σi = (−1)j hδc, hj (σ)i |Vσ
q
X q+1
X
= (−1) ( (−1)k hc, ∂q+1
j k
hj (σ)i |Vσ )
j=0 k=0
q
X j
X
= (−1)j (−1)k hc, (λ0 , . . . , λ̂j , . . . , λj , µj , . . . , µq )i |Vσ
j=0 k=0
7.5. ČECH COHOMOLOGY 323
q
X
+ (−1)ℓ+1 hc, (λ0 , . . . , λj , µj , . . . , µ̂k , . . . , µq )i |Vσ
ℓ=j
q
X j−1
X
= (−1)j (−1)k hc, (λ0 , . . . , λ̂j , . . . , λj , µj , . . . , µq )i |Vσ
j=0 k=0
q
X
+ (−1)ℓ+1 hc, (λ0 , . . . , λj , µj , . . . , µ̂k , . . . , µq )i |Vσ
ℓ=j+1
q
X
+ (−1)j hc, (λ0 , . . . , λj−1 , µj , . . . , µq )i |Vσ +hc, (λ0 , . . . , λj , µj+1 , . . . , µq )i |Vσ .
j=0
The last term is a telescopic sum which is equal to
hc, (λ0 , . . . , λq i |Vσ −hc, (µ0 , . . . , µq )i |Vσ = h̺∗ c, σi − hr∗ c, σi.
If we change the order of summation in the first two term we recover the term h−δχc, σi.
Part (c) is left to the reader as an exercise. ⊔
⊓
The group H • (X, S) is called the Čech cohomology of the space X with coefficients in the
pre-sheaf S.
Let us briefly recall the definition of the direct limit. One defines an equivalence
relation on the disjoint union a
H • (U, S),
U
by
H • (U) ∋ f ∼ g ∈ H • (V) ⇐⇒ ∃W ≻ U, V : ıW W
U f = ıV g.
We denote the equivalence class of f by f . Then
!
a
• •
lim H (U) = H (U, S) / ∼ .
U
U
Example 7.5.26. Let S be a sheaf over the space X. For any open cover U = (Uα ),
a 0-cycle subordinated to U is a collection of sections fα ∈ S(Uα ) such that, every time
Uα ∩ Uβ 6= ∅, we have
fα |Uαβ = fβ |Uαβ .
According to the properties of a sheaf, such a collection defines a unique global section
f ∈ S(X). Hence, H 0 (X, S) ∼
= S(X). ⊔
⊓
h∗ : C q (U, S0 ) → C q (U, S1 ),
by
hh∗ c, σi = hU (hc, σi) ∀c ∈ C q (U, S0 ) σ ∈ U( q).
The reader can check easily that h∗ is a cochain map so it induces a map in cohomology
• •
hU
∗ : H (U, S0 ) → H (U, S1 )
which commutes with the refinements ıVU . The proposition follows by passing to direct
limits. ⊔
⊓
Sketch of proof. For each open set U ⊂ X define S(U ) := S0 (U )/S−1 (U ). Then the
correspondence U 7→ S(U ) defines a pre-sheaf on X. Its associated sheaf is isomorphic
with S1 (see Exercise 7.5.19). Thus, for each open cover U we have a short exact sequence
π
0 → C q (U, S−1 ) → C q (U, S0 ) → C q (U, S) → 0.
To conclude the proof of the proposition we invoke the following technical result. Its proof
can be found in [119].
7.5. ČECH COHOMOLOGY 325
H • (X, S) ∼
= H • (X, S′ ). ⊔
⊓
Definition 7.5.30. A sheaf S is said to be fine if, for any locally finite open cover U =
(Uα )α∈A there exist morphisms hα : S → S with the following properties.
(a) For any α ∈ A there exists a closed set Cα ⊂ Uα , and hα (Sx ) = 0 for x 6∈ Cα , where
Sx denotes the stalk of S at x ∈ X.
P
(b) α hα = 1S . This sum is well defined since the cover U is locally finite. ⊔
⊓
Example 7.5.31. Let X be a smooth manifold. Using partitions of unity we deduce that
the sheaf ΩkX of smooth k-forms is fine. More generally, if E is a smooth vector bundle
over X then the space ΩkE of E-valued k-forms is fine. ⊔
⊓
Proposition 7.5.32. Let S be a fine sheaf over a paracompact space X. Then H q (X, S) ∼
=
0 for q ≥ 1.
Proof. Because X is paracompact, any open cover admits a locally finite refinement. Thus,
it suffices to show that, for each locally finite open cover U = (Uα )α∈A , the cohomology
groups H q (U, S) are trivial for q ≥ 1. We will achieve this by showing that the identity
map C q (U, S) → C q (U, S) is cochain homotopic with the trivial map. We thus need to
produce a map
χq : C q (U, S) → C q−1 (U, S),
such that
χq+1 δq + δq−1 χq = 1. (7.5.1)
Consider the morphisms hα : S → S associated to the cover U postulated by the definition
of a fine sheaf. For every α ∈ A, σ ∈ N ~ q (U), and every f ∈ C q (U, S), we construct
htα (f ), σi ∈ S(Uσ ) as follows. Consider the open cover of Uσ
V = Uα ∩ Uσ , W := Uσ \ Cα , (supp hα ⊂ Cα ).
Note that hα f |V ∩W = 0 and, according to the axioms of a sheaf, the section hα (f |V ) can
be extended by zero to a section htα (f ), σi ∈ S(Uσ ). Now, for every f ∈ C q (U, S), define
χq f ∈ C q−1 (U, S) by X
hχq (f ), σi := htα (f ), σi.
α
The above sum is well defined since the cover U is locally finite. We let the reader check
that χq satisfies (7.5.1). ⊔
⊓
d0 d1
0 ֒→ S → S0 → S1 → · · ·
H q (X, S) ∼
= H q (Sq (X)).
Proof. The first statement in the theorem can be safely left to the reader. For q ≥ 1,
denote by Zq the kernel of the sheaf morphism dq . We set for uniformity Z0 := S. We get
a short exact sequence of sheaves
0 → Zq → Sq → Zq+1 → 0 q ≥ 0. (7.5.2)
We use the associated long exact sequence in which H k (X, Sq ) = 0, for k ≥ 1, since Sq is
a fine sheaf. This yields the isomorphisms
H m (X, Z0 ) ∼
= H 1 (X, Zm−1 ) m ≥ 1. (7.5.3)
Using again the long sequence associated to (7.5.2), we get an exact sequence
dm−1
H 0 (X, Zm−1 ) ∗→ H 0 (X, Zm ) → H 1 (X, Zm−1 ) → 0.
H • (M, RM ) ∼
= H • (M ).
H 1 (M ) ∼
= H 1 (M, RM ) and H 2 (M ) ∼
= H 2 (M, RM ). ⊔
⊓
7.5. ČECH COHOMOLOGY 327
Remark 7.5.37. The above corollary has a surprising implication. Since the Čech co-
homology is obviously a topological invariant, so must be the DeRham cohomology which
is defined in terms of a smooth structure. Hence if two smooth manifolds are homeo-
morphic they must have isomorphic DeRham groups, even if the manifolds may not be
diffeomorphic.
Such exotic situations do exist. In a celebrated paper [93], John Milnor has constructed
a family of nondiffeomorphic manifolds all homeomorphic to the sphere S 7 . More recently,
the work of Simon Donaldson in gauge theory was used by Michael Freedman to construct
a smooth manifold homeomorphic to R4 but not diffeomorphic with R4 equipped with the
natural smooth structure. (This is possible only for 4-dimensional vector spaces!) These
three mathematicians, J. Milnor, S. Donaldson and M. Freedman were awarded Fields
medals for their contributions. ⊔
⊓
Theorem 7.5.38. (Leray) Let M be a smooth manifold and U = (Uα )α∈A a good cover
of M , i.e.,
Uσ ∼
= Rdim M .
Then
H • (U, RM ) ∼
= H • (M ).
Proof. Let Zk denote the sheaf of closed k-forms on M . Using the Poincaré lemma we
deduce
Zk (Uσ ) = dΩk−1 (Uσ ).
We thus have a short exact sequence
d
0 → C q (U, Zk ) → C q (U, Ωk−1 ) →
∗
C q (U, Zk ) → 0.
Using the associated long exact sequence and the fact that Ωk−1 is a fine sheaf, we deduce
as in the proof of the abstract DeRham theorem that
H q (U, RM ) ∼
= H q−1 (U, Z1 ) ∼
= ···
∼
= H 1 (U, Zq−1 ) ∼
= H 0 (U, Zq )/d∗ H 0 (U, Ωq−1 ) ∼
= Zq (M )/dΩq−1 (M ) ∼
= H • (M ). ⊔
⊓
Remark 7.5.39. The above result is a special case of a theorem of Leray: if S is a sheaf
on a paracompact space X, and U is an open cover such that
H q (UA , S) = 0, ∀q ≥ 1 A ∈ N(U),
H q (Rn , G) = 0 q ≥ 1.
H • (M, GM ) = H • (U, GM ). ⊔
⊓
328 CHAPTER 7. COHOMOLOGY
We will say that |K| is the geometric realization of the simplicial scheme K.
A result of Borsuk-Weil (see [19]) states that if U = (Uα )α∈A is a good cover of a com-
pact manifold M , then the geometric realization of the nerve N(U) is homotopy equivalent
to M . Thus, the nerve of a good cover contains all the homotopy information about the
manifold. Since the DeRham cohomology is homotopy invariant, Leray’s theorem comes
as no suprise. What is remarkable is the explicit way in which one can extract the coho-
mological information from the combinatorics of the nerve.
While at this point, we should remark that any compact manifold admits finite good
covers. This shows that the homotopy type is determined by a finite, albeit very large,
set of data. ⊔
⊓
Example 7.5.41. Let M be a smooth manifold and U = (Uα )α∈A a good cover of M . A
1-cocycle of RM is a collection of real numbers fαβ - one for each pair (α, β) ∈ A2 such
that Uαβ 6= ∅ satisfying
fαβ + fβγ + fγα = 0,
whenever Uαβγ 6= ∅. The collection is a coboundary if there exist the constants fα such
that fαβ = fβ − fα . This is precisely the situation encountered in Subsection 7.1.2. The
abstract DeRham theorem explains why the Čech approach is equivalent with the DeRham
approach. ⊔
⊓
7.5. ČECH COHOMOLOGY 329
One can then define a “skew-symmetric” Čech cohomology following the same strategy.
The resulting cohomology coincides with the cohomology described in this subsection. For
a proof of this fact we refer to [119]. ⊔
⊓
is exact. ⊔
⊓
Characteristic Classes
gαβ : Uαβ → G.
The Lie group G operates on its Lie algebra g via the adjoint action
where
Ad(g)X := gXg−1 , ∀X ∈ g, g ∈ G.
We denote by Ad(P ) the vector bundle with standard fiber g associated to P via the
adjoint representation. In other words, Ad(P ) is the vector bundle defined by the open
331
332 CHAPTER 8. CHARACTERISTIC CLASSES
defined by
[ω k ⊗ X, η ℓ ⊗ Y ] := (ω k ∧ η ℓ ) ⊗ [X, Y ], (8.1.1)
for all ω k ∈ Ωk (M ), η ℓ ∈ Ωℓ (M ), and X, Y ∈ Ω0 Ad(P ) .
Exercise 8.1.1. Prove that for any ω, η, φ ∈ Ω∗ (Ad(P )) the following hold.
Aα ∈ Ω1 (Uα ) ⊗ g,
We will denote by A(U, g•• ) the set of connections defined by the open cover U and the
gluing cocycle g•• : U•• → G.
(b) The curvature of a connection A ∈ A(U, g•• ) is defined as the collection Fα ∈ Ω2 (Uα )⊗g
where
1
Fα = dAα + [Aα , Aα ]. ⊔
⊓
2
Remark 8.1.3. Given an open cover (Uα ) of M , then two gluing cocycles
define isomorphic principal bundles if and only if there exists smooth maps
Tα : Uα → G,
such that
hβα (x) = Tβ (x)gβα (x)Tα (x)−1 , , ∀α, β, ∀x ∈ Uαβ .
If this happens, we say that the cocycles g•• and h•• are cohomologous.
8.1. CHERN–WEIL THEORY 333
Suppose (gαβ ), (hαβ ) are two such cohomologous cocycles. Then the maps Tα induce
a well defined correspondence
given by
T
A(U, g•• ) ∋ (Aα ) 7−→ Bα := −(dTα )Tα−1 + Tα Aα Tα−1 ∈ A(U, h•• ).
Vi ⊂ Uϕ(i) , ∀i ∈ I.
ϕ
We obtain a new gluing cocycle gij : Vij → G given by
ϕ
gij (x) = gϕ(i)ϕ(j) (x), ∀i, j ∈ I, x ∈ Vij .
This new cocycle defines a principal bundle isomorphic to the principal bundle defined by
the cocycle (gβα ). If (Aα ) is a connection defined by the open cover (Uα ) and the cocycle
(gβα ), then it induces a connection
Aϕ
i = Aϕ(i) |Vi
defined by the open cover (Vi ), and the cocycle gij . We say that the connection Aϕ is the
V-refinement of the connection A. The correspondence
ϕ
TVU : A(U, g•• ) → A(V, g•• ), A 7→ Aϕ
is also a bijection.
Finally, given two pairs (open cover, gluing cocycle), (U, g•• ) and (V, h•• ), both de-
scribing the principal bundle P → M , there exists an open cover W, finer that both U
and W, and cohomologous gluing cocycles ḡ•• , h̄•• : W•• → G refining g•• and respectively
h•• , such that the induced maps
are bijections.
For simplicity, we will denote by A(P ), any of these isomorphic spaces A(U, g•• ).
☞ All the constructions that we will perform in the remainder of this section are com-
patible with the above isomorphisms but, in order to keep the presentation as transparent
as possible, in our proofs we will not keep track of these isomorphisms. We are convinced
that the reader can easily supply the obvious, and repetitious missing details. ⊔
⊓
(b) For any connection A = (Aα ) ∈ A(U, g•• ), the collection (Fα ) defines a global Ad(P )-
valued 2-form. We will denote it by FA or F (A), and we will refer to it as the curvature
of the connection A.
(c) (The Bianchi identity.)
Proof. (a) If (Aα )α∈A , (Bα )α∈A ∈ A(P ), then their difference Cα = Aα − Bα satisfies the
gluing rules
−1
Cβ = gβα Cα gβα ,
so that it defines an element of Ω1 (Ad(P )). Conversely, if (Aα ) ∈ A(P ), and ω ∈
Ω1 Ad(P ) , then ω is described by a collection of g-valued 1-forms ωα ∈ Ω1 (Uα ) ⊗ g,
satisfying the gluing rules
−1
ωβ |Uαβ = gβα ωα |Uαβ gβα .
The collection A′α := Aα + ωα is then a connection on P . This proves that if A(P ) is
nonempty, then it is an affine space modelled by Ω1 ( Ad(P ) ).
To prove that A(P ) 6= ∅ we consider a partition of unity subordinated to the cover
Uα . More precisely, we consider a family of nonnegative smooth functions uα : M → R,
α ∈ A, such that supp uα ⊂ Uα , ∀α, and
X
uα = 1.
α
−1 we deduce g −1 dg −1
Since gγα = gαγ γα γα = −(dgαγ )gαγ , so that
X
−1
Bα = − uγ (dgαγ )gαγ .
γ
so that
−1
(dgβγ )gβγ = (dgβγ )gγβ = (dgβα )gαβ + gβα (dgαγ )gγβ .
Hence X X
−1
− uγ (dgβγ )gβγ = −(dgβα )gαβ − uγ gβα (dgαγ )gγβ .
γ γ
8.1. CHERN–WEIL THEORY 335
−1 g −1 = g , so that
Using the cocycle condition again, we deduce gαγ βα γβ
X X
−1 −1
uγ gβα (dgαγ )gαγ gβα = uγ gβα (dgαγ )gγβ .
γ γ
Fβ = g −1 Fα g,
−1
where g = gαβ = gβα . We have
1
Fβ = dAβ + [Aβ , Aβ ]
2
1
= d(g −1 dg + g−1 Aα g) + [g−1 dg + g −1 Aα g, g−1 dg + g−1 Aα g].
2
−1
Set ̟ := g dg. Using (8.1.2) we get
1
Fβ = d̟ + [̟, ̟]
2
1
+d(g−1 Aα g) + [̟, g−1 Aα g] + [g−1 Aα g, g −1 Aα g]. (8.1.5)
2
We will check two things.
A. The Maurer-Cartan structural equations.
1
d̟ + [̟, ̟] = 0.
2
B.
d(g −1 Aα g) + [̟, g −1 Aα g] = g −1 (dAα )g.
Proof of A. Let us first introduce a new operation. Let gl(n, K) denote the associative
algebra of K-valued n × n matrices. There exists a natural operation
uniquely defined by
(ω k ⊗ A) ∧ (η ℓ ⊗ B) = (ω k ∧ η ℓ ) ⊗ (A · B), (8.1.6)
where ω k ∈ Ωk (Uα ), η ℓ ∈ Ωℓ (Uα ) and A, B ∈ gl(n, K) (see also Example 3.3.12). The
space gl(n, K) is naturally a Lie algebra with respect to the commutator of two matrices.
This structure induces a bracket
1
ω ∧ η = [ω, η] ∀ω, η ∈ Ω1 (Uα ) ⊗ gl(n, K). (8.1.7)
2
The Lie group lies inside GL(n, K), so that its Lie algebra g lies inside gl(n, K). We can
think of the map gαβ as a matrix valued map, so that we have
1 (8.1.2)
d(Fα ) = {[dAα , Aα ] − [Aα , dAα ]} = [dAα , Aα ]
2
1 (8.1.3)
=[Fα , Aα ] − [[Aα , Aα ], Aα ] = [Fα , Aα ].
2
The proposition is proved. ⊔
⊓
In other words, the collection ωα defines a global k-form ω ∈ Ωk (Ad(P )). Prove that the
collection
dωα + [Aα , ωα ],
defines a global Ad(P )-valued (k + 1)-form on M which we denote by dA ω. Thus, the
Bianchi identity can be rewritten as dA F (A) = 0, for any A ∈ A(P ). ⊔
⊓
8.1. CHERN–WEIL THEORY 337
Remark 8.1.6. Suppose P → M is a principal G-bundle given by the open cover U = (Uα )
and gluing cocycle gβα : Uαβ → G. A gauge transformation of P is by definition, a
collection of smooth maps
Tα : Uα → G
satisfying the gluing rules
G ֒→ C(P ) ։ M,
(see Definition 2.3.15) with standard fiber G, symmetry group G, where the symmetry
group G acts on itself by conjugation
C
G × G ∋ (g, h) 7−→ Cg (h) := ghg 1 ∈ G.
The set of gauge transformations forms a group with respect to the operation
We denote by G(U, g•• ) this group. One can verify that if P is described the (open cover,
gluing cocycle)-pair (V, h•• ), then the groups G(U, g•• ) and G(V), h•• ) are isomorphic. We
denote by G(P ) the isomorphism class of all these groups.
The group G(U, g•• ) acts on A(U, g•• ) according to
G(P ) × A(P ) ∋ (T, A) 7→ T AT −1 := −(dTα )Tα−1 + Tα Aα Tα−1 ∈ A(P ).
The group G(P ) also acts on the vector spaces Ω• Ad(P ) and, for any A ∈ A(P ),
T ∈ G(P ) we have
FT AT −1 = T FA T −1 .
We say that two connections A0 , A1 ∈ A(P ) are gauge equivalent if there exists T ∈ G(P )
such that A1 = T A0 T −1 . ⊔
⊓
(b) A principal G-bundle P over M such that E is associated to P via ρ. In other words,
there exists an open cover (Uα ) of M , and a gluing cocycle gαβ : Uαβ → G, such
that the vector bundle E can be defined by the cocycle
Example 8.1.8. Let E → M be a rank r real vector bundle over a smooth manifold
M . A metric on E allows us to talk about orthonormal moving frames. They are easily
produced from arbitrary ones via the Gramm-Schimdt orthonormalization technique. In
particular, two different orthonormal local trivializations are related by a transition map
valued in the orthogonal group O(r), so that a metric on a bundle allows one to replace an
arbitrary collection of gluing data by an equivalent (cohomologous) one with transitions in
O(r). In other words, a metric on a bundle induces an O(r) structure. The representation
ρ is in this case the natural injection O(r) ֒→ GL(r, R).
Conversely, an O(r) structure on a rank r real vector bundle is tantamount to choosing
a metric on that bundle.
Similarly, a Hermitian metric on a rank k complex vector bundle defines an U (k)-
structure on that bundle. ⊔
⊓
If dim V = n then, fixing a basis of V , we can identify S k (V ∗ ) with the space of degree k
homogeneous polynomials in n variables.
Assume now that A is a K-algebra with 1. Starting with ϕ ∈ S k (V ∗ ) we can produce
a K-multilinear map
ϕ = ϕA : (A ⊗ V ) × · · · × (A ⊗ V ) → A,
uniquely determined by
P (a1 X1 , a2 X2 , · · · ) = −P (a2 X2 , a1 X1 , · · · ).
Example 8.1.10. Let V = gl(n, C). For each matrix T ∈ V we denote by ck (T ) the
coefficient of λk in the characteristic polynomial
X
λ √
cλ (T ) := det 1 − T = ck (T )λk , (i = −1).
2πi
k≥0
Consider now a matrix Lie group G. The adjoint action of G on its Lie algebra g
induces an action on S k (g∗ ) still denoted by Ad. We denote by I k (G) the Ad-invariant
elements of S k (g∗ ). It consists of those ϕ ∈ S k (g∗ ) such that
The elements of I • (G) are usually called invariant polynomials. The space I •• (G) can
be identified (as vector space) with the space of Ad-invariant formal power series with
variables from g∗ .
Example 8.1.11. Let G = GL(n, C), so that g = gl(n, C). The map
gl(n, C) ∋ X 7→ tr exp(X),
defines an element of I •• (GL(n, C)). To see this, we use the “Taylor expansion”
X 1
exp(X) = Xk,
k!
k≥0
which yields
X 1
tr exp(X) = tr X k .
k!
k≥0
tr (gXg −1 )k = tr gX k g−1 = tr X k . ⊔
⊓
d
|t=0 ϕ(etX X1 e−tX , . . . , etX Xk e−tX ) = 0. ⊔
⊓
dt
Fi = ωi ⊗ Xi ∈ Ωdi (U ) ⊗ g, A = ω ⊗ X ∈ Ωd (U ) ⊗ g
then
P (F1 , . . . , Fi−1 , [A, Fi ], Fi+1 . . . , Fk )
= (−1)d(d1 +···di−1 ) ωω1 · · · ωk P (X1 , · · · , [X, Xi ], · · · Xk ).
8.1. CHERN–WEIL THEORY 341
gαβ : Uαβ → G.
Pick A ∈ A(P ) defined by the collection Aα ∈ Ω1 (Uα ) ⊗ g. Its curvature is then defined
by the collection
1
Fα = dAα + [Aα , Aα ].
2
Given φ ∈ I k (G), we can define as in the previous section (with A = Ωeven (Uα ), V = g)
so that the locally defined forms Pφ (Fα ) patch-up to a global 2k-form on M which we
denote by φ(FA ).
Theorem 8.1.13 (Chern–Weil). (a) The form φ(FA ) is closed, ∀A ∈ A(P ).
(b) If A0 , A1 ∈ A(P ), then the forms φ(FA0 ), and φ(FA1 ) are cohomologous. In other
words, the closed form φ(FA ) defines a cohomology class in H 2k (M ) which is independent
of the connection A ∈ A(P ).
Proof. We use the Bianchi identity dFα = −[Aα , Fα ]. The Leibniz’ rule yields
Aiα ∈ Ω1 (Uα ) ⊗ g.
342 CHAPTER 8. CHARACTERISTIC CLASSES
Set Cα := A1α − A0α . For 0 ≤ t ≤ 1 we define Atα ∈ Ω1 (Uα ) ⊗ g by Atα := A0α + tCα .
The collection (Atα ) defines a connection At ∈ A(P ), and t 7→ At ∈ A(P ) is an (affine)
path connecting A0 to A1 . Note that
C = (Cα ) = Ȧt .
t2
Fαt = Fα0 + t(dCα + [A0α , Cα ]) + [Cα , Cα ]. (8.1.12)
2
Hence,
Ḟαt = dCα + [A0α , Cα ] + t[Cα , Cα ] = dCα + [Atα , Cα ].
Consequently,
Z 1n o
φ(Fα1 ) − φ(Fα0 ) = φ(Ḟαt , Fαt , . . . , Fαt ) + · · · + φ(Fαt , . . . , Fαt , Ḟαt ) dt
0
Z 1
= φ(dCα , Fαt , . . . , Fαt ) + · · · + φ(Fαt , . . . , Fαt , dCα ) dt
0
Z 1
+ φ([Atα , Cα ], Fαt , . . . , Fαt ) + · · · + φ(Fαt , . . . , Fαt , [Atα , Cα ]) dt.
0
We claim that
Hence Z 1
φ(Fα1 ) − φ(Fα0 ) =d kφ(Ȧtα , Fαt , . . . , Fαt )dt. (8.1.13)
0
We set Z 1
Tφ (A1α , A0α ) := kφ(Ȧtα , Fαt , . . . , Fαt )dt.
0
−1 −1
Since Cβ = gβα Cα gβα , and Fβ = gβα Fα gβα
on Uαβ , we conclude from the Ad-invariance of
1 0
φ that the collection Tφ (Aα , Aα ) defines a global (2k − 1)-form on M which we denote by
Tφ (A1 , A0 ), and we name it the φ-transgression from A0 to A1 . We have thus established
the transgression formula
Remark 8.1.14. Observe that for every Ad-invariant polynomial φ ∈ I k (g), any principal
G-bundle P , any connection A ∈ A(P ), and any gauge transformation T ∈ G(P ) we have
φ FA = φ FT AT −1 .
Example 8.1.15. Consider a matrix Lie group G with Lie algebra g, and denote by P0
the trivial principal G-bundle over G, P0 = G × G. Denote by ̟ the tautological 1-form
̟ = g−1 dg ∈ Ω1 (G) ⊗ g. Note that for every left invariant vector field X ∈ g we have
̟(X) = X.
is closed.
A simple computation using the Maurer-Cartan equations shows that
Z 1
k
τφ = k−1 (t2 − 1)k−1 dt · φ(̟, [̟, ̟], · · · , [̟, ̟])
2 0
k 22k−1 k!(k − 1)!
= (−1)k−1 φ(̟, [̟, ̟], · · · , [̟, ̟])
2k−1 (2k)!
2k
= (−1)k−1 2k · φ(̟, [̟, ̟], · · · , [̟, ̟]).
k
1
The connections d and d + ̟ are in fact gauge equivalent.
344 CHAPTER 8. CHARACTERISTIC CLASSES
We thus have a natural map τ : I • (G) → H odd (G) called transgression. The elements
in the range of τ are called transgressive. When G is compact and connected, then a
nontrivial result due to the combined efforts of H. Hopf, C. Chevalley, H. Cartan, A.
Weil and L. Koszul states that the cohomology of G is generated as an R-algebra by the
transgressive elements. We refer to [26] for a beautiful survey of this subject. ⊔
⊓
Exercise 8.1.16. Let G = SU (2). The Killing form κ is a degree 2 Ad-invariant polyno-
mial on su(2). Describe τκ ∈ Ω3 (G) and then compute
Z
τκ .
G
Aα ∈ Ω1 (Uα ) ⊗ g,
(c) φ ∈ I k (G).
Output: A closed form φ(F (A)) ∈ Ω2k (M ), whose cohomology class is independent of
the connection A. We denote this cohomology class by φ(P ).
Thus, the principal bundle P defines a map, called the Chern–Weil correspondence
cw P : I • (G) → H • (M ) φ 7→ φ(P ).
''w H (N )
cwP
I • (G) •
cwF • (P )
)' u F • ⊔
⊓
H • (M )
PG ∋ P 7→ c(P ) ∈ F(BP ),
(b) F(F )(c(P )) = c(F ∗ (P )), for any smooth map F : M → N , and any principal G-
bundle P → N . ⊔
⊓
Remark 8.1.20. (a) We see that each characteristic class provides a way of measuring
the nontriviality of a principal G-bundle.
(b) A very legitimate question arises. Do there exist characteristic classes (in the DeRham
cohomology) not obtainable via the Chern–Weil construction?
The answer is negative, but the proof requires an elaborate topological technology
which is beyond the reach of this course. The interested reader can find the details in the
monograph [98] which is the ultimate reference on the subject of characteristic classes.
(b) There exist characteristic classes valued in contravariant functors other then the
DeRham cohomology. E.g., for each Abelian group A, the Čech cohomology with coeffi-
cients in the constant sheaf A defines a contravariant functor H • (−, A), and using topo-
logical techniques, one can produce H • (−, A)-valued characteristic classes. For details we
refer to [98], or the classical [124]. ⊔
⊓
I • (T n ) = S • ((tn )∗ ).
In practice one uses a more explicit description obtained as follows. Pick angular coordi-
nates 0 ≤ θ i ≤ 2π, 1 ≤ i ≤ n, and set
1
xj := − dθ j .
2πi
The x′j s form a basis of (tn )∗ , and now we can identify
I • (T n ) ∼
= R[x1 , . . . , xn ].
Thus, we can identify such a bundle with the tautological principal U (r)-bundle of uni-
tary frames. A connection on this U (r)-bundle is then equivalent with a linear connection
∇ on E compatible with a Hermitian metric h•, •i, i.e.,
∀λ ∈ C, u, v ∈ C ∞ (E), X ∈ Vect (M ).
The characteristic classes of E are by definition the characteristic classes of the tauto-
logical principal U (r)-bundle. To describe these characteristic classes we need to elucidate
the structure of the ring of invariants I • (U (r)).
The ring I • (U (r)) consists of symmetric, r-linear maps
φ : u(r) × · · · × u(r) → R,
The Lie algebra u(r) consists of r × r complex skew-hermitian matrices. Classical results
of linear algebra show that, for any X ∈ u(r), there exists T ∈ U (r), such that T XT −1 is
diagonal
T XT −1 = i diag(λ1 , . . . , λr ).
The set of diagonal matrices in u(r) is called the Cartan algebra of u(r), and we will denote
it by Cu(r) . It is a (maximal) Abelian Lie subalgebra of u(r). Consider the stabilizer
SU (r) := T ∈ U (r) ; T XT −1 = X, ∀X ∈ Cu(r) , ,
The stabilizer SU (r) is a normal subgroup of NU (r) , so we can form the quotient
called the Weyl group of U (r). As in Subsection 7.4.4, we see that the Weyl group is
isomorphic with the symmetric group Sr because two diagonal skew-Hermitian matrices are
unitarily equivalent if and only if they have the same eigenvalues, including multiplicities.
We see that Pφ is Ad-invariant if and only if its restriction to the Cartan algebra is
invariant under the action of the Weyl group.
The Cartan algebra is the Lie algebra of the (maximal) torus T n consisting of diagonal
unitary matrices. As in the previous subsection we introduce the variables
1
xj := − dθj .
2πi
The restriction of Pφ to Cu(r) is a polynomial in the variables x1 , . . . , xr . The Weyl
group Sr permutes these variables, so that Pφ is Ad-invariant if and only if Pφ (x1 , . . . , xr )
is a symmetric polynomial in its variables. According to the fundamental theorem of
symmetric polynomials, the ring of these polynomials is generated (as an R-algebra) by
the elementary ones P
c1 = x
P j j
c2 = i<j xi xj
.. .. ..
. . .
cr = x1 · · · xr
Thus
I • (U (r)) = R[c1 , c2 , . . . , cr ].
In terms of matrices X ∈ u(r) we have
X
t
ck (X)t = det 1 −
k
X ∈ I • (U (r))[t].
2πi
k
The above polynomial is known as the universal rank r Chern polynomial, and its coeffi-
cients are called the universal, rank r Chern classes.
348 CHAPTER 8. CHARACTERISTIC CLASSES
ck (E) := ck (F (∇)) ∈ H 2k (M ),
In other words, the collection nαβγ defines a Čech 2-cocycle of the constant sheaf Z.
On a more formal level, we can capture the above cocycle starting from the exact
sequence of sheaves
exp(2πi·)
0 → Z ֒→ C ∞ (·, R) −→ C ∞ (·, S 1 ) → 0.
The middle sheaf is a fine sheaf so its cohomology vanishes in positive dimensions. The
long exact sequence in cohomology then gives
δ
0 → Pic∞ (M ) → H 2 (M, Z) → 0.
The class δ(L), L ∈ Pic∞ (M ) is called the topological first Chern class and is denoted
by ctop
1 (L). This terminology is motivated by the following result of Kodaira and Spencer,
[81]:
The image of ctop
1 (L) in the DeRham cohomology via the natural morphism
H ∗ (M, Z) → H ∗ (M, R) ∼ ∗
= HDR (M )
coincides with the first Chern class obtained via the Chern–Weil procedure.
The Chern–Weil construction misses precisely the torsion elements in H 2 (M, Z). For
example, if a line bundle admits a flat connection then its first Chern class is trivial. This
may not be the case with the topological one, because line bundle may not be topologically
trivial. ⊔
⊓
Hence, exactly as in the complex case, we can naturally identify the rank r-real vector
bundles equipped with metric with principal O(r)-bundles. A connection on the principal
bundle can be viewed as a metric compatible connection in the associated vector bundle.
To describe the various characteristic classes we need to understand the ring of invariants
I • (O(r)).
As usual, we will identify the elements of I k (O(r)) with the degree k, Ad-invariant
polynomials on the Lie algebra o(r) consisting of skew-symmetric r × r real matrices. Fix
P ∈ I k (O(r)). Set m = [r/2], and denote by J the 2 × 2 matrix
0 −1
J := .
1 0
The Cartan algebra Co(r) is the Lie algebra of the (maximal) torus
m Rθ1 ⊕ · · · ⊕ Rθm ∈ O(r) , r = 2m
T = ,
Rθ1 ⊕ · · · ⊕ Rθm ⊕ 1R ∈ O(r) , r = 2m + 1
The above polynomial is called the universal rank r Pontryagin polynomial, while its
coefficients pj (X) are called the universal rank r Pontryagin classes.
The Pontryagin classes p1 (E), . . . , pm (E) of our real vector bundle E are then defined
by the equality
X
t
pt (E) = pj (E)t = det 1 −
2j
F (∇) ∈ H • (M )[t],
2π
j
where ∇ denotes a connection compatible with some (real) metric on E, while F (∇)
denotes its curvature. Note that pj (E) ∈ H 4j (M ). The polynomial pt (E) ∈ H • (M )[t] is
called the Pontryagin polynomial of E.
8.2. IMPORTANT EXAMPLES 351
The situation is different when r is even, r = 2m. To describe the ring of invariants
I • (SO(2m)), we need to study in greater detail the Cartan algebra
Co(2m) = λ1 J ⊕ · · · ⊕ λm J ∈ o(2m)
and the corresponding Weyl group action. The Weyl group WSO(2m) , defined as usual as
the quotient
WSO(2m) = NSO(2m) /SSO(2m) ,
is isomorphic to the subgroup of GL(Co(2m) ) generated by the involutions
σij : (λ1 , . . . , λi , . . . , λj , . . . , λm ) 7→ (λ1 , . . . , λj , . . . , λi , . . . , λm ),
and
ε : (λ1 , . . . , λm ) 7→ (ε1 λ1 , . . . , εm λm ),
where ε1 , . . . , εm = ±1, and ε1 · · · εm = 1. (Check this!)
Set as usual xi := −λi /2π. The Pontryagin O(2m)-invariants
X
pj (x1 , . . . , xm ) = (xi1 · · · xij )2 ,
1≤i1 <···<ij ≤m
In terms of
X = λ1 J ⊕ · · · ⊕ λm J ∈ CSO(2m) ,
we can write
−1 m
∆(X) = P f (X),
2π
where P f denotes the pfaffian of the skewsymmetric matrix X viewed as a linear map
R2m → R2m , when R2m is endowed with the canonical orientation. Note that pm = ∆2 .
352 CHAPTER 8. CHARACTERISTIC CLASSES
Proposition 8.2.7.
I • (SO(2m)) ∼
= R[Z1 , Z2 , . . . , Zm ; Y ]/(Y 2 − Zm ) (Zj = pj , Y = ∆)
Proof. We follow the approach used by H. Weyl in describing the invariants of the alternate
group ([133], Sec. II.2). The isomorphism will be established in two steps.
Step 1. The space I • (SO(2m)) is generated as an R-algebra by the polynomials
p1 , · · · , pm , ∆.
Step 2. The kernel of the morphism
ψ
R[Z1 , . . . , Zm , ; Y ] → I • (SO(2m))
G = WO(2m) /WSO(2m) ∼
= Z2 .
and moreover,
I • (O(2m)) = ker(1 − e).
For each F ∈ I • (SO(2m)) we define F + := (1 + e)F , and we observe that F + ∈ ker(1 − e).
Hence
F + = P (p1 , . . . , pm ).
On the other hand, the polynomial F − := (1 −e)F is separately odd in each of its variables.
Indeed,
F − (−x1 , x2 , . . . , xm ) = eF − (x1 , . . . , xm )
= e(1 − e)F (x1 , . . . , xm ) = −(1 − e)F (x1 , . . . , xm ) = −F − (x1 , . . . , xm ).
Hence, F − vanishes when any of its variables vanishes so that F − is divisible by their
product ∆ = x1 · · · xm ,
F − = ∆ · G.
Since eF − = −F − , and e∆ = −∆ we deduce eG = G, i.e., G ∈ I • (O(2m)). Consequently,
G can be written as
G = Q(p1 , . . . , pm ),
so that
F − = ∆ · Q(p1 , . . . , pm ).
Step 1 follows from
1 + 1
F = (F + F − ) = (P (p1 , . . . , pm ) + ∆ · Q(p1 , . . . , pm )).
2 2
8.2. IMPORTANT EXAMPLES 353
we deduce
(Y 2 − Zm ) ⊂ ker ψ,
A(p1 , . . . , pm )∆ + B(p1 , . . . , pm ) = 0.
−A(p1 , . . . , pm )∆ + B(p1 , . . . , pm ) = 0.
Let E be a rank 2m, real, oriented vector bundle over the smooth manifold M . As in
the previous subsection we deduce that we can use a metric to naturally identify E with a
principal SO(2m)-bundle and in fact, this principal bundle is independent of the metric.
Finally, choose a connection ∇ compatible with some metric on E.
1
e = e(X) = P f (−X) ∈ I m (SO(2m)).
(2π)m
(b) The Euler class of E, is the cohomology class e(E) ∈ H 2m (M ) represented by the
Euler form
1
e(∇) = P f − F (∇) ∈ Ω2m (M ).
(2π)m
(According to the Chern–Weil theorem this cohomology class is independent of the metric
and the connection.) ⊔
⊓
Example 8.2.9. Let (Σ, g) be a compact, oriented, Riemann surface and denote by ∇g
the Levi–Civita connection. The the Euler form
1
ε(g) = s(g)dvg
4π
coincides with the Euler form e(∇g ) obtained via the Chern–Weil construction. ⊔
⊓
354 CHAPTER 8. CHARACTERISTIC CLASSES
Remark 8.2.10. Let E be a rank 2m, real, oriented vector bundle over the smooth,
compact, oriented manifold M . We now have two apparently conflicting notions of Euler
classes.
A topological Euler class etop (E) ∈ H 2m (M ) defined as the pullback of the Thom class
via an arbitrary section of E.
A geometric Euler class egeom (E) ∈ H 2m (M ) defined via the Chern–Weil construction.
The most general version of the Gauss–Bonnet theorem, which will be established later
in this chapter, will show that these two notions coincide! ⊔
⊓
(F · G)∞ = (F )∞ · (G)∞ . ⊔
⊓
Similarly, given Gn ∈ R[[x]], (n ≥ 1) such that Fn (0) = 0, we can from the sequence
of sums
(0, G1 (X1 ), G1 (X1 ) + G2 (X2 ), . . . , G1 (X1 ) + · · · ⊕ Gn (Xn ), . . .),
which defines an element in R[[X1 , X2 , . . .]] denoted by G1 (X1 ) + G2 (X2 ) + · · · . When
G1 = G2 = · · · = Fn = · · · = F we denote the corresponding element (G)∞ . ⊔
⊓
8.2. IMPORTANT EXAMPLES 355
In dealing with characteristic classes of vector bundles one naturally encounters the
increasing sequences
U (1) ֒→ U (2) ֒→ · · · (8.2.2)
(in the complex case) and (in the real case)
We know that I •• (U (n)) = Sn [[xj ]]= the ring of symmetric formal power series in n
variables with coefficients in R. Set
S∞ [[xj ]] := lim Sn .
←
Given a rank n vector bundle E over a smooth manifold M , and φ ∈ I •• (U (n)), the
characteristic class φ(E) is well defined since udim M +1 = 0, for any u ∈ Ω∗ (M ). Thus
we can work with the ring I •• (U (n)) rather than I • (U (n)) as we have done so far. An
element
φ = (φ1 , φ2 , . . .) ∈ I •• (U (∞)) := lim I •• (U (n)) = S∞ [[xj ]]
←
We can present the above arguments in a more concise form as follows. Consider the
function F (x) = (1 + tx) ∈ R[[x]], where R = R[[t]] is the ring of formal power series in
the variable t. Then (F )∞ defines an element in R[[X1 , X2 , . . .]]. One sees immediately
that in fact, (F )∞ ∈ S∞ [[xj ]] [[t]] and moreover
We see that Y
ct := (1 + txj )
j≥1
We can perform a similar operation with any F ∈ R[[x]] such that F (0) = 1. We get
a semigroup morphism
X ∞
x 1 1 2 1 Bk 2k
F (x) = −x
= 1 + x + x + · · · = 1 + x + (−1)k−1 x ∈ R[[x]].
1−e 2 12 2 (2k)!
k=1
defines an element in S∞ [[xj ]] called the universal Todd class. We denote it by Td. Using
the fundamental theorem of symmetric polynomials we can write
Td = 1 + Td1 + Td2 + · · · ,
For example,
1 1 1
Td1 = c1 , Td2 = (c21 + c2 ), Td3 = c1 c2 etc.
2 12 24
Analogously, any function G ∈ R[[x]] such that G(0) = 0 defines an element
We have two examples in mind. First, consider G(x) = xk . We get the symmetric function
We have X 1 X 1
(ex − 1)∞ = (xm )∞ = sm ∈ S∞ [[xj ]]
m! m=1
m!
m≥1
The cohomology class ch(E) is called the Chern character of the bundle E. If ∇ is a
connection on E compatible with some Hermitian metric, then we can express the Chern
character of E as
∞
X 1
ch(E) = tr eF (∇) = rank (E) + tr (F (∇)∧k ),
k!
k=1
Hence
exp(F (∇1 ⊕ ∇2 )) = exp(F (∇1 )) ⊕ exp(F (∇2 )),
from which we deduce the first equality.
As for the second equality, consider the connection ∇ on E1 ⊗ E2 uniquely defined by
the product rule
is defined by
(ω k ⊗ s1 ) ⊗ (η ℓ s2 ) = (ω k ∧ η ℓ ) ⊗ (s1 ⊗ s2 ) (8.2.5)
∀si ∈ C ∞ (Ei ), ∀ω k ∈ Ωk (M ), and ∀η ℓ ∈ Ωℓ (M ).
We compute the curvature of ∇ using the equality F (∇) = (d∇ )2 . If si ∈ C ∞ (Ei ),
then
F (∇)(s1 ⊗ s2 ) = d∇ {(∇1 s1 ) ⊗ s2 + s1 ⊗ (∇2 s2 )}
= (F (∇1 )s1 ) ⊗ s2 − (∇1 s1 ) ⊗ (∇2 s2 ) + (∇1 s1 ) ⊗ (∇2 s2 ) + s1 ⊗ (F (∇2 ))
Proof. Pick an orthonormal basis (ei ) of Cn and an orthonormal basis (fj ) of Cm ) such
that, with respect to these bases A = diag(λ1 , . . . , λn ) and B = diag(µ1 , . . . , µm )). Then,
with respect to the basis (ei ⊗ fj ) of Cn ⊗ Cm , we have
Hence
exp(A ⊗ 1Cm + 1Cn ⊗ B) = diag(eλi eµj ),
so that, X
tr (A ⊗ 1Cm + 1Cn ⊗ B) = eλi eµj = tr (eA ) · tr (eB ). ⊔
⊓
and (r ≥ 0)
X
sr = xrj ∈ R[x1 , . . . , xn ].
j
Set
n
Y
f (t) := (1 − xj t).
j=1
(c) Deduce from the above formulae the following identities between universal symmetric
polynomials.
s1 = c1 , s2 = c21 − 2c2 , s3 = c31 − 3c1 c2 + 3c3 . ⊔
⊓
The real case. The sequence (8.2.3) induces a projective system
I •• (O(1)) ← I •• (O(2)) ← · · · .
We have proved that I •• (O(n)) = S⌊n/2⌋ [[x2j ]] = the ring of even, symmetric power series
in ⌊n/2⌋ variables. The inverse limit of this system is
I • • (O(∞) = lim I •• (O(r)) = S∞ [[x2j ]] := lim Sm [[x2j ]].
← ←
As in the complex case, any element of this ring is called a universal characteristic class. In
fact, for any real vector bundle E and any φ ∈ S∞ [[x2j ]] there is a well defined characteristic
class φ(E) which can be expressed exactly as in the complex case, using metric compatible
connections. If F ∈ R[[x]]♭ then (F (x2 ))∞ defines an element of S∞ [[x2j ]].
In topology, the most commonly encountered situations are the following.
A. √
x/2 X 22k−1 − 1
F (x) = √ =1+ (−1)k 2k−1 Bk xk .
sinh( x/2) 2 (2k)!
k≥1
B. Consider
√ X
x 1 1 22k
F (x) = √ = 1 + x + x2 + · · · = 1 + (−1)k−1 Bk xk .
tanh x 3 45 (2k)!
k≥1
The universal class (F (x2 ))∞ is denoted by L, and it is called the L-genus. As before, we
can write
L = 1 + L1 + L2 + · · · ,
where the Lj ’s are universal, symmetric, even, homogeneous “polynomials”. They can be
expressed in terms of the universal Pontryagin classes. The first few terms are
1 1
L1 = p1 , L2 = (7p2 − p21 ) etc.
3 45
360 CHAPTER 8. CHARACTERISTIC CLASSES
8.3.1 Reductions
In applications, the symmetries of a vector bundle are implicitly described through topo-
logical or geometric properties.
For example, if a rank r complex vector bundle E splits as a Whitney sum E = E1 ⊕E2
with rank Ei = ri , then E, which has a natural U (r)- symmetry, can be given a finer
structure of U (r1 ) × U (r2 ) vector bundle.
More generally, assume that a given rank r complex vector bundle E admits a G-
structure (P, ρ), where G is a Lie group, P is a principal G bundle, and ρ : G → U (r) is a
representation of G. Then we can perform two types of Chern–Weil constructions: using
the U (r) structure, and using the G structure. In particular, we obtain two collections
of characteristic classes associated to E. One natural question is whether there is any
relationship between them.
In terms of the Whitney splitting E = E1 ⊕ E2 above, the problem takes a more
concrete form: compute the Chern classes of E in terms of the Chern classes of E1 and
E2 . Our next definition formalizes the above situations.
Definition 8.3.1. Let ϕ : H → G be a smooth morphism of (matrix) Lie groups.
(a) If P is a principal H-bundle over the smooth manifold M defined by the open cover
(Uα ), and gluing cocycle
hαβ : Uαβ → H,
then the principal G-bundle defined by the gluing cocycle
gαβ = ϕ ◦ hαβ : Uαβ → G
is said to be the ϕ-associate of P , and it is denoted by ϕ(P ).
(b) A principal G-bundle Q over M is said to be ϕ-reducible, if there exists a principal
H-bundle P → M such that Q = ϕ(P ). ⊔
⊓
The above result should be seen as a guiding principle in proving identities between
characteristic classes, rather than a rigid result. What is important about this result is
the simple argument used to prove it.
We conclude this subsection with some simple, but very important applications of the
above principle.
Example 8.3.3. Let E and F be two complex vector bundles over the same smooth
manifold M of ranks r and respectively s. Then the Chern polynomials of E, F and
E ⊕ F are related by the identity
ct (E ⊕ F ) = ct (E) · ct (F ), (8.3.1)
where the “·” denotes the ∧-multiplication in H even (M ). Equivalently, this means
X
ck (E ⊕ F ) = ci (E) · cj (F ).
i+j=k
Remark 8.3.4. Consider the Grassmannian Grk (Cn ) of complex k-dimensional subspaces
in Cn . The universal complex vector bundle Uk,n → Grk (Cn ) is a subbundle of the trivial
bundle Cn → Grk (Cn ). The trivial bundle Cn is equipped with a canonical Hermitian
metric. We denote by Qk,n the orthogonal complement of Uk,n in Cn so that we have an
isomorphism
Cn ∼= Uk,n ⊕ Qk,n .
From the above example we deduce that
ct Uk,n ct Qk,n = ct (Cn ) = 1.
Denote by uj the j-th Chern class of Uk,n and by vℓ the ℓ-th Chern class of Qk,n . Then
k
X n−k
X
ct Uk,n = uj tj , ct Qk,n = vℓ t ℓ .
j=0 ℓ=0
One can then prove that the cohomology ring H • Grk (Cn ), R) is generated by the classes
ui , vj , which are subject to the single relation above. More formally
. X X
H • Grk (Cn ), R) ∼ = R[u ,
i jv ; 0 ≤ i ≤ k, 0 ≤ j ≤ n − k] ui )( v j ) = 1 ,
i j
deg ui = 2i, deg vj = 2j. The proof requires a more sophisticated topological machinery.
For details we refer to [99], Chapter v3, Section 6. ⊔
⊓
Exercise 8.3.5. Let E and F be two complex vector bundles over the same manifold M .
Show that
Td(E ⊕ F ) = Td(E) · Td(F ). ⊔
⊓
Exercise 8.3.6. Let E and F be two real vector bundles over the same manifold M .
Prove that
pt (E ⊕ F ) = pt (E) · pt (F ), (8.3.2)
where pt denotes the Pontryagin polynomials. ⊔
⊓
Exercise 8.3.7. Let E and F be two real vector bundles over the same smooth manifold
M . Show that
L(E ⊕ F ) = L(E) · L(F )
b
A(E b
⊕ F ) = A(E) b ).
· A(F ⊔
⊓
Example 8.3.8. The natural inclusion Rn ֒→ Cn induces an embedding ı : O(n) ֒→ U (n).
(An orthogonal map T : Rn → Rn extends by complexification to an unitary map TC :
Cn → Cn ). This is mirrored at the Lie algebra level by an inclusion
o(n) ֒→ u(n).
where λj (X) are the eigenvalues of X over C. Since X is, in effect, a real skew-symmetric
matrix we have
λj (X) = λj (X) = −λj (X).
Consequently,
2k+1 X
∗ ∗ 1
ı (c2k+1 )(X) = ı (c2k+1 (X)) = − λi1 (X) · · · λi2k+1 (X)
2πi
1≤i1 <···<i2k+1 ≤n
2k+1 X
2k+1 1
= (−1) − λi1 (X) · · · λi2k+1 (X) = −ı∗ (c2k+1 )(X).
2πi
1≤i1 <···<i2k+1 ≤n
ct (E ∗ ) = c−t (E).
(b) One can also regard E as a real, oriented vector bundle ER . Prove that
X
pit (ER ) = (−1)k t2k pk (ER ) = ct (E) · c−t (E),
k
and
cr (E) = e(ER ). ⊔
⊓
Exercise 8.3.11. (a) The natural morphisms I •• (U (r)) → I •• (O(r)) described above
induce a morphism
Φ∞ : I •• (U (∞)) → I •• (O(∞)).
364 CHAPTER 8. CHARACTERISTIC CLASSES
Φ∞ ((F )∞ ) = (F · F − )∞ ∈ I •• (O(∞)),
b
Td(E ⊗ C) = A(E)2
. ⊔
⊓
Note that
I • (SO(2k) × SO(2ℓ)) ∼
= I • (SO(2k)) ⊗R I • (SO(2ℓ).
Denote by e(ν) the Euler class in I • (SO(2ν)). We want to prove that
Xk = λ1 J ⊕ · · · ⊕ λk J and Xℓ = µ1 J ⊕ · · · ⊕ µℓ J,
2
We are not worried about convergence issues because the matrices for which we intend to apply the
formula have nilpotent entries.
8.3. COMPUTING CHARACTERISTIC CLASSES 365
We have
k+ℓ
1
ı∗ (e(k+ℓ) )(X) = − λ1 · · · λk · µ1 · · · µℓ = e(k) (Xk ) · e(ℓ) (Xℓ )
2π
Proposition 8.3.14. Let E and F be two real, oriented vector bundle of even ranks over
the same manifold M . Then
Example 8.3.15. Let E be a rank 2k real, oriented vector bundle over the smooth
manifold M . We claim that if E admits a nowhere vanishing section ξ then e(E) = 0.
To see this, fix an Euclidean metric on E so that E is now endowed with an SO(2k)-
structure. Denote by L the real line subbundle of E generated by the section ξ. Clearly,
L is a trivial line bundle, and E splits as an orthogonal sum
E = L ⊕ L⊥ .
The result proved in the above example can be reformulated more suggestively as
follows.
Corollary 8.3.16. Let E be a real oriented vector bundle of even rank over the smooth
manifold M . If e(E) 6= 0, then any section of E must vanish somewhere on M ! ⊔
⊓
366 CHAPTER 8. CHARACTERISTIC CLASSES
where ∇ is a connection on E compatible with some metric and 2r = rank (E). The next
result, which generalizes the Gauss–Bonnet theorem, will show that these two notions of
Euler class coincide.
π
Theorem 8.3.17 (Gauss–Bonnet–Chern). Let E → M be a real, oriented vector bundle
over the compact oriented manifold M . Then
i : E → E u 7→ −u ∀u ∈ E.
Since the fibers of E are odd dimensional, we deduce that i reverses the orientation in the
fibers. In particular, this implies
π∗ i∗ τE = −π∗ τE = π∗ (−τE ),
where π∗ denotes the integration along fibers. Since π∗ is an isomorphism (Thom isomor-
phism theorem), we deduce
i∗ τE = −τE .
Hence
etop (E) = −ζ0∗ i∗ τE . (8.3.4)
On the other hand, notice that
ζ0∗ i∗ = ζ0 .
Indeed,
ζ0∗ i∗ = (iζ0 )∗ = (−ζ0 )∗ = (ζ)∗ (ζ0 = −ζ0 ).
The equality etop = egeom now follows from (8.3.4).
B. rank (E) = 2k. We will use a variation of the original argument of Chern, [29]. Let
∇ denote a connection on E compatible with a metric g. The strategy of proof is very
simple. We will explicitly construct a closed form ω ∈ Ω2k
cpt (E) such that
8.3. COMPUTING CHARACTERISTIC CLASSES 367
(i) π∗ ω = 1 ∈ Ω0 (M ).
The Thom isomorphism theorem coupled with (i) implies that ω represents the Thom
2k (E). The condition (ii) simply states the sought for equality e
class in Hcpt top = egeom .
Denote by S(E) the unit sphere bundle of E,
Denote by π0 the natural projection S(E) → M , and by π0∗ (E) → S(E) the pullback
of E to S(E) via the map π0 . The vector bundle π0∗ (E) has an SO(2k)-structure, and
moreover, it admits a tautological, nowhere vanishing section
The decisive step in the proof of Gauss–Bonnet–Chern theorem is contained in the follow-
ing lemma.
Lemma 8.3.18. There exists Ψ = Ψ(∇) ∈ Ω2k−1 (S(E)) such that
and Z
Ψ(∇) = −1 ∈ Ω0 (M ). (8.3.6)
S(E)/M
The form Ψ(∇) is sometimes referred to as the global angular form of the pair (E, ∇).
For the clarity of the exposition we will conclude the proof of the Gauss–Bonnet–Chern
theorem assuming Lemma 8.3.18 which will be proved later on.
Denote by r : E → R+ the norm function
E ∋ e 7→ |e|g
368 CHAPTER 8. CHARACTERISTIC CLASSES
ρ = ρ(r) : [0, ∞) → R,
such that ρ(r) = −1 for r ∈ [0, 1/4], and ρ(r) = 0 for r ≥ 3/4. Finally, define
The differential form ω is well defined since ρ′ (r) ≡ 0 near the zero section. Obviously ω
has compact support on E, and satisfies the condition (ii) since
we deduce
Z Z Z ∞ Z
′ ′
ω=− ρ (r)dr ∧ Ψ(∇) = − ρ (r)dr · Ψ(∇)
E/M E/M 0 S(E)/M
Z
(8.3.6)
= −(ρ(1) − ρ(0)) Ψ(∇) = 1.
S(E)/M
To complete the program outlined at the beginning of the proof we need to show that ω
is closed.
dω = ρ′ (r)dr ∧ dΨ(∇) − ρ′ (r) ∧ π ∗ e(∇)
(8.3.5 )
= ρ′ (r)dr ∧ {π0∗ e(∇) − π ∗ e(∇)}.
The above form is identically zero since π0∗ e(∇) = π ∗ e(∇) on the support of ρ′ . Thus ω
is closed and the theorem is proved. ⊔
⊓
π0∗ E = L ⊕ L⊥ ,
where L is the real line bundle spanned by Υ, while L⊥ is its orthogonal complement in
π0∗ E with respect to the pullback metric g. Denote by
P : π0∗ E → π0∗ E
the orthogonal projection onto L⊥ . Using P , we can produce a new metric compatible
ˆ on π ∗ E by
connection ∇ 0
ˆ = 0.
P f (F (∇))
π0∗ E |Ex ∼
= (Ex × Ex → Ex ).
R2k = ν ⊕ T S 2k−1 ,
ˆ is then the direct
where ν denotes the normal bundle of S 2k−1 ֒→ R2k . The connection ∇
sum between the trivial connection on ν, and the Levi–Civita connection on T S 2k−1 .
370 CHAPTER 8. CHARACTERISTIC CLASSES
Fix a point p ∈ S 2k−1 , and denote by (x1 , . . . , x2k−1 ) a collection of normal coordinates
near p, such that the basis (∂xi |p ) is positively oriented. Set ∂i := ∂xi for i = 1, . . . , 2k − 1.
Denote the unit outer normal vector field by ∂0 . For α = 0, 1, . . . , 2k − 1 set f α = ∂α |p .
The vectors f α form a positively oriented orthonormal basis of R2k .
We will use Latin letters to denote indices running from 1 to 2k − 1, and Greek letters
indices running from 0 to 2k − 1.
∇i ∂α = (∇i ∂α )ν + (∇i ∂α )τ ,
where the superscript ν indicates the normal component, while the superscript τ indicates
the tangential component. Since at p
ˆ i ∂j = (∇i ∂j )τ ,
0=∇
we deduce
∇i ∂j = (∇i ∂j )ν at p.
Hence
∇i ∂j = (∇i ∂j , ∂0 )∂0 = −(∂j , ∇i ∂0 )∂0 .
Recalling that ∇ is the trivial connection in R2k , we deduce
∂
∇i ∂0 |p = ∂0 |p = f i = ∂i |p .
∂fi
Consequently,
∇i ∂j = −δji ∂0 , at p.
If we denote by θ i the local frame of T ∗ S 2k−1 dual to (∂i ), then we can rephrase the above
equality as
∇∂j = −(θ 1 + · · · + θ 2k−1 ) ⊗ ∂0 .
On the other hand, ∇i ∂0 = ∂i , i.e.,
Since (x1 , · · · , x2k−1 ) are normal coordinates with respect to the Levi–Civita connection
ˆ we deduce that ∇∂
∇, ˆ α = 0, ∀α so that
0 −θ 1 · · · −θ 2k−1
θ1 0 ··· 0
A = (∇ − ∇)ˆ |p =
θ
2 0 ··· 0
.
.. .. .. ..
. . . .
θ 2k−1 0 ··· 0
coincides with the induced Riemann metric (which is the first fundamental form). Using
Teorema Egregium we get
F t = F 0 + t2 A ∧ A = 0 ⊕ (1 − t2 )F 0 .
(−1)k X
P f (A, F, F, · · · , F ) = ǫ(σ)Aσ0 σ1 Fσ2 σ3 · · · Fσ2k−2 σ2k−1 .
2k k!
σ∈S2k
For i = 0, 1, define
Si = {σ ∈ S2k ; σi = 0}.
We deduce
X
2k k! P f (A, F, · · · , F ) = (−1)k ǫ(σ)(−θ σ1 ) ∧ θ σ2 ∧ θ σ3 ∧ · · · ∧ θ σ2k−2 ∧ θ σ2k−1
σ∈S0
X
+(−1)k ǫ(σ)θ σ0 ∧ θ σ2 ∧ θ σ3 ∧ · · · ∧ θ σ2k−2 ∧ θ σ2k−1 .
σ∈S1
The above integral can be evaluated inductively using the substitution t = cos ϕ. We have
Z 1 Z π/2
Ik = (1 − t2 )k−1 dt = (cos ϕ)2k−1 dϕ
0 0
Z π/2
π/2
= (cos ϕ)2k−2 sin ϕ|0 + (2k − 2) (cos ϕ)2k−3 (sin ϕ)2 dϕ
0
= (2k − 1)Ik−1 − (2k − 2)Ik
so that
2k − 3
Ik = Ik−1 .
2k − 2
One now sees immediately that
Z
Ψ(∇) = −1.
S 2k−1
In the next exercise we will also assume M is endowed with a Riemann structure.
Exercise 8.3.21. Consider a nondegenerate smooth section s of E, i.e.,
∇0X s 6= 0 along Z
a∇ : NZ → E |Z ,
defined by
X 7→ ∇X s X ∈ C ∞ (NZ ),
is a bundle isomorphism. Conclude that Z is endowed with a natural orientation.
(b) Show that there exists an open neighborhood N of Z ֒→ NZ ֒→ T M such that
exp : N → exp(N)
is a diffeomorphism. (Compare with Lemma 7.3.44.) Deduce that the Poincaré dual of Z
in M can be identified (via the above diffeomorphism) with the Thom class of NZ .
(c) Prove that the Euler class of E coincides with the Poincaré dual of Z . ⊔
⊓
The part (c) of the above exercise generalizes the Poincaré-Hopf theorem (see Corollary
7.3.48). In that case, the section was a nondegenerate vector field, and its zero set was a
finite collection of points. The local index of each zero measures the difference between two
orientations of the normal bundle of this finite collection of points: one is the orientation
obtained if one tautologically identifies this normal bundle with the restriction of T M to
this finite collection of points while, and the other one is obtained via the adjunction map.
In many instances one can explicitly describe a nondegenerate section and its zero set,
and thus one gets a description of the Euler class which is satisfactory for most topological
applications.
Example 8.3.22. Let U denote the tautological line bundle over the complex projective
space CPn . According to Exercise 8.3.10
c1 (U) = e(U),
374 CHAPTER 8. CHARACTERISTIC CLASSES
when we view U as a rank 2 oriented, real vector bundle. Denote by [H] the (2n − 2)-cycle
defined by the natural inclusion
We claim that the (homological) Poincaré dual of c1 (τn ) is −[H]. We will achieve this by
showing that c1 (U∗ ) = −c1 (U) is the Poincaré dual of [H].
Let P be a degree 1 homogeneous polynomial P ∈ C[z0 , . . . , zn ]. For each complex line
L ֒→ Cn+1 , the polynomial P defines a complex linear map L → C, and hence an element
of L∗ , which we denote by P |L . We thus have a well defined map
CPn ∋ L 7→ P |L ∈ L∗ = U∗ |L ,
and the reader can check easily that this is a smooth section of U∗ , which we denote by
[P ]. Consider the special case P0 = zn . The zero set of the section [P0 ] is precisely the
image of [H]. We let the reader keep track of all the orientation conventions in Exercise
8.3.21, and conclude that c1 (U∗ ) is indeed the Poincaré dual of [H]. ⊔
⊓
Show that Z
c1 (Lg ) = deg g,
S2
Ultimately, mathematics is about solving problems, and we can trace the roots of most
remarkable achievements in mathematics to attempts of solving concrete problems which,
for various reasons, were deemed very interesting by the mathematical community.
In this chapter, we will present some very beautiful applications of the techniques
developed so far. In a sense, we are going against the natural course of things, since the
problems we will solve in this chapter were some of the catalysts for the discoveries of
many of the geometric results discussed in the previous chapters.
Integral geometry, also known as geometric probability, is a mixed breed subject.
The question it addresses have purely geometric formulations, but the solutions borrow
ideas from many mathematical areas, such as representation theory and probability. This
chapter is intended to wet the reader’s appetite for unusual questions, and make him/her
appreciate the power of the technology developed in the previous chapters.
A B
α β
where X → A and X → B are maps between finite sets. One should think of α and β as
defining two different fibrations with the same total space X.
Suppose we are given a function f : X → R. We define its “average” over X as the
“integral”
X
hf iX := f (x).
x∈X
375
376 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
We can compute the average hf iX in two different ways, either summing first over the
fibers of α, or summing first over the fibers of β, i.e.
X X X X
f (x) = hf iX = f (x) . (9.1.1)
a∈A α(x)=a b∈B β(x)=b
We can reformulate the above equality in a more conceptual way by introducing the
functions X
α∗ (f ) : A → C, α∗ (f )(a) = f (x),
α(x)=a
and X
β∗ (f ) : B → C, β∗ (f )(b) = f (x).
β(x)=b
α∗ (f ) A
= β∗ (f ) B
.
All the integral geometric results that we will prove in this chapter will be of this form.
The reason such formulæ have captured the imagination of geometers comes from the fact
that the two sides hα∗ (f )iA and hβ∗ (f )iB have rather unrelated interpretations.
The geometric situation we will investigate is a bit more complicated than the above
baby case, because in the geometric case the sets X, A, B are smooth manifolds, the maps
α and β define smooth fibrations, and the function f is not really a function, but an
object which can be integrated over X, i.e., a density. It turns out that the push forward
operations do make sense for densities resulting in formulas of the type
h f iX = α∗ (f ) A
.
These formulæ generalize the classical Fubini theorem and they are often referred to as
coarea formulæ for reasons which will become apparent a bit later. To produce such results
we need to gain a deeper understanding of the notion of density introduced in Subsection
3.4.1.
Suppose V is a finite dimensional real vector space. We denote by det V the top
exterior power of V . Given a real number s we define an s-density on V to be a map
λ : det V → R such that
We denote by |Λ|s (V ) the one dimensional space of s-densities. Note that we have a
canonical identification |Λ|0 (E) = R. We will refer to 1-densities simply as densities, and
we denote the corresponding space by |Λ|(V ).
We say that an s-density λ : det V → R is positive if
We denote by |Λ|+
s (V ) the cone of positive densities.
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 377
• Let µ ∈ |Λ|+ s (U ), λ ∈ |Λ|s (V ), and suppose (wj )1≤j≤p is a basis of W . Now choose
lifts vj ∈ V of wj , such that β(vj ) = wj , and a basis (ui )1≤i≤m of U such that
α(u1 ), . . . , α(um ), v1 , . . . , vp ,
is a basis of V . We set
λ (∧i α(ui ) ) ∧ (∧j vj )
(µ\λ) ∧j wj := .
µ ∧ i ui
It is easily seen that the above definition is independent of the choices of v’s and u’s.
378 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Again it is easily verified that the above definition is independent of the various
choices.
Again one can check that this is independent of the various bases (ui ) and (wj ).
Exercise 9.1.1. Prove that the above constructions are indeed independent of the various
choices of bases. ⊔
⊓
given by
α(s) = (4s, 10s), β(x, y) = 5x − 2y.
Denote by e the canonical basis of U , by (e1 , e2 ) the canonical basis of V and by f the
canonical basis of W . We obtain canonical densities λU on U , λV on V and λW on W
given by
λU (e) = λV (e1 ∧ e2 ) = λW (f ) = 1.
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 379
Hence λV /β ∗ λW = 2λU . ⊔
⊓
Suppose now that E → M is a real vector bundle of rank n over the smooth manifold
M . Assume that it is given by the open cover (Uα ) and gluing cocycle
where V is a fixed real vector space of dimension n. Then the bundle of s-densities
associated to E is the real line bundle |Λ|s E given by the open cover (Uα ) and gluing
cocycle
| det gβα |−s : Uαβ → Aut( |Λ|s (V ) ) ∼
= R∗ .
We denote by C ∞ (|Λ|s E ) the space of smooth sections of |Λ|s E. Such a section is given
by a collection of smooth functions λα : Uα → |Λ|s (V ) satisfying the gluing conditions
−1 ∗
λβ (x) = (gβα ) λα (x) = | det gβα |−s λα (x), ∀α, β, x ∈ Uαβ .
Let us point out that if V = Rn , then we have a canonical identification |Λ|s (Rn ) → R,
and in this case a density can be regarded as a collection of smooth functions λα : Uα → R
satisfying the above gluing conditions.
An s-density λ ∈ C ∞ (|Λ|s E) is called positive if for every x ∈ M we have λ(x) ∈
+
|Λ|s (Ex ).
If φ : N → M is a smooth map, and E → M is a smooth real vector bundle, then we
obtain the pullback bundle π ∗ E → N . We have canonical isomorphisms
|Λ|s π ∗ E ∼
= π ∗ |Λ|s E,
φ∗ : C ∞ (|Λ|s E) → C ∞ (π ∗ |Λ|s E) ∼
= C ∞ (|Λ|s π ∗ E).
0 → E0 → E1 → E2 → 0
\ : C ∞ (|Λ|+ ∞ ∞
s E0 ) × C (|Λ|s E1 ) → C (|Λ|s E2 ),
/ : C ∞ (|Λ|s E1 ) × C ∞ (|Λ|+ ∞
s E2 ) → C (|Λ|s E0 ),
380 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
and
× : C ∞ (|Λ|s E0 ) × C ∞ (|Λ|s E2 ) → C ∞ (|Λ|s E1 ).
In the sequel we will almost exclusively use a special case of the above construction, when
E is the tangent bundle of the smooth manifold M . We will denote by |Λ|s (M ) the line
bundle |Λ|s (T M ), and we will refer to its sections as (smooth) s-densities on M . When
s = 1, we will use the simpler notation |Λ|M to denote |Λ|1 (M ).
As explained
in Subsection 3.4.1, an s-density on M is described by a coordinate atlas
i
Uα , (xα ) , and smooth functions λα : Uα → R satisfying the conditions
!
∂xjβ
λβ = |dβα |−s λα , where dβα = det , n = dim M. (9.1.3)
∂xiα
1≤i,j≤n
We deduce that the smooth 0-densities on M are precisely the smooth functions.
ωα = λα dx1α ∧ · · · ∧ dxnα .
The functions λα satisfy the gluing conditions λβ = d−1 βα λα , and we conclude that the
collection of functions |λα |s defines an s-density on M which we will denote by |ω|s .
Because of this fact, the s-densities are traditionally described as collections
Ωn (M ) → C ∞ (|Λ|(M ) ).
where |gα | denotes the determinant of the symmetric matrix representing the metric g in
the coordinates (xiα ). ⊔
⊓
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 381
The densities on a manifold serve one major purpose: they can be integrated. We
denote by C0 (|Λ|(M ) ) the space of continuous densities with compact support. Consider
the integration map defined in Subsection 3.4.1
Z Z
: C0 (|Λ|(M ) ) → R, |dµ| 7→ |dµ|.
M M
Let us observe that if |dρ| and |dτ | are two positive densities, then there exists a positive
function f such that
|dρ| = f |dτ |.
The existence of this function follows from the Radon-Nycodim theorem. For every x ∈ M
we have R
|dρ
f (x) = lim R U ,
U →{x} U |dτ |
where the above limit is taken over open sets shrinking to x. We will use the notation
|dρ|
f := ,
|dτ |
and we will refer to f as the Jacobian of |dρ| relative to |dτ |. If φ : M → N is a
diffeomorphism and |dρ| = (Uα , ρα |dyα |) is a density on N , then we define the pullback of
|dρ| by φ to be the density φ∗ |dρ| on M defined by
φ∗ |dρ| = φ−1 (Uα ), ρα |dyα |), yαi = xiα ◦ φ.
The classical change in variables formula now takes the form
Z Z
|dρ| = φ∗ |dρ|.
N M
Proposition 9.1.6. Suppose |dρ| and |dτ | are two G-invariant positive densities. Then
|dρ|
the Jacobian J = |dτ | is a G-invariant smooth, positive function on G.
J(x) = J(gx), ∀x ∈ M, g ∈ G. ⊔
⊓
Corollary 9.1.7. If G acts smoothly and transitively on the smooth manifold M then, up
to a positive multiplicative constant there exists at most one invariant positive density. ⊔
⊓
Observe that any (positive) density |dν| on B defines by pullback a (positive) density
Φ∗ |dν| associated to the bundle Φ∗ T B → M . If λ is a density associated to the vertical
tangent bundle T V M , then we obtain a density λ × Φ∗ |dν| on M .
Suppose |dµ| is a density on M such that Φ is proper on the support of |dµ|. Set
k := dim B, r := dim M − dim B. We would like to describe a density Φ∗ |dµ| on B called
the pushforward of |dµ| by Φ. Intuitively, Φ∗ |dµ| is the unique density on B such that for
any open subset U ⊂ B we have
Z Z
Φ∗ |dµ| = |dµ|.
U Φ−1 (U )
Proposition 9.1.8 (The pushforward of a density). There exists a smooth density Φ∗ |dµ|
on B uniquely characterized by the following condition. For every density |dν| on B we
have
Φ∗ |dµ| = Vν |dν|,
where Vν ∈ C ∞ (B) is given by
Z
Vν (b) := |dµ|/Φ∗ |dν|.
Φ−1 (b)
Proof. Fix a positive density|dν| on B. Along every fiber Mb = Φ−1 (b) we have a density
|dµ|b /Φ∗ |dν| ∈ C ∞ |Λ|(Mb ) corresponding to the short exact sequence
DΦ
0 → T Mb → (T M )|Mb −→ (Φ∗ T B)|Mb → 0.
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 383
To understand this density fix x ∈ Mb . Then we can find local coordinates (yj )1≤j≤k near
b ∈ B, and smooth functions (xi )1≤i≤r defined in a neighborhood V of x in M such that
the collection of functions (xi , y j ) defines local coordinates near x on M , and in these
coordinates the map Φ is given by the projection (x, y) 7→ y.
In the coordinates y on B we can write
Using partitions of unity and the classical Fubini theorem we obtain the coarea formula
for densities .
Theorem 9.1.9 (Coarea formula). Suppose the Φ : M → B is a submersion, |dν| is a
density on B and |dµ| is a density on M such that Φ is proper on the support of |dµ|.
Z Z Z Z !
|dµ| = Φ∗ |dµ| = |dµ|/Φ∗ |dν| |dν|. (9.1.4)
M B B Φ−1 (b)
Remark 9.1.10. Very often the submersion Φ : M → B satisfies the following condition.
For every point on the base b ∈ B there exist an open neighborhood U of b in B, a
nowhere vanishing form ω ∈ Ωk (U ), a nowhere vanishing form Ω ∈ Ωk+r (MU ), (MU :=
Φ−1 (U )), and a form η ∈ Ωr (MU ) such that
Ω = η ∧ π ∗ ω.
Then we can write |dµ| = ρ|Ω|, for some ρ ∈ C ∞ (MU ). The form ω defines a density |ω|
on U , and we conclude that
⊔
⊓
f∗ |dVg | = v(t)|dt|,
Hence
|dVg |/f ∗ |dt| = ρ|(dx1 ∧ · · · ∧ dxm )|,
so that
1
|dVt0 | |U ∩Mt0 = |∇f | · |dVg |/f ∗ |dt| and |dVg |/f ∗ |dt| = |dVt0 | |U ∩Mt0 .
|∇f |
Hence Z
1
f∗ |dVg | = v(t)|dt|, v(t) = |dVt |, (9.1.6)
Mt |∇f |
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 385
To see how this works in practice, consider the unit sphere S n ⊂ Rn+1 . We denote the
coordinates in Rn+1 by (t, x1 , . . . , xn ). We let P± ∈ S n denote the poles given t = ±1.
We denote by |dVn | the volume density on S n , and by π : S n → R the natural projection
given by
(t, x1 , . . . , xn ) 7→ t.
The map π is a submersion on the complement of the poles, M = S n \ {P± }, and π(M ) =
(−1, 1). We want to compute π∗ |dVn |.
p 1
θ
θ t
1
0 t p'
where σ m denotes the m-dimensional area of the unit m-dimensional sphere S m . The last
formula implies
Z 1 Z 1
2 n−2 n−2
−1/2 1 n
σ n = 2σ n (1 − t ) 2 |dt| = σ n−1 (1 − s) 2 s |ds| = σ n−1 B , ,
0 0 2 2
386 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
We deduce
2Γ( 12 )n+1
σn = . (9.1.10)
Γ( n+12 )
⊔
⊓
The equality (9.1.7) can be further generalized. We start with a simple linear algebra
result.
Lemma 9.1.12. Suppose that U and V are Euclidean spaces of dimensions n + k and
respectively k, n ≥ 0, and A : U → V is a surjective linear map. Then there exist Eu-
clidean coordinates x1 , . . . , xn+k on U , Euclidean coordinates y 1 , . . . , y k on V and positive
numbers µ1 , . . . , µk such that, in these coordinates the operator A is described by
y j = µj xj , 1 ≤ j ≤ k.
The numbers µ21 , . . . , µ2k are the eigenvalues of the positive symmetric operator AA∗ : V →
V so that √
µ1 · · · µk = det AA∗ .
Proof. Let W denote the orthogonal complement of ker A in U . Denote by A0 the re-
striction of A to W so that A0 : W → V is a linear isomorphism. Note that W coincides
with the range of the adjoint operator A∗ : V → U so that
A0 A∗0 = AA∗ .
B = A0 R : V → V
V B
wV
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 387
1
hA, Bi = tr(AB † ), ∀A, B ∈ End(V ),
2
induces a metric h = hn,k on Grk (V ).
Denote by O(V ) the group of orthogonal transformations of V . The group O(V ) acts
smoothly and transitively on Grk (V )
Note that
PgL = gPL g−1 .
The action of O(V ) on End+ (V ) by conjugation preserves the inner product on End+ (V ),
and we deduce that the action of O(V ) on Grk (V ) preserves the metric h.
We would like to express this metric in the graph coordinates introduced in Example
1.2.22. Consider L ∈ Grk (V ), and S ∈ Hom(L, L⊥ ). Then, for every t ∈ R, we denote by
Ut the graph of the map tS : L → L⊥ ,
1 d
h(U̇0 , U̇0 ) = tr(Ṗ02 ), Ṗ0 := |t=0 PΓtS ,
2 dt
If we write Pt := PΓtS we deduce from (1.2.5) that
(1L + t2 S ∗ S)−1 t(1L + t2 S ∗ S)−1 S ∗
Pt = .
tS(1L + t2 S ∗ S)−1 t2 S(1L + t2 S ∗ S)−1 S ∗
Hence
0 S∗
Ṗt=0 = = S ∗ PL⊥ + SPL , (9.1.13)
S 0
so that
1
h(U̇0 , U̇0 ) = tr(SS ∗ ) + tr(S ∗ S) = tr(SS ∗ ).
2
We can be even more concrete.
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 389
We can think of the collection (sαi ) as defining local coordinates on the open subset
Grk (V, L) ⊂ Grk (V ) consisting of k-planes intersecting L⊥ transversally. Hence
X
h(U̇0 , U̇0 ) = |sαi |2 . (9.1.14)
i,α
In integral geometric computations we will find convenient to relate the above coor-
dinates to the classical language of moving frames. In the sequel we make the following
notational conventions.
• We will use lower case Latin letters i, j, ... to denote indices in the range {1, ..., k}.
• We will use lower case Greek letters α, β, γ, ... to denote indices in the range
{k + 1, . . . , n}.
• We will use upper case Latin letters A, B, C, ... to denote indices in the range
{1, · · · , n}.
gt eA (0) = eA (t).
With respect to the fixed frame ( eA (0) ), the orthogonal transformation gt is given by a
matrix ( sAB (t) ), where
sAB = eA (0) • gt eB (0) .
Observe that g0 = 1V . Let Pt denote the projection onto Lt . Then
Pt = gt P0 gt−1 ,
d
so that, if we set X = dt |t=0 gt , we have
Ṗ0 = [X, P0 ].
390 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
With respect to the decomposition V = L0 + L⊥0 the projector P0 has the block decom-
position
1L0 0
P0 = .
0 0
The operator X is represented by a skew-symmetric matrix with entries
Hence X
1
h(L̇0 , L̇0 ) = tr(Ṗ0 , Ṗ0 ) = tr(XL⊥ ,L0 XL∗ ⊥ ,L0 ) = |ṡαi |2 . (9.1.15)
2 0 0
α,i
The above computations show that the differential of L at p0 is described by the (n−k)×k
matrix of 1-forms on M
Dp0 L = (θαi ), θαi := eα • dei .
More precisely, this means that if X = (u̇a ) ∈ Tp0 M , and we let “ ˙ ” denote the differen-
tiation along the flow lines of X, then
X
Dp0 L(X) = (xαi ) ∈ TL(0) Grk (V ), xαi = eα • ėi = eα • dei (X). (9.1.16)
a
If M happens to be an open subset of Grk (V ), then we can use the forms θαi to describe
the metric h. More precisely, the equalities (9.1.14) and (9.1.15) show that
X
h= θαi ⊗ θαi . (9.1.17)
α,i
In other words, the collection (θαi ) is an orthonormal frame of the cotangent bundle
T ∗ Grk (V ).
The metric h = hn,k is O(V )-invariant so that the associated Riemannian volume
defines an invariant density. We will denote this invariant density by |dγn,k |, where n =
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 391
dim V , and we will refer to it as the kinematic density on Grk (V ). Since the action of
the group O(V ) is transitive, we deduce that any other invariant density is equivalent to
a constant multiple of this metric density. We would like to give a local description of
|dγn,k |.
Set n := dim V . If O is a sufficiently small open subset of Grk (V ) then we can find
smooth maps
eA : O → V, A = 1, · · · , n,
with the following properties.
• For every L ∈ O the collection eA (L) 1≤A≤n is an orthonormal frame of V .
• For every L ∈ O the collection ei (L) 1≤i≤k is an orthonormal frame of L.
For every 1 ≤ i ≤ k, and every k + 1 ≤ α ≤ 1, we have a 1-form
θαi ∈ Ω1 (O), θαi = eα • dei = −deα • ei .
As explained above, the metric h is described along O by the symmetric tensor
X
h= θαi ⊗ θαi ,
α,i
We would like to compute the volumes of the Grassmannians Grk (V ), dim V = n with
respect to the kinematic density |dγn,k |, i.e., we would like to compute
Z
Cn,k := |dγn,k |.
Grk (V )
Denote by ω n the volume of the unit ball B n ⊂ Rn , and by σn−1 the (n − 1)-dimensional
“surface area” of the unit sphere S n−1 , so that
k
π
, n = 2k,
Γ(1/2)n k!
σn−1 = nω n , and ω n = = ,
Γ(1 + n/2) 2k+1 π k k!
2
, n = 2k + 1,
(2k + 1)!
392 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
where Γ(x) is the Gamma function. We list below the values of ω n for small n.
n 0 1 2 3 4
4π π2 .
ωn 1 2 π 3 2
To compute the volume of the Grassmannians we need to use yet another description of
the Grassmannians, as homogeneous spaces.
Note first that the group O(V ) acts transitively on Grk (V ). Fix L0 ∈ Grk (V ).
Then the stabilizer of L0 with respect to the action of O(V ) on Grk (V ) is the sub-
group O(L0 ) × O(L⊥ 0 ) and thus we can identify Grk (V ) with the homogeneous space of
left cosets O(V )/O(L0 ) × O(L⊥0 ).
The computation of Cn,k is carried out in three steps.
Step 1. We equip the orthogonal groups O(Rn ) with a canonical invariant density |dγn |
called the kinematic density on O(n). Set
Z
Cn := |dγn |.
O(Rn )
originating at S0 . We set
Ẋ := γ̇X (0) ∈ TS0 O(V ) ⊂ End(V ), Ẏ := γ̇Y (0) ∈ TS0 O(V ) ⊂ End(V ).
Then Ẋ = S0 X, Ẏ = S0 Y , and
1 1 1
h(Ẋ, Ẏ ) = tr (S0 X)(S0 Y )∗ = tr S0 XY ∗ S0∗ = tr S0∗ S0 XY ∗
2 2 2
1
= tr XY ∗
2
If we choose an orthonormal basis (eA ) of V so that X and Y are given by the skew
symmetric matrices (xAB ), (yAB ), then we deduce
X
h(Ẋ, Ẏ ) = xAB yAB .
A>B
then we obtain the angular forms θAB := f A • df B . The above metric metric has the
description X
h= θAB ⊗ θAB .
A>B
Step 2. Fix an orthonormal frame (eA ) of V such that L0 = span (ei ; 1 ≤ i ≤ k). We
can identify V with Rn , O(V ) with O(n), and L0 with the subspace
Rk ⊕ 0n−k ⊂ Rn .
and that
p∗ |dγn | = Ck Cn−k |dγn,k |.
Once we have this we deduce from the Fubini formula for densities (9.1.5) that
Cn = Ck Cn−k Cn,k .
φ : U → O(n)
of the map p : O(n) → Grk (Rn ). The section can be identified with a smooth family of
orthonormal frames (φA (L), L ∈ U )1≤A≤n of Rn , such that
To such a frame
orthogonal matrix φ(L) ∈ O(n) which maps the fixed
we associate the
frame eA to the frame φB . It is a given by a matrix with entries
φ(L)AB = eA • φB .
defined as follows.
X
f α = f α (t, L) = tβα φβ (L) ∈ L⊥ , k + 1 ≤ α ≤ n. (9.1.19)
β
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 395
f A = ΨeA , ∀A.
L = LT = span (f i )1≤i≤k ,
while the matrices (sij ) and (tαβ ) are obtained via (9.1.18) and (9.1.19). More precisely, we
have
sij = φi (LT ) • f j , sαβ = φα (LT ) • f β .
Observe that, ∀s0 , s1 ∈ O(k), t0 , t1 ∈ O(n − k), we have
(φB ) ∗ (s0 , t0 ) ∗ (s1 , t1 ) = (φB ) ∗ (s0 s1 , t0 t1 ).
This means that Ψ is equivariant with respect to the right actions of O(k) × O(n − k) on
O(k) × O(n − k) × U and O(n). We have a commutative diagram
O(k) × O(n − k) × U
hhh
Ψ
' w p−1(U )
πhj *'''p'
U ⊂ Grk (Rn )
We write this as ^ ^ ^
θij ∧ θαβ ∧ θα,i .
i>j α>β α,i
396 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
V
The form α,i θα,i is the pullback of a nowhere vanishing form defined in a neighborhood
of L0 in Grk (Rn ), whose associated density is |dγn,k |. We now find ourselves in the
situation described in Remark 9.1.10. We deduce
Z ^ ^
c= θij ∧ θαβ
p−1 (L0 ) i>j α>β
Z Z
= |dγk | |dγn−k | = Ck Cn−k .
O(k) O(n−k)
Hence
Cn
Cn,k = .
Ck Cn−k
n−1
Step 3. Fix an orthonormal basis {eA } of V , and denote by S + the open hemisphere
n−1
S+ = ~v ∈ V ; |~v | = 1, ~v • e1 > 0 .
Note that Gr1 (V ) ∼ = RPn−1 is the Grassmannian of lines in V . The set of lines that do
n−1
not intersect S + is a smooth hypersurface of Gr1 (V ) diffeomorphic to RPn−2 and thus
has kinematic measure zero. We denote by Gr1 (V )∗ the open subset consisting of lines
n−1
intersecting S + . We thus have a map
ψ : Gr∗1 (V ) → S +
n−1 n−1
, ℓ 7→ ℓ ∩ S + .
This map is a diffeomorphism, and we have
Z Z Z
Cn,1 = |dγn,1 | = |dγn,1 | = (ψ −1 )∗ |dγn,1 |.
Gr1 (V ) Gr∗1 (V ) S n−1
+
We have n
k
|dνn,k | = |dγnk |.
Cn,k
Example 9.1.15. Using the computation in Example 9.1.13 we deduce
(~c, L) 7−→ ~c + L.
where [S] ∈ Grk (V ) denotes the affine subspace through the origin parallel to S,
[S] = S − S = s1 − s2 ; s1 , s2 ∈ S .
We set
~c(S) := S ∩ [S]⊥ ,
and we say that ~c(S) is the center of the affine plane S.
We equip Graff k (V ) with the structure of smooth manifold which makes A a diffeo-
morphism. Thus, we identify Graff k (V ) with a vector subbundle of the trivial bundle
V × Grk (V ) described by
U⊥ = (~c, L) ∈ V × Grk (V ); PL~c = 0 ,
π ∗ T Grk (V ) → Graff k (V ).
we obtain a density |e
γn,k | on Graff k (V ) given by
e→V
eA : O → V, ~r : O
e we have
• For every S ∈ O
S = ~r(S) + [S].
9.1. THE INTEGRAL GEOMETRY OF REAL GRASSMANNIANS 399
S = S(~r, ei ).
Observe that the center of this affine plane is the projection of ~r onto [S]⊥
X
~c(S) = ( eα • ~r )eα .
α
e by
The volume density on Graff k (V ) is described along O
^ ^
|de
γn,k | = θα ∧ θαi .
α α,i
Denote by Iso(V ) the group of affine isometries of V , i.e., the subgroup of the group of
affine transformations generated by translations, and by the rotations about a fixed point.
Any affine isometry T : V → V is described by a unique pair (t, S) ∈ V × O(V ) so that
T (v) = Sv + t, ∀v ∈ V.
The group Iso(V ) acts in an obvious fashion on Graff k (V ), and a simple computation
shows that the associated volume density |de γn,k | is Iso(V ) invariant.
If instead of the density |dγn,k | on Grk (V ) we use the density |dνn,k |, we obtain a
density |de
νn,k | on Graff k (V ) which is a constant multiple of |de γn,k |.
n
k
|de
νn,k | = |de
γnk |. (9.1.22)
Cn,k
400 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Example 9.1.17. Let us unravel the above definition in the special case Graff 1 (R2 ), the
Grassmannians of affine lines in R2 . Such a line L is determined by two quantities: the
angle θ ∈ [0, π) is makes with the x-axis, and the signed distance ρ ∈ (−∞, ∞) from the
origin. More precisely, for every ρ ∈ R and θ ∈ [0, π) we denote by Lθ,ρ the line is given
in Euclidean coordinates by the equation
x sin θ − y cos θ = ρ
As a manifold, the Grassmannian Graff 1 (R2 ) is diffeomorphic to the interior of the Möbius
band. The Fubini formula in Theorem 9.1.16 can now be rewritten
Z Z ∞ Z π
f (L) |de
γ2,1 |(L) = f (Lθ,ρ )|dθ| |dρ|,
Graff 1 (R2 ) −∞ 0
∞ (Graff (R2 ) ).
∀f ∈ Ccpt ⊔
⊓
1
M ∋ p 7→ Tp M ∈ Grm (Rn ),
which, for obvious reasons, is named the Gauss map of the submanifold M .
• We will use lower case Latin letters i, j, . . . to denote indices in the range
1 ≤ i, j, . . . ≤ m = dim M.
• We will use lower case Greek letters α, β, . . . to denote indices in the range
m < α, β, . . . ≤ n.
9.2. GAUSS–BONNET AGAIN?!? 401
• We will use the capital Latin letters A, B, C to denote indices in the range
1 ≤ A, B, C ≤ n.
Then
dX 1
D X = ... = dX.
dX n
The restriction to M of the tangent bundle T Rn admits an orthogonal decomposition
(T Rn )|M = T M ⊕ (T M )⊥ .
X τ = θ i (X)ei , X ν = θ α (X)eα .
We denote by ΘA
B the 1-forms associated to D by the frame (eA ). They satisfy Cartan’s
equations
dθ A = −ΘA B A A
B ∧ θ , DeB = ΘB eA , ΘB = −ΘA .
B
If we write
φA := θ A |M , ΦA A
B := ΘB |M ,
then, as explained Subsection 4.2.4, we deduce that (Φij ) are the 1-forms associated to the
Levi–Civita connection ∇M by the local orthonormal frame (ei |M ). This implies that
and thus,
∇M τ
X Y = (D X Y ) , ∀X, Y ∈ Vect (M ).
Consider the Gauss map
G = GM : M → Grm (Rn ), x 7→ Tx M.
The shape operator of the submanifold M ֒→ Rn is, by definition, the differential of the
Gauss map. Intuitively, the shape operator measures how fast the tanget plane changes
its location as a point in a Grassmannian.
We denote the shape operator by S M , and we would like to relate it to the structural
coefficients ΦAB.
As explained in Subsection 9.1.2, in the neighborhood U of p0 , the “moving plane”
x 7→ Tx M can be represented by the orthonormal frame (eA ) which has the property
that the first m vectors e1 (x), . . . , em (x) span Tx M . The differential of the Gauss map at
x ∈ U ∩ M is a linear map
DG : Tx M → TG(x) Grm (Rm ) = Hom(Tx M, (Tx M )⊥ ).
As explained in (9.1.16), this differential described by the (n − m) × m matrix of 1-forms
eα • dej , α,i = eα • D ej , α,i
ΦA
ij := ei ΦA 0
j ∈ Ω (M ∩ U ),
so that
ΦA A i
j = Φij ∧ φ .
We have thus obtained the following result.
Proposition 9.2.1. The shape operator of M , that is the differential of the Gauss map,
is locally described by the matrix of 1-forms (Φαi )1≤i≤m<α<n . More precisely the operator
S M (ei ) ∈ Hom(Tx M, (Tx M )⊥ )
is given by
S M (ei )ej = (D ei ej )ν = Φαij eα , ∀i, j. ⊔
⊓
In Subsection 4.2.4 we have defined the second fundamental form of the submanifold
M to be the C ∞ (M )-bilinear and symmetric map
SM : Vect (M ) × Vect (M ) → C ∞ (T M )⊥ , (X, Y ) 7→ (D X Y )ν .
Note that
SM (ei , ej ) = φα (D ei ej )eα = Φαij eα = S M (ei )ej . (9.2.1)
The equality (4.2.12) can be rewritten as
ej • (Dei eα ) = −(D ei ej ) • eα = −eα • SM (ei , ej ), ∀i, j, α. (9.2.2)
Theorema Egregium states that the Riemann curvature tensor of M is completely deter-
mined by the second fundamental form. We recall this result below.
9.2. GAUSS–BONNET AGAIN?!? 403
The Gauss map GM : M → Grm (Rm+1 ) of the embedding M ֒→ Rm+1 can be given the
description,
'' w S m
~G
M
G
)' u π
Grm (Rm+1 )
We fix an oriented orthonormal frame (f~0 , f~1 , . . . , f~m ) of Rm+1 , and we orient the unit
sphere S m ⊂ Rm+1 so that the orientation of Tf~0 S m is given by the ordered frame
(f~1 , . . . , f~m ).
1
The orientability assumption is superfluous. It follows from the Alexander duality theorem with Z/2
coefficients that a compact hypersurface of a vector space is the boundary of a compact domain, and in
particular, it is orientable.
404 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
σ 2h 22h π h h!
σ 2h = (2h + 1)ω 2r+1 =⇒ = . (9.2.3)
2 (2h)!
Hence Z
1
dAm = 1,
S m σm
so that Z
1 ~G∗ dAm = deg GM .
M
σm M
Denote by g the induced metric on M , and by R the curvature of g. We would like to
prove that the integrand ~G∗M dAm has the form
where dVM denotes the metric volume form on M and P (RM ) is a universal polynomial
of degree m2 in the curvature R of M .
Fix a positively oriented orthonormal frame ~e = (e1 , . . . , em ) of T M defined on some
open set U ⊂ M , and denote by ~θ = (θ 1 , . . . , θ m ) the dual coframe. Observe that
dVM = θ 1 ∧ · · · ∧ θ m .
We set
Sij := SM (ei , ej ) • n, Rijkℓ := g ei , R(ek , eℓ )ej .
Theorem 9.2.2 implies that
Sik Siℓ
Rijkℓ = Sik Sjℓ − Siℓ Sjk = . (9.2.4)
Sjk Sjℓ
is the 2 × 2 submatrix of S = (Sij )1≤i,j≤m lieing at the intersection of the i, j-rows with
the k, ℓ-columns. We can rephrase the equality (9.2.4) in a more convenient form.
First, we regard the curvature Rijkℓ at a point x ∈ M as a linear map
X
Λ2 Tx M → Λ2 Tx∗ M, R(ek ∧ eℓ ) = Rijkℓ θ i ∧ θ j .
i<j
S : Tx M → T ∗ M, Sej = Sij θ i .
9.2. GAUSS–BONNET AGAIN?!? 405
Λp S : Λk Tx M → Λp T ∗ M,
defined by
R = Λ2 S. (9.2.5)
We want to prove that det S can be described in terms of Λ2 S. To see this observe that
h X
^ X Y
h
= Rij,2s−1,2s θ i ∧ θ j = ǫ(ϕ) Rϕ(2s−1)ϕ(2s),2s−1,2s dVM ,
s=1 i<j ϕ∈S′m s=1
where S′m denotes the set of permutations ϕ of {1, 2, . . . , m = 2h} such that
We would like to give an alternate description of det S using the concept of pfaffian.
The Riemann curvature, like the curvature of any metric connection, can be regarded
as a 2-form with coefficients in the bundle of skew-symmetric endomorphisms of T M . It
is locally described by a m × m skew-symmetric matrix
Θ = Θg := Θij 1≤i,j≤m
Recall from Subsection 8.2.4 that the pfaffian of Θ is the differential form
(−1)h X
P f (Θ) := ǫ(ϕ)Θϕ(1)ϕ(2) ∧ · · · ∧ Θϕ(2h−1)ϕ(2h) ∈ Ω2h (U ),
2h h!
ϕ∈Sm
406 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
(−1)h X
P f (Θ) = ǫ(ϕ)Θϕ(1)ϕ(2) ∧ · · · ∧ Θϕ(2h−1)ϕ(2h) ,
h! ′ϕ∈Sm
We can simplify this some more if we introduce the set S′′m consisting of permutations
ϕ ∈ S′m such that
ϕ(1) < ϕ(3) < · · · < ϕ(2h − 1).
Observe that
S′M (2h)!
#S′m = (#S′′m )h! =⇒ #S′′m = = h = 1 · 3 · · · (2h − 1) =: γ(2h).
h! 2 h!
Then X
P f (Θ) = (−1)h ǫ(ϕ)Θϕ(1)ϕ(2) ∧ · · · ∧ Θϕ(2h−1)ϕ(2h) .
ϕ∈S′′
m
We have
1 X Y
h
P f (−Θ) = ǫ(σϕ) Rϕ(2j−1)ϕ(2j)σ(2j−1)σ(2j) dVM .
h!
(σ,ϕ)∈S′m ×S′m j=1
1 X
= ǫ(ϕ)Θϕ(1)ϕ(2) ∧ · · · ∧ Θϕ(2h−1)ϕ(2h)
#S′m ′ ϕ∈Sm
1 X Y
h h!
= ǫ(σϕ) Rϕ(2j−1)ϕ(2j)σ(2j−1)σ(2j) dVM = P f (−Θ).
(#S′m ) #S′m
(σ,ϕ)∈S′m ×S′m j=1
Hence
~G∗ dAM = (Λ2h S)(e1 ∧ e2 ∧ · · · ∧ e2h−1 ∧ e2h ) = h!
M P f (−Θ),
#S′m
so that
2 2 h! (9.2.3) h! (2h)!
~G∗ dAM = P f (−Θ) = P f (−Θ)
M ′
σ 2h σ2h #Sm #Sm 2 π h h!
′ 2h
(9.2.6) 1
= P f (−Θ)
(2π)h
The last form is the Euler form e(∇M ) associated to the Levi–Civita connection ∇M .
Using the Gauss–Bonnet–Chern theorem we obtain the following result.
9.2. GAUSS–BONNET AGAIN?!? 407
2 ~∗ 1
GM dAm = e(∇M ) = P f (−Θg ),
σm (2π)h
and Z
2 deg ~GM = e(∇M ) = χ(M ).
M
1 X Y
h
P f (−Θg ) = ǫ(σϕ) Rϕ(2j−1)ϕ(2j)σ(2j−1)σ(2j) dVM (9.2.7a)
h!
(σ,ϕ)∈S′2h ×S′m j=1
X
= ǫ(ϕ)Θϕ(1)ϕ(2) ∧ · · · ∧ Θϕ(2h−1)ϕ(2h) , (9.2.7b)
ϕ∈S′′
2h
where S′2h denotes the set of permutations ϕ of {1, . . . , 2h} such that
and X
Θij = Rijkℓ θ k ∧ θ ℓ . ⊔
⊓
k<ℓ
1, 2, 3, 4, ǫ = 1 −→ Θ1212 Θ3434 ,
1, 3, 2, 4, ǫ = −1 −→ −Θ1312 Θ2434
1, 4, 2, 3, ǫ = 1 −→ Θ1412 Θ2334
We deduce
P f (−Θg ) = Θ12 ∧ Θ34 − Θ13 ∧ Θ24 + Θ14 ∧ Θ23
1 1
= (R12ij θ i ∧ θ j ) ∧ (R34kℓ θ k ∧ θ ℓ ) − (R13ij θ i ∧ θ j ) ∧ (R24kℓ θ k ∧ θ ℓ )
4 4
1
+ (R14ij θ i ∧ θ j ) ∧ (R23kℓ θ k ∧ θ ℓ )
4
408 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Each of the three exterior products above in involves only six nontrivial terms, because
θ i ∧ θ j = 0, when i = j. Thus the total number of terms in the above expression is 36.
However, there are many repetitions due to the symmetries of the Riemann tensor
We have
(R12ij θ i ∧ θ j ) ∧ (R34kℓ θ k ∧ θ ℓ )
= R1212 R3434 + R1213 R3442 + R1214 R3423
+R1223 R3414 + R1242 R3413 + R1234 R3412 dVM , etc.
(c) We can choose the positively oriented local orthonormal frame (e1 , . . . , em ) so that it
diagonalizes SM at a given point x ∈ M . Then the eigenvalues of SM at x are called the
principal curvatures at x and are denoted by κ1 (x), . . . , κm (x). Then
P f(−Θ) = ρdVM , ρ ∈ C ∞ (M ),
where
m
Y
ρ(x) = (2h − 1)!! κi (x), ∀x ∈ M. ⊔
⊓
k=1
We deduce
1 ~∗ 1
GD dAm = det(−SD )dV∂D ,
σm σm
where dV∂D denotes the volume form on ∂D.
A smooth vector field on D̄,
X : D̄ → Rm+1
is called admissible if along the boundary it points towards the exterior of D,
X • n > 0, on ∂D.
Let us observe that the map X̄ is homotopic to the map ~GD . Indeed, for t ∈ [0, 1], define
1
Yt : ∂D → S m , Yt (p) = ((1 − t)n + tX̄
|(1 − t)n + tX̄|
Hence
deg ~GD = deg X̄,
for any admissible vector field X.
Suppose X is a nondegenerate admissible vector field. This means that X has a finite
number of stationary points,
ZX = {p1 , . . . , pν }, X(pi ) = 0,
and all of them are nondegenerate, i.e., for any p ∈ ZX , the linear map
is invertible. Define
ǫX : ZX → {±1}, ǫ(p) = sign det AX,p .
For any ε > 0 sufficiently small the closed balls of radius ε centered at the points in ZX
are disjoint. Set [
Dε = D \ Bε (p).
p∈ZX
Set
1 ∗
Ω := X̄ dAm .
σm
Observe that
1 ∗
dΩ = X̄ d(dAm ) = 0 on Dε .
σm
Stokes’ theorem then implies that
Z Z Z X Z
Ω= ~
dΩ = 0 =⇒ deg GD = Ω= Ω,
∂Dε Dε ∂D p∈ZX ∂Bε (p)
where the spheres ∂Bε (p) are oriented as boundaries of the balls Bε (p). If we let ε → 0
we deduce X
deg ~GD = ǫX (p), (9.2.8)
p∈ZX
is an isomorphism. This map is independent of the connection ∇, and we denote by ǫX̂ (q)
the sign of its determinant. Moreover
ZXb = ZX ∪ ϕ(ZX ),
satisfies
ǫX (p) = ǫXb (ϕ(p)).
Hence
X X (9.2.8)
ǫXb (q) = 2 ǫX (q) = 2 deg ~GD .
q∈ZX
b p∈ZX
On the other hand, the general Poincaré-Hopf theorem (see Corollary 7.3.48) implies that
X
ǫX̂ (q) = χ(D̂).
q∈ZX̂
E1 . Sn contains all the real algebraic subsets of Rn , i.e., the subsets described by finitely
many polynomial equations.
S = S1 ∪ · · · ∪ Sν ,
The Tarski-Seidenberg theorem implies that the projection of Zn on the subspace with
coordinates (ai ) is a semialgebraic set
Rn := ~a = (a0 , . . . , an−1 ) ∈ Rn ; ∃x ∈ R, P~a (x) = 0 ,
where
P~a (x) = a0 + a1 x + · · · an−1 xn−1 + xn .
In other words, the polynomial P~a has a real root if and only if the coefficients ~a satisfy at
least one system of polynomial inequalities from a finite, universal, collection of systems
of polynomial inequalities.
2
This is a highly condensed and special version of the traditional definition of structure. The model
theoretic definition allows for ordered fields, other than R, such as extensions of R by “infinitesimals”.
This can come in handy even if one is interested only in the field R.
9.3. CURVATURE MEASURES 413
For example, when n = 2, the resolvent set R2 is described by the well known inequality
a21
− 4a0 ≥ 0.
When n = 3, we can obtain a similar conclusion using Cardano’s formulæ. For n ≥ 5,
we know that there cannot exist any algebraic formulæ describing the roots of a degree n
polynomial, yet we can find algebraic inequalities which can decide if such a polynomial
has at least one real root. ⊔
⊓
is S-definable. The reason these sets are called definable has to do with mathematical
logic.
A formula 3 is a property defining a certain set. For example, the two different looking
formulas
x ∈ R; x ≥ 0}, x ∈ R; ∃y ∈ R : x = y 2 },
describe the same set, [0, ∞).
Given a collection of formulas, we can obtain new formulas, using the logical operations
∧, ∨, ¬, and quantifiers ∃, ∀. If we start with a collection of formulas, each describing
an S-definable set, then any formula obtained from them by applying the above logical
transformations will describe a definable set.
To see this, observe that the operators ∧, ∨, ¬ correspond to the boolean operations,
∩, ∪, and taking the complement. The existential quantifier corresponds to taking a
projection. For example, suppose we are given a formula φ(a, b), (a, b) ∈ A × B, A, B
definable, describing a definable set C ⊂ A × B. Then the formula
a ∈ A; ∃b ∈ B : φ(a, b)
is S-definable.
Observe that any Grassmannian Grk (Rn ) is a semialgebraic subset of the Euclidean
space End+ (R) of symmetric operators Rn → Rn because it is defined by the system of
algebraic (in)equalities
Grk (Rn ) = P ∈ End+ (Rn ); P 2 = P, Λk P 6= 0, Λm P = 0, ∀m > k ,
and we deduce that cl(A) is also S-definable. Let us examine the correspondence between
the operations on formulas and operations on sets on this example.
We rewrite this formula as
∀ε (ε > 0) ⇒ ∃a(a ∈ A) ∧ (x ∈ Rn ) ∧ (|x − a| < ε) .
In the above formula we see one free variable x, and the set described by this formula
consists of those x for which that formula is a true statement.
The above formula is made of the “atomic” formulæ,
which all describe definable sets. The logical connector ⇒ can be replaced by ∨¬. Finally,
we can replace the universal quantifier to rewrite the formula as a transform of atomic
formulas via the basic logical operations.
n o
¬ ∃ε¬ (ε > 0) ⇒ ∃a(a ∈ A) ∧ (x ∈ Rn ) ∧ (|x − a| < ε) .
T. Any set A ∈ S1 is a finite union of open intervals (a, b), −∞ ≤ a < b ≤ ∞, and
singletons {r}. ⊔
⊓
Example 9.3.4. (a) The collection of real semialgebraic sets Salg is a tame structure.
(b)(Gabrielov-Hironaka-Hardt) A restricted real analytic function is a function f : Rn → R
with the property that there exists a real analytic function f˜ defined in an open neigh-
borhood U of the cube Cn := [−1, 1]n such that
(
f˜(x) x ∈ Cn
f (x) =
0 x ∈ R n \ Cn .
we denote by San the structure obtained from Salg by adjoining the graphs of all the
restricted real analytic functions. For example, all the compact, real analytic submanifolds
of Rn belong to San . The structure San is tame.
(c)(Wilkie, van den Dries, Macintyre, Marker ) The structure obtained by adjoining to
San the graph of the exponential function R → R, t 7→ et , is a tame structure.
(d)(Khovanski-Speissegger ) There exists a tame structure S′ with the following properties
(d1 ) San ⊂ S′ .
is also b
San -definable. ⊔
⊓
The definable sets and functions of a tame structure have rather remarkable tame
behavior which prohibits many pathologies. It is perhaps instructive to give an example
of function which is not definable in any tame structure. For example, the function
x 7→ sin x is not definable in a tame structure because the intersection of its graph with
the horizontal axis is the countable set πZ which violates the o-minimality condition T.
We will list below some of the nice properties of the sets and function definable in a
tame structure S. Their proofs can be found in [31, 36].
• (Piecewise smoothness of tame functions and sets.) Suppose A is an S-definable set, p is
a positive integer, and f : A → R is a definable function. Then A can be partitioned into
4
Our definition of pfaffian closure is more restrictive than the original one in [78, 122], but it suffices
for many geometric applications.
416 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
finitely many S definable sets S1 , . . . , Sk , such that each Si is a C p -manifold, and each of
the restrictions f |Si is a C p -function. The dimension of A is then defined as max dim Si .
• (Dimension of the boundary.) If A is an S-definable set, then dim(cl(A) \ A) < dim A.
• (Closed graph theorem.) Suppose X is a tame set and f : X → Rn is a tame bounded
function. Then f is continuous if and only if its graph is closed in X × Rn .
• (Curve selection.) If A is an S-definable set, k > 0 an integer, and x ∈ cl(A) \ A, then
there exists an S-definable C k -map γ : (0, 1) → A such that x = limt→0 γ(t).
• (Triangulability.) For every compact definable set A, and any finite collection of defin-
able subsets {S1 , . . . , Sk }, there exists a compact simplicial complex K, and a definable
homeomorphism
Φ:K→A
such that all the sets Φ−1 (Si ) are unions of relative interiors of faces of K.
• Any definable set has finitely many connected components, and each of them is definable.
• (Definable selection.) Suppose A, Λ are S-definable. Then a definable family of subsets
of A parameterized by Λ is a subset
S ⊂ A × Λ.
We set
Sλ := a ∈ A; (a, λ) ∈ S ,
and we denote by ΛS the projection of S on Λ. Then there exists a definable function
s : ΛS → S such that
s(λ) ∈ Sλ , ∀λ ∈ ΛS .
• (Definability of dimension.) If (Sλ )λ∈Λ is a definable family of definable sets, then the
function
Λ ∋ λ 7→ dim Sλ ∈ R
is definable. In particular, its range must be a finite subset of Z.
• (Definability of Euler characteristic.) Suppose (Sλ )λ∈Λ is a definable family of compact
tame sets. Then the map
X [
τ
w S×F
[℄ π
Φ S
.
S
9.3. CURVATURE MEASURES 417
Ψ : Ψ−1 (Yk ) → Yk
is definably trivial. ⊔
⊓
is S-definable.
(c) Prove that the curve
C = (t, ϕ(t) ); t ∈ [0, 1]
has finitely many components, and each of them has finite length. ⊔
⊓
E0
A
w E1
ρ0 (g)
u u ρ (g)
1
E0
A
w E1
We will denote by HomO(V ) (E0 , E1 ) the spaces of morphisms of O(V )-modules.
The vector space V has a tautological structure of O(V )-module given by
where
hρ† (g)λ, vi = hλ, g −1 vi, ∀λ ∈ V ∗ , v ∈ V.
Equivalently, ρ† (g) = ρ(g −1 )† , where ρ(h)† denotes the transpose of ρ(h). We obtain an
action on (V ∗ )⊗n ,
(ρ† )⊗n : O(V ) → Aut (V ∗ )⊗n , g 7→ ρ† (g)⊗n .
We denote by (V ∗ )⊗n
O(V ) the subspace consisting of invariant tensors,
⊗n
ω ∈ (V ∗ )⊗n
O(V ) ⇐⇒ ρ† (g) ω = ω, ∀g ∈ O(V ).
Observe that (V ∗ )⊗n can be identified with the vector space of multi-linear maps
ω :Vn =V · · × V} → R,
| × ·{z
n
so that (V ∗ )⊗nO(V )
can be identified with the subspace of O(V )-invariant multilinear maps
n
V → R.
Hermann Weyl has produced in his classic monograph [133] an explicit description
of (V ∗ )⊗n
O(V ) . We would like to present here, without proof, this beautiful result of Weyl
since it will play an important role in the future. We follow the elegant and more modern
presentation in Appendix I of [8] to which we refer for proofs.
Observe first that the metric duality defines a natural isomorphism of vector spaces
D : V → V ∗ , v 7→ v † ,
defined by
hv † , ui = (v, u), ∀u, v ∈ V.
This isomorphism induces an isomorphism of O(V )-modules
D : (V, ρ) → (V ∗ , ρ† ).
(V ∗ )⊗(r+s) ∼
= (V ∗ ⊗r) ⊗ V ⊗s ∼
= Hom(V ⊗r , V ⊗s )
In particular,
(V ∗ )⊗(r+s) ∼
= Hom(V ⊗r , V ⊗s ) = HomO(V ) (V ⊗r , V ⊗s ).
O(V ) O(V )
Let us observe that if we denote by Sr the group of permutations of {1, . . . , r}, then for
every ϕ ∈ Sr we obtain a morphism of O(V )-modules
HomO(V ) (V ⊗r , V ⊗s ) 6= 0 ⇐⇒ r = s,
and that nX o
HomO(V ) (V ⊗r , V ⊗r ) = R[Sr ] := cϕ Tϕ ; cϕ ∈ R, .
ϕ∈Sr
(V ∗ )⊗n
O(V ) 6= 0 ⇐⇒ n = 2r, r ∈ Z≥0 ,
and (V ∗ )⊗2r
O(V ) is spanned by the multilinear forms
Pϕ : V 2r → R, (ϕ ∈ Sr ),
defined by
Pϕ (u1 , . . . , ur , v 1 , . . . , v r ) = u1 , v ϕ(1) · · · ur , v ϕ(r) .
The above result has an immediate consequence. Suppose we have a map
· · × V} → R, (v 1 , . . . , v n ) 7→ f (v 1 , . . . , v n ),
| × ·{z
f :V
n
Polf : V d1 × · · · × V dn → R
= the coefficient of the monomial t11 t12 · · · t1d1 · · · tn1 · · · tndn in the polynomial
X
d1 dn
X
Pf (t11 , t12 , . . . , t1d1 , . . . , tn1 , . . . , tndn ) = f tij uj1 , . . . , tnj ujn .
j=1 j=1
Observe that
f (v 1 , . . . , v n ) = Polf (v 1 , . . . , v 1 , . . . , v n , . . . , v n ),
| {z } | {z }
d1 dn
· · × V} → R, qij (v 1 , . . . , v n ) := (v i , v j ).
| × ·{z
qij : V
n
420 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Example 9.3.8. (a) Consider the space E = V ⊗k . Observe that a degree n homogeneous
polynomial P on E can by identified with an element in the symmetric tensor product
Symd (E ∗ ) ⊂ (V ∗ )⊗2kn .
is the trace X X
tr(T ) = Ti,j ei ⊗ ej = Tii .
i,j i
Here is briefly how to find a basis of the space of degree k invariant polynomials
P : V ⊗h → R.
First of all, such polynomials exist if and only if hk is even, hk = 2m. Fix an orthonormal
basis { e1 , · · · , ed } of V . A tensor T ∈ V ⊗h decomposes as
X
T = Ti1 ...ih ei1 ⊗ · · · ⊗ eih .
1≤i1 ,...,ih ≤d
µ : Ihk → {1, 2, . . . , d}
such that i ∼ j =⇒ µ(i) = µ(j). Equivalently, F∼ , can be identified with the set of
functions µ : Ihk / ∼→ {1, . . . , d}, so there are dm such functions.
For any matching ∼ define
X
P∼ (T ) = Tµ(1)...µ(h) · · · Tµ(hk−h+1)...µ(hk) .
µ∈F∼
The collection
P∼ ; ∼ is a matching of Ihk
is a basis of the space of degree k invariant polynomials V ⊗h → R.
For example, the space of degree 1 invariant polynomials in tensors of order 4 has
4!
dimension 2(2!) = 6. Each polynomial in a basis constructed as above is a sum of d2
terms. We see that things are getting “hairy” pretty fast. ⊔
⊓
c := codim M = n − m.
In this section we will assume that M is compact and without boundary, but we will not
assume that it is orientable.
For r > 0 we define the tube of radius r around M to be the closed set
Tr (M ) := x ∈ M ; dist (x, M ) ≤ r ,
V (M, r) ≈ vol (M ) · ω c r c ,
PM (r) = pc r c + · · · + pn r n ,
• V (M, r) = PM (r),
• pc = ω c · vol (M ),
Let N(M ) denote the orthogonal complement of T M in (T Rn )|M , and we will call it
the normal bundle of M ֒→ Rn . We define
and we set
Nr (M ) := N(M ) ∩ Dr (Rn ).
The manifold with boundary Dr (Rn ) is a bundle of n-dimensional disks over Rn , while
Nr (M ) is a bundle of c-dimensional disks over M .
The exponential map E : T Rn → Rn restricts to an exponential map
EM : N(M ) → Rn .
Observe that because M is compact, there exists r0 = r0 (M ) > 0 such that, for every
r ∈ (0, r0 ), the exponential map EM induces a diffeomorphism
EM : Nr (M ) → Tr (M ).
and thus we can identify Drc × U with the open subset π −1 (U ) ⊂ Nr (M ), and we can use
x ∈ M and ~t ∈ Drc as local coordinates on π −1 (U ). Define
Tr (U ) := EM ( Nr (U ) ) ⊂ Rn ,
and
ẽA : Tr (U ) → Rn by ẽA (x + tα eα ) = eA (x).
We have thus extended in a special way the local frame (eA ) of (T Rn )|U to a local frame
of (T Rn )|Tr (U ) so that
D ẽα ẽA = 0, ∀α, A. (9.3.3)
9.3. CURVATURE MEASURES 423
φi = π ∗ θ i , φα = dtα .
Consider the 1-forms ΘA 1
B ∈ Ω Tr (U ) associated to the Levi–Civita connection D by
the frame (ẽA ) on Tr (U ), and set
ΘA
CB = ẽC ΘA
B , ∀i,
so that
D eC eB = ΘA
CB eA .
ΦA ∗ A 1 A ∗ A ∞
B = π (ΘB |M ) ∈ Ω (Nr (U )), ΦiB := π (ΘiB |U ) ∈ C (Nr (U ) ).
EM (tα eα (x), x) = x + tα eα .
We have X X
E∗M θ A = (eA • D ei EM )φi + (eA • ∂tα EM )dtα
i α
X X
= eA • (ei + tα Dei eα )φi + δAα dtα = δAi φi + tα ΦA i α
iα φ + δAα dt .
i α
Hence X
E∗M θ j = φj + tα Φjiα φi = φj − tα Φαij φi , E∗M θ β = dtβ .
α
Φij = (Φm+1
ij , . . . , Φnij ) : U → Rc ,
so that X
E∗M θ j = φj − (~t • Φij )φi .
i
Define the m × m symmetric matrix
S := S(~t, x) = ~t • Φij (x) 1≤i,j≤m
|dVn | = |θ m+1 ∧ · · · ∧ θ n ∧ θ 1 ∧ · · · ∧ θ m .|
424 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
E∗M dVn = | det(1 − S(~t, x) | |d~t ∧ dφ| = det(1 − S(~t, x) |d~t ∧ dφ|, (9.3.5)
d~t = dtm+1 ∧ · · · ∧ dtn , dφ = dφ1 ∧ · · · ∧ dφm .
Recalling that | ∧i dθ i |M | is the metric volume density |dVM | on M , we deduce
E∗M |dVn | = det(1 − S(~t, x) |d~t| × π ∗ |dVM |,
where |d~t| denotes the volume density on Rc . For simplicity we write |dVM | instead of
π ∗ |dVM |. Now set
1
ρ := |~t|, ω := ~t,
ρ
and denote by |dω| the area density on the unit sphere in Rc . Then
E∗M |dVn | = det(1 − ρS(ω, x) ρc−1 |dρ| × |dω| × |dVM |. (9.3.6)
Observe that
m
X
det(1 − ρS(ω, x) = (−1)ν ρν Pν Φij (x) • ω ,
ν=0
uij ∈ Rc , 1 ≤ i, j ≤ m.
We set Z
P ν Φij (x) := Pν Φij (x) • ω |dω|.
S c−1
Above, P ν uij is an O(c)-invariant, homogeneous polynomial of degree ν in the variables
uij ∈ Rc , 1 ≤ i, j ≤ m. We conclude,
m
X (−1)ν
π∗ E∗M |dVn | = r c+ν P ν Φij (x) |dVM (x)|. (9.3.7)
c+ν
ν=0
We would like to determine the invariant polynomials P ν uij .
Theorem 9.3.7 on invariants of the orthogonal group O(c) implies that P ν must be a
polynomial in the quantities
qi,j,k,ℓ := uik • ujℓ .
Because these quantities are homogeneous of degree 2 in the variables uij we deduce
P ν = 0 if ν is odd. Assume therefore ν = 2h, h ∈ Z≥0 .
For every ~t ∈ Rc = span (em+1 , . . . , em+c ) we form the linear operator
= the sum of all the ν × ν minors of U (~t) symmetric with respect to the diagonal.
9.3. CURVATURE MEASURES 425
ϕj := ϕ(ij ), ∀j = 1, . . . , 2h.
For any σ, ϕ ∈ SI we denote by ǫ(σ, ϕ) the signature of the permutation σ ◦ ϕ−1 , and by
Qσ,ϕ the invariant polynomial
h
Y h
Y
QI,σ,ϕ = qϕ2j−1 ,ϕ2j ,σ2j−1 ,σ2j = uϕ2j−1 σ2j−1 • uϕ2j σ2j .
j=1 j=1
Lemma 9.3.9. There exists a constant ξ = ξm,ν,c depending only on m, ν and c such that
X
µ̄I = ξQI , QI := ǫ(σ, ϕ)QI,σ,ϕ .
ϕ,σ∈SI
Proof. We regard µI as a function on the vector space of (2h) × (2h) matrices U with
entries in Rc
U = [uij ]i,j∈I .
We observe that µI satisfies the following determinant like properties.
where
{k1 , . . . , k2h } and {ℓ1 , . . . , ℓ2h }
are permutations of I. The skew-symmetry of µI with respect to the permutations of rows
and columns now implies that µI must be a multiple of QI .
⊔
⊓
so it suffices to compute the numerator and denominator of the above fraction for some
special values of uij . We can assume I = {1, 2, . . . , 2h} and we choose
1
0
uij = .. ∈ Rc .
.
0
Then, if we set
t1
t2 1
~t =
..
∈ Rc , ω = ~t
. |~t|
tc
we deduce
t1 0 · · · 0
0 t1 · · · 0
U (uij , ~t) = .. .. . . .. , µI (uij • ~t) = |t1 |ν .
. . . .
0 0 ··· t1
Hence Z
µI (uij ) = |ω 1 |2h |dω|. (9.3.8)
S c−1
h
Y
QI,σ,ϕ = uϕ2j−1 σ2j−1 • uϕ2j σ2j ,
j=1
which is nonzero if and only if σ = ϕ. We conclude that for this particular choice of uij
we have
QI = (2h)!.
Hence Z
1
ξm,ν,c = |ω 1 |2h |dω|.
(2h)! S c−1
At this point we invoke the following result whose proof is deferred to the end of this
subsection.
Lemma 9.3.10. For any even, nonnegative integers 2h1 , . . . , 2hc we have
Z
2Γ( 2h12+1 ) · · · Γ( 2hc2+1 )
|ω 1 |2h1 · · · |ω c |2hc |dω| = ,
S c−1 Γ( c+2h
2 )
where h = h1 + · · · + hc .
9.3. CURVATURE MEASURES 427
We deduce
2Γ( 2h+1
2 )Γ(1/2)
c−1
ξm,2h,c = , (9.3.9)
(2h)!Γ( c+2h
2 )
and X
P ν (uij ) = ξm,2h,c QI (uij ). (9.3.10)
#I=ν
We denote by S2 the group of permutations of a linearly ordered set with two elements.
We observe that every element
τ = (τ1 , . . . , τh ) ∈ G = S2 × · · · × S2
| {z }
h
X h
Y
= ǫ(σ, ϕ) qϕ2j−1 ,ϕ2j ,σ2j−1 ,σ2j − qϕ2j ,ϕ2j−1 ,σ2j−1 ,σ2j
ϕ∈S′I j=1
X h
Y
= ǫ(σ, ϕ) Rϕ2j−1 ϕ2j σ2j−1 σ2j
ϕ∈S′I j=1
X X X h
Y
ǫ(σ, ϕ)QI,σ,ϕ = ǫ(σ, ϕ) Rϕ2j−1 ϕ2j σ2j−1 σ2j
σ,ϕ∈SI σ∈SI ϕ∈S′I j=1
X h
Y
= ǫ(στ, ϕ) Rϕ2j−1 ϕ2j σ2j−1 σ2j
σ∈S′I ,τ ∈G j=1
X h
Y
h
=2 ǫ(σ, ϕ) Rϕ2j−1 ϕ2j σ2j−1 σ2j
σ,ϕ∈S′I j=1
| {z }
=:QI (R)
We conclude that X
P ν (ψij ) = 2h ξm,2h,c QI (R). (9.3.11)
#I=2h
428 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
⌊m/2⌋
X Z
2h ξm,2h,c
V (M, r) = vol (Tr (M )) = ω c+2h r c+2h Qh (R)|dVM |.
(c + 2h)ω c+2h M
h=0
where for every I = {i1 < i2 < · · · < i2h } ⊂ {1, . . . , m} we define
X h
Y
QI (R) = ǫ(σ, ϕ) Rϕ(i2j−1 )ϕ(i2j )σ(i2j−1 )σ(i2j ) . ⊔
⊓
σ,ϕ∈S′I j=1
µm (M ) = vol (M ).
(b) Assume now that m is even, m = 2h, and oriented. Then c + 2h = m + c = n and
Z
1 1
µ0 (M ) = h
Qh (R)dVM .
(2π) M h!
9.3. CURVATURE MEASURES 429
Comparing the definition of Qh (R) with (9.2.7a) we deduce that the top dimensional form
1 1
Qh (R)dVM ∈ Ω2h (M )
(2π)h h!
is precisely the Euler form associated with the orientation of M and the induced metric,
so that Z
µ0 (M, g) = e(M, g). (9.3.12)
M
(c) Suppose now that M is a hypersurface in Rm+1 . Consider the second fundamental
form
S = (Sij )1≤i,j≤m , Sij = (D ei ej ) • em+1 ,
where we recall that em+1 is in fact the oriented unit normal vector field along M . Fix
a point x0 ∈ M and assume that at this point the frame (e1 , . . . , em ) diagonalizes the
second fundamental form so that
Sij = κi δij .
The eigenvalues κ1 , . . . , κm are the principal curvatures at the point x0 . We denote by
cν (κ) the elementary symmetric polynomial of degree ν in the variables κi . In this case
c = 1, and we have
m
X
E∗M dVRm+1 = det(1 − tS)dt ∧ dVM = (−1)ν tν cν (κ)dt ∧ dVM
ν=0
m Z
X r ⌊m/2⌋ 2h+1
X r
π∗ E∗M dVRm+1 = tν dt cν (κ)dVM = 2 c2h (κ)dVM
ν=0 −r 2h + 1
h=0
so that
⌊m/2⌋
X X r 2h+1 Z
⌊m/2⌋
ω 1+2h r 1+2h µm−2h (M ) = V (M, r) = 2 c2h (κ)dVM .
2h + 1 M
h=0 h=0
We conclude that
Z
2
µm−2h (M ) = c2h (κ)dVM , σ 2h = (2h + 1)ω 1+2h .
σ 2h M
(d) Using the definition of the scalar curvature we deduce that for any m-dimensional
submanifold M m ֒→ Rn we have
Z
µm−2 (M, g) = constm sg |dVg |,
M
430 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
where s denotes the scalar curvature of the induced metric g, and constm is an universal
constant, depending only on m. We see that the map g → µm−2 (M, g) is precisely the
Hilbert-Einstein functional we discussed in Subsection 4.2.6, Definition 4.2.33.
To find the constm we compute µm−2 (M ) when M = S m . Using (9.3.13) we deduce
Z
2σ m m
= constm sround |dVS m |,
σ2 2 M
where sround denotes the scalar curvature of the round metric on the unit sphere.
Observe that the Grassmannian of oriented codimension one subspaces of Rm+1 can be
identified with the unit sphere S m . Hence, the oriented Gauss map of the unit sphere S m
is the identity. In particular, the shape operator of S m ֒→ Rm+1 is the identity operator
From Theorema Egregium we deduce that all the sectional curvatures of S n are equal to
one. Using the equalities (4.2.3) we deduce
X X
m
sround = Rijij = 1=2 .
2
i,j i,j
Hence
2σ m m m
= 2σ m constm
σ2 2 2
Z
1 1 1
=⇒ constm = = =⇒ µm−2 (M, g) = sg |dVg |.
σ2 4π 4π M
(e) The polynomial Q2 still has a “reasonable form”
X
Q2 (R) = QI
#I=4
The first sum has only three different monomials, each of them appearing twice is them
sum. The second sum has 62 different monomials (corresponding to subsets of cardinality
2 of S′I ) and each of them appears twice. ⊔
⊓
µw (M ) = 0.
We will say that µw (M, g) is the weight w curvature measure of (M, g). We set
1
|dµw | := Qh (R)|dVM |,
(2π)h h!
Remark 9.3.14. (a) Let us observe that for any Riemann manifold M , orientable or not,
the quantities |dµw | are indeed well defined, i.e. independent of the choice of local frames
used in their definition. The fastest way to argue this is by invoking Nash embedding
theorem which implies that any compact manifold is can be isometrically embedded in an
Euclidean space. For submanifolds of Rn , the proof of the tube formula then implies that
these densities are indeed well defined.
We can prove this by more elementary means by observing that, for any finite set I,
the relative signature ǫ(σ, ϕ) of two permutations ϕ, σ : I → I is defined by choosing a
linear ordering on I, but it is independent of this choice.
(b) The weight of the curvature density has a very intuitive meaning. Namely, we should
think of µw (M, g) as a quantity measured in meter w . ⊔
⊓
(u = s2 )
c Z
Y ∞ Yc 2h + 1
j
= e−u thj −1/2 du = Γ .
0 2
j=1 j=1
On the other hand, using spherical coordinates, ρ = |~t|, ω = |1~t| ~t, and recalling that
h = h1 + · · · + hc , we deduce that
Z Z ∞ 2
1 2h1 c 2hc
I(h1 , . . . , hc ) = |ω | · · · |ω | dω e−ρ ρ2h+c−1 dρ
S c−1 0
(u = ρ2 )
Z Z ∞
1 1 2h1 c 2hc c+2h
= |ω | · · · |ω | dω e−u u 2 −1 du
2 S c−1 0
Z
1 c + 2h
= Γ |ω 1 |2h1 · · · |ω c |2hc dω . ⊔
⊓
2 2 S c−1
432 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
so that,
V (Mr , ε) = V (M, r + ε) − V (M, r − ε),
which implies that X
ω 1+2h ε1+2h µn−1−2h (Mr , gr )
h≥0
X n o
= ω c+2k (r + ε)c+2k − (r − ε)c+2k µn−c−2k (M, g).
k≥0
We deduce
X
2 c + 2k c−1+2k−2h
µn−1−2h (Mr , gr ) = ω c+2k r µn−c−2k (M, g).
ω 1+2h 1 + 2h
k≥0
p := n − 1 − 2h, w := n − c − 2k = m − 2k.
If in the formula (9.3.14) we assume that the manifold M is a point, then we deduce that
Mr is the (n − 1)-dimensional sphere of radius r, Mr = Srn−1 , and we conclude that
n−1 n ωn p n p
µp (Sr ) = 2 r = 2ω p r , n − p ≡ 1 mod 2, (9.3.16)
p ω n−p p
9.3. CURVATURE MEASURES 433
where np is defined by (9.1.20). The last equality agrees with our previous computation
(9.3.13).
If in the formula (9.3.14) we let p = 0 we deduce
Observe that the tube Tr (M ) is naturally oriented, even though the manifold M may not
be orientable. The Gauss–Bonnet theorem for oriented hypersurfaces implies
χ(Mr ) = µ0 (Mr , gr )
so that
1
µ0 (M, g) = χ(Mr ), ∀0 < r ≪ 1. (9.3.17)
2
Theorem 9.3.15 (Gauss–Bonnet). Suppose M is a closed, compact submanifold of an
Euclidean space Rn . Denote by g the induced metric. Then
µ0 (M, g) = χ(M ).
Proof. If m = dim M is odd, then both χ(M ) and µ0 (M ) are equal to zero and the identity
is trivial. Assume therefore that m is even. If the dimension n of the ambient space is
odd, then the Poincaré duality for the oriented n-dimensional manifold with boundary
Tr (M ) implies
χ(Mr ) = χ( ∂Tr (M ) ) = 2χ( Tr (M ) ) = 2χ(M ),
and the theorem follows from (9.3.17).
If n is even, we apply the above argument to the embedding
M ֒→ Rn ֒→ Rn+1 ,
where we observe that the metric induced by the embedding M ֒→ Rn+1 coincides with
the metric induced by the original embedding M ֒→ Rn . ⊔
⊓
Remark 9.3.16. (a) We want emphasize again that in the above theorem we did not
require that M be orientable which is the traditional assumption in the Gauss–Bonnet
theorem.
(b) By invoking the Nash embedding theorem, we deduce that the tube formula implies
the Gauss–Bonnet formula for any compact Riemann manifold, orientable or not. ⊔
⊓
Let us record for later use the following corollary of the above proof.
Equivalently,
trj B = tr Λj B : Λj V → Λj V .
We define the tube of radius r around D to be
Tr (D) := x ∈ Rm+1 ; dist (x, D) ≤ r .
EM : ∆r → Tr (D) \ D,
so that Z
vol (Tr (D) ) = vol (D) + E∗M dVRm+1 .
∆r
9.3. CURVATURE MEASURES 435
(e1 , . . . , em )
We deduce Z Z
m
X r j+1
E∗M dVRm+1 = trj (−SM )dVM .
∆r j+1 M
j=0
Define Z
1
µm−j (D) := trj (−SM )dVM , 0 ≤ j ≤ m,
σj M
and
µm+1 (D) := vol (D)
so that using the equality σ j = (j + 1)ω j+1 we deduce the tube formula for domains,
m+1
X
vol Tr (D) = ω m+1−k r m+1−k µk (D). (9.3.18)
k=0
Theorem 9.2.5 shows that, just as in the case of submanifolds, we have µ0 (D) = χ(D).
Definition 9.3.19. Suppose D is a relatively compact domain with smooth boundary of
an Euclidean space V , dim V = n. Then the curvature densities of D are the densities
|dµj | on ∂D defined by
1
|dµj | := trn−j (−SD )|dV∂D |,
σn−j
where |dV∂D | denotes the volume density on ∂D induced by the Euclidean metric on V ,
and SD denotes the co-oriented second fundamental form of ∂D. ⊔
⊓
We denote by Dm+1
r the ball of radius r in Rm+1 . Then,
Tε (Dm+1
r ) = Dm+1
r+ε ,
so that X
ω m+1 (r + ε)m+1 = ω m+1−k εm+1−k µk (Dm+1
r ).
k≥0
436 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
We conclude
ω m+1 m + 1 k m+1 k
µk (Dm+1
r )= r = ωk r . (9.3.19)
ω m+1−k k k
Suppose X ֒→ Rm+1 is a closed, compact smooth submanifold. Then for every suffi-
ciently small r > 0, the tube Dr := Tr (X) is a compact domain with smooth boundary
and
Tε (Dr ) = Tr+ε (X).
The tube formula for X implies that
X X
ω j εj µm+1−j (Dr ) = ω k r + ε)k µm+1−k (X).
j≥0 k≥0
We deduce that
1 X k k−j
µm+1−j (Dr ) = ωk r µm+1−k (X)
ωj j
k≥j
p := n − j, w := n − k.
In particular, we have
lim µp Tr (X) = µp (X), ∀0 ≤ p ≤ dim X. (9.3.21)
r→0
We rescale this volume density as in (9.1.21) to obtain a new volume density |dνc | with
total volume Z
n
|dνc | = . (9.3.22)
Grc c
As explained in Subsection 9.1.3, these two densities produce two invariant densities |de
γc |
νc | on Graff c which differ by a multiplicative constant.
and |de
9.3. CURVATURE MEASURES 437
Proof. For simplicity, we set V = Rn , m = n − 1 = dim M . We will carry out the proof
in several steps.
Step 1. We will prove that there exists a constant ξm,c,p, depending only on m, c, and p
such that Z
ξn,c,pµp+c (D) = µp (L ∩ D)|deνc (L)|.
Graff c
Step 2. We will show that the constant ξ is equal to p+c p by explicitly computing both
sides of the above equality in the special case D = D m+1 .
Step 1. We will rely on a basic trick in integral geometry. For every S ∈ Graff c we
denote by [S] ∈ Grc the parallel translate of S containing the origin, [S] = S − S. We
introduce the incidence relation
I = (v, S) ∈ V × Graff c ; v ∈ S ⊂ V × Graff c .
with inverse
V × Grc (V ) ∋ (v, L) 7−→ (v, v + L) ∈ I.
We obtain a double fibration
')'r
^[[
ℓ
I
V Graff c
and we set
I(M ) := ℓ−1 (M ) = (v, S) ∈ V × Graff c ; v ∈ S ∩ M .
Since dim I = dim V + dim Grc = n + c(n − c) we deduce
M Graff c
438 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
The map r need not be a submersion. Fortunately, Sard’s theorem shows that r fails to
be a surjection on a rather thin set.
Denote by Graff c (M ) the set of codimension c affine planes which intersect M transver-
sally. The set Graff c (M ) is an open subset of Graff c , and Sard’s theorem implies that
its complement has measure zero. We set
I(M )∗ := r −1 Graff c (M ) .
The set I(M )∗ is an open subset of I(M ), and we obtain a double fibration
I(M )∗
DAAA r
ℓ (9.3.23)
c
M Graff (M )
The fiber of r over L ∈ Graff c is the slice ML := L ∩ M which is the boundary of the
domain DL := (L ∩ D) ⊂ L.
The vertical bundle of the fibration r : I∗ (M ) → Graff c (M ) is equipped with a natural
density given along a fiber L ∩ M by the curvature density |dµk | of the domain DL . We
will denote this density by |dµL k |. As explained in Subsection 9.1.1, using the pullback
r ∗ |de
γc | we obtain a density
|dλ| := |dµL ∗
p | × r |de
γc |
on I∗ (M ) satisfying,
Z Z Z ! Z
|dλ| = |dµL
p| |de
γc (L)| = µp (L ∩ D)|de
γc (L)|.
I∗ (M ) Graff c (M ) L∩M Graff c
To complete Step 1 in our strategy it suffices to prove that there exists a constant ξ,
depending only on m and c such that
ℓ∗ |dλ| = ξ|dµc |,
• The collection
{e0 (S), e1 (S), . . . , eh (S), eh+1 (S), . . . , em (S)}
is an orthonormal frame of Rn .
θAB := eA • (D eB ).
Then ^ ^
|de
γc | = D ~r • eα × |dγc | = θ α × |dγc |,
α α
and ^
1
|dλ| = |dµL
p | × |de
γc | = det(−SL∩M )|dVL∩M | × θ α × |dγc |. (9.3.24)
σk α
The fiber of ℓ : I(M ) → M over x0 is described by
Gx0 := (~r, eA (S)) ∈ I(M ), ~r = x0 .
We set
G∗x0 := Gx0 ∩ I∗ (M ).
G∗x0 (M ) can be identified with the space of linear subspaces S of codimension c such that
Tx0 M + S = V , i.e., the affine subspace x0 + S intersects M transversally at x0 .
Denote by n a smooth unit normal vector field defined in a neighborhood of x0 in M ,
i.e.,
n(x) ⊥ Tx M, |n(x)| = 1.
440 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
|(n • e0 )| · |dVM | = |θ 1 ∧ · · · ∧ θ m |,
Proof. It suffices to verify this for one basis X1 , . . . , Xm of Tx0 M which we can choose to
consists of the orthogonal projections f 1 , . . . , f m of e1 , . . . , em . These projections form a
basis since S intersects Tx0 M transversally.
Observe that
f i = ei , ∀1 ≤ i ≤ h, f α = eα − (eα • n)n.
Then
|dVM |(f 1 , . . . , f m )2 = det(f A • f B )1≤A,B≤m
We observe that
f i • f j = δij , f i • f α = 0, ∀1 ≤ i, j ≤ 2h < α
f α • f β = δαβ − nα nβ , nα := n • eα .
We deduce
|dVM |(f 1 , . . . , f m )2 = det(1 − A),
where A denotes the c × c symmetric matrix with entries nα nβ , h + 1 ≤ α, β ≤ m = h + c.
If we denote by ~u the vector
nh+1
~u = ... ∈ Rc ,
nh+c
which we also regard as a c × 1 matrix, then we deduce that A = ~u~u† .
This matrix has a c − 1 dimensional kernel corresponding to vectors orthogonal to ~u.
The vector ~u itself is an eigenvector of A, and the corresponding eigenvalue λ is obtained
from the equality
X
λ~u = |~u|2 ~u =⇒ λ = |~u|2 = n2α = |n|2 − |n • e0 |2 = 1 − |n • e0 |2 .
α
We conclude that
We have again
ei • f j = δij , ei • f α = 0, ∀1 ≤ i, j ≤ h < α,
9.3. CURVATURE MEASURES 441
eα • f β = δαβ − nα nβ ,
so that
|θ 1 ∧ · · · ∧ θ m |(f 1 , . . . , f m ) = |n • e0 |2 .
The lemma is now proved. ⊔
⊓
that is,
SL (ei , ej ) = (n • e0 )SD (ei , ej ), ∀1 ≤ i, j ≤ 2h.
Proof. We have
SL (ei , ej ) = e0 • (D ei ej ).
Let us now observe that the vector (D ei ej ) is parallel with the plane L because the vectors
ei and ej lie in this plane. Thus, D ei ej decomposes into two components, one component
parallel to e0 , and another component, (Deei ej )τ , tangent to L ∩ M . Hence
h
X
k
D ei ej = SL (ei , ej )e0 + Sij ek .
k=1
m
1 ^
= |n • e0 |k trk (−SD |Tx M ∩[L] ) θ A × |dγc |
σk
A=1
If we denote by θ([L], Tx0 M ) the angle between [L] and the hyperplane Tx0 M we deduce
1
|dλ|/|dVM | = · | cos θ([L], Tx0 M )|k+1 trk (−SD |Tx0 M ∩[L] ) |dγc |([L]).
σk
The map G∗x0 ∋ (x0 , L) 7−→ [L] ∈ Grc identifies G∗x0 with an open subset of Grc whose
complement has measure zero. We now have the following result.
Lemma 9.3.23. Suppose that V is an Euclidean space, dim V = m + 1, H ⊂ V is a
hyperplane through the origin, and B : H × H → R a symmetric bilinear map. Denote by
O(H) the subgroup of orthogonal transformations of V which map H to itself and suppose
that
f : Grc → R
is an O(H)-invariant function. Define
GrcH := S ∈ Grc ; S intersects H trasversally .
Let us prove that Qf ≡ 0. To do this, we apply the above formula to a symmetric bilinear
form B such that
dim ker B > m − k.
Thus, at least m − k + 1 of the m eigenvalues of B vanish, so that trk (B) = 0. For such
forms we have
Ik (f, B) = Q tr1 (B), . . . , trk−1 (B) .
On the other hand, for almost all S ∈ GrcH we have
Now choose B to be the bilinear form corresponding to the inner product on H. Then
m m−c
trk (B) = and trk (B|H∩S ) = , ∀S ∈ GrcH ,
k k
9.3. CURVATURE MEASURES 443
Hence
ω m+1 m+1 ω m+1−c m + 1 − c m+1
ξ = Ic,p .
ω m+1−c−p p+c ω m+1−c−p p c
Using spherical coordinates on Rc we deduce
Z Z 1
2 p/2
Ic,p = (1 − |x| ) dVRc = σ c−1 r c−1 (1 − r 2 )p/2 dr
Rc 0
Z
σ c−1 1 c−2
s=r 2
= s 2 (1 − s)p/2 ds
2 0
(9.1.9) σ c p
(9.1.8) σ c−1 c p c−1 Γ( 2 )Γ(1 + 2 )
= B , +1 =
2 2 2 2 Γ(1 + 2c + p2 )
(9.1.10) Γ(1 + 2p )
c ω p+c
= Γ(1/2) = .
Γ(1 + 2c + 2p ) ωp
Hence
m+1 ω m+1−c ω p+c m + 1 m + 1 − c
ξω m+1 =
p+c ωp c p
ω m+1 ω p+c m + 1 m + 1 − c
= .
ωpωc c p
We deduce
m+1
ω p+c c ω p+c p + c p+c
ξ=
ω p ωc m+1 m+1−c =
ω p ωc p
=
p
.
p+c p
⊔
⊓
Proof. We know that there exists a tame structure S such that D ∈ Sn . In particular,
M = ∂D ∈ Sn . Since the Grassmannian Graff c is semialgebraic, we deduce that Graff c
is also S-definable. Hence, the incidence set
I = ((x, L) ∈ M × Graff c ; x ∈ L
π : I → Graff c , (x, L) 7→ L.
9.3. CURVATURE MEASURES 445
The fiber of π over L is the intersection L∩M . From the definability of Euler characteristic
we deduce that the map
Graff c ∋ L 7→ χ(L ∩ M ) ∈ Z
is definable. Its range is a definable subset of Z, i.e., a finite set. To see that it has
compact support it suffices to observe that since M is compact, an affine plane which is
too far from the origin cannot intersect M . ⊔
⊓
Definition 9.3.25. For any compact tame subset S ⊂ Rn , and for every integer 0 ≤ p ≤ n
we define Z
µ̂p (S) := χ(L ∩ D) |deν |(L). ⊔
⊓
Graff p (Rn )
The proof of Proposition 9.3.24 shows that the integral in the above definition is well
defined and finite. This proposition also shows that
(d) Suppose T ⊂ Rn is a triangle, i.e., the convex hull of three non collinear points.
Compute µ̂p (T ), ∀p ≥ 0. ⊔
⊓
Then the function Λ ∋ λ 7−→ fp (λ) := µ̂p (Aλ ) ∈ R is bounded and Borel measurable.
446 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Proof. Denote by Graff p (K) the set of affine planes of codimension p in Rn which intersect
K. This is a compact, definable subset of Graff p . Define
The measurability follows from the classical Fubini theorem. The boundedness is obvious.
⊔
⊓
Graff cD ∋ L 7→ µp (L ∩ D)
Proof. The set Graff cD is a tame open and dense subset of Graff c so that its complement
has measure zero. Moreover
that is,
µ̂c (M ) = µc (M ).
Proof. We can assume that k = codim M < 1. For every x ∈ Rn set d(x) := dist (x, M ).
Fix R > 0 such that for any x such that d(x) ≤ R there exists a unique point x̄ ∈ M such
that
|x − x̄| = d(x).
For every r < R consider the tube of radius r, around M ,
Dr := Tr (M ),
µc (M ) = lim µc (Dr ).
r→0
The tube Dr is a tame domain with smooth boundary, and Theorem 9.3.20 implies
Z
µc (Dr ) = χ(L ∩ Dr )|dν|(L).
Graff c
Let
Graff c (Mr ) := L ⊂ Graff c ; L intersects Mr transversally .
Observe that Graff c (M ) is an open subset of Graff c with negligible complement. For
every r > 0 we set
Xr = L ∈ Graff c (M ); L ∈ Graff c (Ms ); χ(L ∩ Ds ) = χ(L ∩ M ), ∀s ∈ (0, r]
Observe that
Graff c (M, r1 ) ⊂ Graff c (M, r0 ), ∀r1 ≥ r0 .
To proceed further we need the following technical result, whose proof will presented at
the end of this subsection.
448 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Set
Graff c∗ := L ∈ Graff c ; L ∩ DR 6= ∅ .
The region Graff c∗ (M ) is a relatively compact subset of Graff c , and thus it has finite
measure. Define
X∗r := Xr ∩ Graff c∗ , Y∗r := Graff c∗ \X∗r .
For 0 < r < R we have
Z Z
µc (Dr ) = νc |(L) =
fr (L)|de νc |(L)
fr (L)|de
Graff c Grc∗
Z Z Z Z
= νc | +
fr (L)|de νc | = 2
fr (L)|de νc | +
f0 (L)|de νc |
fr (L)|de
X∗r Y∗r X∗r Y∗r
Hence
Z Z
µc (Mr ) − νc | ≤
f0 (L)|de νc | ≤ Cvol (Y∗r ).
|fr (L)||de
X∗r Y∗r
Proof of Lemma 9.3.31. We will prove that for any given L0 ∈ Graff c (M ) there exists
and ρ = ρ(L0 ) such that
L0 ∈ Xρ .
The measurability follows from the fact that Xr is described using countably many boolean
operations on measurable sets.
Consider the normal bundle
N := (T M )⊥ → M.
0
Ny(r) L0
Dr
For every r > 0 we set Ny0 (r) := Ny0 ∩ Dr ; see Figure 9.2.
The collection (Ny0 )y∈L0 ∩M forms a vector subbundle N 0 → L0 ∩ M of (T Rn )|L0 ∩M .
The exponential map on T Rn restricts to a map
EL0 ∩M : N 0 → L0 .
δL0 = d ◦ EL0 ∩M : N 0 → R.
π : L0 ∩ Dρ → L0 ∩ M,
which is continuous and defines a locally trivial fibration with fibers Ny0 (ρ).
For every y ∈ L0 ∩ M , the fiber Ny0 (ρ) is homeomorphic to a disk of dimension k.
Thus, L0 ∩ Dρ is homeomorphic to a tube in L0 around L0 ∩ M ⊂ L0 , so that
χ(L0 ∩ Dρ ) = χ(L0 ∩ M ).
The downward gradient flow of the restriction to L0 ∩ Dρ of the distance function d(x)
produces diffeomorphisms of manifolds with boundary
L0 ∩ Dρ ∼
= L0 ∩ Dr , ∀r ∈ (0, ρ).
Hence
χ(L0 ∩ Dr ) = χ(L0 ∩ Dρ ) = χ(L0 ∩ M ), ∀r ∈ (0, ρ].
450 CHAPTER 9. CLASSICAL INTEGRAL GEOMETRY
Since the restriction to L0 ∩ Dρ of the distance function d(x) has no critical points other
than the minima y ∈ L0 ∩ M , we deduce that L0 is transversal to the level sets
This proves L0 ∈ Xρ . ⊔
⊓
Corollary 9.3.32. Suppose C ⊂ R2 is a smooth, closed, compact tame curve. For every
line L ∈ Gr1 (R2 ) = Gr1 (R2 ) we set
We set Z Z
c(R) = |de
ν (L)|, c(M ) := |de
ν (L)|.
Graff m−2 (R) Graff m−2 (M )
Using Crofton’s formula in Proposition 9.3.24 and the simple observation that
we deduce that
n
c(R) = µm−2 (DnR ) = ωm−2 Rm−2 .
m−2
9.3. CURVATURE MEASURES 451
1
Observe now that c(R) ν (L)| is a probability measure on Graff m−2 (R), and we can regard
|de
the correspondence
Graff m−2 (R) ∋ L 7−→ χ(L ∩ M ) ∈ Z
as a random variable ξR whose expectation is
1 1
hξR i := µm−2 (M ) = EM (h).
c(R) 2πc(R)
We deduce
n m−2 n
EM (h) = 2π ωm−2 R hξR i = 2π ωm−2 lim Rm−2 hξR i. ⊔
⊓
m−2 m−2 R→∞
Theorem 9.3.34 (Crofton Formula. Part 2.). Denote by Graff c (M ) the subset of Graff c
consisting of affine planes which intersect M transversally. Then, for every 0 ≤ p ≤ m − c
we have
Z Z
p+c
µp+c (M ) = µp (L ∩ M )|de
ν |(L) = µ̂p (L ∩ M )|de
ν (L)|.
p Graff c (M ) Graff c
Proof. We continue to use the same notations we used in the proof of Theorem 9.3.29.
From (9.3.15) we deduce
µp+c (M ) = lim µp+c (Dr ).
r→0
The set Dr is a tame domain with smooth boundary, and Corollary 9.3.28 implies
Z Z
p+c
µp+c (Dr ) = µp (L ∩ Dr )|de
ν |(L) = µ̂p (L ∩ Dr )|de
ν |(L).
c Graff c Graff c
Lemma 9.3.35.
lim gr (L) = g0 (L),
rց0
Denote by Graff p (S0 ) the set of codimension p affine planes in Rn which intersect S0
transversally. The set Graff p (S0 ) is an open subset of Graff p , and its complement has
measure zero. Then,
Z Z
p
gr (L) = µ̂(Sr ) = χ(U ∩ Sr )|deν (U )| = χ(U ∩ Sr )|de ν p (U )|. (9.3.26)
Graff p Graff p (S0 )
If we let r go to zero in (9.3.26), we deduce from the dominated convergence theorem that
Z
lim gr (L) = ν p (U )| = µ̂p (S0 ).
χ(U ∩ S0 )|de
rց0 Graff p
Using the fact that S0 is a smooth, compact tame manifold we deduce from Theorem
9.3.29 that
µ̂p (S0 ) = µp (S0 ) = g0 (L). ⊔
⊓
Using (9.3.25) and Lemma 9.3.35, we deduce from the dominated convergence theorem
that Z Z
lim µ̂p (L ∩ Dr )|de
ν (L)| = µ̂p (L ∩ M )|de
ν (L)|
rց0 Graff c Graff c
Z
= µp (L ∩ M )|de
ν (L)|. ⊔
⊓
Graff c (M )
Remark 9.3.36. (a) The tameness assumption in the Crofton formulaæ is not really
needed, and they are true in much more general situations; see [45], Sect. 2.10, 3.2.
We have chosen to work in this more restrictive context for two reasons. First, the
tameness assumption eliminates the need for sophisticated analytical arguments. Second,
we believe that geometers should become more familiar with tame context in which the
geometric intuition is not distracted by analytical caveats.
(b) The subject of integral geometry owes a great deal to the remarkable work of S.S.
Chern who was the first to have reformulated the main problems from a modern point of
view, and push our understanding of this subject to remarkable heights.
It was observed by Chern that the Crofton formulæ that we have proved in this section
are only special cases of the so called kinematic formulæ which explain how to recover
9.3. CURVATURE MEASURES 453
Almost all the objects in differential geometry are defined by expressions involving partial
derivatives. The curvature of a connection is the most eloquent example.
Many concrete geometric problems lead to studying such objects with specific prop-
erties. For example, we inquired whether on a given vector bundle there exist flat con-
nections. This situation can be dealt with topologically, using the Chern–Weil theory of
characteristic classes.
Very often, topological considerations alone are not sufficient, and one has look into
the microstructure of the problem. This is where analysis comes in, and more specifically,
one is led to the study of partial differential equations. Among them, the elliptic ones
play a crucial role in modern geometry.
This chapter is an introduction to this vast and dynamic subject which has numerous
penetrating applications in geometry and topology.
where, as usual, ∂i := ∂xi . This is a scalar operator in the sense that it acts on scalar
valued functions. Note that our definition of the Laplacian differs from the usual one by
a sign. The Laplacian defined as above is sometimes called the geometers’ Laplacian.
Next in line is the exterior derivative
455
456 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
N
smooth function ω : RN → R( k ) . Thus d can be viewed as an operator
N
N
d : C ∞ RN , R( k ) → C ∞ RN , R(k+1) .
C ∞ (E) → C ∞ (F ).
P DO 0 (E, F ) := Hom(E, F ).
by
ad(f )T := T ◦ f − f ◦ T = [T, f ], ∀T ∈ Op(E, F ).
Above, f denotes the C ∞ (M )-module multiplication by f . We can rephrase the equality
(10.1.1) as
Define
The elements of P DO (m) are called partial differential operators of order ≤ m. We set
[
P DO(E, F ) := P DO(m) (E, F ).
m≥0
Example 10.1.2. Denote by R the trivial line bundle over RN . The sections of R are
precisely the real functions on RN . We want
to analyze P DO(1) = P DO(1) (R, R).
Observe first that P DO0 R, R ∼ = C ∞ RN .
Let L ∈ P DO(1) , u, f ∈ C ∞ (RN ). Then [L, f ]u = σ(f ) · u, for some smooth function
σ(f ) ∈ C ∞ (RN ). On the other hand, for any f, g ∈ C ∞ (RN )
σ(f g)u = [L, f g]u = [L, f ](gu) + f ([L, g]u) = σ(f )g · u + f σ(g) · u.
Hence σ(f g) = σ(f )g + f σ(g). In other words, the map f 7→ σ(f ) is a derivation of
C ∞ (RN ), and consequently (see Exercise 3.1.9) there exists a smooth vector field X on
RN such that
σ(f ) = X · f, ∀f ∈ C ∞ (RN ).
Let µ := L(1) ∈ C ∞ (RN ). Then, for all u ∈ C ∞ (RN ), we have
supp Lu ⊂ supp u.
For any open set O such that O ⊃ supp u we can find f ≡ 1 on supp u and f ≡ 0 outside
O so that f u ≡ u). This concludes the proof of the lemma. ⊔
⊓
The above lemma shows that, in order to analyze the action of a p.d.o., one can work
in local coordinates. Thus, understanding the structure of an arbitrary p.d.o. boils down
to understanding the action of a p.d.o. in P DO(m) (Kp , Kq ), where Kp , and Kq are trivial
K-vector bundles over RN . This is done in the exercises at the end of this subsection.
Proposition 10.1.4. Let E, F, G be smooth K-vector bundles over the same manifold
M . If P ∈ P DO (m) (F, G) and Q ∈ P DO(n) (E, F ) then P ◦ Q ∈ P DO (m+n) (E, G).
Proof. According to the computation in Example 10.1.2, each partial derivative ∂i is a 1st
order p.d.o. According to the above proposition, multiple compositions of such operators
are again p.d.o.’s. ⊔
⊓
Lemma 10.1.6. Let E, F → M be two smooth vector bundles over the smooth manifold
M . Then for any P ∈ P DO (E, F ), and any f, g ∈ C ∞ (M )
Proof.
ad(f ) · (ad(g)P ) = [P, g], f
= [P, f ], g + P, [f, g] = [P, f ], g = ad(g) · (ad(f )P ).
⊔
⊓
From the above lemma we deduce that if P ∈ P DO(m) then, for any f1 , . . . , fm ∈
C ∞ (M ), the bundle morphism
We have
nX o X X
{ad(φ)Q} |x0 = ad(αj βj )Q |x0 = {[Q, αj ]βj } |x0 + {αj [Q, βj ]} |x0 = 0.
j j j
Proposition 10.1.10. Let P ∈ P DO (m) (E, F ) and Q ∈ P DO(n) (F, G). Then
σm+n (Q ◦ P ) = σn (Q) ◦ σm (P ). ⊔
⊓
Definition 10.1.13. The operator P ∈ P DOm (E, F ) is said to be elliptic if, for any
x ∈ M , and any ξ ∈ Tx∗ M \ {0}, the map
σm (P )(ξ) : Ex → Fx
is a linear isomorphism. ⊔
⊓
Show that X X αN
σm (L)(ξ) = aα (x)ξ α = aα (x)ξ1α1 · · · ξN . ⊔
⊓
|α|=m |α|=m
Exercise 10.1.16. Let L ∈ P DO (m) (E, F ) and u ∈ C ∞ (E). Show the operator
C ∞ (M ) ∋ f 7−→ [L, f ]u ∈ C ∞ (F )
Exercise 10.1.17. Let Kp and Kq denote the trivial K-vector bundles over RN of rank p
and respectively q. Show that any L ∈ P DO(m) (Kp , Kq ) has the form
X
L= Aα (x)∂ α ,
|α|≤m
10.1.2 Examples
At a first glance, the notions introduced so far may look too difficult to “swallow”. To
help the reader get a friendlier feeling towards them, we included in this subsection a
couple of classical examples which hopefully will ease this process. More specifically, we
will compute the principal symbols of some p.d.o.’s that we have been extensively using
in this book.
In the sequel, we will use the notation “ f · ” to denote the operation of multiplication
by the smooth scalar function f .
Example 10.1.18. (The Euclidean Laplacian). This is the second order p.d.o.,
X
∆ : C ∞ (RN ) → C ∞ (RN ), ∆ = − ∂i 2 , ∂i = ∂xi .
i
Hence
X X
ad(f )2 (∆) = ad(f )(∆f ) · −2 fxi · ad(f ) (∂i ) = −2 (fxi )2 · = −2|df |2 · .
i i
σ2 (∆)(ξ) = −|ξ|2 · .
∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E).
Its symbol can be read from ad(f )∇, f ∈ C ∞ (M ). For any u ∈ C ∞ (E)
By setting ξ = df we deduce σ1 (∇)(ξ) = ξ⊗, i.e., the symbol is the tensor multiplication
by ξ. ⊔
⊓
δ = ∗d∗ : Ω• (M ) → Ω•−1 (M ),
∗ : Ω• (M ) → Ωn−• (M ).
σ1 (ξ 1 )ω = 0. (10.1.3)
⊔
⊓
d + d∗ : Ω• (M ) → Ω• (M ),
Definition 10.1.23. Let E → M be a smooth vector bundle over the Riemann manifold
(M, g). A second order p.d.o.
L : C ∞ (E) → C ∞ (E)
The formal adjoint of a p.d.o. P ∈ P DO (E1 , E2 ), whose existence is not yet guaran-
teed, is denoted by P ∗ . It is worth emphasizing that P ∗ depends on the choices of g and
h·, ·ii .
464 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Thus
ad(f0 ) ad(f1 ) · · · ad(fm )L∗ = (−1)m+1 (ad(f0 )ad(f1 ) · · · ad(fm )L)∗ = 0. ⊔
⊓
σm (P ∗ ) = (−1)m σm (P )∗ ,
where the ∗ in the right-hand-side denotes the conjugate transpose of a linear map. ⊔
⊓
Let E be a Hermitian vector bundle over the oriented Riemann manifold (M, g).
Example 10.1.29. Using the integration by parts formula of Subsection 4.1.5, we deduce
that the Hodge–DeRham operator
d + d∗ : Ω• (M ) → Ω• (M ),
on an oriented Riemann manifold (M, g) is formally self-adjoint with respect with the
metrics induced by g in the various intervening bundles. In fact, d∗ is the formal adjoint
of d. ⊔
⊓
Proposition 10.1.30. Let (Ei , h·, ·ii ), i = 1, 2, be two arbitrary Hermitian vector bundle
over the oriented Riemann manifold (M, g). Then any L ∈ P DO (E1 , E2 ) admits at least
(and hence exactly) one formal adjoint L∗ .
10.1. PARTIAL DIFFERENTIAL OPERATORS: ALGEBRAIC ASPECTS 465
Sketch of proof We prove this result only in the case when E1 and E2 are trivial vector
bundles over RN . However, we do not assume that the Riemann metric over RN is the
Euclidean one. The general case can be reduced to this via partitions of unity and we
leave the reader to check it for him/her-self.
Let E1 = Cp and E2 = Cq . By choosing orthonormal moving frames, we can assume
that the metrics on Ei are the Euclidean ones. According to the exercises at the end of
Subsection 10.1.1, any L ∈ P DO(m) (E1 , E2 ) has the form
X
L= Aα (x)∂ α ,
|α|≤m
where Aα ∈ C ∞ (RN , Mq×p (C)). Clearly, the formal adjoint of Aα is the conjugate trans-
pose
t
A∗α = Aα .
To prove the proposition it suffices to show that each of the operators ∂i ∈ P DO1 (E1 , E1 )
admits a formal adjoint. It is convenient to consider the slightly more general situation.
Lemma 10.1.31. Let X = X i ∂i ∈ Vect (RN ), and denote by ∇X the first order p.d.o.
∇X u = X i ∂i u u ∈ C0∞ (Cp ).
Then
∇X = −∇X − div g (X),
where div g (X) denotes the divergence of X with respect to the metric g, i.e., the scalar
defined by the equality
LX (dVg ) = div g (X) · dVg .
Proof. Let u, v ∈ C0∞ (RN , Cp ). Choose R ≫ 0 such that the Euclidean ball BR of radius
R centered at the origin contains the supports of both u and v. From the equality
we deduce Z Z
h∇X u, vidVg = h∇X u, vidVg
RN BR
Z Z
= X · hu, vidVg − hu, ∇X vidVg . (10.1.6)
BR BR
Set f := hu, vi ∈ C0∞ (RN , C), and denote by α ∈ Ω1 (RN ) the 1-form dual to X with
respect to the Riemann metric g, i.e.,
Equivalently,
α = gij X i dxj .
466 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
i.e.,
1 X p
∇∗X = −∇X − div g (X) = −∇X − p ∂i ( |g|X i ). (10.1.7)
|g| i
The lemma and consequently the proposition is proved. ⊔
⊓
Example 10.1.32. Let E be a rank r smooth vector bundle over the oriented Riemann
manifold (M, g), dim M = m. Let h·, ·i denote a Hermitian metric on E, and consider a
connection ∇ on E compatible with this metric. The connection ∇ defines a first order
p.d.o.
∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E).
The metrics g and h·, ·i induce a metric on T ∗ M ⊗ E. We want to describe the formal
adjoint of ∇ with respect to these choices of metrics.
As we have mentioned in the proof of the previous proposition, this is an entirely local
issue. We fix x0 ∈ M , and we denote by (x1 , . . . , xm ) a collection of g-normal coordinates
on a neighborhood U of x0 . We set
∇i := ∇∂xi .
Next, we pick a local synchronous frame of E near x0 , i.e., a local orthonormal frame (eα )
such that, at x0 , we have
∇i eα = 0 ∀i = 1, . . . , m.
10.1. PARTIAL DIFFERENTIAL OPERATORS: ALGEBRAIC ASPECTS 467
dxk ⊗ : C ∞ (E |U ) → C ∞ (T ∗ U ⊗ E |U ),
is the interior derivative (contraction) along the vector field g-dual to the 1-form dxk , i.e.,
Hence, X X
∇∗ = ∇k ∗ ◦ C k = −∇k − div g (∂xk ) ◦ C k
k k
X p
=− (∇k + ∂xk log( |g|) ◦ C k . (10.1.8)
k
∆ = ∆∇ : C ∞ (E) → C ∞ (E), ∆ = ∇∗ ∇.
To justify the attribute Laplacian we will show that ∆ is indeed a generalized Laplacian.
Using (10.1.8) we deduce that over U (chosen as above) we have
( )
X p X
∆=− ∇k + ∂xk log |g| ◦ C k ◦ dxj ⊗ ∇j
k j
nX p o
=− ∇k + ∂xk log |g| ◦ g kj · ∇j
k
X p
=− gkj ∇k + ∂xk gkj + gkj ∂xk log |g| ◦ ∇j
k,j
X 1 p
=− gkj ∇k ∇j + p ∂xk |g|gkj · ∇j .
k,j
|g|
The symbol of ∆ can be read easily from the last equality. More precisely,
σ2 (∆)(ξ) = −g jk ξj ξk = −|ξ|2g .
where Γikℓ denote the Christoffel symbols of the Levi–Civita connection associated to the
metric g.
(b) Show that
∗
∆∇ = − tr2g (∇T M ⊗E ∇E ),
∗
where ∇T M ⊗E is the connection on T ∗ M ⊗ E obtained by tensoring the Levi–Civita
connection on T ∗ M and the connection ∇E on E, while
tr2g : C ∞ (T ∗ M ⊗2 ⊗ E) → C ∞ (E)
(c)
BL∗ (v, u) = −BL (u, v).
10.2. FUNCTIONAL FRAMEWORK 469
Lemma 10.2.2. Any f ∈ L1loc (D) admits at most one weak partial derivative. ⊔
⊓
Exercise 10.2.4. Let H ∈ L1loc (R) denote the Heaviside function, H(t) ≡ 1, for t ≥ 0,
H(t) ≡ 0 for t < 0. Prove that H is not weakly differentiable. ⊔
⊓
Above, L∗ denotes the formal adjoint of L with respect to the Euclidean metric on RN .
Exercise 10.2.6. Let f, g ∈ C ∞ (D). Prove that
Lf = g classically ⇐⇒ Lf = g weakly. ⊔
⊓
Definition 10.2.7. Let k ∈ Z+ , and p ∈ [1, ∞]. The Sobolev space Lk,p (D) consists of
all the functions f ∈ Lp (D) such that, for any multi-index α satisfying |α| ≤ k, the mixed
partial derivative ∂ α f exists weakly, and moreover, ∂ α f ∈ Lp (D). For every f ∈ Lk,p(D)
we set 1/p
X Z
kf kk,p := kf kk,p,D = |∂ α f |p dx ,
|α|≤k D
if p < ∞, while if p = ∞ X
kf kk,∞ = ess sup |∂ α f |.
|α|≤k
Exercise 10.2.9. Let f (t) = |t|α , t ∈ R, α > 0. Show that for every p > 1 such that
α > 1 − 1p , we have
f (t) ∈ L1,p
loc (R). ⊔
⊓
Theorem 10.2.10. Let k ∈ Z+ and 1 ≤ p ≤ ∞. Then
(a) (Lk,p (RN ), k · kk,p ) is a Banach space.
(b) If 1 ≤ p < ∞ the subspace C0∞ (RN ) is dense in Lk,p (RN ).
(c) If 1 < p < ∞ the Sobolev space Lk,p(RN ) is reflexive.
The proof of this theorem relies on a collection of basic techniques frequently used in
the study of partial differential equations. This is why we choose to cover the proof of
this theorem in some detail. We will consider only the case p < ∞, leaving the p = ∞
situation to the reader.
Proof. Using the Exercise 10.2.5, we deduce that Lk,p is a vector space. From the classical
Minkowski inequality,
ν
!1/p ν
!1/p ν
!1/p
X X X
p p p
|xi + yi | ≤ |xi | + |yi | ,
i=1 i=1 i=1
10.2. FUNCTIONAL FRAMEWORK 471
we deduce that k · kk,p is a norm. To prove that Lk,p is complete, we will use the well
established fact that Lp is complete.
✍ To simplify the notations, throughout this chapter all the extracted subsequences will
be denoted by the same symbols as the sequences they originate from.
Let (fn ) ⊂ Lk,p (RN ) be a Cauchy sequence, i.e.,
In particular, for each multi-index |α| ≤ k, the sequence (∂ α fn ) is Cauchy in Lp (RN ), and
thus,
Lp
∂ α fn → gα , ∀|α| ≤ k.
Set f = limn fn . We claim that ∂ α f = gα weakly.
Indeed, for any ϕ ∈ C0∞ (RN ) we have
Z Z
α |α|
∂ fn ϕ dx = (−1) fn ∂ α ϕ.
RN RN
We let the reader check that f ∗ g is well defined, i.e., y 7→ f (x − y)g(y) ∈ L1 , for almost
all x.
Exercise 10.2.11. (Young’s Inequality). If f ∈ Lp (RN ), and g ∈ Lq (RN ), 1 ≤ p, q ≤
∞, then f ∗ g is well defined, f ∗ g ∈ Lr (RN ), 1r = 1p + 1q − 1, and
⊔
⊓
To define the mollifiers one usually starts with a function ρ ∈ C0∞ (RN ), such that
Z
ρ ≥ 0, supp ρ ⊂ |x| < 1 , and ρ dx = 1.
RN
Note that Z
supp ρδ ⊂ |x| < δ and ρδ dx = 1.
RN
The sequence (ρδ ) is called a mollifying sequence. The next result describes the main use
of this construction.
Lemma 10.2.12. (a) For any f ∈ L1loc (RN ) the convolution ρδ ∗ f is a smooth1 function!
(b) If f ∈ Lp (RN ) (1 ≤ p < ∞) then
Lp
ρδ ∗ f → f as δ → 0.
Proof. Part (a) is left to the reader as an exercise in the differentiability of integrals with
parameters.
To establish part (b), we will use the fact that C0∞ (RN ) is dense in Lp (RN ). Fix ε > 0,
and choose g ∈ C0∞ (RN ) such that,
kf − gkp ≤ ε/3.
We have
kρδ ∗ f − f kp ≤ kρδ ∗ (f − g)kp + kρδ ∗ g − gkp + kg − f kp .
Using the inequality (10.2.1) we deduce
Hence Z
|ρδ ∗ g (x) − g(x)| ≤ ρ(z)|g(x − δz) − g(x)|dz ≤ δ sup |dg|.
RN
Since
supp ρδ ∗ g ⊂ x + z ; x ∈ supp g ; z ∈ supp ρδ ,
there exists a compact set K ⊂ RN such that,
We conclude that
Z 1/p
p p
kρδ ∗ g − gkp ≤ δ (sup |dg|) dx = vol (K)1/p δ sup |dg|.
K
The next auxiliary result describes another useful feature of the mollification technique,
especially versatile in as far as the study of partial differential equations is concerned.
Lemma 10.2.13. Let f, g ∈ L1loc (RN ) such that ∂k f = g weakly. Then ∂k (ρδ ∗ f ) = ρδ ∗ g.
More generally, if X
L= aα ∂ α
|α|≤m
L(ρδ ∗ f ) = ρδ ∗ g classically.
Proof. It suffices to prove only the second part. We will write Lx to emphasize that L
acts via derivatives with respect to the variables x = (x1 , . . . , xN ). Note that
Z
L(ρδ ∗ f ) = (Lx ρδ (x − y))f (y)dy. (10.2.3)
RN
Since
∂
∂xi ρδ (x − y) = − ρδ (x − y),
∂y i
and ∂i ∗ = −∂i , we deduce from (10.2.3) that
Z Z
∗
L(ρδ ∗ f ) = (Ly ρδ (x − y))f (y)dy = ρδ (x − y)Ly f (y) dy
RN RN
Z
= ρδ (x − y)g(y)dy = ρδ ∗ g. ⊔
⊓
RN
This fact has a fundamental importance in establishing regularity results for elliptic op-
erators. ⊔
⊓
After this rather long detour, we return to the proof of Theorem 10.2.10. Let f ∈
Lk,p (RN ). We will construct fn ∈ C0∞ (RN ) such that fn → f in Lk,p using two basic
techniques: truncation and mollification (or smoothing).
Truncation. The essentials of this technique are contained in the following result.
474 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Lemma 10.2.15. Let f ∈ Lk,p (RN ). Consider for each R > 0 a smooth function ηR ∈
C0∞ (RN ) such that η(x) ≡ 1 for |x| ≤ R, ηR (x) ≡ 0 for |x| ≥ R + 1, and |dηR (x)| ≤ 2 ∀x.
Then ηR · f ∈ Lk,p (RN ), ∀R ≥ 0, and moreover
Lk,p
ηR · f −→ f as R → ∞.
Proof. We consider only the case k = 1. The general situation can be proved by induction.
We first prove that ∂i (ηR f ) exists weakly and as expected
∂i (ηR · f ) = (∂i ηR ) · f + ηR · ∂i f.
This confirms our claim. Clearly (∂i ηR )f + ηR ∂i f ∈ Lp (RN ), so that ηR f is in L1,p (RN ).
Note that ∂i ηR ≡ 0 for |x| ≤ R and |x| ≥ R + 1. In particular, we deduce
(∂i ηR )f → 0 a.e.
Clearly |(∂i ηR )f | ≤ 2|f (x)|, so that by the dominated convergence theorem we conclude
Lp
(∂i ηR ) · f → 0 as R → ∞.
Similarly
ηR ∂i f → ∂i f a.e.,
and |ηR ∂i f | ≤ |∂i f |, which implies ηR ∂i f → ∂i f in Lp . The lemma is proved. ⊔
⊓
According to the above lemma, the space of compactly supported Lk,p -functions is
dense in Lk,p (RN ). Hence, it suffices to show that any such function can be arbitrarily
well approximated in the Lk,p -norm by smooth, compactly supported functions.
Let f ∈ Lk,p (RN ), and assume
∂ α (ρδ ∗ f ) = ρδ ∗ (∂ α f ) ∀|α| ≤ k.
Lp
which is continuous, one-to-one, and has closed range. Indeed, if ∂ α fn → fα , then, arguing
as in the proof of completeness, we deduce that
fα = ∂ α f0 weakly,
where f0 = lim fn . Hence (fα ) = T f0 . We now conclude that Lk,p (RN ) is reflexive as a
closed subspace of a reflexive space. Theorem 10.2.10 is proved. ⊔
⊓
Remark 10.2.16. (a) For p = 2 the spaces Lk,2 (RN ) are in fact Hilbert spaces. The
inner product is given by
Z X
hu, vik = ∂ α u · ∂ α vdx dx.
RN |α|≤k
(b) If D ⊂ RN is open, then Lk,p (D) is a Banach space, reflexive if 1 < p < ∞. However
C0∞ (D), is no longer dense in Lk,p(D). The closure of C0∞ (D) in Lk,p (D) is denoted by
Lk,p k,p k,p
0 (D). Intuitively, L0 (D) consists of the functions u ∈ L (D) such that
∂j u
= 0 on ∂D, ∀j = 0, 1, . . . , k − 1,
∂ν j
where ∂/∂ν denotes the normal derivative along the boundary. The above statement
should be taken with a grain of salt since at this point it is not clear how one can define
u |∂D when u is defined only almost everywhere. We refer to [4] for a way around this
issue.
The larger space C ∞ (D) ∩ Lk,p (D) is dense in Lk,p (D), provided that the boundary of
D is sufficiently regular. We refer again to [4] for details. ⊔
⊓
|dφ| ≤ const,
∂i φ(f ) = φ′ (f ) · ∂i f. ⊔
⊓
Exercise 10.2.19. Let f ∈ L1,p (RN ). Show that |f | ∈ L1,p (RN ), and
∂i f a.e. on {f ≥ 0}
∂i |f | =
−∂i f a.e. on {f < 0}
The “strength” is a measure of the size of a Sobolev size. Loosely speaking, the bigger the
strength, the more regular are the functions in that space, and thus it consists of “fewer”
functions.
Remark 10.2.20. The origin of the quantities σN (k, p) can be explained by using the
notion of conformal weight. A function u : RN → R can be thought of as a dimensionless
physical quantity. Its conformal weight is 0. Its partial derivatives ∂i u are physical quanti-
ties measured in meter −1 = variation per unit of distance, and they have conformal weight
−1. More generally, a mixed partial ∂ α u has conformal weight −|α|. The quantities |∂ α |p
have conformal weight −p|α|. The volume form dx is assigned conformal weight N : the
volume is measured in meter N . The integral of a quantity of conformal weight w is a
quantity of conformal weight w + N . For example, the quantity
Z
|∂ α u|p dx
RN
has conformal weight (N − kp)/p = −σN (k, p). Geometrically, the conformal weight is
captured by the behavior under the rescalings x = λy.
If we replace the Euclidean metric g on Rn with the metric gλ = λ2 g, λ > 0, then,
Equivalently, if we introduce the new linear variables y i related to the canonical Euclidean
variables xi by xi = λy i , then
then,
Lk,p (RN ) ֒→ Lm,q (RN ),
and the natural inclusion is continuous, i.e., there exists C = C(N, k, m, p, q) > 0 such
that
kf km,q ≤ Ckf kk,p ∀f ∈ Lk,p (RN ).
Proof. We follow the approach of [107] which relies on the following elementary, but inge-
nious lemma.
Lemma 10.2.22 (Gagliardo-Nirenberg). Let N ≥ 2 and f1 , . . . , fN ∈ LN −1 (RN −1 ). For
each x ∈ RN , and 1 ≤ i ≤ N define
ξi = (x1 , . . . , x̂i , . . . , , xN ) ∈ RN −1 .
Then
f (x) = f1 (ξ1 )f2 (ξ2 ) · · · fN (ξN ) ∈ L1 (RN ),
and moreover,
N
Y
kf k1 ≤ kfi kN −1 . ⊔
⊓
i=1
where |du|2 = |∂1 u|2 + · · · + |∂N u|2 . This result then extends by density to any u ∈
L1,1 (RN ).
We have
Z x1
|u(x1 , . . . , xN )| ≤ ∂i u(x1 , . . . , xi−1 , t, xi+1 , . . . , xN ) dt
−∞
Z x1
≤ ∂i u(x1 , . . . , xi−1 , t, xi+1 , . . . , xN ) dt =: gi (ξi ).
−∞
478 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Since
|u(x)|N/(N −1) ≤ f1 (ξ1 ) · · · fN (ξN ),
we conclude from Lemma 10.2.22 that u(x) ∈ LN/(N −1) (RN ), and
N
!1/N N
!1/N
Y Y
kukN/(N −1) ≤ kgi (ξi )k1 = k∂i uk1 .
1 1
We have thus proved that L1,1 (RN ) embeds continuously in LN/(N −1) (RN ).
Now let 1 < p < ∞ such that σN (1, p) = 1 − N/p < 0, i.e., p < N . We have to show
∗
that L1,p (RN ) embeds continuously in Lp (RN ), where
Np
p∗ = .
N −p
Let u ∈ C0∞ (RN ). Set v := |u|r−1 u, where r > 1 will be specified later. The inequality
N
!1/N
Y
kvkN/(N −1) ≤ ∂i v 1
1
implies
N
!1/N
Y
kukrrN/(N −1) ≤ r |u|r−1 ∂i u 1
.
1
If q = p/(p − 1) is the conjugate exponent of p, then using the Hölder inequality we get
r−1
|u|r−1 ∂i u ≤ ukq(r−1) k∂i u p
.
Consequently,
N
!1/N
Y
kukrrN/(N −1) ≤ r−1
rkukq(r−1) bigk ∂i u p
.
1
Now choose r such that rN/(N − 1) = q(r − 1). This gives
N −1
r = p∗ ,
N
and we get
N
!1/N
Y
kukp∗ ≤ r ∂i u p
≤ C(N, p)k |du| kp .
1
10.2. FUNCTIONAL FRAMEWORK 479
∗
This shows L1,p ֒→ Lp if 1 ≤ p < N . The general case
follows easily by induction over k. We let the reader fill in the details. ⊔
⊓
Theorem 10.2.24 (Rellich-Kondrachov). Let (k, p), (m, q) ∈ Z+ × [1, ∞) such that
Then any bounded sequence (un ) ⊂ Lk,p (RN ) supported in a ball BR (0), R > 0 has a
subsequence strongly convergent in Lm,q (RN ).
Proof. We discuss only the case k = 1, so that the condition σN (1, p) < 0 imposes 1 ≤
p < N . We follow the elegant presentation in [23, Chap. IX]. The proof will be carried
out in several steps.
Step 1. For every h ∈ RN we define the translation operator
1 1−α
=α+ . (10.2.4)
q p∗
Then there exists a positive constant C, depending only on R, p, q, N such that
Hence Z
kτh u − ukL1 (BR ) = τh u(x) − u(x) dx
BR
Z 1 Z
≤ |h| |du(x + th)|dx dt
0 BR
Z 1 Z 1/p
= |h| |du(y)|p dy vol(BR+1 )1−1/p ≤ C|h|kuk1,p,RN ,
0 th+BR
This proves the inequality (10.2.5) for smooth functions. The general case is obtained by
passing to the limit in the L1,p -norm.
480 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
We will prove that, for every ε > 0 there exists 0 < δ = δ(ε) < 1, such that
Let u ∈ U. We have
Z
|ρδ ∗ u(x) − u(x)| ≤ |τ−y u(x) − u(x)|ρδ (y)dy
Bδ
Using the convexity of the function s 7→ sq , s ≥ 0 and the fact that ρδ (y)dy is a probability
measure we deduce
Z q Z
q
|τ−y u(x) − u(x)|ρδ (y)dy ≤ |τ−y u(x) − u(x)| ρδ (y)dy
Bδ Bδ
we deduce Z
|ρδ ∗ u(x) − u(x)|q ≤ |τ−y u(x) − u(x)|q ρδ (y)dy,
Bδ
so that
Z 1/q (10.2.5)
kρδ ∗ un − un kq,BR ≤ kτ−y un − un kqq,BR ρδ (y)dy ≤ Ckuk1,p,Rn δα .
Bδ
∗ −1)/p∗
≤ C(N, p)δ−N kukp∗ · vol (Bδ )(p
10.2. FUNCTIONAL FRAMEWORK 481
Now let δ ≤ δ(ε/2), where δ(ε/2) is defined by (10.2.6). Then the family Uδ is relatively
compact in C 0 (Br+1 ) and thus it can be covered by finitely many Lq (BR+1 ) balls of radius
ε/2. Using (10.2.6) we see that we can cover U with finitely many Lq (BR+1 )-balls of radius
ε. The compactness theorem is proved. ⊔
⊓
where for any set S ⊂ D, we denoted by osc (u; S) the oscillation of u on S, i.e.,
Set
kuk∞,D := sup |u(x)|,
x∈D
and define
C 0,α (D) := u : D → R ; kuk0,α,D := kuk∞ + [u]α < ∞ .
More generally, for every integer k ≥ 0, define
C k,α(D) := u ∈ C k (D); ∂ β u ∈ C 0,α (D), ∀|β| ≤ k .
σ(k, α) := k + α.
482 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Proof. We consider only the case k = 1, and (necessarily) m = 0. The proof relies on the
following elementary observation.
Then Z
2N |du(y)|
|u(x) − u| ≤ dy. (10.2.7)
σ N −1 BR |x − y|N −1
In the above inequality σ N −1 denotes the “area” of the unit (N − 1)-dimensional round
sphere S N −1 ⊂ RN .
Proof.
Z |x−y|
∂ x−y
u(x) − u(y) = − u(x + rω)dr (ω = − ).
0 ∂r |x − y|
Integrating the above equality with respect to y we get,
Z Z |x−y|
∂
vol (BR )(u(x) − u) = − dy u(x + rω)dr.
BR 0 ∂r
1. In the above lemma, we can replace the round ball BR centered at origin by any
other ball, centered at any other point. In the sequel for any R > 0, and any x0 ∈ RN
we set Z
1
ux0 ,R := u(y)dy.
vol (BR (x0 )) BR (x0 )
Since σN (1, p) > 0, we deduce that p > N , so that its conjugate exponent q satisfies
p N
q= < .
p−1 N −1
where C(N, q) is a universal constant depending only on N , and q. Using the Hölder
inequality in (10.2.8) we conclude
Step 2. Oscillation estimates. We will show there exists C > 0 such that for all u ∈
L1,p (RN ) ∩ C 1 (RN )
[u]α ≤ Ckuk1,p .
Indeed, from the inequality (10.2.7) we deduce that, for any ball BR (x0 ), and any x ∈
BR (x0 ), Z
|du(y)|
|u(x) − ux0 ,R | ≤ C N −1
BR (x0 ) |x − y|
p
(q = p−1 )
Z !1/q
(Hölder) 1
≤ Ck |du| kp · dy ≤ Ck |du| kp Rν ,
BR (x0 ) |x − y|q(N −1)
484 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
where
1 N
ν= N − q(N − 1) = 1 − = α.
q p
Hence
|u(x) − ux0 ,R | ≤ CRα ∀x ∈ BR (x0 ),
and consequently,
osc (u; BR (x0 )) ≤ CRα .
Step 2 is completed.
Step 3. Conclusion. Given u ∈ L1,p (RN ) we can find (un ) ∈ C0∞ (RN ) such that
The estimates established at Step 1 and 2 are preserved as n → ∞. In fact, these es-
timates actually show that the sequence (un ) converges in the C 0,α -norm to a function
v ∈ C 0,α (RN ) which agrees almost everywhere with u. ⊔
⊓
Exercise 10.2.27. Let u ∈ L1,1 (RN ) satisfy a (q, ν)-energy estimate, i.e.
Z
1
∃C > 0 : N |du(y)|q dy ≤ C1 r −ν ∀x ∈ RN , 0 < r < 2,
r Br (x)
where 0 ≤ ν < q and q > 1. Show that (up to a change on a negligible set) the function
u is α-Hölder continuous, α = 1 − ν/q, and moreover
[u]α ≤ C2 ,
Remark 10.2.28. The result in the above exercise has a suggestive interpretation. If u
satisfies the (q, ν)-energy estimate then although |du| may not be bounded, on average, it
“explodes” no worse that r α−1 as r → 0. Thus
Z |x|
|u(x) − u(0)| ≈ C tα−1 dt ≈ C|x|α .
0
The energy estimate is a very useful tool in the study of nonlinear elliptic equations. ⊔
⊓
Then a simple application of the Arzela-Ascoli theorem shows that any bounded sequence
in Lk,p(RN ) admits a subsequence which converges in the C m,α norm on any bounded
open subset of RN .
The last results we want to discuss are the interpolation inequalities. They play an
important part in applications, but we chose not to include their long but elementary
proofs since they do not use any concept we will need later. The interested reader may
consult [4] or [15] for details.
Theorem 10.2.29 (Interpolation inequalities). For each R > 0 choose a smooth, cutoff
function ηR ∈ C0∞ (RN ) such that
ηR ≡ 1 if |x| ≤ R,
ηR ≡ 0 if |x| ≥ R + 1,
and
|dηR (x)| ≤ 2 ∀x ∈ RN .
Fix (m, p) ∈ Z+ × [1, ∞), and (k, α) ∈ Z+ × (0, 1).
(a) For every 0 < r ≤ R+1, there exists C = C(r, R, m, p) such that, for every 0 ≤ j < m,
ε > 0 and for all u ∈ Lm,p (RN ), we have
(b) For every 0 < r ≤ R + 1, there exists C = C(r, R, k, α) such that, for every 0 ≤ j < k,
ε > 0, and for all u ∈ C k,α(RN ), we have
The results in this, and the previous section extend verbatim to slightly more general
situations, namely to functions f : RN → H, where H is a finite dimensional complex
Hermitian space.
(ii) A volume form dVg = ∗1. In particular, dVg defines a Borel measure on M . We de-
note by Lp (M, K) the space of K-valued p-integrable functions on (M, dVg ) (modulo
the equivalence relation of equality almost everywhere).
i.e., Z
hψ, φidVg ≤ kψkp,E · kφkq,E
M
This follows immediately from the Cauchy inequality
h ψ(x), φ(x) ≤ |ψ(x)| · |φ(x)| a.e. on M,
where we used the symbol ∇ to generically denote the connections in the tensor products
T ∗ M ⊗j ⊗E induced by ∇g and ∇E
The metrics g and h induce metrics in each of the tensor bundles T ∗ M ⊗m ⊗ E, and in
particular, we can define the spaces Lp (T ∗ M ⊗m ⊗ E).
10.2. FUNCTIONAL FRAMEWORK 487
Definition 10.2.33. (a) Let u ∈ L1loc (E) and v ∈ L1loc (T ∗ M ⊗m ⊗ E). We say that
∇m u = v weakly if
Z Z
hv, φidVg = hu, (∇m )∗ φidVg ∀u ∈ C0∞ (T ∗ M ⊗m ⊗ E).
M
(b) Define Lm,p (E) as the space of sections u ∈ Lp (E) such that, ∀j = 1, . . . , m, there
exist vj ∈ Lp (T ∗ M ⊗j ⊗ E), such that ∇j u = vj weakly. We set
m
X
kukm,p := kukm,p,E = k∇j ukp . ⊔
⊓
j=1
☞ A word of warning. The Sobolev space Lm,p (E) introduced above depends on several
choices: the metrics on M and E and the connection on E. When M is non-compact this
dependence is very dramatic and has to be seriously taken into consideration.
Example 10.2.34. Let (M, g) be the space RN endowed with the Euclidean metric. The
trivial line bundle E = RM is naturally equipped with the trivial metric and connection.
Then, Lp (RM ) = Lp (M, R).
Denote by D the Levi–Civita connection. Then, for every u ∈ C ∞ (M ), and m ∈ Z+ ,
we have X
Dmu = dx⊗α ⊗ ∂ α u,
|α|=m
k • km,p,RM is equivalent with the norm k • km,p,RN introduced in the previous sections. ⊔
⊓
Proposition 10.2.35. (Lk,p (E), k · kk,p,E ) is a Banach space which is reflexive if 1 < p <
∞. ⊔
⊓
Finally set
[u]α,E := sup r −α osc (u; Br (x)); 0 < r < ρ0 , x ∈ M .
For any k ≥ 0, we define
k
X
kukk,α,E := k∇j uk∞,E + [∇m u]α,T ∗ M ⊗m ⊗E ,
j=0
and we set
C k,α(E) := u ∈ C k (E); kukk,α < ∞ .
Theorem 10.2.36. Let (M, g) be a compact, N -dimensional, oriented Riemann mani-
fold, and E a vector bundle over M equipped with a metric h, and compatible connection
∇. Then the following are true.
(a) The Sobolev space Lm,p (E), and the Hölder spaces C k,α(E) do not depend on the
metrics g, h and on the connection ∇. More precisely, if g1 is a different metric on M ,
and ∇1 is another connection on E compatible with some metric h1 then
and the identity map between these two spaces is a Banach space is continuous. A similar
statement is true for the Hölder spaces.
(b) If 1 ≤ p < ∞, then C ∞ (E) is dense in Lk,p (E).
(c) If (ki , pi ) ∈ Z+ × [1, ∞) (i = 0, 1) are such that
then Lk0 ,p0 (E) embeds continuously in Lk1 ,p1 (E). If moreover,
then the embedding Lk0 ,p0 (E) ֒→ Lk1 ,p1 (E) is compact, i.e., any bounded sequence of
Lk0 ,p0 (E) admits a subsequence convergent in the Lk1 ,p1 -norm.
(d) If (m, p) ∈ Z+ × [1, ∞), (k, α) ∈ Z+ × (0, 1) and
m − N/p ≥ k + α,
m − N/p > k + α,
We developed all the tools needed to prove this theorem, and we leave this task to the
reader. The method can be briefly characterized by two key phrases: partition of unity
and interpolation inequalities. We will see them at work in the next section.
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS489
If u0 is a minimum of E, i.e.,
E(u0 ) ≤ E(u), ∀u,
then
Z
d
0 = |t=0 E(u0 + tv) = (du0 , dv)g + u0 · v − f · v dVg ∀v. (10.3.2)
dt S2
so that necessarily,
∆u0 + u0 = f.
There are a few grey areas in this approach, and Weierstrass was quick to point them
out: what is the domain of E, u0 may not exist, and if it does, it may not be C 2 , so that
the integration by parts is illegal etc. This avenue was abandoned until the dawns of the
twentieth century, when Hilbert reintroduced them into the spotlight, and emphasized the
need to deal with these issues.
His important new point of view was that the approach suggested by Riemann does
indeed produce a solution of (10.3.1) “provided if need be that the notion of solution be
suitable extended ”. The suitable notion of solution is precisely described in (10.3.2).
Naturally, one asks when this extended notion of solution coincides with the classical one.
Clearly, it suffices that u of (10.3.2) be at least C 2 , so that everything boils down to a
question of regularity.
Riemann’s idea was first rehabilitated in Weyl’s famous treatise [131] on Riemann
surfaces. It took the effort of many talented people to materialize Hilbert’s program
formulated as his 19th and 20th problem in the famous list of 27 problems he presented
at the Paris conference at the beginning of the 20th century. We refer the reader to [1]
for more details.
This section takes up the issues raised in the above simple example. The key fact which
will allow us to legitimize Riemann’s argument is the ellipticity of the partial differential
operator involved in this equation.
490 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
(b) Let (k, α) ∈ Z+ ×(0, 1), and R > 0. Then, there exists C = C(kLkk+1,R , k, α, N, R) > 0
such that, ∀u ∈ C0∞ (E |BR (0) ), we have
kukk+m,α ≤ C kLukk,α + kuk0,α . (10.3.4)
Proof. The proof consists of two conceptually distinct parts. In the first part we establish
the result under the supplementary assumption that L has constant coefficients. In the sec-
ond part, the general result is deduced from the special case using perturbation techniques
in which the interpolation inequalities play an important role. Throughout the proof we
will use the same letter C to denote various constants C = C(kLkk+1,R , k, p, N, R) > 0
Step 1. Constant coefficients case. We assume L has the form
X
L= Aα ∂xα ,
|α|=m
We will prove the conclusions of the theorem hold in this special case. We will rely on a
very deep analytical result whose proof goes beyond the scope of this book.
For each f ∈ L1 (RN , C) denote by fˆ(ξ) its Fourier transform
Z
fˆ(ξ) := exp(−ix · ξ)dx.
RN
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS491
(b) For every 1 < p < ∞ there exists C = C(p, kmk∞ ) > 0 such that,
(c)(Korn-Lichtenstein). For every 0 < α < 1, and any R > 0, there exists C =
C(α, kmk0,α , R) > 0 such that ,∀u ∈ C00,α (BR )
[T u]α ≤ Ckuk0,α,BR . ⊔
⊓
For a proof of part (a) and (b) we refer to [125], Chap II §4.4. Part (c) is “elementary”
and we suggest the reader to try and prove it. In any case a proof, of this inequality can
be found in [15], Part II.5.
Let us now return to our problem. Assuming L has the above special form we will
prove (10.3.3). The proof of (10.3.4) is entirely similar and is left to the reader. We discuss
first the case k = 0.
Let u ∈ C0∞ (E |BR ). The function u can be viewed as a collection
If we set v = Lu, then for any multi-index β such that |β| = m, we have
L∂ β u = ∂ β Lu = ∂ β v,
492 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
because L has constant coefficients. We Fourier transform the above equality, and we get
X
(−i)m Aα ξ α ∂d
β u(ξ) = (−i)m ξ β v̂ ( ξ). (10.3.5)
|α|=m
∂\ ˆ ∀ξ 6= 0,
β u(ξ) = ξ β B(ξ)v(ξ)
where B(ξ) = A(ξ)−1 . Note that B(ξ) is homogeneous of degree −m, so that M (ξ) =
ξ β B(ξ) is homogeneous of degree 0. Thus, we can find functions mij (ξ) ∈ C ∞ (Rn \ {0}),
which are homogeneous of degree 0, and such that
X
∂d
β ui (ξ) = mij (ξ)v̂ j (ξ).
j
k∂ β ην k ≤ Cr −|β| ∀|β| ≤ m.
If u ∈ C0∞ (BR ), then
!
X X
v = Lu = L ην u = L(ην u).
ν ν
Set uν := ην u, and X
Lν := Aα (xν )∂ α .
|α|=m
def
We rewrite the equality vν = Luν as
X X
Lν uν = (Lν − L)uν + vν = εα,ν (x)∂ α uν − Aβ (x)∂ β uν + vν ,
|α|=m |β|<m
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS493
kuν km,p,Br (xν ) ≤ C(rkuν kp,Br (xν ) + kuν kp,Br (xν ) + kvν kp,Br (xν ) ).
We can now specify r > 0 such that Cr < 1/2 in the above inequality. Hence
Hence
kvν kp,Br (xν ) ≤ C(kην Lukp,Br (xν ) + r −(m−1) kukm−1,p,Br (xν ) ),
so that
kuν km,p,Br (xν ) ≤ C(kLukp,BR + r −(m−1) kukm−1,p,BR ).
We sum over ν, and taking into account that the number of spheres Br (xν ) is O((R/r)N )
we deduce
X
kukm,p,BR ≤ kuν km,p ≤ CRN (r −N kLukp,BR + r −(m+N −1) kukm−1,p,BR ).
ν
Hence
kukm,p,BR ≤ C(kLukp,BR + εkukm,p,BR + ε−(m−1) kukp,BR ).
If now we choose ε > 0 sufficiently small we deduce the case k = 0 of the inequality
(10.3.3).
494 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
εkukm+k,p,BR + Ckukp,BR ,
using the interpolation inequalities as before. The inequality (10.3.3) is completely proved.
The Hölder case is entirely similar. It is left to the reader as an exercise. Theorem 10.3.1
is proved. ⊔
⊓
Using the truncation technique and the interpolation inequalities we deduce the fol-
lowing consequence.
Corollary 10.3.3. Let L as in Theorem 10.3.1, and fix 0 < r < R. Then, for every
k ∈ Z+ , 1 < p < ∞, and α ∈ (0, 1), there exists C = C(k, p, α, N, kLkk+1,R , R, r) > 0 such
that, ∀u ∈ C ∞ (E)
kukk+m,p,Br ≤ C kLukk,p,BR + kukp,BR ,
and
kukk+m,α,Br ≤ C kLukk,α,BR + kuk0,α,BR . ⊔
⊓
0,α m,α
if there exists α ∈ (0, 1) such that, v ∈ Cloc (RN ), u ∈ Cloc (RN ), and (10.3.6) holds
everywhere.
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS495
The principal result of this subsection states that when L is elliptic the above inclusion is
an equality.
Theorem 10.3.6. Let p ∈ (1, ∞) and suppose that L : C ∞ (E) → C ∞ (E) is an elliptic
operator of order m, with smooth coefficients. If u ∈ Lploc and Lu ∈ Lploc then u ∈ Lm,p
loc (E).
Remark 10.3.7. Loosely speaking the above theorem says that if a “clever” (i.e. elliptic)
combination of mixed partial derivatives can be defined weakly, then any mixed partial
derivative (up to a certain order) can be weakly defined as well. ⊔
⊓
Proof. 2 We prove only two special special cases of the theorem. Namely we assume that
L has either constant coefficients or it is a first order operator. The essential ingredient in
the proof is the technique of mollification. For each δ > 0 we denote by Tδ the operator
of convolution with ρδ . Fix a radius R > 0 and a partial differential operator S of order
m > 0. We first prove that there exists a constant C > 0 depending only on R, N, p, S
such that
k [S, Tδ ]u kp,BR+δ ≤ Ckukm−1,p,BR+1 , ∀u ∈ C0∞ (BR+1 ). (10.3.7)
Using the equality
[S1 S2 , Tδ ] = [S1 , Tδ ]S2 + S1 [S2 , Tδ ]
we deduce that if (10.3.7) holds for the operators S1 and S2 of positive orders then it also
holds for their product S1 S2 . Clearly (10.3.7) is trivially satisfied for operators S with
constant coefficients because in this case [S, Tδ ] = 0. Thus, it suffices to verify (10.3.7) for
first order operators of the form a(x)∂i + b(x).
Denote by Mb : C0∞ → C0∞ the operator of multiplication of by the smooth bundle
morphism b(x). Then
Z
[Mb , Tδ ]u(x) = ρδ (x − y)( b(x) − b(y) )u(y)dy,
RN
and the smoothness of b implies that there exists a constant C > 0 independent of δ > 0
such that
k [Mb , Tδ ] kp,BR+δ ≤ Ckukp,BR+1 , ∀u ∈ C0∞ (BR+1 ).
2
I would like to thank my friend Weimin Chen for pointing out a subtle error in the original proof.
496 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
The inequality (10.3.7) follows by observing that |a(x) − a(y)| = O(δ) when x, y ∈ BR+1 ,
|x − y| < δ.
We deduce that if L is a first order operator or L has constant coefficients there exists
a constant C > 0 depending only on R, N, q, L such that
Suppose that u ∈ Lploc and v = Lu ∈ Lploc . Then for any φ ∈ C0∞ (BR+1 ) we have
Z Z
hLTδ u, φ idV = hTδ u, L∗ φidV
BR+1 BR+1
Z Z Z
∗ ∗
= hu, Tδ L φidV = hu, L Tδ φidV + u, [Tδ , L∗ ]φ
BR+1 BR+1 BR+1
Z Z
= hv, Tδ φidV + u, [Tδ , L∗ ]φ .
BR+1 BR+1
1 1
Let q ∈ (1, ∞)such that p + q = 1. Using (10.3.8) and Hölder inequality we deduce
Z
hLTδ u, φdV ≤ kvkp,BR+1 kTδ φkq,BR+1 + Ckukq,BR+1 kφkq,BR+1
BR+1
≤ C(kukp,BR+1 + kvkp,BR+1 kkφkq,BR+1 ,
for some constant C depending only on R, N, q, L. This implies that LTδ u is bounded in
Lp (BR+1 ). From the elliptic estimates we deduce that the familiy Tδ u it is bounded in
Lm,p (BR ) as δ → 0. Since the space Lm,p (BR ) is reflexive we deduce that a subsequence
Tδn u converges weakly as δn → 0 to a section u0 ∈ Lm,p (BR ). On the other hand, from the
Rellich-Kondrachov compactness theorem we deduce that Tδn u converges in the Lp -norm
to u0 . This implies that u0 = u so that u ∈ Lm,p
loc . ⊔
⊓
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS497
Remark 10.3.8. The proof of the general case of the regularity theorem requires consid-
erable more analytic work than we are willing to include in a geometry book. However,
the general idea is easy to describe.
Using the theory of pseudo-differential operators, one can construct a parametrix for
any elliptic operator L on RN T . This is an integral operator T : C0∞ (RN ) → C ∞ (RN )
which serves as a sort of inverse for L, i.e., the operators Sr = LT − 1 and Sℓ = T L − 1 are
smoothing operators. This means that they map locally integrable functions (and more
generally, distributions) to smooth functions. The convolution with ρδ is a prime example
of smoothing operator, while the convolution with the function |x|2−N is the parametrix
of the (geometers’) Laplacian on RN .
If Lu = v weakly then
T v = T Lu = u + Sℓ u.
Thus T v − u is a smooth function, and thus it suffices to prove that T v ∈ Lm,p
loc . This is
achieved, again using the theory of pseudo-differential operators, the Calderon-Zygmund
inequality, and the special case of Theorem 10.3.6 when L is a constant coefficients oper-
ator. For details we refer to [126]. ⊔
⊓
k+m,p
Proof. To prove that u ∈ Lloc (E) we argue by induction on k. The case k = 0 is
settled by Theorem 10.3.6. For the inductive step assume that u ∈ Lploc and Lu ∈ Lloc
k−1,p
k−1+m,p
implies that u ∈ Lloc . We want to prove that u ∈ Lploc and Lu ∈ Lk,ploc implies that
k+m,p p
u ∈ Lloc . Set v = Lu. The elliptic regularity implies that ∂i u ∈ Lloc . Moreover we
have the weak equality
∂i v = ∂i Lu = L∂i u + [∂i , L]u,
i.e.,
h∂i v, φi = hu, (L∂+i )∗ φi + hu, [∂i , L]∗ φi, ∀φ ∈ C0∞ .
To put it differently, the section ∂i u is a Lp -weak solution the equation
Observe that [∂i , L] is a p.d.o. of order ≤ m which means that the right hand side of
k−1,p k−1+m,p
the above equality is in Lloc since the inductive assumption implies that u ∈ Lloc .
k−1+m,p
Invoking the inductive assumption again we deduce that ∂i u ∈ Lloc , ∀i, i.e., u ∈
k+m,p
Lloc .
To prove the elliptic estimate pick a sequence un ∈ C0∞ (E) such that
k+m,p
un → u strongly in Lloc (E).
498 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Then
k+m,p
Lun → Lu strongly in Lloc (E),
and
kun km+k,p,Br ≤ C(kun k0,p,BR + kLun kk,p,BR ).
The desired estimate is obtained by letting n → ∞ in the above inequality. ⊔
⊓
The results in this, and the previous subsection are local, and so extend to the more
general case of p.d.o. on manifolds. They take a particularly nice form for operators on
compact manifolds.
Let (M, g) be a compact, oriented Riemann manifold and E, F → M be two metric
vector bundles with compatible connections. Denote by L ∈ P DOm (E, F ) an elliptic
operator of order m.
Theorem 10.3.11. (a) Let u ∈ Lp (E) and v ∈ Lk,p (F ), 1 < p < ∞, such that
Z Z
hv, φiF dvg = hu, L∗ φiE dVg ∀φ ∈ C0∞ (F ).
M M
Remark 10.3.12. The regularity results and the a priori estimates we have established
so far represent only the minimal information one needs to become an user of the elliptic
theory. Regrettably, we have mentioned nothing about two important topics: equations
with non-smooth coefficients, and boundary value problems. For generalized Laplacians
these topics are discussed in great detail in [51] and [102]. The boundary value problems
for first order elliptic operators require a more delicate treatment. We refer to [21] for a
very nice presentation of this subject. ⊔
⊓
(a) Show that for every u ∈ L1,2 (E), the function x 7→ |u(x)| is in L1,2 (M ), and moreover
|d |u(x)| | ≤ |∇u(x)|,
i.e.,
|u(x)|∆M (|u(x)|) ≤ h∆E u(x), u(x)iE weakly. ⊔
⊓
Exercise 10.3.14. (Local estimates). Suppose (M, g) is a connected, oriented, Rie-
mann manifolds of dimension m, not necessarily, E0 , E1 → M are two smooth hermitian
vector bundles on M , and ∇i , i = 0, 1 are Hermitian connection on Ei .
Suppose that L : C ∞ (E0 ) → C ∞ (E1 ) is an elliptic operator of order ν with smooth
coefficients. Prove that for any compact subset K ⊂ M , any relatively compact open
neighborhood U of K, and any p ∈ (1, ∞), ℓ ∈ Z≥0 , there exists a positive constat C
depending only on K, U, p, ℓ, ∇0 , ∇1 , and the coefficients of L such that, ∀u ∈ C ∞ (E0 ),
we have
kukLℓ+ν,p (K,E0) ≤ C kLukLℓ,p (U,E1 ) + kukLp (U,E0 ) ,
where the above Sobolev norms are defined in terms of the connections ∇i . ⊔
⊓
Proof. Let
1
(u − v)− = min{u − v, 0) =
{(u − v) − |u − v|}.
2
According to the Exercise 10.2.19 we have (u − v)− ∈ L1,2 (M ), and
− d(u − v) a.e. on {u < v}
d(u − v) =
0 a.e. on {u ≥ v}
Clearly
d(u − v), d(u − v)− = |d(u − v)− |2 ,
and since h is nondecreasing,
Hence, Z Z
− 2
|d(u − v) | dVg ≤ − ( h(u) − h(v) )(u − v)− dVg ≤ 0,
M M
so that
|d(u − v)− | ≡ 0.
Since M is connected, this means (u − v)− ≡ c ≤ 0. If c < 0, then
(u − v) ≡ (u − v)− ≡ c,
Exercise 10.3.17. Assume h in the above lemma is only non-decreasing, but u and v
satisfy the supplementary condition
u ≥ v.
Show the conclusion of Lemma 10.3.16 continues to hold. ⊔
⊓
We now return to the equation (10.3.9). Note first that if u and v are two weak
solutions of this equation, then
so that by the maximum principle u ≥ (≤) v. This shows the equation (10.3.9) has at
most one weak solution.
To proceed further we need the following a priori estimate.
Lemma 10.3.18. Let u be a weak solution of (10.3.9). If C is a positive constant such
that
f (C) ≥ sup s(x),
M
Fix C0 > 0 such that f (C0 ) ≥ sup s(x). Consider a strictly increasing C 2 -function
f˜ : R → R such that
f˜(u) = f (u) for u ≤ C0 ,
f˜(u) is linear for u ≥ C0 + 1.
The conditions (C1 ) and (C2 ) imply that there exist A, B > 0 such that
Proof. We deduce as in the proof of Lemma 10.3.18 that u ≤ C0 , which is precisely the
range where f coincides with f˜. ⊔
⊓
The above lemma shows that instead of looking for a weak solution of (10.3.9), we
should try to find a weak solution of (10.3.12). We will use the direct method of the
calculus of variations on a new functional
Z
1
˜
I(u) = |du(x)|2 + F̃ (u(x)) − s(x)u(x) dVg (x),
M 2
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS503
where Z u
F̃ (u) := f˜(t)dt.
0
The advantage we gain by using this new functional is clear. The inequality (10.3.11)
shows F̃ has at most quadratic growth, so that F̃ (u) ∈ L1 (M ), for all u ∈ L2 (M ). The
existence of a minimizer is a consequence of the following fundamental principle of the
calculus of variations.
Proposition 10.3.20. Let X be a reflexive Banach space and J : X → R a convex,
weakly lower semi-continuous, coercive functional, i.e., the sublevel sets
J c = {x ; J(x) ≤ c}
are respectively convex, weakly closed and bounded in X. Then J admits a minimizer, i.e.,
there exists x0 ∈ X such that
J(x0 ) ≤ J(x) ∀x ∈ X.
Since X is reflexive, and Jc is convex, weakly closed and bounded in the norm of X, we
deduce that J c is weakly compact. Hence a (generalized) subsequence (xν ) of xn converges
weakly to some x0 ∈ J c . Using the lower semi-continuity of J we deduce
Proof. The convexity is clear since F̃ is convex on account that f˜ is strictly increasing.
The functionals
Z Z
1 2
u 7→ |du| dVg and u 7→ − s(x)u(x)dVg (x)
2 M M
F̃ (u) − αu − β ≥ 0.
are both convex and strongly closed. The Hahn-Banach separation principle can now be
invoked to conclude these level sets are also weakly closed. We have thus established that
I˜ is convex and weakly lower semi-continuous.
Remark 10.3.22. We see that the lower semi-continuity, and the convexity conditions
are very closely related. In some sense they are almost equivalent. We refer to [33] for
a presentation of the direct method of the calculus of variations were the lower semi-
continuity issue is studied in great detail. ⊔
⊓
The coercivity will require a little more work. The key ingredient will be a Poincaré
inequality. We first need to introduce some more terminology.
For any u ∈ L2 (M ) we denoted by u its average. Now set
u⊥ (x) = u(x) − u.
Note the average of u⊥ is 0. This choice of notation is motivated by the fact that u⊥ is
perpendicular (with respect to the L2 (M )-inner product) to the kernel of the Laplacian
∆ which is the 1-dimensional space spanned by the constant functions.
Proof. We argue by contradiction. Assume that for any ε > 0 there exists uε ∈ L1,2 (M ),
such that Z Z
|u⊥ε |2
dV g = 1, and |duε |2 dVg ≤ ε.
M M
u⊥ 1,2
ε ⇀ v, weakly in L (M ).
We conclude
Z Z
(dv, dφ) dVg = lim (duε , dφ) dVg = 0 ∀φ ∈ L1,2 (M ).
M ε M
In particular, Z
(dv, dv) dVg = 0,
M
so that dv ≡ 0. Since M is connected, we deduce v ≡ c = const. Moreover,
Z Z
c= v(x)dVg (s) = lim u⊥ε dVg = 0.
M ε M
˜
We can now establish the coercivity of I(u). Let κ > 0, and u ∈ L1,2 (M ) such that
Z n o
1
|du(x)|2 + F̃ (u(x)) − s(x)u(x) dVg (x) ≤ κ. (10.3.13)
M 2
Since Z Z Z Z
2 ⊥ 2 2 2
|du| dVg = |du | dVg and |u| = |u| + |u⊥ |2 dVg ,
M M M M
it suffices to show that the quantities
Z Z
⊥ 2
u, |u | dVg , |du⊥ |dVg
M M
implies the boundedness of ku⊥ k2,M , so that we should concentrate only on u and kdu⊥ k2 .
The inequality (10.3.13) can be rewritten as
Z n o
1 ⊥2
|du | + F̃ (u) − s⊥ u⊥ dVg − s · u ≤ c.
M 2
so that
Cku⊥ k22 − ks⊥ k2 · ku⊥ k2 + F̃ (u) − s · u ≤ κ. (10.3.14)
Set P (t) := Ct2 − ks⊥ k2 t, and let m = inf P (t). From the inequality (10.3.14) we deduce
F̃ (u) − s · u ≤ κ − m.
Using condition (C3 ) we deduce that |u| must be bounded. Feed this information back
in (10.3.14). We conclude that P (ku⊥ k2 ) must be bounded. This forces ku⊥ k2 to be
bounded. Thus I˜ is coercive, and Lemma 10.3.21 is proved. ⊔
⊓
Step 2. The regularity of the minimizer. We will use a technique called bootstrapping,
which blends the elliptic regularity theory, and the Sobolev embedding theorems, to grad-
ually improve the regularity of the weak solution.
Let u be the weak solution of (10.3.12). Then u(x) is a weak L2 solution of
∆u = h(x) on M,
where h(x) = −f˜(u(x)) − s(x). Note that since the growth of f˜ is at most linear, we have
f˜(u(x)) ∈ L2 (M ). The elliptic regularity theory implies that u ∈ L2,2 (M ). Using Sobolev
(or Morrey) embedding theorem we can considerably improve the integrability of u. We
deduce that
(i) either u ∈ Lq (M ) if −N/q ≤ 2 − N/2 ≤ 0 (dim M = N ),
(ii) or u is Hölder continuous if 2 − N/2 > 0.
In any case, this shows u(x) ∈ Lq1 (M ), for some q1 > 2, which implies h(x) ∈ Lq1 (M ).
Using again elliptic regularity we deduce u ∈ L2,q1 and Sobolev inequality implies that
u ∈ Lq2 (M ) for some q2 > q1 .
After a finite number of steps we conclude that h(x) ∈ Lq (M ), for all q > 1. Elliptic
regularity implies u ∈ L2,q (M ), for all q > 1. This implies h(x) ∈ L2,q (M ) ,for all q > 1.
Invoking elliptic regularity again we deduce that u ∈ L4,q (M ), for any q > 1. (At this
point it is convenient to work with f , rather than with f˜ which was only C 2 ). Feed this
back in h(x), and regularity theory improves the regularity of u two orders at a time. In
10.3. ELLIPTIC PARTIAL DIFFERENTIAL OPERATORS: ANALYTIC ASPECTS507
From the theorem we have just proved we deduce immediately the following conse-
quence.
Corollary 10.3.24. Let (M, g) be a compact, connected, oriented Riemann manifold and
s(x) ∈ CR∞ (M ). Assume volg (M ) = 1. Then the following two conditions are equivalent.
(a) s = M s(x)dVg > 0
Remark 10.3.27. A long standing problem in differential geometry, which was only
relatively recently solved, is the Yamabe problem:
“If (M, g) is a compact oriented Riemann manifold, does there exist a metric conformal
to g whose scalar curvature is constant? ”
In dimension 2 this problem is related to the uniformization problem in complex anal-
ysis.
The solution of the Yamabe problem in its complete generality is due to the combined
efforts of T. Aubin, [10, 11] and R. Schoen [118]. For a very beautiful account of its proof
we recommend the excellent survey of J. Lee and T.Parker, [85]. One can formulate a
more general question than the Yamabe problem.
Given a compact oriented Riemann manifold (M, g) decide whether a smooth function
s(x) on M is the scalar curvature of some metric on M conformal to g.
508 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
This problem is known as the Kazdan-Warner problem. The case dim M = 2 is com-
pletely solved in [76]. The higher dimensional situation dim M > 2 is far more complicated,
both topologically and analytically. ⊔
⊓
s(g̃) ≡ −1.
Proof. We look for g̃ of the form g̃ = eu g. Using Exercise 10.3.26 we deduce that u should
satisfy
−1 = e−u {s(x) + ∆u},
i.e.
∆u + eu = −s(x),
where s(x) is the scalar curvature of the metric g. The Gauss–Bonnet theorem implies
that
s = 4πχ(Σ) < 0
In the following exercises (M, g) denotes a compact, oriented Riemann manifold with-
out boundary.
Exercise 10.3.31. Fix c > 0. Show that for every f ∈ L2 (M ) the equation
∆u + cu = f
∆g u = f (x, u) (10.3.17)
T : D(T ) ⊂ X → Y
be a linear operator, not necessarily continuous, defined on the linear subspace D(T ) ⊂ X.
The operator T is said to be densely defined if its domain D(T ) is dense in X.
510 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Remark 10.4.2. (a) In more concrete terms, the operator T : D(T ) ⊂ X → Y is closed if,
for any sequence (xn ) ⊂ D(T ), such that (xn , T xn ) → (x, y), it follows that (i) x ∈ D(T ),
and (ii) y = T x.
(b) If T : X → Y is a closed operator, then T is bounded (closed graph theorem).
Also note that if T : D(T ) ⊂ X → Y is a closed, densely defined operator, then ker T is a
closed subspace of X.
(c) One can show that the adjoint of any closed, densely defined operator, is a closed,
densely defined operator.
(d) The closed, densely defined operator T : D(T ) ⊂ X → X is self-adjoint if the
following two conditions hold.
• hT x, yi = hx, T yi for all x, y ∈ D(T ) and
• D(T ) = y ∈ X; ∃C > 0, |hT x, yi| ≤ C|x|, ∀x ∈ D(T ) .
If only the first condition is satisfied the operator T is called symmetric. ⊔
⊓
Let (M, g) be a compact, oriented Riemann manifold, E, F two metric vector bundles
with compatible connections, and L ∈ P DOk = P DOk (E, F ), a k-th order elliptic p.d.o.
We will denote the various L2 norms by k · k, and the Lk,2 -norms by k · kk .
Definition 10.4.3. The analytical realization of L is the linear operator
La : D(La ) ⊂ L2 (E) → L2 (E), D(La ) = Lk,2 (E),
given by u 7→ Lu for all u ∈ Lk,2 (E). ⊔
⊓
Proof. (a) Since C ∞ (E) ⊂ D(La ) = Lk,2 (E) is dense in L2 (E) , we deduce that La is
densely defined. To prove that La is also closed, consider a sequence (un ) ⊂ Lk,2 (E) such
that
un → u strongly in L2 (E), and Lun → v strongly in L2 (F ).
From the elliptic estimates we deduce
kun − um kk ≤ C kLun − Lum k + kun − um k → 0 as m, n → ∞.
Hence (un ) is a Cauchy sequence in Lk,2 (E), so that un → u in Lk,2 (E). It is now clear
that v = Lu.
(b) From the equality
Z Z
hLu, vidVg = hu, L∗ vidVg , ∀u ∈ Lk,2 (E), v ∈ Lk,2 (F ),
M M
we deduce
D (La )∗ ⊃ D (L∗ )a = Lk,2 (F ),
and (La )∗ = (L∗ )a on Lk,2 (F ). To prove that
D (La )∗ ⊂ D (L∗ )a = Lk,2 (F )
we need to show that if v ∈ L2 (F ) is such that ∃C > 0 with the property that
Z
hLu, vidVg ≤ Ckuk, ∀u ∈ Lk,2 (E),
M
then v ∈ Lk,2 (F ). Indeed, the above inequality shows that the functional
Z
u 7→ hLu, vidVg
M
extends to a continuous linear functional on L2 (E). Hence, there exists φ ∈ L2 (E) such
that Z Z
∗ ∗
(L ) u, v dVg = hu, φidVg ∀u ∈ Lk,2 (E).
M M
Following the above result we will not make any notational distinction between an
elliptic operator (on a compact manifold) and its analytical realization.
Definition 10.4.5. (a) Let X and Y be two Hilbert spaces over K = R, C and suppose
that T : D(T ) ⊂ X → Y is a closed, densely defined linear operator. The operator T is
said to be semi-Fredholm if the following hold.
(b) The operator T is called Fredholm if both T and T ∗ are semi-Fredholm. In this case,
the integer
ind T := dimK ker T − dimK ker T ∗
Remark 10.4.6. The above terminology has its origin in the work of Ivar Fredholm at
the twentieth century. His result, later considerably generalized by F. Riesz, states that
if K : H → H is a compact operator from a Hilbert space to itself, then 1H + K is a
Fredholm operator of index 0. ⊔
⊓
Hence (un ) is also bounded in Lk,2 (E). On the other hand, since M is compact, the space
Lk,2 (E) embeds compactly in L2 (E). The lemma is proved. ⊔
⊓
We will first show that dim ker L < ∞. In the proof we will rely on the classical result
of F. Riesz which states that a Banach space is finite dimensional if and only if its bounded
subsets are precompact (see [23], Chap. VI).
Note first that, according to Weyl’s lemma ker L ⊂ C ∞ (E). Next, notice that ker L is
a Banach space with respect to the L2 -norm since according to Remark 10.4.2 (a) ker L
is closed in L2 (E). We will show that any sequence (un ) ⊂ ker L which is also bounded
in the L2 -norm contains a subsequence convergent in L2 . This follows immediately from
Lemma 10.4.8 since kun k + kLun k = kun k is bounded.
To prove that the range R (T ) is closed, we will rely on the following very useful
inequality, a special case of which we have seen at work in Subsection 9.3.3.
kuk ≤ CkLuk,
Proof. We will argue by contradiction. Denote by X ⊂ Lk,2 (E) the subspace consisting
of sections L2 -orthogonal to ker L. Assume that for any n > 0 there exists un ∈ X such
that
kun k = 1 and kLun k ≤ 1/n.
Thus, kLun k → 0, and in particular, kun k + kLun k is bounded. Using Lemma 10.4.8 we
deduce that a subsequence of (un ) is convergent in L2 (E) to some u. Note that kuk = 1.
It is not difficult to see that in fact u ∈ X. We get a sequence
such that (un , Lun ) → (u, 0). Since L is closed, we deduce u ∈ D(L) and Lu = 0. Hence
u ∈ ker L ∩ X = {0}. This contradicts the condition kuk = 1. ⊔
⊓
We can now conclude the proof of Theorem 10.4.7. Consider a sequence (vn ) ⊂ R (L)
such that vn → v in L2 (F ). We want to show v ∈ R (L).
For each vn we can find a unique un ∈ X = (ker L)⊥ such that
Lun = vn .
kun − um k ≤ Ckvn − vm k.
Hence (un ) is a Cauchy sequence in Lk,2 (E) so that un → u in Lk,2 (E). Clearly Lu = v,
so that v ∈ R (L).
We have so far proved that ker L is finite dimensional, and R (L) is closed, i.e. La is
semi-Fredholm. Since (La )∗ = (L∗ )a , and L∗ is also an elliptic operator, we deduce (La )∗
is also semi-Fredholm. This completes the proof of Theorem 10.4.7. ⊔
⊓
Using the closed range theorem of functional analysis we deduce the following impor-
tant consequence.
Corollary 10.4.10 (Abstract Hodge decomposition). Any k-th order elliptic operator L :
C ∞ (E) → C ∞ (F ) over the compact manifold M defines natural orthogonal decompositions
of L2 (E) and L2 (F ),
The last corollary is really unusual. It states the equation Lu = v has a solution
provided the dual equation L∗ v = 0 has no nontrivial solution. A nonexistence hypothesis
implies an existence result! This partially explains the importance of the vanishing results
in geometry, i.e., the results to the effect that ker L∗ = 0. With an existence result in
our hands presumably we are more capable of producing geometric objects. In the next
chapter we will describe one powerful technique of producing vanishing theorems based
on the so called Weitzenböck identities.
Corollary 10.4.13. Over a compact manifold
Proof. Clearly ker L ⊂ ker L∗ L. Conversely, let ψ ∈ C ∞ (E) such that L∗ Lψ = 0. Then
Z Z
kLψk2 = hLψLψidVg = hL∗ Lψ, ψidVg = 0. ⊔
⊓
M M
The Fredholm property of an elliptic operator has very deep topological ramifications
culminating with one of the most beautiful results in mathematics: the Atiyah-Singer
index theorems. Unfortunately, this would require a lot more extra work to include it
here. However, in the remaining part of this subsection we will try to unveil some of the
natural beauty of elliptic operators. We will show that the index of an elliptic operator
has many of the attributes of a topological invariant.
We stick to the notations used so far. Denote by Ellk (E, F ) the space of elliptic
operators C ∞ (E) → C ∞ (F ) of order k. By using the attribute space when referring to
Ellk we implicitly suggested that it carries some structure. It is not a vector space, it
is not an affine space, it is not even a convex set. It is only a cone in the linear space
P DO(m) , but it carries a natural topology which we now proceed to describe.
Let L1 , L2 ∈ Ellk (E, F ). We set
δ(L1 , L2 ) = sup kL1 u − L2 uk; kukk = 1, .
Define
d(L1 , L2 ) = max δ(L1 , L2 ), δ(L∗1 , L∗2 ), .
We let the reader check that (Ellk , d) is indeed a metric space. A continuous family of
elliptic operators (Lλ )λ∈Λ , where Λ is a topological space, is then a continuous map
Λ ∋ λ 7→ Lλ ∈ Ellk .
Roughly speaking, this means that the coefficients of Lλ , and their derivatives up to order
k depend continuously upon λ.
Theorem 10.4.14. The index map
is continuous.
10.4. ELLIPTIC OPERATORS ON COMPACT MANIFOLDS 515
The proof relies on a very simple algebraic trick which requires some analytical foun-
dation.
Let X, Y be two Hilbert spaces. For any Fredholm operator L : D(L) ⊂ X → Y denote
by ıL : ker L → X (respectively by PL : X → ker L) the natural inclusion ker L ֒→ X
(respectively the orthogonal projection X → ker L). If Li : D(Li ) ⊂ X → Y (i = 0, 1) are
two Fredholm operators define
by
RL0 (L1 )(u, φ) = (L1 u + ıL∗0 φ, PL0 u), u ∈ D(L1 ), φ ∈ ker L∗0 .
In other words, the operator RL0 (L1 ) is given by the block decomposition
L1 ıL∗0
RL0 (L1 ) = .
PL0 0
We will call RL0 (L1 ) is the regularization of L1 at L0 . The operator L0 is called the center
of the regularization. For simplicity, when L0 = L1 = L, we write
RL = RL (L).
We strongly recommend the reader who feels less comfortable with basic arguments of
functional analysis to try to provide the no-surprise proof of the above result. It is a very
good “routine booster”.
Proof of Theorem 10.4.14 Let L0 ∈ Ellk (E, F ). We have to find r > 0 such that,
∀L ∈ Ellk (E, F ) satisfying d(L0 , L) ≤ r, we have
We argue by contradiction. Assume that there exists a sequence (un , φn ) ⊂ Lk,2 (E) ×
ker L∗0 , and a sequence (Ln ) ⊂ Ellk (E, F ) such that
Ln un = −φn ,
Since un ⊥ ker L0 (by (10.4.2)) we deduce from the Poincaré inequality combined with
the elliptic estimates that
kun − um kk ≤ CkL0 un − L0 um k → 0 as m, n → ∞.
Putting all the above together we conclude that there exists a pair (u, φ) ∈ Lk,2 (E)×ker L∗0 ,
such that
kukk + kφk = 1,
L0 u = −φ and u ⊥ ker L. (10.4.4)
This contradicts the abstract Hodge decomposition which coupled with (10.4.4) implies
u = 0 and φ = 0. Step 1 is completed.
Step 2. Let r > 0 as determined at Step 1, and L ∈ Ellk (E, F ). Hence
L ıL∗0
RL0 (L) =
PL0 0
is invertible. We will use the invertibility of this operator to produce an injective operator
This implies dim ker L∗ + dim ker L0 ≤ dim ker L + dim ker L∗0 , i.e.,
A dual argument, with L replaced by L∗ , and L0 replaced by L∗0 , will produce the opposite
inequality, and thus finish the proof of Theorem 10.4.14. Now let us provide the details.
First, we orthogonally decompose
Set U := ker L ⊕ ker L∗0 , and V := ker L∗ ⊕ ker L0 . We will regard RL0 (L) as an operator
where
T : Lk,2 (E) ∩ (ker L)⊥ ⊂ (ker L)⊥ → (ker L∗ )⊥ = Range (L)
denotes the restriction of L to (ker L)⊥ . The operator T is invertible and its inverse is
bounded.
Since RL0 (L) is invertible, for any v ∈ V we can find a unique pair (φ, u) ∈ (ker L)⊥ ⊕
U , such that
φ 0
RL0 (L) = .
u v
This means
T φ + Au = 0 and Bφ + Cu = v.
We can view both φ and u as (linear) functions of v, φ = φ(v) and u = u(v). We claim
the map v 7→ u = u(v) is injective.
Indeed, if u(v) = 0 for some v, then T φ = 0, and since T is injective φ must be zero.
From the equality v = Bφ + Cu we deduce v = 0. We have thus produced the promised
injective map V ֒→ U . Theorem 10.4.14 is proved. ⊔
⊓
The theorem we have just proved has many topological consequences. We mention
only one of them.
Corollary 10.4.17. Let L0 , L1 ∈ Ellk (E, F ) if σk (L0 ) = σk (L1 ) then ind (L0 ) = ind (L1 ).
Proof. For every t ∈ [0, 1] Lt = (1 − t)L0 + tL1 is a k-th order elliptic operator depend-
ing continuously on t. (Look at the symbols). Thus ind (Lt ) is an integer depending
continuously on t so it must be independent of t. ⊔
⊓
This corollary allows us to interpret the index as as a continuous map from the elliptic
symbols to the integers. The analysis has vanished! This is (almost) a purely algebraic-
topologic object. There is one (major) difficulty. These symbols are “polynomials with
coefficients in some spaces of endomorphism”.
The deformation invariance of the index provides a very powerful method for com-
puting it by deforming a “complicated” situation to a “simpler” one. Unfortunately, our
deformation freedom is severely limited by the “polynomial” character of the symbols.
518 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
There aren’t that many polynomials around. Two polynomial-like elliptic symbols may
be homotopic in a larger classes of symbols which are only positively homogeneous along
the fibers of the cotangent bundle).
At this point one should return to analysis, and try to conceive some operators that
behave very much like elliptic p.d.o. and have more general symbols. Such objects exist,
and are called pseudo-differential operators. We refer to [83, 121] for a very efficient
presentation of this subject. We will not follow this path, but we believe the reader who
reached this point can complete this journey alone.
Exercise 10.4.18. Let L ∈ Ellk (E, F ). A finite dimensional subspace V ⊂ L2 (F ) is
called a stabilizer of L if the operator
is surjective.
(a) Show that any subspace V ⊂ L2 (F ) containing ker L∗ is a stabilizer of L. More gener-
ally, any finite dimensional subspace of L2 (F ) containing a stabilizer is itself a stabilizer.
Conclude that if V is a stabilizer, then
ker SL,V1 ⊕ V 2 ∼
= ker SL,V2 ⊕ V1 .
Thus the line bundle det ker SL,V ⊗ det V ∗ → Λ is independent of the uniform stabilizer
V . It is called the determinant line bundle of the family L and is denoted by det ind(L).
⊔
⊓
Consider as usual, a compact, oriented Riemann manifold (M, g), and a complex vec-
tor bundle E → M endowed with a Hermitian metric h•, •i and compatible connection.
Throughout this subsection L will denote a k-th order, formally self-adjoint elliptic oper-
ator L : C ∞ (E) → C ∞ (E). Its analytical realization
λ ∈ spec(L) ⇐⇒ ker(λ − L) 6= 0.
Thus, the spectrum of L consists only of eigenvalues of finite multiplicities. The main
result of this subsection states that one can find an orthonormal basis of L2 (E) which
diagonalizes La .
Theorem 10.4.20. Let L ∈ Ellk (E) be a formally self-adjoint elliptic operator. Then the
following are true.
(a) The spectrum spec(L) is real, spec(L) ⊂ R, and for each λ ∈ spec(L) , the subspace
ker(λ − L) is finite dimensional and consists of smooth sections.
(b) The spectrum spec(L) is a closed, countable, discrete, unbounded set.
(c) There exists an orthogonal decomposition
M
L2 (E) = ker(λ − L).
λ∈spec(L)
The first identity is true over the entire L2 (E) while part (d) of the theorem shows the
domain of validity of the second equality is precisely the domain of L.
Proof. (a) We only need to show that ker(λ − L) consists of smooth sections. In view of
Weyl’s lemma this is certainly the case since λ − L is an elliptic operator.
(b)&(c) We first show spec(L) is discrete.
More precisely, given λ0 ∈ spec(L), we will find ε > 0 such that ker(λ − L) = 0,
∀|λ − λ0 |, ε, λ 6= λ0 . Assume for simplicity λ0 = 0.
We will argue by contradiction. Thus there exist λn → 0 and un ∈ C ∞ (E), such that
Lun = λn un , kun k = 1.
520 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Clearly un ∈ R (L) = (ker L∗ )⊥ = (ker L)⊥ , so that the Poincaré inequality implies
T = (t0 − L)−1 .
Conversely, if X
λ2 kPλ ψk2 < ∞,
λ
Hence ψ ∈ Lk,2 (E) as a Lk,2 -limit of smooth sections. The theorem is proved. ⊔
⊓
∂
L = −i : C ∞ (S 1 , C) → C ∞ (S 1 , C).
∂θ
The operator L is clearly a formally self-adjoint elliptic p.d.o. The eigenvalues and the
eigenvectors of L are determined from the periodic boundary value problem
∂u
−i = λu, u(0) = u(2π),
∂θ
which implies
u(θ) = C exp(iλθ) and exp(2πλi) = 1.
Hence
spec(L) = Z and ker(n − L) = spanC {exp(inθ)}.
The orthogonal decomposition
M
L2 (S 1 ) = ker(n − L)
n
if and only if X
(1 + n2 )|un |2 < ∞. ⊔
⊓
n∈Z
Exercise 10.4.22. Let V ∈ Lk,2 (E) be a finite dimensional invariant subspace of L, i.e.
L(V ) ⊂ V . Show that
2. The quantity
nZ o
⊥
λ(V ) = inf hLu, uidvg ; u ∈ Lk,2 (E) ∩ V ⊥ , kuk = 1
M
⊔
⊓
⊔
⊓
Exercise 10.4.24. Use the results in the above exercises to show that if L is a bounded
from below, k-th order formally self-adjoint elliptic p.do. over an N -dimensional manifold
then
λm (L) = O(mk/N ) as m → ∞,
and
d(Λ) = dim ⊕λ≤Λ ker(λ − L) = O(ΛN/k ) as Λ → ∞. ⊔
⊓
Remark 10.4.25. (a) When L is a formally self-adjoint generalized Laplacian then the
result in the above exercise can be considerably sharpened. More precisely H.Weyl showed
that
rank (E) · volg (M )
lim Λ−N/2 d(Λ) = . (10.4.5)
Λ→∞ (4π)N/2 Γ(N/2 + 1)
10.4. ELLIPTIC OPERATORS ON COMPACT MANIFOLDS 523
The very ingenious proof of this result relies on another famous p.d.o., namely the heat
operator ∂t + L on R × M . For details we refer to [14], or [121].
In the next chapter we will investigate the spectrum of the Laplacian ∆ of a compact
Riemann manifold using another famous p.d.o., the wave operator = ∂t2 + ∆ on R × M .
In particular, we will obtain a refinement of (10.4.5).
(b) Assume L is the scalar Laplacian ∆ on a compact Riemann manifold (M, g) of
dimension M . Weyl’s formula shows that the asymptotic behavior of the spectrum of ∆
contains several geometric informations about M : we can read the dimension and the
volume of M from it. If we think of M as the elastic membrane of a drum then the
eigenvalues of ∆ describe all the frequencies of the sounds the “drum” M can produce.
Thus “we can hear” the dimension and the volume of a drum. This is a special case of
a famous question raised by V.Kac in [71]: can one hear the shape of a drum? In more
rigorous terms this question asks how much of the geometry of a Riemann manifold can
be recovered from the spectrum of its Laplacian. This is what spectral geometry is all
about.
It has been established recently that the answer to Kac’s original question is negative.
We refer to [53] and the references therein for more details. In the next chapter we can
reconstruct the geometry of the manifold if besides the spectrum we also have information
about the eigenfunctions of the Laplace operator. ⊔
⊓
Exercise 10.4.26. Compute the spectrum of the scalar Laplacian on the torus T 2 equipped
with the flat metric, and then use this information to prove the above Weyl asymptotic
formula in this special case. ⊔
⊓
Exercise 10.4.27. Denote by ∆S n−1 the scalar Laplacian on the unit sphere S n−1 ⊂ Rn
equipped with the induced metric. We assume n ≥ 2. A polynomial p = p(x1 , . . . , xn ) is
called harmonic if ∆Rn p = 0.
(a) Show that if p is a homogenous harmonic polynomial of degree k, and p̄ denotes its
restriction to the unit sphere S n−1 centered at the origin, then
(d) Prove Weyl’s asymptotic estimate (10.4.5) in the special case of the operator ∆S n−1 .
⊔
⊓
D D Dm−1
0 → C ∞ (E0 ) →0 C ∞ (E1 ) →1 · · · → C ∞ (Em ) → 0
is exact. ⊔
⊓
Example 10.4.29. The DeRham complex (Ω∗ (M ), d) is an elliptic complex. In this case,
the associated sequence of principal symbols is (e(ξ) = exterior multiplication by ξ)
is the Koszul complex of Exercise 7.1.23 of Subsection 7.1.3 where it is shown to be exact.
Hence, the DeRham complex is elliptic. We will have the occasion to discuss another
famous elliptic complex in the next chapter. ⊔
⊓
Theorem 10.4.30 (Hodge). Assume that M is compact and oriented. Then the following
are true.
1. H i (E· , D· ) ∼
= ker ∆i ⊂ C ∞ (Ei ), ∀i.
Proof. Set
Note that according to Corollary 10.4.13 we have ker ∆ = ker D̂, so that we have an
orthogonal decomposition
L2 (E) = ker ∆ ⊕ R (D̂). (10.4.6)
This is precisely part (c) of Hodge’s theorem.
For each i denote by Pi the orthogonal projection L2 (Ei ) → ker ∆i . Set
Z i = u ∈ C ∞ (Ei ); Di u = 0 and B i = Di−1 (C ∞ (Ei−1 )),
so that
H i (E· , D· ) = Z i /B i .
We claim that the map Pi : Z i → ker ∆i descends to an isomorphism H i (E· , D· ) → ker ∆i .
This will complete the proof of Hodge theorem. The above claim is a consequence of
several simple facts.
Fact 1. ker ∆i ⊂ Z i . This follows from the equality ker ∆ = ker D̂.
Fact 2. If u ∈ Zi then u − Pi u ∈ Bi . Indeed, using the decomposition (10.4.6) we have
Since u − Pi u ∈ C ∞ (Ei ), we deduce from Weyl’s lemma that ψ ∈ C ∞ (E). Thus, there
exist v ∈ C ∞ (Ei−1 ) and w ∈ C ∞ (Ei ) such that ψ = v ⊕ w and
u = Pi u + Di−1 v + Di∗ w.
u − Pi u = Di−1 v ∈ B i .
and we deduce that no two distinct elements in ker ∆i are cohomologous since otherwise
their difference would have been orthogonal to ker ∆i . Hence, the induced linear map
Pi : ker ∆i → H i (E· , D· ) is injective. Fact 1 shows it is also surjective so that
ker ∆i ∼
= H i (E· , D· ).
Let us apply Hodge theorem to the DeRham complex on a compact oriented Riemann
manifold
d d d
0 → Ω0 (M ) −→ Ω1 (M ) −→ · · · −→ Ωn (M ) → 0, n = dim M.
We know that the formal adjoint of d : Ωk (M ) → Ωk+1 (M ) is
d∗ = (−1)νn,k ∗ d∗,
where νn,k = nk + n + 1, and ∗ denotes the Hodge ∗-operator defined by the Riemann
metric g and the fixed orientation on M . Set
∆ = dd∗ + d∗ d.
d ∗ ω = ± ∗ d ∗ ω = 0,
and
∗d ∗ (∗ω) = ± ∗ (dω) = 0.
The Hodge ∗-operator is bijective since ∗2 = (−1)k(n−k) . ⊔
⊓
10.4. ELLIPTIC OPERATORS ON COMPACT MANIFOLDS 527
Using the L2 -inner product on Ω• (M ) we can identify Hn−k (M, g) with its dual, and
thus we can view ∗ as an isomorphism
∗
Hk → (Hn−k )∗ .
On the other hand, the Poincaré duality described in Chapter 7 induces another isomor-
phism
PD
Hk −→ (Hn−k )∗ ,
defined by Z
hP D(ω), ηi0 = ω ∧ η,
M
where h·, ·i0 denotes the natural pairing between a vector space and its dual.
Proposition 10.4.34. P D = ∗, i.e.
Z Z
h∗ω, ηig dvg = ω ∧ η ∀ω ∈ Hk η ∈ Hn−k .
M M
Proof. We have
Exercise 10.4.35. Let ω0 ∈ Ωk (M ) be a harmonic k form and denote by Cω0 its coho-
mology class. Show that
Z Z
|ω0 |2g dvg ≤ |ω|2g dvg ∀ω ∈ Cω0 ,
M M
Exercise 10.4.36. Let G denote a compact connected Lie group equipped with a bi-
invariant Riemann metric h. Prove that a differential form on G is h-harmonic if and only
if it is bi-invariant. ⊔
⊓
528 CHAPTER 10. ELLIPTIC EQUATIONS ON MANIFOLDS
Chapter 11
Spectral Geometry
On a compact Riemann manifold the Laplacian produces a wealth of data: its spectrum
and the corresponding eigenfunctions. We will refer to these as the spectral data of a
Riemann manifold. Clearly the spectral data depend on the metric. In this chapter we
discuss how much information about the metric we can extract from the spectral data.
In approaching this problem we rely on a strategy pioneered by L. Hörmander [65] and
based on the wave equation.
satisfying the following continuity property: for any compact set K ⊂ O there exist
m = mK ∈ Z≥0 and C = CK > 0 such that, for any function ϕ ∈ DK (O)
X
hu, ϕi| ≤ C sup |∂xα ϕ(x)|. (11.1.1)
x∈K
|α|≤mK
529
530 CHAPTER 11. SPECTRAL GEOMETRY
Example 11.1.3. (a) Any complex valued measure on the σ-algebra of Borel subset of
O defines a generalized function on O. In particular L1loc (O) ⊂ C −∞ (O). A function
f ∈ L1loc (O) defines a distribution uf by setting
Z
huf , ϕi := f (x)ϕ(x)dx, ∀ϕ ∈ D(O).
O
⊔
⊓
L : C −∞ (O) → C −∞ (O),
where ϕ̄ denotes the complex conjugate of ϕ. In particular, for any multi-index α ∈ Zn≥0
and any u ∈ C −∞ (O) the distributional derivative ∂ α u of u is determined by the equality
∂ α u, ϕ := (−1)|α| u, ∂ α ϕ , ∀ϕ ∈ D(O).
∂ α δ0 , ϕ = (−1)α ∂ α ϕ(0).
df
Then dx = H as generalized functions on R. ⊔
⊓
C −∞ (O1 ) ∋ u 7→ u O0
∈ C −∞ (O0 ),
u O0
, ϕ = u, ϕ , ∀u ∈ D(O0 ) ⊂ D(O1 ).
Definition 11.1.8. Let u ∈ C −∞ (O), O ⊂ Rn .
⊔
⊓
Then supp ϕu ∈ O \ sing supp u and sing supp ϕu = ∅, i.e., the generalized function
coincides with a smooth function. More precisely, it coincides with the extension by zero
of the smooth function (ϕu) O\sing supp u . ⊔
⊓
(b) For any multi-index α ∈ Zn≥0 the generalized function ∂xα δ0 satisfies
Fix a compact set K ⊂ O and denote by C-∞ (O)K the set of generalized function u
on O such that supp u ⊂ K. Observe that any u ∈ C-∞ (O)K induces a linear function
u : C ∞ (O) → R as follows: fix η ∈ D(O) such that η = 1 in a neighborhood of K. For
φ ∈ C ∞ (O) we set
hu, φi∗ := hu, ηφi. (11.1.2)
Note that hu, ηφi is independent of the choice of η.
Indeed, if η0 , η1 are two test functions equal to 1 on some neighborhood of K, then
η0 φ − η1 φ ∈ D(O \ K) so that
hu, η0 φ − η1 φi = 0.
We denote by C-∞ (O)cpt the space of distributions with compact support.
Note that the distributional derivatives of continuous functions are generalized func-
tions of finite order. We have the following converse, [118, Chap.III, Thm. XXI]
Theorem 11.1.11. Let u ∈ C −∞ (Rn ). Then for any open set Ω ⊂ Rn with compact
closure Ω̄ and for any open neighborhood O of Ω̄ we can find a continuous function
Φ : Rn → R
The distributions with support consisting of a single point have a particularly simple
structure, [118, Chap. III, Thm. XXXV].
Theorem 11.1.12. Let u ∈ C −∞ (Rn ). If supp u = {0}, then there exists an integer
m ≥ 0 and complex constants cα , α ∈ Zn≥0 , |α| ≤ m, such that
X
u= cα ∂ α δ0 . ⊔
⊓
|α|≤m
11.1. GENERALIZED FUNCTIONS AND CURRENTS 533
(z = x − y, x = z + y)
Z Z Z Z
= v(y) u(z)ϕ(z + y) dz dy = v(y) u(z)Ty∗ ϕ(z) dz dy,
Rn Rn Rn Rn
Observe that for any u ∈ C-∞ (Rn ) and any ϕ ∈ D(Rn ) the function
Rn ∋ y 7→ ψ(y) := u, Ty∗ ϕ ∈ R
Exercise 11.1.14. Let u ∈ C-∞ (R). Show that for any k ∈ Z≥0 we have
(k)
δ0 ∗ u = u(k) in the sense of distributions. ⊔
⊓
Remark 11.1.15. More generally, suppose that C ⊂ Rn is a closed convex cone satisfying
the properness condition
C × C ∋ (x, y) 7→ x + y ∈ Rn
supp u, supp v ⊂ C,
Rn ∋ y 7→ u(x − y)v(y) ∈ C
Note that
u(z)ϕ(z + y) 6= 0 =⇒ z ∈ C, y + z ∈ K =⇒ y ∈ K − C
so that
supp ψ ∈ K − C.
In particular, supp v(y)ψ(y) ⊂ K ∩ (K − C) is compact so the integral
Z
v(y)ψ(y)dy
Rn
is well defined.
More generally if u, v ∈ C −∞ (Rn ) are supported in a closed convex cone satisfying the
properness condition (11.1.4), then the convolution u ∗ v is well defined. ⊔
⊓
11.1.2 Currents
For any open subset O ⊂ Rn and any k ≤ n we denote by Ωk (O) the space of k-dimensional
currents on O, i.e., the topological dual of the space Ωkcpt (O) of complex valued, degree
k-forms on O with compact support. More precisely, a linear functional
∂[S] = [∂S],
where [∂S] is the integration current defined by the boundary ∂S equipped with the
outer=-normal-first orientation.
(b) Let η ∈ Ωn−k (O) and denote by [η] the associated k-dimensional current. from the
equality
d ϕ ∧ η) = dϕ ∧ η + (−1)k−1 ϕ ∧ η
and Stokes’ formula we deduce that
Then
Sn = S (Rn ) := ϕ ∈ C ∞ (Rn ); pk (ϕ) < ∞, ∀k ∈ Z≥0 .
The functions in Sn are called Schwartz functions. Let us observe that
2
D(Rn ) ⊂ Sn , e−|x| ∈ Sn .
Thus ϕν → ϕ as ν → ∞ iff
lim pk (ϕν − ϕ) = 0, ∀k ≥ 0.
ν→∞
u : Sn → C,
F : Sn → C ∞ (Rn ), Sn ∋ ϕ 7→ ϕ b = F [ϕ],
Z
b
ϕ(ξ) = e−i(ξ,x) ϕ(x)dx, ∀ξ ∈ Rn . ⊔
⊓
Rn
We list below a few fundamental results concerning the Fourier transform. For a proof
we refer to [39, Thm. 14.8].
11.1. GENERALIZED FUNCTIONS AND CURRENTS 537
F : Sn → Sn
is continuous.
(b) For any ϕ, ψ ∈ Sn
Z Z
b
ϕ(x)ψ(x)dx = b
ϕ(x)ψ(x)dx. (11.1.5)
Rn Rn
1
ϕ, ψ L2 (Rn )
= b ψb
ϕ, L2 (Rn )
,
(2π)n
i.e., Z Z
1 b
ϕ(x)ψ(x)dx = b ψ(ξ)dξ.
ϕ(ξ)
Rn (2π)n Rn
ϕ[
∗ ψ(ξ) = ϕ(ξ) b
b · ψ(ξ). ⊔
⊓
Using the equality (11.1.5) above we can extend the Fourier transform to temperate
distributions.
b, ψ := u, ψb , ∀ψ ∈ Sn i.
u ⊔
⊓
Theorem 11.1.24. Suppose I ⊂ R is an open interval. Then the only solutions of the
differential equation
du
= 0, u ∈ C −∞ (I)
dt
are the functions constant on I.
538 CHAPTER 11. SPECTRAL GEOMETRY
is supported on I and,
hu, ψi = hu, φ̇i = −hu̇, φi = 0.
Hence
hµI , ψi = 0 ⇒ hu, ψi = 0.
Now choose ψ0 ∈ C0∞ (I) such that
hµI , ψ0 i = 1.
then
0 = hu, ψi = hu, φi − chu, ψ0 i,
so
hu, φi = cc0 = c0 hµI , φi.
⊔
⊓
du
+ A(t)u = f (t), (11.1.6)
dt
Proof. Choose
E ∈ C ∞ I, End(Rn )
such that Ė = EA, and E(t) is invertible for any t ∈ I. We deduce that
d
( E(t)u(t) ) = E(t)f (t).
dt
If v ∈ C 1 (I, Rn ) satisfies v ′ = E(t)f (t), then we deduce
d
(E(t)u(t) − v(t)) = 0.
dt
Theorem 11.1.24 implies that there exists c ∈ Rn such that
E(t)u(t) = v(t) + c.
Hence u(t) = E(t)−1 v(t)+ E(t)−1 c is a classical solution of the above differential equation.
⊔
⊓
satisfies a linear first order ordinary differential equation with smooth coefficients and
continuous nonhomogeneous term. Corollary 11.1.25 implies that u is a classical solution.
⊔
⊓
d
Then dt uψ = 0. Using Theorem 11.1.24 we deduce that uψ is a constant function. The
map
D(Ω) ∋ ψ 7→ uψ ∈ C −∞ (I)
is obviously linear and continuous from the usual topology of D(Ω) to the weak topology
of C −∞ (I). The image of this map is the 1-dimensional subspace consisting of constant
functions so that the resulting map
D(Ω) ∋ ψ 7→ uψ ∈ R
The desired conclusion follows by observing that the linear span of functions of the form
ϕ(t)ψ(x), ϕ ∈ D(I), ψ ∈ D(Ω) is dense in D(I × Ω).
⊔
⊓
For any open interval I ⊂ R, any open set O ⊂ Rn we denote by C 0,−∞ (I × O) the
space of (weakly) continuous maps I ∋ t → ut ∈ C −∞ (O), i.e.,
t 7→ hut , ϕi
C 0 (I × O) ⊂ C 0,−∞ (I × O) ⊂ C −∞ (I × O).
where Aj (∂) : C ∞ (O) → C ∞ (O) are partial differential operators with smooth coefficients
independent of t. If F ∈ C 0,−inf ty (Rm ), then
u ∈ C ν,−∞ (I × O).
11.1. GENERALIZED FUNCTIONS AND CURRENTS 541
Moreover, if
lim Ft
tց0
lim ∂tj ut , 1 ≤ j ≤ ν − 1
tց0
exist in C −∞ (Rm ).
Proof. It suffices to show that, for any r > 0 and any bounded open set Ω ⊂ Rm , we have
Fix such a Ω.
Observe next that we can reduce (11.1.9) to a first order (in ∂t ) system of partial
differential equation by replacing u with the vector-valued generalized function
v = (∂tj u)0≤j≤ν−1 .
∂t u + A(∂x )u = F. (11.1.10)
Fix ε > 0 and set J := (−ε, r). Using Theorem 11.1.11 we deduce that we can find an
integer µ ≥ 0, multiindices αk ∈ Zm
≥0 , 0 ≤ k ≤ µ, and generalized functions
uk ∈ C 0 J × Ω ⊂ C 0,−∞ J × Ω , 0 ≤ k ≤ µ
such that !
m
X
u|J×Ω = ∂tk ∂xαk ∈ uk |J×Ω .
k=0
u = ∂xα0 u0 ∈ C 0 (J × Ω).
We set
m
X α
v =u+ ∂tj−1 A0 (∂x )∂x j uj ,
j=1
so that
∂t v = F − A0 (∂x )∂xα0 u0 ∈ C 0,−∞ ((0, r) × Ω).
| {z }
G
Now choose w ∈ C 1,−∞ ((0, r) × Ω), (0, r) ∋ t 7→ wt ∈ C −∞ (Ω) such that ∂t w = G. E.g.,
Z t
hwt , ϕi = hGs , ϕi, ∀ϕ ∈ D(Ω).
t0
Clearly
lim wt , ϕ
tց0
We can now conclude by induction that u ∈ C 1,−∞ (J × Ω). By construction, the limit
lim ut , ϕ
tց0
exists ∀ϕ ∈ D(Ω).
⊔
⊓
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 543
plays an important role. In particular, we will use less familiar properties of this function.
They can be found in [84, 134].
An important concept we will use in the sequel is that of holomorphic family of gen-
eralized functions. A family of generalized functions (ua )a∈A ∈ C −∞ (R) parametrized by
an open subset A ⊂ C is called holomorphic if, for any ϕ ∈ D(R) the function
A ∋ a 7→ hua , ϕi ∈ C
is locally integrable and thus defines a generalized function xa+ ∈ C −∞ (R). A simple
computation shows that
x · xa+ = xa+1
+ , ∀ Re a > −1, (11.2.1a)
d a
x = axa−1 + , ∀ Re a > 0. (11.2.1b)
dx +
We want to extend the definition of xa+ to all a ∈ C. For φ ∈ D(R) the function
Z ∞
Re a > −1 ∋ a 7→ Ia (φ) := hxa+ , φi = xa ϕ(x)dx
0
(−1)k
Ia (φ) = Ia+k φ(k) , ∀ Re a > −1, k ∈ Z≥0 . (11.2.2)
(a + 1) · · · (a + k)
If a ∈ C \ Z<0 then we can define
(−1)k
Ia (ϕ) := Ia+k φ(k) , ∀k > max(0, −1 − Re a).
(a + 1) · · · (a + k)
The equality (11.2.2) shows that the right hand side of the above equality is independent
of the choice of k > max(0, −1 − Re a).
544 CHAPTER 11. SPECTRAL GEOMETRY
This gives a meromorphic extension of the function a 7→ Ia (ϕ) with simple poles located
at Z<0 and thus allows us to define the generalized function xa+ for any a ∈ C\Z<0 . Observe
that
(−1)k 1
lim (a + k)Ia (ϕ) = I0 φ(k) = φ(k−1) (0), (11.2.3)
a→−k (1 − k)(2 − k) · · · (−1)! (k − 1)!
so that
(−1)k−1 (k−1)
lim (a + k)xa+ = δ .
a→−k (k − 1)! 0
More explicitly, for a ∈ C \ Z<0 , Re a > −n − 1, n ∈ Z>0 we have
Z 1 X xk φ(k) (0)
n−1
hxa+ , φi = xa φ(x) − dx
0 k!
k=0
Z ∞ Xn
φ(k−1) (0)
+ xa φ(x)dx + .
1 (k − 1)!(a + k)
k=1
To eliminate the poles of a 7→ xa+ we will divide xa+ by a meromorphic function with simple
poles at Z<0 .
For any Re s > 0 Euler’s Gamma-function satisfies
Γ(s + 1) = sΓ(s).
We can use this equality to extend Γ(s) to a meromorphic function on C with simple poles
at Z≤0 . Note that the residue at −k ∈ Z≤0 can be determined from the equality
The correspondence
Re a > −1 ∋ a 7→ χa+ ∈ C −∞ (R)
and in this half-plane it satisfies the equality
d a+1
χ = χa+ .
dx +
We can use this equality to extend χa+ analytically to all a ∈ C,
dn a+n
χa+ = χ
dxn +
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 545
where n is a positive integer such that n + Re a > −1. Since χ0+ is the Heaviside function
we deduce
−k−1 (k) dk
χ+ = δ0 = k δ0 , ∀k ∈ Z≥0 . (11.2.6)
dx
Similarly, for Re a > −1 we can define
(
a 0, x ≥ 0,
x− = a
|x| , x < 0,
Equivalently
hχa− , ϕi = hχa+ , φ̌i, ∀φ ∈ D(R),
where
φ̌(x) = φ(−x), ∀x ∈ R.
Note that
dn a+n
χ = (−1)n χa− .
dxn −
Observe that for Re a > −1 we have
1 (−1)(k−1) (k−1)
φ(k−1) (0) = hδ , φi.
(k − 1) (k − 1)! 0
The equality (11.2.4) shows that the derivative of this function at a = −(2j + 1) is
(−1)j+1 (j + 1)!. We conclude that, for any ϕ ∈ D(R), the function
1
a 7→ Fϕ (a) := Gϕ (a)
Γ( a+1
2 )
Note that
(−1)j+1 2(j + 1)! (2j)
|χ|−(2j+1) = δ0 . (11.2.8)
(2j)!
For y 6= 0 and a ∈ C we define
(k−1)
(x + iy)a := ea log |x+iy|+i arg(x+iy)
hδ0 , φi,
where
arg(x + iy) ∈ (−π, π), x + iy = |x + iy|ei arg(x+iy) .
We deduce that for Re a > −1 we have
|f (x + iy)| ≤ C|y|−N , ∀x + iy ∈ Z,
then f (• + iy) has a limit f0 ∈ C −∞ (R) when y → 0. Moreover, this limit is a generalized
function of order at most N + 1. We denote it by f (x + i0).
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 547
Then (
C0 |y|1−N , N >1
|F (z)| ≤ .
−C0 log y + C1 , N = 1
dN +1
f (x + iy) = G(x + iy).
dxN +1
Since G(• + iy) converges in the sense of generalized functions to a continuous function
G(x + i0), we deduce that f (x + iy) converges as y ց 0 to the generalized function
dN+1
dxn+1
G(x + i0) which has order ≤ N + 1. ⊔
⊓
Theorem 11.2.1 implies that (11.2.9) holds for all a ∈ C \ Z<0 and that the family
a 7→ (x ± i0)a
1
xa+ = eaπi (x − i0)a − e−aπi (x + i0)a , (11.2.10a)
2i sin πa
1
xa+ = (x + i0)a − (x − i0)a . (11.2.10b)
2i sin πa
Let us observe that
d
(x + i0)a = a(x + i0)a−1 .
dx
Proposition 11.2.2.
iπ(a+1)
χc
a (ξ) = e∓
±
2 (ξ ∓ i0)−(a+1) (11.2.11a)
Proof. Let Re a > −1 and ε > 0. Set fε (x) := e−εx xa+ so that fε → f0 = xa+ in the space
S ′ of Schwartz generalized functions. The Fourier transform of fε is
Z ∞
b
fε (ξ) = e−x(ε+iξ) xa dx.
0
Set ζ := ε + iξ so that Z ∞
fbε (ξ) = ζ −a e−xζ (xζ)a dx
0
(z := xζ)
Z
= ζ −a−1 e−z z a dz = ζ −(a+1) Γ(a + 1) = Γ(a + 1)(ǫ + iξ)−(a+1)
R>0 ·ζ
iπ(a+1)
= e− 2 (ξ − iε)−(a+1) .
Hence iπ(a+1)
fb0 (ξ) = Γ(a + 1)e− 2 (ξ − i0)−(a+1) .
Arguing in a similar fashion we deduce that for Re a > −1 the Fourier transform of xa− is
iπ(a+1)
Γ(a + 1)e 2 (ξ + i0)−(a+1) .
to conclude that
1 π 1 −(a+1) π
νba (ξ) = a a |ξ| = |χ|−(a+1) (ξ).
Γ(a + 1) Γ(1 + 2 ) Γ(− 2 ) Γ(1 + a2 )
Thus
da (ξ) = 2πΓ(a + 1) −(a+1)
|χ| a |χ| (ξ).
Γ( a+1
2 )Γ(1 + 2 )
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 549
Hence
da (ξ) = √π2a+1 |χ|−(a+1) (ξ), ∀ Re a > −1, a 6∈ Z.
|χ|
The equality (11.2.11b) now follows by unique continuation. ⊔
⊓
Corollary 11.2.3.
χa+ ∗ χb+ = χa+b+1
+ , ∀a, b.
If we define
−(1+α)
I+
α := χ+ , α ∈ C,
then we deduce
(k)
I+
k = δ0 , ∀k ∈ Z≥0 ,
and
+ +
I+
α ∗ I β = I α+β , ∀α, β ∈ C.
Noting that
(k)
I+
k ∗ f = δ0 ∗ f = f
(k)
, ∀f ∈ D(R),
we could conclude that the operator
D(R) ∋ u 7→ I + ∞
α ∗ u ∈ C (R)
Ψ∗ : D(X) → D(Y )
motivated follows. The co-area formula (9.1.12) implies that for any ϕ ∈ D(X), u ∈ D(Y )
we have
Z Z Z
1
(Ψ∗ u)(x)ϕ(x)|dx| = u(y) ϕ(x)|dVΨ−1 (y) (x)| |dy|,
X Y Ψ−1 (y) |∇Ψ|(x)
550 CHAPTER 11. SPECTRAL GEOMETRY
where |∇Ψ|(x) denotes the Jacobian of Ψ at the point x ∈ X, and |dVΨ−1 (y) (x)| denotes
the Euclidean volume density on the fiber Ψ−1 (x). More precisely,
p
|∇Ψ|(x) = det(DΨ)(x)∗ (DΨ)(x)
More conceptually, the density Ψ∗ ϕ(y)|dy| on Y is the pushforward of the density ϕ(x) |dx|
on X. Note that Ψ∗ ϕ ∈ D(Y ). We define the pullback
Ψ∗ : C −∞ (Y ) → C −∞ (X)
More generally, if Ψ is not a submersion, but its singular values are outside an open neigh-
borhood of U of sing supp u, then we can define Ψ∗ u via the following cutoff construction.
Fix a smooth function η ∈ D(U ) such that η = 1 on a neighborhood V of sing supp u
in U . The pullback Ψ∗ (ηu) is well defined via the above procedure since the restriction of
Ψ to Ψ−1 (U ) is a submersion.
According to Remark 11.1.9, the distribution (1 − η)u ∈ C −∞ ∗ Y ) is smooth and we
define Ψ∗ ( (1 − η)u) as the usual pullback of a smooth function via a smooth map. We
then set
Ψ∗ u := Ψ∗ (ηu) + Ψ∗ ( (1 − η)u ).
One can verify that the above definition is independent of the choice of η with the above
properties.
Indeed, if η0 , η1 are two such functions, then
Ψ∗ (η0 u) = Ψ∗ (η1 η0 u) + Ψ∗ (1 − η1 )η0 u ,
Ψ∗ (1 − η0 )u = Ψ∗ (η1 (1 − η0 )u) + Ψ∗ (1 − η1 )(1 − η0 )u .
By adding the above equalities and using the linearity of Ψ∗ we deduce
Ψ∗ (η0 u) + Ψ∗ (1 − η0 )u = Ψ∗ (η1 u) + Ψ∗ (1 − η1 )u .
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 551
Qc : R → R, Qc (t) = t2 − c2 .
Note that only the point s0 = −c2 is a critical value of this map, and Q−1 (s0 ) = {0}. For
s > s0 we have p
Q−1c (s) = ± s + c .
2
Hence
1
hQ∗c δ0 , ϕ) = ϕ(c) + ϕ(−c) .
2c
Using the cutoff construction of the pullback we deduce
1
Q∗c δ0 = (δc + δ−c ). ⊔
⊓
2c
Rt : Rn → Rn , x 7→ tx.
Rt (X) = X, ∀t > 0.
Rt∗ u = ta u, ∀t > 0.
If we differentiate the last equality with respect to t at t = 1 we deduce the Euler identity
~
(a + n)hu, ϕi + hu, Eϕi = 0, ∀ϕ ∈ D(X), (11.2.15)
~ is the Euler vector field
where E
n
X
~ =
E xj ∂xj .
j=1
552 CHAPTER 11. SPECTRAL GEOMETRY
Conversely, the equality (11.2.15) implies (11.2.14). Indeed, for t ∈ R define the continuous
linear operator
St : C −∞ (X) → C −∞ (X), St u = e−ta Re∗t u.
Then
S0 = 1, St0 ◦ St0 = St0 +t1 , ∀t0 , t1 ∈ R.
The equality (11.2.15) is equivalent to
d
|t=0 St u = 0.
dt
so that
1
lim (Sh − S0 )u = 0,
h→0 h
in the sense of generalized functions. Hence
1 1
0 = St0 lim (Sh − S0 )u = lim (St0 +h − St0 )u.
h→0 h h→0 h
Hence
d
|t=t0 St u = 0, ∀t0 .
dt
Hence the function t 7→ St u ∈ C −∞ (X) is constant, i.e., u is homogeneous of degree a.
Example 11.2.5. (a) The generalized functions xa± , (x ± i0)a and χa+ are homogeneous
of degree a on R \ 0.
(b) The Dirac generalized function δ0 ∈ C −∞ (Rn ) is homogeneous of degree −n. Indeed
hδ0 , (Rt )∗ ϕi = t−n hδ0 , ϕ(tx)i = t−n ϕ(0) = t−n hδ0 , ϕi.
⊔
⊓
^
(∂ e, ∀j = 1, . . . , n.
xj u) = ∂xj u (11.2.16)
11.2. IMPORTANT FAMILIES OF GENERALIZED FUNCTIONS 553
Proof. Uniqueness. Suppose that that ũ and ū are two homogeneous extensions of u to
Rn . Then ũ − ū is supported at the origin. Hence it is a linear combinations of derivatives
of the Dirac generalized function. Such a linear combination cannot be homogeneous of
degree ∈ C \ Z≤−n . This proves the uniqueness of the extension.
Existence. For any φ ∈ D(Rn ) we define
a+n−1
T a φ(x) := t+ , φx (t) R
, φx (t) := φ(tx), x ∈ Rn \ 0,
where h−, −iR denotes the pairing between a generalized function and a test function on
R. Note that T a φ ∈ C ∞ (Rn \ 0).
Lemma 11.2.8. For any a ∈ C \ Z≤−n and any ϕ ∈ D(Rn ) the function T a φ is homoge-
neous of degree −a − n.
The key moment in the existence proof is the following auxiliary result whose proof
we postpone.
Lemma 11.2.9. If u ∈ C −∞ (Rn \ 0) is a homogeneous generalized function of degree a,
then
hu, ηi = 0 if η ∈ D(Rn \ 0) and T a η(x) = 0, ∀x ∈ Rn \ 0.
⊔
⊓
The sought for extension of u can now be easily described. More precisely, we set
the right-hand side of the above equality is well defined since ψT a φ ∈ D(Rn \ 0), ∀φ ∈
D(Rn ).
If P is a homogeneous polynomial, then P u is a homogeneous generalized function of
degree a + deg P ∈ C \ Z≤−n . Clearly Pfu coincides with P u
e on Rn \ 0 and the uniqueness
of the extension implies that
Pfu = P u
e on Rn .
The equality
^
(∂ e
xj u) = ∂xj u
~ + (a + n)φ = η.
η ∈ D(Rn \ 0), T a η = 0 ⇒ ∃φ ∈ D(Rn \ 0) : Eφ (11.2.18)
(11.2.15 )
~ + (a + n)φi
hu, ηi = hu, Eφ = 0.
d 1~
φ(tx) = Eφ(tx), ∀|x| = 1.
dt t
Hence
d a+n
t ~
φ(tx) = ta+n−1 Eφ(tx) + (a + n)φ(tx) .
dt
Thus (11.2.18) is equivalent to
d a+n
t φ(tx) = ta+n−1 η(tx), ∀|x| = 1.
dt
The function φ defined by
Z ∞
1
φ(tx) = − τ a+n−1 η(τ x)dτ
ta+n t
M : C ∞ (R × M ) → C ∞ (R × M ), M u = ∂t2 u + ∆M u.
As we will see later in this section, this initial value problem has a unique solution which
we denote by Ut (ϕ).
Using a technique pioneered by J. Hadamard [57] and clarified by M. Riesz [114], we
will produce asymptotic expansions Ut (ϕ) for |t| small. Our approach follows Hörmander’s
modern presentation in [67] of the ideas of Hadamard and Riesz.2 As Hörmander explained
in his groundbreaking work [65], these short term asymptotics can be used to obtain rather
precise information about the spectrum of ∆M .
E = δ0 .
(0, ∞) ∋ t 7→ Et ∈ C −∞ (Rm ),
such that
m
X
∂t2 Et − ∂x2j Et = 0, t > 0, (11.3.1a)
j=1
lim Et = 0, (11.3.1b)
tց0
[0, ∞) ∋ t 7→ Et , ϕ(t, −) ∈ R
Proposition 11.3.1. If (Et )t>0 is a Cauchy fundamental solution for the operator L,
e is a fundamental solution of L. We say that E
then the associated generalized function E e
is the fundamental solution associated to the Cauchy fundamental solution (Et )t≥0 .
The map
C+ × C+ ∋ (x, y) → x + y ∈ Rn
is obviously proper, so the cone C+ satisfies the properness condition (11.1.4). We have
the following important uniqueness result.
Proposition 11.3.2. The wave operator admits at most one fundamental solution
supported in the forward light cone C+ .
11.3. THE WAVE EQUATION 557
Proof. Suppose that E, E ′ are two fundamental solutions with support in C+ . Then
E ′ = E + u, where u is a generalized function with support in C+ such that
u = (E ′ − E) = 0.
Since the cone C+ satisfies the properness condition (11.1.4) we deduce from Remark
11.1.15 that the convolution E ∗ u is well defined and we have
0 = (E ∗ u) = (E ∗ u) = (E) ∗ u = δ ∗ u = u.
⊔
⊓
Proof. (a) The Lie algebra of Lom can be tautologically identified with a subspace of the
space of linear vector fields on R1,m . A basis is given by the vector fields
Xi · u = Xij · u = 0, ∀1 ≤ i, j ≤ m.
These equalities can be rewritten in the compact form du ∧ dq = 0. Part (b) follows easily
from (a). ⊔
⊓
η = dq ∧ ω
The above equation has many solutions ω, but all their restrictions to the fiber are the
same.
In particular, we have a dual pullback map
(q + )! : Ω1 (R) → Ωn (O+ ).
The image of (q + )! consists of Lom -invariant currents. We want to prove that, in fact,
any invariant current on O+ lies in the image of (q + )! .
Let T+ = {t > 0} ⊂ R1,m . On this open set the functions (q, x1 , . . . , xm ) define a
global coordinate system. More precisely, T+ can be identified with the open set in the
(q, x1 , . . . , xm )-space defined by the inequality
|x|2 + q > 0.
11.3. THE WAVE EQUATION 559
Fix a function α ∈ C ∞ (R1,m ), such that supp α ⊂ T+ and for any λ ∈ R we have
supp α ∩ {q = λ} is compact,
Z
α(λ, x1 , . . . , xm )dx1 ∧ · · · ∧ dxm = 1.
Rm
Now set
γ = αdx1 ∧ · · · ∧ dxm ∈ Ωn−1 (T+ ).
We have a projection formula
S = q∗ (q + )! γ ∩ S , ∀S ∈ Ω1 (R) (11.3.6)
Proof. Observe that if (11.3.7) were true, than the current S is uniquely determined by T
via the projection formula
S = q∗+ (γ ∩ T ).
We will prove that
T = (q + )! q∗+ (γ ∩ T ) on T+ (11.3.8)
The equality (11.3.7) follows from (11.3.8) by invoking the invariance of T and the fact
that the collection A · T+ , A ∈ Lom is an open cover of O+ .
We set
S := q∗+ (γ ∩ T ), T0 := T − (q + )! S |T+ .
We can write
β = b(q, x1 , . . . , xm )dq ∧ dx1 ∧ · · · ∧ dxm ∧ dq.
From the equality
0 = q!+ η − (q + )∗ (q!+ η) ∧ γ
we deduce Z
b(λ, x1 , . . . , xm )dx1 ∧ · · · ∧ dxm = 0, ∀λ ∈ R.
560 CHAPTER 11. SPECTRAL GEOMETRY
(x1 , . . . , xm ) 7→ b(λ, x1 , . . . , xm )
has compact support. Using the Poincaré lemma for compact supports Theorem 7.2.1, or
rather its proof, we can find a family of forms Bq ∈ Ωm−1 m
cpt (R ), depending smoothly on
q ∈ R, such that, ∀q ∈ R we have
supp Bq ⊂ |x|2 + q > 0 ,
b(q, x)dx1 ∧ · · · ∧ dxm = dx Bq ,
where dx denotes the exterior derivative on the space Rm with coordinates x = (x1 , . . . , xm ).
We can regard the family (Bq ) as defining an (m − 1)-form B on the space T+ with coor-
dinates (q, x) and as such we have
β = d(B ∧ dq)
and
(11.3.9 ) (11.3.5 )
hη, T0 i = hd(B ∧ dq), T i = hB ∧ dq, ∂T i = hB, dq ∩ ∂T i = 0.
⊔
⊓
✍ In the sequel we will use the notation u = (q + )∗ v when referring to the generalized
function u on the punctured space R1,m \ 0 defined by (11.3.11), where v ∈ C −∞ (R),
supp v ⊂ R≥0 . Informally
(q + )∗ v = v(t2+ − |x|2 ).
d2 d
Pn = 4q + 2n .
dq 2 dq
and
d d−2
(q + )∗ χ+2 = (2d + 2m − 2)(q + )∗ χ+2 , on R1,m \ 0. (11.3.12)
Proof. We know that u = (q + )∗ v for some v ∈ C −∞ (R) with support in [0, ∞). Since
u is homogeneous of degree d and q is homogeneous of degree 2 we deduce that v is
homogeneous of degree d2 ∈ R \Z<0 . Theorem 11.2.7 implies that v is uniquely determined
by its restriction to (0, ∞), while Theorem 11.2.6 shows that the restriction of v to (0, ∞)
d
is equal to constχ+2 .
To prove the second statement observe that
d d−2 d−2
Pn χ+2 = (2d + 2n − 4)χ+2 = (2d + 2m − 2)χ+2 .
⊔
⊓
where
Γ( a+1
2 )
n := m + 1, cm (a) := m .
π Γ(a)Γ( a−m+1
2
2 )
a is homogeneous of degree a − n.
Observe that R+
a is locally integrable if a > m − 1.
Lemma 11.3.7. Let m > 1. The function R+
562 CHAPTER 11. SPECTRAL GEOMETRY
Proof. As we have seen in the proof of Proposition 11.3.4 on the open set O+ we can use
as coordinates the functions q, x1 , . . . , xm . In these coordinates we have
p
t = |x|2 + q
so that
m
X xi 1
dt = p dxi + p dq
i=1 |x|2 +q 2 |x|2 + q
and
1
dt ∧ dx1 ∧ · · · ∧ dxm = p dq ∧ dx1 ∧ · · · ∧ dxm .
2 |x|2 + q
In the coordinates (q, x) the cone C+ becomes the half-plane q ≥ 0 and the function
a−n
q 2
p
|x|2 + q
is locally integrable on this half-plane because
a−n Re a−n
q 2 q 2
p ≤ , ∀q ≥ 0, x ∈ Rm \ 0,
|x|2 + q |x|
and the function is the right hand side is locally integrable if m > 1 and
Re a − n
> −1⇐⇒ Re a > m − 1.
2
⊔
⊓
of degree a − n > −2 and its support is contained in C+ . Corollary 11.3.6 shows that R+ a
Proposition 11.3.8.
a b a+b
R+ ∗ R+ = R+ , ∀a, b > m − 1.
If we set Z
T (a) := e−t R+
a
(t, x)dtdx,
R1,m
so that
T (a)T (b)
c(a, b) = .
T (a + b)
Now observe that
Z Z !
T (a) a−m−1
= e−t (t2 − |x|2 ) 2 dx dt
cm (a) R>0 |x|≤t
Z Z t
a−m−1
−t 2 2 m−1
= σ m−1 e (t − r ) 2 r dr dt,
R>0 0
σ m−1 Γ(a)Γ( m
2 )Γ(
a−m+1
2 ) 1
= a+1 = .
2 Γ( 2 ) cm (a)
Thence T (a) = 1 and c(a, b) = 1. ⊔
⊓
564 CHAPTER 11. SPECTRAL GEOMETRY
Using the equality (11.3.12) and the definition of dm (a) we deduce that
a a−2
R+ = R+ on R1,m \ 0, ∀ Re a > m + 1.
Invoking (11.2.16) we deduce that the above equality holds on R1,m . We have thus proved
the following result.
Proposition 11.3.9.
a b a+b
R+ ∗ R+ = R+ , ∀ Re a, Re b > m − 1, (11.3.13a)
a a−2
R+ = R+ on R1,m , ∀ Re a > m + 1. (11.3.13b)
We can now use the above equalities to define R+ a ∈ C −∞ (R1,m ) for any a ∈ C by
setting
a
R+ := k R+a+2k
, (11.3.14)
where k is a positive integer such that Re a + 2k > m − 1. The family
a
C ∋ a 7→ R+ ∈ C −∞ (R1,m )
a is homogeneous of
is holomorphic. Moreover, for any a ∈ C the generalized function R+
degree a, it is Lom -invariant and
a
supp R+ ⊂ C+ .
Moreover, the equalities (11.3.13a) and (11.3.13b) extend by unique continuation to all
a, b ∈ C. The family (R+a) a
a∈ C is called the wave family and the generalized functions R+
are the Riesz generalized functions, [114].
Proposition 11.3.10.
a a−2
(a − 2)∂xi R+ = −xi R+ , ∀a ∈ C, 1 ≤ i ≤ m. (11.3.15)
Proof. To prove (11.3.15) we observe that both sides of this equality are generalized func-
tions depending holomorphically on a. Hence it suffices to prove it only for Re a ≫ 0. In
this case a−n a−2−n
a
∂xi R+ = dm (a)∂xi (q + )∗ χ+2 = dm (a)(∂xi q)(q + )∗ χ 2
dm (a) a−2−n dm (a)
= −2xi dm (a − 2)(q + )∗ χ 2 = −2xi R a−2
dm (a − 2) dm (a − 2) +
Γ( a+1
2 )Γ(a − 2) a−2
a−1
a−2 xi
= −2xi R+ = −2x i
2
R+ =− R a−2 .
a−1
Γ(a)Γ( 2 ) (a − 1)(a − 2) a−2 +
⊔
⊓
11.3. THE WAVE EQUATION 565
Theorem 11.3.11. The unique fundamental solution of on R1,m , m > 1, with support
2.
in C+ is the Riesz generalized function R+
Γ( a+1
2 )
a 7→ dm (a) = m , Re a > m − 1
π 2 Γ(a)
Note that
a a
R̄+ = R+ |R1,m \0 , ∀ Re a > m − 1.
By unique continuation we deduce that the above equality holds for all a ∈ C. Using
(11.3.13b) we deduce that
a a−2
R̄+ = R̄+ on R1,m \ 0, ∀a ∈ C.
Hence
0
supp R+ = {0}.
0 is homogeneous of degree −n we deduce that R 0 = cδ , for some constant c.
Since R+ + 0
Then
a a 0 a a
R+ = R+ ∗ R+ = cR+ ∗ δ0 = cR+ .
This shows that c = 1. ⊔
⊓
exist in C −∞ (Rm ).
566 CHAPTER 11. SPECTRAL GEOMETRY
Proposition 11.3.12. The family ( E(t) )t>0 is a Cauchy fundamental solution of the
wave operator, i.e., it satisfies the conditions (11.3.1a), (11.3.1b) and (11.3.1c).
Proof. We follow the approach in [66, Sec. 6.2].The equality (11.3.1a) follows from the
fact that E = δ0 . We have to show that E0 = 0 and E0′ = δ0 . To achive this we first
need to have a clearer idea on the nature of the generalized functions E(t).
Let us compute the restriction of
1 2−n 1 1−m
(m−1)/2
q ∗ χ+2 = (m−1)/2
q ∗ χ+ 2
2π 2π
to the punctured space R1+m \ 0. Note that
p
|∇q(t, x)| = 2 t2 + |x|2 .
and we get
Z ∞ Z
1 rdr 1
q∗ ϕ(s) = ϕ̃(t, r) 2 1/2
= ϕ̃ t, (t2 − s)1/2 dt.
2 0 (s + r ) 2 t2 >s
11.3. THE WAVE EQUATION 567
Hence
Z
1 (1−m)/2
E+ , ϕ = χ+ , ϕ̃ t, (t2 − ·)1/2 dt, ∀ϕ ∈ D(R1+m \ 0). (11.3.16)
4π (m−1)/2 s<t2
m
For any ψ ∈ D R we set
Z
m−2
ψ̃(r) := r ψ(rω)|dω|.
|ω|=1
The equality (11.3.16) shows that for t > 0 and ψ ∈ D(Rm ) we have
1 D 1−m E
E+ (t), ψ = (m−1)/2 χ+ 2 (s), ψ̃ (t2 − s)1/2 . (11.3.17)
4π
We rewrite (11.3.16) as
Z ∞
E+ , ϕ = E+ (t), ϕ(t, x) dt, ∀ϕ ∈ D(R1+m \ 0).
0
We claim that Z ∞
E+ , ϕ = E+ (t), ϕ(t, x) dt, ∀ϕ ∈ D(R1+m ). (11.3.18)
0
To see this observe that since limtց0 E(t) exists weakly we deduce from the uniform
boundedness principle that the right-hand side of (11.3.18) exists and defines a homoge-
neous distribution of degree 1 − n on Rn = R1+m . This coincides with the homogeneous
distribution E+ on the punctured space Rn \ 0. We conclude from Proposition that they
coincide on Rn .
To show that E0 = 0 and E0′ = δ0 we argue as in the proof of Proposition 11.3.1. Let
ϕ = ϕ(t, x) ∈ D(R1+m ). We have
ϕ(0, 0) = hδ0,0 , ϕi = E+ , ϕ = E+ , ϕ
Z ∞
(11.3.18 )
= lim E+ (t), (ϕ)(t, −) dt
εց0 ε
Z ∞ Z ∞
= lim E+ (t), (∆ϕ)(t, −) dt + lim E+ (t), (∂t2 ϕ)(t, −) dt
εց0 ε εց0 ε
Z ∞ Z ∞
= lim ∆Et , ϕ(t, −) dt − lim E+ (ε), ∂t ϕ(ε, −) − lim ∂t Et , (∂t ϕ)(t, −) dt
εց0 ε εց0 εց0 ε
Z ∞
= − E0 , ∂t ϕ(0, −) + lim ∆Et , ϕ(t, −) dt
εց0 ε
Z ∞
+ lim ∂t Eε , ϕ(ε, −) + lim ∂t2 Et , ϕ(t, −) dt
εց0 εց0 ε
Z ∞
′
= − E0 , ∂t ϕ(0, −) + E0 , ϕ(0, −) + lim ∂t2 Et + ∆Et , ϕ(t, −) dt
εց0 ε
satisfying
(∂t2 + ∆)R+
2 2 +
(t) = 0, t > 0, R+ 2 +
(0 ) = 0, ∂t R+ (0 ) = δ0 . (11.3.19)
near (or at) the origin. If x ∈ Rm \ 0, then R+2 (t, x) = 0 for |x| > t. In other words, the
so that
2
R+ = dm (2)χ−k 2 2
+ (t+ − |x| )
has support on ∂C+ . In this case disturbance leaves the point x immediately after it
reaches this point. This is sometimes referred to as Huygens’ Principle.
If m is even, the disturbance will linger at x forever, with diminishing strength. ⊔
⊓
2.
Here is a simple but important application of the fundamental solution R+
Theorem 11.3.14. For arbitrary ϕ0 , ϕ1 ∈ Rm and f ∈ C ∞ ([0, ∞) × Rm ) the Cauchy
problem
u = f, on (0, ∞) × Rm ,
(11.3.20)
u(0, x) = ϕ0 (x), ∂t u(0, x) = ϕ1 (x), ∀x ∈ Rm ,
solution for the wave operator. Using Proposition 11.1.28 we deduce that the correspon-
dence
2
(0, ∞) ∋ t 7→ R+ (t) ∈ C −∞ (Rm )
is smooth. Moreover the support of R+ 2 (t) is contained in the ball {|x| ≤ t}. Thus,
the expression in the right-hand side of (11.3.21) is a well defined smooth function on
[0, ∞) × Rm . The fact that u given by (11.3.21) is a solution of the initial value problem
(11.3.20) follows by direct computation using the equalities
2 d 2
R+ = 0 in (0, ∞) × Rm , R+
2
(0+) = 0, R (0+) = δ0 .
dt +
11.3. THE WAVE EQUATION 569
Uniqueness. There are many approaches. Ours is based on finite speed propagation of
solutions. We follow closely the presentation in [32, VI.6.1].
For every T > 0 we consider the dependence cone (see Figure 11.1)
T
C− := (t, x); t ∈ [0, T ], |x| ≤ (T − t) .
C_T
x2
x1
T.
Figure 11.1: The dependence cone C−
T the base of
We denote by ∂+ C∗T conical part of the boundary of the cone and by ∂0 C−
the cone; see Figure 11.1. The unit outer normal along the (smooth part of the) boundary
of C−T has the form
Xn
n = a0 ∂t + ai ∂xi .
i=1
Along the conical part of the boundary, described by the quadratic equation
(T − t)2 = |x|2 ,
Along the base of the cone, described by {t = 0, |x| ≤ T }, the unit other normal is
n0 = −∂t . For any smooth function u = u(t, x) we set
m
X m
X
ˆ := ut dt +
du ˆ 2 = u2 +
uxi dxi , |du| u2xi .
t
i=1 i=1
We have
X
m
ˆ 2 +
2ut u = ∂t |du| ∂xi (−2ut uxi ).
i=1
570 CHAPTER 11. SPECTRAL GEOMETRY
Hence Z Z Z
(ut u)dtdx = div X u dtdx = (X u , n) dS
T
C− T
C− T
∂C−
Z Z m m
!
X X
=− ˆ
|du(0, x)|2 dx + a0 u2xi − 2 ai ut uxi dS
|x|≤T T
∂∗ C− i=0 i=1
Z Z m
!
(11.3.22 ) 1 X
= − ˆ
|du(0, x)|2 dx + (a0 uxi − ai ut )2 dS.
|x|≤T T
∂∗ C− a0
i=1
If additionally
ˆ
du(0, x) = 0,
T we have
then we deduce that along ∂∗ C−
a0 uxi − ai ut = 0, ∀i = 1, . . . , m. (11.3.24)
Vi · u = 0, ∀i = 1, . . . , m.
They imply that u is constant along ∂∗ C0T . If we assume additionally that u(0, x) = 0,
then we conclude that u ≡ 0 on ∂C− T , so that u(T, 0) = 0, ∀T > 0. Applying the same
Remark 11.3.15. (a) The above proof of the uniqueness result justifies the name depen-
dence cone we gave to C−T . If u, v are smooth solutions of the wave equation satisfying
the equalities
u(0, x) − v(0, x) = ut (0, x) − vt (0, x) = 0, ∀|x| ≤ T,
then u(0, T ) = v(0, T ). In other words, the value at (T, 0) is only determined by the initial
data inside the dependence cone.
11.3. THE WAVE EQUATION 571
(b) There is an alternate proof of the uniqueness, based on the so called energy method
and in which the dependence cone also play a central role. Suppose the u is a C 2 solution
of the wave equation. For t ≤ T we define
Z
1 ˆ
Eu (t) = |du(t, x)|2 dx.
2 |x|≤T −t
∂t2 u
e + |ξ|2 u
e = 0.
g 2 sin(|ξ|t+ )
R+ (t, ξ) = .
|ξ|
⊔
⊓
C+
U0
x0 x
Lx
0
This shows that for Re a ≫ 0 the restriction of χa+ (t2 − |x|2 ) to Lx0 is given by
D E D 1 p E
χa+ (t2 − |x|2 )|Lx0 , ϕ = χa+ (q), p ϕ( q + |x0 |2 ) , (11.3.25)
2 q + |x0 |2
∀ϕ ∈ D(R). The right-hand side of the above equality depends holomorphically on a and
we define χa+ (t2+ − |x|2 ) to the line Lx0 for any a ∈ C by the same equality (11.3.25) using
unique continuation.
11.3. THE WAVE EQUATION 573
1
|χ|a (t) := a
a+1 |t| .
Γ( 2 )
a− n
Proof. For x0 6= 0 fixed we set c = c(x0 ) = |x0 | and we denote by χ+ 2
(t2+ − c2 ) the
a− n
restriction of χ+ 2 (t2+ − |x|2 ) to the temporal line
For Re a ≫ 0 we have
a− n a−1− n
∂t χ+ 2
(t2+ − c2 ) = 2t+ χ+ 2
(t2+ − c2 ) ∈ C −∞ (R),
a− n a− n
∂t χ+ 2 (t2+ − c2 ) − χ+ 2 (t2− − c2 )
a−1− n a−1− n
= 2t+ χ+ 2
(t2+ − c2 ) + 2t− χ+ 2
(t2− − c2 )
a−1− n2
= 2|t|χ+ (t2 − c2 ).
By unique continuation we conclude that the above equality holds for any a, where the
multiplications |t| · χa+ (t2 − c2 ) are well defined since the singular supports of |t| and
χa+ (t2 − c2 ) are disjoint. We deduce that for any ϕ ∈ D(R) we have
D a−1− n
E D a−1− n
E
|t|χ+ 2
(t2 − c2 ), ϕ(t) = t+ χ+ 2
(t2 − c2 ), ϕ(t) + ϕ(−t)
p
(q = t2+ − c2 , t+ = q + c2 )
* +
(11.3.25 ) p a−1− n 1 p p
= ( q + c2 )χ+ 2
(q), p ϕ( q + c2 ) + ϕ(− q + c2 )
2 q + c2
a−1− n 1 p 2
p
2
= χ+ 2
(q), ϕ q+c +ϕ − q+c .
2
574 CHAPTER 11. SPECTRAL GEOMETRY
1
(ϕ(t) + ϕ(−t)) = η(t2 ), (11.3.27)
2
for some η ∈ D(R). Hence
1 p p
ϕ q + c2 + ϕ − q + c2 = η(q + c2 ),
2
D a− n a− n E D a−1− n E
lim ∂t χ+ 2 (t2+ − c2 ) − χ+ 2 (t2− − c2 ) , ϕ(t) = χ+ 2
(q), η(q) .
c→0
We want to express hχa+ (q), η(q)i in terms of the function ϕ(t) satisfying (11.3.27). Note
that the function
a 7→ hχa+ (q), η(q)i
is holomorphic, so it suffices to determine it for a in an open set in the complex plane.
We assume Re a > 0. In this case
Z ∞
a 1
hχ+ (q), η(q)i = q a η(q)dq
Γ(a + 1) 0
Z ∞
q=s2 1
= s2a+1 (ϕ(s) + ϕ(−s))ds
Γ(a + 1) 0
Z
1
= |s|2a+1 ϕ(s)ds = |χ|2a+1 , ϕ .
Γ(a + 1) R
The map a 7→ h|χ|2a+1 , ϕ is holomorphic and we can invoke unique continuation to
conclude the validity of the obove equality for any a. The equality (11.3.26) follows from
the above since n
2 a−1− + 1 = 2a − 1 − n = 2a − 2 − m.
2
⊔
⊓
where √ Qk 2j−1
Γ( 2k+1
2 )
π j=1 2 1
dm (2k) = m = m = m−1 ,
π Γ(2k)
2 π (2k − 1)!
2 2π 2 (k − 1)!
so that
2k 1 k− n
R+ = m−1 χ+ 2
(t2+ − |x|2 ). (11.3.29)
π 2 (k − 1)!
3
We follow [56, p. 343]. Let n an tn be the Taylor expansion of ϕ0 (t) = 12 (ϕ(t)+ϕ(−t)) at t = 0. Since
P
P 2n
ϕ0 is even we deduce an = 0 for n odd so that ϕ0 (t) = a2n t . ByPBorel’s theorem, [66, Thm. 1.2.6],
there exists a smooth function u(s) whose Taylor expansion
√ at s = 0 is n a2n sn . Then v(t) = ϕ 2
√0 (t)−u(t )
vanishes with all its derivatives at t = 0 and hence v( t) is smooth. We set η(s) := u(s) + v( s).
11.3. THE WAVE EQUATION 575
Using (11.2.11b) we deduce that the Fourier transform of ∂t R+ 2 − Ř 2 )(t, 0) with respect
+
to the t-variable is
√
2 2
2 π (m−1) 2π
Ft ∂t R+ − Ř+ )(t, 0) (τ ) = m−1 |χ| (τ ) = m
m
|τ |m−1
m m
2 π 2 Γ( 2 )
2 π 2
m
2π 2 m−1 σ m−1 m−1 mω m m−1
=π m m |τ | =π m
|τ | =π |τ | .
(2π) Γ( 2 ) (2π) (2π)m
Hence !
1 2 2
1 d ωmτ m
Ft ∂t R+ − Ř+ )(t, 0) (τ ) = sign τ . (11.3.31)
2π 2 dτ (2π)m
⊔
⊓
11.3.3 Local parametrices for the wave equation with variable coeffi-
cients
We want to extend the identities (11.3.13b) and (11.3.15) to slightly more general situa-
tions.
Consider a new linear change of coordinates on Rm , x = T y, where T : Rm → Rm is a
linear isomorphism. More explicitly,
X
xi = tij y j .
j
We denote by (gjk ) the matrix associated to the operator (T ∗ T ) in the y-coordinates and
by (gjk ) its inverse. More precisely
X
gjk := tij tik .
i
We set
|g| := det(gjk ) = det(T ∗ T ).
Then in the y-coordinates we have
∆ = ∆y := −∂yj gjk ∂yk .
Observe that
X
|x|2 = |y|2g := y j y k , q(t, x) = t2 − |x|2 = qg (t, y) = t2 − |y|2g .
jk
Then
(∂t2 + ∆y )R+
a a−2
(t, T y) = R+ (t, T y), (11.3.32)
576 CHAPTER 11. SPECTRAL GEOMETRY
and X
a−2
(a − 2) a
gjk ∂yk R+ (t, T y) = −y j R+ (t, T y). (11.3.33)
k
If we multiply both sides of the above equality with gij and then we sum over j we deduce
X
(a − 2)∂yi R+ a
(t, T y) = − gij y j R+
a−2
(t, T y). (11.3.34)
j
Let us observe that if δ0 (x) ∈ C −∞ (Rm ) denotes the Dirac generalized function concen-
trated at 0, then
1
δ0 (T y) = T ∗ δ0 = |g|− 2 δ0 (y). (11.3.35)
a (t, |y| ) ∈ C −∞ (R1,m \ 0) using the technique in (11.3.11)
For Re a ≫ m we define R̄+ g
a−n
a
R̄+ (t, |y|g ) := dm (a)(qg+ )∗ χ 2 .
The generalized function R̄+a (t, |y| ) admits a unique homogeneous extension R a (t, |y| )
g + g
to R 1,m which coincides with R+ a (t, T y). We can then use (11.3.32) to show that
a a
R̄+ (t, |y|g ) = R+ (t, T y)|R1,m, \0 , ∀a.
For this reason we will use the more suggestive notation R+ a (t, |y| ) when referring to
g
a
R+ (t, T y).
More generally, consider a Riemann metric g = (gij ) on Rm ,
X
g= gij dxi dxj .
i,j
Moreover4 X X
gjk (x)xk = gjk (0)xk , (11.3.37)
k k
4
The equality (11.3.37)
P is an algebraic rephrasing of the Gauss Lemma, Lemma 4.1.29, stating that the
the radial vector field j xj ∂xj is the gradient of the function 12 |x|2g .
11.3. THE WAVE EQUATION 577
gjk (x) − gjk (0) = O |x|2 , ∀j, k. (11.3.38)
We set
X (11.3.37 ) 1
h(x) := bj (x)gjk (0)xk = − g ∇ log |g|, x . (11.3.39)
2
j,k
We deduce from (11.3.32) and the above identity that for Re a ≫ 0 we have
X
∂t2 − ∂xj gjk ∂xk R+ a a−2
(t, |x|g ) = R+ (t, |x|g ). (11.3.40)
j,k
By unique continuation we conclude that the above equality holds for all a ∈ C.
More generally, if u = u(x) is a t-independent smooth function, then we deduce
from(11.3.33), (11.3.34), (11.3.39) and (11.3.40) that for any a 6= 2 we have
a−2
∂t2 + ∆g u(x)R+ a
(t, |x|g ) = uR+ + (∆g u)R+a
1 X (11.3.41)
− hu − 2 xj ∂xj u R+a−2
.
a−2
j
When a = 2 we have
m−1 1− n
R 2 (t, |x|g ) = (qg+ )∗ f2 , f2 = π − 2 χ+ 2
, (t, x) 6= 0.
1− n
Since χ+ 2 is homogeneous of degree 2 − n on R1,m \ 0 it admits a unique extension to
R1,m as a homogeneous generalized function of degree 2 − n. The generalized function
−n
xj (qg+ )∗ χ+ 2 ∈ C −∞ (R1,m \ 0)
1
∂t2 + ∆g u(x)R+2
(t, |x|g ) = u(0)|g0 |− 2 δ0 + (∆g u)R+ 2
m−1 X −n (11.3.43)
−2π − 2 hu − 2 xj ∂xj u (qg+ )∗ χ+ 2 .
j
N
X −1
2k 1 1
+ R+ ∆g uk + uk+1 − huk+1 + x · ∇uk+1
2k k
k=1
m−1 n
−2π − 2 (hu1 − 2x · ∇u1 ) (qg+ )∗ χ− 2 .
Proposition 11.3.20. For any smooth function h ∈ C ∞ (Rm ) such that h(0) = 0 there
exists a unique sequence of smooth functions U1 , U2 , . . . , UN , . . . defined on a small neigh-
borhood of 0 ∈ Rm and satisfying the differential equations
1
∂r U1 = hU1 .
2r
Since h(0) = 0, we deduce that for any θ ∈ Rm , |θ| = 1 we have
Z 1
h(rθ) = r ∂θ h(trθ)dt = rg(rθ),
0
Lemma 11.3.21. For any k ∈ Z>0 , and any smooth function w defined in a neighborhood
of 0 ∈ Rm , the differential equation
h
x · ∇U + k − U =w
2
so that Z r
1 Rρ h(tρθ)
V (rθ) = k ρk−1 w(ρθ)e− 0 2t
dt
dρ.
r 0
⊔
⊓
Observe that for any positive integer ν we can find N = N (ν) such that
2N
(∆g UN )R+ ∈ Cν.
n
More precisely, since χN − 2 is homogeneous of degree N − n2 , we deduce that
2N
2N − n > 2ν ⇒ R+ ∈ Cν.
Remark 11.3.22. We want to discuss a rather confusing fact that unfortunately perme-
ates the more analytically and less geometrically oriented texts.
A metric g on M induces a natural bijection
C ∞ (M ) → C ∞ |Λ|M , f 7→ f |dVg |.
Invariantly, the generalized functions on M are continuous linear functionals on the space
C ∞ |Λ|M of smooth densities on M .
The Dirac measure concentrated at a point x0 ∈ M is the continuous linear map
δx0 : C(M ) → R, δc ∞
x0 (f ) = f (x0 ), ∀f ∈ C (M ).
In local coordinates around x0 , the Dirac measure can be identified with a multiple of
the Dirac generalized function. More precisely, we have the equality of locally defined
generalized functions
1
δxM0 = p δx0 .
|g|
To understand this equality, consider a family of nonnegative smooth functions
such that, Z
φε |dVg | = 1, supp φε ⊂ Bε (x0 ), ∀ε > 0.
M
xi (p) = 0, i = 1, . . . , m.
where
1
bj = − ∇j log |g|.
2
Let
s : M × M → [0, ∞), (x, y) 7→ sy (x) = distg (x, y)
be the geodesic distance function determined by g. For y ∈ M we denote by ~ry the radial
vector field on Ty M . Using the exponential map exp : Ty M → M we can regard ~ry as
a vector field defined on a small geodesic ball Uy centered at y. For fixed y we obtain a
generalized function
(t, x) 7→ R a t, sy (x) , distg (x, y) < c
a on
which in normal coordinates at y coincides with the wave function R+
R × Ty M = R × Rm .
By choosing an orthonormal frame of T M over Up0 we can identify Ty M with Rm and then
we can regard the correspondence y 7→ R+ 2a (t, s (x)) as a smooth family of generalized
y
functions defined on a neighborhood origin in R × Rm .
Equivalently, we can identify a neighborhood of the diagonal in M × M with a neigh-
borhood of the zero section in T M . For fixed y ∈ M the generalized function R+ 2a (t, s (x))
y
is defined on Ty M .
Identifying Up0 with a neighborhood of 0 ∈ Rm , we can further view this family as a
generalized function on a neighborhood of the origin in R × Rm × Rm ,
2a
R+ (t, sy (x)) ∈ C −∞ (−c, c) × Up0 × Up0 .
n
dm (2a)χa− 2 t2+ − distg (x, y)2 .
Thus
2a 2a
R+ (t, sy (x)) = R+ (t, sx (y)), Re a ≫ 0
and by unique continuation we deduce that the above symmetry holds for any a,
2a 2a
R+ (t, sy (x)) = R+ (t, sx (y)), ∀a ∈ C. (11.3.46)
We deduce that for (t, x, y) ∈ (−∞, c)×M ×M we have the equality of generalized functions
N
X
∂t2 + ∆x 2k
Uk (x, y)R+ t, sy (x)
k=1 (11.3.48)
− 12 2N
= |gy | δy + (∆x UN (x, y) )R+ t, sy (x) .
1
Above |gy |− 2 δy ∈ C −∞ (R × M ) is the Dirac-delta generalized function on R × M con-
centrated at (0, y) ∈ R × M . More precisely (see Remark 11.3.22) for any function
u ∈ C0∞ (R × M ) we have
1
|gy |− 2 δy , u|dVg | = u(0, y).
⊔
⊓
The function EM,g is called the spectral function of the Riemann manifold (M, g). If we
denote by Hτ the space
Hτ := span Ψk ; λk ≤ τ ,
11.4. SPECTRAL GEOMETRY 583
We set
X XZ
N (τ ) = Ng (τ ) := dim Hτ = 1= Ψk (x)2 |dVg (x)|
λk ≤τ λk ≤τ M
Z (11.4.49)
= Eτ (x, x)|dVg (x)|.
M
For any positive integer k, any p ∈ [0, ∞), and any u ∈ C ∞ (M ) we set
1
XZ
p
j≤k M
Theorem 11.4.1. For any ν ∈ Z≥0 there exists Cν > 0 such that
m
kEτ kC ν (M ×M ) ≤ Cν τ ν+ 2 , ∀τ > 0. (11.4.50)
Proof of the lemma. We will prove that there exists a constant C > 0 such that for
any u ∈ C0∞ (Rm ) and any λ > 1 we have
Z X Z
m X
λk−ν− 2 sup |∂xα u(x)|2 ≤ C |∂xα u(x)|2 |dx| + λk |u(x)|2 |dx| .
x Rm |α|=k Rm
|α|=ν
(11.4.52)
The inequality (11.4.51) follows from (11.4.52) via partitions of unity, so it suffices to prove
(11.4.52) .
We make the substitution
1
x := λ− 2 y.
Then
∂y 1 m
∂x = ∂y = λ 2 y , |dx| = λ− 2 |dy|,
∂x
584 CHAPTER 11. SPECTRAL GEOMETRY
and if we set
1
v(y) = u(x) = u(λ− 2 y),
then we see that it suffices to prove (11.4.52) only in the case λ = 1 for the function v.
From the Morrey inequalities we deduce
X Z X
α 2
sup |∂ u(x)| ≤ C |∂ α u(x)|2 |dx|.
x Rm |α|≤k
|α|=ν
Using Parseval’s formula we deduce that there exists a constant C ′ > 0 such that
Z X Z X Z
|∂ α u(x)|2 |dx| ≤ C ′ |∂ α u(x)|2 |dx| + |u(x)|2 |dx| ,
Rm |α|≤k Rm |α|=k Rm
so that
kuτ kL2 ≤ C(τ + 1)k kukL2 (M ) ,
2k (M )
Fix a point p0 ∈ M and geodesic coordinates x at p0 . Observe that for any multi-index
α such that |α| = ν we have
Hence
ν m
kVα,p0 kL2 (M ) ≤ C(τ + 1) 2 + 4 .
Now observe that Vα,p0 ∈ Hτ so that
Note that Z
2
N (τ ) = Ep,p (τ ) |dVg (p)|.
M
We will obtain sharp estimates for N (τ ) via sharp estimates for Ep,p (τ ).
Theorem 11.4.1 shows that, for fixed p, Ep,p (τ ) is a temperate generalized function in
the τ -variable. Its τ -derivative is the generalized function
1 d 1X
Ep,q (τ ) = Ψk (p)Ψk (q) δ√λk (τ ) + δ−√λk (τ ) ∈ C −∞ (R).
2 dτ 2
k
Our good fortune is that we can obtain quite a bit of information about the Fourier
transform of the generalized function
1 d
τ 7→ Ep,q (τ ).
2 dτ
More precisely
h1 d i 1 X −it√λk √
Fτ Ep,q (τ ) (t) = Kt := e + eit λk Ψk (p)Ψk (q)
2 dτ 2
k
X p
= cos t λk Ψk (p)Ψk (q).
k
We will devote the next subsection to a more in-depth investigation of Kt . But, before we
do so, we want to give an explicit description of the spectral function in a special case.
586 CHAPTER 11. SPECTRAL GEOMETRY
Example 11.4.3. (Spectral function of a flat torus.) Denote by Tm the flat torus
Tm := Rm /(2πZ)m
|~k|2 , ~k = (k1 , . . . , km ) ∈ Zm .
We have
1 2 X 2 X
ER2 (~θ, ϕ
~) = + sinh~k, θi
~ sinh~k, ϕ
~ i + cosh~k, ~θi cosh~k, ϕ
~i
(2π)m (2π)m (2π)m
|~k|≤R |~k|≤R
~k≻~0 ~k≻~0
1 2 X 1 X ~ ~
= + cosh~k, θ~ − ϕ
~ i = eihk,θ−~ϕi .
(2π)m (2π)m (2π)m
|~k|≤R |~k|≤R
~k≻~0
Hence X
1 ~ ~
ER2 (~
θ, 0) = eihk,θi .
(2π)m
|~k|≤R
~ and ~x ∈ Rm we define
For every θ,
~
fθ~ (x) = w(|~x|2 )eih~x,θi
so that fθ~ ∈ S (Rm ). Hence
X
1 1~
E¯w,R (~θ) = f θ~ R
k .
(2πR)m
~k∈Zm
Proof. To prove Poisson’s formula is suffices to consider the case a = 1. The general case
follows from this case applied to the Schwartz function
1
φa (~x) = φ ~x .
a
Consider the (2πZ)m -periodic function
X
Φ : Rm → C, Φ(~x) = φ ~x + 2π~k .
~k∈Zm
This can be viewed as a smooth function on the torus Tm and, as such, it admits a Fourier
(spectral) decomposition
X Z
1 ~
Φ(~x) = m
C~k e ν ,~
ih~ xi
, C~ν = Φ(~θ)e−ih~ν ,θi |dεθ|.
(2π) m [0,2π]m
~
ν ∈Z
588 CHAPTER 11. SPECTRAL GEOMETRY
The above series converges in C ∞ . The sum in the left-hand side of (11.4.59) is Φ(0). We
have
1 X
Φ(0) = C~ν .
(2π)m m ~
ν ∈Z
⊔
⊓
Now let
~
φ(~x) = eih~x,θi w(|~x|2 ) .
| {z }
=:u(~
x)
We deduce that
b =u
φ(ξ) ~
b(ξ − θ).
Hence
1 X
E¯w,R (~
θ) = b(2πR~ν − ~θ)
u
(2π)m m~
ν ∈Z
1 1 X
= b(−~
u θ) + u ~
b(2πR~ν − θ).
(2π)m (2π)m
ν ∈Zm \0
~
1
E¯w,R (~
θ) = b(~θ) + O(R−p ), as R → ∞,
u (11.4.60)
(2π)m
uniformly on the compacts of the ~ θ-spaces. Note that (11.4.57) is formally a special case
of (11.4.60) in which u is the characteristic function of the unit ball in the ~x-space.
where X
kuk2L2 (M ) = |uk |2 .
k≥0
We set √ X p
cos t ∆ u = cos t λk uk Ψk . (11.4.61)
k≥0
Note that √ X p
k cos(t ∆)uk2L2 (M ) = cos2 (t λk )|uk |2 ≤ kuk2L2 (M )
k≥0
√
so that the operator cos(t ∆) is indeed well defined and bounded L2 (M ) → L2 (M ).
Note that if u ∈ C ∞ (M ), then for any N ∈ Z≥0 we have
X
∆N u = λN
k uk Ψ k
k≥0
and we deduce
√ √ X p
cos(t ∆)∆N u = ∆N cos(t ∆)u = λN
k cos(t λk )uk Ψk (p).
k≥0
Theorem 11.4.1 shows that the above series is absolutely convergent for any η ∈ S (R).
This generalized function is the Fourier transform of the temperate generalized function
1 d 1X
Ep,q (τ ) = Ψk (p)Ψk (q) δ√λk (τ ) + δ−√λk (τ ) ∈ S ′ (R). (11.4.62)
2 dτ 2
k
Thus, an approximation for Kt will yield an approximation for the Fourier transform of
the derivative of the modified spectral function
1
τ 7→ (sign τ )E (τ 2 , p, q).
2
We will spend the rest of this section constructing an approximation for the wave kernel
Kt . √
If f ∈ C ∞ (M ), then the function u(t, p) = cos(t ∆)f ∈ C ∞ (R × M ) satisfies the
initial value problem
(∂t2 + ∆)u = 0, on (0, ∞) × M, (11.4.63a)
u(0, p) = f (p), ut (0, p) = 0, ∀p ∈ M. (11.4.63b)
We deduce that the kernel Kt defines a smooth map
(0, ∞) × M ∋ (t, p) 7→ Kt (p, q) ∈ C −∞ (M ).
The a priori estimates described in our next result will play a key role in our search for
an approximation for Kt .
Proposition 11.4.5. Let v ∈ C ∞ ([0, 1] × M ) be a solution of the non-homogeneous wave
equation
(∂t2 + ∆)v = h on [0, 1] × M
(11.4.64)
v(0, p) = vt (0, p) = 0, ∀p ∈ M.
Assume that
∂tj h(0, p) = 0, ∀p ∈ M, j < k. (11.4.65)
Then there exists Ck > 0 independent of h such that for any t ∈ (0, 1) we have
k+1
X
∂tk+1−j v(t, −) L2j (M )
≤
j=0
(11.4.66)
Z t k−1
X
Ck ∂sk h(s, −) L2 (M ) ds + ∂tk−1−j h(t, −) L2 (M ) .
0 j=0
11.4. SPECTRAL GEOMETRY 591
Z t
(11.4.67 )
= C kh(s, −)kL2 ds + k∂t2 v(t, −)kL2 + kh(t, −)kL2 .
k k
0
N
X 1
∂t2 +∆x 2k
Uk (x, y)R+ t, sy (x) = |gy |− 2 δy +(∆x UN (x, y) )R+
2N
t, sy (x) . (11.4.69)
|k=1 {z }
=:EN (t,x,y)
We need to explain the above construction. The generalized function EN defines a smooth
map
(0, ∞) × M ∋ (t, x) 7→ EN (t, x, −) ∈ C −∞ (M ),
and thus
WfN (t, x) = EN (t, x, −), f |dVg | .
Using (11.3.19) we deduce that
Z
Wf (0+ , x) = 0, ∂t Wf (0+ , x) = 2 +
U1 (x, y)∂t R+ (0 , sy (x) )f (y)|dVg (y)|
M
(11.3.46 ) 1
= |gx |− 2 δx , U1 (x, −)f (−)|dVg | = U1 (x, x)f (x) = f (x).
Observe that Z
(∂t2 + ∆)Wf (t, x, y) = SN (t, x, y)f (y)|dVg (y)|,
M
where
2N
SN (t, x, y) = (∆x UN (x, y) )R+ t, sy (x) .
11.4. SPECTRAL GEOMETRY 593
Thus,√the function ∂t Wf satisfies the same initial conditions (11.4.63b) as the function
cos(t ∆)f . Consider their difference, i.e., the function
√
Vf (t, x) = VfN (t, x) = cos(t ∆)f − ∂t WfN (t, x).
where Z
hf = hN
f (t, x) := − ∂t SN (t, x, y)f (y)|dVg (y)|
M
Z
= ∂t SN (t, x, y)f (y)|dVg (y)|.
dist(y,x)<t
≤ C t4N −m−4−2k + λk t4N −m−4 kf k2L2 .
594 CHAPTER 11. SPECTRAL GEOMETRY
so that
kVfN (t, −)kC ν ≤ t2N −n−1−ν kf kL2 , ∀t ∈ [0, c], ∀N ≫ 0.
We deduce similarly that
where
Z Z
j
h∆j f := − ∂t SN (t, x, y)∆ f (y)|dVg (y)| = − ∆jy ∂t SN (t, x, y)f (y)|dVg (y)|
M M
Z
=− ∆jy ∂t SN (t, x, y)f (y)|dVg (y)|.
dist(y,x)<t
Arguing as before we deduce that for any k, j ∈ Z≥0 and N sufficiently large,
We deduce that for any multi-index α and any j ≥ 0 there exist N = N (α, j) and C > 0
such that for any N ≥ N (α, j), f ∈ C ∞ (M ) and any t ∈ (0, c) we have
α
sup ∂t,x TN (t, x, y) , ∆jy f (y)|dVg (y)| ≤ Ct2N −n−1−2j−|α|kf kL2 . (11.4.72)
x∈M
11.4. SPECTRAL GEOMETRY 595
From the above inequality we deduce that for any (t, x) the generalized function ∆y TN (t, x, y)
is represented by a function in L2 (M ) for N sufficiently large. From elliptic regularity we
deduce that for N sufficiently large and any (t, x) the function y 7→ TN (t, x, y) belongs to
the Sobolev space L2j,2 and
α
k∂t,x ∆jy TN (t, x, y)kL2 ≤ Ct2N −n−1−2j−|α| .
Using elliptic estimates for ∆y and Lemma 11.4.2 we deduce that for any k, ν ∈ Z≥0 there
exist N (k, ν) and C > 0 such that, for any N ≥ N (ν, k), any x ∈ M , t ∈ [0, c], and any
multi-index |α| ≤ k we have
α
k∂t,x Tn (t, x, −)kC ν ≤ Ct2N −n−1−|α|−ν .
This implies that for any µ ∈ Z≥0 there exists N = N (µ) and C > 0 such that for any
N > N (µ) we have TN (t, x, y) ∈ C µ ( (0, c) × M × M ) and for any j ≤ µ we have
We set
Ee(t, x, y) := EN (t, x, y) − EN (−t, x, y),
eN (t, x, y) := Kt (x, y) − ∂t EeN (t, x, y).
T
We can rewrite (11.4.75) as
eN (t, −, −)
∂tj T ≤ C|t|2N −n−1−µ , |t| ≤ c, j ≤ µ, N ≥ N (µ). (11.4.76)
C µ−j (M ×M )
Set
n := 1 + m = 1 + dim M, η(x, y) := dist(x, y)2 ,
a− n a− n
H2a (t, x, y) := dm (2a)∂t χ+ 2 t2+ − η(x, y) − χ+ 2 t2− − η(x, y)
(11.4.77)
2a 2a
= ∂t R+ t, sy (x) − R+ −t, sy (x) .
eN (t, y, y) ∈ R.
(−c, c) → T
N
X
=2 Uk (y, y)dm (2k)|χ|2k−1−n (t).
k=1
N
2 X Γ(k − n2 )
= m Uk (y, y)|χ|2k−1−n (t).
π 2 k=1 Γ(2k − 1 − n)
We have thus proved the following result.
Corollary 11.4.7. For any y ∈ M the restriction of Kt (x, y) to the line
R × {(y, y)} ⊂ R × M × M
meaning that for any µ > 0 there exists N (µ) > 0 such that for N ≥ N (µ) we have
N
X
eN (t, y, y) = Kt (y, y) − 2
T Uk (y, y)dm (2k)|χ|2k−1−n (t) ∈ C µ ((−c, c), ×M ).
k=1
and
eN (t, y, y) = Cj |t|2N −n−1−j , |t| ≤ c, j ≤ µ; |α| ≤ µ − j
∂tj ∂yα T
Theorem 11.4.8. There exists a constant K > 0 such that, for any y ∈ M we have
X 1
Ψk (y)Ψk (y) − ω m τ m ≤ Kτ m−1 , as τ → ∞. (11.4.80)
(2π)m
λk ≤τ 2
In particular
ω m Rm
Ng (R2 ) = volg (M ) + O R m−1
as R → ∞. (11.4.81)
(2π)m
1
µ(τ ) = Ey,y (τ ), ν(τ ) = ω m |τ |m sign(τ ),
(2π)m
where Ex,y (τ ) is the modified spectral function defined in (11.4.56). We will prove that
µ(τ ) − ν(τ ) = O τ m−1 as τ → ∞. (11.4.82)
We will obtain information about Ey,y (τ ) by recalling that, for every x, y, the Fourier
transform in the temporal variable of 21 dµ
dt is the wave kernel Kt (x, y).
On the other hand, according to (11.3.31) the Fourier transform 12 dτ dν
is the first term
in the asymptotic expansion (11.4.79) of Kt for t near zero.
To put this fortunate coincidence to good use requires some technical skill and imagi-
nation. What follows is inspired from [67, Sec.17.5].
Choose a positive function φ(τ ) ∈ S (R) such that
φ = |ψ|2 ∗ |ψ|2 ,
where
Observe that 2πψ(−τ ) is the Fourier transform of the function ψb ∈ C0∞ (R). We deduce
that it is real analytic5 thus its zero set is discrete. In particular
Then d 2
b = |ψ|
φ(t) 2 (t) ≥0
and Z 2
φ(0) d2 (0)2 =
b = |ψ| 2
|ψ(τ )| dτ = 1.
R
Fix a cut-off function φ satisfying (11.4.83). For any a ≫ 0 we set
1 τ
φa (τ ) := φ
a a
ca (t) = φ(at)
so that φ b has support in [− ac , ac ]. We deduce that for any a > 0 we have
∞
X
ca (t)Kt ∼ 2
φ ca (t)|χ|2k−1−n (t),
Uk (y, y)dm (2k)φ (11.4.84)
k=1
where the constant C is independent of a. Here we can be even more specific. It suffices
to choose N such that
m l mm
2N − 1 − n ≥ 0⇐⇒N ≥ 1 + ⇐⇒N ≥ N (m) := 1 + .
2 2
Hence
eN (t, y, y) ≤ C|t|2N (m)−n−1 ≤ C|t|m , |t| ≤ min(1, c).
ca (t)T
φ
As we know, the inverse Fourier transform of Kt is the generalized function
1 d X
Ep,q (τ ) = Ψk (p)Ψk (q) δ√λk (τ ) + δ−√λk (τ ) ∈ C −∞ (R).
2 dτ
k
For this reason we plan to take the inverse Fourier transform of the equality
X
ca (t)Kt = 2
φ ca (t)|χ|2k−1−n (t)
Uk (y, y)dm (2k)φ
2≤2k<n+1
ca (t)T
+φ eN (m) (t, y, y).
5
The Fourier-Laplace transform of a smooth compactly supported function is an entire function.
11.4. SPECTRAL GEOMETRY 599
Using the identity (11.2.11b) we deduce that the inverse Fourier transform of |χ|2k−1−n (t)
is the function
22k−1−n n−2k 22k−1−n m+1−2k
√ |χ| (τ ) = √ |χ| (τ ).
π π
Hence
1 d X 22k−m−1 dm (2k)
(φa ∗ Ey,y (τ ) ) = Uk (y, y) √ φa ∗ |χ|m+1−2k (τ )
2 dτ π
2≤2k≤n (11.4.85)
h i
+ F −1 φ eN (m) (t, y, y) ,
ca (t)T
Z τc
C a C
≤ sm ds ≤ .
|τ |m+1 0 am+1
Next, we need to estimate terms of the form φa ∗ |χ|p (τ ), p ∈ Z>0 . In other words we need
to estimate functions of the form
Z
1 τ − s p
fp (τ ) := φ |s| ds.
a R a
600 CHAPTER 11. SPECTRAL GEOMETRY
Note that Z Z
p p p
fp (aτ ) = a φ(τ − s)|s| ds = a |τ − s|p φ(s)ds.
R R
Using the inequality
|τ − s|p ≤ (|τ | + |s|)p ≤ 2p−1 |τ |p + |s|p ,
we deduce
Z Z
|fp (aτ )| ≤ 2p−1
(a|τ |)
p
φ(s)ds +ap |s|p φ(s)ds
.
R R
| {z }
=1
so that p
|fp (τ )| ≤ Cp |τ | + a , ∀τ. (11.4.87)
Using (11.4.86) and (11.4.87) we deduce that there exist positive constants K0 , K1 , inde-
pendent of y, such that
dν
≤ K0 |τ |m−1 , ∀τ, (11.4.88a)
dt
d m−3
φa ∗ (µ − ν)(τ ) ≤ K1 |τ | + a , ∀τ. (11.4.88b)
dt
Arguing as in the proof of (11.4.87) we deduce
d m−1
φa ∗ ν(τ ) ≤ K2 |τ | + a .
dt
Set
ε0 := min φ(τ ) > 0.
|τ |≤ 21
Hence Z τ + a2
−1
ε0 a dµ ≤ dµ ∗ φa (τ ) = (dµ − dν) ∗ φa (τ ) + dν ∗ φa (τ )
τ − a2
m−3 m−1
≤ K1 |τ | + a + K2 |τ | + a .
We deduce that for any b ∈ [0, a] we have
Z τ
µ(τ ) − µ(τ − b) ≤ µ(τ ) − µ(τ − a) ≤ dµ
τ −a
m−3 m−1
3a 3a
≤ K1 |τ | + + K2 |τ | + .
2 2
11.4. SPECTRAL GEOMETRY 601
For s > 0, we divide (0, s) into ν ≤ s + 1 intervals of length ≤ 1 and we deduce from the
above inequality that
!
3a m−3 3a m−1
µ(τ ) − µ(τ − as) ≤ (s + 1) K1 |τ | + + K2 |τ | + .
2 2
Similarly
m−3 m−1 !
3a 3a
µ(τ + as) − µ(τ ) ≤ (s + 1) K1 |τ + as| + + K2 |τ + as| +
2 2
Observe that
Z Z
1 τ −s
µ ∗ φa (τ ) = µ(s)φ ds = µ(τ − as)φ(s)ds.
a R a R
Hence Z
|µ(τ ) − µ ∗ φa (τ )| = (µ(τ ) − µ(τ − as))φ(s)ds
R
Z
≤ (|s| + 1) K3 ( |τ | + a(|s| + 1) )m−3 + K4 (|τ | + |a|(|s| + 1) )m−1 φ(s)ds
R
≤ K5 |τ |m−3 + |τ |m−1 + am−3 + am−1 = O |τ |m−1 .
Arguing similarly we deduce that
|ν(τ ) − ν ∗ φa (τ )| ≤ K6 |τ |m−1 + am−1 = O |τ |m−1 .
Hence
1/τ
then Ew coincides with the spectral
√ function we have investigated earlier. Note also that
2
when w(x) = e−x , then w√t ( ∆) = e−t∆ . This is the so called heat operator.
In this section, building on the results developed so far we present in some detail
information about the behavior along the diagonal of this kernel as ε ց 0 when w is a
Schwartz function.
Theorem 11.4.9. Suppose that w ∈ S (R) is an even, nonnegative Schwartz function,
and (M, g) is a smooth, compact, connected m-dimensional Riemann manifold. We define
X p
E ε = Ewε : M × M → R, Ewε (p, q) = w(ε λk )Ψk (p)Ψk (q),
k≥0
|α|−|β| Z
−m−2d(α,β) i
∂xα ∂yβ E ε (x, y)|x=y=0 =ε w(|x|)x α+β 2
dx + O(ε ) , (11.4.89)
(2π)m Rm
where
|α + β|
d(α, β) := .
2
Moreover, the constant implied by the symbol O(ε) in (11.4.89) uniformly bounded with
respect to p0 .
11.4. SPECTRAL GEOMETRY 603
1
wε = εr |χ|r , w
|χ|r , ρc b + O(εN ) as ε → 0.
ε
(b) For every positive integer r we have
√ 1−r Z
−r π2
h|χ| , wi
b = |τ |r−1 w(τ )dτ.
Γ( 2r ) R
Proof. (a) For transparency we will use the integral notation for the pairing between a
generalized function and a test function. We have
Z Z
r 1 r
|χ| , ηcwε = |χ| (t)ρ(t)w(t/ε)dt
b = |χ|r (εt)ρ(εt)w(t)dt
b
ε R R
Z
r
=ε |χ|r (t)ρ(εt)w(t)dt
b = εr h|χ|r , ρε wi,
b ρε (t) = ρ(εt).
R
b−w
Now observe that ρε w b = w(ρ
b ε − 1) → 0 in S (R). More precisely for k ≥ 0 we have
∂k
(ρε − 1) = O(εN tN ) as ε → 0.
∂k
This implies that
|χ|r , w(ρ
b ε − 1) = O(εN ) as ε → 0,
so that
h|χ|r , ρε wi
b = h|χ|r , wi+
b |χ|r , w(ρ
b ε − 1) = h|χ|r , wi
b + O(εN ) as ε → 0.
(b) We have
[ (11.2.11b ) √ 1−r
h|χ|−r , wi
b = |χ|−r , w = π2 |χ|r−1 (τ ), w(τ )
√ Z
π21−r
= |τ |r−1 w(τ )dτ.
Γ( 2r ) R
⊔
⊓
|~k|2 , ~k = (k1 , . . . , km ) ∈ Zm .
Then X
1 ~ ~
E ε (~
θ, ϕ
~) = m
w(ε|~k|)eihk,θ−~ϕi ,
(2π)
~k∈Zm
so that
i|α|−|β| X ~ ~
∂θ~α ∂ϕβ~ E ε (~
θ, 0) = m
wε (|~k|)~kα+β eihk,θi .
(2π)
~k∈Zm
Define
Wm , uε : Rm → R, Wm (x) = w(|x|), uε (x) = Wm (εx)xα+β .
Using Poisson’s summation formula (11.4.59) we deduce
i|α|−|β| X
∂θ~α ∂ϕβ~ E ε (0, 0) = bε (2π~ν ).
u
(2π)m m~
ν ∈Z
Observe that
Z Z
−ihξ,xi α+β α+β −ihξ,xi
bε (ξ) =
u e w(ε|x|)x dx = (i∂ξ ) e Wm (εx)dx
Rm Rm
Z
−m α+β −ih 1ε ξ,yi −m cm 1
=ε (i∂ξ ) e Wm (y)dy =ε (i∂ξ )α+β W ξ .
Rm ε
Hence
i|α|−|β| X 1
∂θ~α ∂ϕβ~ E ε (~
θ, 0) = m
(i∂ξ )α+β c
Wm ξ .
(2πε) m
ε ξ=2π~
ν
~
ν ∈Z
606 CHAPTER 11. SPECTRAL GEOMETRY
As ε → 0 we have
i|α|−|β|
∂θ~α ∂ϕβ~ E ε (0, 0) = ε−m−|α+β| (i∂ξ )α+β c
W m (0) + O(εN
) , ∀N.
(2π)m
Qk α +β +1
Z 2 i=1 Γ( i 2 i ) , α + β ∈ (2Z )m ,
≥0
xα+β dA(x) = Zm,α,β
m+|α+β|
:= Γ( 2
) (11.4.99)
|x|=1
0, otherwise.
i|α|−|β|Zm,α,β
Dm,α,β = ε−m−|α+β| Im,α,β (w). (11.4.100)
(2π)m
⊔
⊓
1. The spectrum of ∆g
spec(∆g ) ⊂ [0, ∞)
11.4. SPECTRAL GEOMETRY 607
⊔
⊓
To use a terminology dear to the late great Vadimir Arnold (1937–2010), the spectral
perestroika question asks how much of the geometry of the Riemann manifold (M, g) can
be recovered from its spectral data.
Note that the multiplicity n(λ) of λ ∈ spec(∆g ) is computable from the spectral data
as the dimension of the range of P λ . The operator P λ is an integral operator whose kernel
K λ (p, q) is given by
n(λ)
X
K λ (p, q) = Φj (p)Φj (q),
j=1
where (Ψj )1≤j≤n(λ) is any orthonormal basis of ker(λ − ∆g ) consisting of real valued
eigenfunctions. This means that
Z
(P λ u)(p) = K λ (p, q)u(q) |dVg (q)|.
M
0 = λ 0 < λ1 ≤ λ 2 ≤ · · · ,
where each eigenvalue appears exactly as many times as its multiplicity. Fix an orthonor-
mal basis (Ψk )k≥0 of L2 (M ) such that
∆Ψk = λk Ψk , ∀k ≥ 0
Then X
K λ (p, q) = Ψk (p)Ψk (q).
λk =λ
We can think of w(t) as a smooth approximation of the indicator function of the interval
[−1, 1]; see Figure 11.3.
As in the previous subsection we set
X p X √
Ewε (p, q) := w ε λk Ψk (p)Ψk (q) = w ε λ K λ (p, q).
k≥0 λ∈spec(∆)
The above sum consists of finitely may terms since it has only contributions from eigen-
values < ε−2 so it has O(ε−m ) terms.
608 CHAPTER 11. SPECTRAL GEOMETRY
Fix a point p ∈ M and normal coordinates (xi ) near p. From (11.4.89) with |α| =
|β| = 1 we deduce
Z
ε −m−2 1 i j 2
∂xi ∂yj E (x, y)|x=y=0 = ε w(|x|)x x dx + O(ε )
(2π)m Rm
δij Z
−m−2
=ε w(|x|)(x1 )2 dx +O(ε2 ) .
(2π)m Rm
| {z }
=:Bm (w)
The positive constant Bm (w) can be determined using (11.4.100). More generally, for two
vector fields X1 , X2 ∈ Vect (M ) we have
X
X1 (x)X2 (y)E ε (x, y)|x=y=0 = X1i ∂xi X2j (y)∂yj E ε (x, y)|x=y=0
i,j
1 X
j
= ε−m−2 δij X i
1 (0)X 2 (0) B m (w) + O(ε2
) (11.4.101)
(2π)m
i,j
1
= ε−m−2 gp X 1 (p), X 2 (p) B m (w) + O(ε2
) .
(2π)m
For X1 , X2 ∈ Vect (M ) fixed, the above estimate is uniform in p ∈ M . We set
X
D(ε) := dim ker(λ − ∆g ).
λ<ε−2
uniformly in p ∈ M . This proves that for ε > 0 sufficiently small, the map F ε is an
immersion and the metric gε on M induced by F ε is C 0 -close to g.
To prove that F ε is injective for ε > 0 sufficiently small we argue by contradiction.
Assume that this is not true. This means that for any n ∈ N there exist points pn , q n ∈ M
such that
Ψk (pn ) = Ψk (q n ), ∀k = 1, . . . , n.
After eventually extracting subsequence we can assume that the sequences (pn )n∈N and
(q n )n∈N are convergent
p := lim pn , q := lim q n .
n→∞ n→∞
Note that
Ψk (p) = Ψk (q), ∀k ∈ N. (11.4.103)
The vector space spanned by (Ψk )k∈N is dense in C 0 (M ) so (11.4.103) implies
f (p) = f (q), ∀f ∈ C 0 (M ).
dΨk (V )p 6= 0. (11.4.104)
Note that, for large n, γ n is the unique unit speed minimal geodesic connecting pn to q n .
Applying the mean value theorem to the function
fk (t) = Ψk γn (t) , t ∈ [0, δn ]
dΨk (V )p = 0.
εm+2 (2π)m ε
K ε = Kwε : M × M → R, K ε (p, q) = Ew (p, q) (11.4.106)
Bm (w)
From the above corollary we deduce that the volume measure dVgε converges weakly
to the volume measure dVg . We can say a bit more. Fix g-normal coordinates x near a
point p ∈ M . In these coordinates the metric gε is described by the matrix
ε
gij = δij + O(ε2 ). (11.4.107)
The formula (4.2.23) describing the Christoffel symbols in terms of correlators coupled
with (11.4.107) implies that
kAε (p)k = O(ε2 )
11.4. SPECTRAL GEOMETRY 611
Example 11.4.15. Let us assume that dim M = 2 and in the neighborhood of a point
the metric g is Euclidean. Suppose that on the same neighborhood gε is conformal to g
p
gε = efε (r) g, r = x2 + y 2 , fε ∈ C ∞ (R).
We deduce Z Z
ε t/ε
fε (ε) = ε dt w(ds)ds ≤ Iε2 ,
0 0
Z r/ε Z r/ε
ε 1
∆g fε = −w(r/ε) + w(s)ds = w(s)ds − w(r/ε) ≥ 0.
r 0 r/ε 0
This shows that the scalar curvature function sε does not converge uniformly to 0, the
scalar curvature of the flat Euclidean metric.
As a curiosity, note that
ε
sε (r) = 0, ∀r ≤ .
2
This shows that the metric gε is flat on the disk of radius ε/2 centered at the origin, but
becomes very curved along tiny circle of radius ε centered at the origin. ⊔
⊓
612 CHAPTER 11. SPECTRAL GEOMETRY
We want to investigate the behavior of the Riemann curvature tensor of the metric
gε . Fix a point p and g-normal coordinates (xi ) at p. Then, the local frame ∂xi is nearly
gε -normal in the following sense
(11.4.107 )
gε (∂xi , ∂xj )p = δij + O(ε)2 ,
(11.4.108 )
∇ε∂ i ∂xj p
= O(ε2 ).
x
Denote by Rε the Riemann curvature tensor of gε . Using the above estimates and (4.2.24)
we deduce
Theorem 11.4.16. Fix a point p ∈ M and normal coordinates (xi ) near p. For any
multi-induces α, β ∈ (Z≥0 )m we set
ε
Eα;β := ∂xα ∂yβ E ε (x, y)|x=y=0 .
Then there exists a universal constant Zm that depends only on the dimension of M and
the weight w such that
ε ε
Eii;jj − Eij;ij = Zm Rijij (p)ε−m−2 1 + O(ε2 ) as ε → 0. (11.4.109)
In particular, the above equalities show that the 1st and 3rd order derivatives of Ha are
O(1). We have
∂x21 ∂y22 (Uk Hk ) = ∂x21 ∂y22 Uk Hk + 2∂y2 Uk ∂y2 Hk + Uk ∂y22 Hk
= ∂x21 ∂y22 Uk Hk + ∂y22 Uk ∂x21 Hk ) + ∂x21 Uk ∂y22 Hk ) (11.4.117)
+4(∂x21 y2 Uk )(∂x21 y2 Hk ) + Uk ∂x21 ∂y22 Hk + O(1),
where the coefficients Tkj are degree j polynomials in the derivatives of Uk and η at the
point (x, y) = (0, 0). Using (11.4.116) we deduce
Tk4 = Tk3 = 0 .
614 CHAPTER 11. SPECTRAL GEOMETRY
Moreover, in view to (11.4.116), the terms in Tk2 are due only to the 4-th order derivatives of
Hk . Upon inspecting (11.4.117) and (11.4.118) (pay attention to the boxed terms in these
equalities) we see that the 4th order derivatives of Hk are multiplied by Uk . According to
(U k,h ) the function Uk is O(1) if k > 1. Hence Tk2 = 0 for k > 1. We deduce
∞
X
Ktii,jj − Ktij,ij ∼ dm (2k) Tk0 Hk + Tk1 Hk−1 + Tk2 Hk−2 |x=y=0
k=1
where the coefficients Am,α,β,0 are defined as in (11.4.94). Using (11.4.96) and (11.4.98)
we deduce
B−1 = 0.
To compute T11 we observe first that
X
η(x − y) = (xi − y i )2 + higer order terms. (11.4.119)
i
Using (11.4.113) we can simplify (11.4.117) and (11.4.118) in the case k = 1 as follows.
∂x21 ∂y22 (U1 H1 ) = ∂x21 ∂y22 U1 H1 + U1 ∂x21 ∂y22 H1 + O(1), (11.4.120)
∂x21 x2 ∂y21 y2 (U1 H1 ) = ∂x21 x2 ∂y21 y2 U1 H1 + ∂x21 y1 U1 ∂x22 y2 H1
(11.4.121)
+∂x22 y2 U1 ∂x1 y1 H1 + U1 ∂x21 x2 ∂y21 y2 H1 + O(1).
Using (11.4.113), (11.4.115) and (11.4.119) we deduce that
T11 = ∂x21 ∂y22 − ∂x21 x2 ∂y21 y2 η|(0,0)
+2 ∂x21 U1 + ∂y22 U1 |(0,0) + 2 ∂x21 y1 U1 + ∂x22 y2 U1 |(0,0) .
Using the transport equation (U 1,h ) with h given by (11.3.47) we deduce that
These conditions imply that the Hessian of U1 (x, y) at (0, 0) is a quadratic form in the
variables ui = (xi − y i ) so that
Hence
T11 = ∂x21 ∂y22 − ∂x21 x2 ∂y21 y2 η|(0,0) .
11.4. SPECTRAL GEOMETRY 615
The equality (11.4.109) now follows from the above equality by invoking (11.3.26), the
definition (11.4.77) of Hk and Lemma 11.4.10. ⊔
⊓
We can now tie all the loose ends together. Fix a point p ∈ M and g-normal coordinates
(xi ) at p. We denote by Rε the Riemann curvature of the metric gε determined by the
correlator (11.4.106) and by Kij ε the sectional curvature of the metric g along the plane
ε
spanned by ∂xi , ∂xj ,
ε (p)
Rijji
ε
Kij (p) = .
areagε (∂xi , ∂xj )
From Proposition 11.4.13 and Theorem 11.4.16 we deduce that there exists a constant
L = Lm (w) that depends on the dimension m and the function w, but it is independent
of the manifold M and the metric g such that
ε
lim Kij (p) = Lm Kij (p).
εց0
deduce that the correlator K ε (p, q) is invariant with respect to the diagonal action of
SO(m + 1) on S m × S m . This shows that the metric gε is SO(m + 1)-invariant. Since
the action of SO(m + 1) on S m is transitive we deduce that there exists a constant cε > 0
such that
gε = cε g.
616 CHAPTER 11. SPECTRAL GEOMETRY
We can say a bit more. We want to show that the spectral data determine the
whole Riemann curvature tensor of g. Fix a point p ∈ M and denote by Gr2 (Tp M )
the Grassmannian of 2-planes in Tp M and by S+ 2 (T M ) the cone of inner products on
p
Tp M . For any Riemann metric h on M the sectional curvature at p is a continuous
function Kph : Gr2 (Tp M ) → R.
There exists a continuous function
R : C 0 Gr2 (Tp M ) × S+ 2
(Tp M ) → Λ2 Tp∗ M ⊗ Λ2 Tp∗ M, (K, h) 7→ R(K, h),
described by an explicit formula6 in [28, Eq. (1.12)], such that, for any Riemann metric g
on M , the quantity
R Kpg , gp ) ∈ Λ2 Tp∗ M ⊗ Λ2 Tp∗ M
coincides with the Riemann curvature tensor of g at p.
The sectional curvatures Kpε of the metric gε at p converges to the sectional curvature
g p p ε
p and gε → g . Denote by Rp the Riemann curvature of gε at p. We deduce that
Corollary 11.4.18. The spectral data determine the Riemann curvature of the original
metric g. ⊔
⊓
Remark 11.4.19. (a) Using the very deep results in [5, 109] one can show that the C 0 -
convergence of the metrics gε coupled with the convergence of sectional curvatures imply
that gε converges to g in C ∞ . We can rephrase this by saying that, for ε > 0 sufficiently
small, the map F ε : M → RD(ε) defined in (11.4.102) is a nearly isometric embedding.
(b) Let us observe that for any even, nonnegative temperate function w ∈ S (R) we can
define a map
s
Fw X εm+2 (2π)m p
M ∋ p 7−→ε
w ε λk Ψk (p)Ψk ∈ L2 (M, g)
Bm (w)
k≥0
When w has compact support, the range of this map is contained in a finite dimensional
subspace.
2
The case w(r) = e−r was first investigated in [17] where, among many other things,
the authors show that for small ε > 0 the map F w
ε is a nearly isometric embedding of M
in the Hilbert space L2 (M, g). The image of this embedding is not be contained in any
finite dimensional subspace.
6
The explicit formula of R is rather complicated, but it is polynomial in K and h so it is continuous.
11.4. SPECTRAL GEOMETRY 617
This is a rather unexpected conclusion: we can learn the “shape” of the manifold by
investigating how the heat propagates or, equivalently, how the diffusion takes place. ⊔
⊓
Remark 11.4.20. (a) As explained in Remark 4.2.30, the correlators associated to em-
beddings have probabilistic interpretations. For example, when w is as in Figure 11.3, the
correlator Ewε (p, q) is associated to random functions on M of the form
X
Uε := Xkε Ψk ,
k≥0
where (Xkε )k≥ are independent random variables with mean zero and variances
εm+2 (2π)m p
variance (Xkε ) = w(ε λk ).
Bm (w)
As ε ց 0 this random function converges in a precise sense to the “perfect chaos”, the
so called called Gaussian white-noise. A plastic way of phrasing the conclusions of the
previous results is that there is a pattern in devolution to chaos which is very sensitive to
the background geometry.
(b) For every p ∈ M we have a random symmetric bilinear form
Gε := dU ε (p) ⊗ dU ε (p) : Tp M × Tp M → R.
For ε > 0 small, its expectation is the metric gε induced by the embedding F ε defined
in (11.4.102). We are inclined to believe that the random variable Gε converges almost
surely and L1 to the (deterministic) g. In other words Gε is highly concentrated around
its mean as ε ց 0.
(c) Here is another little gem hidden under the probability cloak. Assume M oriented
and even dimensional m = dim M = 2h. The differential of the random function Uε is a
random section dU ε of the cotangent bundle. One can show that, with probability 1, dUε
intersects the zero section of T ∗ M transversally, i.e., dUε has nondegenerate zeros. To
618 CHAPTER 11. SPECTRAL GEOMETRY
such a nondegenerate zero p we can associate a local index ǫ(Uε , p) = ±1 (see Exercise
7.3.47) and from the Poincaré-Hopf formula in Corollary 7.3.48 we deduce that
X
ǫ(Uε , p) = χ(M ). (11.4.124)
dUε (p)=0
In fact, above we have produced quite a bit more, namely a 0-dimensional current
X
ZdUε = ǫ(Uε , p)δp ∈ Ω0 (M ).
dUε (p)=0
The ”integral” of a given 0-form f over this random current is the random variable
X
f, ZdUε := ǫ(Uε , p)f (p).
dUε (p)=0
The expectation of this random variable has a rather striking geometric description [106].
More precisely we have Z
E f, ZdUε = f e(∇gε ), (11.4.125)
M
where e(∇gε ) is the Euler form associated to the Levi–Civita connection of the metric gε ,
1
e(∇gε ) = Pf − Rε ,
(2π)m/2
This is the Gauss–Bonnet theorem for the metric gε . On the other hand, we know that Rε
converges uniformly to the Riemann curvature R of the original metric g. If we let ε ց 0
we obtain the Gauss–Bonnet theorem for the original metric g!
This “accident” is a manifestation of a more general phenomenon. In particular, the
general Gauss–Bonnet theorem has a probabilistic interpretation. For details we refer to
[105, 106]. ⊔
⊓
Chapter 12
Dirac Operators
We devote this last chapter to a presentation of a very important class of first order
elliptic operators which have numerous applications in modern geometry. We will first
describe their general features, and then we will spend the remaining part discussing some
frequently encountered examples.
D : C ∞ (E) → C ∞ (E),
Note that the Dirac operators are first order elliptic p.d.o.-s.
Example 12.1.2. (Hamilton–Floer). Denote by E the trivial vector bundle R2n over
the circle S 1 . Thus C ∞ (E) can be identified with the space of smooth functions
u : S 1 → R2n .
Let J : C ∞ (E) → C ∞ (E) denote the endomorphism of E which has the block decompo-
sition
0 −1Rn
J=
1Rn 0
619
620 CHAPTER 12. DIRAC OPERATORS
Example 12.1.3. (Cauchy–Riemann). Consider the trivial bundle C2 over the com-
plex plane C equipped with the standard Euclidean metric. The Cauchy–Riemann oper-
ator is the first order p.d.o. D : C2 → C2 defined by
u 0 −∂z u
7−→ 2 · ,
v ∂z̄ 0 v
Example 12.1.4. Suppose E is a vector bundle over the Riemann manifold (M, g), and
D : C ∞ (M ) → C ∞ (M ) is a Dirac operator. Denote by M̂ the cylinder R × M , and by ĝ
the cylindrical metric ĝ = dt2 + g on M̂ , where t denotes the coordinate along the factor
R.
We have a natural projection π : M̂ → M , (t, x) 7→ x, and we set Ê := π ∗ E, the
pullback of E to M̂ via π. A section û of Ê is then a smooth 1-parameter family of
sections u(t) of E, t ∈ R. In particular, we can define unambiguously
1
∂t û(t0 , x0 ) = lim u(t0 + h, x0 ) − u(t0 , x0 ) ,
h→0 h
where the above limit is in the t-independent vector space Ex0 . Now define
by
u(t) 0 −∂t + D u(t)
= .
v(t) ∂t + D 0 v(t)
Then D̂ is a Dirac operator called the suspension of D. Note that the Cauchy–Riemann
operator is the suspension of the Hamilton–Floer operator. ⊔
⊓
Hamilton–Floer
Example 12.1.5. (Hodge–DeRham). Let (M, g) be an oriented Riemann manifold.
Then, according to the computations in the previous chapter, the Hodge–DeRham oper-
ator
d + d∗ : Ω• (M ) → Ω• (M ).
is a Dirac operator. ⊔
⊓
12.1. THE STRUCTURE OF DIRAC OPERATORS 621
Let D : C ∞ (E) → C ∞ (E) be a Dirac operator over the oriented Riemann manifold
(M, g). Its symbol is an endomorphism σ(D) : π ∗ E → π ∗ E, where π : T ∗ M → M denotes
the natural projection. Thus, for any x ∈ M , and any ξ ∈ Tx∗ M , the operator
c(ξ) := σ(D)(ξ, x)
c : T ∗ M ⊗ E → E (ξ, e) 7→ c(ξ)e,
we conclude that
{c(ξ), c(η)} = −2g(ξ, η)1Ex ,
where for any linear operators A, B we denoted by {A, B} their anticommutator
{A, B} := AB + BA.
Definition 12.1.6. (a) A Clifford structure on a vector bundle E over a Riemann man-
ifold (M, g) is a smooth bundle morphism c : T ∗ M ⊗ E → E, such that
where for every 1-form α we denoted by c(α) the bundle morphism c(α) : E → E given
by
c(α)u = c(α, u), ∀u ∈ C ∞ (E).
The morphism c is usually called the Clifford multiplication of the (Clifford) structure. A
pair (vector bundle, Clifford structure) is called a Clifford bundle.
(b) A Z2 -grading of a Clifford bundle E → M is a splitting E = E + ⊕ E − such that,
∀α ∈ Ω1 (M ), the Clifford multiplication by α is an odd endomorphism of the superspace
C ∞ (E+ ) ⊕ C ∞ (E− ), i.e., c(α)C ∞ (E ± ) ⊂ C ∞ (E ∓ ). ⊔
⊓
Proposition 12.1.7. Let E → M be a smooth vector bundle over the Riemann manifold
(M, g). Then the following conditions are equivalent.
(a) There exists a Dirac operator D : C ∞ (E) → C ∞ (E).
(b) The bundle E admits a Clifford structure.
Proof. We have just seen that (a)⇒(b). To prove the reverse implication let
c : T ∗M ⊗ E → E
∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E),
622 CHAPTER 12. DIRAC OPERATORS
the composition
∇ c
D = c ◦ ∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E) → C ∞ (E)
Example 12.1.8. Let (M, g) be a Riemann manifold. For each x ∈ M , and ξ ∈ Tx∗ M
define
c(ξ) : Λ• Tx∗ M → Λ• Tx∗ M
by
c(ξ)ω = (eξ − iξ )ω,
where eξ denotes the (left) exterior multiplication by ξ, while iξ denotes the interior
differentiation along ξ ∗ ∈ Tx M - the metric dual of ξ. The Exercise 2.2.55 of Section 2.2.4
shows that c defines a Clifford multiplication on Λ• T ∗ M . If ∇ denotes the Levi–Civita
connection on Λ• T ∗ M , then the Dirac operator c ◦ ∇ is none other than the Hodge–
DeRham operator. ⊔
⊓
The above proposition reduces the problem of describing which vector bundles admit
Dirac operators to an algebraic-topological one: find the bundles admitting a Clifford
structure. In the following subsections we will address precisely this issue.
{ρi , ρj } = −2δij 1E .
The collection (ρi ) generates an associative subalgebra in End (E), and it is natural to try
to understand its structure. We will look at the following universal situation.
Definition 12.1.10. Let V be a real, finite dimensional vector space, and
q :V ×V →R
a symmetric bilinear form. The Clifford algebra Cl(V, q) is the associative R–algebra with
unit, generated by V , and subject to the relations
Proposition 12.1.11. The Clifford algebra Cl(V, q) exists, and is uniquely defined by
its universality property: for every linear map : V → A such that A is an associative
R-algebra with unit, and {(u), (v)} = −2q(u, v) · 1, there exists an unique morphism of
algebras Φ : Cl(V, q) → A such that the diagram below is commutative.
[ w Cl(V, q)
ı
V
[[
℄[ uΦ
A
The map ı is the composition V ֒→ A → Cl(V, q) where the second arrow is the natural
projection. We let the reader check the universality property. ⊔
⊓
Corollary 12.1.14. Let (Vi , qi ) (i = 1, 2) be two real, finite dimensional vector spaces
endowed with quadratic forms qi : V → R. Then, any linear map T : V1 → V2 such
that q2 (T v) = q1 (v), ∀v ∈ V1 induces a unique morphism of algebras T# : Cl(V1 , q1 ) →
Cl(V2 , q2 ) such that T# (V1 ) ⊂ V2 , where we view Vi as a linear subspace in Cl(Vi , qi ).
The correspondence T 7→ T# constructed above is functorial, i.e., (1Vi )# = 1Cl(Vi ,qi ) ,
and (S ◦ T )# = S# ◦ T# , for all admissible S, and T .
The above corollary shows that the algebra Cl(V, q) depends only on the isomorphism
class of the pair (V, q)= vector space + quadratic form. It is known from linear algebra
that the isomorphism classes of such pairs are classified by some simple invariants:
We will be interested in the special case when dim V = rank q = sign q = n, i.e., when q
is an Euclidean metric on the n-dimensional space V . In this case, the Clifford algebra
Cl(V, q) is usually denoted by Cl(V ), or Cln . If (ei ) is an orthonormal basis of V , then we
624 CHAPTER 12. DIRAC OPERATORS
can alternatively describe Cln as the associative R–algebra with 1 generated by (ei ), and
subject to the relations
ei ej + ej ei = −2δij .
Using the universality property of Cln , we deduce that the map
V → Cl(V ), v 7→ −v ∈ Cl(V )
i.e., the automorphism α naturally defines a Z2 -grading of Cl(V ). In other words, the
Clifford algebra Cl(V ) is naturally a super-algebra.
Let (C̃l(V ), +, ∗) denote the opposite algebra of Cl(V ). Then C̃l(V ) coincides with
Cl(V ) as a vector space, but its multiplication ∗ is defined by
x ∗ y := y · x ∀x, y ∈ C̃l(V ),
where “·” denotes the usual multiplication in Cl(V ). Note that for any u, v ∈ V
u · v + v · u = u ∗ v + v ∗ u,
so that, using the universality property of Clifford algebras, we conclude that the natural
injection V ֒→ C̃l(V ) extends to a morphism of algebras Cl(V ) → C̃l(V ). This may as
well be regarded as an antimorphism Cl(V ) → Cl(V ) which we call the transposition map,
x 7→ x♭ . Note that
(u1 · u2 · · · ur )♭ = ur · · · u1 , ∀ui ∈ V.
For x ∈ Cl(V ) we set
x† := (α(x))♭ = α(x♭ ).
The element x† is called the adjoint of x.
For each v ∈ V define c(v) ∈ End (Λ• V ) by
c(v)ω := (ev − iv )ω ∀ω ∈ Λ• V,
where as usual ev denotes the (left) exterior multiplication by v, while iv denotes the
interior derivative along the metric dual of v. Invoking again the Exercise 2.2.55 we
deduce
c(v)2 = −|v|2g ,
so that, by the universality property of the Clifford algebras, the map c extends to a
morphism of algebras c : Cl(V ) → End (Λ• V ).
12.1. THE STRUCTURE OF DIRAC OPERATORS 625
c(x† ) = c(x)∗ ,
where the ∗ in the right-hand-side denotes the adjoint of c(x) viewed as a linear operator
on the linear space Λ• V endowed with the metric induced by the metric on V . ⊔
⊓
Cl(V ) ∋ x 7→ σ(x) ∈ Λ• V,
This shows that the symbol map is bijective since the ordered monomials
form a basis of Cl(V ). The inverse of the symbol map is called the quantization map and
is denoted by q : Λ• V → Cl(V ).
We now see that what we originally called a Clifford structure is precisely a Clifford
module.
In the following two subsections we intend to describe the complex Clifford modules.
The real theory is far more elaborate. For more information we refer the reader to the
excellent monograph [83].
626 CHAPTER 12. DIRAC OPERATORS
Cl(V ) ∼
= EndC (S) as C−algebras.
{e1 , f1 , . . . , ek , fk }
The metric on V defines a Hermitian metric on the complex vector space (V, J)
=C (V, J) ∼
V 0,1 ∼ =C Vc∗ ∼
=C (V 1,0 )∗ .
(Above, Vc∗ denotes the complex dual of the complex space (V, J)). With respect to this
Hermitian metric the collection
1
εj := √ (ej − ifj ); 1 ≤ j ≤ k
2
V ⊗ C = V 1,0 ⊕ V 0,1 .
Hence, we have to specify the action of each of the components V 1,0 and V 0,1 .
The elements w ∈ V 1,0 will act by exterior multiplication
√ √
c(w)ω = 2e(w)ω = 2w ∧ ω, ∀ω ∈ Λ•,0 V.
The elements w ∈ V 0,1 can be identified with complex linear functionals on V 1,0 , and as
such, they will act by interior differentiation
√
c(w)w1 ∧ · · · ∧ wℓ = − 2i(w)(w1 ∧ · · · ∧ wℓ )
ℓ
√ X
= 2 (−1)j gC (wj , w)w1 ∧ · · · ∧ ŵj ∧ · · · ∧ wℓ ,
j=1
We have
1 i
ei = √ (εi + εi ), fj = √ (εj − εj ),
2 2
so that,
c(ei ) = e(εi ) − i(εi ), c(fj ) = i(e(εj ) + i(εj )).
The anti-commutation rules follow as in the Exercise 2.2.55 using the equalities
gC (εi , εj ) = δij .
This shows S2k is naturally a Cl(V )-module. Note that dimC S2k = 2k so that
A little work (left to the reader) shows the morphism c is injective. This completes the
proof of the proposition. ⊔
⊓
Definition 12.1.20. The Cl(V )-module S(V ) is known as the (even) complex spinor
module. ⊔
⊓
628 CHAPTER 12. DIRAC OPERATORS
Using basic algebraic results about the representation theory of the algebra of endo-
morphisms of a vector space we can draw several useful consequences. (See [120] for a
very nice presentation of these facts.)
Corollary 12.1.21. There exists an unique (up to isomorphism) irreducible complex Cl2k -
module and this is the complex spinor module S2k . ⊔
⊓
Corollary 12.1.22. Any complex Cl2k -module has the form S2k ⊗ W , where W is an
arbitrary complex vector space. The action of Cl2k on S2k ⊗ W is defined by
v · (s ⊗ w) = c(v)s ⊗ w.
The vector space W is called the twisting space of the given Clifford module. ⊔
⊓
Remark 12.1.23. Given a complex Cl2k -module E, its twisting space can be recovered
as the space of morphisms of Clifford modules
Assume now that (V, g) is an oriented, 2k-dimensional Euclidean space. For any posi-
tively oriented orthonormal basis e1 , . . . , e2k we can form the element
Γ = ik e1 · · · e2k ∈ Cl(V ).
One can check easily this element is independent of the oriented orthonormal basis, and
thus it is an element intrinsically induced by the orientation. It is called the chirality
operator defined by the orientation. Note that
Let S = S(V ) denote the spinor module of Cl(V ). The chirality operator defines an
involutive endomorphism of S, and thus defines a Z2 -grading on S
and hence a Z2 -grading of End(S). Since {v, Γ} = 0, ∀v ∈ V , we deduce the Clifford mul-
tiplication by v is an odd endomorphism of S. This means that any algebra isomorphism
Cl(V ) → End (S(V )) is an isomorphism of Z2 -graded algebras.
S+/− ∼
= Λeven/odd,0 V. ⊔
⊓
12.1. THE STRUCTURE OF DIRAC OPERATORS 629
The above considerations extend to arbitrary Clifford modules. The chirality operator
introduces a Z2 -grading in any complex Clifford module which we call the chiral grading.
However this does not exhaust the family of Clifford s-modules. The family of s-modules
can be completely described as
ˆ ; W complex s − space ,
S⊗W
where ⊗ˆ denotes the s-tensor product. The modules endowed with the chiral grading form
the subfamily in which the twisting s-space W is purely even.
Example 12.1.25. Let (V, g) be a 2k-dimensional, oriented, Euclidean space. Then
Λ•R V ⊗ C is naturally a Clifford module. Thus, it has the form
Λ•R V ⊗ C ∼
= S ⊗ W.
To find the twisting space, we pick a complex structure on V whose induced orientation
agrees with the given orientation of V . This complex structure produces an isomorphism
Λ•R V ⊗R C ∼
= Λ•,0 V ⊗ Λ0,• V ∼
= S ⊗ (Λ•,0 V )∗ ∼
= S ⊗R S ∗ .
To understand the chiral grading we need to describe the action of the chiral operator on
Λ•R V ⊗ C. This can be done via the Hodge ∗-operator. More precisely we have
Γ · ω = ik+p(p−1) ∗ ω ∀ω ∈ Λp V ⊗ C. (12.1.1)
⊔
⊓
In order to formulate the final result of this subsection we need to extend the automor-
phism α, and the anti-automorphism ♭ to the complexified Clifford algebra Cl(V ). The
automorphism α can be extended by complex linearity
(x ⊗ z)♭ = x♭ ⊗ z.
Proposition 12.1.27. Let S(V ) denote the spinor module of the 2k-dimensional Eu-
clidean space (V, g). Then for every isomorphism of algebras
Each ρ(g) is an unitary operator with respect to this metric. We leave the reader to check
that this is the metric we are after. The uniqueness follows from the irreducibility of S(V )
using Schur’s lemma. ⊔
⊓
Corollary 12.1.28. Let (V, g) as above, and ρ : Cl(V ) → EndC (E) be a complex Clifford
module. Then E admits at least one Hermitian metric with respect to which ρ is self-
adjoint.
Proof. Pick an orthonormal basis {e0 , e1 , . . . , em } of the standard Euclidean space Rm+1 .
These generate the algebra Clm+1 . We view Clm as the Clifford algebra generated by
{e1 , . . . , em }. Now define
Ψ : Clm → Cleven 0 1 0 1
m+1 , Ψ(x + x ) = x + e0 · x ,
Proposition 12.1.30. Let (V, g) be a (2k + 1)-dimensional Euclidean space. Then there
exist two complex, irreducible Cl(V )-modules S+ (V ) and S− (V ) such that
Cl(V ) ∼
= EndC (S+ ) ⊕ EndC (S− ) as ungraded algebras.
or(V ⊕ R) = or ∧ or(R).
Choose an isomorphism
ρ : Cl(V ⊕ R) → EndC (S2k+2 ).
Then S2k+2 becomes naturally a super Cl2k+2 -module
S2k+2 = S+ −
2k+2 ⊕ S2k+2 ,
Cl(V ) ∼
= Cl(V ⊕ R)even ∼
= Endeven (S2k+2 ) ∼
= End (S+ −
2k+2 ) ⊕ End (S2k+2 ). ⊔
⊓
As in the previous section, we can choose Hermitian metrics on S± (V ) such that the
morphism
ρ : Cl(V ) → EndC S(V )
is self-adjoint, i.e.,
ρ(u† ) = ρ(u)∗ , ∀u ∈ Cl(V ).
The above result can be used to describe the complex (super)modules of Cl2k+1 . We will
not present the details since the applications we have in mind do not require these facts.
For more details we refer to [83].
(c) A bundle of Clifford modules, i.e. a vector bundle E → M , together with a morphism
c : C → End (E) whose restrictions to the fibers are morphisms of algebras.
632 CHAPTER 12. DIRAC OPERATORS
(d) A connection on E.
The above collection of data can be constructed from bundles associated to a common
principal bundle. The symmetry group of this principal bundle has to be a Lie group with
several additional features which we now proceed to describe.
Let (V, g) denote the standard fiber of T ∗ M , and denote by AutV the group of auto-
morphisms φ of Cl(V ) such that φ(V ) ⊂ V . The group AutcV is defined similarly, using
the complexified algebra Cl(V ) instead of Cl(V ). For brevity, we discuss only the real
case. We need the following objects.
1
In practice one requires a little more namely that ∇M is precisely the Levi–Civita connection on T ∗ M .
This leads to significant simplifications in many instances.
12.1. THE STRUCTURE OF DIRAC OPERATORS 633
Apparently, the chances that a Lie group G with the above properties exists are very
slim. The very pleasant surprise is that all these, and even more, happen in many geo-
metrically interesting situations.
Example 12.1.33. Let (V, g) be an oriented Euclidean space. Using the universality
property of Clifford algebras we deduce that each g ∈ SO(V ) induces an automorphism
of Cl(V ) preserving V ֒→ Cl(V ). Moreover, it defines an orthogonal representation on the
canonical Clifford module
c : Cl(V ) → End (Λ• V ),
such that
The next two sections discuss two important examples of Lie groups with the above
properties. These are the spin groups Spin(n) and its “complexification” Spinc (n). It
turns out that all the groups one needs to build Dirac operators are these three classes:
SO, Spin, and Spinc .
where Cl(V )⋆ denotes the group of invertible elements. We will instead try to understand
the Clifford group
Γ(V ) = x ∈ Cl⋆ (V ) α(x) · V · x−1 ⊂ V ,
where α : Cl(V ) → Cl(V ) denotes the involutive automorphism of Cl(V ) defining its
Z2 -grading.
634 CHAPTER 12. DIRAC OPERATORS
x = x0 + x1 ,
In terms of the s-commutator, the above two equalities can be written as one
In particular, the even part x0 lies in the center of Cl(V ). We let the reader check the
following elementary fact.
Lemma 12.1.35. The center of the Clifford algebra is the field of scalars R ⊂ Cl(V ). ⊔
⊓
1. N (Γ(V )) ⊂ R∗ .
12.1. THE STRUCTURE OF DIRAC OPERATORS 635
so that,
α((x♭ )−1 ) · v · x♭ ∈ V,
that is, (x♭ )−1 ∈ Γ(V ). Hence x† ∈ Γ(V ), ∀x ∈ Γ(V ). In particular, since α(Γ(V )) ⊂ Γ(V ),
we deduce N (Γ(V )) ⊂ Γ(V ).
For any v ∈ V we have
α(N (x)) · v · (N (x))−1 = α(x♭ x) · v · (x♭ x)−1 = α(x♭ ) · {α(x) · v · x−1 } · (x♭ )−1 .
On the other hand, y = α(x) · v · x−1 is an element in V , which implies y † = α(y ♭ ) = −y.
Hence
α(N (x)) · v · (N (x))−1 = −α(x♭ ) · y † · (x♭ )−1
= −α(x♭ ) · α(x♭ )−1 · v † · x♭ · (x♭ )−1 = −v † = v.
This means N (x) ∈ ker ρ = R∗ .
(b) If x, y ∈ Γ(V ), then
Theorem 12.1.38. (a) For every x ∈ Γ(V ), the transformation ρ(x) of V is orthogonal.
(b) There exists a short exact sequence of groups
ρ
1 → R∗ ֒→ Γ(V ) → O(V ) → 1.
Proof. (a) Note that, ∀v ∈ V , we have N (v) = −|v|2g . For every x ∈ Γ(V ) we get
N (x)2 = N (α(x))2 .
Hence, N (ρ(x)(v)) = ±N (v). Since both N (v) and N (ρx (v)) are negative numbers, we
deduce that the only possible choice of signs in the above equality is +. This shows that
ρ(x) is an orthogonal transformation.
636 CHAPTER 12. DIRAC OPERATORS
(b) & (c) We only need to show ρ(Γ(V )) = O(V ). For x ∈ V with |x|g = 1 we have
α(x) = −x = x−1 .
ρ(x)v = −λx + u.
In other words, ρ(x) is the orthogonal reflection in the hyperplane through origin which
is perpendicular to x. Since any orthogonal transformation of V is a composition of such
reflections (Exercise), we deduce that for each T ∈ O(V ) we can find v1 , . . . , vk ∈ V such
that
T = ρ(v1 ) · · · ρ(vk ).
Incidentally, this also establishes (c). ⊔
⊓
Exercise 12.1.39. Suppose V is a finite dimensional Euclidean space. Prove that any
orthogonal operator T ∈ O(V ) is a product of at most dim V reflections. ⊔
⊓
Set
Γ0 (V ) := Γ(V ) ∩ Cleven .
Note that
ρ(Γ0 (V )) ⊂ SO(V ) = T ∈ O(V ); det T = 1 .
Hence, we have a short exact sequence
ρ
1 → R∗ ֒→ Γ0 (V ) → SO(V ) → 1.
and
Spin(V ) := x ∈ Γ(V ); N (x) = 1 = Pin(V ) ∩ Γ0 (V ). ⊔
⊓
The results we proved so far show that Spin(V ) can be alternatively described by the
following “friendlier” equality
In particular, this shows that Spin(V ) is a compact topological group. Observe that
Spin(V ) is a closed subgroup of the Lie group GL( Cl(V ) ). This implies (see Remark
1.2.31 and [61, 128]) that Spin(V ) is in fact a Lie group, and the map Spin(V ) ֒→ Cl(V )
is a smooth embedding. The proof of the following result is left to the reader.
Proposition 12.1.41. There exist short exact sequences
1 → Z2 → Pin(V ) → O(V ) → 1
1 → Z2 → Spin(V ) → SO(V ) → 1. ⊔
⊓
12.1. THE STRUCTURE OF DIRAC OPERATORS 637
Proof. The fact that ρ : Spin(V ) → SO(V ) is a covering map is an elementary consequence
of the following simple observations.
Since SO(V ) is connected if dim V ≥ 2, the fact that Spin(V ) is connected would
follow if we showed that any points in the same fiber of ρ can be connected by arcs. It
suffices to look at the fiber ρ−1 (1) = {−1, 1}.
If u, v ∈ V are such that |u| = |v| = 1, u ⊥ v, then the path
u
Spin(V )
ρ
w SO(V
u ) u
Spin(V ) y w Cl(Vu )
i
y y , y y
Spin(U ) ρ
w SO(U ) Spin(U ) w Cl(U )
Hence, it suffices to show that if dim V > 3, then for every smooth, closed path
In particular, the vectors v(t) and x are linearly independent, for any t. Denote by f (t)
the vector obtained by applying the Gramm-Schmidt orthonormalization process to the
ordered linearly independent set {x, v(t)}. In other words,
1
f (t) := f0 (t), f0 (t) := v(t) − (v(t) • x)x,
|f0 (t)|
such that,
θ(0) = θ(1) = 0, v(t) = x cos 2θ(t) + f (t) sin 2θ(t).
Set
σ(t) := cos θ + xf sin θ ∈ Cl(V ).
Note that θ θ θ θ
σ(t) = x cos − f sin −x cos − f sin ,
2 2 2 2
so that σ(t) ∈ Spin(V ).
On the other hand,
σ(t)−1 (t) = cos θ − xf sin θ,
and
In other words,
ρσ(t) x = v(t).
For every s ∈ [0, 1] define
σs (t) := ûs (t) = cos sθ + xf sin sθ, ûs (t) := σs (t)−1 û(t)σs (t),
and
us (t) := ρ−1
σs (t) u(t)ρσs (t) ∈ SO(V ).
Observe that for s = 0 we have û0 (t) = û(t). On the other hand, for s = 1 we have
u1 (t)x = ρ−1
σs (t) u(t)ρσs (t) x = ρσs (t)−1 v(t) = x.
We can view Spin(V ) as a submanifold of Cl(V ), and as such we can identify its Lie
algebra spin(V ) with a linear subspace of Cl(V ). The next result offers a more precise
description.
12.1. THE STRUCTURE OF DIRAC OPERATORS 639
spin(V ) = q(Λ2 V ).
Proof. The group Γ(V ) is a Lie group as a closed subgroup of the group of linear trans-
formations of Cl(V ). Since the elements of Γ(V ) are either purely even, or purely odd, we
deduce that the tangent space at 1 ∈ Γ(V ) is a subspace of
E= x ∈ Cleven (V ); xv − vx ∈ V, ∀v ∈ V .
x = x0 + e1 x1 ,
where x0 ∈ Cl(V )even , and x1 ∈ Cl(V )odd are linear combinations of monomials involving
only the vectors e2 , . . . , en . Since [x0 , e1 ] = 0, and {x1 , e1 } = 0, we deduce
1 1
e1 x1 = [e1 , x1 ] = [e1 , x] ∈ V.
2 2
In particular this means x1 ∈ R ⊕ V ∈ Cl(V ).
Repeating the same argument with every vector ei we deduce that
Thus,
T1 Γ0 (V ) ⊂ R ⊕ q(Λ2 V ).
The tangent space to Spin(V ) satisfies a further restriction obtained by differentiating the
condition N (x) = 1. This gives
Since dim spin(V ) = dim so(V ) = dim Λ2 V we conclude that the above inclusion is in fact
an equality of vector spaces.
Now consider two smooth paths x, y : (−ε, ε) → Spin(V ) such that x(0) = y(0) = 1.
The Lie bracket of ẋ(0) and ẏ(0) is then found (using the Exercise 3.1.21) from the equality
where the above bracket is the commutator of ẋ(0) and ẏ(0) viewed as elements in the
associative algebra Cl(V ). ⊔
⊓
ρ∗ : spin(V ) → so(V ),
640 CHAPTER 12. DIRAC OPERATORS
then X
ρ∗ (x)ej = −2 xij ei , (xij = −xji ).
i
with respect to an oriented orthonormal basis {e1 , . . . , en }, (n = dim V ), then the 2-form
associated to A has the form X
ωA = − aij ei ∧ ej ,
i<j
so that
1X 1X
ρ−1
∗ (A) = − aij ei ej = − aij ei ej . (12.1.3)
2 4
i<j i,j
The above negative sign is essential, and in many concrete problems it makes a world of
difference. ⊔
⊓
The (complex) representation theory described in the previous sections can be used to
determine the representations of Spin(V ).
12.1. THE STRUCTURE OF DIRAC OPERATORS 641
S(V ) = S+ (V ) ⊕ S− (V ).
and
S2k+1 ∼
= S+ (R2k+1 ) ∼
=Spin(2k+1) S− (R2k+1 ) if n = 2k + 1.
The module Sn will be called the fundamental complex spinor module of Spin(n). ⊔
⊓
Note that q ∈ Spin(V ) so that ∆(q) preserves the parities when acting on Λ∗,0 (V ). Hence
∆(q) = ∆+ (q) ⊕ ∆− (q) where ∆+/− (q) acts on Λeven/odd,0 V . Compute tr (∆+/− (q)) and
then conclude that Λeven,/odd,0 V are non-isomorphic Spin(V )-modules. ⊔
⊓
Proposition 12.1.46. Let φ : Cl(V ) → End (E) be a self-adjoint Clifford module. Then
the induced representation of Spin(V ) is orthogonal (unitary).
Exercise 12.1.48. Prove that the group Spin(V ) satisfies all the conditions discussed in
Subsection 10.1.5. ⊔
⊓
642 CHAPTER 12. DIRAC OPERATORS
α : Cln → Cln
(v ⊗ z)♭ = v ⊗ z.
1 → S 1 → Pinc (V ) → O(V ) → 1. ⊔
⊓
Pinc (V ) ∼
= (Pin(V ) × S 1 )/ ∼,
where “∼” is the equivalence relation
(Pin(V ) × S 1 )/ ∼→ Cl(V ).
ρc : Pinc (V ) → O(V ).
Spinc (V ) ∼
= (Spin(V ) × S 1 )/ ∼∼
= (Spin(V ) × S 1 )/Z2 .
12.1. THE STRUCTURE OF DIRAC OPERATORS 643
Exercise 12.1.51. Prove Spinc (V ) satisfies all the conditions outlined in Subsection
10.1.5. ⊔
⊓
called the complex spinorial representation of Spinc . It is not irreducible since, once we fix
an orientation on V , the space End (S(V )) has a natural superstructure and, by definition,
Spinc acts through even automorphism. As in the real case, S(V ) splits into a direct sum
of irreducible representations S± (V ).
Fix a complex structure J on V . This complex structure determines two things.
ξ
J
Proposition 12.1.52. There exists a natural group morphism U (V, J) −→ Spinc (V ) such
that the diagram below is commutative.
''ı w Spin (V )
ξJ c
U (V, J)
')' ρ J
u
c
SO(V )
Define ξ(ω) to be the image of (γ̃(1), δ(1)) in Spinc (V ). We have to check two things.
ρ
The elements γ̃(1) and η̃(1) lie in the same fiber of the covering Spin(V ) → SO(V ) so
that they differ by an element in ker ρ. Hence
We can identify ǫ as the holonomy of the covering Spin(V ) → SO(V ) along the loop γ ∗ η −
which goes from 1 to ω along γ, and then back to 1 along η − (t) = η(1 − t).
The map det : U (V ) → S 1 induces an isomorphism between the fundamental groups
(see Exercise 6.2.35 of Subsection 6.2.5). Hence, in describing the holonomy ǫ it suffices
to replace the loop γ ∗ η − ⊂ U (V ) by any loop ν(t) such that
Such a loop will be homotopic to γ ∗ η −1 in U (V ), and thus in SO(V ) as well. Select ν(t)
of the form
ν(t)e1 = ∆(t)e1 , ν(t)ei = ei ∀i ≥ 2,
where (ei ) is a complex, orthonormal basis of (V, J). Set fi = Jei . With respect to the
real basis (e1 , f1 , e2 , f2 , . . . ) the operator ν(t) (viewed as an element of SO(V )) has the
matrix description
cos θ(t) − sin θ(t) · · ·
sin θ(t) cos θ(t) · · ·
,
.. ..
. . 1
where θ : [0, 1] → R is a continuous map such that ∆(t) = eiθ(t) .
The lift of ν(t) to Spin(V ) has the form
θ(t) θ(t)
ν̃(t) = cos − e1 f1 sin .
2 2
We see that the holonomy defined by ν̃(t) is nontrivial if and only if the holonomy of the
z2
loop t 7→ δ(t) in the double cover S 1 → S 1 is nontrivial. This means that δ(1) and λ(1)
differ by the same element of Z2 as γ̃(1) and η̃(1). This proves ξ is well defined. We leave
the reader to check that ξ is indeed a smooth morphism of groups. ⊔
⊓
Example 12.1.54. (The case n = 2). The Clifford algebra Cl2 is isomorphic with the
algebra of quaternions H. This can be seen by choosing an orthonormal basis {e1 , e2 } in
R2 . The isomorphism is given by
1 7→ 1, e1 7→ i, e2 7→ j, e1 e2 7→ k,
12.1. THE STRUCTURE OF DIRAC OPERATORS 645
Spin(2) = {a + bk ; a, b ∈ R, a2 + b2 = 1} ∼
= S 1.
Example 12.1.55. (The case n = 3). The Clifford algebra Cl3 is isomorphic, as an
ungraded algebra, to the direct sum H ⊕ H. More relevant is the isomorphism Cleven ∼
=
3
Cl2 ∼
= H given by
1 7→ 1, e1 e2 7→ i, e2 e3 7→ j, e3 e1 7→ k,
where {e1 , e2 , e3 } is an orthonormal basis in R3 . Under this identification the operation
x 7→ x♭ coincides with the conjugation in H
x = a + bi + cj + dk 7→ x = a − bi − cj − dk.
N (a + bi + cj + dk) = a2 + b2 + c2 + d2 .
1
x−1 = x♭ .
N (x)
Γ0 (R3 ) ∼
= H \ {0}.
Hence
Spin(3) ∼
= { x ∈ H; |x| = 1 } ∼
= SU (2).
The natural map Spin(3) → SO(3) is precisely the map described in the Exercise 6.2.8 of
Subsection 6.2.1.
The isomorphism Spin(3) ∼
= SU (2) can be visualized by writing each q = a+bi+cj +dk
as
q = u + jv, u = a + bi, v = (c − di) ∈ C.
To a quaternion q = u + jv one associates the 2 × 2 complex matrix
u −v̄
Sq = ∈ SU (2).
v ū
A simple computation shows that T (Ri q) = iT q, i.e., T is a complex linear map. Moreover,
∀q ∈ Spin(3) ∼
= S 3 , the matrix Sq is in SU (2), and the diagram below is commutative.
H
T
w C2
L q
u T S u2
q
H wC
In other words, the representation
Rj : S3 → S3 .
β : S3 × S3 → C.
This plays an important part in the formulation of the recently introduced Seiberg-Witten
equations (see [135]). ⊔
⊓
(i) complex with respect to the above introduced complex structure on H, and
(ii) it is isomorphic with the complex spinorial representation described by the left mul-
tiplication.
⊔
⊓
12.1. THE STRUCTURE OF DIRAC OPERATORS 647
Example 12.1.57. (The case n = 4). The Clifford algebra Cl4 can be realized as the
algebra of 2 × 2 matrices with entries in H. To describe this isomorphism we have to start
from a natural embedding R4 ֒→ M2 (H) given by the correspondence
∼ 4 0 −x
H = R ∋ x 7→
x 0
A simple computation shows that the conditions in the universality property of a Clifford
algebra are satisfied, and this correspondence extends to a bona-fide morphism of algebras
Cl4 → M2 (H). We let the reader check this morphism is also injective. A dimension count
concludes it must also be surjective. ⊔
⊓
Clearly |x| = |q1 | · |x| · |q2 | = |Tq1 ,q2 x| ∀x ∈ H, so that each Tq1 ,q2 is an orthogonal
transformation of H. Since SU (2) × SU (2) is connected, all the operators Tq1 ,q2 belong to
the component of O(4) containing 1, i.e,. T defines an (obviously smooth) group morphism
Note that ker T = {1, −1}, so that T is 2 : 1. In order to prove T is a double cover it
suffices to show it is onto. This follows easily by noticing T is a submersion (verify this! ),
so that its range must contain an entire neighborhood of 1 ∈ SO(4). Since the range
of T is closed (verify this!) we conclude that T must be onto because the closure of the
subgroup (algebraically) generated by an open set in a connected Lie group coincides with
the group itself (see Subsection 1.2.3). ⊔
⊓
so(4) ∼
= spin(4) ∼
= su(2) ⊕ su(2) ∼
= so(3) ⊕ so(3).
Exercise 12.1.59. Using the identification Cl4 ∼ = M2 (H) show that Spin(4) corresponds
to the subgroup
diag (p, q); p, q ∈ H, |p| = |q| = 1 ⊂ M2 (H). ⊔
⊓
Exercise 12.1.60. Let {e1 , e2 , e3 , e4 } be an oriented orthonormal basis of R4 . Let ∗ denote
the Hodge operator defined by the canonical metric and the above chosen orientation. Note
that
∗ : Λ2 R4 → Λ2 R4
is involutive, ∗2 = 1, so that we can split Λ2 into the ±1 eigenspaces of ∗
Λ2 R4 = Λ2+ R4 ⊕ Λ2− R4 .
648 CHAPTER 12. DIRAC OPERATORS
S+
4 : SU (2) × SU (2) ∋ (p, q) 7→ p ∈ GL(2; C),
S−
4 : SU (2) × SU (2) ∋ (p, q) 7→ q ∈ GL(2; C).
Exactly as in the case of Spin(3), these representations can be given quaternionic descrip-
tions.
Exercise 12.1.61. The space R4 ∼ = H has a canonical complex structure defined by Ri
which defines (following the prescriptions in §10.1.3) an isomorphism
c : Cl4 → End (Λ• C2 ).
Identify C2 in the obvious way with Λ1 C2 Λodd C2 , and with Λeven C2 via the map
e1 7→ 1 ∈ Λ0 C2 , e2 7→ e1 ∧ e2 .
Show that under these identifications we have
c(x) = Tx ∀x ∈ R4 ∼
= (H, Ri ) ∼
= C2 ,
where Tx is the odd endomorphism of C2 ⊕ C2 defined in (12.1.4). ⊔
⊓
12.1. THE STRUCTURE OF DIRAC OPERATORS 649
q : Λ• V → Cl(V )
the quantization map, and fix an isomorphism ∆ : Cl(V ) → End (S(V )) of Z2 -graded alge-
bras. Show that for any η ∈ Λ2+ (V ), the image ∆ ◦ q(η) ∈ End (S(V )) is an endomorphism
of the form T ⊕ 0 ∈ End (S+ (V )) ⊕ End (S− (V )). ⊔
⊓
∇X (c(α)u) = c(∇M
X α)u + c(α)(∇X u),
The next result addresses the fundamental consistency question: do there exist Dirac
bundles?
650 CHAPTER 12. DIRAC OPERATORS
Proposition 12.1.65. Let E → M be a Clifford bundle over the oriented Riemann man-
ifold (M, g). Then there exist Dirac structures on E.
Proof. Denote by D the (possible empty) family of Dirac structure on E. Note that if
(hi , ∇i ) ∈ D, i = 1, 2, and f ∈ C ∞ (M ), then
(f h1 + (1 − f )h2 , f ∇1 + (1 − f )∇2 ) ∈ D.
This elementary fact shows that the existence of Dirac structures is essentially a local
issue: local Dirac structures can be patched-up via partitions of unity.
Thus, it suffices to consider only the case when M is an open subset of Rn , and E is
a trivial vector bundle. On the other hand, we cannot assume that the metric g is also
trivial (Euclidean) since the local obstructions given by the Riemann curvature cannot be
removed. We will distinguish two cases.
A. n = dim M is even. The proof will be completed in three steps.
Step 1. A special example. Consider the (complex) spinor module (representation)
1 X ij i j
ω̃ := ρ−1
∗ (ω) = − ω e · e ∈ Ω1 (M ) ⊗ spin(n).
2
i<j
1X
∇S u = du − ωij ⊗ c(ei ) · c(ej )u, ∀u ∈ C ∞ (SM ).
2
i<j
connection ∇E⊗W . These two data define a Dirac structure on E ⊗ W (Exercise 12.1.68
at the end of this subsection).
The representation theory of the Clifford algebra Cln with n even shows that any
Clifford bundle over M must be a twisting SM ⊗ W of the spinor bundle SM . This
completes the proof of the proposition when dim M is even.
Step 3 Conclusion. The odd case is dealt with similarly using the different representation
theory of the Cl2k+1 . Now instead of one generating model S there are two, but the proof
is conceptually identical. The straightforward details are left to the reader. ⊔
⊓
Denote by (E, h, ∇) a Dirac bundle over the oriented Riemann manifold (M, g). There
exists a Dirac operator on E canonically associated to this structure given by
∇ c
D = c ◦ ∇ : C ∞ (E) → C ∞ (T ∗ M ⊗ E) → C ∞ (E).
Proof. The assertion in the above proposition is local so we can work with local orthonor-
mal moving frames. Fix x0 ∈ M , and denote by (xi ) a collection of normal coordinates
near x0 . Set ei = ∂xi . Denote by (ei ) the dual coframe of (ei ). If (h, ∇) is a Dirac structure
on the Clifford bundle E, then at x0 the associated Dirac operator can be described as
X
D= c(ei )∇i (∇i = ∇ei ).
i
We deduce
D ∗ = (∇i )∗ c(ei )∗ .
Since the connection ∇ is compatible with h and div g (ei ) |x0 = 0, we deduce
(∇i )∗ = −∇i .
On the other hand, the Clifford multiplication is covariant constant, and (∇M i
i e ) |x0 = 0
so we conclude
[∇i , c(ei )] |x0 = 0 ∀i. ⊔
⊓
Let D be a geometric operator associated to the Dirac bundle (E, h, ∇). By definition,
D 2 is a generalized Laplacian, and according to Proposition 10.1.34 we must have an
equality of the form
D2 = ∇ ˜ ∗∇
˜ + R,
˜ is a connection on E, and R is the Weitzenböck remainder which is an endo-
where ∇
˜ = ∇, and this remainder can be given
morphism of E. For geometric Dirac operators, ∇
652 CHAPTER 12. DIRAC OPERATORS
On the other hand, the Clifford multiplication c : Cl(T ∗ M ) → End (E) defines a linear
map
Cl(T ∗ M ) ⊗ End (E) → End(E), ω ⊗ T 7→ c(ω) ◦ T.
This map associates to the element q(F ) an endomorphism of E which we denote by c(F ).
If (ei ) is a local, oriented, orthonormal moving frame for T ∗ M , then we can write
X
F (∇) = ei ∧ ej ⊗ Fij ,
i<j
and X 1X
c(F ) = c(ei )c(ej )Fij = c(ei )c(ej )Fij .
2
i<j i,j
D 2 = ∇∗ ∇ + c(F (∇)).
Proof. Fix x ∈ M , and then choose an oriented, local orthonormal moving frame (ei ) of
T M near x such that
[ei , ej ] |x = (∇M ei ) |x = 0,
where ∇M denotes the Levi–Civita connection. Such a choice is always possible because
the torsion of the Levi–Civita connection is zero. Finally, denote by (ei ) the dual coframe
of (ei ). Then
X X
D 2 |x = c(ei )∇i c(ej )∇j .
i j
We want to emphasize again the above equalities hold only at x. The theorem now follows
by observing that !
X
∗ 2
(∇ ∇) |x = − ∇i |x . ⊔
⊓
i
12.2. FUNDAMENTAL EXAMPLES 653
Exercise 12.1.68. Let (E, h, ∇) be a Dirac bundle over the oriented Riemann manifold
(M, g) with associated Dirac operator D. Consider a Hermitian bundle W → M and a
connection ∇W compatible with the Hermitian metric hW .
(a) Show that (E ⊗ W, h ⊗ hW , ∇ˆ = ∇ ⊗ 1W + 1E ⊗ ∇W ) defines a Dirac structure on
E ⊗ W in which the Clifford multiplication by α ∈ Ω1 (M ) is defined by
Show that
2
DW ˆ ∗∇
=∇ ˆ + c(F (∇)) + cW (F (∇W )). ⊔
⊓
Λ•C T ∗ M := Λ• T ∗ M ⊗ C.
d + d∗ : Ω• (M ) → Ω• (M )
is a Dirac operator. In fact, we will prove this operator is a geometric Dirac operator.
Continue to denote by g the Hermitian metric induced by the metric g on the com-
plexification Λ•C T ∗ M . We will denote by ∇g all the Levi–Civita connection on the tensor
bundles of M . When we want to be more specific about which Levi–Civita connection we
are using at a given moment we will indicate the bundle it acts on as a superscript. E.g.,
∗
∇T M is the Levi–Civita connection on T ∗ M .
Proposition 12.2.1. The pair (g, ∇g ) defines a Dirac structure on the Clifford bundle
ΛC T ∗ M and d + d∗ is the associated Dirac operator.
654 CHAPTER 12. DIRAC OPERATORS
Proof. In Subsection 4.1.5 we have proved that d can be alternatively described as the
composition
∇ ε
C ∞ (Λ• T ∗ M ) → C ∞ (T ∗ M ⊗ Λ• T ∗ M ) → C ∞ (Λ• T ∗ M ),
where ε denotes the exterior multiplication map. Thus
d + d∗ = ε ◦ ∇ + ∇∗ ◦ ε∗ .
If X1 , · · · , Xn is a local orthonormal frame of T M , and θ 1 , · · · , θ n is its dual coframe, then
for any ordered multi-index I we have
X
ε∗ (θ I ) = iXj θ I .
j
Fortunately, we have the freedom to choose the frame (Xk ) in any manner we find conve-
nient.
Fix an arbitrary point x0 ∈ M and choose (Xk ) such that, at x0 , we have Xk =
∂xk , where (xk ) denotes a collection of normal coordinates near x0 . With such a choice
div g (Xk ) = 0 at x0 , and thus
X
d∗ ω |x0 = − i∂k ∇∂k ω.
k
We want to spend some time elucidating the structure of the Weitzenböck remainder.
First of all, we need a better description of the curvature of ∇g viewed as a connection on
Λ• T ∗ M .
Denote by R the Riemann curvature tensor, i.e., the curvature of the Levi–Civita
connection
∇g : C ∞ (T M ) → C ∞ (T ∗ M ⊗ T M ).
Thus R is a bundle morphism
R : C ∞ (T M ) → C ∞ (Λ2 T ∗ M ⊗ T M ).
We have a dual morphism
R̃ : C ∞ (T ∗ M ) → C ∞ (Λ2 T ∗ M ⊗ T ∗ M ),
uniquely determined by the equality
(R̃(Y, Z)α)(X) = −α(R(Y, Z)X) ∀α ∈ Ω1 (M ) X, Y, Z ∈ Vect (M ).
12.2. FUNDAMENTAL EXAMPLES 655
∗M
Lemma 12.2.2. R̃ is the curvature of the Levi–Civita connection ∇T .
Since R0 ≡ 0, the first interesting case is R1 . To understand its form, pick normal
coordinates (xi ) near x0 ∈ M . Set ei = ∂xi |x0 ∈ Tx0 M and ei = dxi |x0 ∈ Tx∗0 M .
656 CHAPTER 12. DIRAC OPERATORS
i e =R
where R(ek , eℓ )ej = Rjkℓ i ijkℓ ei . Using Lemma 12.2.2 we get
and
c(ek )c(eℓ )ei = ek ∧ eℓ ∧ ei − δiℓ ek − δkℓ ei + δik eℓ .
Hence
X 1 X
R1 ( αj ej ) = αj Rijkℓ (ek ∧ eℓ ∧ ei − δiℓ ek − δkℓ ei + δik eℓ ).
2
j i,j,k,ℓ
Hence
X 1 X
R1 ( αj ej ) = αj Rijkℓ (δik eℓ − δiℓ ek )
2
j i,j,k,ℓ
1X 1X X X
= αj Rijiℓ eℓ − αj Rijki ek = αj Rijik ek = αj Ricjk ek ,
2 2
i,j,ℓ i,j,k i,j,k jk
P
where Ricjk = i Rijik denotes the Ricci tensor at x0 . Hence
R1 = Ric . (12.2.1)
In the above equality, the Ricci tensor Ric is regarded (via the metric duality) as a
self-adjoint endomorphism of T ∗ M .
The identity (12.2.1) has a beautiful consequence.
12.2. FUNDAMENTAL EXAMPLES 657
Theorem 12.2.5 (Bochner). Let (M, g) be a compact, connected, oriented Riemann man-
ifold.
(a) If the Ricci tensor is non-negative definite, then b1 (M ) ≤ dim M .
(b) If Ricci tensor is non-negative definite, but it is somewhere strictly positive definite
then b1 (M ) = 0.
(Recall that b1 (M ) denotes the first Betti number of M ).
The above result is truly remarkable. The condition on the Ricci tensor is purely
local, but it has global consequences. We have proved a similar result using geodesics (see
Myers Theorem, Subsection 5.2.2 , 6.2.4 and 6.2.5). Under more restrictive assumptions
on the Ricci tensor (uniformly positive definite) one deduces a stronger conclusion namely
that the fundamental group is finite. If the uniformity assumption is dropped then the
conclusion of the Myers theorem no longer holds (think of the flat torus).
This result gives yet another explanation for the equality H 1 (G) = 0, where G is
a compact semisimple Lie group. Recall that in this case the Ricci curvature is 41 ×
{the Killing metric}.
∆1 = dd∗ + d∗ d : Ω1 (M ) → Ω1 (M ).
Hodge theory asserts that b1 (M ) = dim ker ∆1 so that, to find the first Betti number, we
need to estimate the “number” of solutions of the elliptic equation
∆1 η = 0, η ∈ Ω1 .
Using the Bochner-Weitzenböck theorem and the equality (12.2.1) we deduce that if η ∈
ker ∆1 , then
∇∗ ∇η + Ric η = 0, on M.
Taking the L2 -inner product by η, and then integrating by parts, we get
Z Z
2
|∇η| dvg + (Ric η, η)dvg = 0. (12.2.2)
M M
Since Ric is non-negative definite we deduce ∇η = 0, so that any harmonic 1-form must be
covariant constant. In particular, since M is connected, the number of linearly independent
harmonic 1-forms is no greater than the rank of T ∗ M which is dim M .
(b) Using the equality (12.2.2) we deduce that any harmonic 1-form η must satisfy
(Ric(x)ηx , ηx )x = 0 ∀x ∈ M.
If the Ricci tensor is positive at some x0 ∈ M , then η(x0 ) = 0. Since η is also covariant
constant, and M is connected, we conclude that η ≡ 0. ⊔
⊓
Remark 12.2.6. For a very nice survey of some beautiful applications of this technique
we refer to [13]. ⊔
⊓
658 CHAPTER 12. DIRAC OPERATORS
Note that any almost complex manifold is necessarily even dimensional and orientable,
so from the start we know that not any manifold admits almost complex structures. In
fact, the existence of such a structure is determined by topological invariants finer than
the dimension and orientability.
Example 12.2.8. (a) A complex manifold M is almost complex. Indeed, the manifold
M is locally modelled by Cn , and the transition maps are holomorphic maps Cn → Cn .
The multiplication by i defines a real endomorphism on R2n ∼ = Cn , and the differential
of a holomorphic map Cn → Cn commutes with the endomorphism of T Cn given by
multiplication by i in the fibers. Hence, this endomorphism induces the almost complex
structure on T M .
(b) For any manifold M , the total space of its tangent bundle T M can be equipped in
many different ways with an almost complex structure.
To see this, denote by E the tangent bundle of T M , E = T (T M ). The bundle E has
a natural subbundle V, the vertical subbundle, which is the kernel of the differential of the
canonical projection π : T M → M .
If x ∈ M , v ∈ Tx M , then a tangent vector in T(v,x) T M is vertical if and only if it
is tangent to a smooth path which entirely the fiber Tx M ⊂ T M . Note that we have a
canonical identification
V(v,x) ∼
= Tx M.
Fix a Riemann metric h on the manifold T M , and denote by H ⊂ E, the subbundle of E
which is the orthogonal complement of V in E. For every (v, x) ∈ T M , the differential of
π induces an isomorphism
π∗ : H(v,x) → Tx M ∼
= Vv,x .
Thus, π∗ induces a bundle isomorphism A : H → V. Using the direct sum decomposition
E = H ⊕ V we define the endomorphism J of E in the block form
0 −A−1
J= . ⊔
⊓
A 0
Let (M, J) be an almost complex manifold. Using the results of Subsection 2.2.5 we
deduce that the complexified tangent bundle T M ⊗ C splits as
T M ⊗ C = (T M )1,0 ⊗ (T M )0,1 .
T ∗ M ⊗ C = (T ∗ M )1,0 ⊕ (T ∗ M )0,1 .
where we use capital Latin letters A, B, C . . . to denote ordered multi-indices, and for each
such index A we set
dz A =: dz A1 ∧ · · · ∧ dz Ap .
The differential form dz̄ B is defined similarly. ⊔
⊓
(The converse is also true, and it is known as the Newlander-Nirenberg theorem. Its proof
is far from trivial. For details we refer to the original paper [103]). ⊔
⊓
660 CHAPTER 12. DIRAC OPERATORS
Thus, the Newlander-Nirenberg theorem mentioned above states that the condition
(12.2.3) is necessary and sufficient for an almost complex structure to be integrable.
vanishes identically. ⊔
⊓
In the sequel we will exclusively consider only complex manifolds. Let M be such a
manifold. The above exercise shows that the exterior derivative
d : ΩkC (M ) → Ωk+1
C (M )
The component Ωp,q → Ωp+1,q is denoted by ∂ = ∂ p,q , while the component Ωp,q → Ωp,q+1
is denoted by ∂¯ = ∂¯p,q .
∂¯ ∂¯
0 → Ωp,0 (M ) −→ Ωp,1 (M ) −→ · · ·
is a cochain complex known as the p-th Dolbeault complex of the complex manifold M .
Its cohomology groups are denoted by H∂p,q
¯ (M ).
Proof. The symbol of ∂¯p,q is very similar to the symbol of the exterior derivative. For any
x ∈ M and any ξ ∈ T ∗ X
is (up to a multiplicative constant) the (left) exterior multiplication by ξ 0,1 , where ξ 0,1
denotes the (0, 1) component of ξ viewed as an element of the complexified tangent space.
More precisely,
1
ξ 0,1 = (ξ + iJ0 ξ),
2
where J0 : Tx M → Tx M denotes the canonical complex structure induced on Tx∗ M by the
∗ ∗
Since ξ 0,1 6= 0, for any ξ 6= 0, the Koszul complex is exact (see Subsection 7.1.3) . This
proves that the Dolbeault complex is elliptic. ⊔
⊓
To study the Dirac nature of this complex we need to introduce a Hermitian metric
h on T M . Its real part is a Riemann metric g on M , and the canonical almost complex
structure on T M is a skew-symmetric endomorphism with respect to this real metric. The
associated 2-form Ωh = −Im h is nondegenerate in the sense that Ωn (n = dimC M ) is
a volume form on M . According to the results of Subsection 2.2.5 the orientation of M
defined by Ωn coincides with the orientation induced by the complex structure.
We form the Hodge–Dolbeault operator
¯ 1 1
σ(∂)(ξ) = (−1)p e(ξ + iη) = (e(ξ) + ie(η)),
2 2
662 CHAPTER 12. DIRAC OPERATORS
¯
where as usual e(·) denotes the (left) exterior multiplication. The adjoint of σ(∂)(ξ) is
¯ 1
(−1)p σ(∂)(ξ)∗
= ξ∗ −iη ∗ ),
2
where ξ ∗ (respectively η ∗ ) denotes the metric dual of ξ (respectively η), and ξ ∗ denotes
the contraction by ξ ∗ . We set
and we deduce
2 1
¯
σ(∂)(ξ) ¯
− σ(∂)(ξ)∗
= { e(ξ) + ie(η) − ξ ∗ +iη ∗ }2
4
1 2 1 2
= e(ξ) − ξ ∗ + i e(η) + η ∗ = c(ξ) + ic̃(η)
4 4
1
= c(ξ)2 − c̃(η)2 + i c(ξ)c̃(η) + c(η)c(ξ) .
4
Note that
c(ξ)2 = − e(ξ)ξ ∗ +ξ ∗ e(ξ) = −|ξ|2 ,
and
c̃(η)2 = e(η)η ∗ +η ∗ e(η) = |η|2 .
On the other hand, since ξ ⊥ η, we deduce as above that
c(ξ)c̃(η) + c̃(η)c(ξ) = 0.
(−1)p
c(ξ) = √ e(ξ) + ie(η) − ξ ∗ −iη ∗ ,
2
where η = Jξ, we deduce the Clifford multiplication is covariant constant if ∇g J = 0.
Definition 12.2.17. Let M be a complex manifold, and h a Hermitian metric on T M
(viewed as a complex bundle). Then h is said to be a Kähler metric if ∇J = 0, where ∇
denotes the Levi–Civita connection associated to the Riemann metric Re h, and J is the
canonical almost complex structure on T M . A pair (complex manifold, Kähler metric) is
called a Kähler manifold. ⊔
⊓
12.2. FUNDAMENTAL EXAMPLES 663
Exercise 12.2.18. Let (M, J) be an almost complex manifold and h a Hermitian metric
on T M . Let Ω = −Im h. Using the exercise 12.2.13 show that if dΩh = 0, then the
almost complex is integrable, and the metric h is Kähler.
Conversely, assuming that J is integrable, and h is Kähler show that dΩ = 0. ⊔
⊓
We see that on a Kähler manifold the above Clifford multiplication is covariant con-
stant. In fact, a more precise statement is true.
∗ : T M → T M,
Example 12.2.22. Perhaps the favorite example of Kähler manifold is the complex
projective space Cn . To describe this structure consider the tautological line bundle
L1 → CPn . It can be naturally viewed as a subbundle of the trivial bundle Cn+1 → CPn .
Denote by h0 the canonical Hermitian metric on Cn+1 , and by ∇0 the trivial connec-
tion. If we denote by P : Cn+1 → L1 the orthogonal projection, then ∇ = P ◦ ∇0 |L1
defines a connection on L1 compatible with h1 = h |L1 . Denote by ω the first Chern form
associated to this connection
i
ω= F (∇),
2π
and set
hF S (X, Y )x := −ωx (X, JY ) + iω(X, Y ) ∀x ∈ M, X, Y ∈ Tx M.
Exercise 12.2.23. Describe hF S in projective coordinates, and then prove that h is indeed
a Hermitian metric, i.e. it is positive definite). ⊔
⊓
664 CHAPTER 12. DIRAC OPERATORS
In other words, ǫ• defines a Čech 2-cocycle, and thus defines an element in the Čech
cohomology group H 2 (M, Z2 ).
It is not difficult to see that this element is independent of the various choices: the
cover U, the gluing data gαβ , and the lifts g̃αβ . This element is intrinsic to the tangent
bundle T M . It is called the second Stiefel-Whitney class of M , and it is denoted by
w2 (M ). We see that if w2 (M ) 6= 0 then M cannot admit a spin structure. In fact, the
converse is also true.
Proposition 12.2.25. An oriented Riemann manifold M admits a spin structure if and
only if w2 (M ) = 0. ⊔
⊓
Remark 12.2.27. The usefulness of the above proposition depends strongly on the ability
of computing w2 . This is a good news/bad news situation. The good news is that algebraic
topology has produced very efficient tools for doing this. The bad news is that we will not
mention them since it would lead us far astray. See [83] and [98] for more details. ⊔
⊓
Remark 12.2.28. The definition of isomorphism of spin structures is rather subtle (see
[95]). More precisely, two spin structures defined by the cocycles g̃•• and h̃•• are isomorphic
if there exists a collection εα ∈ Z2 ⊂ Spin(n) such that the diagram below is commutative
for all x ∈ Uαβ
Spin(n)
εα
w Spin(n)
u g̃ βα (x)
u h̃ (x)
βα
Spin(n)
εβ
w Spin(n)
The group H 1 (M, Z2 ) acts on Spin(M ) as follows. Take an element ε ∈ H 1 (M, Z2 )
represented by a Čech cocycle, i.e., a collection of continuous maps εαβ : Uαβ → Z2 ⊂
Spin(n) satisfying the cocycle condition
Then the collection ε•• · g̃•• is a Spin(n) gluing cocycle defining a spin structure that we
denote by ε · σ.
It is easy to check that the isomorphism class of ε · σ is independent of the various
choice, i.e, the Čech representatives for ε and σ. Clearly, the correspondence
defines a left action of H 1 (M, Z2 ) on Spin(M ). This action is transitive and free. ⊔
⊓
is an even integer.
(c) Any compact oriented surface is spinnable. Any sphere S n admits a unique spin
structure. The product of two spinnable manifolds is canonically a spinnable manifold.
(d) w2 (RPn ) = 0 if and only if n ≡ 3 (mod 4), while CPn admits spin structures if and
only if n is odd. ⊔
⊓
As usual, we regard the collection gαβ as defining the principal SO(n)-bundle of oriented
frames of T M . We call this bundle PSO(M ) .
The collection g̃αβ defines a principal Spin(n)-bundle which we denote by PSpin(M ) .
We can regard PSO(M ) as a bundle associated to PSpin(M ) via ρ : Spin(n) → SO(n). Using
the unitary spinorial representation
called the complex spinor bundle. Its sections are called (complex) spinors. Let us
pont out that when M is even dimensional, the spinor bundle is equipped with a natural
Z/2-grading
S(M ) = S+ (M ) ⊕ S− (M ).
From Lemma 12.1.31 we deduce that S(M ) is naturally a bundle of Cl(T M ) modules.
Then, the natural isomorphism Cl(T M ) ∼ = Cl(T ∗ M ) induces on S(M ) a structure of
Cl(T ∗ M )-module,
c : Cl(T M ) → EndC ( S(M ) ).
12.2. FUNDAMENTAL EXAMPLES 667
Moreover, when M is even dimensional, the above morphism is compatible with the Z/2-
gradings of Cl(T M ) and EndC ( S(M ) ) As it turns out, the bundle S(M ) has a natural
Dirac structure whose associated Dirac operator is the spin Dirac operator on M . We will
denote it by D.
Since ∆n is a unitary representation, we can equip the spinor bundle S(M ) with a
natural metric with respect to which the Clifford multiplication
is self-adjoint, i.e.,
c(u† ) = c(u)∗ , ∀u ∈ C ∞ (Cl(T M )).
Since Cl(T M ) is locally generated as an algebra by Vect (M ), we can equivalently rewrite
the self-adjointness condition as
All we now need is to describe a natural connection on S(M ) with respect to which the
Clifford multiplication is covariant constant.
We start with the Levi–Civita connection ∇g which we can regard as a connection on
the principal bundle PSO(M ) . Alternatively, ∇g can be defined by a collection of so(n)-
valued 1-forms ωα ∈ Ω1 (Uα ) ⊗ so(n) such that
−1 −1
ωβ = gαβ dgαβ + gαβ ωα gαβ on Uαβ .
ρ∗ : spin(n) → so(n).
∇ej = ek ⊗ ωkj
i
ei ,
j j
so that ωα = ek ⊗(ωki ), where for each k the collection (ωki )i,j is a skew-symmetric matrix.
−1
Using the concrete description of ρ∗ given in (12.1.3) we deduce
X 1X i 1X i k
ω̃α = − ek ⊗ ( ωkj ei ej ) = − ωkj e ⊗ ei ej .
2 4
k i<j i,j,k
is the Riemann curvature tensor, (in the above trivializations over Uα ) we deduce that the
ˆ is
curvature of ∇
X 1 XX k
R̃ = ek ∧ eℓ ⊗ ρ−1
∗ (Rkℓ ) = − e ∧ eℓ ⊗ Rijkℓ ei ej .
4
k<ℓ k<ℓ ij
In the above sum, the terms corresponding to indices (i, j, k, ℓ) such that i = j or k = ℓ
vanish due to the corresponding skew-symmetry of the Riemann tensor. Thus, we can
write XX
ˆ = −1
c(F (∇)) Rijkℓ c(ei )c(ej )c(ek )c(eℓ )
8
i6=j k6=ℓ
Using the first Bianchi identity we deduce that the second sum vanishes. The first sum is
equal to
1X 1X s
− Rijij (c(ei )c(ej ))2 = Rijij = ,
4 4 4
i,j i,j
where s denotes the scalar curvature of M . We have thus proved the following result, [86].
12.2. FUNDAMENTAL EXAMPLES 669
defined in Exercise 12.1.68. The operator DW is Z2 -graded, and hence it has a block
decomposition
0 6 ∗W,+
D
DW = ,
6 W,+
D 0
6 W,+ : C ∞ (S+ (M ) ⊗ W ) → C ∞ (S− (M ) ⊗ W ). Show that
where D
6 ∗W,+D
D 6 W,+ = ∇ ˜ + s + c F+ (∇) ,
˜ ∗∇
4
˜ denotes the product connection of S(M ) ⊗ W , and
where ∇
1
F+ (∇) = F (∇) + ∗F (∇) ,
2
denotes the self-dual part of the curvature of the bundle (W, ∇). ⊔
⊓
Exercise 12.2.35. The torus T 3 = S 1 × S 1 × S 1 equipped with the product metric (of
volume (2π)3 ) has eight spin structures. Since T 3 is also a Lie group, its tangent bundle
admits a canonical trivialization. We denote by σ0 the spin structure determined by this
trivialization. Using the free and transitive action of H 1 (T 3 , Z2 ) on Spin(T 3 ) we deduce
that we have a canonical bijection
H 1 (T 3 , Z2 ) → Spin(T 3 ), H 1 (T 3 , Z2 ) ∋ ǫ 7→ σǫ := ǫ · σ0 .
Compute the spectrum of the spin Dirac operator Dǫ determined by the spin structure
σǫ . ⊔
⊓
oriented orthonormal frames. Assume PSO(M ) is defined by a good open cover U = (Uα )
and transition maps
gαβ : Uαβ → SO(n).
The manifold M is said to posses a spinc structure (or complex spin structure) if there
exists a principal Spinc (n) -bundle PSpinc such that PSO(M ) is associated to PSpinc via the
natural morphism ρc : Spinc (n) → SO(n):
Equivalently, this means there exist smooth maps g̃αβ : Uαβ → Spinc (n), satisfying the
cocycle condition, such that
ρc (g̃αβ ) = gαβ .
As for spin structures, there are obstructions to the existence spinc structures, but they are
less restrictive. Let us try to understand what can go wrong. We stick to the assumption
that all the overlaps Uαβ···γ are contractible.
Since Spinc (n) = (Spin(n) × S 1 )/Z2 , lifting the SO(n) structure (gαβ ) reduces to
finding smooth maps
such that
ρ(hαβ ) = gαβ ,
and
(ǫαβγ , ζαβγ ) := hαβ hβγ hγα , zαβ zβγ zγα ∈ (−1, −1), (1, 1) . (12.2.4)
2 :U 1
If we set λαβ := zαβ αβ → S , we deduce from (12.2.4) that the collection (λαβ ) should
satisfy the cocycle condition. In particular, it defines a principal S 1 -bundle over M , or
equivalently, a complex Hermitian line bundle2 L. This line bundle should be considered
as part of the data defining a spinc structure.
The collection (ǫαβγ ) is an old acquaintance: it is a Čech 2-cocycle representing the
second Stieffel-Whitney class.
As in Subsection 8.2.2, we can represent the cocycle λαβ as
λαβ = exp(iθαβ ).
The collection
1
nαβγ = θαβ + θβγ + θγα ,
2π
defines a 2-cocycle of the constant sheaf Z representing the topological first Chern class
of L. The condition (12.2.4) shows that
To summarize, we see that the existence of a spinc structure implies the existence of a
complex Hermitian line bundle L such that
ctop
1 (L) = w2 (M ) (mod 2).
It is not difficult to prove the above condition is also sufficient. In fact, one can be more
precise.
Denote by Spinc (M ) the collection of isomorphism classes of spinc structures on the
manifold M . Any σ ∈ Spinc (M ) is defined by a lift (hαβ , zαβ ) as above. We denote by
det σ the complex Hermitian line bundle defined by the gluing data (zαβ ). We have seen
that
ctop
1 (det σ) ≡ w2 (M ) (mod 2).
Spinc (M ) → LM , σ 7→ ctop
1 (det σ).
The smooth Picard group Pic∞ (M ) of isomorphisms classes of complex line bundles
(with group operation given by the tensor product) acts on Spinc (M ) by
and L is given by the S 1 cocycle ζαβ : Uαβ → S 1 , then σ ⊗ L is given by the cocycle
[hαβ , zαβ ζαβ ]. Note that
det(σ ⊗ L) = det σ ⊗ L2 ,
so that
ctop
1 det(σ ⊗ L) = ctop top
1 (det σ) + 2c1 (L).
Proposition 12.2.38. The above action of Pic∞ (M ) on Spinc (M ) is free and transitive.
Proof. Consider two spinc structures σ 1 and σ 2 defined by the good cover (Uα ), and the
gluing cocycles
(i) (i)
[hαβ , zαβ ], i = 1, 2.
(1) (2) (2)
Since ρc (hαβ ) = ρc (hαβ ) = gαβ , we can assume (eventually modifying the maps hαβ by a
sign) that
(1) (2)
hαβ = hαβ .
This implies that the collection
(2) (1)
ζαβ = zαβ /zαβ
672 CHAPTER 12. DIRAC OPERATORS
Given two spinc structures σ1 and σ2 we can define their “difference” σ2 /σ1 as the
unique complex Hermitian line bundle L such that σ2 = σ1 ⊗ L. This shows that the
collection of spinc structures is (non-canonically) isomorphic with H 2 (M, Z) ∼= Pic∞ (M ).
2
It is a sort of affine space modelled on H (M, Z) in the sense that the “difference ”
between two spinc structures is an element in H 2 (M, Z), but there is no distinguished
origin of this space. A structure as above is usually called a H 2 (M, Z)-torsor.
The set Spinc (M ) is equipped with a natural involution
Spinc (M ) ∋ σ 7→ σ̄ ∈ Spinc (M ),
defined as follows. If σ is defined by the cocycle [g̃αβ , zαβ ], then σ̄ is defined by the cocycle
[g̃αβ , zαβ −1 ]. Observe that
σ/σ̄ = det σ.
Without sufficient background in algebraic topology the above results may look of very
little help in detecting spinc structures. This is not the case, and to convince the reader
we will list below (without proofs) some examples of spinc manifolds.
Example 12.2.39. (a) Any spin manifold admits a spinc structure.
(b) Any almost complex manifold has a natural spinc structure.
(c) (Hirzebruch-Hopf, [64]; see also [102]) Any oriented manifold of dimension ≤ 4 admits
a spinc structure. ⊔
⊓
Let us analyze the first two examples above. If M is a spin manifold, then the lift
g̃αβ : Uαβ → Spin(n)
of the SO-structure to a spin structure canonically defines a spinc structure via the trivial
morphism
Spin(n) → Spinc (n) ×Z2 S 1 , g 7→ (g, 1) mod the Z2 − action.
We see that in this case the associated complex line bundle is the trivial bundle. This is
called the canonical spinc structure of a spin manifold. We thus have a map
Spin(M ) → Spinc (M ).
Suppose we have fixed a spin structure on M given by a Spin-lift g̃βα of an SO-gluing
cocycle gβα .
To any complex Hermitian line bundle L defined by the S 1 -cocycle (zαβ ) we can
associate the spinc structure σ ⊗ L defined by the gluing data
(g̃αβ , zαβ ) .
The complex Hermitian line bundle det σ⊗L associated to this structure is det σ⊗L = L⊗2 .
Since the topological Picard group Pic∞ (M ) acts freely and transitively on Spinc (M ), we
deduce that to any pair (ǫ, σ) ∈ Spin(M ) × Spinc (M ) we can canonically associate a
complex line bundle L = Lǫ,σ such that L⊗2 ∼= det σ, i.e., Lǫ,σ is a “square root of det σ.
12.2. FUNDAMENTAL EXAMPLES 673
Exercise 12.2.40. Show that for any σ ∈ Spinc (M ) there exists a natural bijection
between the set Spin(M ), and the set of isomorphisms of complex line bundles L such
that L⊗2 ∼
= det σ. ⊔
⊓
Exercise 12.2.41. Prove that the image of the natural map Spin(M ) → Spinc (M ) coin-
cides with the fixed point set of the involution σ 7→ σ̄. ⊔
⊓
Z3 ֒→ R3 → T 3 ,
so that, to any group morphism ρ : Z3 → U (1) we can associate a complex line bundle
Lρ → T 3 .
(a) Prove that Lρ1 ·ρ2 ∼
= Lρ1 ⊗ Lρ2 , ∀ρ1 , ρ1 ∈ Hom(Z3 , S 1 ), and use this fact to describe
the first Chern class of Lρ by explicitly producing a Hermitian connection on Lρ .
(b) Prove that any complex line bundle on T 3 is isomorphic to a line bundle of the form
Lρ , for some morphism ρ ∈ Hom(Z3 , S 1 ). .
(c) Show that the image of the map Spin(T 3 ) → Spinc (T 3 ) consists of a single point. ⊔ ⊓
Exercise 12.2.43. Prove that Spin(RP3 ) consists of precisely two isomorphisms classes
of spin structures and moreover, the natural map
is a bijection. ⊔
⊓
induces a spinc structure defined by the gluing data ξ(hαβ ). Its associated line bundle is
given by the S 1 -cocycle
detC (hαβ ) : Uαβ → S 1 ,
and it is precisely the determinant line bundle
The dual of this line bundle, detC (T ∗ M )1,0 = Λk,0 T ∗ M plays a special role in algebraic
geometry. It usually denoted by KM , and it is called the canonical line bundle. Thus the
−1 def
line bundle associated to this spinc structure is KM = KM ∗ .
From the considerations in Subsection 11.1.5 and 11.1.7 we see that many (complex)
vector bundles associated to the principal Spinc bundle of a spinc manifold carry natural
674 CHAPTER 12. DIRAC OPERATORS
Clifford structures, and in particular, one can speak of Dirac operators. We want to discuss
in some detail a very important special case.
Assume that (M, g) is an oriented, n-dimensional Riemann manifold. Fix σ ∈ Spinc (M )
(assuming there exist spinc structures). Denote by (gαβ ) a collection of gluing data defin-
ing the SO structure PSO(M ) on M with respect to some good open cover (Uα ). Moreover,
we assume σ is defined by the data
Sσ (M ) = PSpinc ×∆n Sn ,
which has a natural Clifford structure. This is called the bundle of complex spinors
associated to σ.
Example 12.2.44. (a) Assume M is a spin manifold. We denote by σ0 the spinc structure
corresponding to the fixed spin structure. The corresponding bundle of spinors S0 (M )
coincides with the bundle of pure spinors defined in the previous section. Moreover for
any complex line bundle L we have
SL := Sσ ∼
= S0 ⊗ L,
where σ = σ0 ⊗ L. Note that in this case L2 = det σ so one can write
Sσ ∼
= S0 ⊗ (det σ)1/2 .
(b) Assume M is an almost complex manifold. The bundle of complex spinors associated
−1
to the canonical spinc structure σ (such that det σ = KM ) is denoted by SC (M ). Note
that
SC (M ) ∼
= Λ0,• T ∗ M. ⊔
⊓
We will construct a natural family of Dirac structures on the bundle of complex spinors
associated to a spinc structure.
Consider the warm-up case when T M is trivial. Then we can assume gαβ ≡ 1, and
Sσ (M ) ∼
= S(M ) ⊗ (det σ)1/2 .
12.2. FUNDAMENTAL EXAMPLES 675
As in the exercise 12.1.68 of Subsection 11.1.9, we deduce that twisting the canonical
connection on the bundle of pure spinors S0 (M ) with any U (1)-connection on det σ 1/2 we
obtain a Clifford connection on Sσ (M ). Notice that if the collection
ωα ∈ u(1) ⊗ Ω1 (Uα )
1
F̂ = F. (12.2.5)
2
Hence any connection on det σ defines in a unique way a Clifford connection on Sσ (M ).
Assume now that T M is not necessarily trivial. We can however cover M by open sets
(Uα ) such that each T Uα is trivial. If we pick from the start a Hermitian connection on
det σ, this induces a Clifford connection on each Sσ (Uα ). These can be glued back to a
Clifford connection on Sσ (M ) using partitions of unity. We let the reader check that the
connection obtained in this way is independent of the various choices.
Here is an equivalent way of associating a Clifford connection on Sσ to any Hermitian
connection on det σ. Fix an open cover (Uα ) of M consisting of geodesically convex
open sets. The restriction of T M to any Uα is trivializable. Fixing such (orthogonal)
trivializations leading to the gluing cocycle
Aα ∈ Ω1 (Uα ) ⊗ so(n),
gβα = ρ(ĝβα ).
676 CHAPTER 12. DIRAC OPERATORS
where ρ∗ : spin(n) → so(n) is the natural Lie algebra isomorphism described in (12.1.3).
The collection (Âα ) satisfies the transition rules
−1 −1
Âβ = −(dĝβα )ĝβα + ĝβα Âα ĝβα .
Observe that although ĝ•• is only defined up to a ±1 ambiguity, this ambiguity is lost in
the above equality.
Consider now a connection ω on the Hermitian line bundle det σ defined by the gluing
2 ). It is defined by a collection
cocycle (z••
In particular, the operators Dρ (γ∇γ −1 ) and Dρ (∇) have the same spectrum.
(b)∗ We say that two Hermitian connections ∇, ∇′ on Lρ are gauge equivalent if and only
if there exists a unitary automorphism γ of L such that ∇′ = γ∇γ −1 . Prove that two
connections ∇ and ∇′ are gauge equivalent if and only if
Z
1
dθ j ∧ F (∇′ ) − F (∇) ∈ 2πiZ, ∀j = 1, 2, 3.
2π T 3
(c) Fix a Hermitian connection ∇ on Lρ . Describe the spectrum of Dρ (∇) in terms of the
real numbers Z
1
sj = dθ j ∧ F (∇), j = 1, 2, 3. ⊔
⊓
2πi T 3
678 CHAPTER 12. DIRAC OPERATORS
Bibliography
[5] M.T. Anderson, J. Cheeger: C α -compactness for manifolds with Ricci curvature and
injectivity radius bounded below, J. Diff. Geom., 35(1992), 265-281.
[8] M. F. Atiyah, R. Bott, V. K. Patodi: On the heat equation and the index theorem,
Invent. Math., 19(1973),279-330.
[13] P. Bérard: From vanishing theorems to estimating theorems: the Bochner technique
revisited, Bull. A.M.S. 19(1988), p.371-406.
[14] N. Berline, E. Getzler, M. Vergne: Heat Kernels and Dirac Aperators, Springer Verlag,
1992.
679
680 BIBLIOGRAPHY
[16] A. Besse: Einstein Manifolds, Ergebnisse der Mathematik, Bd.10, Springer Verlag,
1987.
[18] X. Bin: Derivatives of the spectral function and Sobolev norms of eigenfunctions on a
closed Riemannian manifold, Ann. Global. Analysis an Geometry, 26(2004), 231-252.
[20] J. Bochnak, M. Coste, M.-F. Roy: Real Algebraic Geometry, Springer Verlag, 1998.
[23] H. Brezis: Functional Analysis, Sobolev Spaces and Partial Differential Equations,
Universitext, Springer Verlag, 2011.
[25] E. Cartan: Sur les invariants integraux des certains espaces homogènes et les pro-
priétés topologiques de ces espaces, Ann. Soc. Pol. Math, 8(1929), p.181-225.
[26] H. Cartan: La transgression dans un group de Lie et dans un espace fibré principal,
p.1268-1282, vol. III of H. Cartan’s “ Collected Works”, Springer-Verlag, 1979.
[29] S. S. Chern: A simple intrinsic proof of the Gauss-Bonnet formula for closed Riemann
manifolds, Ann. Math, 45(1946), p.747-752.
[31] M. Coste: An Introduction to o-minimal geometry, Real Algebraic and Analytic Ge-
ometry Network,
http://www.ihp-raag.org/publications.php
[32] R. Courant, D. Hilbert: Methods of Mathematicsl Physics, vol. II, John Wiley &
Sons, 1989.
[36] L. van der Dries: Tame Topology and o-minimal Structures, London Math. Soc. Series,
vol. 248, Cambridge University Press, 1998.
[37] L. van der Dries, C. Miller: Geometric categories and o-minimal structures Duke
Math. J. 84(1996), 497-540.
[43] J. D. Elworthy, Y. LeJan, X.-M. Lin: On the Geometry of Diffusion Operators and
Stochastic Flows, Lect. Notes in Math., vol. 1720, Springer Verlag, 1999.
[44] L. C. Evans: Partial Differential Equations, 2nd Edition, Grad. Studies in Math., vol.
19, Amer. Math. Soc., 2010
[48] S. Gallot, D. Hulin, J. Lafonte: Riemannian Geometry, 3rd Edition, Springer Verlag,
2004.
[49] I.M. Gelfand, G.E. Shilov: Generalized Functions. Properties and Operations, vol. 1,
Academic Press, 1964.
[50] I.M. Gelfand, G.E. Shilov: Generalized Functions. Spaces of Fundamental and Gen-
eralized functions, vol. 2, Academic Press, 1968.
[52] R. Godement: Topologie Algébrique et Théorie des Faisceaux, Hermann et Cie, Paris,
1964.
682 BIBLIOGRAPHY
[53] C. Gordon. D. L. Webb, S. Wolpert: One cannot hear the shape of a drum, Bull.
A.M.S., 27(1992), p.134-138.
[58] B. Hall: Lie Groups, Lie Algebras, and Reprsentations. An Elementary Introduction,
Springer Verlag 2003.
[59] G. H. Hardy, E.M Wright: An Introduction to the Theory of Numbers, Oxford Uni-
versity Press, 1962.
[60] P. Hartman: Ordinary Differential Equations, 2nd Edition, Classics in Applied Math-
ematics, SIAM 2002.
[61] S. Helgason: Differential Geometry, Lie Groups, and Symmetric Spaces, Graduate
Studies in Math. v. 34, Amer. Math. Soc., 2001.
[66] L. Hörmander: The Analysis of Linear Partial Differential Operators I., Springer
Verlag, 2003.
[67] L. Hörmander: The Analysis of Linear Partial Differential Operators III., Springer
Verlag, 1985.
[70] J. Jost: Riemannian Goemtry and Geometric Analysis, Universitext, Springer Verlag,
1998.
BIBLIOGRAPHY 683
[71] M. Kac: Can one hear the shape of a drum?, Amer. Math. Monthly, 73(1966) Part
II, p.1-23.
[75] T. Kato: Perturbation Theory for Linear Operators, Springer Verlag, 1984.
[76] J. Kazdan, F. Warner: Curvature functions for compact 2-manifolds, Ann. Math.
99(1974), p.14-47.
[78] A. Khovanskii: Fewnomials, Transl. Math. Monogr., 88, Amer.Math. Soc., 1991.
[81] K. Kodaira, D. C. Spencer: Groups of complex line bundles over compact Kähler va-
rieties. Divisor class group on algebraic varieties, Proc. Nat. Acad. Sci. USA 39(1953),
p. 868-877.
[83] H. B. Lawson, M.-L. Michelson: Spin Geometry, Princeton University Press, 1989.
[85] J. M. Lee, T. H. Parker: The Yamabe problem, Bull. A.M.S. 17 (1987), p. 37-91.
[86] A. Lichnerowicz: Laplacien sur une variété riemannienne et spineure, Atti Accad.
Naz. dei Lincei, Rendicotti 33(1962), 187-191.
[88] I. G. Macdonald: Symmetric Functions and Hall Polynomials, Clarendon Press, Ox-
ford, 1979.
[89] S. MacLane: Categories for the Working Mathematician, Springer Verlag, 1971.
[91] W. S. Massey: Cross products of vectors in higher dimensional euclidian spaces, Amer.
Math. Monthly, 93(1983), p.697-700.
684 BIBLIOGRAPHY
[92] P. -D. Mehtée: Sur les distributions invariantes dans le groupe des rotations de
Lorentz, Comm. Math. Helv. 28(1954), 225-269.
[96] J. W. Milnor: Curvatures of left invariant metrics on Lie groups, Adv. in Math.,
21(1976), p.293-329.
[97] J. W. Milnor: Topology from the Differentiable Viewpoint, Princeton University Press,
1965.
[98] J. W. Milnor, J. D. Stasheff: Characteristic Classes, Ann. Math. Studies 74, Princeton
University Press, Princeton, 1974.
[99] M. Mimura, H. Toda: Topology of Lie Groups, I and II, Translations of Mathematical
Monographs, vol. 91, Amer. Math. Soc., 1991.
[100] R. Miron, M. Anastasiei: The Geometry of Lagrange Spaces: Theory and Applica-
tions, Kluwer Academic Publishers, 1994.
[102] C. Morrey: Multiple Integrals in the Calculus of Variations, Springer Verlag, 1966.
[107] L. Nirenberg: On elliptic partial differential equations, Ann. Scuola Norm. Sup.
Pisa, 13(1959), p.1-48.
[108] K. Nomizu, H. Ozeki: The existence of complete Riemann metrics, Proc. A.M.S. ,
12(1961), 889-891.
[113] M. M. Postnikov: Lectures in Geometry. Semester V: Lie Groups and Lie Algebras,
Mir Publishers, Moscow, 1986.
[124] N. E. Steenrod: The Topology of Fibre bundles, Princeton University Press, 1951.
[127] F. Trèves: Topological Vector Spaces, Distributions and Kernels, Dover, 2006.
[131] H.Weyl: Die Idee Der Riemannschen Fläche B.G. Tuebner, Berlin 1913.
[133] H. Weyl: The Classical Groups. Their Invariants and Representations, Princeton
University Press, 1939.
[134] E.T. Whittaker, G.N. Watson: A Course of Modern Analysis, Cambridge Univ.
Press, 1996.
[136] S. Zelditch: Real and complex zeros of Riemannian random waves, Spectral analysis
in geometry and number theory, Contemp. Math., 484, Amer. Math. Soc., Providence,
RI, 2009. arXiv:0803.433v1
687
688 INDEX
derivative, 6 algebra, 51
Fréchet, 6 derivative, 92
determinant line, 51, 61 product, 51
diffeomorphism, 9 extremal, 204
of manifolds with boundary, 126
differential, 30 five lemma, 264
differential form, 72 flow, 81
closed, 243 infinitesimal generator of a, 82
local, 81, 83
exact, 243
formula
Dirac function, 530
duplication, 549
distribution, 529
Parseval, 584
temperate, 536
Poisson summation, 587
distributional derivative, 531
polarization, 338
duality, 59
reflection, 548
Hodge, 65
co-area, 384, 385
metric, 59
coarea, 383
natural, 59
Crofton, 437, 446, 451
symplectic, 59
Euler-Meusnier , 441
duplication formula, 549
Gauss, 179
Dynkin polynomials, 90
Green, 165, 468
Künneth, 265
Einstein
Parseval, 537
convention, 48
Stokes, 127
equation, 195
Weyl integration, 303
manifold, 195
Fourier transform, 490, 536
tensor, 195
Fréchet derivative, 5
elliptic complex, 524
framing, 42, 72
embedding, 34
Fredholm index, 512
energy method, 571
function, 5
equation
C k, 5
Einstein, 195
Fréchet differentiable, 5
Euler-Lagrange, 204
harmonic, 165
Hamilton-Jacobi, 223
smooth, 5, 9
Jacobi, 216
functors, 234
Euler
contravariant, 234
characteristic, 54, 198, 249, 255, 260,
covariant, 234
290
fundamental solution, 555
class, 290, 353 Cauchy, 555
geometric, 354
topological, 354 Gamma function, 166, 386, 392
universal, 353 gauge equivalence, see connection
form, 173, 182, 353 gauge transformation, 38, 337
Euler-Lagrange equation, 204 Gauss
exponential map see manifold 150 curvature, 172, 183
exterior, 51 lemma, 153
INDEX 691
wave